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NOVEL NUCLEIC ACIDS AND POLYPEPTIDES 

( » 

1. TECHNICAL FIELD 

4 The present invention provides novel polynucleotides and proteins encoded by such 

E 

polynucleotides, along with uses for these polynucleotides and proteins, for example in 
therapeutic, diagnostic and.research methods. 

8 2. BACKGROUND 

Technology aimed at the discovery of protein factors (including e.g. , cytokines, such as 
lymphokines, interferons, CSFs, chemokines, and interleukins) has matured rapidly over the past 

■ 

decade. The now routine hybridization cloning and expression cloning techniques clone novel 
12 polynucleotides "directly" in the sense that they rely on inforination directly related to the 

* * ■ 

discovered protein (i e. , partial DNA/amino acid sequence of the protein in the case of 
hybridization cloning; activity of the protein in the case of expression cloning). More recent 
"indirect" cloning techniques such as signal sequence cloning, which isolates DNA sequences 
16 based on the presence of a now well-recognized secretory leader sequence motif, as well as 

various PCR-based or low stringency hybridization-based cloning techniques, have advanced the 
state of the art by making available large numbers of DNA/amino acid sequences for proteins 
that are known to have biological activity, for example, by virtue of their secreted nature in the 
20 case of leader sequence cloning, by virtue of their cell or tissue source in the case of PCR-based 
techniques, or by virtue of structural similarity to other genes of known biological activity. 

Identified polynucleotide and polypeptide sequences have numerous applications in, for 
example, diagnostics, forensics, gene mapping; identification of mutations responsible for 
24 genetic disorders or other traits, to assess biodiversity, and to produce many other types of data 
and products dependent on DNA and amino acid sequences. 

3. SUMMARY OF THE INVENTION 

28 The compositions of the present invention include novel isolated polypeptides, novel 

m 

isolated polynucleotides encoding such polypeptides, including recombinant DNA molecules, 
cloned genes or degenerate variants thereof, especially naturally occurring variants such as allelic 
variants, antisense polynucleotide molecules, and antibodies that specifically recognize one or more 
32 epitopes present on such polypeptides, as well as hybridomas producing such antibodies. 

The compositions of the present invention additionally include vectors, including expression 
vectors, containing the polynucleotides of the invention, cells genetically engineered to contain such 
polynucleotides and cells genetically engineered to express such polynucleotides. 



t 

WO 01/64835 PCT/US01/04927 

# 

* 

The present invention relates to a collection or library of at least one novel nucleic acid 
sequence assembled from expressed sequence tags (ESTs) isolated mainly by sequencing by 

■ * 

hybridization (SBH), and in some cases, sequences obtained from one or more public databases. 
4 The invention relates also to the proteins encoded by such polynucleotides, along with therapeutic, 

diagnostic and research utilities for these polynucleotides and proteins. These nucleic acid . ; 

sequences are designated as SEQ ID NOS: 1-139.01. The polypeptides sequences are designated ' 1 

SEQ ID NOS: 13902-27802. The nucleic acids and polypeptides are provided in the Sequence 
8 Listing. In the nucleic acids provided in the Sequence Listing, A is adenosine; C is cytosine; G is 

guanine; T is thymine; and N is any of the four bases. In the amino acids provided in the Sequence ;, 

Listing, * corresponds to the stop codon. 

The nucleic acid sequences of the present invention also include, nucleic acid sequences that 
12 hybridizeto the complement of SEQ ID NO: 1-13901 under stringent hybridization conditions; 

nucleic acid sequences which are allelic variants or species homologues of any of the nucleic acid 

sequences recited above, or nucleic acid sequences that encode a peptide comprising a specific 

domain or truncation of the peptides encoded by SEQ ID NOS: 1-13901. A polynucleotide 
1 6 comprising a nucleotide sequence having at least 90% identity to an identifying sequence of SEQ 

ID NO: 1-13901 or a degenerate variant or fragment thereof. The identifying sequence can be 100 

basepairs in length. 

The nucleic acid sequences of the present invention also include the sequence information 

20 from the nucleic acid sequences of SEQ ED NO: 1-13901. The sequence information can be a 
segmentof any oneof SEQ ID NO: 1-13901 that uniquely identifies or representsthe sequence 
infoimationof SEQ ID NOS: 1-13901. 

A collection as used in this application can be a collection of only one polynucleotide. The 

24 collection of sequence information or identifying information of each sequence can be provided on 
a nucleic acid array. In one embodiment, segments of sequence information is provided on a 
nucleic acid array to detect the polynucleotide that contains the segment. The array can be designed 
to detect full-match or mismatch to the polynucleotide that contains the; segment. The collection 

28 can also be provided in a computer-readablefonnat 

This invention also includes the reverse or direct complement of any of the nucleic acid 
sequences recited above; cloning or expression vectors containing the nucleic acid sequences; and 
host cells or organisms transformed with these: expression vectors. Nucleic acid sequences (or their 

32 reverse or direct complements) according to the invention have numerous applications in a variety 
of techniques known to those skilled in the art of molecular biology, such as use as hybridization 
probes, use as primers for PCR, use in an array, use in computer-readable media, use in sequencing 

2 
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full-length genes, use for chromosome and gene mapping, use in the recombinant production of : 
protein, and use in the generation of anti-sense DNA or RNA, their chemical analogs and the like. 
In a preferred embodiment, the nucleic acid sequences of SEQ ID NO: 1-1 390 1 or novel 
4 segments or parts of the nucleic acids of the invention are used as primers in expression assays that 

* 

are well known in the art In a particularly preferred embodiment, the nucleic acid sequences of 

« 

SEQ ID NO: 1-13901 or novel segments or parts of the nucleic acids provided herein are used in 
diagnostics for identifying expressed genes or, as well known in the art and exemplified by Vollrath 
8 et al., Science 258:52-59 (1 992), as expressed sequence tags for physical mapping of the human 
genome. 

The isolated polynucleotides of the invention include, but are not limited to, a 
polynucleotide comprising any one of the nucleotide sequences set forth in SEQ ID NO: 1-13901; a 

12 polynucleotide comprising any of the Ml length protein coding sequences of SEQ IDNOS: 1- 
1 390 1 ; and a polynucleotide comprising any of the nucleotide sequences of the mature protein 
coding sequences of SEQ ID NOS: 1-13901 . The polynucleotides of the present invention also 
include, but are not limited to, a polynucleotide that hybridizes under stringent hybridization 

16 conditionsto (a)the complement of any one of the nucleotide sequences set forth in SEQ ID NO: 1- 
1 390 1 ; (b) a nucleotide sequence encoding any one of the amino acid sequences set forth in the 
Sequence Listing (e.g., SEQ ID NOS: 1 3902-27802); (c) a polynucleotide which is an allelic variant 
of any polynucleotides recited above; (d) a polynucleotide which encodes a species homolog (e.g. 

T 

20 orthologs) of any of the proteins recited above; or (e) a polynucleotide that encodes a polypeptide 
comprising a specific domain or truncation of any of the polypeptides comprising an amino acid 
sequence set forth in the Sequence Listing. 

The isolated polypeptides of the invention include, but are not limited to, a polypeptide 

24 comprising any of the amino acid sequences set forth in the Sequence Listing; or the corresponding 
full length or mature protein. Polypeptides of the invention also include polypeptides with 
biological activity that are encoded by (a) any of the polynucleotides having a nucleotide sequence 
set forthin SEQ ID NO: 1-13901 ; or (b) polynucleotides that hybridize to the complement of the 

28 polynucleotides of (a) under stringent hybridization conditions. Biologically or immunologically 
active variants of any of the polypeptide sequences in the Sequence Listing, and "substantial 
equivalents" thereof (e.g, with at least about 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99% 
amino acid sequence identity) that preferably retain biological activity are also contemplated. The 

32 polypeptides of the invention may be wholly or partially chemically synthesized but are preferably 

» 

produced by recombinant means using the genetically engineered cells (e.g. host cells) of the 
invention. 
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The invention also provides compositions comprising a polypeptide of the invention. 
Polypeptide compositions of the invention may further comprise an acceptable carrier, such as a 
hydrophilic, e.g., pharmaceutical^ acceptable, carrier. 
4 The invention also provides host cells transformed or transfected with a polynucleotide of 

the invention. 

i 

■ 

The invention also relates to methods for producing a polypeptide of the invention 
comprising growing a culture of the host cells of the invention in a suitable culture medium 
8 under conditions permitting expression of the desired polypeptide, and purifying the polypeptide 
from the culture or from the host cells. Preferred embodiments include those in which the 
protein produced by such process is a mature form of the protein. 

Polynucleotides according to the invention have numerous applications in a variety of 
12 techniques known to those skilled in the art of molecular biology. These techniques include use 
as hybridization probes, use as oligomers, or primers, for PCR, use for chromosome and gehe 
mapping, use in the recombinant production of protein, and use in generation of anti-sense DNA 
or RNA, their chemical analogs and the like. For example, when the expression of an mRNA is 
16 largely restricted to a particular cell or tissue type, polynucleotides of the invention can be used 
as hybridization probes to detect the presence of the particular cell or tissue mRNA in a sample 
using, e.g., in situ hybridization. 

In other exemplary embodiments, the polynucleotides are used in diagnostics as 
20 expressed sequence tags for identifying expressed genes or, as well known in the art and 
exemplified by Vollrath et aL, Science 258:52-59 (1992), as expressed sequence tags for 
physical mapping of the human genome. 

The polypeptides according to the invention can be used in a variety of conventional 
24 procedures and methods that are currently applied to other proteins. For example, a polypeptide 
of the invention can be used to generate an antibody that specifically binds the polypeptide. Such 
antibodies, particularly monoclonal antibodies, are useful for detecting or quantitating the 
polypeptide in tissue. The polypeptides of the invention can also be used as molecular weight 
28 markers, and as a food supplement. 

Methods are also provided for preventing, treating, or ameliorating a medical condition 
which comprises the step of administering to a mammalian subject a therapeutically effective 
amount of a composition comprising a polypeptide of the present invention anda 
32 pharmaceutical^ acceptable carrier. 

In particular, the polypeptides and polynucleotides of the invention can be utilized, for 
example, in methods for the prevention and/or treatment of disorders involving aberrant protein 
expression or biolojgical activity. 

4 
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The present invention further relates to methods for detecting the presence of the 
polynucleotides or polypeptides of the invention in a sample. Such methods can, for example, be 
1 utilized as part of prognostic and diagnostic evaluation of disorders as recited herein and for the 
4 identification of subjects exhibiting a predisposition to such conditions. The invention provides 
a method for detecting the polynucleotides of the invention in a sample, comprising contacting 
the sample with a compound that binds to and forms a complex with the polynucleotide of 
interest for a period sufficient to form the complex and under conditions sufficient to form a 
8 complex and detecting the complex such that if a complex is detected, the, polynucleotide of 
interest is detected. The invention also provides a method for detecting the polypeptides of the 
invention in a sample comprising contacting the sample with a compound that binds to and forms 
a complex with the polypeptide under conditions and for a period sufficient to form the complex 
12 and detecting the formation of the complex such that if a complex is formed, the polypeptide is 
detected. 

The invention also provides kits comprising polynucleotide probes and/or monoclonal 
antibodies, and optionally quantitative standards, for carrying out methods of the invention. 
16 Furthermore, the invention provides methods for evaluating the efficacy of drugs, and 

monitoring the progress of patients, involved in clinical trials for the treatment of disorders as 

♦ 

recited above. 

The invention also provides methods for the identification of compounds that modulate 

20 (/. e. , increase or decrease) the expression or activity of the polynucleotides and/or polypeptides 
of the invention. Such methods can be utilized, for example, for the identification of compounds 
that can ameliorate symptoms of disorders as recited herein. Such methods can include, but are 
not limited to, assays for identifying compounds and other substances that interact with (e.g., 

24 bind to) the polypeptides of the invention. The invention provides a method for identifying a 
compound that binds to the polypeptides of the invention comprising contacting the compound 
with a polypeptide of the invention in a cell for a time sufficient to form a 
polypeptide/compound complex, wherein the complex drives expression of a reporter gene 

28 sequence in the cell; and detecting the complex by detecting the reporter gene sequence 

expression such that if expression of the reporter gene is detected the compound the binds to a 
polypeptide of the invention is identified. 

The methods of the invention also provides methods for treatment which involve the 

32 administration of the polynucleotides or polypeptides of the invention to individuals exhibiting 
symptoms or tendencies. In addition, the invention encompasses methods for treating diseases or 
disorders as recited herein comprising administering compounds and other substances that 
modulate the overall activity of the target gene products. Compounds and other substances can 
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* 

effect such modulation either on the level of target gene/protein expression or target protein 
activity. 

The polypeptides of the present invention and the polynucleotides encoding them are also 
4 useful for the same functions known to one of skill in the art as the polypeptides and- 

polynucleotides to which they have homology (set forth in Table 2). If no homology is set forth 
for a sequence, then the polypeptides and polynucleotides of the present invention are useful for 
a variety of applications, as described herein, including use in arrays for detection. 

8 

i 

4. DETAILED DESCRIPTION OF THE INVENTION 

i 

t t 

12 4.1 DEFINITIONS 

It must be noted that as used herein and in the appended claims, the singular forms "a", 
"an" and "the" include plural references unless the context clearly dictates otherwise. 

The terra "active" refers to those forms of the polypeptide which retain the biologic 
16 and/or immunologic activities of any naturally occurring polypeptide. According to the 

invention, the terms "biologically active" or "biological activity" refer to a,protein or peptide 
having structural, regulatory or biochemical functions of a naturally occurring molecule. 
Likewise "immunologically active" or "immunological activity" refers to the capability of the 
20 natural, recombinant or synthetic polypeptide to induce a specific immune response in 
appropriate animals or cells and to bind with specific antibodies. 

The term "activated cells" as used in this application are those cells which are engaged in 
extracellular or intracellular membrane trafficking, including the export of secretory or 
24 enzymatic molecules as part of a normal or disease process. 

The terms "complementary" or "complementarity" refer to the natural binding of 
polynucleotides by base pairing. For example, the sequence S'-AGT-S* binds to the 
complementary sequence 3*-TCA-5\ Complementarity between two single-stranded molecules 

► - > 

28 may be "partial" such that only some of the nucleic acids bind or it may be "complete" such that 
total complementarity exists between the single stranded molecules. The degree of 
complementarity between the nucleic acid strands has significant effects on the efficiency and 
strength of the hybridization between the nucleic acid strands. 

32 The term "embryonic stem cells (ES)" refers to a cell that can give rise to many 

differentiated cell types in an embryo or an adult, including the germ cells. The term "germ line 
stem cells (GSCs)" refers to stem cells derived from primordial stem cells that provide a steady 
and continuous source of germ cells for the production of gametes. The term "primordial germ 

6 
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• ■ 

cells (PGCs)" refers to a small population of cells set aside from other cell lineages particularly 
from the yolk sac, mesenteries, or gonadal ridges during embryogenesis that have the potential to 
• differentiate into germ cells and other cells. PGCs are the source from which GSCs and ES cells 
4 are derived The PGCs, the GSCs and the ES cells are capable of self-renewal. Thus these cells 
not only populate the germ line and give rise to a plurality of terminally differentiated cells that 
comprise the adult specialized organs, but are able to regenerate themselves. 

The term "expression modulating fragment," EMF, means a series of nucleotides which 
8 modulates the expression of an operably linked ORF or another EMF. 

As used herein, a sequence is said to "modulate the expression of an operably linked 
sequence" .when the expression of the sequence is altered by the presence of the EMF. EMFs 
include, but are not limited to, promoters, and promoter modulating sequences (inducible 

* r 

12 elements). One class of EMFs are nucleic acid fragments which induce the expression of an 
operably linked ORF in response to a specific regulatory factor or physiological event. 

The terms "nucleotide sequence" or "nucleic acid" or "polynucleotide" or 
"oligonculeotide" are used interchangeably and refer to a heteropolymer of nucleotides or the 

1 6 sequence of these nucleotides. These phrases also refer to DNA or RNA of genomic or synthetic 
origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA) or to any DNA-like or RNA-like material. In the 
sequences herein A is adenine, C is cytosine, T is thymine, G is guanine and N is A, C, G or T 

20 (U). It is contemplated that where the polynucleotide is RNA, the T (thymine) in the sequences 
provided herein is substituted with U (uracil). Generally, nucleic acid segments provided by this 
invention may be assembled from fragments of the genome and short oligonucleotide linkers, or 
from a series of oligonucleotides, or from individual nucleotides, to provide a synthetic nucleic 

24 acid which is capable of being expressed in a recombinant transcriptional unit comprising 
regulatory elements derived from a microbial or viral operon, or a eukaryotic gene. 

The terms "oligonucleotide fragment" or a "polynucleotide fragment", "portion," or 
"segment" or 6f probe" or "primer" are used interchangeably and refer to a sequence of nucleotide 

28 residues which are at least about 5 nucleotides, more preferably at least about 7 nucleotides, 

* ■ 

more preferably at least about 9 nucleotides, more preferably at least about 1 1 nucleotides and 
most preferably at least about 17 nucleotides. The fragment is preferably less than about 500 
nucleotides, preferably less than about 200 nucleotides, more preferably less than about 100 
32 nucleotides, more preferably less than about 50 nucleotides and most preferably less than 30 
nucleotides. Preferably the probe is from about 6 nucleotides to about 200 nucleotides, 
preferably from about 15 to about 50 nucleotides, more preferably from about 1 7 to 30 
nucleotides and most preferably from about 20 to 25 nucleotides. Preferably the fragments can 
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I 

be used in polymerase chain reaction (PCR), various hybridization procedures or microarray 
. procedures to identify or amplify identical or related parts of mRNA or DNA molecules. A 
fragment or segment may uniquely identify each polynucleotide sequence of the present 
4 invention. Preferably the fragment comprises a sequence substantially similar to any one of SEQ 
ID NOs: 1-13901 . 

i 

Probes may, for example, be used to determine whether specific mRNA molecules are 
present in a cell or tissue or to isolate similar nucleic acid sequences from chromosomal DNA as 
8 described by Walsh et al. (Walsh, P.S. et al., 1 992, PCR Methods Appl 1 :241-250). They may 
be labeled by nick translation, Klenow fill-in reaction, PCR, or other methods well known in the 
art Probes of the present invention, their preparation and/or labeling are elaborated in 
Sambrook, J. et al., 1 989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
12 Laboratory, NY; or Ausubel, F.M. et al., 1989, Current Protocols in Molecular Biology, John 
Wiley & Sons, New York NY, both of which are incorporated herein by reference in their ' 
entirety. 

The nucleic acid sequences of the present invention also include the sequence 

16 information from the nucleic acid sequences of SEQ ID NOS: 1-13901. The sequence 

information can be a segment of any one of SEQ ID NO: 1-13901 that uniquely identifies or 
represents the sequence information of that sequence of SEQ ID NOS: 1-13901. One such 
segment can be a twenty-mer nucleic acid sequence because the probability that a twenty-mer is 

20 fully matched in the human genome is 1 in 300. In the human genome, there are three billion 
base pairs in one set of chromosomes. Because 4 20 possible twenty-mers exist, there are 300 
times more twenty-mers than there are base pairs in a set of human chromosomes. Using the 
same analysis, the probability for a seventeen-mer to be fully matched in the human genome is 

24 approximately 1 in 5. When these segments are used in arrays for expression studies, fifteen- 
mer segments can be used. The probability that the fifteen-mer is fully matched in the expressed 
sequences is also approximately one in five because expressed sequences comprise less than 
approximately 5% of the entire genome sequence. 

28 Similarly, when using sequence information for detecting a single mismatch, a segment can 

be a twenty-five mer. The probability that the twenty-five mer would appear in a human genome 
with a single mismatch is calculated by multiplying the probability for a fiill match (l-^ 25 ) times the 
increased probability for mismatch at each nucleotide position (3 x 25). The probability that an 

32 eighteen mer with a single mismatch can be detected in an array for expression studies is 
approximately one in five. The probability that a twenty-mer with a single mismatch can be 
detected in a human genome is approximately one in five. 

8 
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The term "open reading frame," ORF, means a series of nucleotide triplets coding for 
amino acids, without any termination codons and is a sequence translatable into protein. 

The terms "operably linked" or "operably associated" refer to functionally related nucleic 

4 acid sequences. For example, a promoter is operably associated or operably linked with a coding 

sequence if the promoter controls the transcription of the coding sequence. While operably 

linked nucleic acid sequences can be contiguous and in the same reading frame, certain genetic 

elements e.g. repressor genes are not contiguously linked to the coding sequence but still control 

8 transcription/translation of the coding sequence. , 
• ■ . , 

The term "pluripotent" refers to the capability of a cell to differentiate into a number of 

differentiated cell types that are present in an adult organism. A pluripotent cell is restricted in its 

differentiation capability in comparison to a totipotent cell. 

12 The terms "polypeptide" or "peptide" or "amino acid sequence" refer to an oligopeptide, 

peptide, polypeptide or protein sequence or fragment thereof and to naturally occurring or 
synthetic molecules. A polypeptide "fragment," "portion," or "segment" is a stretch of amino 
acid residues of at least about 5 amino acids, preferably at least about 7 amino acids, more 

1 6 preferably at least about 9 amino acids and most preferably at least about 1 7 or more amino 

acids. The peptide preferably is not greater than about 200 amino acids, more preferably less 

- 

than 150 amino acids and most preferably less than 100 amino acids. Preferably the peptide is 
from about 5 to about 200 amino acids. To be active, any polypeptide must have sufficient 
20 length to display biological and/or immunological activity. 

The term "naturally occurring polypeptide" refers to polypeptides produced by cells that 
have not been genetically engineered and specifically contemplates various polypeptides arising 
. from post-translational modifications of the polypeptide including, but not limited to, 
24 acetylation, carboxylation, glycosylation, phosphorylation, lipidation and acylation. 

The term "translated protein coding portion" means a sequence which encodes for the full 
length protein which may include any leader sequence or any processing sequence. 

The term "mature protein coding sequence" means a sequence which encodes a peptide 
28 or protein without a signal or leader sequence. The "mature protein portion" means that portion 
of the protein which does not include a signal or leader sequence. The peptide may have been 
produced by processing in the cell which removes any leader/signal sequence. The mature 
protein portion may or may not include the initial methionine residue. The methionine residue 
32 may be removed from the protein during processing in the cell. The peptide may be produced 
synthetically or the protein may have been produced using a polynucleotide only encoding for 
the mature protein coding sequence. 

9 
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The term "derivative" refers to polypeptides chemically modified by such techniques as 
ubiquitination, labeling (e.g., with radionuclides or various enzymes), covalent polymer 
attachment such as pegylation (derivatization with polyethylene glycol) and insertion or 
4 substitution by chemical synthesis of amino acids such as ornithine, which do not normally occur ~ 

The term "variant" (or "analog") refers to any polypeptide differing from naturally ■» 
occurring polypeptides by amino acid insertions, deletions, and substitutions, created using, e g., 
8 recombinant DNA techniques. Guidance in detennining which amino acid residues may be 
replaced, added or deleted without abolishing activities of interest, may be found by comparing 
the sequence of the particular polypeptide with that of homologous peptides and minimizing the 
number of amino acid sequence changes made in regions of high homology (conserved regions) 

12 or by replacing amino acids with consensus sequence. 

Alternatively, recombinant variants encoding these same or similar polypeptides may be 
synthesized or selected by making use of the "redundancy" in the genetic code. Various codon 
substitutions, such as the silent changes which produce various restriction sites, may be 

16 introduced to optimize cloning into a plasmid or viral vector or expression in a particular 

prokaryotic or eukaryotic system. Mutations in the polynucleotide sequenqe may be reflected in 
the polypeptide or domains of other peptides added to the polypeptide to modify the properties of 
any part of the polypeptide, to change characteristics such as ligand-binding affinities, interchain 

20 affinities, or degradation/turnover rate. 

Preferably, amino acid "substitutions" are the result of replacing one amino acid with 
another amino acid having similar structural and/or chemical properties, i.e., conservative amino 
acid replacements. "Conservative" amino acid substitutions may be made on the basis of 

24 similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity , and/or the amphipathic 
nature of the residues involved. For example, nonpolar (hydrophobic) amino acids include 
alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and methionine; polar 
neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, and 

28 glutamine; positively charged (basic) amino acids include argiiiine, lysine, and histidine; and 
negatively charged (acidic) amino acids include aspartic acid and glutamic acid. "Insertions" or 
"deletions" are preferably in the range of about 1 to 20 amino acids, more preferably 1 to 10 
amino acids. The variation allowed may be experimentally determined by systematically making 

32 insertions, deletions, or substitutions of amino acids in a polypeptide molecule using 

* 

recombinant DNA techniques and assaying the resulting recombinant variants for activity. 

Alternatively, where alteration of function is desired, insertions, deletions or 
non-conservative alterations can be engineered to produce altered polypeptides. Such alterations 
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can, for' example, alter one or more of the biological functions or biochemical characteristics of 
the polypeptides of the invention. For example, such alterations may change polypeptide 
• characteristics such as ligand-binding affinities, interchain affinities, or degradation/turnover 

4 rate. Further, such alterations can be selected so as to generate polypeptides that are better suited 
for expression, scale up and the like in the host cells chosen for expression. For example, 
cysteine residues can be deleted or substituted with another amino acid residue in order to 
eliminate disulfide bridges. 

8 The terms "purified" or "substantially purified" as used herein denotes that the indicated 

nucleic acid or polypeptide is present in the substantial absence of other biological 
macromolecules, e.g, polynucleotides, proteins, and the like. In one embodiment, the 
polynucleotide or polypeptide is purified such that it constitutes at least 95% by weight, more 

« 

12 preferably at least 99% by weight, of the indicated biological macromolecules present (but water, 

■ 

buffers, and other small molecules, especially molecules having a molecular weight of less than 
1000 daltons, can be present). 

The term "isolated" as used herein refers to a nucleic acid or polypeptide separated from 

i 

16 at least one other component (e.g., nucleic acid or polypeptide) present with the nucleic acid or 
polypeptide in its natural source. In one embodiment, the nucleic acid or polypeptide is found in 
the presence of (if anything) only a solvent, buffer, ion, or other component normally present in a 
solution of the same. The terms "isolated" and "purified" do not encompass nucleic acids or 

20 polypeptides present in their natural source. 

The term "recombinant," when used herein to refer to a polypeptide or protein, means 
that a polypeptide or protein is derived from recombinant (e.g., microbial, insect, or mammalian) 
expression systems. "Microbial" refers to recombinant polypeptides or proteins made in 

24 bacterial or fungal (e.g. , yeast) expression systems. As a product, "recombinant microbial" 
defines a polypeptide or protein essentially free of native endogenous substances and 
unaccompanied by associated native glycosylation. Polypeptides or proteins expressed in most 
bacterial cultures, e.g., E. coli, will be free of glycosylation modifications; polypeptides or 

28 proteins expressed in yeast will have a glycosylation pattern in general different from those 
expressed in mammalian cells. 

The term "recombinant expression vehicle or vector" refers to a plasmid or phage or virus 
or vector, for expressing a polypeptide from a DNA (RNA) sequence. An expression vehicle can 

32 comprise a transcriptional unit comprising an assembly of (1) a genetic element or elements 

having a regulatory role in gene expression, for example, promoters or enhancers, (2) a structural 
or coding sequence which is transcribed into mRNA and translated into protein, and (3) 
appropriate transcription initiation and termination sequences. Structural units intended for use 
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in yeast or eukaryotic expression systems preferably include a leader sequence enabling 
extracellular secretion of translated protein by a host cell. Alternatively, where recombinant 
protein is expressed without a leader or transport sequence, it may include an amino terminal 
4 methionine residue. This residue may or may not-be subsequently cleaved from the expressed 
recombinant protein to provide a final product. 

The term "recombinant expression system" means host cells which have stably integrated 
a recombinant transcriptional unit into chromosomal DN A or carry the recombinant 
8 transcriptional unit extrachromosomally. Recombinant expression systems as defined herein will 
express heterologous polypeptides or proteins upon induction of the regulatory elements linked 
to the DNA segment or synthetic gene to be expressed. This term also means host cells which . 
have stably integrated a recombinant genetic element or elements having a regulatory role in 

12 gene expression^ for example, promoters or enhancers. Recombinant expression systems as 

defined herein will express polypeptides or proteins endogenous to the cell upon induction of the 
regulatory elements linked to the endogenous DNA segment or gene to be expressed. The cells 
can be prokaryotic or eukaryotic. 

16 The term "secreted" includes a protein that is transported across or through a membrane, 

including transport as a result of signal sequences in its amino acid sequence when it is 
expressed in a suitable host cell. "Secreted" proteins include without limitation proteins secreted 
wholly (e.g. , soluble proteins) or partially (eg. , receptors) from the cell in which they are 

20 expressed. "Secreted" proteins also include without limitation proteins that are transported 
across the membrane of the endoplasmic reticulum. "Secreted" proteins are also intended to 
include proteins containing non-typical signal sequences (e.g. Interleukin-1 Beta, see Krasney, 

P.A. and Young, P.R. (1992) Cytokine 4(2):134 -143) and factors released from damaged cells 

■. 

24 (e.g. Interleukin-1 Receptor Antagonist, see Arend, W.P. et. al. (1998) Annu. Rev. Immunol. 
16:27-55) 

Where desired, an expression vector may be designed to contain a "signal or leader 
sequence" which will direct the polypeptide through the membrane of a cell. Such a sequence 

28 may be naturally present on the polypeptides of the present invention or provided from 
heterologous protein sources by recombinant DNA techniques. 

The term "stringent" is used to refer to conditions that are commonly understood in the 
art as stringent Stringent conditions can include highly stringent conditions (i.e. 9 hybridization 

32 to filter-bound DNA in 0.5 M NaHP0 4 , 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 
65°C, and washing in 0.1X SSC/0.1% SDS at 68°C), and moderately stringent conditions 
washing in 0.2X SSC/0. 1 % SDS at 42°C). Other exemplary hybridization conditions are 
described herein in the examples. 

12 
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In instances of hybridization of deoxyoligonucleotides, additional exemplary stringent 
hybridization conditions include washing in 6X SSC/0>05% sodium pyrophosphate at 37°G (for 
' 14-base oligonucleotides), 48°C (for 17-base oligos), 55°C (for 20-base oligonucleotides), and 
4 60°C (for 23-base oligonucleotides). 

As used herein, "substantially equivalent" can refer both to nucleotide and amino acid 
sequences, for example a mutant sequence, that varies from a reference sequence by one or more 
substitutions, deletions, or additions, the net effect of which does not result in an adverse 
8 functional dissimilarity between the reference and subject sequences. Typically, such a 

substantially equivalent sequence varies from one of those listed herein by no more than about 
35% (i.e., the number of individual residue substitutions, additions, and/or deletions in a 
substantially equivalent sequence, as compared to the corresponding reference sequence, divided 

♦ 

12 by the total number of residues in the substantially equivalent sequence is about 0.35 or less). 
Such a sequence is said to have 65% sequence identity to the listed sequence. In one 
embodiment, a substantially equivalent, e.g. , mutant, sequence of the invention varies from a 
listed sequence by no more than 30% (70% sequence identity); in a variation of this 

1 6 embodiment, by no more than 25% (75% sequence identity); and in a further variation of this 
embodiment, by no more than 20% (80% sequence identity) and in a further variation of this 
embodiment, by no more than 1 0% (90% sequence identity) and in a further variation of this 
embodiment, by no more that 5% (95% sequence identity). Substantially equivalent, e.g., 

20 mutant, amino acid sequences according to the invention preferably have at least 80% sequence 
identity with a listed amino acid sequence, more preferably at least 85% sequence identity, more 

■ 

preferably at least 90% sequence identity, more preferably at least 95% identity, more preferably 
at least 98% identity, and most preferably at least 99% identity. Substantially equivalent 

24 nucleotide sequences of the invention can have lower percent sequence identities, taking into 
account, for example, the redundancy or degeneracy of the genetic code. Preferably, nucleotide 
sequence has at least about 65% identity, more preferably at least about 75% identity, more 
preferably at least about 80% sequence identity, more preferably at least about 85% sequence 

28 identity, more preferably at least about 90% sequence identity, and most preferably at least about 
95% identity, more preferably at least about 98% sequence identity, and most preferably at least 
about 99% sequence identity. For the purposes of the present invention, sequences having 
substantially equivalent biological activity and substantially equivalent expression characteristics 

32 are considered substantially equivalent. For the purposes of determining equivalence, truncation 
of the mature sequence (e.g., via a mutation which creates a spurious stop codon) should be 
disregarded. Sequence identity may be determined, e.g., using the Jotun Hein method (Hein, J. 
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(1990) Methods Enzymol. 183:626-645). Identity between sequences can also be determined by 
other methods known in the art, e.g. by varying hybridization conditions. 

The term "totipotent" refers to the capability of a cell to differentiate into all of the cell 
4 types of an adult organism. 

The term "transformation" means introducing DNA into a suitable host cell so that the 
DNA is replicable, either as an extrachromosomal element, or by chromosomal integration. The 
term "transfection" refers to the taking up of an expression vector by a suitable host cell, whether 
8 or not any coding sequences are in fact expressed. The term "infection" refers to the introduction 
of nucleic acids into a suitable host cell by use of a virus or viral vector. 

As used herein, an "uptake modulating fragment," UMF, means a series of nucleotides 
which mediate the uptake of a linked DNA fragment into a cell. UMFs can be readily identified 

♦ 

12 using known UMFs as a target sequence or target motif with the computer-based systems 
described below. The presence and activity of a UMF can be confirmed by attaching the 1 
suspected UMF to a marker sequence. The resulting nucleic acid molecule is then incubated 
with an appropriate host under appropriate conditions and the uptake of the marker sequence is 

16 determined. As described above, a UMF will increase the frequency of uptake of a linked 
marker sequence. 

Each of the above terms is meant to encompass all that is described for each, unless the 
context dictates otherwise. 

♦ 

20 

4.2 NUCLEIC ACIDS OF THE INVENTION 

Nucleotide sequences of the invention are set forth in the Sequence Listing. 

The isolated polynucleotides of the invention include a polynucleotide comprising the 

24 nucleotide sequences of SEQ ID NO: 1-13901 ; a polynucleotide encoding any one of the 
peptide sequences of SEQ ID NOS: 13902-27802; and a polynucleotide comprising the 
nucleotide sequence encoding the mature protein coding sequence of the polypeptides of any one 
of SEQ ID NOS: 13902-27802. The polynucleotides of the present invention also include, but 

28 are not limited to, a polynucleotide that hybridizes under stringent conditions to (a) the 
complement of any of the nucleotides sequences of SEQ ID NO: 1-1 3901 ; (b) nucleotide 
sequences encoding any one of the amino acid sequences set forth in the Sequence Listing; (c) a 
polynucleotide which is an allelic variant of any polynucleotide recited above; (d) a 

32 polynucleotide which encodes a species homolog of any of the proteins recited above; or (e) a 

■ 

polynucleotide that encodes a polypeptide comprising a specific domain or truncation of the 
polypeptides of SEQ ID NOS: 1 3902-27802. Domains of interest may depend on the nature of 
the encoded polypeptide; e.g., domains in receptor-like polypeptides include ligand-binding, 

14 
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extracellular, transmembrane, or cytoplasmic domains, or combinations thereof; domains in 
immunoglobulin-like proteins include the variable immunoglobulin-like domains; domains in 
• enzyme-like polypeptides include catalytic and substrate binding domains; and domains in ligand 
4 polypeptides include receptor-binding domains. 

The polynucleotides of the invention include naturally occurring or wholly or partially 
synthetic DNA, e.g., cDNA and genomic DNA, and RNA, e.g., mRNA. The polynucleotides 
may include all of the coding region of the cDNA or may represent a portion of the coding 

■ 

g region of the cDNA. 

The present invention also provides genes corresponding to the cDN A sequences disclosed 
herein. The corresponding genes can be isolated in accordance with known methods using the 
sequence information disclosed herein. Such methods include the preparation of probes or primers 
1 2 from the disclosed sequence information for identification and/or amplification of genes in 

' * 

appropriate genomic libraries or other sources of genomic materials. Further 5' and 3' sequence can 
be obtained using methods known in the art. For example, full length cDNA or genomic DNA that 
corresponds to any of the polynucleotides of SEQ ID NO: 1-13901 can be obtained by screening 
1 6 appropriate cDNA or genomic DNA libraries under suitable hybridization conditions using any of 
the polynucleotides of SEQ ID NO: 1-13901 or a portion thereofas a probe. Alternatively, the 
polynucleotides of SEQ ID NO: 1 -13901 may be used as the basis for suitable primer(s) that allow 
identification and/or amplification of genes in appropriate genomic DNA or cDNA libraries. 

* 

20 The nucleic acid sequences of the invention can be assembled from ESTs and sequences 

(including cDNA and genomic sequences) obtained from one or more public databases, such as 
dbEST, gbpri, and UniGene. The EST sequences can provide identifying sequence information, 
representative fragment or segment information, or novel segment information for the full-length 

24 gene. * 

The polynucleotides of the invention also provide polynucleotides including nucleotide 
sequences that are substantially equivalent to the polynucleotides recited above. Polynucleotides 
according to the invention can have, e.g., at least about 65%, at least about 70%, at least about 

28 75%, at least about 80%, 81%, 82%, 83%, 84%, more typically at least about 85%, 86%, 87%, 
88%, 89%, more typically at least about 90%, 91%, 92%, 93%, 94%, and even more typically at 
least about 95%, 96%, 97%, 98%, 99%, sequence identity to a polynucleotide recited above. 

Included within the scope of the nucleic acid sequences of the invention are nucleic acid 

32 sequence fragments that hybridize under stringent conditions to any of the nucleotide sequences 
of SEQ ID NO: 1-13901, or complements thereof which fragment is greater than about 5 
nucleotides, preferably 7 nucleotides, more preferably greater than 9 nucleotides and most 
preferably greater than 17 nucleotides. Fragments of, e.g. 1 5, 17, or 20 nucleotides or more that 

15 
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are selective for {i.e. specifically hybridize to any one of the polynucleotides of the invention) 
are contemplated. Probes capable of specifically hybridizing to a polynucleotide can 
differentiate polynucleotide sequences of the invention from other polynucleotide sequences in 
4 the same family of genes or can differentiate human genes from genes of other species, and are 
preferably based on unique nucleotide sequences. ■ ■ 

The sequences falling within the scope of the present invention are not limited to these 
specific sequences, but also include allelic and species variations thereof. Allelic and species 
8 variations can be routinely determined by comparing the sequence provided SEQ ID NO: 1 -1 3901 , 
a representative fragment thereof, or a nucleotide sequence at least 90% identical, preferably 95% ;, 
identical,toSEQEDNO: 1-13901 with a sequence from another isolate of the same species. 
Furthermore, to accommodate codon variability, the invention includes nucleic acid molecules 

12 coding for the same amino acid sequences as do the specific ORFs disclosed herein. In other words, 
in the coding region of an ORF, substitution of one codon for another codon that encodes the same 
amino acid is expressly contemplated. 

The nearest neighbor or homology result for the nucleic acids of the present invention, 

16 including SEQ ID NO: 1-13901 can be obtained by searching a database using an algorithm or a 
program. Preferably, a BLAST which stands for Basic Local Alignment Search Tool is used to 
search for local sequence alignments (Altshul, SJ\ J Mol. EvoL 36 290-300 (1 993) and Altschul 
S.F.etal. J. MoLBioL 21:403-410(1990)). Alternatively a FASTA version 3 searchagainst 

20 Genpept, using Fastxy algorithm. 

Species homologs (or orthologs) of the disclosed polynucleotides and proteins are also 
provided by the present invention. Species homologs may be isolated and identified by making 
suitable probes or primers from the sequences provided herein and screening a suitable nucleic 

24 acid source from the desired species. 

The invention also encompasses allelic variants of the disclosed polynucleotides or 
proteins; that is, naturally-occurring alternative forms of the isolated polynucleotide which also 
encode proteins which are identical, homologous or related to that encoded by the 

28 polynucleotides. 

The nucleic acid sequences of the invention are further directed to sequences which 
encode variants of the described nucleic acids. These amino acid sequence variants may be 
prepared by methods known in the art by introducing appropriate nucleotide changes into a 

32 native or variant polynucleotide; There are two variables in the construction of amino acid 
sequence variants: the location of the mutation and the nature of the mutation. Nucleic acids 
encoding the amino acid sequence variants are preferably constructed by mutating the 
polynucleotide to encode an amino acid sequence that does not occur in nature. These nucleic 

16 



WO 01/64835 PCT/US01/04927 

acid alterations can be made at sites that differ in the nucleic acids from different species 
(variable positions) or in highly conserved regions (constant regions). Sites at such locations 
1 will typically be modified in series, e.g., by substituting first with conservative choices (e.g. , 

4 hydrophobic amino acid to a different -hydrophobic amino acid) and then with more distant 
choices (e.g., hydrophobic amino acid to a charged amino acid), and then deletions or insertions 
may be made at the target site. Amino acid sequence deletions generally range from about 1 to 
30 residues, preferably about 1 to 10 residues, and are typically contiguous. Amino acid 

8 insertions include amino- and/or carboxyl-terminal fusions ranging in length from one to one 
hundred or more residues, as well as intrasequence insertions of single or multiple amino acid 
residues. Intrasequence insertions may range generally from about 1 to 10 amino residues, 
preferably from 1 to 5 residues. Examples of terminal insertions include the heterologous signal 

12 sequences necessary for secretion or for intracellular targeting in different host cells and 

■ 

sequences such as FLAG or poly-histidine sequences useful for purifying the expressed protein. 

In a preferred method, polynucleotides encoding the novel amino acid sequences are 
changed via site-directed mutagenesis. This method use£ oligonucleotide sequences to alter a 

16 polynucleotide to encode the desired amino acid variant, as well as sufficient adjacent 

nucleotides on both sides of the changed amino acid to form a stable duplex on either side of the 
site of being changed. In general, the techniques of site-directed mutagenesis are well known to 
those of skill in the art and this technique is exemplified by publications such as, Edelman et al., 

20 DNA 2:1 83 (1983). A versatile and efficient method for producing site-specific changes in a 
polynucleotide sequence was published by Zoller and Smith, Nucleic Acids Res. 10:6487-6500 
(1982). PCRmay also be used to create amino acid sequence variants of the novel nucleic acids. 
When small amounts of template DNA are used as starting material, primer(s) that differs 

24 slightly in sequence from the corresponding region in the template DNA can generate the desired 
amino acid variant. PCR amplification results in a population of product DNA fragments that 
differ from the polynucleotide template encoding the polypeptide at the position specified by the 
primer. The product DNA fragments replace the corresponding region in the plasmid and this 

28 gives a polynucleotide encoding the desired amino acid variant. 

A further technique for generating amino acid variants is the cassette mutagenesis 
technique described in Wells et al., Gene 34:315 (1985); and other mutagenesis techniques well 
known in the art, such as, for example, the techniques in Sambrook et al., supra, and Current 

32 Protocols in Molecular Biology;, Ausubel et al. Due to the inherent degeneracy of the genetic 
code, other DNA sequences which encode substantially the same or a functionally equivalent 
amino acid sequence may be used in the practice of the invention for the cloning and expression 
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■ i 

of these novel nucleic acids. Such DNA sequences include those which are capable of , . , 
hybridizing to the appropriate novel nucleic acid sequence under stringent conditions. 

Polynucleotides encoding preferred polypeptide truncations of the invention can be used 
4 . to generate polynucleotides encoding chimeric or fusion proteins comprising one or more 
domains of the invention and heterologous protein sequences. 

The polynucleotides of the invention additionally include the complement of any of the 
polynucleotides recited above. The polynucleotide can be DNA (genomic, cDNA, amplified, or 
8 synthetic) or RNA. Methods and algorithms for obtaining such polynucleotides are well known 
to those of skill in the art and can include, for example, methods for determining hybridization , 
conditions that can routinely isolate polynucleotides of the desired sequence identities. 

In accordance with the invention, polynucleotide sequences comprising the mature 

1 2 protein coding sequences corresponding to any one of SEQ ID NO: 1 - 1 390 1 , or functional 
equivalents thereof, may be used to generate recombinant DNA molecules that direct the 
expression of that nucleic acid, or a functional equivalent thereof, in appropriate host cells. Also 
included are the cDNA inserts of any of the clones identified herein. 

16 A polynucleotide according to the invention can be joined to any of a variety of other 

nucleotide sequences by well-established recombinant DNA techniques (see Sambrook J et al. 
(1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, NY). Useful 
nucleotide sequences for joining to polynucleotides include an assortment of vectors, e.g. , 

20 plasmids, cosmids, lambda phage derivatives, phagemids, and the like, that are well known in the 
art. Accordingly, the invention also provides a vector including a polynucleotide of the 
invention and a host cell containing the polynucleotide. In general, the vector contains an origin 
of replication functional in at least one organism, convenient restriction endonuclease sites, and a 

24 selectable marker for the host cell. Vectors according to the invention include expression 
vectors, replication vectors, probe generation vectors, and sequencing vectors. A host cell 

* * * 

according to the invention can be a prokaryotic or eukaryotic cell and can be a unicellular 
organism or part of a multicellular organism. 

28 The present invention further provides recombinant constructs comprising a nucleic acid 

having any of the nucleotide sequences of SEQ ID NO: 1-13901 or a fragment thereof or any 
other polynucleotides of the invention. In one embodiment, the recombinant constructs of the 
present invention comprise a vector, such as a plasmid or viral vector, into which a nucleic acid 

32 having any of the nucleotide sequences of SEQ ID NO: 1-13901 or a fragment thereof is 

inserted, in a forward or reverse orientation. In the case- of a vector comprising one of the ORFs 
of the present invention, the vector may further comprise regulatory sequences, including for 
example, a promoter, operably linked to the ORF. Large numbers of suitable vectors and 
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promoters are known to those of skill in the art and are commercially available for generating the 
recombinant constructs of the present invention. The following vectors are provided by way of 
1 example. Bacterial: pBs^hagescriptPsiXiy^pBluescriptSKjpBsKS^pNHSajpNHiea, 
_ 4 pNHl 8a, pNH46a (Stratagene); pTrc99A, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia). 
Eukaryotic: pWLneo, pSV2cat, pOG44, PXTI, pSG (Stratagene) pS VK3, pBPV, pMSG, pSVL 
(Pharmacia). 

The isolated polynucleotide of the invention may be operably linked to an expression 
8 control sequence such as the pMT2 or pED expression vectors disclosed in Kaufman et aL, 
Nucleic Acids Res. 1 9, 4485-4490 (1 991), in order to produce the protein recombinantly. Many 
suitable expression control sequences are known in the art. General methods of expressing 
recombinant proteins are also known and are exemplified in R. Kaufman, Methods in 
12 Enzymology 1 85, 537-566 (1 990). As defined herein "operably linked" means that the isolated 
polynucleotide of the invention and an expression control sequence are situated within a vector 
or cell in such a way that the protein is expressed by a host cell which has been transformed 

* 

(transfected) with the ligated polynucleotide/expression control sequence. 

16 Promoter regions can be selected from any desired gene using CAT (chloramphenicol 

transferase) vectors or other vectors with selectable markers. Two appropriate vectors are 
pKK232-8 and pCM7. Particular named bacterial promoters include lad, lacZ, T3, T7, gpt, 
lambda PR, and trc. Eukaryotic promoters include CMV immediate early, HSV thymidine 

20 kinase, early and late SV40, LTRs from retrovirus, and mouse metallothionein-L Selection of 
the appropriate vector and promoter is well within the level of ordinary skill in the art. 
Generally, recombinant expression vectors will include origins of replication and selectable 
markers permitting transformation of the host cell, e.g., the ampicillin resistance gene of E. coli 

24 and S. cerevisiae TRP1 gene, and a promoter derived from a highly-expressed gene to direct 

transcription of a downstream structural sequence. Such promoters can be derived from operons 
encoding glycolytic enzymes such as 3-phosphoglycerate kinase (PGK), a-factor, acid 
phosphatase, or heat shock proteins, among others. The heterologous structural sequence is 

» ■ 

28 assembled in appropriate phase with translation initiation and termination sequences, and 
preferably, a leader sequence capable of directing secretion of translated protein into the 
periplasmic space or extracellular medium. Optionally, the heterologous sequence can encode a 
fusion protein including an amino terminal identification peptide imparting desired 

32 characteristics, e.g. , stabilization or simplified purification of expressed recombinant product 
Useful expression vectors for bacterial use are constructed by inserting a structural DNA 
sequence encoding a desired protein together with suitable translation initiation and termination 
signals in operable reading phase with a functional promoter. The vector will comprise one or 
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more phenotypic selectable markers and an origin of replication to ensure maintenance of the 
vector and to, if desirable, provide amplification within the host. Suitable prokaryotic hosts for 
• transformation include E coti, Bacillus subtilis, Salmonella typhimurium and various species 
4 within the genera Pseudomonas, Streptornyces, mdrStaphylococcus, although others may also be 

» * * • 

employed as a matter of choice. 

As a representative but non-limiting example, useful expression vectors for bacterial use 
can comprise a selectable marker and bacterial origin of replication derived from commercially 
8 available plasmids comprising genetic elements of the well known cloning vector pBR322 
(ATCC 37017). Such commercial vectors include, for example, pKK223-3 (Pharmacia Fine 
Chemicals, Uppsala, Sweden) and GEM 1 (Promega Biotech, Madison, WI, USA). These 
pBR322 "backbone" sections are combined with an appropriate promoter and the structural 
12 sequence to be expressed. Following transformation of a suitable host strain and growth of the 

■ 

host strain to an appropriate cell density, the selected promoter is induced or derepressed by 
appropriate means (e.g., temperature shift or chemical induction) and cells are cultured for an 
additional period. Cells are typically harvested by centrifugation, disrupted by physical or 

1 6 chemical means, and the resulting crude extract retained for further purification. 

Polynucleotides of the invention can also be used to induce immune responses. For 
example, as described in Fan et aL, Nat. Biotech. 17:870-872 (1999), incorporated herein by 
reference, nucleic acid sequences encoding a polypeptide may be used to generate antibodies 

20 against the encoded polypeptide following topical administration of naked plasmid DNA or 
following injection, and preferably intramuscular injection of the DNA. The nucleic acid 
sequences are preferably inserted in a recombinant expression vector and may be in the form of 
naked DNA. 

24 

43 ANTISENSE 

Another aspect of the invention pertains to isolated antisense nucleic acid molecules that 
are hybridizable to or complementary to the nucleic acid molecule comprising the nucleotide 

28 sequence of SEQ ID NO: 1-13901 , or fragments, analogs or derivatives thereof. An "antisense" 
nucleic acid comprises a nucleotide sequence that is complementary to a "sense" nucleic acid 
encoding a protein, e.g., complementary to the coding strand of a double-stranded cDNA 
molecule or complementary to an mRNA sequence. In specific aspects, antisense nucleic acid 

32 molecules are provided that comprise a sequence complementary to at least about 1 0, 25, 50, 

100, 250 or 500 nucleotides or an entire coding strand, or to only a portion thereof. Nucleic acid 
molecules encoding fragments, homologs, derivatives and analogs of a protein of any of SEQ ID 
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NOS: 13902-27802 or antisense nucleic acids complementary to a nucleic acid sequence of SEQ 
ID NO: 1 - 1 390 1 are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding region" 
4 of the coding strand of a nucleotide sequence of the invention. The term "coding region" refers 
to the region of the nucleotide sequence comprising codons which are translated into amino acid 
residues. In another embodiment, the antisense nucleic acid molecule is antisense to a 
"noncoding region" of the coding strand of a nucleotide sequence of the invention. The term 
8 "noncoding region" refers to 5' and 3 ! sequences which flank the coding region that are not 
translated into amino acids (i.e., also referred to as 5' and 3' untranslated regions). 

Given the coding strand sequences encoding a nucleic acid disclosed herein (e.g., SEQ 
. ID NO: 1-13901 ), antisense nucleic acids of the invention can be designed according to the rules 

12 of Watson and Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be 

complementary to the entire coding region of a mRNA, but more preferably is an oligonucleotide 
that is antisense to only a portion of the coding or noncoding region of a mRNA. For example, 
the antisense oligonucleotide can be complementary to the region surrounding the translation 

16 start site of a mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 
30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the.invention can be 
constructed using chemical synthesis or enzymatic ligation reactions using procedures known in 
the art. For example, an antisense nucleic acid (e.g. , an antisense oligonucleotide) can be 

20 chemically synthesized using naturally occurring nucleotides or variously modified nucleotides 
designed to increase the biological stability of the molecules or to increase the physical stability 
of the duplex formed between the antisense and sense nucleic acids, e.g., phosphorothioate 
derivatives and acridine substituted nucleotides can be used. 

24 Examples of modified nucleotides that can be used to generate the antisense nucleic acid 

include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 
4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl- 
2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, 

28 inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 

2- methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, 5 f -methoxycarboxymethyluracil, 5-methoxyuracil, 

32 2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluraciI, 
uracil-5 -oxy acetic acid methylester, uracil-5-oxy acetic acid (v), 5 -methy 1-2-thiouracil, 

3- <3-amino-3-N-2-ciarboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the 

21 
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antisense nucleic acid can be produced biologically using an expression vector into which a 
nucleic acid has been subcloned in an antisense orientation (Le., RNA transcribed from the 
1 inserted nucieic acid will be of an antisense orientation to a target nucleic acid of interest, 
4 described further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered to a 
subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or 
genomic DNA encoding a protein according to the invention to thereby inhibit expression of the 
8 protein, e.g. , by inhibiting transcription and/or translation. The hybridization can be by 

♦ 

conventional nucleotide complementarity to form a stable duplex, or, for example , in the case of 
an antisense nucleic acid molecule that binds to DNA duplexes, through specific interactions in 
the major groove of the double helix. An example of a route of administration of antisense 

12 nucleic acid molecules of the invention includes direct injection at a tissue site. Alternatively, 
antisense nucleic acid molecules can be modified to target selected cells and then administered 
systemically. For example, for systemic administration, antisense molecules can be modified 
such that they specifically bind to receptors or antigens expressed on a selected cell surface, e.g. , 

16 by linking the antisense nucleic acid molecules to peptides or antibodies that bind to cell surface 
receptors or antigens. The antisense nucleic acid molecules can also be delivered to cells using 
the vectors described herein. To achieve sufficient intracellular concentrations of antisense 
molecules, vector constructs in which the antisense nucleic acid molecule is placed under the 

20 control of a strong pol II or pol HI promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is an 
-a n omeric nucleic acid molecule. An -anomeric nucleic acid molecule forms specific 
double-stranded hybrids with complementary RNA in which, contrary to the usual -uni ts, the 

24 strands run parallel to each other (Gaultier et ah (1987) Nucleic Acids Res 15: 6625-6641). The 
antisense nucleic acid molecule can also comprise a 2-o-methylribonucleotide (Inoue et al. 
(1987) Nucleic Acids Res 15: 6131-6148) or achimeric RNA -DNA analogue (Inoue etal. (1987) 
FEBSLett 2 15: 327-330). 

28 

- 

4.4 RIBOZYMES AND PNA MOIETIES 

In still another embodiment, an antisense nucleic acid of the invention is a ribozyme. 
Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of cleaving a 
32 single-stranded nucleic acid, such as a mRNA, to which they have a complementary region. 
Thus, ribozymes (e.g., hammerhead ribozymes (described in Haselhoff and Gerlach (1988) 
Nature 334:585-591)) can be used to catalytically cleave a mRNA transcripts to thereby inhibit 

T * 

* 

translation of a mRNA. A ribozyme having specificity for a nucleic acid of the invention can be 
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designed based upon the nucleotide sequence of a DNA disclosed herein (Le., SEQ ID NO; 1- 
13901 ). For example, a derivative of a Tetrahymena L-19 IVS RNA can be constructed in 
which the nucleotide sequence of the active site is complementary to the nucleotide sequence to 
4 be cleaved in a SECX-encoding mRNA. See, e.g. , Cech et al U.S. Pat No. 4,987,071; and 
Cech et al. U.S. Pat. No. 5,1 16,742. Alternatively, SECX mRNA can be used to select a 
catalytic RNA having a specific ribonuclease activity from a pool of RNA molecules. See, e. g. , 
Bartel et al., (1993) Science 261:1411-1418. 
8 Alternatively, gene expression can be inhibited by targeting nucleotide sequences 

complementary to the regulatory region (e.g., promoter and/or enhancers) to form triple helical ;, 
structures that prevent transcription of the gene in target cells. See generally, Helene. (1991) , 
Anticancer Drug Des. 6: 569-84; Helene. et al (1992) Arm. N.Y. Acad Sci 660:27-36; and 

12 Maher (1992) Bioassays 14: 807-15. 

In various embodiments, the nucleic acids of the invention can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, e.g. , the stability, hybridization, or 
solubility of the molecule. For example, the deoxyribose phosphate backbone of the nucleic 

16 acids can be modified to generate peptide nucleic acids (see Hyrup et al. (1 996) Bioorg Med 

Chem 4: 5-23). As used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid 
mimics, e.g. , DNA mimics, in which the deoxyribose phosphate backbone is replaced by a 
pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 

20 backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis ofPNA oligomers can be performed using 
standard solid phase peptide synthesis protocols as described in Hyrup et al. (1996) above; 
Perry-O'Keefee/*/. (1996) PNAS93: 14670-675. 

24 PNAs of the invention can be used in therapeutic and diagnostic applications. For 

example, PNAs can be used as antisense or antigene agents for sequence-specific modulation of . 
gene expression by, e.g. , inducing transcription or translation arrest or inhibiting replication. 
PNAs of the invention can also be used, e.g., in the analysis of single base pair mutations in a 

28 gene by, e.g., PNA directed PCR clamping; as artificial restriction enzymes when used in 
combination with other enzymes, e.g. , SI nucleases (Hyrup B. (1996) above); or as probes or 
primers for DNA sequence and hybridization (Hyrup et al (1996), above; Perry-O'Keefe (1996), 
above). 

32 In another embodiment, PNAs of the invention can be modified, e.g. , to enhance their 

stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the 
formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug 
delivery known in the art. For example, PNA-DNA chimeras can be generated that may 
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combine the advantageous properties of PNA and DNA. Such chimeras allow DNA recognition 
enzymes, e.g., RNase H and DNA polymerases, to interact with the DNA portion while the PNA 
portion would provide high binding affinity and specificity. PNA-DNA chimeras can be linked 
4 using linkers of appropriate lengths selected in terms of base stacking, number of bonds between 
the hucleobases, and orientation (Hyrup (1996) above). The synthesis of PNA-DNA chimeras 

• ■ 

can be performed as described in Hyrup (1 996) above and Finn et al. (1 996) Nucl Acids Res 24: 
3357-63. For example, a DNA chain can be synthesized on a solid support using standard 
8 phosphoramidite coupling chemistry, and modified nucleoside analogs, e.g., 

5 , -(4-methoxytrityl)ainino-5 , -deoxy-thymidine phosphoramidite, can be used between the PNA 
and the 5' end of DNA (Mag et al (1989) Nucl Acid Res 17: 5973-88). PNA monomers are then 
coupled in a stepwise manner to produce a chimeric molecule with a 5' PNA segment and a 3' 

12 DNA segment (Finn et al (1996) above). Alternatively, chimeric molecules can be synthesized 

i 

with a 5' DNA segment and a 3' PNA segment. See, Petersen et al (1975) BioorgMed Chem 
Lett 5: 1119-11124. 

In other embodiments, the oligonucleotide may include other appended groups such as 
1 6 peptides (e.g. , for targeting host cell receptors in vivo), or agents facilitating transport across the 
cell membrane (see, e.g., Letsinger et al, 1989, Proc. Natl Acad Set U.S.A. 86:6553-6556; 
Lemaitre et al 9 1987, Proc. Natl Acad. Set 84:648-652; PCT Publication No. W088/09810) or 
the blood-brain bairier (see, e.g., PCT Publication No. W089/10134). In addition, 
20 oligonucleotides can be modified with hybridization triggered cleavage agents (See, e.g. , Krol et 
al., 1988, BioTechniques 6:958-976) or intercalating agents. (See, e.g., Zon, 1988, Pharnu Res. 
5: 539-549). To this end, the oligonucleotide may be conjugated to another molecule, e.g., a 
peptide, a hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered 
24 cleavage agent, etc. 

4,5 HOSTS 

The present invention further provides host cells genetically engineered to contain the 
28 polynucleotides of the invention. For example, such host cells may contain nucleic acids of the 
invention introduced into the host cell using known transformation, transfection or infection 
methods. The present invention still further provides host cells genetically engineered to express 
the polynucleotides of the invention, wherein such polynucleotides are in operative association 
32 vvith a regulatory sequence heterologous to the host cell which drives expression of the 
polynucleotides in the cell. 

Knowledge of nucleic acid sequences allows for modification of cells to permit, or 
increase, expression of endogenous polypeptide. Cells can be modified (e.g., by homologous 

24 
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recombination) to provide increased polypeptide expression by replacing, in whole or in part, the 
naturally occurring promoter with all or part of a heterologous promoter so that the cells express 
the polypeptide at higher levels. The heterologous promoter is inserted in such a manner that it 
A is operatively linked to the encoding sequences. See, for example, PCT International Publication ~~ 
No. WO94/12650, PCT International Publication No. WO92/20808, and PCT International 
Publication No. WO91/09955. It is also contemplated that, in addition to heterologous promoter ' 1 
DNA, amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene which 

8 encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or 

» 

intron DNA may be inserted along with the heterologous promoter DNA. If linked to the coding, 
sequence, amplification of the marker DNA by standard selection methods results in co- 
amplification of the desired protein coding sequences in the cells. 
12 The host cell can be a higher eukaiyotic host cell, such as a mammalian cell, a lower 

i 

eukaryotic host cell, such as a yeast cell, or the host cell can be a prokaryotic cell, such as a 

■ « • 

bacterial cell. Introduction of the recombinant construct into the host cell can be effected by 
calcium phosphate transfectioh, DEAE, dextran mediated transfection, or electroporation (Davis, 

16 L. et al., Basic Methods in Molecular Biology (1986)). The host cells containing one of the 
polynucleotides of the invention, can be used in conventional manners to produce the gene 
product encoded by the isolated fragment (in the case of an ORF) or can be used to produce a 
heterologous protein under the control of the EMF. 

20 Any host/vector system can be used to express one or more of the ORFs of the present 

invention. These include, but are not limited to, eukaryotic hosts such as HeLa cells, Cv-1 cell, 
COS cells, 293 cells, and Sf9 cells, as well as prokaryotic host such as E. colt and B. subtilis. 
The most preferred cells are those which do not normally express the particular polypeptide or 

24 protein or which expresses the polypeptide or protein at low natural level. Mature proteins can 
be expressed in mammalian cells, yeast, bacteria, or other cells under the control of appropriate 
promoters. Cell-free translation systems can also be employed to produce such proteins using • 
RNAs derived from the DNA constructs of the present invention. Appropriate cloning and 

28 expression vectors for use with prokaryotic and eukaryotic hosts are described by Sambrook, et 
al, in Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, New 
York (1 989), the disclosure of which is hereby incorporated by reference. 

Various mammalian cell culture systems can also be employed to express recombinant 

32 protein. Examples of mammalian expression systems include the COS-7 lines of monkey kidney 
fibroblasts, described by Gluzman, Cell 23 : 1 75 ( 1 98 1 ). Other cell lines capable of expressing a 
compatible vector are, for example, the C127, monkey COS cells, Chinese Hamster Ovary 
(CHO) cells, humah kidney 293 cells, human epidermal A43 1 cells, human Colo205 cells, 3T3 

* 
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cells, CV-1 cells, other transformed primate cell lines a normal diploid cells, cell strains derived 
from in vitro culture of primary tissue, primary explants, HeLa cells, mouse L cells, BHK, 
1 HL-60, U937, HaK or Jurkat cells. Mammalian expression vectors will comprise an origin of 

4 replication, a suitable promoter and also any necessary ribosome binding sites, polyadenylation 
site, splice donor and acceptor sites, transcriptional termination sequences, and 5' flanking 
nontranscribed sequences. DNA sequences derived from the SV40 viral genome, for example, 
S V40 origin, early promoter, enhancer, splice, and polyadenylation sites may be used to provide 

8 the required nontranscribed genetic elements. Recombinant polypeptides and proteins produced 
in bacterial culture are usually isolated by initial extraction from cell pellets, followed by one or 
more salting-out, aqueous ion exchange or size exclusion chromatography steps. Protein 
refolding steps can be used, as necessary, in completing configuration of the mature protein. 
12 Finally, high performance liquid chromatography (HPLC) can be employed for final purification 

* • 

steps. Microbial cells employed in expression of proteins can be disrupted by any convenient 
method, including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing 
agents. 

16 Alternatively, it may be possible to produce the protein in lower eukaryotes such as yeast 

or insects or in prokaryotes such as bacteria. Potentially suitable yeast strains include 
Saccharomyces cerevisiae, Schizosaccharomyces pombe, Klvyveromyces strains, Candida, or 
any yeast strain capable of expressing heterologous proteins. Potentially suitable bacterial 

20 strains include Escherichia coli, Bacillus subtilis, Salmonella typhimurium, or any bacterial 

strain capable of expressing heterologous proteins. If the protein is made in yeast or bacteria, it 
may be necessary to modify the protein produced therein, for example by phosphorylation or 
glycosylation of the appropriate sites, in order to obtain the functional protein. Such covalent 

24 attachments may be accomplished using known chemical or enzymatic methods. 

In another embodiment of the present invention, cells and tissues may be engineered to 
express an endogenous gene comprising the polynucleotides of the invention under the control of 
inducible regulatory elements, in which case the regulatory sequences of the endogenous gene 

28 may be replaced by homologous recombination. As described herein, gene targeting can be used 
to replace a gene's existing regulatory region with a regulatory sequence isolated from a different 
gene or a novel regulatory sequence synthesized by genetic engineering methods. Such 
regulatory sequences may be comprised of promoters, enhancers, scaffold-attachment regions, 

32 negative regulatory elements, transcriptional initiation sites, regulatory protein binding sites or 
combinations of said sequences. Alternatively, sequences which affect the structure or stability 
of the RN A or protein produced may be replaced, removed, added, or otherwise modified by 
targeting. These sequence include polyadenylation signals, mRNA stability elements, splice 
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sites, leader sequences for enhancing or modifying transport or secretion properties of the . 
protein, or other sequences which alter or improve the function or stability of protein or RNA 
molecules. 

. 4 The targeting event may be a simple insertion of the regulatory sequence, placing the 

gene under the control of the new regulatory sequence, e.g., inserting a new promoter or 
enhancer or both upstream of a gene. Alternatively, the targeting event may be a simple deletion 
of a regulatory element, such as the deletion of a tissue-specific negative regulatory element 

8 Alternatively, the targeting event may replace an existing element; for example, a tissue-specific 
enhancer can be replaced by an enhancer that has broader or different cell-type specificity than ; 
the naturally occurring elements. Here, the naturally occurring sequences are deleted and new . 
sequences are added. In all cases, the identification of the targeting event may be facilitated by 
12 the use of one or more selectable marker genes that are contiguous with the targeting DNA, 

i 

allowing for the selection of cells in which the exogenous DNA has integrated into the host' cell 
genome. The identification of the targeting event may also be facilitated by the use of one or 
more marker genes exhibiting the property of negative selection, such that the negatively 

16 selectable marker is linked to the exogenous DNA, but configured such that the negatively 
selectable marker flanks the targeting sequence, and such that a correct homologous 
recombination event with sequences in the host cell genome does not result in the stable 
integration of the negatively selectable marker. Markers useful for this purpose include the 

20 Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial xanthine-guanine 
phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance with 
this aspect of the invention are more particularly described in U.S. Patent No. 5,272,071 to 

24 Chappel; U.S. Patent No. 5,578,461 to Sherwin et ah; International Application No. 
PCT/US92/09627 (WO93/09222) by Selden et aL; and International Application No. 
PCT/US90/06436 (WO91/06667) by Skoultchi et aL, each of which is incorporated by reference 
herein in its entirety. 

28 

4.6 POLYPEPTIDES OF THE INVENTION 

The isolated polypeptides of the invention include, but are not limited to, a polypeptide 
comprising: the amino acid sequences set forth as any one of SEQ ED NOS: 13902-278Q2 or an 
32 amino acid sequence encoded by any one of the nucleotide sequences SEQ ID NO: 1-13901 or 

■ 

the corresponding full length or mature protein. Polypeptides of the invention also include 
polypeptides preferably with biological or immunological activity that are encoded by: (a) a 
polynucleotide having any one of the nucleotide sequences set forth in SEQ ID NO: 1-13901 or 

27 
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(b) polynucleotides encoding any one of the amino acid sequences set forth as SEQ ID NOS: : 
1 3902-27802 or (c) polynucleotides that hybridize to the complement of the polynucleotides of 
• either (a) or (b) under stringent hybridization conditions. The invention also provides 

4 biologically active or immunologically active variants of any of the amino acid sequences set 
forth as SEQ ID NOS: 13902-27802 or the corresponding full length or mature protein; and 
"substantial equivalents" thereof (e.g., with at least about 65%, at least about 70%, at least about 
75%, at least about 80%, at least about 85%, 86%, 87%, 88%, 89%, at least about 90%, 91%, 

8 92%, 93%, 94%, typically at least about 95%, 96%, 97%, more typically at least about 98%, or 
most typically at least about 99% amino acid identity) that retain biological activity. 
Polypeptides encoded by allelic variants may have a similar, increased, or decreased activity 
compared to polypeptides comprising SEQ ID NOS: 1 3902-27802. 

12 Fragments of the proteins of the present invention which are capable of exhibiting 

■ 

biological activity are also encompassed by the present invention. Fragments of the protein may 

be in linear form or they may be cyclized using known methods, for example, as described in H. 

U. Saragovi, et aL, Bio/Technology 1 0, 773-778 (1 992) and in R. S. McDowell, et ah, J. Amer. 
16 Chem. Soc. 1 14, 9245-9253 (1992), both of which are incorporated herein by reference. Such 

fragments may be fused to carrier molecules such as immunoglobulins for many purposes, 

including increasing the valency of protein binding sites. 

The present invention also provides both full-length and mature forms (for example, 
20 without a signal sequence or precursor sequence) of the disclosed proteins. The protein coding 

sequence is identified in the sequence listing by translation of the disclosed nucleotide 

♦ 

sequences. The mature form of such protein may be obtained by expression of a full-length 
polynucleotide in a suitable mammalian cell or other host cell. The sequence of the mature form 

24 of the protein is also determinable from the amino acid sequence of the full-length form. Where 
proteins of the present invention are membrane bound, soluble forms of the proteins are also 
provided. In such forms, part or all of the regions causing the proteins to be membrane bound 
are deleted so that the proteins are fully secreted from the cell in which they are expressed. 

28 Protein compositions of the present invention may further comprise an acceptable carrier, 

such as a hydrophilic, e.g. , pharmaceutical^ acceptable, carrier. 

The present invention further provides isolated polypeptides encoded by the nucleic acid 
fragments of the present invention or by degenerate variants of the nucleic acid fragments of the 

32 present invention. By "degenerate variant" is intended nucleotide fragments which differ from a 
nucleic acid fragment of the present invention (e.g. , ah ORF) by nucleotide sequence but, due to 
the degeneracy of the genetic code, encode an identical polypeptide sequence. Preferred nucleic 

acid fragments of the present invention are the ORFs that encode proteins. 

■ 
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WO 01/64835 PCT/US01/04927 

i 

A variety of methodologies known in the art can be utilized to obtain any one of the . 
isolated polypeptides or proteins of the present invention. At the simplest level, the amino acid 
sequence can be synthesized using commercially available peptide synthesizers. The 
4 synthetically-constructed protein sequences, by virtue of sharing primary, secondary or tertiary 
structural and/or conformational characteristics with proteins may possess biological properties 
in common therewith, including protein activity: This technique is particularly useful in 
producing small peptides and fragments of larger polypeptides. Fragments are useful, for 
8 example, in generating antibodies against the native polypeptide. Thus, they may be employed 
as biologically active or immunological substitutes for natural, purified proteins in screening of 
therapeutic compounds and in immunological processes for the development of antibodies. 

The polypeptides and proteins of the present invention can alternatively be purified from 
12 cells which have been altered to express the desired polypeptide or protein. As used herein, a 

i ' 

cell is said to be altered to express a desired polypeptide or protein when the cell, through genetic 
manipulation, is made to produce a polypeptide or protein which it normally does not produce or 
which the cell normally produces at a lower level. One skilled in the art can readily adapt 
16 procedures for introducing and expressing either recombinant or synthetic sequences into 

► 

eukaryotic or prokaryotic cells in order to generate a cell which produces one of the polypeptides 
or proteins of the present invention. 

■ 

The invention also relates to methods for producing a polypeptide comprising growing a 
20 culture of host cells of the invention in a suitable culture medium, and purifying the protein from 
the cells or the culture in which the cells are grown. For example, the methods of the invention 
include a process for producing a polypeptide in which a host cell containing a suitable 
expression vector that includes a polynucleotide of the invention is cultured under conditions that 
24 allow expression of the encoded polypeptide. The polypeptide can be recovered from the 

culture, conveniently from the culture medium, or from a lysate prepared from the host cells and 
further purified. Preferred embodiments include those in which the protein produced by such 
process is a full length or mature form of the protein. 

- 

28 In an alternative method, the polypeptide or protein is purified from bacterial cells which 

naturally produce the polypeptide or protein. One skilled in the art can readily follow known 
methods for isolating polypeptides and proteins in order to obtain one of the isolated 
polypeptides or proteins of the present inventioa These include, but are not limited to, 

32 immunochromatography, HPLC, size-exclusion chromatography, ion-exchange chromatography, 
and immuno-affinity chromatography. See, e.g., Scopes, Protein Purification: Principles and 
Practice, Springer-Verlag (1994); Sambrook, et al., in Molecular Cloning: A Laboratory 
Manual, Ausubel et al., Current Protocols in Molecular Biology. Polypeptide fragments that 
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retain biological/immunological activity include fragments comprising greater than about 100 
amino acids, or greater than about 200 amino acids, and fragments that encode specific protein 
1 domains. 

4 The purified polypeptides can be used in in vitro binding assays which are well known in 

the art to identify molecules which bind to the polypeptides. These molecules include but are not 
limited to, for e.g., small molecules, molecules from combinatorial libraries, antibodies or other 
proteins. The molecules.identified in the binding assay are then tested for antagonist or agonist 

8 activity in in vivo tissue culture or animal models that are well known in the art In brief, the 

■ • 

molecules are titrated into a plurality of cell cultures or animals and then tested for either 
cell/animal death or prolonged survival of the animal/cells. 

In addition, the peptides of the invention or molecules capable of binding to the peptides 

12 may be complexed with toxins, e.g., hem or cholera, or with other compounds that are toxic to 

% 

cells. The toxin-binding molecule complex is then targeted to a tumor or other cell by the 
specificity of the binding molecule for SEQ ID NOS: 1 3902-27802. 

The protein of the invention may also be expressed as a product of transgenic animals, 

16 e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep which are characterized 
by somatic or germ cells containing a nucleotide sequence encoding the protein. 

The proteins provided herein also include proteins characterized by amino acid sequences 
similar to those of purified proteins but into which modification are naturally provided or 

20 deliberately engineered. For example, modifications, in the peptide or DNA sequence, can be 
made by those skilled in the art using known techniques. Modifications of interest in the protein 
sequences may include the alteration, substitution, replacement, insertion or deletion of a 
selected amino acid residue in the coding sequence. For example, one or more of the cysteine 

24 residues may be deleted or replaced with another amino acid to alter the conformation of the 
molecule. Techniques for such alteration, substitution, replacement, insertion or deletion are 
well known to those skilled in the art (see, e.g., U.S. Pat. No. 4,518,584). Preferably, such 
alteration, substitution, replacement, insertion or deletion retains the desired activity of the 

28 protein. Regions of the protein that are important for the protein function can be determined by 
various methods known in the art including the alanine-scanning method which involved 
systematic substitution of single or strings of amino acids with alanine, followed by testing the 
resulting alanine-containing variant for biological activity. This type of analysis determines the 

32 importance of the substituted amino acid(s) in biological activity. Regions of the protein that are 
important for protein function may be determined by the eMATRIX program. 

Other fragments and derivatives of the sequences of proteins which would be expected to 
retain protein activity in whole or in part and are useful for screening or other immunological 
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methodologies may also be easily made by those skilled in the art given the disclosures herein. 
Such modifications are encompassed by the present invention. 

The protein may also be produced by operably linking the isolated polynucleotide of the 
4 invention to suitable control sequences in one or more insect expression vectors, and employing 
an insect expression system. Materials and methods for baculovims/insect cell expression 
systems are commercially available in kit form from, e.g. , Invitrogen, San Diego, Calif., U.S A. 
(the MaxBat™ kit), and such methods are well known in the art, as described in Summers and 
8 Smith, Texas Agricultural Experiment Station Bulletin No. 1555 (1987), incorporated herein by 
reference. As used herein, an insect cell capable of expressing a polynucleotide of the present 
invention is "transformed." 

The protein of the invention may be prepared by culturing transformed host cells under 
12 culture conditions suitable to express the recombinant protein. The resulting expressed protein 
may then be purified from such culture (i.e., from culture medium or cell extracts) using known 
purification processes, such as gel filtration and ion exchange chromatography. The purification 
of the protein may also include an affinity column containing agents which will bind to the 
16 protein; one or more column steps over such affinity resins as concanavalin A-agarose, 

i 

heparin-toyopearl™ or Cibacrom blue 3GA Sepharose™; one or more steps involving 
hydrophobic interaction chromatography using such resins as phenyl ether, butyl ether, or propyl 
ether; or immunoaffinity chromatography. 

20 Alternatively, the protein of the invention may also be expressed in a form which will 

facilitate purification. For example, it may be expressed as a fusion protein, such as those of 
maltose binding protein (MBP), glutathione-S-transferase (GST) or thioredoxin (TRX), or as a 
His tag. Kits for expression md purification of such fusion proteins are commercially available 

24 from New England BioLab (Beverly, Mass.), Pharmacia (Piscataway, N.J.) arid Invitrogen, 

respectively. The protein can also be tagged with an epitope and subsequently purified by using 
a specific antibody directed to such epitope. One such epitope ("FLAG®") is commercially 
available from Kodak (New Haven, Conn.). 

28 Finally, one or more reverse-phase high performance liquid chromatography (RP- HPLC) 

steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant methyl or other 
aliphatic groups, can be employed to further purify the protein. Some or all of the foregoing 
purification steps, in various combinations, can also be employed to provide a substantially 

32 homogeneous isolated recombinant protein. The protein thus purified is substantially free of 

other mammalian proteins and is defined in accordance with the present invention as an "isolated 
protein." 
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The polypeptides of the invention include analogs (variants). This embraces fragments, 
as well as peptides in which one or more amino acids has been deleted, inserted, or substituted. 
1 Also, analogs of the polypeptides of the invention embrace fusions of the polypeptides or 
4 modifications of the polypeptides of the invention, wherein the polypeptide or analog is fused to 

• ■ 

another moiety or moieties, e.g. , targeting moiety or another therapeutic agent. Such analogs 
may exhibit improved properties such as activity and/or stability. Examples of moieties which 
may be fused to the polypeptide or an analog include, for example, targeting moieties which 
8 provide for the delivery of polypeptide to pancreatic cells, e.g. , antibodies to pancreatic cells, 
antibodies to immune cells such as T-cells, monocytes, dendritic cells, granulocytes, etc., as well 
as receptor and ligands expressed on pancreatic or immune cells. Other moieties which may be 
fused to the polypeptide include therapeutic agents which are used for treatment, for example, 
12 immunosuppressive drugs such as cyclosporin, SK506, azathioprine, CD3 antibodies and 

steroids. Also, polypeptides may be fused to immune modulators, and other cytokines such as 
alpha or beta interferon. 

16 4.6.1 DETERMINING POLYPEPTIDE AND POLYNUCLEOTIDE IDENTITY 

AND SIMILARITY 

Preferred identity and/or similarity are designed to give the largest match between the 
sequences tested. Methods to determine identity and similarity are codified in computer 

20 programs including, but are not limited to, the GCG program package, including GAP 

(Devereux, J., et al., Nucleic Acids Research 12(1):387 (1984); Genetics Computer Group, 
University of Wisconsin, Madison, WI), BLASTP, BLASTNf, BLASTX, PASTA (Altschul, S.F. 
et al., J. Molec. Biol. 215:403-410 (1990), PSI-BLAST (Altschul S.F. et al., Nucleic Acids Res. 

24 vol. 25, pp. 3389-3402, herein incorporated by reference), eMatrix software (Wu et al., J. Comp. 
Biol., Vol. 6, pp. 219-235 (1999), herein incorporated by reference), eMotif software (Neyill- 
Manning et al, ISMB-97, Vol. 4, pp. 202-209, herein incorporated by reference), pFam software 
(Sonnhammer et al., Nucleic Acids Res., Vol. 26(1), pp. 320-322 (1998), herein incorporated by 

28 reference) and the Kyte-Doolittle hydrophobocity prediction algorithm (J. Mol Biol, 157, pp. 
105-3 1 (1 982), incorporated herein by reference). The BLAST programs are publicly available 
from the National Center for Biotechnology Information (NCBI) and other sources (BLAST 
Manual, Altschul, S., et al. NCB NLM NIH Bethesda, MD 20894; Altschul, S., et al., J. Mol. 

32 Biol. 215:403-410 (1990). 

4.7 CHIMERIC AND FUSION PROTEINS 

The invention also provides chimeric or fusion proteins. As used herein, a "chimeric 

protein" or "fusion protein" comprises a polypeptide of the invention operatively linked to 

32 



WO 01/64835 PCT/US01/04927 

■ 

f 

another polypeptide. Within a fusion protein the polypeptide according to the invention can 
correspond to all or a portion of a protein according to the invention. In one embodiment, a 
fusion protein comprises at least one biologically active portion of a protein according to the 

4 invention. In another embodiment, a fusion protein comprises at least two biologically active 
portions of a protein according to the invention. Within the fusion protein, the term "operatively 
linked" is intended to indicate that the polypeptide according to the invention and the other 
polypeptide are fused in-frame to each other. The polypeptide can be fused to the N-terminus or 

8 C-terminus. 

For example, in one embodiment a fusion protein comprises a polypeptide according to 
the invention operably linked to the extracellular domain of a second protein. 

In another embodiment, the fusion protein is a GST-fusion protein in which the 

i2 polypeptide sequences of the invention are fused to the C-terminus of the GST (i. e. , glutathione 
S-transferase) sequences. 

In another embodiment, the fusion protein is an immunoglobulin fusion protein in which 
the polypeptide sequences according to the invention comprises one or more domains are fused 

16 to sequences derived from a member of the immunoglobulin' protein family. The 

immunoglobulin fusion proteins of the invention can be incorporated into pharmaceutical 
compositions and administered to a subject to inhibit an interaction between a ligand and a 
protein of the invention on the surface of a cell, to thereby suppress signal transduction in vivo. 

20 The immunoglobulin fusion proteins can be used to affect the bioavailability of a cognate ligand. 
Inhibition of the ligand/protein interaction may be useful therapeutically for both the treatment of 
proliferative and differentiative disorders, e,g., cancer as well as modulating (e.g., promoting or 
inhibiting) cell survival. Moreover, the immunoglobulin fusion proteins of the invention can be 

24 used as immunogens to produce antibodies in a subject, to purify ligands, and in screening assays 
to identify molecules that inhibit the interaction of a polypeptide of the invention with a ligand. 

A chimeric or fusion protein of the invention can be produced by standard recombinant 
DNA techniques. For example, DNA fragments coding for the different polypeptide sequences 

28 are ligated together in-frame in accordance with conventional techniques, e.g. , by employing 
blunt-ended or stagger-ended termini for ligation, restriction enzyme digestion to provide for 
appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to 
avoid undesirable joining, and enzymatic ligation. In another embodiment, the fusion gene can 

32 be synthesized by conventional techniques including automated DNA synthesizers. 

■ 

Alternatively, PGR amplification of gene fragments can be carried out using anchor primers that 
give rise to complementary overhangs between two consecutive gene fragments that can 
subsequently be annealed and reamplified to generate a chimeric gene sequence (see, for 
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example, Ausubel et al. (eds.) Current Protocols in Molecular Biology, John Wiley & 
Sons, 1 992). Moreover, many expression vectors are commercially available that already encode 
a fusion moiety (e.g. , a GST polypeptide). A nucleic acid encoding a polypeptide of the 
4 invention can be cloned into such an expression vector such that the fusion moiety is linked 
in-frame to the protein of the invention. 

* * 

4.8 GENE THERAPY 
8 Mutations in the polynucleotides of the invention gene may result in loss of normal 

function of the encoded protein. The invention thus provides gene therapy to restore normal 
activity of the polypeptides of the invention; or to treat disease states involving polypeptides of 
the invention. Delivery of a functional gene encoding polypeptides of the invention to 

* 

12 appropriate cells is effected ex vivo, in situ, or in vivo by use of vectors, and more particularly 
viral vectors (e.g., adenovirus, adeno-associated virus, or a retrovirus), or ex vivo by use of ■ 
physical DNA transfer methods (e.g., liposomes or chemical treatments). See, for example, 
Anderson, Nature, supplement to vol. 392, no. 6679, pp.25-20 (1998). ¥ot additional reviews of 

16 gene therapy technology see Friedmann, Science, 244: 1275-1281 (1989); Verma, Scientific 

American: 68-84 (1990); and Miller, Nature, 357: 455-460 (1992). Introduction of any one of 

it. 

the nucleotides of the present invention or a gene encoding the polypeptides of the present 
invention can also be accomplished with extrachromosomal substrates (transient expression) or 

20 artificial chromosomes (stable expression). Cells may also be cultured ex vivo in the presence of 
proteins of the present invention in order to proliferate or to produce a desired effect on or 
activity in such cells. Treated cells can then be introduced in vivo for therapeutic purposes. 
Alternatively, it is contemplated that in other human disease states, preventing the expression of 

24 or inhibiting the activity of polypeptides of the invention will be useful in treating the disease 

states. It is contemplated that antisense therapy or gene therapy could be applied to negatively 

regulate the expression of polypeptides of the invention. 

* i • ■ ... 

Other methods inhibiting expression of a protein include the introduction of antisense 

28 molecules to the nucleic acids of the present invention, their complements, or their translated RNA 
sequences, by methods known in the art. Further, the polypeptides of the present invention can be 
inhibited by using targeted deletion methods, or the insertion of a negative regulatory element such 
as a silencer, which is tissue specific. 

32 The present invention still further provides cells genetically engineered in vivo to express the 

polynucleotides of the invention, wherein such polynucleotides are in operative association with a 
regulatory sequence heterologous to the host cell which drives expression of the polynucleotides in 
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the cell.' These methods can be used to increase or decrease the expression of the polynucleotides of 
the present invention. 

Knowledge of DN A sequences provided by the invention allows for modification of cells to 
4 permit, increase, or decrease, expression of endogenous polypeptide. Cells can be modified (e.g. , by 
homologous recombination) to provide increased polypeptide expression by replacing, in whole or 
in part, the naturally occurring promoter with all or part of a heterologous promoter so that the cells 
express the protein at higher levels. The heterologous promoter is inserted in such a manner that it is 
8 operatively linked to the desired protein encoding sequences. See, for example, PCT International 
Publications. WO 94/1 2650, PCT International PubUcationNo. WO 92/20808, and PCT 
InternationalPubUcationNo. WO 91/09955. It is also contemplated that, in addition to heterologous 
promoter DNA, amplifiable marker DNA (e.g. , ada, dhfr, and the multifunctional CAD gene which 

12 encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or 
intron DNA may be inserted along with the heterologous promoter DNA. If linked to the desired 
protein coding sequence, amplification of the marker DNA by standard selection methods results in 
co-amplification of the desired protein coding sequences in the cells. 

16 In another embodiment of the present invention, cells and tissues may be engineered to 

express an endogenous gene comprising the polynucleotides of the invention under the control of 
inducible regulatory elements, in which case the regulatory sequences of the endogenous gene may 
be replaced by homologous recombination. As described herein, gene targeting can be used to 

20 replace a gene's existing regulatory region with a regulatory sequence isolated from a different gene 
or a novel regulatory sequence synthesized by genetic engineeringmethods. Such regulatory 
sequences may be comprised of promoters, enhancers, scaffold-attachment regions, negative 
regulatory elements, transcriptional initiation sites, regulatory protein binding sites or combinations 

24 of said sequences. Alternatively, sequences which affect the structure or stability of the RNA or 

protein produced may be replaced, removed, added, or otherwise modified by targeting. These 

• ■ 

sequences include polyadenylation signals, mRNA stability elements, splice sites, leader sequences 
for enhancing or modifying transport or secretion properties of the protein, or other sequences 

28 which alter or improve the function or stability of protein or RNA molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing the gene 
under the control of the new regulatory sequence, e.g., inserting a new promoter or enhancer or both 
upstream of a gene. Alternatively, the targeting event may be a simple deletion of a regulatory 

32 element, such as the deletion of a tissue-specific negative regulatory element. Alternatively, the 
targeting event may replace an existing element; for example, a tissue-specific enhancer can be 

9 

replaced by an enhancer that has broader or different cell-type specificity than the naturally 
occurring elements. Here, the naturally occurring sequences are deleted and new sequences are 
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added In all cases, the identification of the targeting event may be facilitated by the use of one or 
more selectable marker genes that are contiguous with the targeting DNA, allowing for the selection 
of cells in which the exogenous DNA has integrated into the cell genome. The identification of the 

4 targeting event may also be facilitated by the use of one or more marker genes exhibiting the 

property of negative selection, such that the negatively selectable marker is linked to the exogenous 
DNA, but configured such that the negatively selectable marker flanks the targeting sequence, and 
such that a correct homologous recombination event with sequences in the host cell genome does 
8 not result in the stable integration of the negatively selectable marker. Markers useful for this 
purpose include the Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial 
xanthine-guanine phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance with this 

1 2 aspect of the invention are more particularly describedin U.S. Patent No. 5,272,07 1 to Chappel; 
U.S. Patent No. 5^578,461 to Sherwin et aL; International AppUcationNo. PCT/US92/09627 * 
(WO93/09222)by Seldenet aL; and International AppUcationNo. PCT/US90/06436 
(W09 1 /06667) by Skoultchi et aL, each of which is incorporated by reference herein in its entirety. 

16 * 

4.9 TRANSGENIC ANIMALS ',. 

In preferred methods to determine biological functions of the polypeptides of the 

_ . -. 

invention in vivo, one or more genes provided by the invention are either over expressed or 

20 inactivated in the germ line of animals using homologous recombination [Capecchi, Science 
244:1288-1292 (1989)]. Animals in which the gene is over expressed, under the regulatory 
control of exogenous or endogenous promoter elements, are known as transgenic animals. 
Animals in which an endogenous gene has been inactivated by homologous recombination are 

24 referred to as "knockout" animals. Knockout animals, preferably non-human mammals, can be 
prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. Transgenic 
animals are useful to determine the roles polypeptides of the invention play in biological 
processes, and preferably in disease states. Transgenic animals are useful as model systems to 

28 identify compounds that modulate lipid metabolism. Transgenic animals, preferably non-human 
mammals, are produced using methods as described in U.S. Patent No 5,489,743 and PCT 
Publication No. W094/28 122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of a promoter of the 

32 polynucleotides of the invention is either activated or inactivated to alter the level of expression 

■ 

of the polypeptides of the invention; Inactivation can be carried out using homologous 
recombination methods described above. Activation can be achieved by supplementing or even 
replacing the homologous promoter to provide for increased protein expression. The homologous 
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promoter can be supplemented by insertion of one or more heterologous enhancer elements 
known to confer promoter activation in a particular tissue. 
. The* polynucleotides of the present invention also make possible the development, 

4 through, e.g. , homologous recombination or knock out strategies, of animals that fail to express 

* • * 

polypeptides of the invention or that express a variant polypeptide. Such animals are useful as 
models for studying the in vivo activities of polypeptide as well as for studying modulators of the 
polypeptides of the invention. 

* * i ■ 

8 In preferred methods to determine biological functions of the polypeptides of the 

i 

invention in vivo, one or more genes provided by the invention are either over expressed or 
. inactivated in the germ line of animals using homologous recombination [Capecchi, Science 
244:1288-1292 (1989)]. Animals in which the gene is over expressed, under the regulatory 

12 control of exogenous or endogenous promoter elements, are known as transgenic animals. 

Animals in which an endogenous gene has been inactivated by homologous recombination are 
referred to as "knockout" animals. Knockout animals, preferably non-human mammals, can be 
prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. Transgenic 

1 6 animals are useful to determine the roles polypeptides of the invention play in biological 

processes, and preferably in disease states. Transgenic animals are useful as model systems to 
identify compounds that modulate lipid metabolism. Transgenic animals, preferably non-human 
mammals, are produced using methods as described in U.S. Patent No 5,489,743 and PCT 

20 Publication No. W094/28122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of the polynucleotides of the 
invention promoter is either activated or inactivated to alter the level of expression of the 
polypeptides of the invention. Inactivation can be carried out using homologous recombination 

24 methods described above. Activation can be achieved by supplementing or even replacing the 
homologous promoter to provide for increased protein expression. The homologous promoter 
can be supplemented by insertion of one or more heterologous enhancer elements known to 
confer promoter activation in a particular tissue. 

28 

4.10 USES AND BIOLOGICAL ACTIVITY 

The polynucleotides and proteins of the present invention are expected to exhibit one or 
more of the uses or biological activities (including those associated with assays cited herein) 
32 identified herein. Uses or activities described for proteins of the present invention may be 

provided by administration or use of such proteins or of polynucleotides encoding such proteins 
(such as, for example, in gene therapies or vectors suitable for introduction of DNA). The 
mechanism underlying the particular condition or pathology will dictate whether the . 
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polypeptides of the invention, the polynucleotides of the invention or modulators (activators or 
inhibitors) thereof would be beneficial to the subject in need of treatment. Thus, "therapeutic 
compositions of the invention" include compositions comprising isolated polynucleotides 

4 (including recombinant DNA molecules, cloned genes and degenerate variants thereof) or ~~ 
polypeptides of the invention (including full length protein, mature protein and truncations or 
domains thereof), or compounds and other substances that modulate the overall activity of the 
target gene products, either at the level of target gene/protein expression or target protein 

8 activity. Such modulators include polypeptides, analogs, (variants), including fragments and 
fusion proteins, antibodies and other binding proteins; chemical compounds that directly or 
indirectly activate or inhibit the polypeptides of the invention (identified, e.g., via drug screening 
assays as described herein); antisense polynucleotides and polynucleotides suitable for triple 
12 helix formation; and in particular antibodies or other binding partners that specifically recognize 
one or more epitopes of the polypeptides of the invention. • 

The polypeptides of the present invention may likewise be involved in cellular activation 
or in one of the other physiological pathways described herein. 

16 

4.10.1 RESEARCH USES AND UTILITIES 

The polynucleotides provided by the present invention can be used by the research 
community for various purposes. The polynucleotides can be used to express recombinant 

20 protein for analysis, characterization or therapeutic use; as markers for tissues in which the 

- 

corresponding protein is preferentially expressed (either constitutively or at a particular stage of 
tissue differentiation or development or in disease states); as molecular weight markers on gels; 
as chromosome markers or tags (when labeled) to identify chromosomes or to map related gene 

24 positions; to compare with endogenous DNA sequences in patients to identify potential genetic 
disorders; as probes to hybridize and thus discover novel, related DNA sequences; as a source of 
information to derive PCR primers for genetic fingerprinting; as a probe to "subtract-out" known 
sequences in the process of discovering other novel polynucleotides; for selecting and making 

28 oligomers for attachment to a "gene chip" or other support, including for examination of 

expression patterns; to raise anti-protein antibodies using DNA immunization techniques; and as 
an antigen to raise anti-DNA antibodies or elicit another immune response. Where the 
polynucleotide encodes a protein which binds or potentially binds to another protein (such as, for 

32 example, in a receptor-ligand interaction), the polynucleotide can also be used in interaction trap 
assays (such as, for example, that described in Gyuris et al., Cell 75:791-803 (1993)) to identify 
polynucleotides encoding the other protein with which binding occurs or to identify inhibitors of 
the binding interaction. 
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The polypeptides provided by the present invention can similarly be used in assays to 
determine biological activity, including in a panel of multiple proteins for high-throughput 
• screening; to raise antibodies or to elicit another immune response; as a reagent (including the 
4 labeled reagent) in assays designed to quantitatively determine levels of the protein (or its 
receptor) in biological fluids; as markers for tissues in which the corresponding polypeptide is 
preferentially expressed (either constitutively or at a particular stage of tissue differentiation or 
development or in a disease state); and, of course, to isolate correlative receptors or ligands. 
8 Proteins involved in these binding interactions can also be used to screen for peptide or small 
molecule inhibitors or agonists of the binding interaction. 

Any or all of these research utilities are capable of being developed into reagent grade or 
kit format for commercialization as research products. 
12 Methods for performing the uses listed above are weli known to those skilled in the art. 

References disclosing such methods include without limitation "Molecular Cloning: A 
Laboratory Manual", 2d ed., Cold Spring Harbor Laboratory Press, Samhrook, J., E. F. Fritsch 
and T. Maniatis eds., 1989, and "Methods in Enzymology: Guide to Molecular Cloning 
16 Techniques", Academic Press, Berger, S. L. and A. R. Kimmel eds., 1987. 

4.10.2 NUTRITIONAL USES 

Polynucleotides and polypeptides of the present invention can also be used as nutritional 
20 sources or supplements. Such uses include without limitation use as a protein or amino acid 

supplement, use as a carbon source, use as a nitrogen source and use as a source of carbohydrate. In 
such cases the polypeptide or polynucleotide of the invention can be added to the feed of a 
particular organism or can be administered as a separate solid or liquid preparation, such as in the 
24 form of powder, pills, solutions, suspensions or capsules. In the case of microorganisms, the 
polypeptide or polynucleotide of the invention can be added to the medium in or on which the 
microorganism is cultured. 

28 4.103 CYTOKINE AND CELL PROLIFERATION/DIFFERENTIATION 

ACTIVITY 

A polypeptide of the present invention may exhibit activity relating to cytokine, cell 
proliferation (either inducing or inhibiting) or cell differentiation (either inducing or inhibiting) 
32 activity or may induce production of other cytokines in certain cell populations. A 

polynucleotide of the invention can encode a polypeptide exhibiting such attributes. Many 
protein factors discovered to date, including all known cytokines, have exhibited activity in one 
or more factor-dependent cell proliferation assays, and hence the assays serve as a convenient 
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confirmation of cytokine activity. The activity of therapeutic compositions of the present 

. ■ • 

invention is evidenced by any one of a number of routine factor dependent cell proliferation 
assays for cell lines including, without limitation, 32D, DA2, DA1G, T10, B9, B9/1 1, BaF3, 

4 MC9/G, M+(preB M+V2E8, ^ 

HUVEC, and Caco. Therapeutic compositions of the invention can be used in the following: 

Assays for T-cell or thymocyte proliferation include without limitation those described 
in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. 

8 M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, 
In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in 
Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; Bertagnolli et al., J. Immunol. 

■ » 

145:1706-1712, 1990; Bertagnolli et al., Cellular Immunology 133:327-341, 1991; Bertagnolli, 
12 et al., I. Immunol. 149:3778-3783, 1992; Bowman et al., I. Immunol. 152:1756-1761, 1994. 

Assays for cytokine production and/or proliferation of spleen cells, lymph node cells or 

t 

- ■ 1 

thymocytes include, without limitation, those described in: Polyclonal T cell stimulation, 
Kruisbeek, A. M. and Shevach, E. M. In Current Protocols in Immunology. J. E. e.a. Coligan 

16 eds. Vol 1 pp. 3.12.1-3,12.14, John Wiley and Sons, Toronto. 1994; and Measurement of mouse 
and human interleukin-y, Schreiber, R. D. In Current Protocols in Immunology. J. E. e.a. Coligan 
eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994. 

Assays for proliferation and differentiation of hematopoietic and lymphopoietic cells 

20 include, without limitation, those described in: Measurement of Human and Murine Interleukin 2 
and Interleukin 4, Bottomly, K., Davis, L. S. and Lipsky, P. E. In Current Protocols in 
Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and Sons, Toronto. 1991; 
deVries et al., J. Exp. Med. 173:1205-1211, 1991; Moreau et al., Nature 336:690-692, 1988; 

24 Greenberger et al., Proc. Natl. Acad. Sci. U.S.A. 80:2931-2938, 1983; Measurement of mouse 
and human interleukin 6-Nordan, R. In Current Protocols in Immunology. J. E. Coligan eds. Vol 
1 pp. 6.6.1-6.6.5, John Wiley and Sons, Toronto. 1991; Smith et al., Proc. Natl. Aced. Sci. 
U.S.A. 83:1857-1861, 1986; Measurement of human Interleukin 1 1-Bennett, F., Giannotti, J., 

28 Clark, S. C. and Turner, K. J. In Current Protocols in Immunology. J. E. Coligan eds. Vol 1 pp. 
6. 1 5. 1 John Wiley and Sons, Toronto. 1 991 ; Measurement of mouse and human Interleukin 
9-Ciarletta, A., Giannotti, J., Clark, S. C. and Turner, K. J. In Current Protocols in Immunology. 
J. E. Coligan eds. Vol 1 pp. 6.13.1, John Wiley and Sons, Toronto. 1991. 

32 Assays for T-cell clone responses to antigens (which will identify, among others, proteins 

that affect APC-T cell interactions as well as direct T-cell effects by measuring proliferation and 
cytokine production) include, without limitation, those described in: Current Protocols in 
Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E, M. Shevach, W Strober, 
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Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse 
Lymphocyte Function; Chapter 6, Cytokines and their cellular receptors; Chapter 7, 
• Immunologic studies in Humans); Weinberger et aL, Proc. Natl. Acad. Sci. USA 77:6091-6095, 
4 1980; Weinberger et : aL, Eur. J. Immun. 1 1 :405-4 11,1981; Takai et aL, J. Immunol. 
137:3494-3500, 1986; Takai et aL, I Immunol. 140:508-512, 1988. 

4.10.4 STEM CELL GROWTH FACTOR ACTIVITY 
8 A polypeptide of the present invention may exhibit stem cell growth factor activity and 

be involved in the proliferation, differentiation and survival of pluripotent and totipotent stem 
cells including primordial germ cells, embryonic stem cells, hematopoietic stem cells and/or 
germ line stem cells. Administration of the polypeptide of the invention to stem cells in vivo or 

• * 

12 ex vivo is expected to maintain and expand cell populations in a totipotential or pluripotential 
state which would be useful for re-engineering damaged or diseased tissues, transplantation, 
manufacture of bio-pharmaceuticals and the development of bio-sensors. The ability to produce 

large quantities of human cells has important working applications for the production of human 

■ . ■ 

16 proteins which currently must be obtained from non-human sources or donors, implantation of 
cells to treat diseases such as Parkinson's, Alzheimer's and other neurodegenerative diseases; 
tissues for grafting such as bone marrow, skin, cartilage, tendons, bone, muscle (including 
cardiac muscle), blood vessels, cornea, neural cells, gastrointestinal cells and others; and organs 

20 for transplantation such as kidney, liver, pancreas (including islet cells), heart and lung. 

It is contemplated that multiple different exogenous growth factors and/or cytokines may 
be administered in combination with the polypeptide of the invention to achieve the desired 
effect, including any of the growth factors listed herein, other stem cell maintenance factors, and 

24 specifically including stem cell factor (SCF), leukemia inhibitory factor (LIF), Flt-3 ligand (Flt- 
3L), any of the interleukins, recombinant soluble IL-6 receptor fused to IL-6, macrophage 
inflammatory protein 1 -alpha (MOM -alpha), G-CSF, GM-CSF, thrombopoietin (TPO), platelet 
factor 4 (PF-4), platelet-derived growth factor (PDGF), neural growth factors and basic 

28 fibroblast growth factor (bFGF). 

Since totipotent stem cells can give rise to virtually any mature cell type, expansion of 
these cells in culture will facilitate the production of large quantities of mature cells. Techniques 
for culturing stem cells are known in the art and administration of polypeptides of the invention, 

32 optionally with other growth factors and/or cytokines, is expected to enhance the survival and 
proliferation of the stem cell populations. This can be accomplished by direct administration of 
the polypeptide of the invention to the culture medium. Alternatively, stroma cells transfected 

■ « 

with a polynucleotide that encodes for the polypeptide of the invention can be used as a feeder 
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layer for the stem cell populations in culture or in vivo. Stromal support cells for feeder layers 
may include embryonic bone marrow fibroblasts, bone marrow stromal cells, fetal liver cells, or 
• cultured embryonic fibroblasts (see U.S. Patent No. 5,690,926). 
4 Stem cells themselves can be transfected with a polynucleotide of the invention to induce 

autocrine expression of the polypeptide of the invention. This will allow for generation of 
undifferentiated totipotential/pluripotential stem cell lines that are useful as is or that can then be 
differentiated into the desired mature cell types. These stable cell lines can also serve as a source 
8 of undifferentiated totipotential/pluripotential mRNA to create cDNA libraries and templates for 
polymerase chain reaction experiments. These studies would allow for the isolation and 
identification of differentially expressed genes in stem cell populations that regulate stem cell 
proliferation and/or maintenance. 

12 Expansion and maintenance of totipotent stem cell populations will be useful in the 

treatment of many pathological conditions. For example, polypeptides of the present invention 
may be used to manipulate stem cells in culture to give rise to neuroepithelial cells that can be 
used to augment or replace cells damaged by illness, autoimmune disease, accidental damage or 

16 genetic disorders. The polypeptide of the invention may be useful for inducing the proliferation 
of neural cells and for the regeneration of nerve and brain tissue, for the treatment of central 
and peripheral nervous system diseases and neuropathies, as well as mechanical and traumatic 
disorders which involve degeneration, death or trauma to neural cells or nerve tissue. In addition, 

20 the expanded stem cell populations can also be genetically altered for gene therapy purposes and 
to decrease host rejection of replacement tissues after grafting or implantation. 

Expression of the polypeptide of the invention and its effect on stem cells can also be 
manipulated to achieve controlled differentiation of the stem cells into more differentiated cell 

24 types. A broadly applicable method of obtaining pure populations of a specific differentiated 
cell type from undifferentiated stem cell populations involves the use of a cell-type specific 
promoter driving a selectable marker. The selectable marker allows only cells of the desired type 
to survive. For example, stem cells can be induced to differentiate into candiomyocytes (Wobus 

28 et al., Differentiation, 48: 173-182, (1991); Klug et al., J. Clin. Invest., 98(1): 216-224, (1998)) 
or skeletal muscle cells (Browder, L. W. In: Principles of Tissue Engineering eds. Lanza et al., 
Academic Press (1997)). Alternatively, directed differentiation of stem cells can be 
accomplished by culturing the stem cells in the presence of a differentiation factor such as 

32 retinoic acid and an antagonist of the polypeptide of the invention which would inhibit the 
effects of endogenous stem cell factor activity and allow differentiation to proceed. 

In vitro cultures of stem cells can be used to determine if the polypeptide of the invention 
exhibits stem cell growth factor activity. Stem cells are isolated from any one of various cell 
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sources (including hematopoietic stem cells and embryonic stem cells) and cultured on a feeder 
layer, as described by Thompson et al. Proc. Natl. Acad. Sci, U.S.A., 92: 7844-7848 (1995), in 
the presence of the polypeptide of the invention alone or in combination with other growth 
4 factors or cytokines. The ability of the polypeptide of the invention to induce stem cells 
proliferation is determined by colony formation on semi-solid support e.g. as described by 
Bernstein etal., Blood, 77: 23 16-2321 (1991). ' 

8 4.10.5 HEMATOPOIESIS REGULATING ACTIVITY 

A polypeptide of the present invention may be involved in regulation of hematopoiesis 
and, consequently, in the treatment of myeloid or lymphoid cell disorders. Even marginal 
biological activity in support of colony forming cells or of factor-dependent cell lines indicates 

12 involvement in regulating hematopoiesis, e.g. in supporting the growth and proliferation of 
erythroid progenitor cells alone or in combination with other cytokines, thereby indicating ■ 
utility, for example, in treating various anemias or for use in conjunction with 
irradiation/chemotherapy to stimulate the production of erythroid precursors and/or erythroid 

16 cells; in supporting the growth and proliferation of myeloid cells such as granulocytes and 

monocytes/macrophages (f.e., traditional CSF activity) useful, for example, in conjunction with 
chemotherapy to prevent or treat consequent myelo-suppression; in supporting the growth and 
proliferation of megakaryocytes and consequently of platelets thereby allowing prevention or 

20 treatment of various platelet disorders such as thrombocytopenia, and generally for use in place 
of or complimentary to platelet transfusions; and/or in supporting the growth and proliferation of 
hematopoietic stem cells which are capable of maturing to any and all of the above-mentioned 
hematopoietic cells and therefore find therapeutic utility in various stem cell disorders (such as 

24 those usually treated with transplantation, including, without limitation, aplastic anemia and 
paroxysmal nocturnal hemoglobinuria), as well as in repopulating the stem cell compartment 
post irradiation/chemotherapy, either in-vivo or ex-vivo (i.e., in conjunction with bone marrow 
transplantation or with peripheral progenitor cell transplantation (homologous or heterologous)) 

28 as normal cells or genetically manipulated for gene therapy. 

Therapeutic compositions of the invention can be used in the following: 
Suitable assays for proliferation and differentiation of various hematopoietic lines are 
cited above. 

32 Assays for embryonic stem cell differentiation (which will identify, among others, 

proteins that influence embryonic differentiation hematopoiesis) include, without limitation, 
those described in: Johansson et al. Cellular Biology 15: 141-151, 1995; Keller et al., Molecular 
and Cellular Biology 13:473-486, 1993; McClanahan etal., Blood 81:2903-2915, 1993. 
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Assays for stem cell survival and differentiation (which will identify, among others, 
proteins that regulate lympho-hematopoiesis) include, without limitation, those described in: 
1 Methylcellulose colony forming assays, Freshney, M. G. In Culture of Hematopoietic Cells. R. I. 
4 Freshney, et al. eds. Vol pp. 265-268, Wiley-Liss, Inc., New York, N.Y. 1994; Hirayama et al., 
Proa Natl. Acad. Sci. USA 89:5907-591 1, 1992; Primitive hematopoietic colony forming cells 
with high proliferative potential, McNiece, I. K. and Briddell, R. A. In Culture of Hematopoietic 
Cells. R. I. Freshney, et al. eds. Vol pp. 23-39, Wiley-Liss, Inc., New York, N.Y. 1994; Neben et 
8 al., Experimental Hematology 22:353-359, 1994; Cobblestone area forming cell assay, 

Ploemacher, R. E. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 1-21, 
Wiley-Liss, Inc., New York, N.Y. 1994; Long term bone marrow cultures in the presence of 
stromal cells, Spooncer, E., Dexter, M. and Allen, T. In Culture of Hematopoietic Cells. R. I. 
12 Freshney, et al. eds. Vol pp. 1 63-179, Wiley-Liss, Inc., New York, N.Y. 1 994; Long term culture 
initiating cell assay, Sutherland, H. J. In Culture of Hematopoietic Cells. R. I. Freshney, et al. 
eds. Vol pp. 139-162, Wiley-Liss, Inc., New York, N.Y. 1994. 

> 

* * 

16 4.10.6 TISSUE GROWTH ACTIVITY 

A polypeptide of the present invention also may be involved in bone, cartilage, tendon, 
ligament and/or nerve tissue growth or regeneration, as well as in wound healing and tissue 
repair and replacement, and in healing of burns, incisions and ulcers. 

20 A polypeptide of the present invention which induces cartilage and/or bone growth in 

circumstances where bone is not normally formed, has application in the healing of bone 
fractures and cartilage damage or defects in humans and other animals. Compositions of a 
polypeptide, antibody, binding partner, or other modulator of the invention may have 

24 prophylactic use in closed as well as open fracture reduction and also in the improved fixation of 
artificial joints. De novo bone formation induced by an osteogenic agent contributes to the 
repair of congenital, trauma induced, or oncologic resection induced craniofacial defects, and 
also is useful in cosmetic plastic surgery. 

28 A polypeptide of this invention may also be involved in attracting bone-forming cells, 

stimulating growth of bone-forming cells, or inducing differentiation of progenitors of 
bone-forming cells. Treatment of osteoporosis, osteoarthritis, bone degenerative disorders, or 
periodontal disease, such as through stimulation of bone and/or cartilage repair or by blocking 

32 inflammation or processes of tissue destruction (collagenase activity, osteoclast activity, etc.) 
mediated by inflammatory processes may also be possible using the composition of the 
invention. 

* 
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Another category of tissue regeneration activity that may involve the polypeptide of the 
present invention is tendon/ligament formation. Induction of tendon/ligament-like tissue or 
other tissue formation in circumstances where such tissue is not normally formed, has 

4 application s the healing of tendon or ligament tears, deformities and other tendon or ligament 
defects in humans and other animals. Such a preparation employing a tendon/ligament-like 
tissue inducing protein may have prophylactic use in preventing damage to tendon or ligament 
tissue, as well as use in the improved fixation of tendon or ligament to bone or other tissues, and 

8 in repairing defects to tendon or ligament tissue. De novo tendon/ligament-like tissue formation 
induced by a composition of the present invention contributes to the repair of congenital, trauma - 
induced, or other tendon or ligament defects of other origin, and is also useful in cosmetic plastic 
surgery for attachment or repair of tendons or ligaments. The compositions of the present 
12 invention may provide environment to attract tendon- or ligament-forming cells, stimulate 

grcwth of tendon- or ligament-fonning eells, indue, diff.rond.don of progenitors of tendon- or 
ligament-forming cells, or induce growth of tendon/ligament cells or progenitors ex vivo for 
return in vivo to effect tissue repair. The compositions of the invention may also be useful in the 
16 treatment of tendinitis, carpal tunnel syndrome and other tendon or ligament defects. The 

i 

compositions may also include an appropriate matrix and/or sequestering agent as a carrier as is 
well known in the art. 

The compositions of the present invention may also be useful for proliferation of neural 

20 cells and for regeneration of nerve and brain tissue, Le. for the treatment of central and peripheral 
nervous system diseases and neuropathies, as well as mechanical and traumatic disorders, which 
involve degeneration, death or trauma to neural cells or nerve tissue. More specifically, a 
composition may be used in the treatment of diseases of the peripheral nervous system, such as 

24 peripheral nerve injuries, peripheral neuropathy and localized neuropathies, and central nervous 
system diseases, such as Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic 
lateral sclerosis, and Shy-Drager syndrome. Further conditions which may be treated in 
accordance with the present invention include meqhanical and traumatic disorders, such as spinal 

28 cord disorders, head trauma and cerebrovascular diseases such as stroke. Peripheral neuropathies 
resulting from chemotherapy or other medical therapies may also be treatable using a 
composition of the invention. 

Compositions of the invention may also be useful to promote better or faster closure of 

32 non-healing wounds, including without limitation pressure ulcers, ulcers associated with vascular 
insufficiency, surgical and traumatic wounds, and the like. 

Compositions of the present invention may also be involved in the generation or 
regeneration of other tissues, such as organs (including, for example, pancreas, liver, intestine, 

* 

45 



t 

WO 01/64835 PCTAJS01/04927 

kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) and vascular (including vascular 
endothelivfm) tissue, or for promoting the growth of cells comprising such tissues. Part of the 
' desired effects may be by inhibition or modulation of fibrotic scarring may allow normal tissue 
4 to regenerate. A polypeptide of the present invention may also exhibit angiogenic activity. ~~ 
A composition of the present invention may also be useful for gut protection or 
regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, and 
conditions resulting from systemic cytokine damage. 
8 A composition of the present invention may also be useful for promoting or inhibiting 

differentiation of tissues described above from precursor tissues or cells; or for inhibiting the 
growth of tissues described above. 

Therapeutic compositions of the invention can be used in the following: 
1 2 Assays for tissue generation activity include, without limitation, those described in: 

International Patent Publication No. WO95/16035 (bone, cartilage, tendon); International Patent 
Publication No. WO95/05846 (nerve, neuronal); International Patent Publication No. . 
WO91/07491 (skin, endothelium). 
16 Assays for wound healing activity include, without limitation, those described in: Winter, 

Epidermal Wound Healing, pps. 71-112 (Maibach, H. I. and Rovee, D. T., eds.), Year Book 
Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. Dermatol 
71:382-84(1978). 

20 

4.10,7 IMMUNE STIMULATING OR SUPPRESSING ACTIVITY 

A polypeptide of the present invention may also exhibit immune stimulating or immune 
suppressing activity, including without limitation the activities for which assays are described 

24 herein. A polynucleotide of the invention can encode a polypeptide exhibiting such activities. A 
protein may be useful in the treatment of various immune deficiencies and disorders (including 
severe combined immunodeficiency (SCID)), e.g., in regulating (up or down) growth and 
proliferation of T and/or B lymphocytes, as well as effecting the cytolytic activity of NK cells 

28 and other cell populations. These immune deficiencies may be genetic or be caused by viral (e.g., 
HIV) as well as bacterial or fungal infections, or may result from autoimmune disorders. More 
specifically, infectious diseases causes by viral, bacterial, fungal or other infection may be 
treatable using a protein of the present invention, including infections by HIV, hepatitis viruses, 

32 herpes viruses, mycobacteria, Leishmania spp., malaria spp. and various fungal infections such 
as candidiasis. Of course, in this regard, proteins of the present invention may also be useful 
. where a boost to the immune system generally may be desirable, in the treatment of cancer. 
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Autoimmune disorders which may be treated using a protein of the present invention 
include, for example, connective tissue disease, multiple sclerosis, systemic lupus erythematosus, 

- 

rheumatoid arthritis, autoimmune pulmonary inflammation, Guillain-Barre syndrome, 
4 autoimmune thyroiditis, insulin dependent diabetes mellitis, myasthenia gravis, graft-versus-host 
disease and autoimmune inflammatory eye disease. Such a protein (or antagonists thereof, 
including antibodies) of the present invention may also to be useful in the treatment of allergic 
reactions and conditions (e.g., anaphylaxis, serum sickness, drug reactions, food allergies, insect 
8 venom allergies, mastocytosis, allergic rhinitis, hypersensitivity pneumonitis, urticaria, 
angioedema, eczema, atopic dermatitis, allergic contact dermatitis, erythema multiforme, 
Stevens-Johnson syndrome, allergic conjunctivitis, atopic keratoconjunctivitis, venereal 
keratoconjunctivitis, giant papillary conjunctivitis and contact allergies), such as asthma 
. 12 (particularly allergic asthma) or other respiratory problems. Other conditions, in which immune 
suppression is desired (including, for example, organ transplantation), may also be treatable 
using a protein (or antagonists thereof) of the present invention. The therapeutic effects of the 
polypeptides or antagonists thereof on allergic reactions can be evaluated by in vivo animals 

16 models such as the cumulative contact enhancement test (Lastbom et al., Toxicology 125: 59-66, 
1998), skin prick test (Hoffmann et al., Allergy 54: 446-54, 1999), guinea pig skin sensitization 
test (Vohr et al., Arch. ToxocoL 73: 501-9), and murine local lymph node assay (Kimber et aL, 
J. Toxicol. Environ. Health 53: 563-79). 

20 Using the proteins of the invention it may also be possible to modulate immune 

♦ 

responses, in a number of ways. Down regulation may be in the form of inhibiting or blocking an 
immune response already in progress or may involve preventing the induction of an immune 
response. The functions of activated T cells may be inhibited by suppressing T cell responses or 

24 by inducing specific tolerance in T cells, or both. Immunosuppression of T cell responses is 

generally an active, non-antigen-specific, process which requires continuous exposure of the T - 
cells to the suppressive agent Tolerance, which involves inducing non-responsiveness or anergy 
in T cells, is distinguishable from immunosuppression in that it is generally antigen-specific and 

28 persists after exposure to the tolerizing agent has ceased. Operationally, tolerance can be 

demonstrated by the lack of a T cell response upon reexposure to specific antigen in the absence 
of the tolerizing agent. 

Down regulating or preventing one or more antigen functions (including without 

32 limitation B lymphocyte antigen functions (such as, for example, B7)), e.g. , preventing high 

» 

level lymphokine synthesis by activated T cells, will be useful in situations of tissue, skin and 
organ transplantation and in graft-versus-host disease (GVHD). For example, blockage of T cell 
function should result in reduced tissue destruction in tissue transplantation. Typically, in tissue 
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transplants, rejection of the transplant is initiated through its recognition as foreign by T cells, 
followed by an immune reaction that destroys the transplant The administration of a therapeutic 
' composition of the invention may prevent cytokine synthesis by immune cells, such as T cells, 

4 and thus acts as an immunosuppressant Moreover, a lack of (^stimulation may also be sufficient 
to anergize the T cells, thereby inducing tolerance in a subject Induction of long-term tolerance 
by B lymphocyte antigen-blocking reagents may avoid the necessity of repeated administration 
of these blocking reagents. To achieve sufficient immunosuppression or tolerance in a subject, it 

8 may also be necessary to block the function of a combination of B lymphocyte antigens. 

The efficacy of particular therapeutic compositions in preventing organ transplant 
rejection or GVHD can be assessed using animal models that are predictive of efficacy in 
humans. Examples of appropriate systems which can be used include allogeneic cardiac grafts in 

12 rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been used to examine 

■ 

the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as described in Lenschow et 
al., Science 257:789-792 (1992) and Turka et at, Proc. Natl. Acad. Sci USA, 89:1 1 102-1 1 105 
(1992). In addition, murine models of GVHD (see Paul ed., Fundamental Immunology, Raven 

1 6 Press, New York, 1 989, pp. 846-847) can be used to determine the effect of therapeutic 
compositions of the invention on the development of that disease. 

Blocking antigen function may also be therapeutically useful for treating autoimmune 
diseases. Many autoimmune disorders are the result of inappropriate activation of T cells that are 

20 reactive against self tissue and which promote the production of cytokines and autoantibodies 
involved in the pathology of the diseases. Preventing the activation of autoreactive T cells may 
reduce or eliminate disease symptoms. Administration of reagents which block stimulation of T 
cells can be used to inhibit T cell activation and prevent production of autoantibodies or T 

24 cell-derived cytokines which may be involved in the disease process. Additionally, blocking 
reagents may induce antigen-specific tolerance of autoreactive T cells which could lead to 
long-term relief from the disease. The efficacy of blocking reagents in preventing or alleviating 
autoimmune disorders can be determined using a number of well-characterized animal models of 

28 human autoimmune diseases. Examples include murine experimental autoimmune encephalitis, 
systemic lupus erythmatosis in MRL/lpr/lpr mice or NZB hybrid mice, murine autoimmune 
collagen arthritis, diabetes mellitus in NOD mice and BB rats, and murine experimental 
myasthenia gravis (see Paxil ed., Fundamental Immunology, Raven Press, New York, 1989, pp. 

32 840-856). 

Upregulation of an antigen function (e.g., a B lymphocyte antigen function), as a means 
of up regulating immune responses, may also be useful in therapy. Upregulation of immune 
responses may be in the form of enhancing an existing immune resjponse or eliciting an initial 
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immune response. For example, enhancing an immune response may be useful in cases of viral 

infection, including systemic viral diseases such as influenza, the common cold, and 

' encephalitis. 

4 — Alternatively, anti-viral immune responses may be enhanced in an infected patient by 
removing T cells from the patient, costimulating the T cells in vitro with viral antigen-pulsed 
APCs either expressing a peptide of the present invention or together with a stimulatory form of 
a soluble peptide of the present invention and reintroducing the in vitro activated T cells into the 

8 patient. Another method of enhancing anti-viral immune responses wouldbe to isolate infected 
cells from a patient, transfect them with a nucleic acid encoding a protein of the present 
invention as described herein such that the cells express all or a portion of the protein on their 
surface, and reintroduce the transfected cells into the patient. The infected cells would now be 

12 capable of delivering a costimulatory signal to, and thereby activate, T cells in vivo. 

■ 

A polypeptide of the present invention may provide the necessary stimulation signal to T 
cells to induce a T cell mediated immune response against the transfected tumor cells. In 
addition, tumor cells which lack MHC class I or MHC class II molecules, or which fail to 

16 reexpress sufficient mounts of MHC class I or MHC class II molecules, can be transfected with 
nucleic acid encoding all or a portion of (e.g., a cytoplasmiodomain truncated portion) of an 
MHC class I alpha chain protein and p2 microglobulin protein or an MHC class II alpha chain 
protein and an MHC class II beta chain protein to thereby express MHC class I or MHC class II 

20 proteins on the cell surface. Expression of the appropriate class I or class II MHC in conjunction 
with a peptide having the activity of a B lymphocyte antigen (e.g., B7-1, B7-2, B7-3) induces a T 
cell mediated immune response against the transfected tumor cell. Optionally, a gene encoding 
an antisense construct which blocks expression of an MHC class II associated protein, such as 

24 the invariant chain, can also be cotransfected with a DNA encoding a peptide having the activity 
of a B lymphocyte antigen to promote presentation of tumor associated antigens and induce 
tumor specific immunity. Thus, the induction of a T cell mediated immune response in a human 
subject may be sufficient to overcome tumor-specific tolerance in the subject 

28 The activity of a protein of the invention may, among other means, be measured by the 

following methods: 

Suitable assays for thymocyte or splenocyte cytotoxicity include, without limitation, 
those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. 
32 H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 

Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; 
Chapter 7, Immunologic studies in Humans); Herrmann et al., Proc. Natl. Acad. Sci. USA 
78:2488-2492, 1981; Herrmann etal., J. Immunol. 128:1968-1974, 1982; Handaetal., J. 

- 

■ 
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Immunol. 135:1564-1572, 1985; Takai etal., I. Immunol. 137:3494-3500, 1986; Takai etal., J. 
Immunol., 140:508-512, 1988; Bowman et al., J. Virology 61:1992-1998; Bertagnolli et al., 
1 Cellular Immunology 133:327-341, 1991; Brown et al., J. Immunol. 153:3079-3092, 1994. 
4 Assays for T-eell-dependent immunoglobulin responses and isotype switching (which 

will identify, among others, proteins that modulate T-cell dependent antibody responses and that 
affect Thl/Th2 profiles) include, without limitation, those described in: Maliszewski, J. 
Immunol. 144:3028-3033, 1990; and Assays for B cell function: In vitro antibody production, 
8 Mond, J. J. and Brunswick, M. In Current Protocols in Immunology. J. E. ca. Coligan eds. Vol 1 
pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 

Mixed lymphocyte reaction (MLR) assays (which will identify, among others, proteins 
that generate predominantly Thl and CTL responses) include, without limitation, those described 
12 in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. 
M. Shevach, W, Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, 
In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in 
Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 
16 1988; Bertagnolli et al., J. Immunol. 149:3778-3783, 1992. 

Dendritic cell-dependent assays (which will identify, among others, proteins expressed 
by dendritic cells that activate naive T-cells) include, without limitation, those described in: 
Guery et al., J. Immunol. 134:536-544, 1995; Inaba et al., Journal of Experimental Medicine 
20 173:549-559, 1991; Macatonia et al., Journal of Immunology 154:5071-5079, 1995; Porgador et 
al., Journal of Experimental Medicine 182:255-260, 1995; Nair et al., Journal of Virology 
67:4062-4069, 1993; Huang et al., Science 264:961-965, 1994; Macatonia etal., Journal of 
Experimental Medicine 169:1255-1264, 1989; Bhardwaj et al., Journal of Clinical Investigation 
24 94:797-807, 1994; and Inaba et al., Journal of Experimental Medicine 172:63 1-640, 1990. 

Assays for lymphocyte survival/apoptosis (which will identify, among others, proteins 
that prevent apoptosis after superantigen induction and proteins that regulate lymphocyte 
homeostasis) include, without limitation, those described in: Darzynkiewicz et al., Cytometry 
28 13:795-808, 1992; Gorczyca et al., Leukemia 7:659-670, 1993; Gorcsyca et al., Cancer Research 
53:1945-1951, 1993; Itoh et al., Cell 66:233-243, 1991; Zacharchuk, Journal of Immunology 

145:4037-4045, 1990; Zamai et al., Cytometry 14:891-897, 1993; Gorczyca et al., International 

■■ 

Journal of Oncology 1:639-648, 1992. 
32 Assays for proteins that influence early steps of T-cell commitment and development 

include, without limitation, those described in: Antica et al., Blood 84:1 11-117, 1994; Fine et al., 
Cellular Immunology 155:1 1 1-122, 1994; Galy et al., Blood 85:2770-2778, 1995; Toki et al., 
Proc. Nat. Acad Sci. USA 88:7548-7551, 1991. 
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4.10.8 ACTTSW/INHIBIN ACTIVITY 

A polypeptide of the present invention may also exhibit activin- or inhibin-related 
4 activities. A polynucleotide of the invention may encode a polypeptide exhibiting such 
characteristics. Inhibins are characterized by their ability to inhibit the release of follicle 
stimulating hormone (FSH), while activins and are characterized by their ability to stimulate the 
release of follicle stimulating hormone (FSH). Thus, a polypeptide of the present invention, 
8 alone or in heterodimers with a member of the inhibin family, may be useful as a contraceptive 
based on the ability of inhibins to decrease fertility in female mammals and decrease 
spermatogenesis in male mammals. Administration of sufficient amounts of other inhibins can 
induce infertility in these mammals. Alternatively, the polypeptide of the invention, as a 
12 homodimer or as a heterodimer with other protein subunits of the inhibin group, may be useful as 
a fertility inducing therapeutic, based upon the ability of activin molecules in stimulating FSH 
release from cells of the anterior pituitary. See, for example, U.S. Pat. No. 4,798,885. A 
polypeptide of the invention may also be useful for advancement of the onset of fertility in 
16 sexually immature mammals, so as to increase the lifetime reproductive performance of domestic 

i 

animals such as, but not limited to, cows, sheep and pigs. 

The activity of a polypeptide of the invention may, among other means, be measured by 
the following methods. 

20 Assays for activin/inhibin activity include, without limitation, those described in: Vale et 

al., Endocrinology 91:562-572, 1972; Ling et al., Nature 321:779-782, 1986; Vale et aL, Nature 
321:776-779, 1986; Mason et al., Nature 318:659-663, 1985; Forage et al., Proc. Natl. Acad. Sci. 
USA 83:3091-3095, 1986. 

24 

4.10.9 CHEMOTACTIC/CHEMOKINETTC ACTIVITY 

r " 

A polypeptide of the present invention may be involved in chemotactic or chemokinetic 
activity for mammalian cells, including, for example, monocytes, fibroblasts, neutrophils, 

28 T-cells, mast cells, eosinophils, epithelial and/or endothelial cells. A polynucleotide of the 
invention can encode a polypeptide exhibiting such attributes. Chemotactic and chemokinetic 
receptor activation can be used to mobilize or attract a desired cell population to a desired site of 
action. Chemotactic or chemokinetic compositions (e.g. proteins, antibodies, binding partners, or 

32 modulators of the invention) provide particular advantages in treatment of wounds and other 
trauma to tissues, as well as in treatment of localized infections. For example, attraction of 
lymphocytes, monocytes or neutrophils to tumors or sites of infection may result in improved 
immune responses against the tumor or infecting agent. 
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A protein or peptide has chemotactic activity for a particular cell population if it can 
stimulate, .directly or indirectly, the directed orientation or movement of such cell population. 
' Preferably, the protein or peptide has the ability to directly stimulate directed movement of cells. 
4 Whether a particular protein has chemotactic activity for a population of cells can be readily 
determined by employing such protein or peptide in any known assay for cell chemotaxis. 
Therapeutic compositions of the invention can be used in the following: 
Assays for chemotactic activity (which will identify proteins that induce or prevent 
8 chemotaxis) consist of assays that measure the ability of a protein to induce the migration of 
cells across a membrane as well as the ability of a protein to induce the adhesion of one cell 
population to another cell population. Suitable assays for movement and adhesion include, 
without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. 

i 

12 M. Kruisbeek, D. H. Marguiles, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates 
and Wiley-Interscience (Chapter 6. 1 2, Measurement of alpha and beta Chemokines 
6.12.1-6.12.28; Taub et al. J. Clin. Invest. 95:1370-1376, 1995; Lind et al. APMIS 103:140-146, 
1995; Muller et al Eur. J. Immunol. 25:1744-1748; Gruber et al. J. of Immunol. 152:5860-5867, 

16 1994; Johnston et al. J. of Immunol 153:1762-1768, 1994. 

► 

- 

4.10.10 HEMOSTATIC AND THROMBOLYTIC ACTIVITY 

A polypeptide of the invention may also be involved in hemostatis or thrombolysis or 

* • 

20 thrombosis. A polynucleotide of the invention can encode a polypeptide exhibiting such 

attributes. Compositions may be useful in treatment of various coagulation disorders (including 
hereditary disorders, such as hemophilias) or to enhance coagulation and other hemostatic events 
in treating wounds resulting from trauma, surgery or other causes. A composition of the 

24 invention may also be useful for dissolving or inhibiting formation of thromboses and for 

treatment and prevention of conditions resulting therefrom (such as, for example, infarction of 
cardiac and centred nervous system vessels (e.g., stroke). 

Therapeutic compositions of the invention can be used in the following: 

28 Assay for hemostatic and thrombolytic activity include, without limitation, those 

described in: Linet et al., J. Clin. Pharmacol. 26:131-140, 1986; Burdick et al., Thrombosis Res. 
45:413-419, 1987; Humphrey et al., Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 
35:467-474, 1988. 

32 

4.10.11 CANCER DIAGNOSIS AND THERAPY 

Polypeptides of the invention may be involved in cancer cell generation, proliferation or 
metastasis. Detection of the presence or amount of polynucleotides or polypeptides of the 

■ • 
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invention may be useful for the diagnosis and/or prognosis of one or more types of cancer. . For 
example, the presence or increased expression of a polynucleotide/polypeptide of the invention 
may indicate a hereditary risk of cancer, a precancerous condition, or an ongoing malignancy. 
4 Conversely, a defect in the gene or absence of the polypeptide may be associated with a cancel 
condition. Identification of single nucleotide polymorphisms associated, with cancer or a 
predisposition to cancer may also be useful for diagnosis or prognosis. 

Cancer treatments promote tumor regression by inhibiting tumor cell proliferation, 
8 inhibiting angiogenesis (growth of new blood vessels that is necessary to support tumor growth) 
and/or prohibiting metastasis by reducing tumor cell motility or invasiveness. Therapeutic 
compositions of the invention may be effective in adult and pediatric oncology including in solid 
phase tumors/malignancies, locally advanced tumors, human soft tissue sarcomas, metastatic 

12 cancer, including lymphatic metastases, blood cell malignancies including multiple myeloma, 
acute and chronic leukemias, and lymphomas, head and neck cancers including mouth cancer, 
larynx cancer and thyroid cancer, lung cancers including small cell carcinoma and non-sinall cell 
cancers, breast cancers including small cell carcinoma and ductal carcinoma, gastrointestinal 

16 cancers including esophageal cancer, stomach cancer, colon cancer, colorectal cancer and polyps 

i 

associated with colorectal neoplasia, pancreatic cancers,- liver cancer, urologic cancers including 
bladder cancer and prostate cancer, malignancies of the female genital tract including ovarian 
carcinoma, uterine (including endometrial) cancers, and solid tumor in the ovarian folhcle, 

20 kidney cancers including renal cell carcinoma, brain cancers including intrinsic brain tumors, 
neuroblastoma, astrocytic brain tumors, gliomas, metastatic tumor cell invasion in the central 
nervous system, bone cancers including osteomas, skin cancers including malignant melanoma, 
tumor progression of human skin keratinocytes, squamous cell carcinoma, basal cell carcinoma, 

24 hemangiopericytoma and Karposi's sarcoma. 

Polypeptides, polynucleotides, or modulators of polypeptides of the invention (including 
inhibitors and stimulators of the biological activity of the polypeptide of the invention) may be 
administered to treat cancer. Therapeutic compositions can be administered in therapeutically 

28 effective dosages alone or in combination with adjuvant cancer therapy such as surgery, 

chemotherapy, radiotherapy, thermotherapy, and laser therapy, and may provide a beneficial 
effect, e.g. reducing tumor size, slowing rate of tumor growth, inhibiting metastasis, or otherwise 
improving overall clinical condition, without necessarily eradicating the cancer. 

32 The composition can also be administered in therapeutically effective amounts as a 

portion of an anti-cancer cocktail. An anti-cancer cocktail is a mixture of the polypeptide or . 
modulator of the invention with one or more anti-cancer drugs in addition to a pharmaceutically 
acceptable carrier for delivery. The use of anti-cancer cocktails as a cancer treatment is routine. 
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Anti-cancer drugs that are well known in the art and can be used as a treatment in combination 
with the pplypeptide or modulator of the invention include: Actinomycin D, Aminoglutethiinide, 
Asparaginase, Bleomycin, Busulfan, Carboplatin, Carmustine, Chlorambucil, Cisplatin (cis- 

4 DDP), Cyclophosphamide, Cytarabirie HC1 (Cytosine arabinoside), Daicarbazine, Dactinomycin, 
Daunorubicin HC1, Doxorubicin HG1, Estramustine phosphate sodium, Etoposide (V 1 6-2 1 3), 
Floxuridine, 5-Fluorouracil (5-Fu), Flutamide, Hydroxyurea (hydroxycarbamide), Ifosfamide, 
Interferon Alpha-2a, Interferon Alpha-2b, Leuprolide acetate (LHRH-releasing factor analog), 

8 Lomustine, Mechlorethamine HC1 (nitrogen mustard), Melphalan, Mercaptopurine, Mesna, 

Methotrexate (MTX), Mitomycin, Mitoxantrone HC1, Octreotide, Plicamycin, Procarbazine HC1, 
Streptozocin, Tamoxifen citrate, Thioguanine, Thiotepa, Vinblastine sulfate, Vincristine sulfate, 
Amsacrine, Azacitidine, Hexamethylmelamine, Interleukin-2, Mitoguazone, Pentostatin, 

12 Semustine, Teniposide, and Vindesine sulfate. 

■ 

In additipn, therapeutic compositions of the invention may be used for prophylactic 
treatment of cancer. There are hereditary conditions and/or environmental situations (e.g. 

» * 

exposure to carcinogens) known in the art that predispose an individual to developing cancers. 
16 Under thesis circumstances, it may be beneficial to treat these individuals with therapeutically 
effective doses of the polypeptide of the invention to reduce the risk of developing cancers. 

» ' ■ 

In vitro models can be used to determine the effective doses of the polypeptide of the 
invention as a potential cancer treatment. These in vitro models include proliferation assays of 

20 cultured tumor cells, growth of cultured tumor cells in soft agar (see Freshney, (1 987) Culture of 
Animal Cells: A Manual of Basic Technique, Wily-Liss, New York, NY Ch 18 and Ch 21), 
tumor systems in nude mice as described in Giovanella et al., J. Natl. Can. Inst, 52: 921-30 
(1974), mobility and invasive potential of tumor cells in Boyden Chamber assays as described in 

24 Pilkington et al., Anticancer Res., 17: 4107-9 (1997), and angiogenesis assays such as induction 
of vascularization of the chick chorioallantoic membrane or induction of vascular endothelial 
cell migration as described in Ribatta et al., Intl. J. Dev. Biol., 40: 1 1 89-97 (1 999) and Li et al., 
Clin- Exp. Metastasis, 17:423-9 (1999), respectively. Suitable tumor cells lines are available, 

28 e.g. from American Type Tissue Culture Collection catalogs. 

4.10.12 RECEPTOR/LIGAND ACTIVITY 

A polypeptide of the present invention may also demonstrate activity as receptor, 
32 receptor ligand or inhibitor or agonist of receptor/ligand interactions. A polynucleotide of the 
invention can encode a polypeptide exhibiting such characteristics. Examples of such receptors 
and ligands include, without limitation, cytokine receptors and their ligands, receptor kinases and 
their ligands, receptor phosphatases and their ligands, receptors involved in cell-cell interactions 
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and their ligands (including without limitation, cellular adhesion molecules (such as selectins, 
integrins and their ligands) and receptor/ligand pairs involved in antigen presentation, antigen 
recognition and development of cellular and humoral immune responses. Receptors and ligands 
4 are also useful for screening of potential peptide or small molecule inhibitors of the relevant 
receptor/ligand interaction. A protein of the present invention (including, without limitation, 
fragments of receptors and ligands) may themselves be useful as inhibitors of receptor/ligand 
interactions. 

8 The activity of a polypeptide of the invention may, among other means, be measured by 

the following methods: 

Suitable assays for receptor-ligand activity include without limitation those described in: 
Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. 
12 Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley- Interscience (Chapter 7.28, 
Measurement of Cellular Adhesion under static conditions 7.28. 1 - 7.28.22), Takai et al., Proc. 
Natl. Acad. Sci. USA 84:6864-6868, 1987; Bierer etal., J. Exp. Med. 168:1145-1156, 1988; 
Rosenstein et al., J. Exp. Med. 169:149-160 1989; Stoltenborg et al., J. Immunol. Methods 
1 6 1 75:59-68, 1 994; Stitt et al., Cell 80:661 -670, 1 995, 

i 

By way of example, the polypeptides of the invention may be used as a receptor for a 
ligand(s) thereby transmitting the biological activity of that ligand(s). Ligands may be identified 
through binding assays, affinity chromatography, dihybrid screening assays, BIAcore assays, gel 

20 overlay assays, or other methods known in the art. 

Studies characterizing drugs or proteins as agonist or antagonist or partial agonists or a . 
partial antagonist require the use of other proteins as competing ligands. The polypeptides of the 
present invention or ligand(s) thereof may be labeled by being coupled to radioisotopes, 

24 colorimetric molecules or a toxin molecules by conventional methods. ("Guide to Protein 
Purification" Murray P. Deutscher (ed) Methods in Enzymology Vol. 182 (1990) Academic 
Press, Inc. San Diego). Examples of radioisotopes include, but are not limited to, tritium and 
carbon-14 . Examples of colorimetric molecules include, but are hot limited to, fluorescent 

28 molecules such as fluorescamine, or rhodamine or other colorimetric molecules. Examples of 
toxins include, but are not limited, to ricin. 

4.10.13 DRUG SCREENING 

32 This invention is particularly useful for screening chemical compounds by using the 

• » 

novel polypeptides or binding fragments thereof in any of a variety of drug screening techniques. 
The polypeptides or fragments employed in such a test may either be free in solution, affixed to a 
solid support, borne on a cell surface or located intracellularly. One method of drug screening 
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i 

utilizes eukaryotic or prokaryotic host cells which are stably transformed with recombinant 
nucleic ac;ds expressing the polypeptide or a fragment thereof. Drugs are screened against such 
transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can 
4 be used for standard binding assays. One may measure, for example, the formation of 

complexes between polypeptides of the invention or fragments and the agent being tested or 
examine the diminution in complex formation between the novel polypeptides and an 
appropriate cell line, which are well known in the art 
8 Sources for test compounds that may be screened for ability to bind to or modulate (i.e., 

increase or decrease) the activity of polypeptides of the invention include (1) inorganic and 
organic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries 
comprised of either random or mimetic peptides, oligonucleotides or organic molecules. 

12 Chemical libraries may be readily synthesized or purchased from a number 

commercial sources, and may include structural analogs of known compounds or compounds 
that are identified as "hits" or "leads" via natural product screening. 

The sources of natural product libraries are microorganisms (including bacteria and 

16 fungi), animals, plants or other vegetation, or marine organisms, and libraries of mixtures for 

screening may be created by: (1) fermentation and extraction of broths from soil, plant or marine 
microorganisms or (2) extraction of the organisms themselves. Natural product libraries include 
polyketides, non-ribosomal peptides, and (non-naturally occurring) variants thereof. For a 

20 review, see Science 252:63-68 (1998). 

Combinatorial libraries are composed of large numbers of peptides, oligonucleotides or 
organic compounds and can be readily prepared by traditional automated synthesis methods, 
PCR, cloning or proprietary synthetic methods. Of particular interest are peptide and 

24 oligonucleotide combinatorial libraries. Still other libraries of interest include peptide, protein, 
peptidomimetic, multiparallel synthetic collection, recombinatorial, and polypeptide libraries. 

■ 

For a review of combinatorial chemistry and libraries created therefrom, see Myers, Curr. Opin. 

Biotechnol 8:701-707 (1997). For reviews and examples of peptidomimetic libraries, see 
28 Al-Obeidi et al., MoL Biotechnol 9(3):205-23 (1 998); Hruby et al., Curr Opin Chem Biol 

1(1):1 14-19 (1997); Dorner et al., BioorgMed Chem, 4(5):709-15 (1996) (alkylated dipeptides). 
Identification of modulators through use of the various libraries described herein permits 

modification of the candidate "hit" (or "lead") to optimize the capacity of the "hit" to bind a 
32 polypeptide of the invention. The molecules identified in the binding assay are then tested for 

antagonist or agonist activity in in vivo tissue culture or animal models that are well known in the 

art. In brief, the. molecules are titrated into a plurality of cell cultures or animals and then tested 

for either cell/animal death or prolonged survival of the animal/cells. 
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The binding molecules thus identified may be complexed with toxins, e.g., ricin or ... 
cholera, or with other compounds that are toxic to cells such as radioisotopes. The toxin-binding 
molecule complex is then targeted to a tumor or other cell by the specificity of the binding 
4 molecule for a polypeptide of the invention. Alternatively, the binding molecules may be 
complexed with imaging agents for targeting and imaging purposes. 

4.10.14 ASSAY FOR RECEPTOR ACTIVITY 

8 The invention also provides methods to detect specific binding of a polypeptide e.g. a 

ligand or a receptor. The art provides numerous assays particularly useful for identifying 

■ 

previously unknown binding partners for receptor polypeptides of the invention. For example, 
expression cloning using mammalian or bacterial cells, or dihybrid screening assays can be used 

12 to identify polynucleotides encoding binding partners. As another example, affinity 

chromatography with the appropriate immobilized polypeptide of the invention can be used to 
isolate polypeptides that recognize and bind polypeptides of the invention. There are a number 
of different libraries used for the identification of compounds, and in particular small molecules, 

1 6 that modulate (i e., increase or decrease) biological activity of a polypeptide of the invention. 

r 

Ligands for receptor polypeptides of the invention can also be identified by adding exogenous 
ligands, or cocktails of ligands to two cells populations that are genetically identical except for 
the expression of the receptor of the invention: one cell population expresses the receptor of the 

20 invention whereas the other does not The response of the two cell populations to the addition of 
ligands(s) are then compared. Alternatively, an expression library can be co-expressed with the 
polypeptide of the invention in cells and assayed for an autocrine response to identify potential . 
Iigand(s). As still another example, BIAcore assays, gel overlay assays, or other methods known 

24 in the art can be used to identify binding partner polypeptides, including, (1) organic and 
inorganic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries 
comprised of random peptides, oligonucleotides or organic molecules. 

The role of downstream intracellular signaling molecules in the signaling cascade of the 

28 polypeptide of the invention can be determined. For example, a chimeric protein in which the 
cytoplasmic domain of the polypeptide of the invention is fused to the extracellular portion of a 
protein, whose ligand has been identified, is produced in a host cell. The cell is then incubated 
with the ligand specific for the extracellular portion of the chimeric protein, thereby activating 

32 the chimeric receptor. Known downstream proteins involved in intracellular signaling can then 
be assayed for expected modifications i.e. phosphorylation. Other methods known to those in the 
art can also be used to identify signaling molecules involved in receptor activity. 

■ 
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4.10.15 ANTI-INFLAMMATORY ACTIVITY 

Compositions of the present invention may also exhibit anti-inflammatory activity. The 
1 anti-inflammatory activity may be achieved by providing a stimulus to cells involved in the 

4 inflammatory response, by inhibiting or promoting cell-cell interactions (such as, for example, 
cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the inflammatory 
process, inhibiting or promoting cell extravasation, or by stimulating or suppressing production 
of other factors which more directly inhibit or promote an inflammatory response. Compositions 

8 with such activities can be used to treat inflammatory conditions including chronic or acute 
conditions), including without limitation intimation associated with infection (such as septic 
shock, sepsis or systemic inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, 
endotoxin lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or 

i 

12 chemokine-induced lung injury, inflammatory bowel disease, Crohn's disease or resulting from 
over production, of cytokines such as TNF or IL-1 . Compositions of the invention may also be 
useful to treat anaphylaxis and hypersensitivity to an antigenic substance or material. 
Compositions of this invention may be utilized to prevent or treat conditions such as, but not 

16 limited to, sepsis, acute pancreatitis, endotoxin shock, cytokine induced shock, rheumatoid 
arthritis, chronic inflammatory arthritis, pancreatic cell damage from diabetes mellitus type 1 , 
graft versus host disease, inflammatory bowel disease, inflamation associated with pulmonary 
disease, other autoimmune disease or inflammatory disease, an antiproliferative agent such as for 

20 acute or chronic mylegenous leukemia or in the prevention of premature labor secondary to 
intrauterine infections. 

4.10.16 LEUKEMIAS 

24 Leukemias and related disorders may be treated or prevented by administration of a 

therapeutic that promotes or inhibits function of the polynucleotides and/or polypeptides of the 
invention. Such leukemias and related disorders include but are not limited to acute leukemia, 
acute lymphocytic leukemia, acute myelocytic leukemia, myeloblasts, promyelocytic, 

28 myelomonocytic, monocytic, erythroleukemia, chronic leukemia, chronic myelocytic 

* • • 

(granulocytic) leukemia and chronic lymphocytic leukemia (for a review of such disorders, see 
Fishman et aL, 1985, Medicine, 2d Ed., J.B. Lippincott Co., Philadelphia). 

32 4.10.17 NERVOUS SYSTEM DISORDERS 

Nervous system disorders, involving cell types which can be tested for efficacy of 
. intervention with compounds that modulate the activity of the polynucleotides and/or 
polypeptides of the invention, and which can be treated upon thus observing an indication of 
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■ 

therapeutic utility, include but are not limited to nervous system injuries, and diseases or 

disorders which result in either a disconnection of axons, a diminution or degeneration of 

■ 

neurons, or demyelination. Nervous system lesions which may be treated in a patient (including 
4 human and non-human mammalian patients) according to the invention include but are not 
limited to the following lesions of either the central (including spinal cord, brain) or peripheral 
nervous systems: 

(i) traumatic lesions, including lesions caused by physical injury or associated with 
8 surgery, for example, lesions which sever a portion of the nervous system, or compression 

injuries; . 

(ii) ischemic lesions, in which a lack of oxygen in a portion of the nervous system 
results in neuronal injury or death, including cerebral infarction or ischemia, or spinal cord 

12 infarction or ischemia; 

i 

(iii) infectious lesions, in which a portion of the nervous system is destroyed or 
injured as a result of infection, for example, by an abscess or associated with infection by human 
immunodeficiency virus, herpes zoster, or herpes simplex virus or with Lyme disease, 

16 tuberculosis, syphilis; 

■ 

(iv) degenerative lesions, in which a portion of the nervous system is destroyed or 
injured as a result of a degenerative process including but not limited to degeneration associated 

with Parkinson's disease, Alzheimer's disease, Huntington's chorea, or amyotrophic lateral 

< 
■ 

20 sclerosis; 

(v) lesions associated with nutritional diseases or disorders, in which a portion of the 
nervous system is destroyed or injured by a nutritional disorder or disorder of metabolism 
including but not limited to, vitamin B 1 2 deficiency, folic acid deficiency, Wernicke disease, 

24 tobacco-alcohol amblyopia, Marchiafava-Bignami disease (primary degeneration of the corpus 
callosum), and alcoholic cerebellar degeneration; 

(vi) neurological lesions associated with systemic diseases including but not limited to 
diabetes (diabetic neuropathy, Bell's palsy), systemic lupus erythematosus, carcinoma, or 

28 sarcoidosis; 

(vii) lesions caused by toxic substances including alcohol, lead, or particular, 
neurotoxins; and 

(viii) demyelinated lesions in which a portion of the nervous system is destroyed or 
32 injured by a demyelinating disease including but not limited to multiple sclerosis, human 

immunodeficiency virus-associated myelopathy, transverse myelopathy or various etiologies, 
progressive multifocal leukoencephalopathy, and central pontine myelinolysis. 

■ 
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Therapeutics which are useful according to the invention for treatment of a nervous 
system disorder may be selected by testing for biological activity in promoting the survival or 

■ 

differentiation of neurons. For example, and not by way of limitation, therapeutics which elicit 



8 choline acetyltransferase or acetylcholinesterase with respect to motor neurons; or 
(iv) decreased symptoms of neuron dysfunction in vivo. 
Such effects may be measured by any method known in the art. In preferred, 
non-limiting embodiments, increased survival of neurons may be measured by the method set 

12 forth in Arakawa et al. (1990, J. Neurosci. 10:3507-3515); increased sprouting of neurons may 

■ 

be detected by methods set forth in Pestronk et al. (1980, Exp. Neurol. 70:65-82) or Brown et al. 
(1981, Ann. Rev. Neurosci. 4:17-42); increased production of neuron-associated molecules may 

* 

be measured by bioassay, en2ymatic assay, antibody binding, Northern blot assay, etc., 
16 depending on the molecule to be measured; and motor neuron dysfunction may be measured by 
assessing the physical manifestation of motor neuron disorder, e.g., weakness, motor neuron 

* ■ 

conduction velocity, or functional disability. 

In specific embodiments, motor neuron disorders that may be treated according to the 

20 invention include but are not limited to disorders such as infarction, infection, exposure to toxin, 
trauma, surgical damage, degenerative disease or malignancy that may affect motor neurons as 
well as other components of the nervous system, as well as disorders that selectively affect 
neurons such as amyotrophic lateral sclerosis, and including but not limited to progressive spinal 

24 muscular atrophy, progressive bulbar palsy, primary lateral sclerosis, infantile and juvenile 
muscular atrophy, progressive bulbar paralysis of childhood (F azio-Londe syndrome), 
poliomyelitis and the post polio syndrome, and Hereditary Motorsensory Neuropathy 
(Charcot-Marie-Tooth Disease). 



4 



any of the following effects may be useful according to the invention: 




28 



4.10.18 



OTHER ACTIVITIES 



32 



A polypeptide of the invention may also exhibit one or more of the following additional 
activities or effects: inhibiting the growth, infection or function of, or killing, infectious agents, 
including, without limitation, bacteria, viruses, fungi and other parasites; effecting (suppressing 
or enhancing) bodily characteristics, including, without limitation, height, weight, hair color, eye 
color, skin, fat to lean ratio or other tissue pigmentation, or organ or body part size or shape 



(such as, for example, breast augmentation or diminution, change in bone form or shape); 
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r * 

effecting biorhythms or circadian cycles or rhythms; effecting the fertility of male or female 
subjects; effecting the metabolism, catabolism, anabolism, processing, utilization, storage or 
elimination of dietary fat, lipid, protein, carbohydrate, vitamins, minerals, co-factors or other 
— 4 nutritional factors or components); effecting behavioral characteristics, including, without 
limitation, appetite, libido, stress, cognition (including cognitive disorders), depression 
(including depressive disorders) and violent behaviors; providing analgesic effects or other pain 
reducing effects; promoting differentiation and growth of embryonic stem cells in lineages other 
8 than hematopoietic lineages; hormonal or endocrine activity; in the case of enzymes, correcting 
deficiencies of the enzyme and treating deficiency-related diseases; treatment of 
hyperproliferative disorders (such as, for example, psoriasis); immunoglobulin-like activity (such 
as, for example, the ability to bind antigens or complement); and the ability to act as an antigen 

12 in a vaccine composition to raise an immune response against such protein or another material or 

- 

entity which is pross-reactive with such protein. 

4.10.19 IDENTIFICATION OF POLYMORPHISMS 

1 6 The demonstration of polymorphisms makes possible the identification of such 

polymorphisms in human subjects and the pharmacogenetic use of this information for diagnosis 
and treatment Such polymorphisms may be associated with, e.g., differential predisposition or 
susceptibility to various disease states (such as disorders involving inflammation or immune 

20 response) or a differential response to drug administration, and this genetic information can be 
used to tailor preventive or therapeutic treatment appropriately. For example, the existence of a 
polymorphism associated with a predisposition to inflammation or autoimmune disease makes 
possible the diagnosis of this condition in humans by identifying the presence of the 

24 polymorphism. 

Polymorphisms can be identified in a variety of ways known in the art which all 
generally involve obtaining a sample from a patient, analyzing DNA from the sample, optionally 
involving isolation or amplification of the DNA, and identifying the presence of the 

28 polymorphism in the DNA. For example, PCR may be used to amplify an appropriate fragment 
of genomic DNA which may then be sequenced. Alternatively, the DNA may be subjected io 
allele-specific oligonucleotide hybridization (in which appropriate oligonucleotides are 
hybridized to the DNA under conditions permitting detection of a single base mismatch) or to a 

32 single nucleotide extension assay (in which an oligonucleotide that hybridizes immediately 

adjacent to the position of the polymorphism is extended with one or more labeled nucleotides). 
In addition, traditional restriction fragment length polymorphism analysis (using restriction 
enzymes that provide differential digestion of the genomic DNA depending on the presence or 
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♦ 

* 

absence of the polymorphism) may be performed. Arrays with nucleotide sequences of the . 

present invention can be used to detect polymorphisms. The array can comprise modified 

nucleotide sequences of the present invention in order to detect the nucleotide sequences of the 
4 present invention. In the alternative, any one of the nucleotide sequencesof the present 

invention can be placed on the array to detect changes from those sequences. 

Alternatively a polymorphism resulting in a change in the amino acid sequence could 

also be detected by detecting a corresponding change in amino acid sequence of the protein, e.g., 
8 by an antibody specific to the variant sequence. 

M 

4.10.20 ARTHRITIS AND IOTLAMMATION 

The immunosuppressive effects of the compositions of the invention against rheumatoid 
12 arthritis is determined in an experimental animal model system. The experimental model system 
is adjuvant induced arthritis in rats, and the protocol is described by J. Holoshitz, et at, 1 983, 
Science, 219:56, or by B. Waksman et al., 1963, Int. Arch- Allergy Appl. Immunol., 23:129. 
Induction of the disease can be caused by a single injection, generally intradermal!/, of a 
16 suspension of killed Mycobacterium tuberculosis in complete Freund's adjuvant (CFA). The 

i 

route of injection can vary, but rats may be injected at the base of the tail with an adjuvant 
mixture. The polypeptide is administered in phosphate buffered solution (PBS) at a dose of about 
1-5 mg/kg. The control consists of administering PBS only. 
20 The procedure for testing the effects of the test compound would consist of intradermally 

• * 

* 

injecting killed Mycobacterium tuberculosis in CFA followed by immediately administering the 
test compound and subsequent treatment every other day until day 24. At 14, 15, 18, 20, 22, and 
24 days after injection of Mycobacterium CFA, an overall arthritis score may be obtained as 
24 described by J. Holoskitz above. An analysis of the data would reveal that the test compound 
would have a dramatic affect on the swelling of the joints as measured by a decrease of the 

- » 

arthritis score. 



28 4.11 THERAPEUTIC METHODS 

The compositions (including polypeptide fragments, analogs, variants and antibodies or 
other binding partners or modulators including antisense polynucleotides) of the invention have 
numerous applications in a variety of therapeutic methods. Examples of therapeutic applications 
32 include, but are not limited to, those exemplified herein. 

4.11.1 EXAMPLE 
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6ne embodiment of the invention is the administration of an effective amount of the 
polypeptides or other composition of the invention to individuals affected by a disease or 
1 disorder that can be modulated by regulating the peptides of the invention. While the mode of 
4 administration is not particularly important, parenteral administration is preferred. An 
exemplary mode of administration is to deliver an intravenous bolus. The dosage of the 
polypeptides or other composition of the invention will normally be determined by the 
prescribing physician. It is to be expected that the dosage will vary according to the age, weight, 
8 conciition and response of the individual patient. Typically, the amount of polypeptide 

administered per dose will be in the range of about O.Ol^ig/kg to 100 mg/kg of body weight, with 
the preferred dose being about 0.1}ig/kg to 10 mg/kg of patient body weight. For parenteral 
administration, polypeptides of the invention will be formulated in an injectable form combined 
12 with a pharmaceutical^ acceptable parenteral vehicle. Such vehicles are well known in the art 
and examples include water, saline, Ringer's solution, dextrose solution, and solutions consisting 
of small amounts of the human serum albumin. The vehicle may contain minor amounts of 
additives that maintain the isotonicity and stability of the polypeptide or other active ingredient. 
1 6 The preparation of such solutions is within the skill of the art. 

* 

4.12 PHARMACEUTICAL FORMULATIONS AND ROUTES OF 
ADMINISTRATION 

20 A protein or other composition of the present invention (from whatever source derived, 

including without limitation from recombinant and non-recombinant sources and including 
antibodies and other binding partners of the polypeptides of the invention) may be administered 
to a patient in need, by itself, or in pharmaceutical compositions where it is mixed with suitable 

24 carriers or excipient(s) at doses to treat or ameliorate a variety of disorders. Such a composition 
may optionally contain (in addition to protein or other active ingredient and a carrier) diluents, 
fillers, salts, buffers, stabilizers, solubilizers, and other materials well known in the art. The term 

■ 

"pharmaceutical^ acceptable" means a non-toxic material that does not interfere with the 
28 effectiveness of the biological activity of the active ingredient(s). The characteristics of the 
carrier will depend on the route of administration. The pharmaceutical composition of the 
invention may also contain cytokines, lymphokines, or other hematopoietic factors such as 
M-CSF, GM-CSF, TNF, IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, IL-12, 
32 IL-13, IL-14, BL-15, IFN, TNF0, TNF1, TNF2, G-CSF, Meg-CSF, thrombopoietin, stem cell 
factor, and erythropoietin. In further compositions, proteins of the invention may be combined 
with other agents beneficial to the treatment of the disease or disorder in question. These agents 
include various growth factors such as epidermal growth factor (EGF), platelet-derived growth 
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1 

factor (PDGF), transforming growth factors (TGF-a and TGF-p), insulin-like growth factor. 
(IGF), as well as cytokines described herein. 

The pharmaceutical composition may further contain other agents which either enhance 

4 the activity of the protein or other active ingredient or complement its activity or use in 
treatment. Such additional factors and/or agents may be included in the pharmaceutical 
composition to produce a synergistic effect with protein or other active ingredient of the 
invention, or to minimize side effects. Conversely, protein or other active ingredient of the 

8 present invention may be included in formulations of the particular clotting factor, cytokine, 
lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic factor, or anti- 
inflammatory agent to minimize side effects of the clotting factor, cytokine, lymphokine, other 
hematopoietic factor, thrombolytic or anti-thrombotic factor, or anti-inflammatory agent (such as 
12 IL-IRa, Hyl, DL-1 Hy2, anti-TNF, corticosteroids, immunosuppressive agents). A protein 

i 

I 

of the present invention may be active in multimers (e.g. , heterodimers or homodimers) or 
complexes with itself or other proteins. As a result, pharmaceutical compositions of the 
invention may comprise a protein of the invention in such multimeric or complexed form. 

16 As an alternative to being included in a pharmaceutical composition of the invention 

including a first protein, a second protein or a therapeutic agent may be concurrently 
administered with the first protein (e.g., at the same time, or at differing times provided that 
therapeutic concentrations of the combination of agents is achieved at the treatment site). 

20 Techniques for formulation and administration of the compounds of the instant application may 

4 

be found in "Remington's Pharmaceutical Sciences," Mack Publishing Co., Easton, PA, latest 
edition. A therapeutically effective dose further refers to that amount of the compound sufficient 
to result in amelioration of symptoms, e.g., treatment, healing, prevention or amelioration of the 

24 relevant medical condition, or an increase in rate of treatment, healing, prevention or 

amelioration of such conditions. When applied to an individual active ingredient, administered 
alone, a therapeutically effective dose refers to that ingredient alone. When applied to a 
combination, a therapeutically effective dose refers to combined amounts of the active 

28 ingredients that result in the therapeutic effect, whether administered in combination, serially or 
simultaneously. 

In practicing the method of treatment or use of the present invention, a therapeutically 
effective amount of protein or other active ingredient of the present invention is administered to 
32 a mammal having a condition to be treated. Protein or other active ingredient of the present . 
invention may be administered in accordance with the method of the invention either alone or in 
combination with other therapies such as treatments employing cytokines, lymphokines or other 
hematopoietic factors. When co- administered with one or more cytokines, lymphokines or other 
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hematopoietic factors, protein or other active ingredient of the present invention may be , . , 
administered either simultaneously with the cytokine(s), lymphokine(s), other hematopoietic 
factors), thrombolytic or antithrombotic factors, or sequentially. If administered sequentially, 
4 the attending physician will decide on the appropriate sequence of administering protein or other 
active ingredient of the present invention in combination with cytokine(s), lymphokine(s), other 
hematopoietic factor(s), thrombolytic or antithrombotic factors. 

8 4.12.1 ROUTES OF ADMINISTRATION 

Suitable routes of administration may, for example, include oral, rectal, transmucosal, or , 
intestinal administration; parenteral delivery, including intramuscular, subcutaneous, 
intramedullary injections, as well as intrathecal, direct intraventricular, intravenous, 
12 intraperitoneal, intranasal, or intraocular injections. Administration of protein or other active 
ingredient of the present invention used in the pharmaceutical composition or to practice the 

■ . * 

method of the present invention can be carried out in a variety of conventional ways, such as oral 
ingestion, inhalation, topical application or cutaneous, subcutaneous, intraperitoneal, parenteral 

16 or intravenous injection. Intravenous administration to the patient is preferred. 

Alternately, one may administer the compound in a local rather than systemic manner, for 
example, via injection of the compound directly into a arthritic joints or in fibrotic tissue, often 
in a depot or sustained release formulation. In order to prevent the scarring process frequently 

20 occurring as complication of glaucoma surgery, the compounds may be administered topically, 
, for example, as eye drops. Furthermore, one may administer the drug in a targeted drug delivery 
system, for example, in a liposome coated with a specific antibody, targeting, for example, 
arthritic or fibrotic tissue. The liposomes will be targeted to and taken up selectively by the 

24 afflicted tissue. 

The polypeptides of the invention are administered by any route that delivers an effective 
dosage to the desired site of action. The determination of a suitable route of administration and 
an effective dosage for a particular indication is within the level of skill in the art. Preferably for 
28 wound treatment, one administers the therapeutic compound directly to the site. Suitable dosage 
ranges for the polypeptides of the invention can be extrapolated from these dosages or from 
similar studies in appropriate animal models. Dosages can then be adjusted as necessary by the 
clinician to provide maximal therapeutic benefit 

32 

4.12.2 COMPOSITIONS/FORMULATIONS 

Pharmaceutical compositions for use in accordance with the present invention thus may 
be formulated in a conventional manner using one or more physiologically acceptable carriers 

■ 
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comprising excipients and auxiliaries which facilitate processing of the active compounds into 

« 

preparations which can be used pharmaceutically. These pharmaceutical compositions may be 
' manufactured in a manner that is itself known, e.g., by means of conventional mixing, 

4 dissolving, granulating, dragee-making, levigating, emulsifying; encapsulating, entrapping or 

"t 

lyophilizing processes. Proper formulation is dependent upon the route of administration 
chosen. When a therapeutically effective amount of protein or other active ingredient of the 
present invention is administered orally, protein or other active ingredient of the present 
8 invention will be in the form of a tablet, capsule, powder, solution or elixir. When administered 
in tablet form, the pharmaceutical composition of the invention may additionally contain a solid 
carrier such as a gelatin or an adjuvant. The tablet, capsule, and powder contain from about 5 to 
95% protein or other active ingredient of the present invention, and preferably from about 25 to 

> ■ 

* 

12 90% protein or other active ingredient of the present invention. When administered in liquid 

■ 

form, a liquid carrier such as water, petroleum, oils of animal or plant origin such as peanut oil, 
mineral oil, soybean oil, or sesame oil, or synthetic oils may be added. The liquid fonn of the 
pharmaceutical composition may further contain physiological saline solution, dextrose or other 

■ 

16 saccharide solution, or glycols such as ethylene glycol, propylene glycol or polyethylene glycol. 
When administered in liquid form, the pharmaceutical composition contains from about 0.5 to 
90% by weight of protein or other active ingredient of the present invention, and preferably from 
about 1 to 50% protein or other active ingredient of the present invention. 

20 When a therapeutically effective amount of protein or other active ingredient of the 

present invention is administered by intravenous, cutaneous or subcutaneous injection, protein or 
other active ingredient of the present invention will be in the form of a pyrogen-free, parenterally 
acceptable aqueous solution. The preparation of such parenterally acceptable protein or other 

24 active ingredient solutions, having due regard to pH, isotonicity, stability, and the like, is within 
the skill in the art. A preferred pharmaceutical composition for intravenous, cutaneous, or 
subcutaneous injection should contain, in addition to protein or other active ingredient of the 
present invention, an isotonic vehicle such as Sodium Chloride Injection, Ringer's Injection, 

28 Dextrose Injection, Dextrose and Sodium Chloride Injection, Lactated Ringer's Injection, or 

other vehicle as known in the art. The pharmaceutical composition of the present invention may 
also contain stabilizers, preservatives, buffers, antioxidants, or other additives known to those of 
skill in the art. For injection, the agents of the invention may be formulated in aqueous 

32 solutions, preferably in physiologically compatible buffers such as Hanks's solution, Ringer's 
solution, or physiological saline buffer. For transmucosal administration, penetrants appropriate 
to the barrier to be permeated are used in the formulation. Such penetrants are generally known 
in the art 
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For oral administration, the compounds can be formulated readily by combining the 
active compounds with pharmaceutical^ acceptable carriers well known in the art. Such carriers 
enable the compounds of the invention to be formulated as tablets, pills, dragees, capsules, 

4 liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a patient to be 
treated. Pharmaceutical preparations for oral use can be obtained from a, solid excipient, . 
optionally grinding a resulting mixture, and processing the mixture of granules, after adding 
suitable auxiliaries, if desired, to obtain tablets or dragee cores. Suitable excipients are, in 

8 particular, fillers such as sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose 
preparations such as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, 
gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium 
carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP). If desired, disintegrating agents 

12 may be added, such as the cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt 

i ■ 

i 

thereof such as sodium alginate. Dragee cores are provided with suitable coatings. For this 
purpose, concentrated sugar solutions may be used, which may optionally contain gum arabic, 
talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer 

16 solutions, and suitable organic solvents or solvent mixtures. Dyestuffs or pigments may be 

added to the tablets or dragee coatings for identification or to characterize different combinations 
of active compound doses. 

Pharmaceutical preparations which can be used orally include push-fit capsules made of 

20 gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol or 

sorbitol. The push-fit capsules can contain the active ingredients in admixture with filler such as 
lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate and, 
optionally, stabilizers. In soft capsules, the active compounds may be dissolved or suspended in 

24 suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In addition, 
stabilizers may be added. All formulations for oral administration should be in dosages suitable 
for such administration. For buccal administration, the compositions may take the form of 
tablets or lozenges formulated in conventional manner. 

28 For administration by inhalation, the compounds for use according to the present 

invention are conveniently delivered in the form of an aerosol spray presentation from 
pressurized packs or a nebuliser, with the use of a suitable propellant, e.g. , 
dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or 

32 other suitable gas. In the case of a pressurized aerosol the dosage unit may be determined by 
providing a valve to deliver a metered amount Capsules and cartridges of, e.g. , gelatin for use 
in an inhaler or insufflator may be formulated containing a powder mix of the compound and a 
suitable powder base such as lactose or starch. The compounds may be formulated for parenteral 

67 



WO 01/64835 PCT/US01/04927 

• * • 

• ■ ■ • 

administration by injection, e.g., by bolus injection or continuous infusion. Formulations for 

injection may be presented in unit dosage form, e.g 9 in ampules or in multi-dose containers, with 

» 

• an added preservative. The compositions may take such forms as suspensions, solutions or 
4 emulsions in oily or aqueous vehicles, and may contain formulatory agents such as suspending, 
stabilizing and/or dispersing agents. 

Pharmaceutical formulations for parenteral administration include aqueous solutions of 
the active compounds in water-soluble form. Additionally, suspensions of the active compounds 
8 may be prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or 

vehicles include fatty oils such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate or 
triglycerides, or liposomes. Aqueous injection suspensions may contain substances which 
increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or 

12 dextran. Optionally, the suspension may also contain suitable stabilizers or agents which 

. ■ ■ 

increase the solubility of the compounds to allow for the preparation of highly concentrated 
solutions. Alternatively, the active ingredient may be in powder form for constitution with a 
suitable vehicle, e.g., sterile pyrogen-free water, before use. 
16 The compounds may also be formulated in rectal compositions such as suppositories or 

retention enemas, e.g., containing conventional suppository bases such as cocoa butter or other 

■ » 

♦ 

glycerides. In addition to the formulations described previously, the compounds may also be 
formulated as a depot preparation. Such long acting formulations may be administered by 

20 implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. 
Thus, for example, the compounds may be formulated with suitable polymeric or hydrophobic 
materials (for example as an emulsion in an acceptable oil) or ion exchange resins, or as 
sparingly soluble derivatives, for example, as a sparingly soluble salt 

24 A pharmaceutical carrier for the hydrophobic compounds of the invention is a co-solvent 

system comprising benzyl alcohol, a nonpolar surfactant, a water-miscible organic polymer, and 
an aqueous phase. The co-solvent system may be the VPD co-solvent system. VPD is a solution 
of 3% w/v benzyl alcohol, 8% w/v of the nonpolar surfactant polysorbate 80, and 65% w/v 

28 polyethylene glycol 300, made up to volume in absolute ethanol. The VPD co-solvent system 

■ 

(VPD: 5 W) consists of VPD diluted 1 : 1 with a 5% dextrose in water solution. This co-solvent 
system dissolves hydrophobic compounds well, and itself produces low toxicity upon systemic 
administration. Naturally, the proportions of a co-solvent system may be varied considerably 

* 

32 without destroying its solubility and toxicity characteristics. Furthermore, the identity of the 
co-solvent components may be varied: for example, other low-toxicity nonpolar surfactants may 
be used instead of polysorbate 80; the fraction size of polyethylene glycol may be varied; other 
biocompatible polymers may replace polyethylene glycol, e.g. polyvinyl pyrrolidone; and other 
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sugars or polysaccharides may substitute for dextrose. Alternatively, other delivery systems for 
hydrophobic pharmaceutical compounds may be employed. Liposomes and emulsions are well 
known examples of delivery vehicles or carriers for hydrophobic drugs. Certain organic solvents 
~ -4 such as dimethylsulfoxide also may be employed, although usually at the cost of greater toxicity. ~ 
Additionally, the compounds may be delivered using a sustained-release, system, such as » 
semipermeable matrices of solid hydrophobic polymers containing the therapeutic agent 
Various types of sustained-release materials have been established and are well known by those 
8 skilled in the art. Sustained-release capsules may, depending on their chemical nature, release the 

compounds for a few weeks up to over 100 days. Depending on the chemical nature and the 

- ■ 

biological stability of the therapeutic reagent, additional strategies for protein or other active 
ingredient stabilization may be employed. 
12 The pharmaceutical compositions also may comprise suitable solid or gel phase carriers 

or excipients. Examples of such carriers or excipients include but are not limited to calcium 

* 

carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, gelatin, and 
polymers such as polyethylene glycols. Many of the active ingredients of the invention may be 

16 provided as salts with pharmaceutical^ compatible counter ions. Such pharmaceutical^ 
acceptable base addition salts are those salts which retain the biological effectiveness and 
properties of the free acids and which are obtained by reaction with inorganic or organic bases 
such as sodium hydroxide, magnesium hydroxide, ammonia, trialkylamine, dialkylamine, 

20 monoalkylamine, dibasic amino acids, sodium acetate, potassium benzoate, triethanol amine and 
the like. 

The pharmaceutical composition of the invention may be in the form of a complex of the 
protein(s) or other active ingredient(s) of present invention along with protein or peptide 

24 antigens. The protein and/or peptide antigen will deliver a stimulatory signal to both B and T 
lymphocytes. B lymphocytes will respond to antigen through their surface immunoglobulin 
receptor. T lymphocytes will respond to antigen through the T cell receptor (TCR) following 
presentation of the antigen by MHC proteins. MHC and structurally related proteins including 

28 those encoded by class I and class II MHC genes on host cells will serve to present the peptide 
antigen(s) to T lymphocytes. The antigen components could also be supplied as purified 
MHC-peptide complexes alone or with co-stimulatory molecules that can directly signal T cells. 
Alternatively antibodies able to bind surface immunoglobulin and other molecules on B cells as 

32 well as antibodies able to bind the TCR and other molecules on T cells can be combined with the 
pharmaceutical composition of the invention. 

The pharmaceutical composition of the invention may be in the form of a liposome in 
which protein of the present invention is combined, in addition to other pharmaceutical^ 
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acceptable carriers, with,amphipathic agents such as lipids which exist in aggregated form as 

♦ • 

■ 

micelles, insoluble monolayers, liquid crystals, or lamellar layers in aqueous solution. Suitable 
1 lipids for liposomal formulation include, without limitation, monoglycerides, diglycerides, 
4 sulfatides, lysolecithins, phospholipids, saponin, bile acids, and the like. Preparation of such 
liposomal formulations is within the level of skill in the art, as disclosed, for example, in U.S. 
Patent Nos. 4,235,871 ; 4,501,728; 4,837,028; and 4,737,323, all of which are incorporated 
herein by reference. 

8 The amount of protein or other active ingredient of the present invention in the 

pharmaceutical composition of the present invention will depend upon the nature and severity of 
the condition being treated, and on the nature of prior treatments which the patient has 
undergone. Ultimately, the attending physician will decide the amount of protein or other active 
12 ingredient of the present invention with which to treat each individual patient. Initially, the 

' ( * T 

attending physician will administer low doses of protein or other active ingredient of the present 
invention and observe the patient's response. Larger doses of protein or other active ingredient 
of the present invention may be administered until the optimal therapeutic effect is obtained for 

* * ' * 

16 the patient, and at that point the dosage is not increased further. It is contemplated that the 

various pharmaceutical compositions used to practice the method of the present invention should 

• ■ 

contain about 0.01 |ig to about 100 mg (preferably about 0.1 fxg to about 10 mg, more preferably 
about 0.1 jig to about 1 mg) of protein or other active ingredient of the present invention per kg 

20 body weight. For compositions of the present invention which are useful for bone, cartilage, 
tendon or ligament regeneration, the therapeutic method includes administering the composition 
topically, systematically, or locally as an implant or device. When administered, the therapeutic 
composition for use in this invention is, of cotirse, in a pyrogen-free, physiologically acceptable 

24 form. Further, the composition may desirably be encapsulated or injected in a viscous form for 
delivery to the site of bone, cartilage or tissue damage. Topical administration may be suitable 
for wound healing and tissue repair. Therapeutically useful agents other than a protein or other 
active ingredient of the invention which may also optionally be included in the composition as 

28 described above, may alternatively or additionally, be administered simultaneously or 

sequentially with the composition in the methods of the invention. Preferably for bone and/or 
cartilage formation, the composition would include a matrix capable of delivering the 
protein-containing or other active ingredient-containing composition to the site of bone and/or 

32 cartilage damage, providing a structure for the developing bone and cartilage and optimally 

capable of being resorbed into the body. Such matrices may be formed of materials presently in 
use for other implanted medical applications. 
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1fhe choice of matrix material is based on biocompatibility, biodegradability, mechanical 
properties^ cosmetic appearance and interface properties. The particular application of the 
1 compositions will define the appropriate formulation. Potential matrices for the compositions 
4 may be biodegradable and chemically defined calcium sulfate, tricalcium phosphate, 

hydroxyapatite, polylactic acid, polyglycolic acid and polyanhydrides. Other potential materials 
are biodegradable and biologically well-defined, such as bone or dermal collagen. Further 

> 

matrices are comprised of pure proteins or extracellular matrix components. Other potential 
8 matrices are nonbiodegradable and chemically defined, such as sintered hydroxyapatite, bioglass, 
aluminates, or other ceramics. Matrices may be comprised of combinations of any of the above 
mentioned types of material, such as polylactic acid and hydroxyapatite or collagen and 
tricalcium phosphate. The bioceramics may be altered in composition, such as in 

p i 

12 calcium-alumiriate-phosphate and processing to alter pore size, particle size, particle shape, and 

- 

biodegradability. Presently preferred is a 50:50 (mole weight) copolymer of lactic acid and 
glycolic acid in the form of porous particles having diameters ranging from 1 50 to 800 microns. 
In some applications, it will be useful to utilize a sequestering agent, such as carboxymethyl 
16 cellulose or autologous blood clot, to prevent the protein compositions from disassociating from 
the matrix. 

+ - 

A preferred family of sequestering agents is cellulosic materials such as alkylcelluloses 
(including hydroxyalkylcelluloses), including methylcellulose, ethylcellulose, 

20 hydroxyethylcellulose, hydroxypropylcellulose, hydroxypropyl-methylcellulose, and 

carboxymethylcellulose, the most preferred being cationic salts of carboxymethylcellulose 
(CMC). Other preferred sequestering agents include hyaluronic acid, sodium alginate, 
poly(ethylene glycol), polyoxyethylene oxide, carboxyvinyl polymer and poly(vinyl alcohol). 

24 The amount of sequestering agent useful herein is 0.5-20 wt %, preferably 1-10 wt % based on 
total formulation weight, which represents the amount necessary to prevent desorption of the 
protein from the polymer matrix and to provide appropriate handling of the composition, yet not 
so much that the progenitor cells are prevented from infiltrating the matrix, thereby providing the 

28 protein the opportunity to assist the osteogenic activity of the progenitor cells. In further 

■ . 

compositions, proteins or other active ingredients of the invention may be combined with other 
agents beneficial to the treatment of the bone and/or cartilage defect, wound, or tissue in 
question. These agents include various growth factors such as epidermal growth factor (EGF), 
32 platelet derived growth factor (PDGF), transforming growth factors (TGF-a and TGF-p), and 
insulin-like growth factor (IGF). 

The therapeutic compositions are also presently valuable for veterinary applications. 
Particularly domestic animals and thoroughbred horses, in addition to humans, are desired 
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patients for such treatment with proteins or other active ingredients of the present invention. The 

«>< 

dosage regimen of a protein-containing pharmaceutical composition to be used in tissue 

regeneration wall be determined by the attending physician considering various factors which 

* 

4 modify the action of the proteins, e.g., amount of tissue weight desired to be formed, the site of 
damage, the condition of the damaged tissue, the size of a wound, type of damaged tissue (e.g., 
bone), the patients age, sex, and diet, the severity of any infection, time of administration and 
other clinical factors. The dosage may vary with the type of matrix used in the reconstitution 
8 and with inclusion of other proteins in the pharmaceutical composition. For example, the 

addition of other known growth factors, such as IGF I (insulin like growth factor I), to the final „ 
composition, may also effect the dosage. Progress can be monitored by periodic assessment of 
tissue/bone growth and/or repair, for example, X-rays, histomorphometric determinations and 

1 2 tetracycline labeling. 

Polynucleotides of the present invention can also be used for gene therapy. Such 
polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 
mammalian subject. Polynucleotides of the invention may also be administered by other known 

16 methods for introduction of nucleic acid into a cell or organism (including, without limitation, in 
the form of viral vectors or naked DNA). Cells may also be cultured ex vivo in the presence of 
proteins of the present invention in order to proliferate or to produce a desired effect on or 
activity in such cells. Treated cells can then be introduced in vivo for therapeutic purposes. 

20 

4.123 EFFECTIVE DOSAGE 

Pharmaceutical compositions suitable for use in the present invention include . 
compositions wherein the active ingredients are contained in an effective amount to achieve its 

24 intended purpose. More specifically, a therapeutically effective amount means an amount 
effective to prevent development of or to alleviate the existing symptoms of the subject being 
treated. Determination of the effective amount is well within the capability of those skilled in 
the art, especially in light of the detailed disclosure provided herein. For any compound used in 

28 the method of the invention, the therapeutically effective dose can be estimated initially from 
appropriate in vitro assays. For example, a dose can be formulated in animal models to achieve a 
circulating concentration range that can be used to more accurately determine useful doses in 
humans. For example, a dose can be formulated in animal models to achieve a circulating 

32 concentration range that includes the IC50 as determined in cell culture (i.e., the concentration of 

■ 

the test compound which achieves a half-maximal inhibition of the protein's biological activity). 
Such information can be used to more accurately detennine useful doses in humans. 
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A therapeutically, effective dose refers to that amount of the compound that results in 
amelioration of symptoms or a prolongation of survival in a patient Toxicity and therapeutic 
' efficacy of such compounds can be determined by standard pharmaceutical procedures in cell 
4 cultures or experimental animals, e.g., for determining the LD 5 o (the dose lethal to 50%of the 

'i 

population) and the ED50 (the dose therapeutically effective in 50% of the population). The dose 
ratio between toxic and therapeutic effects is the therapeutic index and it can be expressed as the 
ratio between LD50 and EDso- Compounds which exhibit high therapeutic indices are preferred. 
8 The data obtained from these cell culture assays and animal studies can be used in formulating a 
range of dosage for use in human. The dosage of such compounds lies preferably within a range 
of circulating concentrations that include the ED50 with little or no toxicity. The dosage may 

vary within this range depending upon the dosage form employed and the route of administration 

■ 

12 utilized. The exact formulation, route of administration and dosage can be chosen by the . 
individual physician in view of the patient's condition. See, e.g., Fingl et aL, 1975, in "The 

* • ■ * 

Pharmacological Basis of Therapeutics", Ch. 1 p.l . Dosage amount and interval may be adjusted 
individually to provide plasma levels of the active moiety which are sufficient to maintain the 

16 desired effects, or minimal effective concentration (MEC). The MEC will vary for each 

compound but can be estimated from in vitro data. Dosages necessary to achieve the MEC will 
depend on individual characteristics and route of administration. However, HPLC assays or 
bioassays can be used to determine plasma concentrations. 

20 Dosage intervals can also be determined using MEC value. Compounds should be 

administered using a regimen which maintains plasma levels above the MEC for 10-90% of the 
time, preferably between 30-90% and most preferably between 50-90%. In cases of local 

■ 

administration or selective uptake, the effective local concentration of the drug may not be 
24 related to plasma concentration. 

An exemplary dosage regimen for polypeptides or other compositions of the invention 

will be in the range of about 0.01 fig/kg to 100 mg/kg of body weight daily, with the preferred 

dose being about 0. 1 \xg/kg to 25 mg/kg of patient body weight daily, varying in adults and 
28 children. Dosing may be once daily, or equivalent doses may be delivered at longer or shorter 

intervals. 

The amount of composition administered will, of course, be dependent on the subject 
being treated, on the subject's age and weight, the severity of the affliction, the manner of 
32 administration and the judgment of the prescribing physician. 

4.12.4 PACKAGING 
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The compositions may, if desired, be presented in a pack or dispenser device which may 
contain one or more unit dosage forms containing the active ingredient. The pack may, for 
example, comprise metal or plastic foil, such as a blister pack. The pack or dispenser device may 
4 be accompanied by instructions for administration. Compositions comprising a compound of the 

invention formulated in a compatible pharmaceutical carrier may also be prepared, placed in an 

* 

appropriate container, and labeled for treatment of an indicated condition. 

8 4.13 ANTIBODIES 

Also included in the invention are antibodies to proteins, or fragments of proteins of the '„ 

■ * 

invention. The term "antibody" as used herein refers to immunoglobulin molecules and 
immunologically active portions of immunoglobulin (Ig) molecules, i.e., molecules that contain 

12 an antigen binding site that specifically binds (immunoreacts with) an antigen. Such antibodies 
include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, F a t>, Fab' and F^'p 
fragments, and an expression library. In general, an antibody molecule obtained from 
humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, which differ from one another 

16 by the nature of the heavy chain present in the molecule. Certain classes have subclasses as well, 
such as IgG], IgG2, and others. Furthermore, in humans, the light chain may be a kappa chain or 
a lambda chain. Reference herein to antibodies includes a reference to all such classes, 
subclasses and types of human antibody species. 

20 An isolated related protein of the invention may be intended to serve as an antigen, or a 

portion or fragment thereof, and additionally can be used as an immunogen to generate 
antibodies that immunospecifically bind the antigen, using standard techniques for polyclonal 
and monoclonal antibody preparation. The full-length protein can be used or, alternatively, the 

24 invention provides antigenic peptide fragments of the antigen for use as immunogens. An 

antigenic peptide fragment comprises at least 6 amino acid residues of the amino acid sequence 
of the full length protein, (for example the amino acid sequence shown in SEQ ID NO: 1351), 
and encompasses an epitope thereof such that an antibody raised against the peptide forms a 

28 specific immune complex with the full length protein or with any fragment that contains the 
epitope. Preferably, the antigenic peptide comprises at least 10 amino acid residues, or at least 
1 5 amino acid residues, or at least 20 amino acid residues, or at least 30 amino acid residues. 
Preferred epitopes encompassed by the antigenic peptide are regions of the protein that are 

32 located on its surface; commonly these are hydrophilic regions. 

In certain embodiments of the invention, at least one epitope encompassed by the . 
antigenic peptide is a region of -related protein that is located on the surface of the protein, e.g., a 
hydrophilic region. A hydrophobicity analysis of the human related protein sequence will 
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indicate which regions of a related protein are particularly hydrophilic and, therefore, are likely 
to encode surface residues useful for targeting antibody production. As a means for targeting 
antibody production, hydropathy plots showing regions of hydrophilicity and hydrophobicity 

4 may be generated by any method well known in the art, including, for example, the Kyte 

Doolittle or the Hopp Woods methods, either with or without Fourier transformation. See, e.g. , 
Hopp and Woods, 1981, Proc. Nat. Acad Set, USA 78: 3824-3828; Kyte and Doolittle 1982, J. 
Moi Biol. 157: 105-142, each of which is incorporated herein by reference in its entirety. 

8 Antibodies that are specific for one or more domains within an antigenic protein, or derivatives, 
fragments, analogs or homologs thereof, are also provided herein. 

A protein of the invention, or a derivative, fragment, analog, homolog or ortholog 
thereof, may be utilized as an immunogen in the generation of antibodies that 
12 immunospecifically bind these protein components. 

i 

Various procedures known within the art may be used for the production of polyclonal or 
monoclonal antibodies directed against a protein of the invention, or against derivatives, 
fragments, analogs homologs or orthologs thereof (see, for example, Antibodies: A Laboratory 
16 Manual, Harlow E, and Lane D, 1988, Cold Spring Harbor Laboratory Press, Cold Spring 

Harbor, NY, incorporated herein by reference). Some of these antibodies are discussed below. 

5.13.1 Polyclonal Antibodies 

20 For the production of polyclonal antibodies, various suitable host animals (e.g. , rabbit, 

goat, mouse or other mammal) may be immunized by one or more injections with the native 
protein, a synthetic variant thereof, or a derivative of the foregoing. An appropriate 
immunogenic preparation can contain, for example, the naturally occurring immunogenic 

24 protein, a chemically synthesized polypeptide representing the immunogenic protein, or a 

recombinantly expressed immunogenic protein. Furthermore, the protein may be conjugated to 
a second protein known to be immunogenic in the mammal being immunized. Examples of such 
immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum albumin, 

28 bovine thyroglobulin, and soybean trypsin inhibitor. The preparation can further include an 
adjuvant. Various adjuvants used to increase the.immunological response include, but are not 
limited to, Freund's (complete and incomplete), mineral gels (e.g., aluminum hydroxide), surface 
active substances (e.g., lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, 

32 dinitrophenol, etc.), adjuvants usable in humans such as Bacille Calmette-Guerin and 

Corynebacterium parvum, or similar immunostimulatory agents. Additional examples of 
adjuvants which can be employed include MPL-TDM adjuvant (monophosphoryl Lipid A, 
synthetic trehalose dicorynomycolate). 
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The polyclonal antibody molecules directed against the immunogenic protein can be 
isolated from the mammal (e.g., from the blood) and further purified by well known techniques, 
' such as affinity chromatography using protein A or protein G, which provide primarily the IgG 
~4 fraction of immune serum. Subsequently, or alternatively, the specific antigen which is the 

target of the immunoglobulin sought, or an epitope thereof, may be immobilized on a column to 
purify the immune specific antibody by immunoaffinity chromatography. Purification of 
immunoglobulins is discussed, for example, by D. Wilkinson (The Scientist, published by The 
8 Scientist, Inc., Philadelphia PA, Vol. 14, No. 8 (April 17, 2000), pp. 25-28). 



5.13.2 Monoclonal Antibodies 

The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", as used 

12 herein, refers to a population of antibody molecules that contain only one molecular species of 

■ 

antibody molecule consisting of a unique light chain gene product and a unique heavy chain 
gene product. In particular, the complementarity determining regions (CDRs) of the monoclonal 
antibody are identical in all the molecules of the population. MAbs thus contain an antigen 

16 binding site capable of immunoreacting with a particular epitope of the antigen characterized by 
a unique binding affinity for it. 

Monoclonal antibodies can be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature. 256:495 (1 975). In a hybridoma method, a mouse, 

20 hamster, or other appropriate host animal, is typically immunized with an immunizing agent to 
elicit lymphocytes that produce or are capable of producing antibodies that will specifically bind 
to the immunizing agent Alternatively, the lymphocytes can be immunized in vitro. 

The immunizing agent will typically include the protein antigen, a fragment thereof or a 

24 fusion protein thereof. Generally, either peripheral blood lymphocytes are used if cells of human 
origin are desired, or spleen cells or lymph node cells are used if non-human mammalian sources 
are desired. The lymphocytes are then fused with an immortalized cell line using a suitable 
fusing agent, such as polyethylene glycol, to form a hybridoma cell (Goding, Monoclonal 

28 Antibodies: Principles and Practice. Academic Press, (1986) pp. 59-103). Immortalized cell 

lines are usually transformed mammalian cells, particularly myeloma cells of rodent, bovine and 
human origin. Usually, rat or mouse myeloma cell lines are employed. The hybridomia cells can 
be cultured in a suitable culture medium that preferably contains one or more substances that 

32 inhibit the growth or survival of the unfused, immortalized cells. For example, if the parental 
cells lack the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the 
culture medium for the hybridomas typically will include hypoxanthine, aminopterin, and 
thymidine ("HAT medium"), which substances prevent the growth of HGPRT-deficient cells. 
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Preferred immortalized cell lines are those that fuse efficiently, support stable high level 

i*i . | 

expression of antibody by the selected antibody-producing cells, and are sensitive to a medium 
such as HAT medium. More preferred immortalized cell lines are murine myeloma lines, which 

* * ■ 

4 can be obtained, for instance, from the Salk Institute Cell Distribution Centeri'San Diego, 

California and the American Type Culture Collection, Manassas, Virginia. Human myeloma and ; 
mouse-human heteromyeloma cell lines also have been described for the production of human ' 
monoclonal antibodies (Kozbor, J. Immunol., 133:3001 (1984); Brodeur et aL, Monoclonal 
8 Antibody Production Techniques and Applications. Marcel Dekker, Inc., New York, (1 987) pp. 
51-63). ,, 

The culture medium in which the hybridoma cells are cultured can then be assayed for . 
the presence of monoclonal antibodies directed against the antigen. Preferably, the binding 

12 specificity of monoclonal antibodies produced by the hybridoma cells is determined by 
immunoprecipitation or by an in vitro binding assay, such as radioimmunoassay (RIA) or 
enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are known in the 
art. The binding affinity of the monoclonal antibody can, for example, be determined by the 

16 Scatchard analysis of Munson and Pollard, Anal. Biochem.. 107:220 (1980). Preferably, 

antibodies having a high degree of specificity and a high binding affinity for the target antigen 
are isolated. 

After the desired hybridoma cells are identified, the clones can be subcloned by limiting 
20 dilution procedures and grown by standard methods. Suitable culture media for this purpose 
include, for example, Dulbecco's Modified Eagle's Medium . and RPMI-1640 medium. 
Alternatively, the hybridoma cells can be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones can be isolated or purified from the 
24 culture medium or ascites fluid by conventional immunoglobulin purification procedures such 
as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, 
dialysis, or affinity chromatography. 

■ 

The monoclonal antibodies can also be made by recombinant DNA methods, such as 
28 those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the 

* ■ 

invention can be readily isolated and sequenced using conventional procedures (e.g., by using 
oligonucleotide probes that are capable of binding specifically to genes encoding the heavy and 
light chains of murine antibodies). The hybridoma cells of the invention serve as a preferred 
32 source of such DNA. Once isolated, the DNA can be placed into expression vectors, which are 
then transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, or 
myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis of 
monoclonal antibodies in the recombinant host cells. The DNA also can be modified, for 
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example, by substituting the coding sequence for human heavy and light chain constant domains 
in place of the homologous murine sequences (U.S. Patent No. 4,816,567; Morrison, Nature 368, 
1 812-13 (1994)) or by covalently joining to the immunoglobulin coding sequence all or part of the 

4 coding sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin 

polypeptide can be substituted for the constant domains of an antibody of the invention, or can 
be substituted for the variable domains of one antigen-combining site of an antibody of the 
invention to create a chimeric bivalent antibody. 

8 .. 

i 

5.13.2 Humanized Antibodies 

The antibodies directed against the protein antigens of the invention can further comprise 
humanized antibodies or human antibodies. These antibodies are suitable for administration to 

i 

12 humans without engendering an immune response by the human against the administered 

* ■ 

immunoglobulin. Humanized forms of antibodies are chimeric immunoglobulins, 
immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab r )2 or other antigen- 
binding subsequences of antibodies) that are principally comprised of the sequence of a human 
16 immunoglobulin, and contain minimal sequence derived from a non-human immunoglobulin. 
Humanization can be performed following the method of Winter and co-workers (Jones et al., 

■ • * 

Nature, 321:522-525 (1986); Riechmann et al., Nature, 332:323-327 (1988); Verhoeyen et al., 
Science. 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the 
20 corresponding sequences of a human antibody. (See also U.S. Patent No. 5,225,539.) In some 
instances, Fv framework residues of the human immunoglobulin are replaced by corresponding 
non-human residues. Humanized antibodies can also comprise residues which are found neither 
in the recipient antibody nor in the imported CDR or framework sequences. In general, the 

» 

24 humanized antibody will comprise substantially all of at least one, and typically two, variable 

♦ ■ 

domains, in which all or substantially all of the CDR regions correspond to those of a non-human 
immunoglobulin and all or substantially all of the framework regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will comprise at 
28 least a portion of an immunoglobulin constant region (Fc), typically that of a human 

immunoglobulin (Jones et al, 1986; Riechmann et al., 1 988; and Presta, Curr. Op. Struct. Biol., 
2:593-596(1992)). 

32 5.133 Human Antibodies 

Fully human antibodies relate to antibody molecules in which essentially the entire 
sequences of both the light chain and the heavy chain, including the CDRs, arise from human 
genes. Such antibodies are termed "human antibodies", or "fully human antibodies" herein. 
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Human monoclonal antibodies can be prepared by the trioma technique; the human B-cell ... 
hybridoma technique (see Kozbor, et al., 1983 Immunol Today 4: 72) and the EBV hybridoma 
technique to produce human monoclonal antibodies (see Cole, et al., 1985 In: MONOCLONAL 
4 Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Human monoclonal 

antibodies may be utilized in the practice of the present invention and may be produced by using 
human hybridomas (see Cote, et al., 1983. Proc Natl Acad Sci USA 80: 2026-2030) or by 
transforming human B-cells with Epstein Bair Virus in vitro (see Cole, et al., 1 985 In: 

8 Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

In addition, human antibodies can also be produced using additional techniques, 
including phage display libraries (Hoogenboom and Winter, J. Mol. Biol.. 227:381 (1991); 
Marks et al., J. Mol. Biol.. 222:581 (1991)). Similarly, human antibodies can be made by 

12 introducing human immunoglobulin loci into transgenic animals, e.g. , mice in which the 
endogenous immunoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in humans 
in all respects, including gene rearrangement, assembly, and antibody repertoire. This approach 

16 is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 

5,633,425; 5,661,016, and in Marks et al. (Bio/Technology. 1 0. 779-783 (1992)); Lonberg et al. 
(Nature 368 856-859 (1994)); Morrison ( Nature 368 , 812-13 (1994)); Fishwildet al,(Nature 
Biotechnology 14, 845-5 1 (1996)); Neuberger (Nature Biotechnology 14, 826 (1996)); and 

20 Lonberg and Huszar (Intern. Rev . Immunol 13 65-93 ( 1 995)). 

Human antibodies may additionally be produced using transgenic nonhuman animals 
which are modified so as to produce fully human antibodies rather than the animal's endogenous 
antibodies in response to challenge by an antigen. (See PCT publication WO94/02602). The 

24 endogenous genes encoding the heavy and light immunoglobulin chains in the nonhuman host 
have been incapacitated, and active loci encoding human heavy and light chain immunoglobulins 
are inserted into the host's genome. The human genes are incorporated, for example, using yeast 
artificial chromosomes containing the requisite human DNA segments. An animal which 

28 provides all the desired modifications is then obtained as progeny by crossbreeding intermediate 
transgenic animals containing fewer than the full complement of the modifications. The 

*i*k>i 

preferred embodiment of such a nonhuman animal is a mouse, and is termed the Xenomouse 
as disclosed in PCT publications WO 96/33735 and WO 96/34096. This animal produces B 
32 cells which secrete fully human immunoglobulins. The antibodies can be obtained directly from 
the animal after immunization with an immunogen of interest, as, for example, a preparation of a 
polyclonal antibody, or alternatively from immortalized B cells derived from the animal, such as 
hybridomas producing monoclonal antibodies. Additionally, the genes encoding the 
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immunoglobulins with human variable regions can be recovered and expressed to obtain the 
antibodies directly, or can be further modified to obtain analogs of antibodies such as, for 
1 example, single chain Fv molecules. 

4 -An example of a method of producing a nonhuman host, exemplified as a mouse, lacking 

't 

expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. Patent No. 
5,939,598. It can be obtained by a method including deleting the J segment genes from at least 

r 

■ 

one endogenous heavy chain locus in an embryonic stem cell to prevent rearrangement of the 

■ 

8 locus and to prevent formation of a transcript of a rearranged immunoglobulin heavy'chain locus, 
the deletion being effected by a targeting vector containing a gene encoding a selectable marker; 
and producing from the embryonic stem cell a transgenic mouse whose somatic and germ cells 
contain the gene encoding the selectable marker. 

r 

12 A method for producing an antibody of interest, such as a human antibody, is disclosed in 

U.S. Patent No., 5,916/771 . It includes introducing an expression vector that contains a 
nucleotide sequence encoding a heavy chain into one mammalian host cell in culture, introducing 
an expression vector containing a nucleotide sequence encoding a light chain into another 

16 mammalian host cell, and fusing the two cells to form a hybrid cell. The hybrid cell expresses an 
antibody containing the heavy chain and the light chain. 

In a further improvement on this procedure, a method for identifying a clinically relevant 
epitope on an immunogen, and a correlative method for selecting an antibody that binds 

20 immunospecifically to the relevant epitope with high affinity, are disclosed in PCT publication 
WO 99/53049. 

5.13.4 Fab Fragments and Single Chain Antibodies 

24 According to the invention, techniques can be adapted for the production of single-chain 

antibodies specific to an antigenic protein of the invention (see e.g. , U.S. Patent No. 4,946,778). 
In addition, methods can be adapted for the construction of Fab expression libraries (see e.g., 
Huse, et al., 1989 Science 246: 1275-1281) to allow rapid and effective identification of 

28 monoclonal Fab fragments with the desired specificity for a protein or derivatives, fragments, 

analogs or homologs thereof. Antibody fragments that contain the idiotypes to a protein antigen 
may be produced by techniques known in the art including, but not limited to: (i) an F^-ja 
fragment produced by pepsin digestion of an antibody molecule; (ii) an F a b fragment generated 

32 by reducing the disulfide bridges of an F^b-p fragment; (iii) an F a b fragment generated by the 
treatment of the antibody molecule with papain and a reducing agent and (iv) F v fragments. 

5.13.5 Bispecific Antibodies 
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Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that 
have binding specificities for at least two different antigens. In the present case, one of the 
binding specificities is for an antigenic protein of the invention. The second binding target is any 
4 other antigen, and advantageously is a cell-surface protein or receptor or receptor subunitf" 
Methods for making bispecific antibodies are known in the art. Traditionally, the 
recombinant production of bispecific antibodies is based on the co-expression of two 
immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different 
8 specificities (Milstein and Cuello, Nature, 305 :537-539 (1 983)). Because of the random 

assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) produce a;, 
potential mixture of ten different antibody molecules, of which only one has the correct 
bispecific structure. The purification of the correct molecule is usually accomplished by affinity 

12 chromatography steps. Similar procedures are disclosed in WO 93/08829, published 13 May 
1993, and in Traunecker etaL 9 1991 EMBO J., 10:3655-3659. - - ■ ' ■ 

Antibody variable domains with the desired binding specificities (antibody-antigen 
combining sites) can be fused to immunoglobulin constant domain sequences. The fusion 

16 preferably is with an immunoglobulin heavy-chain constant domain, comprising at least part of 
the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain constant region 
(CHI) containing the site necessary for light-chain binding present in at least one of the fusions. 
DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the immunoglobulin 

20 light chain, are inserted into separate expression vectors, and are co-transfected into a suitable 
host organism. For further details of generating bispecific antibodies see, for example, Suresh et 
al., Methods in Enzvmology, 121 :210 (19861. 

According to another approach described in WO 96/2701 1 , the interface between a pair 

24 of antibody molecules can be engineered to maximize the percentage of heterodimers which are 
recovered from recombinant cell culture. The preferred interface comprises at least a part of the 
CH3 region of an antibody constant domain. In this method, one or more small amino acid side 
chains from the interface of the first antibody molecule are replaced with larger side chains (e.g> 

28 tyrosine or tryptophan). Compensatory "cavities" of identical or similar size to the large side 
chain(s) are created on the interface of the second antibody molecule by replacing large amino 
acid side chains with smaller ones (e.g. alanine or threonine). This provides a mechanism for 
increasing the yield of the heterodimer over other unwanted end-products such as homodimers. 

32 Bispecific antibodies can be prepared as full length antibodies or antibody fragments (e.g. 

F(ab*)2 bispecific antibodies). Techniques for generating bispecific antibodies from antibody 
fragments have been described in the literature. For example, bispecific antibodies can be 
prepared using chemical linkage. Brennan et al., Science 229:81 (1985) describe a procedure 

, » * 
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wherein intact antibodies are proteolytically cleaved to generate F(ab')2 fragments. These 
fragments are reduced in the presence of the dithiol complexing agent sodium arsenite to . 
• stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab' fragments 
4 generated are then converted to thionitrobenzoate (TNB) derivatives. One of the Fab'-TNB 
derivatives is then reconverted to the Fab' -thiol by reduction with mercaptoethylamine and is 
mixed with an equimolar amount of the other Fab'-TNB derivative to form the bispecific 
antibody. The bispecific antibodies produced can be used as agents for the selective 

* 

8 immobilization of enzymes. 

Additionally, Fab' fragments can be directly recovered from E. coli and chemically 
coupled toform bispecific antibodies. Shalaby et al., J. Exp. Med. 175:217-225 (1992) describe 
the production of a folly humanized bispecific antibody F(ab')2 molecule. Each Fab* fragment 

■ * 

12 was separately secreted from E. coli and subjected to directed chemical coupling in vitro to form 

■ 

the bispecific antibody. The bispecific antibody thus formed was able to bind to cells 
overexpressing the ErbB2 receptor and normal human T cells, as well as trigger the lytic activity 
of human cytotoxic lymphocytes against human breast tumor targets. 

- 

16 Various techniques for making and isolating bispecific antibody fragments directly from 

recombinant cell culture have also been described. For example, bispecific antibodies have been 
produced using leucine zippers. Kostelny et al., J. Immunol. 148(5): 1547- 1553 (1992). The 
leucine zipper peptides from the Fos and Jun proteins were linked to the Fab' portions of two 

20 different antibodies by gene fusion. The antibody homodimers were reduced at the hinge region 
to form monomers and then re-oxidized to form the antibody heterodimers. This method can 
also be utilized for the production of antibody homodimers. The "diabody" technology 
described by Hollinger ,et al., Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an 

24 alternative mechanism for making bispecific antibody fragments. The fragments comprise a 
heavy-chain variable domain (V^) connected to a light-chain variable domain (Vl) by a linker 
which is too short to allow pairing between the two domains on the same chain. Accordingly, 
the Vh and V L domains of one fragment are forced to pair with the complementary Vl and Vh 

28 domains of another fragment, thereby forming two antigen-binding sites. Another strategy for 

■ ■ 

making bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has also been 

reported. See, Gruber et al., J. Immunol. 152:5368 (1994). 

... t . 

Antibodies with more than two valencies are contemplated. For example, trispecific 
32 antibodies can be prepared. Tutt et al., J. Immunol. 147:60 (1991). 

Exemplary bispecific antibodies can bind to two different epitopes, at least one of which 
originates in the protein antigen of the invention. Alternatively, an anti-antigenic arm of an 
immunoglobulin molecule can be combined with an arm which binds to a triggering molecule on 
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a leukocyte such as a T-cell receptor molecule {e.g. CD2, CD3, CD28, or B7), or Fc receptors for 
IgG(Fc R),suchasFc RI (CD64),Fc RII (CD32) and Fc RIII (CD16) so as to focus 
cellular defense mechanisms to the cell expressing the particular antigen. Bispecific antibodies 
can also be used to direct cytotoxic agents to cells which express a particular antigen. These 
antibodies possess an antigen-binding aim and an arm which binds a cytotoxic agent or a 
radionuclide chelator, such as EOTUBE, DPTA, DOTA, or TETA. Another bispecific antibody 
of interest binds the protein antigen described herein and further binds tissue factor (TF). 

■ . 

■ 

5.13.6 Heteroconjugate Antibodies 

Heteroconjugate antibodies are also within the scope of the present invention. 
Heteroconjugate antibodies are composed of two covalently joined antibodies. Such antibodies 
have, for example, been proposed to target immune system cells to unwanted cells (U.S. Patent 
No. 4,676,980), and for treatment of HIV infection (WO 91/00360; WO 92/200373; EP 03Cf89). 
It is contemplated that the antibodies can be prepared in vitro using known methods in synthetic 
protein chemistry, including those involving crosslinking agents. For example, immunotoxins 
can be constructed using a disulfide exchange reaction or by forming a thioether bond. 
Examples of suitable reagents for this purpose include iminothiolate and methyl-4- 
mercaptobutyrimidate and those disclosed* for example, in U.S. Patent No. 4,676,980. 

5.13.7 Effector Function Engineering 

It can be desirable to modify the antibody of the invention with respect to effector function, so as 
to enhance, e.g , the effectiveness of the antibody in treating cancer. For example, cysteine 
residue(s) can be introduced into the Fc region, thereby allowing interchain disulfide bond 
formation in this region. The homodimeric antibody thus generated can have improved 
internalization capability and/or increased complement-mediated cell killing and antibody- 
dependent cellular cytotoxicity (ADCC). See Caron et al, J. Exp Med., 176: 1191-1 195 (1992) 
and Shopes, J. Immunol., 148: 2918-2922 (1992). Homodimeric antibodies with enhanced anti- 
tumor activity can also be prepared using heterobifunctional cross-linkers as described in Wolff 
et al. Cancer Research, 53: 2560-2565 (1993). Alternatively, an antibody can be engineered that 
has dual Fc regions and can thereby have enhanced complement lysis and ADCC capabilities. 
See Stevenson et al., Anti-Cancer Drug Design, 3 : 219-230 (1 989). 

5.13.8 Immunoconjugates 

The invention also pertains to immunoconjugates comprising an antibody conjugated to a 
cytotoxic agent such as a chemotherapeutic agent, toxin (e.g. , an enzymatically active toxin of 
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bacterial, fungal, plant, or animal origin, or fragments thereof), or a radioactive isotope (z.e., a 
radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugate.s have been 
4— described above. Enzymatically active toxins and fragments thereof that can be used include 
diphtheria A chain, nonbinding active fragments of diphtheria toxin, exptoxin A chain (from 
Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, alpha-sarcin, 
Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins (PAPI, PAPII, and 
8 PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, gelonin, 
mitogellin, restrictocin, phenomycin, enomycin, and the tricothecenes. A variety of 
radionuclides are available for the production of radioconjugated antibodies. Examples include 
212 Bi, 13, I, ,3, In, 90 Y, and 186 Re. 

12 Conjugates of the antibody and cytotoxic agent are made using a variety of bifunctional 

protein-coupling agents such as N-sucxinimidyl-3-(2-pyridyldithiol) propionate (SPDP), 
iminothiolane (IT), bifunctional derivatives of imidoesters (such as dimethyl adipimidate HCL), 
active esters (such as disuccinimidyl suberate), aldehydes (such as glutareldehyde), bis-azido 

16 compounds (such as bis (p-azidobenzoyl) hexanediamine), bis-diazonium derivatives (such as 
bis-(p-diazoriiumbenzoyl)-ethylenediamine), diisocyanates (such as tolyene 2,6-diisocyanate), 
and bis-active fluorine compounds (such as l,5-difluoro-2,4-dinitrobenzene). For example, a 
ricin immunotoxin can be prepared as described in Vitetta et al., Science, 238: 1098 (1987). 

20 Carbon- 14-labeled l-isothibcyanatobenzyl-3-methyldiethylene triaminepentaacetic acid (MX- 
DTP A) is an exemplary chelating agent for conjugation of radionucleotide to the antibody. See 
WO94/11026. 

In another embodiment, the antibody can be conjugated to a "receptor" (such 
24 streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate is 
administered to the patient, followed by removal of unbound conjugate from the circulation 
using a.clearing agent and then administration of a "ligand" (e.g., avidin) that is in turn t 
conjugated to a cytotoxic agent 

28 

4,14 COMPUTER READABLE SEQUENCES 

In one application of this embodiment, a nucleotide sequence of the present invention can 
be recorded on computer readable media. As used herein, "computer readable media" refers to 
32 any medium which can be read and accessed directly by a computer. Such media include, but 
are not limited to: magnetic storage media, such as floppy discs, hard disc storage medium, and 
magnetic tape; optical storage media such as CD-ROM; electrical storage media such as RAM 
and ROM; and hybrids of these categories such as magnetic/optical storage media. A skilled 

♦ 
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artisan can readily appreciate how anyof the presently known computer readable mediums can 
be used to, create a manufacture comprising computer readable medium having recorded thereon 
' a nucleotide sequence of the present invention. As used herein, "recorded" refers to a process for 

4 storing information on computer readable medium. A skilled artisan can readily adopt any of the 

*» 

presently known methods for recording information on computer readable medium to generate 
manufactures comprising the nucleotide sequence information of the present invention. 
A variety of data storage structures are available to a skilled artisan for creating a 
8 computer readable medium having recorded thereon a nucleotide sequence of the present 

invention. The choice of the data storage structure will generally be based on the means chosen 
to access the stored information. In addition, a variety of data processor programs and formats 
can be used to store the nucleotide sequence information of the present invention on computer 

12 readable medium. The sequence information can be represented in a word processing text file, 
formatted in commercially-available software such as WordPerfect and Microsoft Word, or 
represented in the form of an ASCII file, stored in a database application, such as DB2, Sybase, 
Oracle, or the like. A skilled artisan can readily adapt any number of data processor structuring 

16 formats (e.g. text file or database) in order to obtain computer readable medium having recorded 
thereon the nucleotide sequence information of the present invention. 

By providing any of the nucleotide sequences SEQ ID NO: 1-13901 or a representative 
fragment thereof; or a nucleotide sequence at least 95% identical to any of the nucleotide 

20 sequences of SEQ ID NO: 1-13901 in computer readable form, a skilled artisan can routinely 
access the sequence information for a variety of purposes. Computer software is publicly 
available which allows a skilled artisan to access sequence information provided in a computer 
readable medium. The examples which follow demonstrate how software which implements the 

24 BLAST (Altschul et al., J. Mol. Biol. 215:403-410 (1990)) and BLAZE (Brutlag et al., Comp. 
Chem. 17:203-207 (1993)) search algorithms on a Sybase system is used to identify open reading 
frames (ORFs) within a nucleic acid sequence. Such ORFs may be protein encoding fragments 
and may be useful in producing commercially important proteins such as enzymes used in 

28 fermentation reactions and in the production of commercially useful metabolites. 

As used herein, "a computer-based system" refers to the hardware means, software 
means, and data storage means used to analyze the nucleotide sequence information of the 

» 

present invention. The minimum hardware means of the computer-based systems of the present 
32 invention comprises a central processing unit (CPU), input means, output means, and data 
storage means. A skilled artisan can readily appreciate that any one of the currently available 
computer-based systems are suitable for use in the present invention. As stated above, the 
computer-based systems of the present invention comprise a data storage means having stored 
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therein a nucleotide sequence of the present invention and the necessary hardware means and 
software means for supporting and implementing a search means. As used herein, "data storage 
means" refers to memory which can store nucleotide sequence information of the present . 
4 invention, or a memory access means which can access 'manufactures having recorded thereon 
the nucleotide sequence information of the present invention. 

As used herein, "search means" refers to one or more programs which are implemented 
on the computer-based system to compare a target sequence or target structural motif with the 

- * 

8 sequence information stored within the data storage means. Search means are used to identify 
fragments or regions of a known sequence which match a particular target sequence or target 
motif. A variety of known algorithms are disclosed publicly and a variety of commercially 
available software for conducting search means are and can be used in the computer-based 

12 systems of the present invention. Examples of such software includes, but is not limited to, 
Smith-Waterman, MacPattern (EMBL), BLASTN and BLASTA (NPOLYPEPTEDEIA). A' 
skilled artisan can readily recognize that any one of the available algorithms or implementing 
software packages for conducting homology searches can be adapted for use in the present 

16 computer-based systems. As used herein, a "target sequence" can be any nucleic acid or amino 
acid sequence of six or more nucleotides or two or more amino acids. A skilled artisan can 
readily recognize that the longer a target sequence is, the less likely a target sequence will be 

present as a random occurrence in the database. The most preferred sequence length of a target 

* 

20 sequence is from about 10 to 300 amino acids, more preferably from about 30 to 100 nucleotide 
residues. However, it is well recognized that searches for commercially important fragments, 
such as sequence fragments involved in gene expression and protein processing, may be of 
shorter length. 

24 As used herein, "a target structural motif," or "target motif," refers to any rationally 

selected sequence or combination of sequences in which the sequence(s) are chosen based on a 
three-dimensional configuration which is formed upon the folding of the target motif. There are 
a variety of target motifs known in the art. Protein target motifs include, but are not limited to, 

28 enzyme active sites and signal sequences. Nucleic acid target motifs include, but are not limited 
to, promoter sequences, hairpin structures and inducible expression elements (protein binding 
sequences). 

32 4.15 TRIPLE HELIX FORMATION 

In addition, the fragments of the present invention, as broadly described, can be used to 
control gene expression through triple helix formation or antisense DNA or RNA, both of which 

methods are based on the binding of a polynucleotide sequence to DNA or RNA, 

- 
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Polynucleotides suitable, for use in these methods are preferably 20 to 40 bases in length and are 
designed to be complementary to a region of the gene involved in transcription (triple helix - see 
' Lee et al, Nucl. Acids Res. 6:3073 (1979); Cooney et al., Science 15241:456 (1988); and Dervan 

4 et al., Science 251:1360 (1991))-or to the mRNA itself (antisense - Olmno, J. Neurochem. 

56:560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, 
Boca Raton, FL (1988)). Triple helix-formation optimally results in a shut-off of RNA 
transcription from DNA, while antisense RNA hybridization blocks translation of an mRNA 

8 molecule into polypeptide. Both techniques have been demonstrated to be effective in model 
systems. Information contained in the sequences of the present invention is necessary for the 
design of an antisense or triple helix oligonucleotide. 

* 

12 4.16 DIAGNOSTIC ASSAYS AND KITS 

The present invention further provides methods to identify the presence or expression of 
one of the ORFs of the present invention, or homolog thereof, in a test sample, using a nucleic 
acid probe or antibodies of the present invention, optionally conjugated or otherwise associated 

* 

16 with a suitable label. 

In general, methods for detecting a polynucleotide of the invention can comprise 
contacting a sample with a compound that binds to and forms a complex with the polynucleotide 
for a period sufficient to form the complex, and detecting the complex, so that if a complex is 

20 detected, a polynucleotide of the invention is detected in the sample. Such methods can also 
comprise contacting a sample under stringent hybridization conditions with nucleic acid primers 
that anneal to a polynucleotide of the invention under such conditions, and amplifying annealed 
polynucleotides, so that if a polynucleotide is amplified, a polynucleotide of the invention is 

24 detected in the sample. 

In general, methods for detecting a polypeptide of the invention can comprise contacting 
a sample with a compound that binds to and forms a complex with the polypeptide for a period 
sufficient to form the complex, and detecting the complex, so that if a complex is detected, a 

28 polypeptide of the invention is detected in the sample. 

In detail, such methods comprise incubating a test sample with one or more of the 
antibodies or one or more of the nucleic acid probes of the present invention and assaying for 
binding of the nucleic acid probes or antibodies to components within the test sample. 

32 Conditions for incubating a nucleic acid probe or antibody with a test sample vary. 

Incubation conditions depend on the format employed in the assay, the detection methods 
employed, and the type and nature of the nucleic acid prohe or antibody used in the assay. One 
skilled in the art will recognize that any one of the commonly available hybridization, 
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amplification or immunological assay formats can readily be adapted to employ the nucleic acid 
probes or antibodies of the present invention. Examples of such assays can be found in Chard, 
' T., An Introduction to Radioimmunoassay and Related Techniques, Elsevier Science Publishers, 

4 Amsterdam, The Netherlands (1986); Bullock, G.R. et al., Techniques in Immunocytochemistry, 
Academic Press, Orlando, FL Vol. 1 (1982), Vol. 2 (1983), Vol. 3 (1985); Tijssen, P., Practice 
and Theory of immunoassays: Laboratory Techniques in Biochemistry and Molecular Biology, 
Elsevier Science Publishers, Amsterdam, The Netherlands (1 985). The test samples of the 

8 present invention include cells, protein or membrane extracts of cells, or biological fluids such as 
sputum, blood, serum, plasma, or urine. The test sample used in the above-described method 
will vary based on the assay format, nature of the detection method and the tissues, cells or 
extracts used as the sample to be assayed. Methods for preparing protein extracts or membrane 

? i 

12 extracts of cells are well known in the art and can be readily be adapted in order to obtain a 

sample which is compatible with the system utilized. 

In another embodiment of the present invention, kits are provided which contain the 

necessary reagents to cany out the assays of the present invention. Specifically, the invention 
1 6 provides a compartment kit to receive, in close confinement, one or more containers which 

comprises: (a) a first container comprising one of the probes or antibodies of the present 

« - 

invention; and (b) one or more other containers comprising one or more of the following: wash 
reagents, reagents capable of detecting presence of a bound probe or antibody. 

20 ■ In detail, a compartment kit includes any kit in which reagents are contained in separate 

containers. Such containers include small glass containers, plastic containers or strips of plastic 
or paper. Such containers allows one to efficiently transfer reagents from one compartment to 
another compartment such that the samples and reagents are not cross-contaminated, and the 

24 agents or solutions of each container can be added in a quantitative fashion from one 

compartment to another. Such containers will include a container which will accept the test 
sample, a container which contains the antibodies used in the assay, containers which contain 
wash reagents (such as phosphate buffered saline, Tris-buffers, etc.), and containers which 

28 contain the reagents used to detect the bound antibody or probe. Types of detection reagents 
include labeled nucleic acid probes, labeled secondary antibodies, or in the alternative, if the 
primary antibody is labeled, the enzymatic, or antibody binding reagents which are capable of 
reacting with the labeled antibody. One skilled in the art will readily recognize that the disclosed 

32 probes and antibodies of the present invention can be readily incorporated into one of the 
established kit formats which are well known in the art. 

4.17 MEDICAL IMAGING 
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The novel polypeptides and binding partners of the invention are useful in medical . . . 
imaging of sites expressing the molecules of the invention (e.g. , where the polypeptide of the 
invention is involved in the immune response, for imaging sites of inflammation or infection). 
4 See, e.g., Kunkel et al., U.S. Pat NO. 5,413,778. Such methods involve chemical attachment of 
a labeling or imaging agent, administration of the labeled polypeptide to a subject in a 
pharmaceutically acceptable carrier, and imaging the labeled polypeptide in vivo at the target 
site. 

8 

4.18 SCREENING ASSAYS 

Using the isolated proteins and polynucleotides of the invention, the present invention . 
further provides methods of obtaining and identifying agents which bind to a polypeptide 
12 encoded by an ORF corresponding to any of the nucleotide sequences set forth in SEQ ID NO: 
1-13901 , or bind to a specific domain of the polypeptide encoded by the nucleic acid. In de^, 
said method comprises the steps of: * 

(a) contacting an agent with an isolated protein encoded by an ORF of the present 
16 invention, or nucleic acid of the invention; and 

(b) determining whether the agent binds to said protein or said nucleic acid. 
In general, therefore, such methods for identifying compounds that bind to a 

polynucleotide of the invention can comprise contacting a compound with a polynucleotide of 
20 the invention for a time sufficient to form a polynucleotide/compound complex, and detecting 

the complex, so that if a polynucleotide/compound complex is detected, a compound that binds 

to a polynucleotide of the invention is identified. 

Likewise, in general, therefore, such methods for identifying compounds that bind to a 
24 polypeptide of the invention can comprise contacting a compound with a polypeptide of the 

invention for a time sufficient to form a polypeptide/compound complex, and detecting the 

■ 

complex, so that if a polypeptide/compound complex is detected, a compound that binds to a 
polynucleotide of the invention is identified. 

28 Methods for identifying compounds that bind to a polypeptide of the invention can also 

comprise contacting a compound with a polypeptide of the invention in a cell for a time 
sufficient to form a polypeptide/compound complex, wherein the complex drives expression of a 
receptor gene sequence in the cell, and detecting the complex by detecting reporter gene 

32 sequence expression, so that if a polypeptide/compound complex is detected, a compound that 

* * 

binds a polypeptide of the invention is identified. 

Compounds identified via such methods can include compounds which modulate the 
activity of a polypeptide of the invention (that is, increase or decrease its activity, relative to 
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activity observed in the absence of the compound). Alternatively, compounds identified via such 
methods cpn include compounds which modulate the expression of a polynucleotide of the 
1 invention (that is, increase or decrease expression relative to expression levels observed in the 
4 absence of the compound). Compounds, suclras compounds identified via the methods of the 
invention, can be tested using standard assays well known to those of skill in the art for their 
ability to modulate activity/expression. 

■ 

The agents screened in the above assay can be, but are not limited to, peptides, 
8 carbohydrates, vitamin derivatives, or other pharmaceutical agents. The agents can be selected 
and screened at random or rationally selected or designed using protein modeling techniques. 

For random screening, agents such as peptides, carbohydrates, pharmaceutical agents and 
the like are selected at random and are assayed for their ability to bind to the protein encoded by 

12 the ORF of the present invention. Alternatively, agents may be rationally selected or designed. 

■ 

As used herein, an agent is said to be "rationally selected or designed" when the agent is chosen 
based on the configuration of the particular protein. For example, one skilled in the art can 
readily adapt currently available procedures to generate peptides, pharmaceutical agents and the 

16 like, capable of binding to a specific peptide sequence, in order to generate rationally designed 
antipeptide peptides, for example see Hurby et al., Application of Synthetic Peptides: Antisense 
Peptides," In Synthetic Peptides, A User's Guide, W.H. Freeman, NY (1992), pp. 289-307, and 
Kaspczak et al., Biochemistry 28:9230-8 (1989), or pharmaceutical agents, or the like. 

20 In addition to the foregoing, one class of agents of the present invention, as broadly 

described, can be used to control gene expression through binding to one of the ORFs or EMFs 
of the present invention. As described above, such agents can be randomly screened or 
rationally designed/selected. Targeting the ORF or EMF allows a skilled artisan to design 

24 sequence specific or element specific agents, modulating the expression of either a single ORF or 
multiple ORFs which rely on the same EMF for expression control. One class of DNA binding 
agents are agents which contain base residues which hybridize or form a triple helix formation 

- 

by binding to DNA or RNA. Such agents can be based on the classic phosphodiester, 
28 ribonucleic acid backbone, or can be a variety of sulfhydryl or polymeric derivatives which have 
base attachment capacity. 

Agents suitable for use in these methods preferably contain 20 to 40 bases and are 
designed to be complementary to a region of the gene involved in transcription (triple helix - see 
32 Lee et al., NucL Acids Res. 6:3073 (1979); Cooney et al., Science 241:456 (1988); and Dervan et 
al., Science 251:1360 (1991)) or to the mRNA itself (antisense - Okano, J. Neurochem. 56:560 
(1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca 
Raton, FL (1988)). Triple helix-formation optimally results in a shut-off of RNA transcription 
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from DNA, while antisense RNA hybridization blocks translation of an mRNA molecule into 
polypeptide. Both techniques have been demonstrated to be effective in model systems. 
Information contained in the sequences of the present invention is necessary for the design of an 

4 antisense or triple helix oligonucleotide and other DNA binding agents. 

Agents which bind to a protein encoded by one of the ORFs of tlje present invention can 
be used as a diagnostic agent. Agents which bind to a protein encoded by one of the ORFs of the 
present invention can be formulated using known techniques to generate a pharmaceutical 

8 composition. 

4.19 USE OF NUCLEIC ACIDS AS PROBES 

Another aspect of the subject invention is to provide for polypeptide-specific nucleic acid 

12 hybridization probes capable of hybridizing with naturally occurring nucleotide sequences. The 
hybridization probes of the subject invention may be derived from any of the nucleotide 
sequences SEQ ID NOS: 1-13901 . Because the corresponding gene is only expressed in a 
limited number of tissues, a hybridization probe derived from of any of the nucleotide sequences 

1 6 SEQ ID NO : 1 -1 3 90 1 can be used as an indicator of the presence of RNA of cell type of such a 
tissue in a sample. „ 

Any suitable hybridization technique can be employed, such as, for example, in situ 
hybridization. PCR as described in US Patents Nos. 4,683,195 and 4,965,188 provides 

20 additional uses for oligonucleotides based upon the nucleotide sequences. Such probes used in 
PCR may be of recombinant origin, may be chemically synthesized, or a mixture of both. The 
probe will comprise a discrete nucleotide sequence for the detection of identical sequences or a 
degenerate pool of possible sequences for identification of closely related genomic sequences. 

24 Other means for producing specific hybridization probes for nucleic acids include the 

cloning of nucleic acid sequences into vectors for the production of mRNA probes. Such vectors 
are known in the art and are commercially available and may be used to synthesize RNA probes 
in vitro by means of the addition of the appropriate RNA polymerase as T7 or SP6 RNA 

28 polymerase and the appropriate radioactively labeled nucleotides. The nucleotide sequences may 
be used to construct hybridization probes for mapping their respective genomic sequences. The 
nucleotide sequence provided herein may be mapped to a chromosome or specific regions of a 
chromosome using well known genetic and/or chromosomal mapping techniques. These 

32 techniques include in situ hybridization, linkage analysis against known chromosomal markers, 
hybridization screening with libraries or flow-sorted chromosomal preparations specific to 
known chromosomes, and the like. The technique of fluorescent in situ hybridization of 
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chromosome spreads ha$ been described, among other places, in Verma et al (1988) Human 
Chromosqmes: A Manual of Basic Techniques, Pergamon Press, New York NY. 
' Fluorescent in situ hybridization of chromosomal preparations and other physical 

4 — chromosome mapping techniques may be correlated with additional genetic map data. Examples 
of genetic map data can be found in the 1994 Genome Issue of Science (265:19811). Correlation 
between the location of a nucleic acid on a physical chromosomal map and a specific disease (or 
predisposition to a specific disease) may help delimit the region of DNA associated with that 

8 genetic disease. The nucleotide sequences of the subject invention may be used to detect 
differences in gene sequences between normal, carrier or affected individuals. 

4.20 PREPARATION OF SUPPORT BOUND OLIGONUCLEOTIDES 

Oligonucleotides, i.e. , small nucleic acid segments, may be readily prepared by, for 
12 example, directly synthesizing the oligonucleotide by chemical means, as is commonly practiced 
using an automated oligonucleotide synthesizer. 

Support bound oligonucleotides may be prepared by any of the methods known to those of 
skill in the art using any suitable support such as glass, polystyrene or Teflon. One strategy is to 
16 precisely spot oligonucleotides synthesized by standard synthesizers. Immobilization can be 

♦ 

achieved using passive adsorption (Inouye & Hondo, ( 1 990) J. Clin. Microbiol. 28(6) 1469-72); 

using UV light (Nagataef al t 1985;Dahlenefa/., 1987; Morrissey & Collins, (1989) Mol. Cell 

Probes3(2) 1 89-207) or by covalent binding of base modified DNA (Keller et aL, 1988; 1989); all 
20 references being specifically incorporated herein. 

Another strategy that may be employed is the use of the strong biotm-streptavidin 

interaction as a linker. For example, Broude et aL (1994) Proc. Natl. Acad. ScL USA 91(8) 3072-6, 

describe the use of biotinylated probes, although these are duplex probes, that are immobilized on 
24 streptavi din-coated magnetic beads. Streptavidin-coated beads may be purchased from Dynal, 

Oslo. Of course, this same linking chemistry is applicable to coating any surface with streptavidin. 

Biotinylated probes may be purchased from various sources, such as, e.g. , Operon Technologies 

(Alameda, CA). 

28 Nunc Laboratories (Naperviile, IL) is also selling suitable material that could be used. Nunc 

Laboratories have developed a method by which DNA can be covalently bound to the microwell 
surface termed Covalink NH. CovaLink NH is a polystyrene surface grafted with secondary amino 
groups (>NH) that serve as bridge-heads for further covalent coupling. CovaLink Modules may be 

32 purchased from Nunc Laboratories. DNA molecules may be bound to CovaLink exclusively at the 
5'-end by aphosphoramidatebond, allowing immobilization of more than 1 pmol of DNA 
(Rasmussenef al, (1991) Anal. Biochem. 198(1) 138-42). 



92 



WO 01/64835 PCTYUS01/04927 



The use of CovaLink NH strips for covalent binding of DNA molecules at the 5 r -end has 
been described (Rasmussen et al., (1991). In this technology, a phosphoramidatebond is employed 
(Chu et al., (1 983) Nucleic Acids Res. 1 1(8) 65 13-29). This is beneficial as immobilizationusing 
4 only a single covalent bond is preferred. The phosphoramidate bond joins the DNATb the 
CovaLink NH secondary amino groups that are positioned at the end of spacer arms covalently 
grafted onto the polystyrene surface through a 2 nm long spacer arm. To link an oligonucleotide to 
CovaLink NH via an phosphoramidatebond, the oligonucleotide terminus must have a 5'-end 
8 phosphate group. It is, perhaps, even possible for biotin to be covalently bound to CovaLink and 
then streptavidin used to bind the probes. 

More specifically, the linkage method includes dissolving DNA in water (7.5 ng/ul) and 
denaturingfor 10 min. at 95°C and cooling on ice for 10 min: Ice-cold 0. 1 M 1-methylimidazole, 
12 pH 7.0 (1 -Melm?), is then added to a final concentration of 10 mM l-Melm?. A ss DNA solution is 
then dispensed into CovaLink NH strips (75 ul/well) standing on ice. 

Carbodiimide 0.2 M 1 -ethyl-3-(3-dimethylaminopropyl)-carbodiimide (EDC), dissolved in 
1 0 mM l-Melmv, is made fresh and 25 ul added per well. The strips are incubated for 5 hours at 
16 50°C. After incubation the strips are washed using, e.g., Nunc-Immuno Wash; first the wells are 
washed 3 times, then they are soaked with washing solution for 5 min., and finally they are washed 
3 times (where in the washing solution is 0.4 N NaOH, 0.25% SDS heated to 50°C). 

It is contemplated that a further suitable method for use with the present invention is that 
20 described in PCT Patent Application WO 90/03382 (Southern & Maskos), incorporated herein by 
reference. This method of preparing an oligonucleotide bound to a support involves attaching a 
nucleoside 3'-reagent through the phosphate group by a covalent phosphodiester link to aliphatic 
hydroxyl groups carried by the support The oligonucleotide is then synthesized on the supported 
24 nucleoside and protecting groups removed from the synthetic oligonucleotide chain under standard 
conditions that do not cleave the oligonucleotide from the support Suitable reagents include 
nucleoside phosphoramidite and nucleoside hydrogen phosphorate. 

« 

An on-chip strategy for the preparation of DNA probe for the preparation of DNA probe 
28 arrays may be employed. For example, addressable laser-activated photodeprotectionmay be 

employed in the chemical synthesis of oligonucleotides directly on a glass surface, as described by 
Vodoretal. (1991) Science 251(4995) 767-73, incorporatedherein by reference. Probes may also 
be immobilized on nylon supports as described by Van Ness et al ( 1 99 1 ) Nucleic Acids Res. 
32 19(12) 3345-50; or linked to Teflon using the method of Duncan & Cavalier (1988) Anal. Biochem. 
169(1) 104-8; all references being specifically incorporated herein. 
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To link an oligonucleotide to a nylon support, as describedby Van Ness etcd. (1991), 
requires activation of the nylon surface via alkylation and selective activation of the S'-amine of 
oligonucleotides with cyanuric chloride. 

■ 

4 One particular way to prepare support bound oligonucleotides is to utilize the 

light-generated synthesis describedby Pease et al., (1994) PNAS USA 91(1 1) 5022-6, incorporated 

■ . 

herein by reference). These authors used current photolithographic techniques to generate arrays of 
immobilized oligonucleotide probes (DNA chips). These methods, in which light is used to direct 
8 the synthesis of oligonucleotide probes in high-density, miniaturized arrays, utilize photolabile 
5-protected iV-acyl-deoxynucleosidephosphoramidites, surface linker chemistry and versatile 
. combinatorial synthesis strategies. A matrix of 256 spatially defined oligonucleotide probes may be 
generated in this manner. 

12 421 PREPARATION OF NUCLEIC ACID FRAGMENTS 

t 

The nucleic acids may be obtained from any appropriate source, such as cDNAs, genomic 
DNA, chromosomal DNA, microdissected chromosome bands, cosmid or YAC inserts, and RNA, 
including mRNA without any amplification steps. For example, Sambrooke/ al (1989) describes 
1 6 three protocols for the isolation of high molecular weight DNA from mammalian cells (p. 
9.14-923). 

DNA fragments may be prepared as clones in M 1 3 , plasmid or lambda vectors and/or 
prepared directly from genomic DNA or cDNA by PCR or other amplification methods. Samples 
20 may be prepared or dispensed in multiwell plates. About 1 00-1 000 ng of DNA samples may be 
prepared in 2-500 ml of final volume. 

The nucleic acids would then be fragmented by any of the methods known to those of skill 
in the art including, for example, using restriction enzymes asdescribed at 9.24-9.28 of Sambrook et 
24 al (1989), shearing by ultrasound and NaOH treatment 

Low pressure shearing is also appropriate, as described by Schriefer et al (1 990) Nucleic 
Acids Res. 1 8(24) 7455-6, incorporated herein by reference). In this method, DNA samples are 
passed through a small French pressure cell at a variety of low to intermediate pressures. A lever 
28 device allows controlled application of low to intermediate pressures to the cell. The results of 

these studies indicate that low-pressure shearing is a useful alternative to sonic and enzymatic DNA 
fragmentationmethods. 

One particularly suitable way for fragmenting DNA is contemplated to be that using the two 
32 base recognition endonuclease, Cv/JI, described by Fitzgerald et al (1 992) Nucleic Acids Res. 

20(14)3753-62. These authors described an approach for the rapid fragmentation and fractionation 
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of DNA into particular sizes that they contemplated to be suitable for shotgun cloning and 
sequencing. 

1 . The restriction endonuclease CviJl normally cleaves the recognition sequence PuGCPy 
4 between the G and C to leave blunt ends. Atypical reaction conditions, which alter the specificity of 
this enzyme (Cvz JI* *), yield a quasi-random distribution of DNA fragments form the small 
molecule pUC19 (2688 base pairs). Fitzgerald et al (1992) quantitatively evaluated the 
randomness of this fragmentation strategy, using a Cv/JI** digest of pUCl 9 that was size 
8 fractionated by a rapid gel filtration method and directly ligated, without end repair, to a lac Z minus 
Ml 3 cloning vector. Sequence analysis of 76 clones showed that Cv/JI* * restricts pyGCPy and 
. PuGCPu, in addition to PuGCPy sites, and that new sequence data is accumulated at a rate 
consistent with random fragmentation. 

T » 

1 

12 As reported in the literature, advantages of this approach compared to sonication arid 

agarose gel fractionation include: smaller amounts of DNA are required (0.2-0.5 ug instead of 2-5 
ug); and fewer steps are involved (no preligation, end repair, chemical extraction, or agarose gel 
electrophoresis and elution are needed 

1 6 Irrespective of the manner in which the nucleic acid fragments are obtained or prepared, it is 

important to denature the DNA to give single stranded pieces available for hybridization. This is 
achieved by incubating the DNA solution for 2-5 minutes at 80-90°C. The solution is then cooled 

• » 

quickly to 2°C to prevent renaturation of the DNA fragments before they are contacted with the 
20 chip. Phosphate groups must also be removed from genomic DNA by methods known in the art. 

422 PREPARATION OF DNA ARRAYS 

Arrays may be prepared by spotting DNA samples on a support such as a nylon membrane. 
Spotting may be performed by using arrays of metal pins (the positions of which correspond to an 

24 array of wells in a microtiter plate) to repeated by transfer of about 20 nl of a DNA solution to a 
nylon membrane. By offset printing, a density of dots higher than the density of the wells is 
achieved. One to 25 dots may be accommodated in 1 mm , depending on the type of label used. By 
avoiding spotting in some preselected number of rows and columns, separate subsets (subarrays) 

28 may be formed. Samples in one subarray may be the same genomic segment of DNA (or the same 
gene) from different individuals, or may be different, overlapped genomic clones. Each of the 
subarrays may represent replica spotting of the same samples. In one example, a selected gene 
segment may be amplified from 64 patients. For each patient, the amplified gene segment may be in 

32 one 96-well plate (all 96 wells containing the same sample). A plate for each of the 64 patients is 
prepared. By using a 96-pin device, all samples may be spotted on one 8 x 12 cm membrane. 
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Subarrays may contain 64 samples, one from each patient. Where the 96 subarrays are identical, the 
dot span may be 1 mm 2 and there may be a 1 mm space between subarrays. 

Another approach is to use membranes or plates (available from NUNC, Naperville, Illinois) 

* 

4 which may be partitioned by physical spacers e.g. a plastic grid molded over the membrane, the grid 
being similar to the sort of membrane applied to the bottom of multiwell plates, or hydrophobic 
strips. A fixed physical spacer is not preferred for imaging by exposure to flat phosphor-storage 
screens or x-ray films. 

8 The present invention is illustrated in the following examples. Upon consideration of the 

present disclosure, one of skill in the art will appreciate that many other embodiments and variations; 
may be made in the scope of the present invention. Accordingly, it is intended that the broader 
aspects of the present invention not be limited to the disclosure of the following examples. The 

1 2 present invention is not to be limited in scope by the exemplified embodiments which are intended 
as illustrations of single aspects of the invention, and compositions and methods which are 
functionally equivalent are within the scope of the invention. Indeed, numerous modifications and 
variations in the practice of the invention are expected to occur to those skilled in the art upon 

1 6 consideration of the present preferred embodiments. Consequently, the only limitations which 
should be placed upon the scope of the invention are those which appear in the appended claims. 

All references cited within the body of the instant specification are hereby incorporated by 
reference in their entirety. 

20 5.0 EXAMPLES 

5.1 EXAMPLE 1 

Novel Nucleic Acid Sequences Obtained From Various Libraries 
A plurality of novel nucleic acids were obtained from cDNA libraries prepared from various 
24 human tissues and in some cases isolated from a genomic library derived from human chromosome 
using standard PCR, SBH sequence signature analysis and Sanger sequencing techniques. The 
inserts of the library were amplified with PCR using primers specific for the vector sequences 
which flank the inserts. Clones from cDNA libraries were spotted on nylon membrane filters and 
2& screened with oligonucleotide probes (e.g. , 7-mers) to obtain signature sequences. The clones were 
clustered into groups of similar or identical sequences. Representative clones were selected for 
sequencing. 

In some cases, the 5' sequence of the amplified inserts was then deduced using a typical 
32 Sanger sequencing protocol. PCRproducts were purified and subjected to fluorescent dye 

tenninator cycle sequencing. Single pass gel sequencing was done using a 377 Applied Biosystems 
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(ABI) sequencer to obtain the novel nucleic acid sequences. In some cases RACE (Random 
Amplification of cDNA Ends) was performed to further extend the sequence in the 5' direction. 

■ 

« * 

■ + * 

4 5.2 EXAMPLE 2 

Novel Contigs 

The novel contigs of the invention were assembled from sequences that were obtained from 

* 

a cDNA library by methods described in Example 1 above, and in some cases sequences obtained 

8 from one or more public databases. The sequences for the resulting nucleic acid contigs are 

» 

designated as SEQ ID NO: 1-13901 and are provided in the attached Sequence Listing. The contigs 

i 

were assembled using an EST sequence as a seed Then a recursive algorithm was used to extend 
the seed EST into an extended assemblage, by pulling additional sequences from different databases 

12 (z. e. , Hy seq' s database containing EST sequences, dbEST version 1 14, gb pri 114, and UniGene 
version 101) that belong to this assemblage. The algorithm terminated when there was no 
additional sequences from the above databases that would extend the assemblage. Inclusion of 
component sequences into the assemblage was based on a BLASTN hit to the extending 

16 assemblage with BLAST score greater than 300 and percent identity greater than 95%. 

* - 1 

Table 3 sets forth the novel predicted polypeptides (including proteins) encoded by the 
novel polynucleotides (SEQ ID NO: l-13901)of the present invention, and their corresponding 
nucleotide locations to each of SEQ ID NO: : 1-13901. Table 3 also indicates the method by which 

20 the polypeptide was predicted. Method A refers to a polypeptide obtained by using a software 

program called FASTY (available from http ://fasta.bioch. virginia.edu) which selects a polypeptide 
based on a comparison of the translated novel polynucleotide to known polynucleotides (W.R. 
Pearson, Methods in Enzymology, 183:63-98 (1990), herein incorporated by reference). MethodB 

24 refers to a polypeptide obtained by using a software program called GenS can for human/vertebrate 
sequences (available from Stanford University, Office of Technology Licensing) that predicts the 
polypeptide based on a probabilistic model of gene structure/compositionalproperties (C. Burge 
and S. Karlin, J. Mol Biol., 268:78-94 (1997), incorporated herein by reference). Method C refers 

28 to a polypeptide obtained by using a Hy seq proprietary software program that translates the novel 
polynucleotide and its complementary strand into six possible amino acid sequences (forward and 
reverse frames) and chooses the polypeptide with the longest open reading frame. 

The nearest neighbor results for SEQ ID NO: 1-13901 were obtained by a BLASTP 

32 version 2.0al 19MP-WashU search against Genpept release 120 and Geneseq database release 
200101 (Derwent), using BLAST algorithm. The nearest neighbor result showed the closest 
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homologue for SEQ ID NOS: 1-13901 . The nearest neighbor results for SEQ ID NO: 1 -1 390 1 

are shown in Table 2 below. 

Tables 1, 2 and 3 follow. Table 1 shows the various tissue sources of SEQ ID NOS: 1- 
4 13901. Table 2 shows the nearest neighbor result for the assembled contig. The nearest neighbor 

result shows the closest homolog with an identifiable function for each assemblage. Table 3 . 

contains the start and stop nucleotides for the translated amino acid sequence for which each 

assemblage encodes. Table 3 also provides a correlation between the amino acid sequences set forth 
8 in the Sequence Listing, the nucleotide sequences set forth in the Sequence Listing and the SEQ ID 

NO. in USSN 09/515,126 ' 



98 



WO 01/64835 PCT/US01/04927 



TABLE 1 



Tissue 1 
Origin 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 


adult brain 


GIBCO 

* 

■ 


AB3001 

• 


83 544 597-598 600-607 616 841 1004 1148 1346 1493 1974 
2138 2141 2143 2161 22662345 2363 2511 256928762880 
3001 3099-3 1 0 1 3 1 05-3 1 06 3 1 1 0-3 1 11 3 1 1 5-31 1 7 3 1 99 3272 
3282 3284 3356 3425 3537 3634 3689 3709 3797 3810 3839 
3899 4006 4021-4022 4025 4043 4194 4201 4253 4277 4297 
4388 4399 4410 4667 4671 4722 4747-4748 4750 4755 4767 
4845 4865 4940 5037 5075 5093 5118 5163 5171-51725268 
5481 5523 5553 5656 5724 5894 5902 5938 6052 6170-6173 
6176 6214 6307 6336 6369 6374 6793 6894-6897 6979 7058 
7169 7455 7492-7493 7495-7499 7501 7504 7577 7586 7761 
7792 7864.7870 8035 8065 8085 81 10 8120 8140 8224 8226 
8298 8372 8427 8452 8456 8535 8648 8672 8674-86798681- 
8684 8816 8838-8839 8870 8898 9012 9041 9079 9128 9257 
9264 9304 93 17 9460 9503 95 17 9567 9623 9734 978 1 9792- 
9798 9929 9964 9999 10296 10330 10469-10470 10578 10679 
10778 10786 10895 10984-10986 11032 11052 11069 11130 
11145 1123911289 11402 11818 1186211870-1187611878- 
1 1881 12017 12037 12127 12160 12294 12363 12375 12405 
12424 12438 12467 12539 12570 12590 12615-12616 12618 
12685 12688 12712 12739 12748 12830 12913 12916 12948- 
12950 13002 13064 13073 13083 13141 13150 13153 13164- 
13166 13257 13391 13456 13479 13489 13492 13494 13499 
13501 13503 13560 13595-13596 13627 13645 13679 13782 
13795 13861 13866 13869 13882 


adult brain 


GIBCO 


ABD003 


67 83 142 443 587 598 608-609 611 613-624 633 731 734 737- 
742 760 799-800 809 1 148 1 152 1 167-1184 1 193 1346 1433 
1516 1552 1575 1671 1756 1774 1833 1974 21382145 2176- 
2178 2237 2266 2299-2301 2303-2306 2343 2363 2412 2444 
2449 251 1 2516 2555 2569 2576 2614 2716 2809 2876 2911 
2926 3001 3093 3114 31 193121-3124 3126 3128-3130 3234 
3254-3256 3258-3263 3265-3267 3270-3274 3276-3277 3280- 
3281 3284 3286 3348 3356 3378 3435 3459 3484 3537 3548 
3595 3605 3625 3627 3634 3686-3697 3700 3702 3709 371 1 
3720 3722 3737 3757 3797 3804 3810 3839 3856 4006 4019 
4025 4040 4055 4057-4058 4060 4078 4194 4201 4246 4253 
4277 4282 4390 4405 4412 443 1 4620 4622 4641 4689 4751- 
4764 4791 4808 4837 4845 4847-4849 4852-4858 4860-4862 
4864-4869 4940 4957 4962 4972 4998 5021 5031 5037-5038 
5040 5076 5093 5108 51 18 5167 5169 5171-5172 5251-5261 
5263-5265 5270 5364 5401 5481 5492 5521 5523 5535 5656 
5674 5693 5766 5788 5817 5906-5909 5938 6005 6027 6057 
6064 6147 6178 6180-6182 6189 6214 6229-6233 6254 6272 
6369 6371 6421-6426 6555 6595 6598 6601 6799 6803 6825 
6836 6886 6894 6913 6972 6995 7058 7104 7130 7133 7148 
7164 7169 7339 7347 7386 7426 7455 7494 7502 7507 7509 
751 1-7512 7516 7520 7584-75877590-7596 7598-7601 7603- 
7604 7608 7632 7677 7743 7748 7761 7768 7792 7797 7807 

4k ^m. ^41 4^h ■! 4k 4k 4 4k ^ 4^fe 4k ^# M 4^k ^k 4V ^k ^k ^k 4k ^k 4V ^k 4B- 4k 4k ^4 ^4) 

7815 7839 7849-7861 7864 7870 7930 7937 8035 8065 8067 
8080 8087 8095 81 10 8120 8139-8140 8209 8224 8226 8235 
8246 8262 8285 8298 8320 8323 8336 8354 8361 8365 8370 
8375 8387 8452 8456 8535 8556 8576-8577 8603 8630 8648 
8674 8685-8686 8688-8690 8693 8695 8702 8712 8742 8760- 
8761 8763-8764 8766-8769 8813 8815-8816 8830 8834 8838- 
8839 8848 8863 8870 8898 8921 8943-8944 8951 8989 9010 
9041 9050-9056 9058-9064 9076 9079 9092 9097 9128 9144- 
9 145 9257 9264 927 1 9278-9279 9304 93 1 5 93 1 7 9455 9466 
9472 9475 9480 9503 951 1 9517 9525 9539 9689 9734 9773 
9781 9791 9799-9802 9847 9852 9873 9928-9929 9964 9999- 
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Tissue 
Origin 


RNA 
I Source - 


Library 
1 Name 


SEQ ID NOS: 


* 


• 


i 


10001 10003-10004 10053 10175 10267 10276 10330 10349 
10434 10449 10466 10471-10474 10492 10496 10509 10566 
10578 10597-10599 10645 10679 10788 10891 10948 10988- 
10990 11032 11039 11041-11043 11052 11065 11069 11105 
1 1 108 1 1 130 1 1 145 1 1 167 1 1 196 1 1 203-1 1207 1 1209 1 1239 
1 1399 1 1401-1 1402 1 1406 1 1459 1 1470 1 1604 1 1606 1 1642 
1 1 76 1 1 1 8 1 8 1 1 862 1 1 877 1 1 882-1 1 884' 1 1 886 1 1 889-1 1 893 
11944-11946 11981 11988 12016 12019 12022 12037 12083 
12127 12143 12164-12165 12168-12171 12178 12195 12236 
12265 12305 12327 12363 12375 12405 12423-12424 12430 
12438 12546 32570 12590 12594 12612 12615-12618 12630 
12670 12674 12685- 1268o 12693 12704 12706-12707 12748 
12772 12830 12885 12904 12913 12916 12923 12933 12951 
12956 12993 13001 13020-13021 13038 13047 13064 13072- 
13073 13084-13085 13092 13117 13142 13167-13171 13191 
13254 13257 13260 13295 13390-13391 13394 13456 13479 
13483 13489 13497 13501 13503 13505-13507 13512 13516 
13546 13551 13555 13575 13590 13592 13597 13613-13614 
13645 13649 13659 13711 13782 13795 13838 13861 13869 
13875 13882 13884-13885 13888 13892 13896 


adult brain . 


Clontech 


ABR001 


142 858 1542 2174 2407 2483 2652 3272 3287 3460 3492 3535 
3595 3737 3839 4005 40604282 4434 4791 4972 5040 5293 
5523 5530 5535 5788 5906 6082 6601 6799 6980 7373 7577 
7587 7759 7788 7851 8081-8082 8110 9167 9455 9466 9781 
9928 10422 10774 10791 1 1069 1 1401 1 1406 11459 1 1604 
1 1607 11791 11818 11865 11961 11979 12022 12122 12160 
12327 12442 12594 12615 12640 12670 12705 12935 12957 
12985 13047 13197 13257 13456 13511-13512 13546 13554 
13646 13793 13885 13889 13893 


adult brain 


Clontech 


ABR006 


6 67 1004 1908 3272 3286 3548 401 1 4282 4998 5923 5928 
6374 6730 6815 6867 68907067 8365 9264 9729 9780 10776 
11587 11618 12596 12601 12605 12704 12749 12754 12951 
13047 13051 13090 13479 13488 13498-13499 13503 13512 
13575 13882 


adult brain 


Clontech 


ABR008 


6 11 21 41 51 88 142 364 376 579 598 651 736 800 1050 1 148 
1 184 1251-1265 1291 1346 1404 1479 1529 1543 1671-1674 
1697 1699-1710 1820 1830 1832-1838 1840 1848-1849 1908 
1914 1919 1927 1957 1964 1974 1976 1978-19792005-2006 
2050 2081 2090 21 10-21 11 2129 2150 2174 2200 2310 2327 
2342 2408-2410 2420 2444 2449 2461-2467 2484 2490 2499- 
2506 25 1 1 2553 2574 2576 26 1 1 2652 2809 2827 2866 2894 
3032 3207 3535 3591 3610 3634 3715 3722 3737 3766-3770 
3819 4006-4007 401 1 4025 4032 4060 4078 4095 4109 4128 
4143-4)55 4182 4194 4247-4257 4277 4282 4294 4296 4310 
4330 4348 4355 4360 4381 4395 4399 4411 4431 4543 4641 
4662 4694 4698 4767 4781 4791 4808 4833 4837 4985 5001 
5022 5040 5075 5094 5108 5163 5303-5306 5308-5314 5320 
5380 5523 5553 5615 5625-56265634 5638-5644 5701 5706 
571 1 5727-5742 5766 5772 5775 5783 5801 5814 5817 5820- 
5821 5829 5837 5851 5855 5858 5864 5867 58745885 5890 
5897 5901 5906 5923 6057 6125 6214 6223 6288 6302 6456- 
6464 6545 6598 6601-6602 6624 6676-6685 6699 6726 6728 

fHAtL-AlAQ f^'lt^'s ^OO fiR^A £BAO fiRQ^ ARQA 1(\(\A.1(\(\1 

o /*to-o Jny o /o j o / yy ooio ooj4 osou ooyj-ooy* /uu**- / uu / 
7049 7076 7078 7081-7083 7105 7117 71 19 7133 7153 7166 
7431 7579 7708 7768 7849 7900-7905 8018 8083-8084 8095 
8110 8196-8208 8262 8288 8312 8320 8331 8336 8356 8375 
8452 8482 8633 8681 8710 8739 8777 8815 8817 8830 8839 
8963 8965 8983 9010 90979100 9102-9108 9111 9128 9142 
9257 9264 9313 9364 9378-9384 9401 9454-9455 9458 9460- 
9464 9503 9509 951 1 9515-9516 9522 9528-9529 9533 9539 
9542 9544 9573 9577 9646 9773 9780 9924 10000 10025- 
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i 



Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 


■ 


■ 

» 

i 

• 


« 

■ 

* 


10029 10097 10148 10209 10218-10223 10225-10229 10267 
10275-10276 10284 10292 10296 10303 10313 10326 10329- 
10331 10333-10334 10343 10346 10393 10430 10494 10496 
10513 10542 10557 10613-10616 10679 10688 10691 10719- 
10721 10729 10743 10758 10760 10776 10782 10786 10795 
10895 1110011114 1113211195 1124011242-1125411401 
1 1406 1 1437 1 1454 1 1459 1 1462 1 146&1 1473 1 1475-1 1476 
11519 11532-11535 11550 11553 11555-11556 11559 11588- 
115 89 11606116151161811 62 111 627- 1 1 62 8 11 633 1 1 76 1 
11791 11807 11818 11932 11955 12006 12039 12041 12092 
12212-12224 12231 12236 12305 12363 12368 12396-12399 
12405 12424 12439 12442 12465-12466 12468 12477 12479- 
12482 12490 12521 12546 12552-12553 12576-12577 12579 
12582 12585 12590 12601 12609 12617-12618 12636 12658 
12707 12725 12735-12736 12749 12754 12776-12779 12859 
12868 12894 12905 12909 12940 12955 12959 12977 12980 
12990 13002 13004-13005 13020 13035 13038 13042-13044 

13047 13051-13052 13056 13062 13073 13082-13083 13196 
13249 13280 13311-13312 13336 13387 13417 13421-13426 
13436 13445 13456 13458 13479-13482 13488 13490 13494- 
13495 13497-13500 13503 13507 13512 13516 13533 13546 
13554-13555 13590 13613 13630 13649 13659 13670 13678 
13713 13724 13769 13793-13794 13808 13827-13828 13838 
13861 13867-13868 13875 13882 13884-13885 13888-13889 
13893 13896 13898 


adult brain 


Clontech 


ABR011 


1006 1257 3797 4006 4025 5535 6057 7169 7870 8262 8937 
8966 9257 10778 12736 13394 13679 13793 13861 


adult brain 


BioChain n 


ABR012 


88 598 1007 1 134 2597 3557 3590 3627 3797 4006 4192 4246 
4282 4391 4940 5523 5535 6288 6338 7138 81 10 8898 9076 
9401 9455 9476 10772 1 1061 1 1 1 14 12989 13394 1351 1 13866 


adult brain 


Invitrogen 


ABR013 


598 2614 3191 4355 4391 5523 5788 8085 8486 11513 12521 
12989 13861 


adult brain 

• 


Invitrogen 


ABT004 


40 51 598 1050-1057 1148 1777-1778 1947 19762270-2272 
2327 2490 2617 3050 3600-3602 3722 3987 4390-4391 4434 
4543 4689 5031 5157-5159 5167 5169 5466 5505 5682-5683 
5701 5766 5778 5794 5902 6147 6367-6371 6459 6545 6709 
6728 6783 6801 6971 7104 7175 7815 7839 7864 8139 8342 
8345 8355 8363 8372 8452 8633 8963 8975-8976 9012 9133 

VJ~<J S**J*JtJ UJwJ 1 Am UtJA UUJJ D//i/ U7IU 71JJ 

9423-9424 9511 9515 9517 9528 9556 9827 9949 10260 10267 
10275 10570-10571 10733 10767 11132 11159 11406 11459 
11932 12009 12092 12109-12111 12127 12283 12428 12511 
12579 12605 12725 12747 12830 12885-12886 12910 12913 
12954 12987-1298913051 13054 13062 13073 13090 13249- 
13253 13438 13445 13456 13489 13500 13512 13516 13533 
13546 13590 13622 13649 13683-13684 13713 13803 13838 
13861 13866 13896 


cultured 

d re adi doc vtes 


Strategene 

■ 


ADP001 


1 134 1346 2343 2614 3272 3426 3610 3720 3839 3885 401 1 
4277 4282 4297 4346 4388 4391 4405 4434 4641 4833 4940 
4985 5018 5030 5040 5163 5167 5523 5581 5778 5788 5794 
5895 5951 6082 6147 6272 6607 7067 7141 8093 8235 8285 
83 12 8363 8629 8648 8830 8839 9290 9401 9466 9503 9781 
10346 10470 10776 10795 10971 1 1 108 1 1 170 1 1513 1 1818 
12034 12037 12046 12093 12375 12387 12405 12424 12570 
12636 12670 12674 12688 12735 12749 12913 12940 13126 
13163 13295 13489 13494 13497 13499 13511 13516 13575 
13652-13653 13866 13888-13889 


adrenal gland 


Clontech 


ADR002 


8 83 142225 351 443 553 569 731 864 1134 1266-1271 1273- 
1274 1276-1292 1294-1295 1381 1391 1544-1545 1658 1671 
1 908 1 959 1 983 20 1 0 2023 2 1 45 21 75 2283 23 1 0 2328-2334 
2343 2444 24492510 2522 2576 3032 3069 3153 3166 3272 
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Tissue 
Origin 


RNA 
Source h 


; Library 
Name 


SEQIDNOS: 


• 

1 


i 




3378 3416 3548 3625 3709 371 1 3771-3788 3790-3791 3797 
3839 3870 3899 3985 4019 4054 4060 4109 4173 4192 4194 
4201 4253 4277 4282 4389 4391 4395-4396 443 1 4667 4687 
4694 4783 4957 4966 4985 4998 5037 5108 5293 5316-5323 
5325-5336 5481 5505 5527 5531-5533 5560 5628 5656 5701 
5766 5865 5874 5902 5939 5979 6020 6052 6057 6227 6288 
6354 6405 6449 6465-6482 6521 6603 6*726 6894 6979 7008- 
701 1 7155-7156 7204 7604 7634 7845 7871 7906-7912 7915- 
7918 7920-7930 8022 8067 8085-8086 8095 8110 81 16 8224 
8262 8363 8365 8412 8520 8535 8554 8699 8742 8831 8870 
8950-8951 9002 9010 9012 9097 9109-91 19 9121-9127 9190 
9264 9280 9304 9317 9455 9457 9495 9503 9553 9556 9586 
9709 9780-9781 9895 9927 10007 10030-10032 10034-10035 
10037-10045 10119 10277 10284 10346 10595 10617-10620 
10623-10627 10645 10675 10679 10760-10761 10766 10774 
10782 11108 11194 11255 11258-1 1270 1 1289 11400 11406 
11457 11519 11588 11600 11621 11626 11818 11952 12080 

191^0 1999^ 19997.1993ft 19939-1 9939 1 9930-1 99Afi 199A9 

12249 12251 12305 12363 12375 12404-12405 12424 12439 
12577 12599 12601 12630 12636 12657-12658 12663 12688 • 
12693 12706 12713 12715 12735 12749 12754 12780-12787 
12848 12863 12909 12913 12923 12957 12990 13006-13007 
13020-13021 13064 13095 13104 13295 13313 13315 13388- 
13389 13391 13456 13473 13494 13507 13515 13546 13613 
13631 13679 13725-13733 13858 13866 13872 13883-13884 
13888-13889 


adult heart 


G1BCO 


AHR001 


51 83 88 94 221 239 360 366-367404410-41 1 413 415 458- 
459 461 465-468 471 473-478 486 545-546 559 567 616 625- 
630 743-744799 802-806 808 810-835 837-842 959 1004 1066- 
1085 1134 11781184-1193 1346 1512 1516 1546-1547 1556 
1575 1671 1727 1774 1829 1959 1976-197720902096 2108 
21 10 2128 2138 2145-21472161 2179 2195-2198 2257 2276 
2278-2281 2302 2307-2309 2363 2398-2399 2409 241 1-2412 
2444 2449 2497 25 1 6 2529 2563 2569 2575-2576 2597 2605 
2614 2617 2762 2809 2816 2879-2880 291 1-2924 2926 2961 
2978-2980 2985-2986 2993 2995-3002 3032 3042 305 1 3058 
3069 3081 3091-309431093114 3132-3133 3135 3137-3138 
3141 3191 31963199 32153263 32723282-32863317 3349- 
3350 3353-3359 3361-3362 3364-3367 3370-3393 3396-3399 . 
3403 3406 3425 3451 3465-3466 3479 3500 3503 3537-3538 
3544 3548 3550 3555 3557 3590-3591 3595 3604 3606-3612 
3614-3621 3623-36273634 3689 3697-3698 3701-3709371 1- 
3713 3720 3722 3737 3757 3797 3839 3885 3898 3988-3989 
3996 4005-4006 4008 4010-401 1 4019 4021-4022 4025 4040 
4043 4054-40554058-40604078 4183 4192 41944201 4246 
4253 4269 4277 4282 4341 4351 4391 4403 4405 4434 4517 
4543 4553 4590 46 1 5 4622 463 1 4633-4634 464 1 4654 4664 
4666-4667 4672-4675 4688-4689 4693-4694 4738 4740 4755 
4783 4828 4870-4871 4905-4906 4909-4915 4917-4919 4921- 
4924 4926-4927 4940 4957 4972 4985 4991 4998 5001 5030 
5037-5038 50405076 5118 5163 5171-5175 5177-5178 5180- 
5182 5266-5268 5380 5422 5481 5490 5492 5506-5507 5523 
5534-5535 5581 5656 5682 571 1 5740 5766 5788 5895 5908 
5923-5924 5938 6047 6057 6083 6085 6104 6110-6111 6117 
6147 6184-6185 6189 6195 6215 6255-6257 6259-6266 6268 
6272 6288 6307 6336 6375-6379 6382 6427-6429 6458 6555 
6588 6595 6607 6629 6665 6689 6765 6767 6799 6815 6871- 
6873 6879 6881-6882 6898 6926-6928 6975-6978 6987 7058 
7090 7148 7313-73147341 7343-7344 7346-73477350 7354 
7359 7362 7373 7380 7394 7402 7407-7408 7410 7413 7415- 
7416 7418-741974267431 7468 7473 7480 7494 7505 7513- 
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Tissue 


RNA 


Library 


SEQ W NOS: 


Origin 


Source 


Name 




* 


i 

i 

i 


i 

• 


7516 7520 7544 7561 75847587 7599 7601-76047635 7638- . 
7643 7645-7649 7651-7655 7657 7659-7662 7733 7735 7743 
7748 7783-7796 7815 7852 7857 7862-7863 7865-7870 7930 
7933 7983 8062-8065 8067 8087-8088 8093 8095 81 10 81 16 
8120 8139-8140 8224 8226 8235 8262 8298 8336 8344-8345 
8354 8356 8363 8368 8372 8378 8387 8410 8427 8452 8456 . 
853 1 -8532 8534-8535 8563-8569 8572 8576-8577 8592-8593 
8597 8603 8606 8610 8613-8614 8616-8617 8646 8648 8670 
868 J 8691 8698-8699 8702 8712 8742 8756 8760 8763 8795 
8807.-8809 8811-8814 8816-8819 8821-8833 8835-8839 8858 
8863 8870 8898 8921 8927 8936 8939 8943 8946 8950-8951 
8966 8988-8992 8994-8995 9010 9049 9059 9065-9067 9070- 
9072 9076 9097-9098 9167 9190 9257-9260 9262 9269 9281 
9287 9301 9304 9315 9317 9401 9454-9455 9466 9476 9480 
9484 9556 9577 96 12 9689 9698 9720-972 1 9734 974 1-9743 
9747-9750 9758 9781 9791 9804 9871-9882 9884-9885 9928 
9939 9942 9954-9960 9999-10000 10005 10175 10179 10275 
10284 10292 10296 10329-10331 10346 10400 10422 10430- 
10431 10437 10442 10444-10447 10475 10511-10512 10514- 
10521 10557 10576-10577 10616 10645 10679 10691 10729 
10742 10744 10772 10774 10777-10778 10782 10788 10839 

10KQ1 10894-1089^ 10007 10917-10970 10937 1094? 1094£ 

10948 10969-10970 10992 11032 11044-11045 11061 11066- 
11074 111081111411132 11145 11153 13165-1117011173 
11205 1120811210-1121511283 112891138611388 11401- 
11402 11406 11462 11559 11565 11576 11596 11606 11615 
11620 11744 11788-11789 11793 11818 11823-1 1825 11831- 
11832 11869 11894 11947 11961 11982-1198912000 12006 
12009 1201 1 12019-12020 12028 12037 12044 12078 12081 
12093 12119-12122 12143 12160 12166 12172-12175 12177- 


i 

- 


■ 


• 

•< 


12179 12197 12305 12335 12363 12375-12376 12383 12387 
12400 12402 12405 12424 12428 12438 12479 12521 12523 
12546 12560 12564 12570 12590 12599 12601 12605 12609 
12631 12616-12618 12653 12662-12663 12670 12674 12688 
12717-12720 12734 12752 12754 12772 12905 12907 12914 
12916-12917 12923 12925 12940 12961 32963-12965 12989 
12991 12993 13020-13021 13033 13035 13072-13073 13082- 

A A m ^ V <b A ™* M pa aj»F aaja- aajpa- p> *a» •aa* apa^ a •jp* W ^aaj 4 ^P *W ™ *P^ ™P ^P aaaM 

13083 13104 13117 13126 13132 13136 13141-13142 13148 
13169 13203-13210 13212 13260-13261 13293 13295 13326 
13377 13394 13413 13442 13456 13477 13480 13488 13490 
13494-13496 13501-13503 13506-13507 13511 13516 13533 
13568 13570-13571 13575 13582 13592 13613 13624-13630 
13632 13644 13646 13659-13660 13678-13679 13689 13701 
13711-13713 13775 13782 13795 13797 13866-13869 13872 
13882 13884-13885 13893 


adult kidney 

* 


GIBCO 

• 


AKD001 


49 67 83-84 142 354 405-407 415-429 431-432 445 460 462 
479-484 486 488 492-493 524 548-549 598 616 63 1-638 744 
787 809 841 1004 1068 1086 11601163-1166 3171 11841193 
1346 1359 1449 1479 1516 1552 1556 1671 1724 1727 1774 
1826 1858 1914 1974 1976 1978-19792081 2097-20992111- 
2113 2117-2118 2129-213021382145 2148-215021612186 
2218 2266 2268 2302 2310 2327 2343 2363 2409 2412 2444 
2449 2468 2483 2523 2569 2576 2614 261 7 2827 2845 2876 
2910 2915 2926-2931 2933-2934 2938-2943 2945-2947 2955- 
2956 2976 2981 2983-2984 3001-3016 3018 3053 3109 31 14 
3140 3142-31463149-315031993254 3265 32723283-3284 
3286 3333 3350 3356 3378 3406 3435 3445 3460 3492 3503 
3535 3537 3544 3548 3590-3591 3598 3625 3627 3634 3676- 
3677 3679-3685 3689 3697 3709 3711 3720 3722 3737 3757 
3797 3808 3810 3839 3885 3989 4005-40064011 4019 4022 
4025 4040 4043 4054-4055 4060 4078 4109 4192 4194 4201 
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Tissue 
Origin 



RNA 
Source * 



Library 
Name 



SEQ 1J> NOS: 



4246 4253 4269 4277 4341 4354 4387-4388 4390 4398 4402 
4405 4410 4434 4473 4540 4543 4583 4591-4592 4607 4637 
4640^4641 4649 4654 4665 4668 4676-4677 4680-4685 4688- 
4689 471 1-4712 475 1 4758 4766-4768 4783 4808 4833 4836- 
4837 4845 4874 4904 4940 4957 4962-4963 4972 4983 4991 
4998 5022 5037 5040 5076 5093 51 18 5143 5163 5171-5172 
5246-5249 5281 5362 5364 5380 5422 5481 5521 5523 5526- 
5527 5535 5656 5693 5726 5740 5766 5769 5778 5788 5794 
5894 5902 5908 591 1 5916-5917 5923 5928 5938 5990 6005 
6049-6051 6057 6078 6082 6086 6088 6105-6107 6112-61 17 
6120-6122 6147 6159 618^-6189 6215 6257 6272 6288 6369 
6418 6423 6430 6555 6595 6598 6738 6743 6765 6767 6793 
6799 6805 6815 6836 6857 6867-6870 6872-6873 6878 6884 
6886 6893 6899-6900 6931 6947 6976 6979 6988 7024 7045 
7058 7104 7118 7155 7275 7281 7306 7312 7338 7347-7348 
7351-7352 7355 7357 7359-7363 7366 7369 7371-7373 7381 
7383 7386 7400 7402 7404-7406 7408 7420 7422 7424 7426- 
7428 7430-7431 7433-7435 7439 7455 7465-7468 7494 7502 
7506 7517-7521 7579 7587 7604 7634 7639 7642 7733 7735 
7743 7748 7792 7797 7839 7842-7846 7848 7852 7857 7862 
7864 7870 7930 7933 8065 8077 8084 8087 8093 8095 8105 
8110 8116 8120 8139-8140 8156 8224 8226 8235 8262 8320 
8336 8345 8351 8354 8359 8364 8368 8370 8372 8375 8377 
8387 8427 8452 8456 8461 8486 8518 8520 8527 8535 8563- 
8564 8566 8572-8573 8575-8577 8580-8583 8585 8588-8589 
8597 8603 8618-8621 8623 8625-8626 8628 8630 8634 8647- 
8650 8664 8674 8677-8678 8691 8694 8701-8704 8706 871 1 
8722 8740 8742 8763 8792 8798 8806 8812-8813 8816 8830 
8838-8839 8848 8853 8863 8876 8898 8921 8935 8943-8944 
8948 8951 8966 8989 9012 9041 9047-9049 9076 9092 9128 
9 1 37 9 1 67 92 1 8 9244 9249 9257 9264 9287 9304 93 1 5 93 17 
9424 9455-9457 9466 9472 9475 9484 9503 951 1 95 17 9529 
9536 9542 9544 9558 9612 9683 9699 9703 9722-9723 9725 
9734 9744 9752-9755 9758 9772-9773 9780-9781 9805-9807 
9841 9843 9927-9929 9939 9942 9969 9998-10000 10007 
10175 10275 10284 10287 10292 10319 10346 10376 10423- 
10424 10431 10442 10446 10448 10450 10475 10480 10496 
10542 10557 10645 10679 10774 10778 10782 10788 10895 
10921-10928 10947 10949-10953 10986 10994-10996 11032 
11052 11061 110691111411130 1113211143 11145 11197- 
11202 11205 11208 11212 11239 11289 11401-11402 11406 
11459 11513 115861159611604 116071161811620 11695 
11711 1175911761 11790-11791 11793-1179411818 11820- 
1 1 822 1 1 834- 1 1 836 1 1 857 1 1 865 1 1 869 1 1 895- 1 1 899 1 1 950 
11970 12006 12022 12041 12078 12143 12159-12160 12178 
12195 12197 12256 12265 12305 12327 12359 12363 12375 
12387 12405 12424 12428 12438 12442 12467 12511 12521 
12523 12535 12539 12546 12564 12570 12590 12599 12601 
12609-12611 12616 12618 12630 12653 12656 12664 12666- 
12667 12670 12674 12688-12689 12691 12739 12754 12830 
12834 12904 12913-12914 12916-12917 12923 12940-12941 
12951-12952 12956 12972 12976-12977 12993 12999 13002 
13052 13062 13064 13066 13072-13073 13082-13083 13095 
13104 13127 13131 13133 13136 13141 13143-13146 13148 
13154-13155 13169 13172 13184 13195-1319613249 13254 
13260 13263 13280 13295 13307 13319 13394 13442 13456 
13477 13479-13480 13488-13490 13492 13494-13495 13497- 
13503 13506-13507 13512 13515-13516 13546 13551 13554- 
13556 13572-1357413590 13592 13613 13627 13631 13644- 
13645 13656 13659-13660 13665 13670 13710 13713 13776 
13782 13795 13859-13860 13864 13866-13868 13872 13882- 
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Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 






i 


13885 13888 13891-13892 


adult kidney 

i 

i 


Invitrogen 

< 

■ 


AKT002 

• 


67 142 550-552 598 639-641 10041015 1493 1516 19472215 
2299 2343 2363 2449 2618 3054-3055 3152-3153 3155-3157 
3434 3535 3590 3709 3797 3808 3839 3885 401 1 4022 4025 
4040 4253 4277 4391 4405 4671 4759 4767 4769 4837 4949 
4972 5001 5035 5037 5052 5108 5526 5581 5615 5726 5788 
5895 6062 6139 6190-6191 6803 6900 6947 6975 7468-7469 
7473 7733 8095 81 10 8139 8262 8323 8361 8363 8375 8520 
8539 8648 871 1 8798 8912 8950 8966 8983 9076 9264 9368 
9510 9517 9665 9703 9734 10175 10476 10791 10989 10997 
11132 11618 11745 11900 12006 12039 12160 12363 12375 
12405 12424 12685 12702 12707 12904 13035 13060 13104 
13136 13295 13394 13456 13488 13495 13501 13512 13533 
13554 13583 13644 13670 13679 13696 13713 13795 13866 
13888 13891 


adult' lung 


GIBCO 

* 


ALG001 


83-84 553 598 642-644 650 747 975 1004 1009 1015-1022 
1449 1516 1816 2161 21862215 2343 2444 29683056 3158 
3160-3162 3345 3466 3503 3519 3566 3568-3573 3595 3709 
3797 3810 3839 3885 4025 4039 4059 4194 4405 4622 4755 
4767 4770 4797 4865 4940 4949 4963 4972 4987 4991 4998 
5001 5117-512? 5171-5172 5233 5401 5481 5581 5724 5788 
5938 5960 6123 6125 6140 6215 6322 6336 6343-6348 6371 
6886 6966-6967 7024 7143 7275 7441 7444 7502 7522-7523 
7749-7752 8093 81 10 8140 8336 8345 8375 8378 8449 8535 
8612 8622 8635 8648 8651 8674 8702 8707-8708 8838-8839 
8898 8941-8942 8944 8948 8951 9076 9128 9457 9466 9475 
9497 9503 9544 9567 9703 9756 9758 9808-9810 9843 9873 

101^1 10175 10775 1019Q 10^44 10400 104SR 
yyz,\> yyyy iuioi iui /j iua/j i\Joz,y i\jjh*t ivrtuu iu*tjo 

10554-10555 10796 10998 11032 11046 11069 11132 11144- 

11145 11483 11627 11818 11837 11901 12006 12028 12039 

12041 12087-12092 12146 12363 12375 12424 12438 12539 

12570 I260I 12617 12661 12670 12674 12688 12738 12749 

12754 12904 12940 12974 13062 13173 13195 13232 13234 

13260 13295 13307 13456 13472 13477 13490 13494 13497- 

13498 13569 13584 13592 13660 13663-13666 13670 13743 

13776 13882 13885 13889 13891 


lymph node 

■ 


Clontech 

• 


ALN001 

- 


83 142 364 487 495-497 554 629 645-646 648-650 716 938-951 
953-962 1134 1516 1549 1671 1774 19762138 2225-2232 
2234-2235 2340 2879 3019-3020 3057 3103 3163-3165 3272 
3356 3498-3505 3507-3513 3515-35163538 3548 3628 3697 
4194 4201 4253 4405 4641 4687-4688 4771-47724783 4808 
4845 4963 4972 4987 4998 5042-5065 5076 5163 5504 5523 
5835 5895 5917 6027 6142 6192 6272 6288 6308-631 1 6313- 
6314 6765 6805 6871 6949-6951 7700-7705 7707-7709 8085 
8088 8091 8110 8235 8375 8387 8432 8629 8631 8633 8648 
8677 8709 8713-8715 8830 8863 8887-8889 8891-8896 8943- 
8944 8966 9010 9076 91119 12 8 91 42 9222 9455 9472 9520 
9544 9734 9774 9780 981 1 9905-9907 9928 9939 9999 10027 
10129 10296 10439 10452 10501 10543-10545 10679 10777- 
10778 10788 10891 10999-11000 11111-11113 11115 11130 
11145 11344 11406 11513 11584 11885 12006 12028 12050- 
12056 12143 12256 12363 12405 12442 12570 12674 12690 
12913 12917 12940 12968-12970 13173 13220-13221 13394 
13400 13492 13503 13511 13533 13630 13642 13645 13713 
13868 13885 13889 13891 


young liver 


GIBCO 


ALV001 


211 498 598 651-653 1008-1014 1 193 1264 1575 1976-1977 
2131 2161 2254-2255 2269 2363 2568 2617 2627 2633 2636 
2961 3021 3059 3093 3166 321$ 3272 3356 3378 3426 3479 
3559-3565 3590 3597 3627 3634 3673 3709 3797 3810 3885 
3993 4006 40 1 1 40 1 9 4025 4 1 94 4246 4253 4277 4422 4426 
4431 4434 4437-4438 4634 4654 4687 4714 4722 4755 4773- 
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Origin 


RNA 
Source * 


Library 
Name 


SEQIDNOS: 








4774 4808 4957 5001 5037 5052 5113-5115 5163 5167 5171 
5508 5510 5526 5581 5616 5693 5726 5757 5772 5788 5928 
6125 6141-6142 6193 6288 6340-6342 6589 6765 6815 6886 
6901 7024 71 18 7141 7442 7524 7735 7747-7748 7792 7852 
7870 7924 7930 8066 81 10 8120 8262 8375 8378 8518 8520 
8648 8652 8691 8702 8716-8718 8816 8838-8839 8938-8940 
9097 91 19 9263-9264 9301 9317 9424 9454-9455 9544 9689 
9703 9734 9758 9775 9925 9947 10393 10477 10769 10774 
1 1001 1 1 132 1 1 140-1 1 142 1 1 173 1 1208 1 13 18 1 1406 1 1587- 
11588 11725 11804 11902 12081-12086 12118 12160 12178 
12375 12377 12405 12424 12479 12511 12570 12599 12601 
12618 12670 12674 12688 12737 12830 12913 12916 12933 
12953 12977 12993 13047 13062 13073 13126 13169 13174 
33230-13231 13295 13488-13489 13494 13498-13499 33502 
13506-13507 13575 13592 13646 13660-13662 13670 13866 
13868-13869 13882 13888 13901 


adult liver 

■ 


Invitrogen 


ALV002 

■ 


6 25 60 142 598 1004 1213 1296-1301 1381 1493 1513-1514 
1977 2139 2161 2269 2310 2335 2342 2400 2413 2449 2458 
2497 2568-2569 2576 2636 2809 2827 2880 2926 3471 3484 
3503 3597 3722 3792-3796 3885 3994-3995 4025 4060 4201 • 
4326 4422 4426 4618 4689 4767 4786 4985 4998 5093 5163 
5337-5339 551 1 5526 5581 5693 5701 5724 5726 5757 5895 
5922-5923 5979 6020 6027 6083 6125 6192 6195 6253 6333 
6346 6483-6485 6716 6765 6797 6975 7169 7422 7468 7614 
7642 7807 7932 8089 81 10 8140 8262 8323 8378 8633 8677 

R71 1 R7A0 R7^Q R7R6 WXA RR^Q RQdA 0007 Q17R-Q17Q Q7AA 

9265 9282 9466 9484-9485 9517 9677 9700 9773 10007 10046- 
10047 10616 10660 10777 1 1037 1 M 14 11 1^7 1 1 149 

1119411271 1138911462 11502 115601158711602 11818 
11980 11989 12086 12160 12195 12253-12254 12316 12327 
12363 12378-1237912411 12424 12511 12570 12630 12693 
12706 12788-12789 12840 12863 12913-12914 13047 13062 
13072-13073 13090 13242 13256 13378-13379 13389 13479 
13487 13489 13497 13555 13734-13738 13859 13864 13889 


adult liver 


Clontech 


ALV003 


346 2529 3548 3797 3885 4438 4940 5101 5801 5902 9597 
10769 10778 11587 1 1927 12086 


adult ovary 

« 

* * 


Invitrogen 


AOV001 


6 13 40 67 83 88 142 196 444 522 555-560 577 579 598 609 
654-657 659-665 667-668 708 740 745-749 751-754 756-761 
806 841 871 1004 1023 1028-1034 1036-1039 1041-1047 1055 
1087-1115 1119 1151-1162 1298 1346 1359 1456 1493 1507 
1516 1658 1697 1752 1774 1812 1826 1848 1914 1974 1976- 
1977 1979 1983 20992111 2138 2153-21552161 2175 2180- 
2181 2186 2258-2267 2284-2286 2288-2290 2298-2299 2342- 
2343 2449 2483 2523 2529 2555 2569 2576 2591 2597 2618 
2708 2750 2783 2818 2839 2926 2961-2962 2988 3007 3022 
303 1 3060-306 1 3063 3069 3 1 66-3 1 72 3 1 74-3 1 75 3 1 77-3 1 89 
3226 3272 3286 3288-3289 3291-3293 3295-3299 3435 3450 
3460 3479 3486 3503 3535 3577-3589 3612 3627 3629 3631- 
3643 3673 3675 3720 3722 3737 3797 3806 3810 3812-3813 
3839 3885 3985 4006 4012-4014 4023 4025 4040 4043 4060 
4078 4133 4192 4194 4201 4246 4269 4277 4282 4341 4362 
4381 4383 4385 4388 4390-4391 4396 4399 4402 4405 4410 
4434 4465 4473 45 15 4543 4582 4610 4654 4689 4694 4715- 
4717 4722 4759 4767 4775-4782 4836-4837 4873-4878 4880 
4940 4944 4957 4960 4962-4964 4972 4975 498 1 4985 4998 
5001-5002 5018 5030 5037 5040 5045 5075-5076 5093 5108 
5135-5136 5138-51465148 5163 5191-52095213 5238-5245 
5262 5374 5380 5451 5455 5490 5503 5505 5521 5523 5527 
5560 5581 5674 5701 5724 5757 5778 5783 5788 5794 5817 
5874 5894-5895 5902 5906 5908-5909 5916-5917 5923 5932 
5938 5979 5989-5990 6005 6027 6048 6054 6057 6061 6078 
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Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQIDNOS: 


i 

■ 

* 

■ 

• 


* 

♦ 


• 


6082 60866125 6140 6143-6144 6165 6171 6192 6194-6200 
6215 6235-6239 6241 6270 6288 6307 6333 6336 6354-6360 
6362-6364 6374 6385-6392 6410 6412 6415-6417 6458 6461 
6508 6555 6583 6595 6598 6604 6607 6624 6626 6629 6708 
6765 6767 6793 6797 6799 6801 6805 6825 6860 6870-6871 
6873 6886 6888 6900 6902-6904 6915-6917 6947 6969-6970 
6975 69 80-6987 6998 7054 705 8 7 1 04 7 1 09 71 1 8 7 1 33 71 37 
71417155 7166 7169 71717191 7204 7281 73867408 7431 
7443 7468 7471-7472 7494 7507 7525-7528 7604-7609 7614 
7642 7687 7716 7729 7762-7763 7765-7767 7769-7771 7773- 
7775 7788 7802-7818 7829-7835 7838-7841 7864 7905 7937 
7977 8007 8035 8077 8088 8093 8095 8110 8120 8134 8139- 
8140 8143 8156 8235 8246 8262 8292 8320 8336 8345 8355 
8358 8363 8365 8368 8370 8372 8375 8387 8444 8452 8456 
8486 8520 8593 8610 8633-8634 8653 8719-8724 8773-8777 
8779 8781-8782 8792 8798 8813 8830 8839 8853 8863 8870 
8876 8898 8928 8944 8950 8956-8961 8963-8971 8989 9004- 
9007 9009-9019 9042-9046 9055 9076 9097 9128 9134 9185 
9264 9283-9284 9291 9313 9358 9424 9445 9454-9455 9457 
9460 9466 9471-9472 9475 9480 951 1 9517 9533 9539 9542 
9544 9553 9565 9577 9586 9612 9626 9677 9694 9703 9722 
9734 9757-9758 9773 9776-9778 9791 9812-9819 9827 9838 
9843 9848-9853 9922 9927-9940 9942 9962-9970 9989-9992 
9994-9997 10007 10027 10097 10149-10152 10249 10252 
10275 10298 10333-10334 10346 10349 10351 10383 10386 
10417 10438-10439 10452 10459 10470 10478-10480 10493- 
10498 10557-10562 10564-10565 10580-10586 10594-10596 
10645 10658 10676 10679 10767 10772 10774 10778 10782 
10788 10791 10954 10971-10972 11002-11004 1 1032 11047- 
1 1052 1 1061 11069 1 1083 1 1 108 1 1 130 1 1 132 1 1 15 1-1 1 153 
11155 11157-111581116711170 11172-1117611193-11196 
11274 11343 11369 11406 11411 11431 1158811596 11600 
11604 11606-1160711618 11620-11621 11629 1166811814 
11860 11865 11876 11903-11909 11932 11949-11954 11980 
11985 12000 12006 12017 12022 12033 12039 12080 12083 
12092-12093 12095-12104 12127-12129 12131-12142 12146 
12159-12160 12162-12163 12178 12213 12216 12236 12256 
12260 12305 12327 12363 12368 12371 12375 12379 12387 
12401-12402 12405 12424 12430 12467 12520 12522 12546 
12570 12576 12590 12594 12599 12605 12609 12611 12615 ' 
12617-12618 12630 12636 12643 12657-12658 12663 12670 
12674 12685 12688 12691 12693 12702 12705-12707 12713 
12724 12729 12735 12740-12745 12749 12754-12760 12765- 
12770 12777 12842 12848 12875 12904 12906 12910 12913 
12916-12917 12935 12940 12955 12957 12963 12972 12977- 
12980 12982-12984 12992-12994 12997-13000 13020 13034 
13047 13051 13054 13056 13060 13062 13066 13070 13072- 
13073 13075 13082 13090 13092-13093 13104 13126 13136 
13141 13175-1317713179-1318013193-1319413196-13197 
13202 13236-13241 13243 13245 13249 13254 13263-13275 

m w ^r^^ a ^m ^^mr ^m — *m ■ ^ • wm^^ » mm • mm 7 ■ m^ » mm ^m ■ ^ mm m>m^r • • mm ^m^mmm 7 A w • tr 

13277 13286-13292 13295 13319 13351 13377 13389-13391 
13394 13420 13436 13456 13477 13479 13488 13492 13494- 
13495 13497-13499 13502-13503 13506 13512 13516 13533 
13546 13549 13554-13555 13575 13590 13597 13600-13601 
13613 13616-13620 13627 13631 13644-13645 13649 13659- . 
13660 13670 13674-13679 13693-13699 13707-13709 13713 
13782 13803 13864 13866 13868-13869 13872 13875 13882- 
13885 13888-1388913892 


adult placenta 


Clontech 


APL001 


669-671 1006 1134 1184 15512053 20902156-21582342 
2490 2716 3001 3064-3065 3190 3272 3625 3670 4019 4055 
4194 4201 4246 4641 4718 472(M721 4783-4786 4957 5523 
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Library 
Name 


SEQIDNOS: 








5536 5581 5788 5895 6145-61466201-6202 6358 6803 7049 
7204 7529 8367 8375 8609 863 1 8630 8725-8727 9097 9480 
9734 9820 10319 10460 10579 10795 10895 11519 11723 
11737 11776 11827-11828 11959 12403 12546 12570-12663 
12848 12875 12913 13051 13394 13782 13869 13889 


placenta 


Invitrogen 


APL002 


142 561 1023 1551 19762449 2569 2614 319133403592 3668 
4060 4346 4767 4787-4789 4892 4985 5536 5674 5693 5757 
5772 5794 5906 6147 6203 6215 6598 6788 6799 6979 7158 
7530-7531 8139 8361 8611 8863 94579484 9517 9563 9703 
9758 9821 9927 9939 9942 10481 10557 10778 10802 10973 
1 1 006-1 101 1 1 1723 1 173 3 1 1737 1 1776 1 1 807 11 866 11910- 
11916 11959 12379 12403 12590 12685 13020 13054 13392 
13489 13533 13554-13555 13867 13882 13888 13891 


adult spleen 

« 

, i 


GIBCO 

i 


ASP001 


463 499 562-563 598 672 990-993 995 1004 1020 1346 1515- 
1516 1556 1774 1877 1914 1977 21592161 2215 2248 2257 
2363 2529 2569 2614 2617 2812 2827 3093 3 192 3195 3199 
3272 3286 3350 3356 3434 3470 3503 3537 3544-3546 3548 
3604 3634 3689 3709 3720 3722 3796-3798 3812 3839 3885 
4006 401 1 4022 4025 4043 4194 4201 4246 4253 4277 4388 
4391 43964405 4434 4641 4689-46904781 49574987 4998 » 
5001 5017 5030 5037 5052 5076 5118 5512-5513 5523 5526 
5701 5740 5778 5788 5796 5801 5895 5938 6005 6064 6125 
6140 6147 6192 6204 6272 6329 6458 6551 6590 6607 6873 
68866931 6958 7018 7058 7109 7118 7133 7171 7233 7506 
7532-7533 7561 7586 7733 7736 7807 7842 7845 7933 8065 
8085 8093 8095 8110 81 16 8139-8140 8226 8235 8262 8323 
8359 8363 8365 8368 8372 8456 8535 8648 8702 8792 8816 
8838-8839 8858 8863 8865 8876 8923-8928 8943 8950-8951 
9128 9257 9264 931 7 9455 9457 9466 9474 9544 9560 9773 
9781 9919-9920 9927 9939 9942 10000 10048-10049 10155 
10175 10267 10275 10296 10331 10549 10566 10774 10777 
10828 10944 11032 11061 11108 11113 11132 11173 11272 
11401 11406 11620-11621 11804 11818 11861 11917 12006 
12039 12041 12073-12074 12078 12118 12127 12236 12327 
12375 12387 12402 12405 12424 12426 12511 12546 12560 
12570 12601 12617-12618 12670 12674 12734-12735 12749 . 
12772 12790 12913 12916-12917 12972 12977 13002 13021 
13066 13082 13084-13085 13169 13228-13229 13256 13260 
13295 13316-13317 13394 13456 13488-13489 13494 13497 . 
13512 13516 13546 13645 13649 13651 13659 13679 13739 
13776 13864 13866-13867 13888 


testis 


GIBCO 


ATS001 


50 142 500-502 564-565 598 673-678963-968 1193 1346 1556 
1671 18261968 1977 2160-21612236-22372555 2590 2597 
3024 3093 3196-3202 3284 3378 3517-3522 3524 3720 3797 
3839 3885 4006 4025 4055 4109 4253 4354 4377 4388 4405 
4434 4473 4480 4543 4688 4792 4940 4957 4962 4964 4975 
5001 5018 5030 5037 5067-5072 5076 51 18 5171 5481 5525 
5535 5740 5757 5788 5907 5924 6125 6147 6257 6316-6322 
6324 6607 6886 6952 7058 7141 7288 7445 7484 7534-7540 
7677 7710-7711 7713-7717 7735 7792 7815 8087 8110 8120 
8262 8368 8370 8375 8535 8577 8648 8655 8702 8729 8838- 
8839 8848 8898-8906 8936 8939 8966 9076 9087 9244 9264 
9304 9455 9466 9485 9542 9567 9703 9758 9822-9825 9908- 
9910 9928 10462 10475 10478 10482 10551 10583 10679 
10772 10778 10795 10955 10974 11108 11113 11116 11118- 
1 1 120 1 1 1 32 1 1 173 1 1239 1 1425 1 1 606 1 1 620 1 1629 1 1 695 
13807 11862 11918-11919 12000 12006 12017 12033 12057- 
12061 12375 12405 12424 12570 12577 12599 12601 12605 
12609 12668 12692 12730-12731 12748 12904 12917 12923 
12929 12935 12940 12956 12971 13047 13051 13073 13090 
13169 13222-13225 13249 13394 13442 13456 13479 13494 
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Library 
Name 


SEQ1DNOS: 


1 






13497 13502-13503 13506 13512 13515 13533 13549 13555 
13575 13613 13616 13627 13644-13646 13660 13866 13875 . 
13882 13889 


Genomic 
DNA from 
BAC 63118 


Research 

Genetics 

(CITB 

BAC 

Library^ 


BAC001 


8711 


adult bladder 


Inyitrogen 

• 


BLD001 

■ 


731 1710 1779-1791 2076 2367 2479-2481 4067 4208-4214 
4681 4767 4775 5163 5169 5553 5560 5581 56745684-5688 
5794 6082 6244 6716 6718-6721 6799 6931 7045 7094-7097 
7967 8110 8226 8249-8258 8364 8648 8737 9012 9097 9425- 
9428 9430 9626 9703 9928 10007 10261 10557 10734-10735 
11459 11504-1150612000 12212 12512-12513 12515-12517 
12670 12735 12914 13085 13439 13512 13687 13838-13841 
13866 


bone marrow 

» 

4 


Clontech 

* 

t 


BMD001 


11 70 83 85 142 150 162-184 186-198 200-210230-243 245- 
277 279-281 306 370 373-387 389 433-435 438-440 457 483 
503-510 566 568-569 595-596 598 609 616 679-683 740 770 
843-857 859 861 864 867-876 880-884 886-890 892-893 896 
900 902-920 1 000 1 004 1 1 16-1120 1 122-1 126 1 128-1 134 1 1 84 
1211 1346 1359 1516 1552-1554 1575 1583 1671 1724 1774 
1877 1894 1927 1974 1976 1983 2012-20242031 2033-2038 
2040 2043-2044 2084-2088 21 1 1 2120-2121 2132-2133 2137- 
2138 2161 2163-2164218621892199-22002202-2203 2205- 
2210 2213-2218 2266 2291-2295 2342-2343 2414-2416 2444 
2529 2555 2566 2569 2575-2576 2591 2597 2652 2681-2709 
271 1-2716 2735-2738 2740-2744 2746-2748 2750-2756 2758- 
2759 2761-2764 2766-2768 2770 2772-2781 2783-2787 2806 
2812-2813 2816 2873 2875 2880-2881 2883-2885 2887-2897 
2899 290 1 2926 2948 2954 2958 2977 2984 3026-3029 3032 
3068-3069 3071 3081 3093 3096-3097 3139 3203-3205 3207 
3238 3257 3272 3282-3283 3286 3317 3345 3356 3404-3418 
3422-3434 3437-3438 3440 3442 3447 3449-3450 3456-3459 
3461-3464 3466-3473 3483 3497 3535 3538 3548 3557 3572 
3588 3593 3600 3604-3605 3610 3612 3625-3627 3634 3644- 
3647 3649-3651 3653-3657 3709 371 1 3722 3725 3727 3737 
3797 3804 3808 3810 3839 3899 4005-4006 401 1 4015-4019 
4023 4025 4040 4043 4058 4060 4129 4132 4192 4194 4201 
4246 4253 4277 4282 4367 4403 4466-4472 4474-4477 4479- 
4484 4486-4490 4492-4497 4509 4512-4540 4582 4595 4597- 
4606 4608-4615 4622 4642 4648 4650 4654 4667 4691-4695 
4723 4741-4742 4759 4767 4783 4794-4796 4808 4836 4928- 
4929 493 1-4934 4938 4940 4942 4944-4948 4950-4953 4955- 
4960 4962-4964 4967-4971 4973 4976-4979 4985-4987 4992- 
4998 5000 5004-5016 5030 5037 5052 5075-5076 5093 5143 
5149-5151 5163 5169 5210-5215 5217-5226 5262 5317 5357 
5503 5523 5535 5537-5540 5560 5604 5695 5740 5748 5766 
5788 5796 5801 5862 5874 5895 5906 5908 5938 5965-5967 
5969-5979 5991-5994 5996-6005 6007-6009 6027 6047 6057 
6065 6067-6068 6071-6072 6082 6086 6089-6091 61 12 6125- 
6127 6148-6152 6195 6214 6233 6257 6269-6276 6278-6280 
6282-6283 6285-6289 6292-6300 6321 6374 6393-6403 6508 
6555 6605 6607 6722 6730 6788 6815-6822 6826-6832 6836 
6859-6861 6867 6870-6871 6873-6874 6905 6915 6929 6933- 
6937 6940-6942 6944-6945 6988-6992 7051 7126 7155 7166 
7169 7199-7210 7212-7219 7226 7233-7253 7256-7264 7275 
73 1 7-73 1 8 7320 7322-7326 7347 7349 7373-7374 7379 73 82 
7446-7448 7473-7474 7491 7502 7542 7544 7569 7577 7579 
7663-7671 7674 7677-7678 7680-7684 7686 7689-76967742 
7768 7791 7815 7819-7824 7847 7864 7930 7995 8065 8085 
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Tissue 
Origin 


RNA 
Source * 


Library 
Name 


SEQ1DNOS: 


* 

• 






8088 8090-8093 8095 81 10 8 J 20 8134 8139 8224 8226 8235 
8241 8246 8262 8336 8354 8359 8363 8365 8368 8375 8421- 
8427 8429-8440 8444 8452 8456-8460 8463-8465 8467-8476 
8478-8489 8538-8539 8542-8551 8563 8566 8577 8593-8595 
8597 8603 8633 8635-8637 8648 8656-8657 8671 8677 8679 
8701 8792 8796 8817 8830 8840-8845 8847-8849 8852 8855- 
8860 8863-8868 8870 8873 8875 8879-8881 8898 8935 8944 
8963 8965 8995 9010 9012 9020-903 1 9049 9051 9066 9076 
9092 9097-9098 9128 9257 9264 9285-9288 9304 9317 9339 
9381 9399 9401 9454-9455 9466 9471-94729474-9476 9516- 
9517 9519 9529 9544 9622 9630-9643 9646 9650-9665 9689 
9705-9712 9730 9734 9740 9758 9779-9781 9826 9885-9889 
9892-9902 9928-9929 9939 9942 9972-9976 9978-9982 9999- 
10000 10068 10153 10175 10249 10275 10277 10284 10287 
10296 10319 10321 10343-10344 10346 10364-10374 10379- 
10382 10385-10394 10414 10416 10426 10429 10439 10447 . 
10452 10467-10468 10475 10483 10520 10522-10526 10528- 
10529 10531-10535 10587-10590 10595 10616 10677-10679 
10691 10750 10760 10767 10772 10778 10788 10794-10795 
10828 10831-10837 10846-10847 10849-10851 10853-10858 < 
10891 10895 10897 10899-10903 10905-10907 10931-10933 
10956-10959 10975-1097611012 11032 110461105711059 
1 1 06 1 1 1 077- 11 082 1 1 085- 1 1 086 1 1 088-1 1 098 1 1 1 08 1 1 1 1 3 
11132 11145 11177-11181 11194 11208 1 1289 11344 11401- 
11402 11513 11618 11620 11631 11673-1167511677-11690 
11695 11697-1170411706 11708-11715 117361174211762- 
1 1772 1 1774 1 1795-11796 1 1802-1 1804 1 1817-11818 1 1829 
11839-11842 11863 11920 11946 11992-12002 12004-12008 
12010 12012-12013 12015-12019 12022-12029 12033 12041 
12081 12142-1214912160 12178 12195 1220012231 12236 
12283 12305 12308 12316 12327 12363 12368 12404-12405 
12424 12426 12430 12439 12527 12546 12570 12576 12590 
12608 12610 12616 12634-12636 12641-12645 12654-12655 
12657-12659 12669-12670 12674 12721-12724 12754 12761 
12834 12840 12842 12904-12905 12910-12911 12913 12916- 
12917 12922-12923 12925-12928 12933 12938 12956 12966 
12977 12989-12990 12993 12995 13020 13033 13047 13060 

l^ftfi? Ilflfid. nnrttf llfiQS 1 1101-1 110*5 11107 111 11. 

13113 13129 13135 13137 13141 13150 13163 13173 13181 
13213-13217 13234 13276-13278 13280 13391 13394 13400 
13456 13472 13477 13490 13492 13494 13498 13501-13502 
13506-13507 13511 13516 13528 13530-13534 13538 13540 
13546 13551-13552 13554-13555 13560 13585-13586 13594 
13613 13630 13633-13639 13644-13646 13649 13659-13660 
13670 13673 13679 13713 13775 13795 13m 13872 13875 
13882-13885 13889 13891 13893 ' 


bone marrow 


Clontech 


BMD002 


51 242 442 654 1004 1134 1841-1904 1908 1927 2023 2107 
22 1 5 2342 2408 2507-2529 2576 2597 2806 2866 3286 3434 
3722 3736-3737 3817 3823 3839 4060 4246 4258-42904389 
4396 441 1 4618 4641 4828 4836 4957 4987 5030 5037 5052 
5108 5163 5526 571 1 5743-5772 5774-5804 5813 5895 5939 
6131 6236 6266 6287 6306-6307 6333 6655 6675 6728 6730 
6750-6762 6764-6769 6788 7049 7106-71 17 7137 7156 7738 

w 9 m^ J m^ mmw J w w m^ W • ^mw mw » mr w mm W mm ^ w w mm mmw W V mm mm* m^ m W mmr ^mr 

8068 8085 8134 8156 8290-8297 8299-8305 8307-8312 8323 
8368 8482 8499 8858 8863 8865 8928 8935 9076 9111 9128 
9465-9502 9626 9700 9703 9923 9928-9929 9942 10277-10280 
10282-10284 10286-10297 30396 10434 10515 10551 10645 
30675 10693 10744-10753 10772 10778 11057 11098 11108 
13132 11232 11252 11519 11536-11553 11606 11620 12033 
12039 12146 12260 12305 12387 12402 12405 12500 12554- 
12578 12594 12599 12608 12674 12754 12777 12839 12895- 
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Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQIDNOS: 


* 




» 


12904 12916 12923 13020-13021 13043 13057-13064 13169 
13249 13446-13454 33479 13492 13494 13513 13533 13555 . 
13659 13713 13775 13803 13863-13875 13877-13887 13889 
13891 


bone marrow 


Clontech 


BMD004 


2249 2529 3286 3494 3548 355 i 3797 3839 4025 4058 4201 
4277 4282 5052 5108 6545 6961 8262 8898 9474 10000 1 1098 
11818 13021 13893 


bone marrow 


Clontech 


BMD007 


8539 9780 9927 13021 


adult colon 

• 

i 


Invitrogen 

* 


CLN001 


319 346487 731 799 1792 1848 20502161 24492482-2483 
3431 3901 4215-4217 4940 4957 49875163 5239 5560 5689- 
5695 5865 591 1 5923 6722 6765 7098 7815 7864 7880 81 10 
8259-8262 8486 8597 8951 9484 9529 9542 9556 10376 1 1507- 
11508 11617 11869 12127 12236 12424 12518-12523 12601 
12610 32777 12976 13062 13073 13367 13440 13507 13512 
13630 13713 13843-13844 13864 13868-13869 13888 


Mixture of 16 
tissues - 
mRNAs* 


Various 
Vendors* 


CTL016 


6815 10776 12977 13064 13512 

* 


Mixture of 16 

tissues- 

mRNAs* 


Various 
Vendors* 


CTL021 


1671 6738 8432 8648 8863 8944 951 1 10769 13021 13062 
13064 


adult cervix 

* * 
* 


BioChain 

< 


CVX001 


50 67 142 158 308 332 346 475 598 654 895 1004 1086 1286 
1449 1516 1671 1698 1701 1711-1756 1758-1776 1828 1848 
1959 2134 2186 2257 2267 2343 2408 2414 2468-2474 2476- 
2478 2608 2716 3002 31363166 3191 3199 35293535 3554 
3572 3627 3722 3737 3777 3797 3839 3985 41 58-4176 4178- 
4195 4197 4199-4207 4246 4277 4391 4396 4434 4641 4667 
4759 4783 4828 4885 4940 4957 4963 4987 4998 5001 5038 
5075 5108 5163 5293-5294 5455 5481 5523 5552 5581 5646- 
5652 5654-56595661-5671 5673-5681 5687 5701 5711 5723 
5740 5788 5794 5848 5902 5908 5923-5924 5964 6020 6052 
6057 6062 6091 6106 6112 6125 6129 6181 63506371 6374 
6410 6446 6458 6504 6508 6512 6551 6598 6686-6687 6689- 
6705 6707-6715 6788 6873 6893 69176998 70087045 7078 
7084-7093 7095 7130 7141 7148 7169 7204 7507 7579 7608 
7675 7733 7768 7815 7871 7880 7893 8078 8138 8209-8215 
8217-8236 8238-8242 8244-8248 8298 8345 8370 8444 8456 
8486 8499 8535 8558 8592 8633 8635 8648 8669 8679 8742 
8853 8863 8870 8898 8921 8939 8948 9012 9061 9098 9107 
9128 9137 9153 9304 9308 9317-93 18 9355 9385-9391 9393- 
9403 9405-9406 9408-9418 9420-9422 9457 9466 9475 9510 
9539 9612 9734 9773 9927-9928 9939 9947 9960 101 10 10175 
10230-10256 10258-10259 10267 10274 10319 10329 10344 
10491 10496 10540 10616 10660 10691 10722-10732 10778 
10782 11055 11145 11217 11376 1 1462 11477-1 1489 11491- 
11503 115191158411604 11695 11853 1186911891 11980 
12006 12066 12081 12127 12160 12195 12216 12240 12266 
12308 12363 12379 12402 12405 12424 12438 12483-12494 
12496-12510 12579 12605 12610-12611 12617-12618 12643 
12653 12670 12674 12688 12691 12703 12707 12735 12740 
12754 12830 12840 12866 12870-12881 12883-12884 12905 



* The 16 tissue-mRNAs and their vendor source, are as follows: 1) Normal adult brain mRNA (Invitrogen), 2) 
normal adult kidney mRNA (Invitrogen), 3) normal adult liver mRNA (Invitrogen), 4) normal fetal brain mRNA 
(Invitrogen), 5) normal fetal kidney mRNA (Invitrogen), 6) normal fetal liver mRNA (Invitrogen), 7) normal fetal 
skin mRNA (Invitrogen), 8) human adrenal gland mRNA (Clontech), 9) human bone marrow mRNA (Clontech), 
10) human leukemia lymphablastic mRNA (Clontech), 11) human thymus mRNA (Clontech), 12) human lympb 
node mRNA (Clontech), 13) human spinal cord mRNA (Clontech), 14) human thyroid mRNA (Clontech), 15) 
human esophagus mRNA (BioChain), 16) human conceptional umbilical cord mRNA (BioChain). 
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Tissue 
Origin 


RNA 
Source * 


Library 
Name 


SEQ 11> NOS: 

* 








12913-12914 12917 12923 12951 12955 12957 12965 12989- 
12990 12993 13002 13020 13035 13045-13050 13062 13064 
13072 13092 13136 13141 13174 13197 13254 13283 13307 
13391 13428 13430-13437 13442 13473 13479 13492 13494- 
13495 13497-13498 13502 13532 13554-13555 13575 13590 
13597 13613 13616 13627 13644 13679 13713 13775 13829- 
13837 13866 13868-13869 13872 13884 13888 13891 


diaphragm 


BioChain 


DIA002 


731 1346 3548 371 1 3885 4282 4654 5895 6873 8120 8931 
8936 9455 1 1 132 1 1818 12405 12609 


endothelial 
cells 

• 

* 

i 

< 

• 


Strategene 

< 

■ 


EDTOOi 

i 

■ 


21 51 67 83 332 569 598 609 762 796 1004 1024-1026 1086 
1561 1848 1928 1959 1976-1977 1983 2138 2161 2166 2257 
2282-2283 2417 2483 24902555 25692614 2926 3042 3189 
3191 3272 3300-3303 3426 3494 3503 3548 3574-3576 3605 
3627-3628 3673 3709 3720 3722 3737 3797 3839 3885 4005 
401 1 4019 4055 4133 4192 4246 4269 4282 43404354 4365 
4384 4388 4399 4405 4410-441 1 4434 4543 4641 4654 4767 
4797-4799 4802 4881-4882 4885-4886 4888 4940 4957 4964 
4972 4985 4998 5002 5017 5030 5076 5103-5104 5132-5133 
5163 5167 5183-5185 5187-5190 5380 5523 5527 5535 5541- 
5542 5544 5674 5684 5693 5724 5766 5778 5788 5794 5796 » 
5874 5895 5916 5923 5928 5938 6005 6048 6057 6068 6082 
6165 6205 6215 6240-6241 6307 6321-6322 6349-6353 6383 
6458 6595 6598 6606-6607 6765 6799 6805 6815 6860 6871 
6873 6890 6918 6968 6972 6976 6979-6980 6998 7058 7067 
7104 71 13 71 16 7137 7139 7 169 7275 7468 7613 7716 7755- 
7757 7759-7760 7797 7799-7801 7930 8077 8084 8093-8095 
8120 8139 8235 8262 8320 8323 8335-8336 8345 8354 8358 
8363-8364 8370 8372 8375 8387 8452 8592 8648 8786 8788 
8792 8813 8863 8898 8944-8955 8965 8996-9001 9051 9076 . 
9097 9128 9264 9289-9291 9304 9315 9414 9455-94569466 
9472-9473 9475 9484 9504 9517 9529 9542 9563 9570 9626 
9703 9780-9781 9843 9927 9939 9961 10000 10027 10154 
10267 10285 10321 10330-10331 10342 10344 10349 10496 
10500 10550 10556-10557 10579 10679 10772 10776 10778 
10788 10795 10802 11013 11132 11136 11146-11149 11406 
11483 11565 1158811600 11606-11607 11615 1162611807 
11818 11932 11955 12006 12034 12037 12041 12044 12078 
12092-12094 12123-12126 12150 12213 12375 12381 12387 
12405 1241 1 12424 12426 12522 12570 12576 12590 12601 
12610 12612 12615 12617-12618 12663 12670 12674 12707 
12729 12739 12749 12753-12754 12777 12830 12842 12913- 
12914 12916-12918 12929 12940 12972 12975-12977 13002 
13024 13047 13051 13054 13062 13064 13082-13084 13090 
13092 13094 13123 13126 13136 13195 13235 13263 13380 
13389 13392 13394 13400 13456 13479 13488-13489 13492 
13494-13499 13502 13506-13507 13514 13516 13546 13555 
13568 13575 13590 13592 13613 13616 13621 13630 13649 
13659-13660 13667-13668 13670-13673 13678-13679 13690- 
13692 13713 13796-13797 13838 13866-13869 13872 13882 
13884 13888 13893 


Genomic 
clones from 
the short arm 
of 

chromosome 
8 


Genomic 

DNA 

from 

Genetic 

Research 


EPM001 


150 2023 2327 2490 4109 4783 5503 5560 10267 10760 12017 
12160 12557 12582 12923 13020 13514 


Genomic 
clones from 
the short arm 
of 

chromosome . 


Genomic 

DNA 

from 

Genetic 

Research 


EPM003 


5560 12017 12146 
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Tissue 
Origin 


RNA 
Source # 


Library 
Name 


SEO ID NOS: 


8 








Genomic 
clones from 
the short arm 
of 

chromosome 
8 


Genomic 

DKA 

from 

Genetic 

Research 


EPM004 


4783 4798 5560 10817 11926 12017 12160 

I" 


esophagus 


BioChain 


ESO002 


999-1000 2449 3272 33 15 3548 3550 3634 3697 3796 4011 
4025 4058 4201 4282 5106 5163 5553 6082 6873 7739 9304 
10296 11133 11818 12033 12570 13869 


fetal brain 


Clontech 


FBR001 


51 142 1184 3664 40604109 4940 5021 52705523 5553 6112 
6805 6908 7294 8558 9457 10376 11059 11985 12006 12122 
12160 12754 13438 13507 13888 


fetal brain 


Clontech 


FBR004 


60 2704 371 1 4025 4109 4783 5001 6082 7597 9010 9504 
9949 11837 12033 12039 12363 12705 12905 13020 13503 
13512 13891 


fetal brain 


Clontech 

* 

■ 

> 


FBR006 

i 

i 


6 60 67 598 800 932 1004 1 170 1793-1794 1796-1797 1799- 
1805 1905-1914 1916-1958 1974 1976 1979 1983 2057 2129 
2 1 74 222 1 2407 2444 2449 2484-2492 2530-2554 2556-256 1 
2563 2576 2857 3064 3207 3479 3556 3673 3 709 3722 4060 ' 
4078 4157 421 8-4221 4223-4224 4277 4291-4334 4338 4355 
4364 4369 4431 4957 5001 5109 5270 5380 5553 5634 5696- 
5706 571 1 5724 5766 5788 5794 5801 5805-5832 5834-5879 
5882-5901 5936 5990 6057 6723-6732 6765 6770-6791 6797 
6805 6894 7049-7050 7100-71027105 7118-7123 7125 7127 
7169 7905 8263-8265 8267-8273 8294 8312-8333 8359 8361 
8375 8452 8633 8664 8740 8757 8884 9010 911 1 9432-9436 
9503-9516 9518-9545 9547-9551 9556 957b 9577 9780 9895 
9923-9924 9928 9942 10007 10027 10202 10263-10268 10276 
10284 10298-10310 10329 10331 10496 10542 10595 10621 
10736-10737 10755-10761 10772 10774 10795 11108 11132 
11406 11483 11509-11523 11555-11582 1 1589-11590 1 1600 

1 1 1/?9 1 11713 11790 1 1 Rft7 1 15H7 17/W)/» 17H3Q 
17009 171 n 1771R 77731 1771fi 17177 171/H 173QR 174A5 

12465 12511 12524-12530 12576-12577 12579-12601 12729 
12735. 12754 12863 12869 12889 12906-12910 12914 12954 
12973 13020-13021 13051-13052 13054 13065 13082-13083 
13427 13445 13455-13470 13488 13490 13496 13498-13501 
13507 13516 13560 13613 13630 13649 13708 13713 13769 
13831 13845-13855 13868 13872 13882 13884 13888-13894 
13896-13900 


fetal brain 


Clontech 


FBRs03 


1005 4405 5111 6337 69647742 13084 13864 13891 


fetal brain 

< * 

* 


Invitrogen 


FBT002 


51 83 142 321 430 746 932 1054 1058-1065 1493 1833 1947 
2273-2275 2299 2444 2449 2926 3479 3492 3885 4347 4354 
4391 4405 4410 4434 4530 4804 4985 4998 5075 5160-5169 
5380 5428 5466 5750 5788 5801 5895 6132 6215 6371-6374 
6458 6598 6973-6974 7067 7096 7776-7778 7780-7782 7937 
8143 8323 8361 8364 8372 8377 8452 8633 8977-8984 8986 
90 1 0 91 42 9264 9332 9457 9474 9503 051 1 95 1 7 9539 95 R7 

9827 9848 9927 9950-9953 10027 10161 10329 10430 10492 
10573-10575 11014 11160-11164 11406 11628 11742 11814 

A V m* § ^ J. V mJ 9 mf A A. \J .X i AAA VI V* AAA w T & A » w v A J. VSA# VJ A A / A. X V A 

11830 11985 12092 12112-1211412116-12117 12127 12424 
12511 12521 12570 12576 12643 12696 12735 12748-12751 
12754 12830 12835 12913 12957 12977 12990 13002 13020 
13062 13072 13083 13117 13254-13259 13377 13486 13489 
13496 13499 13507 13590 13649 13685-13688 13713 13867 
13888 13891 13893 


fetal heart 


Invitrogen 


FHR001 


1001 1004 2250 4025 6334 6765 7740 8933 8935 9457 9544 
10000 11132 12599 12609 13021 13568 13656 13866 


fetal kidney 


Clontech 


FKD001 | 142 346 364 511-517 570-572 574 598 685-690 969-970 972- 
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Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQIDNOS: 


* 

* 


« 

• 




981 983-987 1134 13462123 2167-21692238-2246 23422444 
2483 25 16 2555 2617 2728 2843 2876 3032 3049 3072 3206 
3208 3282-3283 3525-3531 3533-3543 3548 3591 3709 3722 
3797 3839 3878 401 5 40 1 9 4043 4246 4277 4367 4405 4696- 
4698 4725 4767 4805-4810 4940 4947 4957 4986 4998 5037 
5056 5073-5080 5082-5091 5099-5100 5108 5258 5504 5523 
5560 5923 6005 6207-6208 6225 6272 6288 6325-6332 6478 
6603 6702 6793 6815 6906 6953-6959 7045 7058 7204 7355 
7426 7449-7450 7520 7543-7546 7561 7587 7718-7732 7930 
8077 8097 8262 8375 8387 8452 8520 8638 8658 8736-8737 
8834 8863 8898 8907-8918 8922 8950 9010 9134 9257 9401 
9457 9544 9597 9760 9781 9791 9828-9830 9912 9914-991 8 
10296 10440 10484 10546-10548 10772 11108 11121-11129 
11131-11132 11170 11513 11638 11695 11923-11924 12006 
12033 12062-12070 12072 12160 12405 12522 12570 12594 
12599 12605 12626 12663 12670 12732-12733 12749 12848 
12904 12914 12940-12941 12990 13020 13083 13188 13226- 
13227 13234 13263 13277 13280 13351 13391 13394 13491 
13501 13512 13590 13644 13647-13650 13713 13782 13867- 
13868 13872 13875 


fetal kidney 


Clontech 


FKD002 


3286 5030 5037 5105 1 1 108 12033 12490 12570 13494 13866 


fetal kidney 


Invitrogen 


FKD007 


3272 3806 4025 4253 4277 4654 5 1 12 5535 5788 5801 8863 
8935 9401 9466 10553 11628 11818 13494 13646 13866 


fetal lung 


Clontech 


FLG001 


79 2367 2395 3010 3460 3885 4828 4948 4962 5001 5723 5748 
5902 5908 6186 6738 7051 7067 7677 7759 9264 9553 9700 
10007 10478 11098 12017 12383 12417 12424 12749 12917 
13020 13169 13472 13554 13644 13782 13835 


fetal \un& 


Invitrogen 


FLG003 

i 


142 319 364 629 1671 1806-1814 1816-18191877 2129 2161 

A im ^ A — ^ *J W « V*/ A V / A M W V/ A W A T A w A \J A W A ✓ A V r / mm A mm ^ +m M, V/ A 

2169 2367 2449 2493 2529 3191 3503 3610 4109 4225-4234 
4367 44344957 5108 5380 5421 5581 5707-57105712 5714 
5788 5801 6057 6733-6741 7034 7103 8274-8278 8365 8597 
8948 9264 9327 9437-9442 9444 9466 9510 9525 9530 9539 
9677 9773 9841 10007 10190 10198 10269-10271 10329 11519 
11524-11527 11927 12531-12539 12848 12890 12904 13021 
13072 13249 13445 13472 13489 13551 13575 13649 13670 
13679 13856-13857 


fetal lung 


Clontech 


FLG004 


1003-1004 2597 5110 69639924 10552 11138-11139 12080 
12990 13659 


fetal liver- . 
spleen 


Columbia 
University 


FLS001 


-2 4-14 16-22 24 26 28-31 33^6 48-49 51-61 63-68 71-91 93- 
102 104-110 112-124 126-156 158-162282-283 285-290292- 
299 301-304 307-3 12 3 14-326 328-338 340-344 346-353 355- 
365 369 390-400 402 436 441 483 557 567 575-585 595 598 
629 673 678 691-699 701-702 708 731 736 763-767 769-776 
778-786 788-791 793-794 796 925 975 1004 1015 1023 1038 
1068 1104 1134 1144 1184 1192 1216 1264 1298 1346 1482 
1493 1516 1518-1521 1551 1556 1575 1583 1594 1636 1641 
1707 1724 1774 1826-1829 1841 1858 1927 1959 1962-1965 
1967-1972 1974-1979 1981-19982000-20092011 2045-2051 
2053-2055 2057-2058 2060-2063 2065-2083 2089-2094 2100- 
2101 2161217021742184-21942215 2222 22692290 2310 
2342 24092411 2414 24442449 24582483 24902497-2498 
2510 2516 2523 2529 2555 2562 2566-2576 2578-2586 2588- 
2591 2593-2601 2604 2607-2608261 1-2612 2614-2618 2620 
2622-2642 2644 2646-2653 2655-2664 2666 2668 2670-2680 
2696 2750 2788-2793 2795-28 1 1 28 14-2826 2828-2835 2837- 
2842 2844 2846-2848 285 1-2858 2860 2862-2871 2876 2878 
2893 2900-2905 2907-2909 2926 2929 2949 2952-2953 2959- 
2960 2984 2992 3032 3058 3069 3073-3076 3078-3080 3082 
3093 3166 3 194 3196 3207 3210-3211 3213-3214 3217-3225 
3249 3257 3272 3282 3286-3287 3304-3307 3310-331 1 3314- 



114 



WO 01/64835 PCT/US01/04927 



Tissue 
Origin 


RNA 
Source *■ 


Library 
Name 


SEQIDNOS: 


* 

• 


i 




3317 3319 3321-3332 3334-3346 3356 3419 3426 3434 3446 
3450 3455 3468 3470 3479 3484 3494 3503 3535 3537 3548 
3552 3557 3572 3590 3594 3597 3604-3605 3610 3612 3625 
3627 3634 3668 3670 3709 371 1 3720 3722 3729 3737 3777 
3797 3806 3808 3810 3813 3839 3885 3926 3990 3996-3997 
4006 4009 401 1 4019-4022 4025 4040 4043 4060 4078 4095 
4109 4129 4192 4194 4201 4245-4246 4253 4261 4277 4282 
4297 4335-4338 4340-4357 4359-4361 4364 4366-4367 4370- 
4372 4374-4377 4379-4382 4384-4386 4388-4392 4395-4396 
4398-4414 4417-4424 4426-4455 4457-4458 4460-4465 4542- 
4543 4545 4547-4553 4555-4562 4565-4575 4577 4579 4581- 
4582 4585-4588 4593 4596 4607 4616-4629 4644-4645 4647 
4654 4671 4676 4687 4689 4694 4721 4726-4729 4759 4767 
4775 4783-4784 4788 4790-4791 481 1-4819 4830 4837 4845 
4862 4874 4889-4902 4930 4940 4948-4949 4957 4962-4965 
4972 4985 4998 5022 5029-5030 5037 5040 5075-5076 5092 
5108 5132 5152 5163 5167 5171-5172 5335 5380 5398 5473 
5503 5514 5516 5523 5526-5527 5535-5536 5553 5581 5598 
5604 5616 5674 5684 5691 5693 571 1 5715 5724 5726 5748 
5750 5757 5778 5788 5794 5801 5817 5865 5874 5894-5895 • 
5902-5914 5916-5919 5921-5928 5930-5949 5951-5954 5956- 
5964 5966 5979 6005 6010 6013-6015 6017-6018 6020-6022 
6024-6025 6028-6032 6034-6048 6052-6062 6064 6068 6073- 
6081 6086 6092-6094 6104 6112 6125 6135 6140 6147 6149 
6153-6157 6165 6186 6195 6209-621 1 6240-6241 6243-6248 
6250-6253 6272 6276 6287-6288 6307 6313 6338 6371 6374 
6419 6430 6446 6451 6458 6478 6496 6508 6545 6579 6595 
6598 6607 661 1 6624 6629 6642 6658 6695 6726 6728 6730 
6738 6745 6754 6765 6767 6788 6793-6797 6799-6808 6810- 
6815 6833-6843 6845-6847 6850-6856 6858 6860 6862-6865 
6870-6871 6875 6877 6891 6893-6894 6900 6907-6908 6917 
69 19-6925 6950 6979 7049 7058 7067 7077 7096 7109 71 16 
7118 7128-7131 7133-7135 7137-7141 7144-7147 7149-7150 
7152-7153 7155-7179 7181-7192 7194-71987265-7271 7273- 
7280 7282-7305 7307-7312 7316 7327-7336 7375-7376 7383 
7386 7451 7455 7468 7473 7475-7477 7479 7484 7548-7553 . 
7555-7558 7561 7608 7617-7618 7620-7631 7642 7675 7687 
7695 7716 7768 7809 781 1 7839 7842 7864 7883 7933 7977 
8069 8093 8095 8105 81 10 81 16 8139 8224 8226 8235 8241 
8262 8323 8334-8340 8342-8350 8352-8355 8357-8359 8361- 
8408 8410-841 1 8414-8419 8452 8456 8490-8498 8500-8502 
8504-8526 8529-8530 8537 8540-8541 8553-8560 8563 8566 
8568 8577 8592 861 1 8639 8648 8659-8661 8664 8668-8669 
8710 8738-8739 8741 8777 8792-8804 8830 8834 8839 8853 
8858 8863 8865 8876 8898 8926 8935 8939 8950-8951 8957 
8963 9010 9012 9076 9092 9097 91 12 91 19 9128 9257 9264 
9266 9284 9291 9301 9304 9313 9318 9352 9375 9399 9414 
9424 9445 9455 9457 9466 9473-9474 9476 9480 9484-9485 
951 1 9517 9528-9529 9536 9539 9542 9544 9552-9557 9559 
9561-9564 9566-9572 9574-9577 9579-9630 9646 9655 9666- 
9667 9669-9671 9673 9675-9697 9700-9701 9703-9704 9713- 
9715 9718-9719 9722 9726-9729 9734 9752 9758 9761 9777 
9780-9784 9813 9831 9854-9867 9869 9906 9927 9929 9939 
9942-9943 9969 9983 10000 10007 10205 10267 10284 10296 
10305 10311 10313-10316 10318-10325 10327 10329-10331 
10333-10354 10356-10363 10395-10397 10400-10403 10405- 
10410 10413 10417-10422 10427 10446-10447 10463 10470 ! 
10485 10496 10502-10507 10515 10542 10557 10565-10566 
10645 10679 10688 10691 10729 10750 10760 10762 10764- 
10769 10772-10774 10776-10782 10784 10786-10795 10797- 
10815 10817-10830 10859 10861-10869 10871-10879 10881- 
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Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQIDNOS: 

■ 




• 

* 


* 


10893 10908-10916 10929 10934 10943 10948 10968 10971 
10975 10977-10983 11005 11009 11011 11015-11018 1 1028 . 
11031 11054-11063 1109411108 11114 111321113611142 
11194 1120811213 1124611390-11391 11401 11406 M411 
11414 11459 11462 11483 11508 11513 11521 11534 11559- 
1 1560 1 1565 1 1573 1 1585-1 1603 1 1606-1 1610 11612-11616 
11618-11638 11640 11642-11657 11659-11667 11669-11670 
1 1672 1 1699 1 1716-1 17 1 8 1 1720-1 1722 1 1724-1 1741 1 1743- 
11758 11760-11761 11775-11787 11797 11799-11801 11818 
11830 11862 11864-11866 11869 11876 11889 11911 11925- 
11928 11958-11961 11963-11976 11979-11980 12000 12006 
12017 12019 12028 12033-12034 12041 12078 12086 12092 
12122 12127 12159-12160 12178 12216 12218 12236 12245 
12256 12260 12289 12303 12305 12308 12327 12330 12363 
12375 12382 12387 12402-12405 12424 12426 12439 12442 
12465 12470 12476 12511 12522-12523 12546 12551 12560 
12570 12576 12590 12601-12602 12604-12619 12621-12633 
12636 12639 12646-12653 12670-12671 12674 12682-12683 
12688 12693 12696 12706 12709-12715 12735 12749-12750 
12772 12790 12823 12830-12832 12835 12842 12849 12855 
12863 12904-12905 12910-12911 12913-12916 12918-12924 
12929 12931-12932 12934-12937 12939-12940 12954-12955 
12965 12973 12976-12977 12989-12990 12993 12999 13020- 
13021 13035 13051 13054-13055 13060 13062 13066-13067 
13069-13087 13089-13095 13097-13102 13114-13117 13119- 
13120 13122-13126 13134-13136 13141 13147 13156 13174 
13193 13195-13196 13198-13201 13254 13260 13264 13277 
lizoU 13295 1335 1 13306 133/7 13391 13394 13400 13409 
13412 13420 13456 13472-13477 13479-13484 13486-13492 
13494-13496 13498-13499 13501-13510 13512-13516 1351&- 
13527 13533 13541-13544 13546-13551 13553-13555 13560 
13575 13587-13589 13597 13603-13604 13613 13616 13622- 
13623 13630-13631 13644-13646 13649 13659 13670 13679 
13713 13743 13748 13769 13775 13782 13793 13803 13808 
13818-13819 13858-13860 13864 13866-13869 13872 13882 
13884 13888-13889 13891 13893 13901 


fetal liver- 
spleen 

♦ 


Columbia 
University 


FLS002 


6 16 24 30-31 63 67 81 83 89 95 103 115 1 17 126 140 142 147 
150-151 158 162211225 287 308 332 356 358 390-391438- 
439 483 551 556 641 654 694 701 708 731 788 997 1006 1012 
1047 1082 1151 1154 1178 1184 1208 1212 1356 1480 1507 
1551 1556 1623-1647 1649-1667 1669-1671 1675 1677-1698 
1828-1829 1858 1877 1885 1889 1914 1927 1947 1961 1997 
1999 20282057 2070 2092 2098 2138-213921542161 2174- 
2175 2189 2191 2257 2282 2327 2342 2363 2399 2403 2409 
2440-2441 2443-2450 2452-2459 2490 2498 2522-2523 2544 
2555 2568 2575-2576 2592 2597 2605 2619 2623 2659 2806 
2809 2812 2860 2869 2879 2903 2926 2932 2988 303 1 3066 
3075 3089 3188-31893286-3287 3319 3343-3344 3356 3426 
3535 3548 3552 3554-3555 3583 3591 3610 3625 3634 3673 
3709 371 1 3720 3722 3839 3899 3926 3985 4005-4006 401 1 
4020 4025 4055 4058 4060 4078 4091-41 16 41 18-4126 4128- 
4142 4172 4194 4201 4253 4277 4282 4347 4350 4353-4355 
4362 4367 4374 4379 4386 4391 4394 4396 4402 4426 443 1 
4435 4437-4439 4512 4553 4578 4586 4607 4622 4644 4654 
4671 4758 4767 4783 4798 4836 4845 4899 4940 4948 4962- 
4963 49S5 4991 4998 5001 5037 5108 5167 5171 5177 5198 
5237 5293-5294 5380 5400 5523 5535-5536 5581-5591 5593- 
5599 5601-5613 5615-5624 5627-5637 5653 5674 5691 5693 
571 1 5724 5726 5733 5748 5757 5772 5778 5794 5817 5874 
5894 5902 5904 5906-5907 5909 591 1 5916-5919 5923-5924 
5927 5929 5932 5938 5941 5948 5957 5959-5960 5962 5964 
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Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQ1BNOS: 


• 

• 


• 

i 

• 


i ■ 

i 

• 

- 


5979 6005 6020 6027 6037-6038 6052 6054 6057 6060-6062 
6064 6068 61 12 6140 61 95 6225 6246 6287-6288 6306-6307 . 
.6336 6354 6371 6374 6423 6430 6451 6458 6461 6469 6478 
6496 6508 6538 6545 6563 6580 6583 6595 6598 6607 6629 
6638 6641-6643 6645-6675 6709 6726 6730 6765-6766 6788 
6792-6793 6795 6797-6798 6800-6801 6808 6812 6836 6843 
6850-685 1 6855 6860 6864 6870 6875 6888 6908 6924 6950 
6958 6968 6998 7003 70187045 7049 7056 7071-70797095- 
7096 7109 71 16 71 18 7126 7133 7135 7137-7139 7155-7156 
7166-7167 7169 7175 7178 7275 7291 7294 7329 7422 7426 
7478 7608 7627 7687 7695 7716 7815 7839 7871 7893 7914 
7937 7977 8022 8095 8120 8133-8134 8148-8149 8151-8167 
8169-8182 8184 8186-8192 8194-8196 8241 8246 8276 8289 
8298 8336 8339 8343 8345 8349 8355-8356 8361 8363 8365 
8367-8368 8370 8373 8375-8378 8385 8388 8417 8496 8518 
8520 8543 8558 8561 8563 8646 8667 8710 8738-8740 8786 
8803 8813 8865 8926 8946 8948 8963 8970 9010 9049 91 19 
9128 9142 9164 9222 9264 9289 9296 9301 9317-9318 9320 
9322 9335-9341 9343-9362 9365-9368 9370 9372-9373 9375- 
9377 9399 9455 9457 9466 9472 9475 9480 9483 9495 9526 
9533 9536 9553 9556 9558 9560 9563 9567 9570 9582 9597 
9601 9630 9646 9655 9671 9683 9695 9700 9703 9715 9722 
9729 9733 9752 9758 9783 9843 9848 9855 9880 9936 9942 
9983 1000710027 1010310142 10186-1021710274 10284 
10287 10296 10313-10314 10329 10331 10349 10352 10354 
10400 10405 10430 10439 10496 10500 10507 10542 10621 
10709-10710 10712-10717 10750 10760 10776-10779 10782 
10794 10809 10816 10837 10885 10891 10895 10913 10960 
10971 10975 11057 11071 11098 11142-11143 11148 11194 
11246 11401 11406 11430-11453 1 1455-11464 1 1483 11504 
11508 11556 11560 11589 11596 11615 11618 11626 11631 
11714 11729 11830 11833 11865 11988 12000 12006 12017 
12019-12020 12041 12044 12081 12092 12178 12195 12245 
12256 12277 12297 12327 12363 12383 12402-12403 12405. 
12426-12427 12444-12455 12457-12465 12467-12473 12475- 
12476 12520 12546 12576 12599 12607 12611 12614-12615 
12617-12618 12622 12627 12630-12632 12636 12647-12648 
12650 12685 12688 12691 12693 12696 12703 12739-12740 
12743 12763 12808 12830 12834 12842 12849-12858 12860- 
12866 12885 12895 12906 12910 12913 12915 12917 12920 
12922 12929 12933 12940-12941 12954-12955 12957 12965 
12977 12990 12993 12999 13021 13039-13040 13047 13051 
13054-13055 13060 13062 13072-13073 13077 13082-13083 
13085 13092-13095 13098 13117 13123 13136 13173 13195- 
13197 13202 13249 13254 13362 13377 13391 13394 13406- 
13409 13411-13415 13417-13420 13456 13473-1347413477 
13480 13484 13486 13488-13489 13496-13498 13500 13502 
13504 13506-13507 13511-13512 13515 13521 13546 13551 
13554-13555 13560 13575 13590 13597 13613 13616-13617 
13622 13630 13659-13660 13670 13678-13679 13695-13696 
13701 13708 13713 13726 13748 13775 13795-13796 13815- 
13826 13835 13838 13859-13860 13864 13867 13869 13872 
13882-13885 13888-13889 13891 13893 


fetal liver- 
spleen 


Columbia 
University 


FLS003 


525 2269 2529 2627 2636 3552 3554-3555 3597 4201 4246 
4253 4422 4426 4434 4438 5536 5801 8935 9536 1 1 136 1 1 142 
11519 11626 11927 12028 12077-12079 12086 12305 12403 
12424 13021 13472 


fetal liver 


Invitrogen 


FLV001 


40 51 60 598 731 1023 1048-1049 1820 197621 11 2268-2269 
2449 2483 2529 2568 2576 2614 2809 3207 3220 3468 3595- 
3599 3885 4020 4346 4390 4422 4434 4543 4615 4767 4833 
5092 5152-5155 5270 5505 5526 5553 5581 5715 5724 5726 
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Origin 


RNA 
Source w 


Library 
Name 


SEQIDNOS: 








5750 5766 5788 5794 5801 5902 6125 6147 6458 6598 6624 
6645 6765 6799 6805 6862 7104 7856 8070 8156 8359 8361 
8432 8740 8928 8935 8972-8974 9012 9041 9128 9264 9445 
9474 9485 951 1 9536 9593 9597 9703 9780 9927 9943 9945- 
9946 9948 10007 10137 10275 10334 10529 10557 10566- 
10569 10776 10828 11142 1 1560 11587 1 1618 11927 12070 
12086 12105-12108 12127 12218 12236 12327 12424 12511 
12541-12542 12605 12746 12916 12977 12985-12986 13020- 
13021 13060 13064 13135 13246-13247 13472 13479 13488- 
13489 13497 13533 13554 13590 13659 13680-13682 13859- 
13860 13882 13888-13889 


fetal liver 


Clontech 


FLV002 


360 996 1 134 3226 5102 9401 


fetal liver 

* 


Clontech 


FLV004 


998 1927 2449 2627 4025 4043 4426 4438 4834 5030 5726 
6730 9474 9942 10769 1 1 142 1 1587 12590 12608 13889 


fptal miiQple 

IvUU AilUdwlw 


Tnvitropen 

HI ¥111 UgVll 

t 


FMS001 

i 


40 150 731 894 1264 1555 1613 1821-1822 1883 1959 2161 
2343 2494-2496 ?555 3^03 4235-4237 4434 4671 4694 4783 
4885 4940 4985 5037 5045 5481 5716-5718 5724 5788 5902 
5932 6624 6742-6743 6825 7054 7104 7141 7632 7800 8279- 
8281 8863 8935 9204 92649446-9449 9451 9474 951 1 9536 
9556 9722 9780-9781 10097 10273-10274 10330 10738-10739 
10750 11098 11406 11528-11530 11560 12305 12544-12546 
12590 12609 12618 12663 12670 42702 12735 12891-12892 
12916 13051 13053 13195 13441-13442 13479 13613 13782 


fetal muscle 


Invitrogen 


FMS002 


3378 4348 4434 8932 1 1 132 12033 12570 13782 


fetal skin 

■ 

■ 


Invitrogen 

■ 


FSK001 


60 142 235 319 641 683 800 1015 1050 1346 1774 1823-1825 
2044 2099 2 11 1 2161 2215 2223 2280 2367 2401 2408 2513 
2516 2614 28122871 29263207 3356 3468 34903503 3548 
3599 3720 3722 3885 4020 4032 4060 4109 4238 4240-4244 
4246 4253 4277 4340 4353 4355 4388-4389 4391 4405 4434 
4543 4562 4568 4654 4667 4671 4767 4940 4944 4947 5030 
5037 5075 5163 5198 5296 5380 5481 5514 5581 5656 5674 
5691 5719-5724 5748 5757 5801 5894 5902 5923 6057 6076 
6078 612561476215 6336 6374 64096437-64386551 6563 
6744 6783 6803 6805 6815 6873 6900 6908 6917 6985 6987 
7037 7054 7058 7067 7075 7275 7355 7431 7484 7507 7587 
7627 7642 7653 7687 7871 7977 81 10 8134 8186 8226 8235 
8251 8262 8282-8287 8298 8356 8365 8368 8452 8455-8456 
8539 8597 8648 8679 8813 8863 8898 8921 8935 8939 8948 
8951 8957 8983 9002 9041 9061 9122 9128 9257 9285 9414 
9452-9453 9466 9474 9484 9503 9517 9529 9553 9563 9671 
9677 9703 9714 9781 9841 9927 10007 10175 10296 10343 
10491 10741-10742 10778 10795 10907 10948 11239 11513 
1 153 1 1 1573 1 1606 1 1626 1 1807 1 1869 1 1952 12000 12006 
12160 12190 12218 12256 12327 12363 12371 12375 12378 
12426 12438 12465 12470 12488 12549-1255012570 12604 
12615 12617 12663 12670 12735 12749 12754 12830 12893 
12904 12910 12916-12917 12976-12977 13038 13084 13090 
13116 13249 13254 13367 13389 13391 13443-13445 13456 
13472 13479-13480 13494 13496 13499 13505 13512 13516 
13551 13554-13555 13575 13590 13613 13630-13631 13644 
13670 13713 13782 13784 13793 13803 13858 13866 13869 
13882 13891 


fetal skin 


Invitrogen 


FSK002 


1004 3544 4834 5523 9922 9942 1 1 134-1 1 135 12570 13495 
13499 13793 13884 13889 


fetal spleen 


BioChain 


FSP001 


997 5030 9466 11108 12033 12749 13590 


umbilical 
cord 


BioChain 


FUC001 


60 83 89 142 166 567 609 760 997 1302-1304 1306-1307 1309 
1671 1697 1724 1848 1917 1978 2111 2154 2161 22072215 
2315 2343 2444 2569 2576 2591 2597 2652 2866 2926 3468 
3526 3599 3602 3625 3697 3722 3799-3805 3813 3839 3885 
4025 4060 4173 4197 4246 4277 4340 4364 4387 4391 4395- 
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Origin 


RNA 
Source 


Library 
Name 


SEQIDNOS: 


i 




* * 

* 


4396 4405 4410 4434 4543 4589 4667 4694 4767 4834 4865 
4940 4957 4963 4985 5030 5093 5096 5108 5163 5171 5213 
5313 5340 5342-5344 5380 5481 5521 5526 5627 5656 5674 
5714 5743 5750 5757 5801 5895 5902 5912 5938 6027 6061 
6082 6125 6186 6195 6218 6241 6313 6321 64106458 6486- 
6495 6595 6607-6608 6668 6745 6825 6860 6870 6979 7054 
7141 7260-7261 7275 7329 7355 7373 7383 7444 7579 7586 
7677 7716 7807 7930 7935-79367995 8077 8093 8226 8246 
8262 8298 8343 8345 8358 8387 8452 8535 8563 8635 8669 
871 1 8819 8858 8863 8921 8935 8943-8944 8948 8963 9001- 
9002 9012 9097 9130 9132 9134-9138 9264 9303-9304 9313 
9332 9401 9424 9466 9472 9474 9544 9597 9665 9677 9700 
9722 9734 9758 9834 9841 9848 10050 10175 10183 10267 
10277 10311 10337 10446 10470 10508 10529 10566 10691 
10772 10828 10961 11108 11132 11145 11239 11241 11273- 
11276 11293 11408 11484 11513 11596 11604 11607 11620 
1 1695 1 1823 1 1909 11927 11942 12000 12077 12160 12245 
12255-12256 12292 12305 12308 12363 12383 12404-12405 
12426 12438 1251 1 12570 12601 12605 12630 12700 12706 
12754 12875 12913 12916 12935 12957 12993 13002 13009 
13020-13021 13062 13135-13136 13141 13254 13283 13318- 
13319 13336 13394 13400 13472 13479 13481 13488-13489 
13495-13496 13499 13507 13511-13512 13516 13551 13555 
13568 13575 13590 13613 13630 13644 13713 13740-13742 
13866-13867 13882 13884 13888 


fetal brain 

* 

• 


GIBCO 

« 


HFB001 

• 


51 70 211-213 215-222 224-229 445 586-588 598 608 703-710 
712-716 i068 1148 1178 1184 1193 1308 1516 1556 1774 1778 
1957 1974 1976-1977 2010 2025-2029 2134-2135 2145 2161 
2171 2174 2220 2223 2257 2266 2283 2444 2449 2510 2555 
2563 2569 2590 2597 2614 2617 2717 2719-2723 2726 2729- 
2734 2809 2997 3002 3081 3083 3086 3109 3226-3237 3239 
3257 3272 3278 3282-3283 3286 3356 3419 3460 3479 3492 
3495 3544 3548 3557 3590 3604-3605 3625 3627 3634 3673 
3689 3697 3709 3722 3797 3810 3839 3885 4006401 1 4019 
4022 4025 4040 4054 4059 4095 4194 4201 4253 4277 4282 
4355 4364 4383 4399 4405 4410 4412 4434 4473 4480 4498- 
4504 4507-4508 4517 4543 4654 4689 4694 4730-4733 4735 
4755 4783 4820-4823 4825-4830 4845 4885 4940 4949 4957 
4962 4972 4985 5017 5019 5022 5031 5037 5108 5132 5167 
5171-5172 5262 5380 5481 5521 5523 5527 5553 5616 5656 
5711 5740 5788 5801 5894 5902 59065908 5923 5938 5979- 
5989 6057 6082 6096 6125 6147 6159 6161-6163 6165 6207 
6212-6213 6215-6222 6241 6257 6288 6336 6369 6374 6419 
6422 6512 6551 6555 6595 6598 6607 671 1 6767 6788 6823- 
6825 6860 6871 6879 6892-6894 6900 6909-6910 7058 7118- 
71 19 7169 7220 7222 7224-7225 7228-7229 7231-7233 7275 
7426 743 1 7444 7481-7483 7485-7488 7560-7567 7569 7608 
7743 7768 7792 7814 7845 7864 7930 7977 7995 8093 8095 
81 10 8140 8226 8235 8262 8345 8358 8361 8370 8372 8387 
8441-8443 8445 8447 8450-8455 8520 8535 8558 8597 8603 
8648 8663-8666 8677 8702 8742-8749 8811 8838-8839 8943 
8951 9010 9092 9134 91379257 9287 930493179455 9457 
9466 9472 9475 9484 9510 9515 9533 9553 9567 9644-9645 
9647-9649 9734 9758 9781 9785-9786 9791 9832-98379927- 
9928 9939 9942 9970 10053 10175 10275 10277 10296 10329 
10375-10378 10434 10464 10486 10496 10645 10679 10691 
10778 10782 10791 10838-10844 10928 11019-11025 11027 
11032 11055 11061 11108 11132 11145 11153 11208 11239 
11343 11483 11513 11588 11596 116041160611620-11621 
11668 11691-1169511818 11867 1186911929-11930 12006 
12033 12039 12041 12044 12047 12066 12078 12197 12218 



119 



Tissue 
Origin 


RNA 
Source * 


Library 
Name 


SEQIDNOS: 


• 






12327 12375 12405 12411 12424 12438 12521-12522 12564 
12570 12576 12582 12590 12599 12601 12604 12611 12626 
12630 12637-12640 12670 12674 12688 12694-12695 12703 
12734 12748 12754 12785 12835 12840 12913 12916-12917 

19999-19930 19915 19940 19946 19959 199R9 19991 11090 

13051-13052 1305413073 13083 13090 13108-13110 13117 
13131 13136 13148 13158 13160 13169 13184-13186 13254 

13263 13277 13295 13389 13394 13438 13456 13477 13479- 
13481 13489-1349013494-13500 13502-13503 13506 13512 
13516 13535-13536 13555 13575 13590 13592 13605-13606 
13613 13616 13627 13630 13644-13645 13660 13670 13679 
13687 13713 13793 13803 13838 13861 13866 13868-13869 
13875 1388813891 13896 


macrophage 


Invitrogen 


HMP001 


1002 2253 3548 401 1 4058 4201 4246 4282 5526 8093 8262 
11137 12039 12426 12511 12521 13888 


infant brain 

• 

• 


Columbia 
University 


IB2002 

i 

■ 

■ 

• 


6 89 142 21 1 276 307 518 589-590 598 644 717-718 720-721 
773 841 921-924 926-932 937 1023 1091 1147-1150 1289 1493 
1522-1525 1557-1564 1724 1778 1928 1947 1976 1978 2097 
2111 21382172 2218-2223 2257 2283 2401 2418-24192444 
2449 2516 2555 2563 25692575 2597 2627 28092818 2880 1 
2932 3087 3166 3226 3241-3245 3272 3435 3474 3476-3477 
3481 3483-3486 3488-3489 3492 3494-3495 3497 3538 3544 
3548 3572 3595 3597 3669 3671-3672 3709 3720 3722 3737 
3797 3817 3885 3985 3997-3999 4006 401 1 4022-4031 4040 
4095 4201 4246 4253 4277 4297 4355 4405 4410 4434 4676 
4689 4767 4775 4831-48344837 4885 4962 4983 4998 5017- 
5019 5021-503 1 5035-5037 5040 5095 5143 5164 5167 5169 
5233-5236 5380 5505 5517-5519 5523 5536 5546-5548 5691 
5695 5724 5750 5766 5788 5801 5895 6027 6048 6082 6095 
6111 61196132 6135 6147 6223 6301-6303 6305-6307 6374 
641 1 6459 6555 6563 6591-6595 6601 6606-6607 6609-661 1 
6613-6614 6624 671 1 6767 6799 6805 6909 6946 7018 7052 
7067 7118 7133 7166 7168-7169 7386 7464 7469 7473 7571- 
7573 7697-7698 7716 7814 7827-7828 7905 7934 7977 8071- 
8073 8084 8098-8104 8235 8320 8331 8351 8354-8356 8364- 
8365 8370 8372 8376 8452 8520 8572 8667-8668 8750-8752 
8813 8830 8863 8883-8885 8950 8958 8963 9040-9041 9251 
9257 9267 9293-9298 9313 9424 9454 9456-9457 9466 9510 
95 14-95 1 5 9533 9542 9556 9576-9577 9597 9626 9646 9722 
9758 9787 9903-9904 9988 10027 10138-10139 10155-10160 
10284 10296 10319 10346 10496 10536-10542 10557 10670- 
10672 10680 10682 10778 10791 10897 10971 11028-11030 
11052 11100-1110911122 11132 1 1192 1 1392 11403-11404 
11447 11533 11568 11595 11606 11626 11638 11818 11927 
11931-11933 12028 12030-12031 12033-12034 12036-12039 
12041 12047 12049 12146 12155-12157 12160 12327 12344 . 
12363 12383-12385 12402 12405-12408 12424 12439 12521- 
12522 12570 12590 12601 12611 12615-12616 12626 12639 
12684-12685 12688 12696-12697 12707 12725-12727 12729 

1974R-1974Q 19*754 197^1-19764 19&10 19904 19911 1901 A- 

12917 12923 12929 12959 12977 12990 12996 13000 13047 
13051 13054 13062 13073 13082-13084 13117 13141 13187- 
13188 13196 13218-13219 13249 13257 13277 13284-13285 

X. yl \J \M J m if V *■ m ^-^ m, mm Mh m^ A *^ mm mm m *m mm+m W M mm mm m 9 A- %J I m- m i^Uhi 

13351 13389 13394-13395 13456 13458 13479-13481 13488 
13494-13495 13499 13503 13512 13516 13530 13535 13575 
13607-13608 13613 13616 13641 13649 13659 13679 13705- 
13706 13708 13713 13793 13798-13801 13803 13861 13869 
13872 13875 13883-13884 13888 13891-13893 13896 


infant brain 


Columbia 
University 


IB2003 


6 46 746 1914 1947 1959 19942111 22202257 25162962 
3166 3226 3272 3435 3572 3885 4277 4377 4410 4833 4837 
5029-5030 5040 5147 5259 5851 6147 6423 6595 6598 661 1 
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• ■ < 



i 



Tissue 
Origin 


RNA 
Source 


Library 

• 

Name 


SEQIDNOS: 




• 


* 


7145 7169 7716 8262 8354-8355 8364 8452 8863 9264 9457 
9515 95569577 9626 96469715 9781 9928 9970 10027 10329. 
10376 1055710791 11083 11289 114061145911513 11663 
1 1933 12039 1-2405 12546 12570 12601 12616 12685 12754 
12913 12916 12954 12977 12990 13000 13056 13062 13195 
13257 13392 13456 13458 13480-13481 13488 13497 13499 
13506-13507 13511-135121351413516 13549 13616-13617 
13649 13793 13861 13884 13888 13891 13896 


infant brain 

■ 


Columbia 
University 


IBM002 


1564-1565 19762218 242031664391 5788 61476302 10329 
11108 11513 12039 12729 12830 13062 13257 13512 13617 
13803 13859 13861 


infant brain 

• 


Columbia 
University 


IBS001 


927 931 1564 2915 3166 3737 3885 5029 5044 5095 5143 5701 
5894-5895 6135 6307 7018 8452 8963 9424 9556 9626 10296 
10542 11083 1110011108 11167 12383 12611 12785 13257 
13280 13479 13495 13506 13617 13679 13713 13896 


lung, 
fibroblast 

* 


Strategene 

i 


LFB001 


67 235 552 641 746 1346 20902186 2343 2770 2812 2871 
3203 3537 3548 3634 3812 3839 4073 4253 4434 4654 4783 
4786 4821 4940 4957 4963 4985 5163 5172 5492 5523 5788 
591 1 5938 6027 6048 6057 6223 6272 6322 6599 6873 6886 

£007 77ftA 77^0 78 1 ^ 1Q11 7077 ft77/C Q7A0 JKi (MCO 

8648 8742 8791 8830 8863 8898 8944 8948 8951 9000 9076 
9097 9128 9290 9304 9414 9455 9466 9472 9476 9503 9544 
9563 9671 10000 10050 10342 10376 10434 10447 10679 
10760 10857 11055 11094 11289 11402 11607 11818 11837 
12000 12006 12178 12363 12405 12462 12582 12617 12636 
12670 12754 12913 12916-12917 12925 12940 13126 13377 
13394 13446 13456 13494 13497 13499 13502 13575 13613 
13670 13679 13691 13795 13797 13867-13868 13882 


lung tumor 

• 


Invitrogen 


LGT002 

■ 


5 1 69 88 142 158 21 1 444 598 608 722-723 733 795-797 799 
1004 1023 1135-1145 1308 1338 1346 1493 1526-1533 1535 
1566-1569 1626 1654 1816 1841 1848 1860 1976 1983 2090 
2150 2161 2173 2223 2257 2266 2296-2297 2342 2402 2449 
2483 2555 2569 2576 2591 261 1 2623 2724 2809 2868 2880 
2926 3088 3166 3272 3347 3571 3658-3666 3673 3722 3737 
3797 3839 3885 4000-4002 4006 401 1 4025 4032-40344060 
4133 4201 4246 4253 42774282 4340-4341 43844388 4391 
4399 4402 4405 4434 4543 4622 4671 4686 4767 4783 4791 
4833-4834 4836 4885 4903 4940 4957 4987 4998 5001 5017- 
5018 5030 5037 5052 5108 5163 5210 5227-5232 5422 5451 
5520-5528 5581 5627 5637 5674 5693 5713 5724 5748 5766 
5772 5788 5894-5895 5902 5907 5917 5938 5990 6005 6020 
6094 6129 6147 6171 6224-6225 6239 6241 6253 6322 6336 
6404-6410 6461 6508 6595-659% 6624 6765 6793 6798 6851 
6870 6873 6994 7003 7008 7045-7047 7053-7055 7067 7085 
7109 711671397141 71587169 71877468 75177524 7579 
7608 7676 7687 7716 7761 7825-7826 8074-8078 8105-8106 . 
8139 8235 8298 8323 8335 8345 8354 8359 8363 8365 8370 
8372 8375 8456 8563 8633 8648 8678 8741 8792 8798 8805 
8831 8863 8870 8989 9002 9010 9012 9032-9037 9039 9051 
9076 9080 9092 9128 9257 9269-9271 9273-9274 9276 9299- 
9301 9399 9414 9424 9457 9466 9472 9480 9484-9485 9517 
9533 9536 9539 9544 9567 9612 9626 9665 9677 9689 9700 
9703 9758 9773 9784 9788 9838-9841 9870 9928-9929 9942 
9970 9983-9986 10140-10143 10149 10161 10175 10275 10296 
10319 10330 10346 10349 10386 10496 10508 10561 10591- 
10593 10673-10674 10742 10774 10778 10782 10788 10802 
10836 10971 11031-11033 11064 1 1108 11132 11182-11191 
11393-11395 11406 11459 11462 11560 11565 11588 11596 
1160411606-1160711615 11618 1162911821 11823 11845 
11869 11932 11934 11961 11978-11980 12000 12006 12017 
12022 12033 12078 12080 12127 12150-12154 12178 12236 
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* 

t 



Tissue 
Origin 


RNA 
Source * 


Library 
Name 


SEQIDNOS: 






• 


12327 12363 12371 12379 12386-12393 12409 12424 12521 
12570 12590 12616-12618 12630 12637 12674 12691 12693 
12698-12699 12716 12749 12754 12760 12762 12830 12832- 
12833 12842 12904 12907 12913 12917 12923 12993 13020 
13031-13032 13035 13060 13062 13064 13072 13083 13090 
13136 13150 13193 13197 13202 13234 13277 13279 13281- 
13282 13295-13296 13381-13383 13385-1338613394 13400 
13456 13477 13479-13480 13484 13488-13489 13492 13494 
13496-13497 13499 13503 13506-13507 13511 13554 13575 
13597 13616 13630 13646 13649 13659-13660 13670 13701- 
13704 13713 13743 13782 13788-13789 13791-13792 13802- 
13803 13864 13867-13868 13872 13882-13884 13888 13892 


lymphocytes 

■ 

• 


ATCC 


LPC001 


142 316 1034 19772161 23422367 2483 2513 25292932 2962 
3272 371 1 3722 3839 4006 4277 4282 4405 4434 4836 4970 
5037 5239 5451 5706 5796 5924 5941 6048 6147 6301 6765 
6793 6805 6815 7133 7171 7275 7320 7687 7815 8033 8093 
8095 8105 81 10 8246 8262 8345 8365 8368 8370 8535 8664 
8674 8722 8834 8858 8898 8919 8963 9012 9257 9456-9457 
9484 9539 9544 9560 9758 9928 9939 10175 10201 10284 
10333 10496 10520 10529 10679 10744 10766 1 1032 1 1 108 • 
11508 11513 11519 11853 11862 12033 1216012363 12387 
12402 12424 12438 12520 12560 12570 12663 12830 12913 
12923 12968 13021 13066 13072 13085 13140 13147-13148 
13394 13479 13488 13495 13499 13507 13575 13867 13869 
13888 


leukocyte 

* 


GIBCO 


LUC001 


21 49 51 67 83 88 94 142 21 1 316 326 340 368 371 403 438 
443-455 464 485 518-523 525-530 532-535 591-592 616 724- 
726 744 800 841 889 943 1004 1346 1556 1570-1572 1826 
1914 1916 1927 1959 1976-19772095 2103-2107 2124-2127 
2134 2138 2145 2161 2163 2175 2215 2223 2237 2266 2343 
2363 2412 2421 2449 2483 2529 2555 2569 2575 2597 2617- 
2618 2695-2696 2827 2876-28772899 2903 2961 2965-2975 
3002 3035-3038 3040-3046 3109 3132 3166 3199 3203 3207 
3246-3251 3265 3272 3286-3287 33 12 3344 3356 3361 3378 
3400 3434 3451 3468 3479 3484 3494 3503 3552 3590 3610 
3627 3634 3704 3709 3720 3722 3725 3757 3797 3804 3810 
3839 3885 3985 3996 4006 4009-401 1 4019 4022-4023 4025 
4035-4037 4058 4060 4194 4253 4269 4277 4297 4341 4362 
4388 439<M391 4396 4399-44004402 4405 44104412 4431 
4434 4534 4543 4594 4615 4630 4641 4651-4663 4670 4688- 
4689 4694 4699-4706 4736 4755 4758-4759 4767 4775 4783 
4791 4798 4828 4835-4840 4845 4865 4930 4940 4947-4948 
4957 4962 4964 4972 4976 4985 4991 4998 5001 5003 5017 
5022 5030 5037 5040 5075-5076 5108 51 18 5143 5163 5171- 
5172 5313 5481 5503 5505 5521 5523 5526 5535 5549 5552 
5656 5691 5724 5726 5740 5750 5766 5772 5788 5794 5796 
5801 5865 5874 5894-5895 5906 5908 5923-5924 5928 5938 
5989 6027 6042 6057 6063-6064 6082 6094-6103 6125 6130- 
6137 6142 6147 6166-6167 6171 6181 6214 6226 6239 6253 
6301 6307 6371 6374 6418 6512 6662 6716 6730 6788 6792 
6799 6815 6836 6860 6873 6876-6877 6886-6887 6945 6975 
6979 7018 7037 7056 7058 7067 71 16 7118 7137 7155 7158 
7171 7275 7315 7384-7385 7387-7389 7391-7393 7395-7398 
7452-7463 7468 7494 7608 7671 7676 7687 7714 7733 7792 
7815 7845 7864 7870 7905 7930 8093 8107 81 10 8120 8139- 
8140 8224 8226 8262 8276 8320 8363-8365 8368 8375 8387 
8432 8452 8456 8520 8535-8536 8539 8562-8563 8577 8579 
8597 8599-8601 8603-8605 8608 8640-8646 8648 8664 8669 
8674 8677 8691 8702 8722 8753 8755 8798 8815-8816 8830 
8838-8839 8858 8863 8870 8876 8898 8943-8944 8948 8951 
9001 9010 9012 9061 9076 9092 9097 9128 9257 9304 9454- 
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Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQ1DNOS: 


1 

I 


; * 


\ 

* • ■ 


9456 9466 9472-9475 9484 9497 951 1 9515 9517 9529 9536 
9542 9556 9567 9612 9626 9702-9703 9723 9731-9738 9745 
9752 9758 9762-9770 9780-9781 9789 9791 9808 9832 9842 
9927-9928 9939 9942 9970 10000 10175 10275 10319 10329- 
10331 10333 10345-10346 10412 10428 10430 10433 10435- 
10436 10453-10454 10456 10465 10470 10475 10488 10496 
10513 10566 10645 10679 10683 10691 10774 10776 10778 
10788 10791 10795 10802 10891 10928 10930 10935-10941 
10948 10961-10962 10964-10967 11034 11055 11108 11132 
1 1 136 1 1 145 1 1239 1 1289 1 1405-1 1406 1 1459 1 1483 11519 
11588 11604 11606 11618 11620 11693 11695 11805 11809- 
1 1 8 1 6 1 1 8 1 8 1 1 830 1 1 833 1 1 848 1 1 850-1 1 856 1 1 868 1 1 927 
11932 11934-11935 11937-11939 11954 11988 12000 12006 
12009 12028 12033 12037 12039 12047 12053 12092 12097 
12118 12127 12143 12159-12160 12178 12186 12327 12336 
12363 12375 12387 12402 12404-12405 12410 12424 12438 
12442 12490 12511 12521-12522 12556 12560 12564 12570 
12590 12599 12601 12605 12615-12617 12636 12643 12653 
12660 12670 12672-12680 12693 12700-12701 12706-12707 
12715 12735 12754 12823 12895 12904 12910 12913 12916- 
12918 12923 12925 12935 12945 12947 12955-12956 12972 
12977 12993 13002 13020-13021 13051 13062 13066 13072- 

13073 13082 13085 13117 13126 13130-13131 13135 13138- 
13140 13147-13151 13161 13169 13189 13254 13295 13391 
13394 13425 13436 13442 13456 13477 13479 13488-13490 
13494-13496 13498-13503 13505-13507 13512 13530 13546 
13550-13551 13554 13557-1356413575-13581 13590 13592- 
13593 13609 13613 13616 13627 13630 13645-13646 13660 
13679 13713 13782 13803 13864 13866-1386913872 13883 
13885 13888 13893 


leukocyte 


Clontech 


LUC003 


536 539 541-543 593 728 1552 1927 1974 3089 3252-3253 
3434 3548 3709 371 1 3722 3797 401 1 4019 4040 4060 4194 
420 1 4277 4282 4622 4707 479 1 484 1 -4842 4949 4998 500 1 

5030 5345 5895 6052 6138 6227 6419 6595 6754 6765 6767, 
6788 6911-6912 7049 7139 7171 74647575-7577 8093 81 10 
8116 8365 8370 8375 8592 8648 8830 8863 8944 9466 9544 
9612 9758 9771 9773 9790 9929 10326 10346 10679 10779 
10961 11035-11036 11132 1 1401 11513 11853 11940 12160 
12256 12405 12670 12674 12681 12840 12904 12906 13020 
13051 13162 13280 13400 13488 13511 13516 13554 13670 
13679 13860 13869 13883 13889 


melanoma 
from cell line 
ATCC #CRL 
1424 


Clontech 

• 


MEL004 

• 


83 142 360447 841 1061 1346 1516 1573 1724 1959 1963 
2159 2175 2343 2367 2513 2652 2812 2876 2961 3001 3272 
3345 3604 3627 3722 3754 3839-3840 3985 3988 4040 4194 
4246 4341 4515 4568 4641 4667 4862 4940 5075 5132 5163 
5481 5492 5523 5740 5788 6008 61 12 6119 6147 6189 6195 
6272 6287 6418 6423 6508 6563 6726 6860 6870 7134 7166 
7169 7275 7294 7579 7677 7748 7831 7852 7980 8110 8116 
8226 8235 8262 8320 8345 8363 8370 8456 8563-8564 8633 
8838 9001-9002 9128 9296 9304 93 13 9484 9542 9544 9646 
9703 9758 9780-9781 9808 9999 10007 10027 10296 10346 
10470 11145112391128911401 11406115081159611606 
11620 11823 11899 11950 12019 12107 12160 12292 12329 
12363 12405 12436 12523 12599 12658 12670 12749 12754 
12774 12842 12930 13020 13296 13394 13458 13489 13498 
13501 13507 13551 13554 13575 13613 13616 13649 13660 
13743 13775 13868 


mammary 
gland 


Invitrogen 


MMG001 


51 67 142449 594 598 616 708 729-733 1004 1060 1194-1196 
1310-1315 1329 1536.1541 1826 1848 1858 1914 1947 1974 
1976-1978 2174-2175 2283 2299 23 10 2336 2343 2403-2406 
2449 2483 2555 2563 25762597 261 1 2614 26172812 2827 
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Tissue 
Origin 


RNA 
Source * 


Library 
Name 


SEQIDNOS: 


* 






2926 3191 3207 3490 3503 3673 3714-3715 3722 3737 3777 
3796 3807-3819 3885 4003-4004 4060 4330 4346 4355 4391 
4399 4434 4543 4671 4767 4791 4836-4837 4843 4940 4964 
4975 4985 4998 5001 5017 5019 5022 5052 5108 5155 5163 
5168-5169 5269 5345-5347 5466 5526-55275529 5581 5615 
5724 5772 5778 5788 5794 5894-5895 5902 5907 591 1 5917 
5923-5924 6048 6057 6064 6094 61096125 6147 6156-6157 
6215 6228 6321 6430 6458 6496-6499 6545 6579 6598-6599 
6607 67 1 1 6765 6780 6793 6797 6799 6805 6880 693 1 6979 
6987 6996 7012-7013 7048 7054 7057 7104 7141 7158 7169 
7517 7578-7581 7583 7872 7937-7940 81 10 8139 8156 8224 
8262 8292 8298 8336 8359 8363 8365 8372 8452 8619 8628 
8646 8648 8722 8758-8759 8782 8813 8815 8863 8882 8939 
8963 8983 9012 9097 91 1 1 9139-9141 9143 9157 9164 9264 
9313 9327 9347 9466 9484 951 1 9517 9544 9553 9563 9577 
9646 9700 9703 9734 9746 9780 979 1 9844-9846 9927-9928 . 
9942 10006-10007 10051-10055 10144 10146-10147 10162 
10285 10330 10346 10439 10490-10491 10496 10542 10557 
10600 10628 10645 10691 10729 10772 10796 10983 11038 
11108 11132 11216-11217 11277-11279 11396-11397 11406 ■ 
1 1 459 1 1 5 13 1 1596 1 1606-1 1607 1 1 626 1 1628 11 823 1 1830 
11833 11902 11941-11943 11979 12000 12009 12037 12047 
12078 12093 12146 12160 12181 12258-12263 12281 12327 
12371 12383 12387 12395 12424 12426 12438 12442 12484 
12511 12564 12570 12590 12594 12601 12605 12609 12615- 
12616 12618 12621 12653 12670 12691 12699 12702-12703 
12749 12771 12791-12794 12834-12835 12885 12898 12905 
12913 12916-12917 12920 12923 12955 12976-12977 13002 
13020 13035 13047 13062 13064 13066 13072-13073 13084 
13090 13092 13127 13135 13169 13190 13193 13195 13249 
13254 13296 13320-13322 13392 13456 13477 13479-13480 
13488-13489 13494 13497-13499 13505-13507 13512 13514 
13516 13546 13551 13554-13555 13590 13611-1361213631 
13649 13670 13713 13743-13745 13838 13864 13866-13867 
13869 13872 13875 13882-13885 13888 13891 


induced 
neuron cells 

i 


Strategene 

• 


NTD001 


88 1493 1552 15612034 20902510 2522 2570 25762623 3226 

"5nn "join Af\f\£. AFkZO /no*^ AIOA ACCkA A*701 AQQQ C t f\1 Clif\A 

3272 3839 4006 4058 4282 43 84 4694 4783 4888 5103-5104 

5172 5523 5541-5542 5684 5766 5788 5794 5938 6082 6195 
64 1 8 6449 6607 6960 6972 7058 7494 78 1 5 7937 8370 8929- 
oyju oyyo yuuu yuuo yizo yi o / yzoy-y^yi y*f iz. yyzo wjjli j 
10550 10729 10760 11607 11736 11818 12039 12075 12094 
12256 12363 12381 12423 12522 12707 12735 12863 12906 
12958-12959 12963 13020 13052 13054 13196 13202 13310 
13368 13378 13438 13479 13486 13506-13507 13546 13616 
13654 13659 13691 13795 13891 


retinoid acid 

inuuccQ 

neuronal cells 


Strategene 


NTO001 


1552 38393885 4282 4434 5030 93089466 11108 11132 

H(YIA lOmO MAfiS MAI A 17^70 lOGfl^ 11(\A1 


neuronal ceiis 


o u diegene 


IN 1 uuui 

• 


8R 7W* 1**1 1S£1 10A7 1111 lAQfi 7&1/1 1111 1&1*\ 1C11 

06 ivo ijjz ijoi iy**f zzzj z^yu c.ol** jz/z joiu oozd do&i 
4032 4201 4253 4297 4337 4434 4543 4622 4641 5104 5163 

**11 **A1 KARA ^7RR f\(\QA (\\ATl f\1(Y7 ££01 £££9 11** KfZAk 

8740 8996 9097 9128 9289 9308 9466 951 1 9544 10333 10779 
10963 11032 11 108 1 1 132 12034 12039 12075 12405 12424 
12570 12590 12706 12749 12830 12905 12917 12963 13035 
13073 13280 13490 13507 13613 13616 


pituitary 
gland 


Clontech 


PIT004 


1 148 5674 6068 6371 7759 8093 10430 12605 12636 12658 
12916 13021 13082 13875 


placenta 


Clontech 


PLA003 


3885 4025 5030 5536 5711 5923 8349 8367 9921 12033 12076 
12403 12511 13655 13885 


prostate 


Clontech 


PRT001 


67 83 142 225 235 379 486 572 616 1286 1316-1329 1538 
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Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQIDNOS: 




> 
• 


* 


2023 2139 2147 2161 2266 2290 2337-2344 2346 2367 2422 
24442511 2631 28093001-3002 308434193563 3625 3627 
3797 3810 3821-3827 3829-3835 4019 4022 4173 4282 4332 
443446674755 4791 48624865 4998 5001 50665171 5349 
5352-5356 5358-5361 5363-5364 5481 5581 5656 5674 5723 
5740 5902 6027 6047 6125 6321 6371 6374 6427 6458 6463 
6500-6507 6509 6563 6598 6643 6793 6803 6871 7014-7015 
7058 7104 7275 7320 7608 7635 7733 7842 7852 7864 7941- 
7943 7946-7951 7953-7954 8077 8087 8093 81 10 8224 8226 
8452 8487 8520 8558 8635 8677 8863 8950 8963 8983 8997 

9646 9703 9780 10000 10027 10056-10060 10491 10629 10691 
10777 10891 11145 11200 11239 11281 11283-11295 11344 
11406 11761 11837 11862 12006 12166 12264-12278 12305 
12363 12368 12411-12412 12438 12467 12685 12691 12729 
12734 12795-12798 12800 12830 12863 12904 13010-13011 
13104 13146 13295 13323-13326 13377 13394 13456 13473 
13477 13489 13516 13533 13550 13611 13678-13679 13746 

13oOO-13o67 13554 13o8y 


rectum 


Invitrogen 

f 


REC001 


6 67 142 683 731 997 1 178 1909 1959 2005 2023 2596 261 1 
2614 2809 2926 3314 3333 3455 3722 3780 38704173 4355 
4837 4949 4987 5526 5580 5615 5674 5691 5724 5788 5895 
5909 5924 6057 61 12 6195 6765 6805 7815 7833 8095 8664 
8863 9517 9539 9544 9884 9927 10400 10666 11401 11513 
11606 11985 12160 12327 12428 12693 12848 12910 12977 
13051 13064 13072 13420 13494 13497 13507 13512 13515- 
13516 13866 13oo9 13oo4 13500 


salivary gland 

- 


Clontech 

* 


OAT AA1 

SAL001 

• 


67 73 1 800 997 IU54 }y 14 2207 23yj 2o2y 3 136 3595 3627 

/1ft 1 1 A 1ft*7 A1A6. A 1 1 ft AA1A A&A 1 AQZH AQS1 Cft/I ft €ftC/7 ciri 
4U 1 1 HIV2 4Z40 433U 4434 404 1 4iO / 4Vo / DU4U JVjZ D 103 

^4.^1 <L4R1 ^7C\A <.77* 57RR ^RO^ fD 1Q£/V?1 fALCM AQAfi fiQ75 
7045 7733 Rl 10 R37? K5ft"3 R#V?5 RR3ft £951 QftflO Qfllfl 

9051 9313 9472 9475 9671 9724 9758 9927 10027 1 1 145 
1 1695 1 1725 12017 12284 12363 12424 12427 12570 12609 
12670 12674 12693 12977 13035 13307 13554 13617 13867 
13872 13889 13891 


salivary gland 


Clontech 


SALs03 


1516 1724 1858 5030 6186 13657 13864 


skin fibroblast 


ATCC 


SFB001 


2251-2252 5788 6068 12511 


skin fibroblast 


ATCC 


SFB002 


6068 8951 12511 


skin fibroblast 


ATCC 


SFB003 


4025 5895 7741 


small 
intestine 

• 


Clontech 

• 


SIN001 


142 319 627 654 1034 1063 1197-1198 1330-1338 1340-1359 
1575 1646 1774 1814 1978 2161 2347-2354 2409 2876 3046 
3419 3460 3605 3716-3718 3737 3797 3837-3839 3841-3843 
3845-3857 3885 3986 4060 4201 4301 4351 4385 4568 4689 
4694 5076 5163 5270-5273 5304 5326 5365 5367-5372 5374 
5503 5550 5701 5772 6064 6094 6171 6288 6427 6430-6432 
6438 6510-6522 6598 6615 6793 6815 6997-6998 7016-7018 
7054 7058 7072 7309 7450 7604 7769 781 1 7873-7876 7955 

*70<7 *70»;0 TO£7 HQ£A C17A 570°. fi1<A C/l<7 CCIA OOjCO QQCA_ 

fyjj fyjy-fyoZ /yo4 olzU ezyo 533U o4Dz oo3U ooo3 oJOU- 

8951 8966 9010 9073-9075 9119 9126 9128 9155-91669303 
9544 9560 9780 9884 9928-9929 10008-10010 10061-10068 
10097 10262 10330 10351 10601 10630-10634 10760 10983 
11061 11219 11296-11308 11310-11313 11513 11620 11693 

X X \J\J X 1 1^1/ 11 V/ X X WW X » — * * V A X mS X *J X J^U XX \J^\f X X \J *S —J 

12182-12183 12280-12287 12327 12363 12488 12707 12799- 
12801 12922 12991 13012-13014 13035 13051 13064 13297 
13307 13328-13332 13335 13382 13499 13506 13554 13560 
13575 13631 13695 13714 13747-13749 13751 13882 13884 


skeletal 
muscle 


Clontech 


SKM001 


1 104 1346 2363 2367 2495 2555 2876 2880 3555 3634 3722 
401 1 4022 4194 4201 4253 4277 4282 4434 4641 4940 4972 
4998 5343 5481 5523 5801 6005 63366873 7408 7995 81 10 
8120 8235 8262 8292 8345 8372 8576 8740 8830 8936 8951 | 
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Tissue 
. Origin 


RNA 
Source" 


Library 
Name 


SEQIDNOS: 








9303 9689 10616 10679 11132 11145 11825 11869 11927 
12387 12442 12467 12570 12663 12735 12916 13280 13503 
13872 13885 


skeletal 
muscle 


Clontech 


SKM002 


8535 


skeletal 
muscle 


Clontech 


SKMs03 


6336 6962 8936 


skeletal 
muscle 


Clontech 


SKMs04 


770 1724 3797 4277 6336 12405 13658 


spinal cord 

i * 

• 

* 


Clontech 

> 

- 


SPC001 

i 

- 


83 142 390415 598 668 708 731 11841199-1207 1360-1375 
1377-1396 1516 1574-1576 1595-1596 1849 1927 2070 2129 
2 1 6 1 23 1 1 -23 14 2345 2355-2368 2423-2424 2430 2484 2529 
2569 2576 2876 3215 3249 3272 3283 3532 3584 3627 3634 
371 1 3719-3722 3737 3839 3860-3884 401 1 4025 4038-4039 
4043 4055-4056 4173 4246 4282 4354 4375 4391 4434 4681 
4767 4781 4808 4964 4985 4998 5037 5163 5233 5274-5277 
5375-5392 5394-5402 5523 5569-5570 5581 5615 5723 5788 
5835 5902 5928 5936 6047 6078 6082 621 1 6288 6374 6433- 
6435 6512 6523-6531 6534 6595 6616 6625-6626 6788 6894 
6979 6999 7018-7026 7126 7166 7359 7473 7642 7653 7807 ' 
7814 7877-7879 7965-7968 7970 7972-7980 8105 8108-81 10 
8139 8246 8298 8345 8363 8368 8482 8603 8646 8884 8898 
8981 9010 9012 9076-9078 9098 9167-9179 9184-9189 9264 
9302 9304 93 19-9320 9455 9466 9520 9530 9544 9556 9567 
9781 9895 9901 9928 9942 9947 9969 9999 10007 10069- 
10077 10079-10085 10177 10296 10326 10346 10376 10422 
10566 10602 10635-10638 10679 10685-10686 10729 10776 
1 1 132 1 1220 1 1246 1 1314-1 1323 1 1325-11330 1 1417-1 1418 
11459 11513 11818 12000 12011 12017 12033 12039 12160 
12184-12185 12288-12292 12295-12299 12301-12305 12363 
12375 12383 12387 12402 12413 12442 12468 12527 12605 
12617 12636 12657-12658 12739-12740 12754 12772 12802- 
12809 12830 12835 12841-12842 12905 12923 12940 12976 
13003 13015 13017-13021 13051-13052 13117 13126 13136 
13260 13277 13283 13295 13336-13343 13367 13442 13456 
13473 13477 13481 13495 13497 13499-13500 13507 13516 
13659 13670 13713 13715-13716 13748 13752-13759 13803 
13869 13872 13884-13885 13888 13893 13896 


adult spleen 


Clontech 


SPLcOl 


800 1 927 4032 4834 6064 6 1 35 6 1 95 6446 6788 6873 7 1 66 
7455 8966 9929 10744 12402 12564 12590 12691 12904 12933 
13082 13500 13506-13507 13516 13575 13864 13869 13883 
13889 


stomach 


Clontech 


STO001 

■ 


21 83 142 1004 1208-1215 1217-1219 1397 1399-1405 1671 
23 1 5-23 1 6 2345 2369-2373 2375 2575-2576 2809 2846 2984 
3136 3166 3537 3610 3698 3723-3725 3839 3885-38974057- 
4059 4173 4277 4410 4480 4667 4791 4808 4940 4987 5262 
5278-5281 5283-5284 5403-5405 5407-54245481 56565674 
5796 5904 641 8 6436-6440 6535-6540 6563 6627-6629 6765 
6940 7000-7001 7027-7030 7064 7135 7509 7604 7880-7885 
7981-7990 8087 8110 8120 8143 8226 8452 8535 9010 9079- 
9081 9191 9193-91969304-9306 9313 9317 9321 9715 10007 
10011-10013 10086-10093 10178-10179 10603-10605 10640- 
10642 1 1069 1 1 167 1 1221-1 1222 1 133 1-1 1337 1 1339-1 1343 
11419 11513 11818 12186-12190 12307-12314 12327 12363 
12425-12427 12438 12617 12773-12774 12810-1281 1 12834 
13082 13103 13298-13299 13344-13349 13592 13630 13670 
13717 13760-13764 13782 13888 


thalamus 


Clontech 


THA002 


579 598 616 1065 1148 1220-1221 1223-1226 1407-1432 1597 
2266 2317-2319 2340 2342 2376-2378 2380 2431 2444 2555 
3093 3230 3286 3537 3722 3726-3732 3737 3898-3902 3904- 
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Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQIDNOS: 


« 

i 


■ 

• 


1 


391 8 3920-3922 4060-4062 4173 4201 4282 4360 4391 5270 
5285-5288 5425-5438 5440-5449 5701 5902 5938 6137 6437 . 
6458 6461 6541-€544 6546-6550 6630 6755 7031 7095 71 1? 
7166 7484 7579 7815 7849 7905 7977 7991-8000 8002-8005 
8126-8128 8134 8363 8558 8664 8786 8890 8930 8963 9082- 
9085 9128 9197-9199 9201-92169251 9264 93089503 9515 
9556 9646 9703 9928 10014-10017 10027 10094-10100 10102 
10267 10496 10606-10608 10643-10650 10696 10891 11145 
. 1 1223-1 1225 1 1344-1 1355 1 1406-1 1407 1 1420 1 1513 1 1604 
11695 11791 11950 12022 12107 12191-1219912315-12331 
12363 12375 12405 12428-12430 12688 12706 12735 12748 
12754 12812-12813 12815 12835 12914 12959 13020 13023- 
13025 13060 13073 13300 13351-13358 13402-13403 13489 
13496 13506 13512 13719-13720 13765-13768 13770 13872 
13883 


thymus 


Clontech 


THM001 


51 142 150 332 346 360 438 546 731 760 895 1004 1104 1227- 
1234 1264 1391 1516 1577-1585 1812 1860 187721292174 
2215 2263 2321-2322 2408 2414 2425 2449 2490 2555 2569 
2575 261 1 2847 2880 3435 3530 3722 3727 3733 3735-3742 
3839 3870 4006 4041 4043-4046 4060 4109 4375 4396 4399 
4434 4667 4671 4759 4791 4885 4976 4987 5108 5289-5290 
5466 5481 5526 5553-5554 5796 5956 5979 6020 6186 6253 
6336 6371 641 1 6438 6441-6447 6617-6621 6765 6788 6797 . 
6870 6886 6908 6972 6994 7059-7060 7126 7141 7166 7168 
7310 7383 7450 7494 7632 7716 7779 7887 7889-7891 8087 
81 1 1-81 14 81 17-8120 8139 8226 8343 8368 8370 8452 8456 
8633 8830 8898 8921 8963 9010 9076 9086-9087 9128 9134 
9264 9304 9307-9313 9401 9466 9497 9544 9563 9582 9612 
9626 9646 9758 9781 9927 9937 9969 10018 10065 10163- 
10166 10175 10284 10329 10414 10557 10609-10610 10617 
10679 10687-10691 10742 10760 10772 10776 10778 10881 
10891 11187 11194 11226-11229 11274 11406 11408-11410 
11412 11459 11513 11676 11695 11830 11865 11942 12000 
12006 12016 12022 12092 12160 12186 12200-12204 12327 
12363 12414-12417 12427 12462 12470 12490 12564 12693 
12706 12735 12740 12835 12840 12898 12905 12910 12913- 
12914 12916 12977 13020 13036 13051 13062 13111 13126 
13141 13295 13301-13305 13326 13351 13391 13396-13397 
13456 13484 13498 13505 13507 13512 13516 13546 13713 
13803 13805 13808 13866 13869 13885 13888 


thymus 

* 


Clontech 

• 

9 


THMc02 


16 27 51 67 142 390 598 1233 1493 1508 1586-1588 1598-1603 
1724 1841 1918 1927 1959 1976 1979 1989 205721072161 
2223 2290 2407 2426-2428 2432-2433 2444 2487 2569 26 1 7 
3537 3664 3711 3720 3722 3737 3780 3817 4021 4025 4047- 
4049 4060 4063-4068 4246 4277 4377 4405 4688 4694 4759 
4791 4837 4949 4957 4992 4998 5037 5052 5108 5505 5553 
5555-5564 557 1-5572 5683 5766 5772 5788 5796 5894 59 1 1 
6048 6186 6225 6288 6306 6333 6621 6728 6730 6765 6767 
6780 6788 6815 6867 6873 6979 7049 7061-7062 7166 7169 
7676 7687 7809 8084 8121 8129-8130 8179 8262 8354 8363 
8365 8375 8482 8597 8654 8740 8786 8791 8963 9076 9157 
9257 9264 93 14 9323-9324 9401 9454 9466 95 19 9529 9536 
9637 9700 9703 9927-9929 9940 10004 10007 10070 10167 
10169-10172 10180 10182 10267 10284 10326 10331 10383 
10439 10452 10542 10605 10691 10693-10694 10697 10744 
10775 10778 10795 1 1046 11098 11274 11413-11415 11462 
11519 11576 11592 1 1606 11618 11621 11627-11628 11693 
11807 11814 12022 12034 12044 12080-12081 12086 12160 
12236 12256 12327 12335 12363 12368 12387 12418-12419 
12424 12433 12556 12560 12564 12570 12577 12594 12599 
12612 12663 12735 12754 12836-12839 12844 12905 12913 
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Tissue 
Origin 


RNA 
Source H ' 


Library 
Name 


SEQIDNOS: 








12955 12977 13020 13035 13037 13051 13062 13072 13087 
13193 13280 13295 13398 13404 13456 13488 13499-13500 
13503 13507 13516 13533 13535 13546 13611 13613 13630 
13646 13649 13659 13670 13678-13679 13713 13769 13775 
13803 13806-13808 13810-13811 13869 13872 13882-13883 
13885 13888-13889 


thyroid gland 

■ 

♦ 

• 


Clontech 

i 

■ 


THR001 

• 

» 

f 

■ 


49 1 15 142 360 641 698 800 1004 1 1 34 1 193 1233 1235 1237- 
1241 1329 1433-1471 1473-1476 1478-1479 1481 1589-1591 
1593-1594 1604 1606-1612 1614-1615 1639 1671 1906 1927 
1959 1976 19942090 2099213421502161 217421862215 
2343 2367 2381-2387 2429 2434-2437 2449 2490 2510 2529 
2555 2569 2662 2827 2901 2961 2997 3001 3109 3265 3286 
3425 3490 3503 3560 3643 3698 3722 3737 3743-3747 3797 
3839 3885 3923-3941 3943-3954 3956-3961 40094050-4053 
4060 4069-40824109 4173 4194 4253 4277 4282 4348 4354- 
4355 4376 4391 4396 4412 4434 4641 4689 4694 4755 4759 
4783 4791 4834 4957 4972 4985 4987 501 8 5037 5052 5075- 
5076 5108 5147 5163 5171 5270 5292-5294 5317 5380 5450- 
5461 5464-5481 5503 5523 5565-5568 5573-5574 5693 571 1 
5724 5757 5772 5788 5894-5895 5917 5923 5938 5959 5966 ' 
6005 6013 6027 6047-6048 6061 6064 6125 6135 6139 6189 
6215 6240 6336 6371 6374 6448-6449 6458 6508 6538 6551- 
6554 6556-6561 6563-6572 6595 6598 6607 6622-6623 6631- 
6636 6793 6803 6815 6873 6899 6955 6958 7032-7038 7040 
7043 7063 7066-7067 7079 71 16 7155 7233 7275 7455 7669 
7743 7792 7839 7845 7857 7892-7893 7910 7930 7989 8006- 
8008 8010-8033 8053 8067 8087 8092-8093 81 10 8116 8120 
8122-8124 8131-8143 8196 8226 8235 826i2 8345 8365 8368 
8370 8372 8375 8452 8520 8535 8543 8597 8646 87J 1 8760 
8816 8839 8863 8898 8930 8948 8951 8963 8966 9010 9055 
9088-9091 9142 9157 9217-9220 9222-9227 9229-9242 9244 . 
9257 9264 9304 9315-9316 9325-9330 9368 9401 9455 9466 
9484 9525 9542 9544 9563 9646 9695 9703 9780-9781 9855 
9928 9942 9947 10000 10019 10027 10103-10117 10119-10120 
10173-10175 10183-10185 10346 10376 10440 10470 10496 
1061 1 10651-10657 10666 10679 10695 10698-10704 10760 
10772 10778 10782 10788 10891 11033 11051-11052 11061 
11063 1111211231 11256 11356-11357 11359-11360 11362- 
1 1367 1 1369-1 1372 1 1376 1 1385 1 1406 114161 1421-1 1425 
11462 11513 11588 11605-11607 11620 11833 11869 11932 
11980 11985 12006 12017 12033 12143 12160 12193 12195 
12205 12207-12208 12292 12332-12336 12338-12345 12347 . 
12349-12356 12358-12360 12363 12368 12405 12420 12422 
12424 12428 12434-12439 12465 12523 12601 12605 12609- 
12611 12617 12636 12645 12670 12691 12707 12740 12749 
12754 12768 12775 12777 12817-12824 12830 12840 12845- 
12848 12863 1291 1 12940 12955 12977 13020 13026-13028 
13038 13047 13051 13062 13072-13073 13090 13104 13117 
13126 13195 13202 13254 13295-13296 13360-13371 13389 
13391 13399 13412 13479 13488-13489 13492 13494 13498 
13500 13503-13504 13506-13507 13511-13512 13516 13533 
13551 13554 13590 13613 13627 13644 13646 13656 13659- 
13660 13670 13713 13721 13743 13771-1377413782 13803 
13809 13812-13814 13860 13866-13867 13869 13872 13882 
13884-13885 13888 13891 13893 


trachea 


Clontech 


TRC001 


83 438 483 858 1006 1034 1242-1245 1359 1483-1494 1616- 
1619 1621 1671 2266 23242388-2391 2409 2438-24392444 
2575 3001 3136 3272 3425 3460 3535 3548 3748-3756 3810 
3962-3967 4083-4086 4530 4755 4758 4949 4957 4987 5075 
5213 5295-5299 5481-5482 5484 5486-5488 5490-5491 5535 
5576-5580 5656 5941 6091 6450-6453 6574-6578 6637 6797 
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Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQIDNOS: 


'i 




* 


6955 7041-7042 7049 7068 7204 7347 7733 7864 7894 8034- 
8043 8045 8120 8139 8144 8563 8635 8646 9012 9092-9099 
9245-9251 9317 9331 9333 9484 9503 9517 9734 10020-10022 
10027 10097 10121-10123 10125-10127 10284 10344 10478 
10612 10658-10659 10705 10812 10948 1 1 145 11233-11239 
11373-11376 11428-11429 11942 12022 12209-12211 12292 
12361-12363 12365-12366 12438 12440-12441 12611 12617 
12734-12735 12825-12826 12900 12904-12905 12916 12923 
12965 12972 12976 12989 13029-13030 13306-13308 13372- 
13373 13492 13670 13713 13722 13775-13781 13803 13884 


uterus 

i 

T 

f 


Clontech 

i 

■ 


UTR001 


1036 1134 1246-1250 1324 1493 1495-1511 1622 1671 1928 
2145 2266 23 10 2326 2343 2367 2392-2397 2555 3281 3479 
3720 3757-3760 3762 3969-3986 .4088-4090 4192 4201 4246 
4277 4390-4391 4434 4515 4998 5002 5184 5300-5301 5481 
5492-5504 5656 5695 5712 5794 5936 61 16 6475 6579-6587 
6595 6870 7002 7043-7044 731 1 7484 7895-7899 8046-8055 
8057-8059 8061 8110 8146-8147 8152 8226 8262 8622 8677 
9008 9092 9137 9252-9257 9334 9370 9414 9466 9539 9703 
10023-10024 10128-10134 10346 10595 10660-10668 10706- 
10708 11145 11239 11289 11377 11379-1 1385 11695 11S30 
12367-12369 12371-12374 12442-12443 12570 12670 12693 
12827-12829 12914 13002 13047 13073 13083-13084 13131 
13260 13277 13309-13310 13374-13376 13394 13489 13512 
13713 13782 13784-137871386613869 
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TABLE 2 

• 



SEO ID 
NO: 


Accession No. 


Sneclcs 


Deserintion 


Smith- 
Water man 
Score 


% 

Ideotit 

y 


1 


M36501 


Homo sapiens 


alpha-2-macroglobulin 


118 


69 


2 


AF1 18090 


Homo sapiens 


PRO2044 


247 


59 


3 . 


X01683 


Homo sapiens 


alpha 1 -antitrypsin 


544 


78 


4 


L27428 


Homo sapiens 


reverse transcriptase 


79 


27 


5 


M22332 


Homo sapiens 


unknown protein 


89 


40 


6 


AFO 15539 


Mytilus edulis 


precollagen P 


113 


33 


7 


X03325 


Homo sapiens 


apolipoprotein B fragment 


540 


83 


8 


AB019280 


Mus musculus 


sprouty-4 


91 


35 


9 


D88 152 


Homo sapiens 


acetyl-coenzyme A transporter 


625 


87 


10 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


124 


58 


11 


AL049569 


Homo sapiens 


(U37C10.5 (KIAA0445) 


182 


82 


12 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


85 


37 


13 


L27428 


Homo sapiens 


reverse transcriptase 


135 


61 


14 


U49973 


Homo sapiens 

» 


ORF1; MER37; putative transposase 
similar to pogo element 


266 

• 


72 


15 


U93569 


Homo sapiens 


putative pi 50 


135 


37 


16 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


132 


67 


17 


X53581 


Rattus 
norvegicus 


ORF4 


124 


34 


18 


AF1 83961 


Homo sapiens 


carbon catabolite repression 4 protein 
homolog 


431 

i 


75 


19 


AJ002190 


Homo sapiens 


dihydroxyacetone phosphate 
acyltransferase 


551 


88 


20 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


127 


45 


21 


AK001269 


Homo sapiens 


unnamed protein product 


1643 


99 


22 

. 


U49973 


Homo sapiens 

- 


ORF1 ; MER37; putative transposase 
similar to pogo element 


275 


58 


23 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


123 


75 


24 


AF 156550 


Mus musculus 


putative E1-E2 ATPase 


168 


58 


25 


AF1 19856 


Homo sapiens 


PR01851 


585 


83 


26 


U49974 


Homo sapiens 


mariner transposase 


187 


46 


27 


G00901 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4982. 


86 


30 


28 


AF295773 


Homo sapiens 


ral guanine nucleotide dissociation 
stimulator 


126 


74 


29 


AF1 13685 


Homo sapiens 


PRO0974 


92 


73 


30 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


102 


50 


31 


Y91577 


Homo sapiens 

• 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


279 


75 


32 


AF003535 


Homo sapiens 


ORF2-like protein 


114 


47 


33 


M15386 


Homo sapiens 


gamma-globin 


370 


84 


34 


M19419 


Mus musculus 


proline-rich salivary protein 


110 


35 


35 


AF211943 


Homo sapiens 


WW domain-containing protein 
WWOX 


586 


83 


36 


X13885 

* 


Nicotiana 
tabacum 


extensin(AA 1-620) 


103 


35 


37 


U93563 


Homo sapiens 


putative pi 50 


127 


58 


38 


U93564 


Homo sapiens 


putative pi 50 


103 


77 


39 


AF069732 


Homo sapiens 


ADA2-like protein 


524 


88 


40 


X61046 


Hydra sp. 


mini-collagen 


101 


34 


41 


AK000322 


Homo sapiens 


unnamed protein product 


566 


80 


42 


G03646 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


57 



130 
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• ■ 



SEQ1D 
NO: 


t Accession No, 


Species 

* * 


Description 


Smith- 

Waterman 

Score 


% 

Ideotit 

y 








mmmm*\mm 

7727. 






43 


A CAT 1 noi 

ArU71081 


Mycobacterium 
tuberculosis. 


proline-nch mucin bomolog 

■ 


1U4 

♦ 


A 1 

41 


AA 

44 


a "cno^ ii c 


Homo sapiens 






41 


45 


Y /J3 j3 


Homo sapiens . 


H l KM clone 1 870y 14 protein 

4V 4k > • AM Aa 

sequence. 






A4L 

4o 


At 1 1 oU&z 


Homo sapiens 




1 10- 
1 \y 


A A 

44 


/IT 

4/ 


A/oyzo 


Homo sapiens 


zinc linger protein 






A Q 

4o 


*V<A"l'>_C 


Homo sapiens 


giutaminyi-tKJNA syntnetase 




51 . 


4y 


UjU04j 


Homo sapiens 






yj 


50 


M92439 


Homo sapiens 


leucine-rich protein 


344 


80 


51 


U28963 


Homo sapiens 


Gps2 


CM 




52 


U41806 


Homo sapiens 


P 60 


660 


81 


jm 4ft 

53 


AF181490 


IT • 

Homo sapiens 


prenylcysteine lyase 


461 


nit 

78 


54 


U93570 


Homo sapiens 


putative pl50 


147 


36 


55 


W73499 


Homo sapiens 


Von Willebrand factor. 


529 


76 


56 


AF1 19851 


Homo sapiens 


PR01722 


126 


57 


57 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


115 


61 


58 


AL021939 


Homo sapiens 


dJ352A20.2 (aldehyde dehydrogenase 
family protem) 


422 


^k 4k ^ 

90 


59 


L24158 


Homo sapiens 


mtegrm alpha 9 protem 


117 


71 


60 


Y32157 


Homo sapiens 


Human SH3D1A protein. 


530 


91 


61 


<■« w 4* A 4k 4k ^ 

X61296 


Rattus 
norvegicus 


* ft 4ft /k 

open reading frame 2 

i 


117 


4k « 

31 


62 


W _f 4**, 4k 4ft 4* A 

AK002064 


Homo sapiens 


unnamed protem product 


AAA 

330 


4k 4k 

80 


63 


ABO 12223 


Canis familiaris 


ORF2 


80 


56 


64 


U93570 


Homo sapiens 


putative p!50 


113 


37 


65 


U15647 


Mus musculus 


reverse transcriptase 


152 


55 


66 


G03411 


Homo sapiens 


V V « ft y>K ^k ftft^ ^fA w ^. 

Human secreted protem, SEQ ID NO: 
7492. 


* >k, 4^ 

126 


54 


^ mm 

67 


Y48359 


Homo sapiens 


Human prostate cancer-associated 
protein 56. 


590 


99 


68 


W74879 


Homo sapiens 


Human secreted protem encoded by 
gene 151 clone HTLEF62. 


4>* f-ft 

368 


98 


69 


AF 104921 


Homo sapiens 


succinyl-CoA synthetase alpha subumt 


4* 4% 4J 

604 


^k M 

93 


70 


* * ftJ»k « .ft*/ *• m\ £* 

AF 175265 


Homo sapiens 


vacuolar sortmg protein 35 


4* A O 

632 


OA 

88 


71 


U93571 


Homo sapiens 


p40 


106 


33 


72 


XI 5324 


Homo sapiens 


angiotensmogen 


330 


84 


73 


Z98204 


Hordeum 
vulgare 


extensin 


111 


38 


74 


Y30713 


Homo sapiens 


Ammo acid sequence of a human 
secreted protein. 


232 


61 


75 


AF 11 8092 


Homo sapiens 


PRO2061 


453 


79 


76 


M63175 


Homo sapiens 


autocrine motility factor receptor 


190 


85 


77 


M26361 


■k M 1 

Mus musculus 


LINE/Ig H-cham fusion protem 


153 


38 


78 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


171 


58 


79 


X78926 


Homo sapiens 


zinc finger protem 


199 


37 


80 


k. Mmmmmm* 4% * 

M77381 


V w ft 

Homo sapiens 


* 

acrosin 


98 


4J 

54 


0 1 


• 


T I AMU 4k 4* 4% M.* * M fft 

Homo sapiens 


Human secretea protem, oty jnu. 
6619. 




44 


82 


U93569 


Homo sapiens 


putative pi 50 


94 


38 


83 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


132 


100 


84 


AF255446 


Crypthecodinium 
cohnii 


Dip 1 -associated protein C 

* 


129 


34 


85 


R59837 


Homo sapiens 


Sequence of human microtubule- 


82 


48 
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SEQ ID 


Accession No. 


Species 1 


Description 


Smith- 


% 


NO: 




- 


Waterman . 


Identit 










Score 


y 








A 4b A« AA, — Ah -^L A. ^ _. A - m A — _ _ 

associated protem tau. 






86 


Y02749 


Homo sapiens 


Human secreted protein encoded oy 


1 ^7 
ID/ 


76 








gene 100 clone HNFlU9o. 






87 


AF116712 


Homo sapiens 


rKU273o 




CO 


A A 

88 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


77 


A A 

44 








6566. 






89 


4 7 A A n A A 


Homo sapiens 


DNA replication and repair associated 


315 


TA 

79 








protein ^UKAorj. 






90 


Ml 6961 


Homo sapiens 


alpna-2-HS-giycoprotem 




T/l 


A 1 

91 


U49973 


Homo sapiens 


UKr J , Mbiv> /, putative transposase 




DO 








similar to pogo element 






92 


AF220656 


Homo sapiens 


apoptosis-associated nuclear protem 


62 










TJTTT T-\ A 1 

PHLDA1 






93 


U65928 


Homo sapiens 


Jun activation domain bindmg protem 


loo 


T^ 
/D 


94 


U93568 


Homo sapiens 


putative pi 50 


1 AT 
102 


48 


95 


S80119 


Rattus sp. 


reverse transcriptase homolog 


130 


53 


96 


U93563 


Homo sapiens 


m A • • ATA 

putative pl50 


242 


50 


97 


Y86472 


Homo sapiens 


Human gene 52-encoded protem 


105 


46 






i 


fragment, SEQ ID NO:387. 






98 


G03714 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


132 


62 


• 






7795. 






99 


X74045 


Equus caballus 


preproalbumin 


289 


65 


100 


AF 118090 


Homo sapiens 


PRO2044 


A ^ A 

269 


A A 

90 


101 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


198 


51 








» *l A 1 A_ 

similar to pogo element 






102 


G03133 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


88 


• C A 

53 








TO 1 A 

7214. 






103 


Y86573 


Homo sapiens 


Human gene 91 -encoded protem 




OT 








fragment, SEQ ID NO:490. 






104 


AF003535 


Homo sapiens 


ORF2-Iike protem 


1 A A 

1 14 


AT 1 

47 


105 


AF 130079 


Homo sapiens 


TVT» /"NO O 

PR02852 


133 


56 


106 


AF130089 


Homo sapiens 


PRO2550 


107 


T1 

71 


107 


M63473 


t T 

Homo sapiens 


alpha-5 type IV collagen 


131 


A C 

45 


108 


AF1 16661 


Homo sapiens 


PRO 1438 


1 12 


54 


109 


X92485 


Plasmodium 


pval 


1 A 1 

101 


41 






* 

vivax 


• 






110 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


111 

113 


OA 

oil 


111 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 


1 AA 

199 


<CA 

oy 








similar to pogo element 






112 


AF 194537 


Homo sapiens 


NAG13 




44 


113 


V A A iAA 

L27428 


Homo sapiens 


reverse transcriptase 


160 


34 


114 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


1 9 T 

137 


50 


115 


U49973 


t T 

Homo sapiens 


ORF1; MER37; putative transposase 


1 A1 

191 


^T 

67 








similar to pogo element 






116 


AF130052 


Homo sapiens 


PRO0956 


1 aT<^ 

163 


AT 

47 


117 


L27428 


T T * 

Homo sapiens 


reverse transcriptase 


1 1 T 

117 


36 


1 18 


U93569 


Homo sapiens 


A A • 4 AV f\ 

putative pi 50 


104 


66 


119 


W48351 


Homo sapiens 


Human breast cancer related protein 


96 


66 








t"> on t>'*j 

BCRB2. 






120 


G00357 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


81 


57 








» A A^ ^fc 

4438. 






121 


G03438 


Homo sapiens 


Human secreted protein^ ocv^ 1JJ inu. 


/o 


- 1 
ji 








7519. 






122 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 


80 








6613. 






123 


X61296 


Rattus 


open reading frame 2 


94 


36 






norvegicus 








124 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


60 








7882. 







132 
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SEQID 
NO: 


\ Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


125 


Af 11 9900 


Homo sapiens 


PR02822 


168 


68 


126 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


209 


58 


127 


L27428 


Homo sapiens 


reverse transcriptase 


102 


35 


128 


G02607 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6688. 


99 


63 


129 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


1 13 


73 


130 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


170 


36 


131. 


U93572 


Homo sapiens 


putative pi 50 , 


168 


38 


132 

* 


U37263 


Homo sapiens 

♦ 


KRAB zinc finger protein; Method: 
conceptual translation supplied by 
author 


155 

• 


57 


133 


G00403 

> 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


137 


92 


134 


B08918 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 28 SEQ ID NO:75. 


58 


61 


135 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


128 


66 


136 

• 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


102 


38 


137 

■ 


U49973 

< 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


230 


55 


138 


AK000496 


Homo sapiens 


unnamed protein product 


127 


46 


139 


X53581 


Rattus 
norvegicus 


ORF4 


136 


38 


140 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


158 


48 


141 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


162 


65 


142 


AF090930 


Homo sapiens 


PRO0478 


127 


65 


143 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


141 


58 


144 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ ID NO:387. 


98 


65 


145 


AJ238588 


Sciurus vulgaris 


cytochrome c oxidase subunit III 


417 


72 


146 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


139 


76 


147 


Y36156 


Homo sapiens 


Human secreted protein #28. 


91 


40 


148 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


121 


70 


149 


Y76184 


Homo sapiens 


Human secreted protein encoded by 
gene 61. 


214 


85 


150 


Y91577 

• 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


95 


57 


151 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


126 


66 


152 


G00262 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4343. 


105 


51 


153 


AF 119900 


Homo sapiens 


PR02822 


116 


62 


154 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


149 


66 


155 


AB009993 


Mus musculus 


collagen al(V) 


105 


36 


156 


U49973 


Homo sapiens 

• 


ORFl; MER37; putative transposase 
similar to pogo element 


155 


69 


157 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


348 


71 


158 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


146 


54 


159 


AF247705 


Oryctolagus 
cuniculus 


alpha 1 type X collagen 


102 


42 
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■ * B T 

I - 



SEO ID 
NO: 


» Accession No. 


Species 


Description 


Smitn- 

Waterman 

Score 


Identit 

y 


160 


R95913 


Homo, sapiens 


Neural thread protein. 


99 


56 


161 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


112 


52 


162 


X71442 


Rattus 
norvegicus 


ORF 1; putative 


96 


47 


163 


U93570 


Homo sapiens 


putative pi 50 


118 


38 


164 


U23515 

■ 


Caenorhabditis 
elegans 


weak similarity to adenylyl cyclase- 
associated protein (CAP) and to P. 
chabaudi adami major merozoite surfae 
antigen protein (P1R:A32555). Final • 
exon overlaps gene predicted on other 
strand. 


93 

- 

* 


37 

. 


165 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


143 


66 


166 


AF130079 


Homo sapiens 


PR02852 


143 


90 


167 


L27428 


Homo sapiens 


reverse transcriptase 


200 


53 


168 


G04072 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


92 


51 


169 


R95913 


Homo sapiens 


Neural thread protein. 


116 


54 


170 

• 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


324 


85 


171 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


111 


50 


172 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


111 


69 


173 


AF130089 


Homo sapiens 


PRO2550 


126 


59 


174 


S801 19 


Rattus sp. 


reverse transcriptase homolog 


151 


46 


175 


G02451 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6532. 


113 


53 


176 


U 15647 


Mus musculus 


reverse transcriptase 


104 


46 


177 


M24732 


Homo sapiens 


lam in -like protein 


112 


42 


178 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


137 


38 


179 


AF010I44 


Homo sapiens 


neuronal thread protein AD7c-NTP 


143 


58 


180 


AF194537 


Homo sapiens 


NAG 13 


92 


90 


181 


U93564 


Homo sapiens 


putative pi 50 


131 


53 


182 


U93574 


Homo sapiens 


putative pi 50 


86 


46 


183 


Y14166 


Gallus gallus 


attachment region binding protein 


91 


40 


184 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


104 


64 


185 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


183 


73 


186 


U93572 


Homo sapiens 


putative pi 50 


139 


64 


187 


M22332 


Homo sapiens 


unknown protein 


79 


41 


188 


Y87202 


Homo sapiens 


Human secreted protein sequence SEQ 
IDNO:241. 


75 


71 


189 


U70935 


Peromyscus 
maniculatus 


reverse transcriptase 


132 


37 


190 


S80119 


Rattus sp. 


reverse transcriptase homolog 


172 


43 


191 


AF 194537 


Homo sapiens 


NAG 13 


81 


75 


192 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


112 


64 


193 


X92485 


Plasmodium 

• 

vivax 


pval 


96 


40 


194 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


112 


45 


195 


L27428 


Homo sapiens 


reverse transcriptase 


141 


37 


196 


U93570 


Homo sapiens 


putative pi 50 


201 


41 


197 


X92485 


Plasmodium 
vivax 


pval 

* 


120 


48 


198 


AF130089 


Homo sapiens 


PRO2550 


137 


60 



134 



i 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 




• 




Waterman 


Identit 








• 


IJVVI c 


y 


199 


V00662 


Homo s aniens 


URF 1 fNATlH rfphv/lropenase subunirt 


377 


84 


200 


AF010144 


Homo sariens 


neuronal tVirpflH nrot^in AT^7c-NTP 


119 


52 


201 


Y86573 


Homo ^aniens 


Human opnp nrotptn 
xiuiiiaii gviic 7i*cutuucu piuicxu 


151 


68 








fraement SEO ID NO490 






202 


U49973 


Homo saniens 


ORF1 * MFR^7* nntative transnosase 


247 


78 








similar to no cm element 






203 


W48351 


Homo ^aniens 


Human hrpa^t mncpr tp fated orotein 


126 


56 














204 


Y91577 


Homo ?anien<i 


Human ^prrptpf) nrotpin Qeniifince 

XAlUliOll JvvlvlwU piULwJJJ JvUUvlivW 


362 


80 








encoded bv ffene 2 SEO ID NO^50 






205 


R59842 


Homo Qanien«i 


AooR4T.l nrotpa^p 


100 


82 


206 

XOO 


AF161356 


Homo <santpn«i 


HSPC0Q3 


78 


62 


707 
xo / 


fS00500 


nUUIU bap Id Us 


Human <prrp^f»rf nrotpfn ^PO TD TsIO* 

riuiiitui Svcrcveu piuiciiiy ocy xxj 


111 
1 1 1 


48 








4^R1 






lax 


VJ 0->XOv 


J_/Cisuillailia 


KX/a UllUgCU 


171 

IX J 


53 






/ionovnm 








OftO 

X07 




Homo sapiens 


Unman CA/*ra>A^ nmfaf'n CCH TTl XTW 

Lull an SQcreiGu protein, ocv^ jnl?. 


o*f 


RO 
















1 I49Q73 


nuiiiu sapiens 


\_/.tvr i } ivixiivj / } puLaLivv uaiispvisasc 


146 


77 






1 


simuar lu pogu clciucuu 






Xl 1 


M6Q1Q7 


H<"\mr» cnnipnc 

numu sapiens 




344 


02 

7X 


717 


AF03A61 1 


noiiio sapiens 


!Tl4*riTlci^» TQftftF. Til 7 TPI*Pl^t'ftT T^TPPHTCfVr* 

liuiiiibiL lawuji'Dix rcuepiur pxci/uxsui) 


173 


37 








ciilitltn 






213 


Y86248 


Homo canipnQ 
jiuuiv sapjciis 


Human sprreted nrotein HCHPF68 


129 


65 






SFO ID NO •163 






214 

X J *t 


V00662 

r V« OOX 


Homo ciinipnc 
ouiiiu sapiens 


vjr lUwHi UIIIC UJvlUaSC i 


485 

tOJ 


87 


715 


O07514 


OUIIIO oaplCTlo 


Y-Tiityion CAi^r^f'A/l rnvvfAtn Q17^1 TT^ XTO* 
xlUJXuUl aCl/I ClCU pii/WiJUy 0£y 1JL/ iNw. 


?74 
ixt 


ow 














216 


1135312 

W mJ<m>*J X dim 


l"lUO JUUOvUJUd 


miclpflr rp r.prstrNr piv>reT>irftR<5or 


115 
1 1 j 


47 


217 
xi / 


L26953 


nuiuu sapjcus 


r^Viromocomal nt¥it*Ptn 
wirviiivaUiiiai piuiwiii 


143 


77 


XI 0 


ni7fion 

U IX07U 


jnomo sapiens 


cyiocjironic oxiuasc suuuntt n 


774 

XX *T 


70 


210 

XI 7 


(303453 


jnuiuLi sapiens 


ri union sccrctcu piuicm, jcy iu 


17^ 

IX J 


75 








7534 






270 

XXV 


AR0181 14 


A mHiHr»ncic 
ni auiiiupsis 


1? TWf"3 fin ctpt rwr*t p tn.li ]rt> 
rSJ.li VJ IJilKd pi vlClJl"JIA.C 


1 1 1 
j j j 


38 






thalinnn 








221 

XX *• 


D38112 

vJvj AX 


Homo canipnc 

X WJill\s SOpiGLUk 


A nrPacp cnfiimit ^ 
/v x * ase aUUUUll v 




84 

o^ 


277 

XXX 


V00667 


XX\JL\1\J SOpieUS 


rvtrtrhrnmp 
&JrwUU UJJ1C O 


466 
•too 


77 


223 

XXv 


YX624R 
i ov&to 


Hnmfl ennipne 
jnuuiu sapiens 


Hum an «prrptpr1 nrotpin HPHPF/iR 


175 

X 1 J 


85 








OCA TTN Xin* 1 






224 

XX** 


AF010144 


Homo caniRnc 
* aviiio sapiens 


npnronnl thrpfl/i nrotpin A07/*— l^T'P 


125 


38 


225 


Y02999 


Homo QJinipnc 


Fraornpnt nfhiiman cp^reted nrotein 


86 


65 








priPivled ISv cene 171 






226 


X77816 


Rattiis 


PR-Vbetal 


130 

l — ' V/ 


54 






norvp{*im«i 








227 


U09500 


Homo Qanienc 


rvtoeViTome l> 


274 


62 


228 
xxo 


A FORI 104 


XVIUS JJIUoLUJUS 


ORF7 
u/xvrx 


111 
i j i 


36 






dompcticiic 








229 


AF090942 


Homo ^smipnc 
xxxjuixj oapivits 


PRO0657 


88 


57 


230 


G04091 


Hnmrt conipnc 
jiUiiic/ sapiens 


Hitman cp^rpfpd nrotpin QPO TO 1VO* 
jniuuiaii acucicu piuicui, oov^ u/ i>v. 


76 

lO 


57 








R177 
oi /x. 






231 

X J A 


G03472 


Homo Qsmienc 

j Awiiiu sap j ciia 


Human ferreted motein SFO TD NO* 


101 


66 








7553 






232 


B12310 


Homo sapiens 


Human secreted protein encoded by 


116 


54 








gene 10 clone HDPGP94. 






233 


AF0I0400 


Homo sapiens 


transaldolase-related protein 


253 


77 


234 


AF 19791 3 


HeJicoverpa 


basic DNA-binding protein BDBP 


137 


50 






arm igera nuclear 












polyhedrosis 












virus 
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SEQID 
NO: 


Accession No. 


Species 


Description 

* 


Smith- 

Waterman 

Score 


Identit 

y 


235 


AF090931 


Homo sapiens 


PRO0483 


123 


75 


236 


U49973 


Homo saoiens 


ORF1* MER37* mutative transoosase 
similar to pogo element 


185 


86 


237 


M19503 


Homo sapiens 


ORF1 * outative 


99 


40 


238 


G00368 


Homo saoiens" 


Human secreted n rote in SEO ID NO: 
4449. 


93 


58 


239 


AFO 14883 


Homo sapiens 


NADH dehydrogenase subunit 2 


•305 


65 


240 


G03102 


Homo saniens 


Human secreted nrotein SEO ID NO* 
7183. 


60 


44 


241 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
fragment SEO ID NO*387 


79 


50 


242 


U 15647 


Mus musculus 


reverse transcriptase 


117 


47 


243 


AF010144 


Homo saoiens 

A J \J1AA \J JU|/lWUiJ 


neuronal fhreaH nrotein ADTrs-NTP 


104 


56 


244 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214 


94 


45 


245 


AK023542 


Homo sapiens 


unnamed protein product 


82 


38 


246 




melanogaster 


puiycQmu piuicin 


OA 


71 
J I 


247 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/O/U. 


129 


65 




no? st) 


nuiuo sapiens 


fiuman secrctcQ proxcin, oLy iu in w. 

ODl J. 


ISA 1 


/U 


249 


YKdRQO 

I IrtOyu 




Unman ^' PQT r**Int/»H nolvT>f>rtfrt Hf> C*170 
riULIlall J CO 1 I CicllCU pUiypcpiIUC JCy 

IDNO:1051. 






250 


Y 17832 


Human 
endn&enniiq 

retrovirus K 




?07 


71 


251 


U93568 


Homo srrniei^Q 








252 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


201 


47 


253 




iiuiuu oitpiciio 


PROfl 1 1 7 
rjwvi i / 




□V 


254 


Y02886 


Homo ennipne 


Fraompnt' nf human c^rrpfprl nrrttpin 
.rjogjjiciii i/i iiuiiiaii dvviCLvu pruiciii 

ennnrteH hv pene Qfl 


lOJ 


7? 


255 


L27428 


Homo sapiens 


reverse transcriptase 


156 


40 


256 


U49973 




similar to novo element 


1S7 


so 


257 


G00333 


Homo saoiens 


Human secreted nrotein SEO ID NO* 
4414. 


118 

X A V 


70 


258 


AF194537 


Homo saoiens 


NAG 13 


141 




259 


B01372 


Homo saniens 


Neuron-associated nrotein 


115 


71 


260 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718 

T § A VJ * 


113 


53 


261 


M22332 


Homo sapiens 


unknown protein 


78 


45 


262 


AFO 10 144 


Homo saniens 


neuronal thread nrotein AD7c-NTP 


70 


78 


263 


X61296 


Rattus 
norvefficus 


open reading frame 2 


108 


36 


264 


AFO 16099 


Mus musculus 

ITAUa lIIUOv U4UO 


enrlnmielpase/revpr^e transerinta*4e 1 


178 

I/O 


45 


265 


G03303 


Homo saoiens 


Human secreted orotein. SFO TD NO* 
7384. 


81 


63 


266 


Y01154 


Homo saniens 


Protein ^ponence Sen Td *54. frnm 

WO9901020. 


116 


84 


267 


G03786 


Homo sanienQ • 


T-Tuman spcrpfpH nrntein ^»PO TD ^JO" 

7867. 




S2 


268 


AF1 19855 


Homo sapiens 


PRO 1847 


74 


70 


269 


AF 109907 


Homo sapiens 


S164 


85 


61 


270 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


137 




271 


X92485 


Plasmodium 
vivax 


pval 


107 


72 


272 


AF1 94537 


Homo sapiens 


NAG 13 


167 


51 
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• 



NO: 


Accession No. 

i 


Species 

» 


Description 


Smith- 

Waterman 
Score 


% 

Ideobt 

v 


273 


U93564 


Homo sapiens 


p40 


104 


40 


274 

mm I ■ 


L27428 


Homo saoiens 


reverse transcriotase 


142 


56 


( 275 


G00354 


Homo saoiens 


Human secreted orotein. SEO ID NO: 
4435. 


150 


60 


276 


X61296 


Rattus 
norvesicus 


onen reading frame 2 


96 


48 


in 


AF090931 


Homo sapiens 


PRO0483 


140 


65 


278 


U49973 


Homo saniens 


ORFl * MRR37* nutative trans nosase 
similar to dopo element 


231 


66 


279 


AF130089 


Homo saoiens 


PRO2550 

* 


164 


60 


280 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
nrotein 


98 


58 


281 


L22548 


Homo sapiens 


collagen type XVIII alpha 1 


92 


38 


282 


AP000616 


OtvTfl 'sativa 


similar to RTNC5-H9 finder nrotein 

RHAla(AF078683) 


130 


68 




AF116715 




PRQ2829 




75 


284 


AF010144 

ill WIV 1 I I 






111 


53 


285 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7071 

f Of A. 


94 


53 


286 


W48351 


Homo sapiens 


Human breast cancer related protein 


120 


53 


287 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 

citnilnr tr> Tin or* f*\f*mf*T\'t 

oirniiar 10 pugu cicmcui 


147 


66 


288 


U93572 


Homo sapiens 


putative pi 50 


125 


32 




A I OCfVlQQ 


ATttDiaopSlS 

thai i ami 
Ul all alia 


puiauve prpiine**ncn protein 

* 


1 A7 






Y9948S 


PI u cm r> A ti ttn 

vivAV 
T 1 VuA 


pval 

■ 






291 


AB047600 


Macaca 

fii^fMPiilnriQ 

ICUwlwUlcU A2> 


hypothetical protein 


172 


66 


292 


G037R9 


Worn ft saniens 


Hinman swretftrf nrotein SFO ID NO* 

7870. 


108 

1 wo 


55 


293 


D38115 


Ponpo nvomapns 


NADH dehvHroo'pna'ie Quhnnit S 
iN^vLsin UMiyui VRCi moc oul/ui iil j 


342 


71 


294 


AF090942 


Homo sapiens 


PRO0657 


99 


66 


295 




"Rnc tannic 


rr 1 1 1 tt\ 1 c qi > iH_i*i/*Vi T%TT\t"<»in 
^ lLllculilU a^lU'I J Ul JDI v>ivJll 


1 14 




296 


M13100 


norvefneus 


tinlfnnwn TYrfvtotfl 

uiiivjiuvvJi piuicui 


107 


43 


297 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


114 


50 


298 


X92485 


PljUOTinrfinm 

A idJlllUUIUlU 

vivax 

▼ * t Ha 


nvn 1 
|ivcii 

1 




78 


299 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253 

9 mm ■ 1 1 1 ■ 


127 


75 


300 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


115 


42 


301 


L24521 


Homo saniens 


transfonnat ion -related nrotein 


117 


60 


302 


X83413 


Human 
herpesvirus 6 


U88 


219 


49 


303 

mJ \J 


U93567 


Homo saniens 


nutative nl "50 


130 


48 

. »w 


304 


B08918 


Homo saniens 


Human secreted nrotein seouencfi 
encoded bv eene 28 SEO LD NO:75 


72 


61 


305 


AJ242540 


Volvox carteri £ 
naeariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


153 


68 


306 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


532 


79 


307 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


84 


53 


308 


L27428 


Homo sapiens 


reverse transcriptase 


151 


72 


309 


M69297 


Homo sapiens 


ORF3 


145 


43 


310 


X92485 


Plasmodium 
vivax 


pval 


81 


60 


311 


L27428 


Homo sapiens 


reverse transcriptase 


103 


41 
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SEQ ID 

NO: 


Accession Na 


Species 


Description 

• 


Smith- 

Waterman 

Score 


% 

Identit 

v 
J 


312 


AF130079 


Homo sapiens 


PR02852 


135 


49 


313 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867 . 


no 


58 


314 


AF090928 


Homo sapiens 


PRO0470 


88 


48 


315 


G00352 


Homo saDiens 


Human secreted nrotein. SEO ID NO: 

* ******* M k^w^M WW r V*VUA| *j^*W^^ fc* A ^ « 

4433. 


175 


64 


316 


U93568 


Homo sapiens 


putative pi 50 


148 


46 


317 


API 19855 


Homo saoiens 


PRO 1847 


108 


84 


318 


P60839 


Homo saniens 


Sefluencfi of human ^emm albumin 

(USA.) on Dlasmid dXL53 


175 

X t mi 


50 


319 


W46424 


Homo sapiens 


Human macrophage stimulating protein 
(MSP} 


257 . 


69 


320 


AL049547 


Homo sapiens 


dJ34F7.2 (CREB-RP (G13)) 


247 


64 


321 


G02532 


Homo ^aniens 


Human secreted nrotein SKO TO NO* 

6613. 


110 


66 


322 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


102 


48 


323 


AF090930 




PRO0478 


141 
in 


79 


324 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 

MlllllCU LU jJlsgU CICUIGUI 


126 


44 


325 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


59 


326 




Hattia caniptic 

numu aapicii2> 


/\pujc.HJL» i protease. 




ou 




W4R151 


nomo sapiens 


numan oreasi cancer iciaieu protein 
BCRB2 




Ol 


328 


G00354 




PTnmfln «ArrpteH nmtpin 9.PO in *MO* 
nmiicui j^viciw ^liviLciiiy ody ul/ iw. 

4435 




TO 


329 


G00354 


Homo sapiens 


Human secreted protein, SEQ IDNO: 
4435 


125 

»» 


65 


330 


AF1 19855 


Homo sapiens 


PR01847 


121 


80 


331 


L78671 


Homo ^anipn<» 


f^ovlT/D-Ioon DNA fusion nrotpin 


364 


71 


332 


AK000496 


Homo sapiens ; 


unnamed protein product 


145 


41 


333 
•j -j -j 


D00570 




nnpn fpnHtno from p HQn A 
open icauuig ii oJiiv iyj\) 






334 


AF1 19855 


Homo sapiens 


PR01847 


116 


74 


335 
j j j 




HAirtA contMie 
nUlUU SaplCIlS 


O III Hall SCvlClCU pJUldll, uCy 1U IN w. 

7871. 




50 

Ju, 






i iUiiiu sapiens 


r roiein encouea Qownsircam oi nnc__ivi 
onronrofpin 


19Q 


• 


337 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7075 


107 


59 


338 


Y01158 


a X \JXXX \t SOLJlwllO 


fiprrpfprf nrntpin pnpoHprf hv optip 1 R 

clone HCACJ81. 


MS 


79 


339 


AL359782 


Trvnan o^om a 

brucei 


nos^ihle ^hhv-/»^ nil (Y? variant a Hna 

complete virion genome. 


117 

XXI 


50 


340 


AK022217 


Homo <taniens 

XX.\JXMX\J iJCilJlVilO 


unnamed nrotein nrorluct 




70 


341 


U43360 


Peromvscus 
maniculatus 

******* *^*<a**;**i*."tJ 


reverse transcrintase 


115 

X XmJ 


75 


342 


AF 11 8086 


Homo sapiens 


PR01992 


141 


73 


343 


X92485 


Plasmodium 
vivax 


nval 


96 

■ * 




344 


AF106677 


Drosoohila 
melanogaster 


d Ksati ^faction 


90 


48 

"TO 


345 


U12693 


Homo sapiens 


cytochrome oxidase subunit II 


239 


91 


346 


L27428 


Homo sapiens 


reverse transcriptase 


95 


56 


347 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


133 


69 


348 


G02532 


Homo sapiens 

4 


Human secreted protein, SEQ ID NO: 
6613. 


81 


51 


349 


X51616 


Volvox carteri 


SULFATED SURFACE 
GLYCOPROTEIN 185 


110 


41 
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SEQID 
NO: 


Accession No. 

t 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


350 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870 


158 


55 


351 


U49973 


Homo saDiens 


ORFl* MER37* nutative transDOsase 
similar to doso element 


183 


60 


352 

^ ^ -m* 

*i 


AL390U4 


Leishmania 
major 


extremely cvsteine/valine rich protein 


151 


51 


353 


R95913 


Homo saniens 


Neural thread motein. 


95 


56 


354 


AF061340 


Artibeus 
iamaicensis 


cvtochrome c: oxidase subunit 111 


346 


70 


355 


AF090895 


Homo sapiens 


PRO0I17 


126 


60 


356 

«J *J \J 


AF0 16099 


Mus musculus 

X^X**4J XXX * LAW 


endonuclease/re verse transcrintase • 


121 


48 


357 


AF1 18086 


Homo saniens 


PR01992 


159 


73 


358 


Y14482 

* 


Homo sapiens 


Fragment of human secreted protein 
encoded fev pene \ 7 


85 


89 


359 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


98 


50 




AF010144 


Hnmn sampn^ 

X I l/l 1 1 1/ JQyi C/I JO 


nfiirnnpl thrpflH nrotein AD7c-NT*P 


110 

X X w 


57 


361 


M 1 3 1 no 


Rath is 

norvptriniQ 
iivi vvgiwuj 


unkrinwr) nrntPtn 

UUJVIii/ Vrli LU ULCUI 


122 


34 


362 


Y36203 


Homo ^aniens 


Human secreted orotfiin ^75 


108 


63 


363 


G03052 


Homo <!anien<i 


Human secreted "orotein SEO ID NOi 
7133. 


83 


75 


364 




Hnmn o^ni Pn <! 


Human secreted nrotein SEO ID NO* 
7878 


212 


65 


365 


AF 130051 


Homo saniens 


PRO0898 


136 


71 


366 


AF068294 


Homo saniens 


HDCMB45P 


188 


65 


367 


Ml 0546 


Homo saniens 


cvtochrome oxidase I 


225 

i^X» 


70 


368 


S80119 


Rattus sp. 


reverse transcriptase homolog 


188 


45 


3fiQ 




Pf»rr>m v on > c 

rci uiJj jjluo 

maniculatus 




75 


48 


370 


AF1 1 8082 


Hnmn «5flnifttis 

X X\Jx 11 \J JuUlbm 


PRO1902 

1 lv\y l _/ \J J— 


98 


79 

9 ^ 


371 


AF010144 


Homo ^aniens 


npiirotial threaH nrotein AD7c-NTP 

li will vUUJ UilvUU jVl WvlU 1V&/ # w il 1A 


106 


40 


372 


AF014903 


Pan troglodytes 


NADH dehydrogenase subunit 2 


169 


41 


373 


AF010144 




Tipiimnfll thrpj»H nrntpin AD7f*»WT*P 


93 

✓ 


48 


374 


W4R35 V 




Human hTpji<it panrer related nrotein 

1 lUUldll Ul v~doi woilWvl t viaivu piuivui 
t-> v/ r\ 1 1 ■ ii » 


94 


66 


375 


G03 1 07 


Homo saniens 


Hirman secreted nrotein SEO ID NO" 
7188. 


90 


80 


376 


U93568 


Homo saniens 


nutative dI50 


140 


56 


377 


G02994 


Homo saniens 

• 


Human secreted nrotein. SEO ID NO: 
7075. 


140 


50 


378 


AF090942 


Homo sapiens 


PRO0657 


154 


66 


379 

^ 9 J 


U93568 


Homo saniens 


nutative d150 


149 


36 


380 


U93570 


Homo saniens 


p40 


184 


57 


381 


L27428 


Homo sapiens 


reverse transcriptase 


128 


60 


382 


AF194537 


Homo saniens 


NAG 13 


114 


35 


383 


AF1 16712 


Homo saniens 


PR02738 


109 


56 


384 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


405 


81 


385 


AP000616 


Orvzfi <totiva 


similar to RING-H2 finfter nrotein 

J tj|_| 1 ILTI iv XV^LX 1 V^J X I ii X>XJtJ.^Vi L/l 

RHAla (AF078683 , i 


116 


63 


386 


M22334 


Homo sapiens 


unknown protein 


124 


39 


387 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


97 


52 


388 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


327 


94 


389 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


157 


70 


390 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


129 


62 


391 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


259 


66 
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SEQO) 
NO: 


Accession No. 


Species 

* 


Description 

• 


Smith- 
Waterman 

jcorc 


Identic 

y 


392 


G00577 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
4658 


137 


63 


'393 


U43360 


PeroTnv»nu«; 

maniculatus 


rpVPrcp trancr*r*ntnQP 


129 


54 


394 


AK023582 


Homo ^aniens 

l^V/JllvF .JUJJIVIU 


liTinampri nrntp'n nroHiicf 

ULLU1CU11CVJ LU VlvUi 1/1 UUUU 


148 


46 


395 


M22332 


Homo sapiens 


unknown protein 


128 


41 


3Q6 


AF1 IR0R6 




PR0 1009 


160 


7! 
/ J 


397 


00236*3 


• 


xiuiuan secreicu pruiciii, ocy 11^ :> 
6444 


101 


S9 


398 


D38112 


Homo 3anien<; 




199 


66 


399 


U49973 

* 


Homo <iarnpn*; 

* 


similar to no^o plpment 

Villi! 1CU WJ UVCV ViWUivUt 


234 

dC*mJ*+ 


78 


400 


AL3 59782 


Trvnan osom a 

brucei 


orohahle cimilnr to rimy-h2 ^fn^er 

pUUaUlb j ill lUCli W/ Illig IMi XJLUgvl 

protein rhala. 


107 


40 


401 


AF000996 


Homo ^antpn^ 


nhimiftotic TPR mAtif Y icrtfonn 


116 


61 


402 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


145 


37 




AF1 1R0R9 


ounjo sapiens 


DpAl AA-> 


07 




404 


AF202635 


Homo sapiens 


PP1200 


126 


55 


AAC 


v uuooz 


j-iomo sapiens 


cytoenrome oxidase i 




oo 


406 


AF229067 


Homo sapiens 


PADI-H protein 


129 


71 


4U7 


AioyUJ 14' 


Leishmania 
major 


extremely cysteine/valine rich protein 


197 


38 


4Uo 


L20z D 1 


Trypanosoma 
orucei 


CK5 


AC 

95 


46 






Jriomo sapiens 


OKr l; MJBK37; putative transposase 
similar to pogo element 


1 1 A 

310 


62 


A1 A 




riasmoaium 

* 

vivax 


pval 


o^ 
96 


OO 


HI 1 




Kauus 
norvegicus 


salivary proune-ncn protein 


Izo 


AO ] 


419 
*r!Z 


V1Q109 


x aipa europaca 


cycoenrome oxioase suoumr i 


*fJ 1 


R^ 


41^ 




riomo sapiens 


cytoenrome oxioase i 




OO 




YR694R 


nunio adpicns 


Unman CA<*rAfaH nmfpi'n UPUPH/tfi 

riuinan secreieo pruiein riU/Xir Jroo, 


990 


RI 
01 


415 


AF205385 


Pan troglodytes 


NADH dehydrogenase subunit 5 


296 


89 


416 


V^RA^R 

AJOHjO 


JVLU5 in US CUIUS 


proline ncn proiein 


1 1Z 


*n 

jU 




T T70Q39 


rcioiay SCUS 


reverse uonscnpuise 






418 


V00662 


Haitia canipnc 


vy LLAViir vine uajuooc hi 


90,0 


R4 
o*t 


419 


AF0 17789 




riiitaHv/p franc/^inHnn fiipfiAFOA l^fl 


190 


41 


420 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


183 


69 


421 


Al 3597R9 


Tnmanncnmo 
1 1 ypaiiusuxila 

brucei 


pU^SlUlC ^1111 V*0 J Ul 11/^, VoTUUll a Cilia, 

complete virion genome. 


16/C 


44 


42? 


AF130051 

/TJT 1 JUUJ1 


xivijuu sapiens 


PRO0RQR 


1 JO 


^0 


423 


RQ6R00 


Hatha ennipne 
rivJUU sapicua 


Wiiman Kief in/ v vt i p.cp/ , rf^('pH fiii*tAr WClt? 

ri liiz i aii iiisiiUbyxc'Sccrcicu xauiur nor - 


R6 


C9 


424 


D3R1 16 


Oar* nflflicnie 
i oLI JjamaOuS 


uyiuLiiruiiic u oxiuasc auounn j 


"549 1 


R9 
oz 


425 


U93570 


Homo sapiens 


putative pi 50 


133 


41 


496 


YR694R 


numu sapiens 


riuiudn bccrcicQ. protein rt^rtrr oo, 

^ifo Tn>jr>»i6^ 


100 


O/ 


427 


D13Q51 


tabacum 


calcijsid precursor 

* 


1 AO 


49 1 


428 


L27428 


Homo sapiens 


reverse transcriptase 


104 


34 


429 


R95913 


Homo sapiens 


Neural thread protein. 


118 


49 


430 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


206 


97 


431 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


88 


55 


432 


Y12713 


Mus musculus 


Pro-PoI-dUTPase polyprotein 


98 


54 


433 


J05042 


OryctoJagus 


alpha- 1 (VIH) collagen precursor 


91 


48 



140 



j 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smitfa- 

Waterman 

Score 


% 

Identit 

y 






cuniculus 








414 




noino sapiens 


Unman Pa/vMfa /J Hmtain CUA li t "XT/"^» 

riuman secretea protein, o£>v^ in w. 


i 

1UD 


cc 
DO 


435 


U93572 


Homo sapiens 


putative pi 50 


118 


38 




U/JJO/ 


no mo sapiens 


mi tot ns a «%1 CA 

putative pi ou 


1 OA 

1 uu 


OA 

30 


437 


nnns6 


nomo sapiens 


riuman secrexea protein, iu inu. 
7437. 


1Z0 


ol 


AIR 




riomo sapiens 


jruman secrete a protein, ocv^ id nu. 
6613. 


ICC 

loo 


71 


43Q 


11^7077 


riomo sapiens 


manner transposase 


187 


CO 1 

52 


AAC\ 




riomo sapiens 


riuman secretea protein, bbQ id NU: 
7214. 


OA 

80 


A C 

45 


441 


ATTA1 HI 44 
AJrvlvl^T- 


riomo sapiens 


neuronal tnreaa protein au/c-in l r 


104 


71 


442 


AE003727 


Drosophila 
meianogaster 


CGI 67 18 gene product 


301 


48 


443 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


221 


74 


AAA 
444 


T 

Ujj /JU 


Mus musculus 


jerky 


159 


26 


445 


X53581 


Rattus 
norvegicus 


ORF3 


192 


46 


A AC 

440 


VJ0U4I6 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
4497. 


142 


52 


447 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


156 


38 


448 


AF 194537 


Homo sapiens 


NAG 13 


315 


70 


AAQ 


W4o351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


93 


66 




YOOAOO 


Brugia pahangi 


collagen 


126 


44 


A<1 

451 


Ar 090930 


Homo sapiens 


PRO0478 


88 


60 


452 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


93 


40 


A^l 
40j 


A EAfi 111/1 

ArUol 1 14 


Mus musculus 

J AtVl AA^I Alt A 

aomesticus 


ORF2 


108 


32 


454 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/o/l. 


173 


65 


455 


S80119 


Rattus sp. j 


reverse transcriptase homolog 


197 


54 


450 




Homo sapiens 


Human secretea protem, SEQ ID NO: 

00 10. 


89 


68 


457 


R95913 


Homo sapiens 


Neural thread protein. 


114 


48 


*tJO 


T TAOl \C 


riomo sapiens 


OKr 1, encodes a 40 kDa product 


160 


39 


459 


X92485 


Plasmodium 

• 

vivax 


pval 


99 


52 


46ft 


VJU4Uj j 


riomo sapiens 


riuman secretea protem, 5bQ ID NO: 

ft 177 
ol IJ.. 


1 AA 

100 


52 


461 


com iQ 
ooui ly 


JtvaUUS sp. 


reverse transcriptase nomoiog 


1 JO 


/I o 

48 


462 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoaec oy gene z oty tu rssj.z du. 


211 


67 


463 


X97707 


Pongo pygmaeus 

aOclll 


stopcodon created by 
posnranscripnonai poiyaoenyiaiion 


229 




464 


AF010144 

ATvlvl'rf 


numu sapiens 


neuronal tnreaa protem ad /c-in ir 


i in 

iiy 


0 / 


46S 


I 7747R 


riomo sapiens 


tA«AA Amna API *Aj ^ il M aV 

reverse transcriptase 


1j4 


y< A 

40 


466 


AK000496 


Homo saoiens 


unnamed nrotein nrnduct 


140 


6Q 


467 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


114 


61 


468 


G00589 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4670. 


146 


69 


469 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


286 


79 


470 


D38112 


Homo sapiens 


NAJDH dehydrogenase subunit 4 


448 


86 


471 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


296 


79 



141 
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SEQ ID 
NO: 


\ Accession No. 


Species 

■ 


Description 


fcmftb- 

Waterman 

Score 


V. 

Identit 

y 


472 


Y]&6248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


102 


48 


473 


AL080253 


Arabidopsis . 
thaliana 


putative snRNP protein 


103 

• ■ | — | 7 


42 


474 


X99452 


Lycopersicon 

t 4.'. 

esculentum 


extensin-bke protein Dif54 


108 


25 


475 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


68 


34 


in/ 

476 


ABO 1 2223 


Cams famihans 


ORF2 


78 


66 


477. 


AF 13 0089 


Homo sapiens 


PRO2550 


113 


71 


478 


G03652 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7733. 


390 


97 


479 


AF210651 


Homo sapiens 


NAG 18 


146 


80 


480 


AB029309 


Homo sapiens 


Npw38-bindmg protein NpwBP 


103 


40 


481 


AF 194537 


Homo sapiens 


NAG 13 


118 


31 


482 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


115 


45 


483 


L27428 


Homo sapiens 


reverse transcriptase 


184 


47 


484 


U93570 


Homo sapiens 


putative pi 50 


101 


50 


485 


AF194537 . 


Homo sapiens 


NAG 13 


213 


52 


486 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


162 


82 


487 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


204 


86 


488 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


140 


53 


489 


U93574 


Homo sapiens 


putative pi 50 


86 


54 


490 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


83 


56 


491 

■ 


AJ27I872 


Nicotiana 
sylvestris 


extensin 


220 


47 


492 


U11288 


Drosophila 
melanogaster 


diaphanous protein 


113 


33 


493 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


184 


70 


494 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


131 


68 


495 


AF1 19900 


Homo sapiens 


PR02822 


148 


65 


496 


AB026542 


Homo sapiens 


WASP-famiiy protein 


96 


38 


497 


D86853 


Catharanthus 
roseus 


extensin 


104 


34 


498 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protem K10 (NID:g8148) 


109 


47 


499 


G00492 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4573. 


109 


67 


500 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


112 


62 


501 


AF 119901 


Homo sapiens 


PR0283 1 


116 


82 


502 


AF238235 


Entamoeba 
histolytica 


diaphanous protein 


120 


41 


503 


M22332 


Homo sapiens 


unknown protein 


123 


49 


504 


AF119851 


Homo sapiens 


PR01722 


204 


52 


505 


X61296 


Rattus 
norvegicus 


open reading name 2 


107 


"Aft *? \ 

45 


506 


AF 118082 


Homo sapiens 1 


PRO 1902 


145 


52 


507 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


103 


68 


508 


X55685 


Lycopersicon 
esculentum 


extensin (class I) 


175 


39 


509 


X92485 


Plasmodium 


pval 


117 


52 



142 



t 
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NO: 


Accession No. 


Species 

*** 


Description 


Smith- 
Waterman 
Score 


Identit 

y 






vivax 








510 


AF090942 


Homo sapiens 


PRO0657 


95 


77 


511 


U93569 


Homo sapiens 


putative pi 50 


120 


54 


512 


U93574 


Homo sapiens 


putative pi 50 


140 


49 


513 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


196 


63 


514 


L27428 


Homo sapiens 


reverse transcriptase 


132 


37 


515 


U93565 


Homo sapiens 


putative.plSO 


101 


45 


516 


U93574 


Homo sapiens 


putative pi 50 


178 


35 


517 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6834. 


81 


27 


518 


AF053538 


Aivinella 
pompejana 


fibrillar collagen chain FApl alpha 


112 


36 


519 


X52235 


Homo sapiens 


ORFII 


148 


35 


520 


AF130051 


Homo sapiens 


PRO0898 


98 


61 


521 


L02106 


Drosophila 
melanogaster 


ribonucleoprotein 

• 


143 


40 


522 

* 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


202 


70 


523 


U93570 


Homo sapiens 


putative pi 50 


159 


43 


524 


Y86248 


Homo sapiens 

• 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


210 


100 

• 


525 


L27428 


Homo sapiens 


reverse transcriptase 


128 


38 


526 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


96 


65 


527 


X53581 


Rattus 
norvegicus 


ORF4 


130 

ii 


42 


528 


U93570 


Homo sapiens 


putative pi 50 


195 


35 


529 


AF130089 


Homo sapiens 


PRO2550 


132 


43 


530 


AK024455 


Homo sapiens 


FLJ00047 protein 


126 


54 


531 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


123 


61 


532 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


210 


44 


533 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


109 


42 


534 


G03133 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


120 


62 


535 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


124 


37 


536 


R13556 


Homo sapiens 


Protein encoded downstream of hhc.M 
oncoprotein. 


122 


50 


537 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


118 


43 


538 


A23786 


Beta vulgaris 


chitinase 1 


91 


33 


539 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


116 


39 


540 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


102 


67 


541 


S80119 


Rattus sp. 


reverse transcriptase homolog 


191 


50 


542 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


155 


77 


543 


G01828 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


125 




544 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


146 


62 


545 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


74 


45 


546 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


94 


75 


547 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


510 


86 


548 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


146 


68 



143 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 
v 








RHAla (AF078683) 






549 


Y08061 


Homo sapiens 


Human c-mvb protein fragment. 


128 


82 


550 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


153 


77 


551 


G02996 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7077. 


121 


56 


.552 • 


X92485 


Plasmodium 
vivax 


pval 


103 


50 


553 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


129 


46 


554 


L27428 


Homo sapiens 


reverse transcriptase 


149 


44 


555 


AF194537 


Homo sapiens 


NAG13 


157 


45 


556 


Y 13247 

A A W mm T m 


Homo sapiens 


FBI 9 protein 

A, A^m* A 0* m***.M.A 


106 

tA W W 


42 


557 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


100 


54 


558 


G0J984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


176 


68 


559 

W mf 


S80119 


Rattus so 

A \i* Hill fcj Wf'* 


reverse transcriptase homoloer 


113 

A A 


43 


560 


AY008270 


Homo sapiens 


cholestervl ester transfer orotein 


107 

BV W ( 


95 


561 


AF010144 


Homo sapiens 


neuronal thread orotein AD7c-NTP 


140 


63 


562 


U49973 

1 


Homo saoiens 

A. IVUlV JUL/ IVIM 


ORF1* MER37* putative transoosase 
similar to do&o element 


347 


68 


563 


AF025467 


Caenorhabditis 
elecans 


contains similarity to drosophila DNA- 
bindine protein K10 fNID*E8148) 


93 


40 


564 


D38114 


Gorilla corilla 


cvtochrome c oxidase subunit 3 (COllD 


329 


74 


565 


Y36156 


Homo sapiens 


Human secreted protein #28. 


153 


56 


566 

ml \J \J 


R96800 

ivy w u w 


Kninn ^anien^ 

llvluv uuU IwilO 


Human histiocvte-Qecreted factor HSF 


131 


48 


567 


D38112 


Homo saniens 


cvtocfirome c oxidase subunit 


406 


94 


568 


AF 130079 

dm a A w Ww / ✓ 


Homo saoiens 


PR02852 

A ft\,%/ v W W 


101 


55 


569 


AF081114 


Mus musculus 
domesticus 


ORF2 


123 

A A~*J 


40 


570 


L22030 


Glycine max 


hvdroxvoroline-rich clvcoorotein 


65 


45 


571 


D86853 


Catharanthus 
roseus 


extensin 

i 


168 


39 


572 


AF104415 


Mus musculus 


gene trap locus- 13 


179 


66 


573 


AF130089 


Homo sapiens 


PRO2550 


114 

A A W 


56 


574 


X67863 


Mus musculus 


T2 


115 


42 


575 


S80119 

Www A 1 ^ 


Rattus <m 


reverse trflnscriotese homolo^ 


101 


28 


576 


S80119 


Rattus 


reverse trarKPrintase homnlop 


150 


57 


577 

m* $ 9 


U49973 

^^^V m? m* 9 mw 


Homo saoiens 


ORF1 * MER37* nutative transnosase 
similar to poeo element 

V B^^* i f^^^ * will %^ 


142 


74 


578 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


106 


57 


579 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


122 


46 


580 


L24521 


Homo sapiens 


transformation-related protein 


110 


38 


581 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


537 


84 


582 


AF090895 


Homo sapiens 


PRO0117 


127 


80 


583 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


145 


70 


584 


X55681 


Lvcooersicon 
esculentum 


extensin f class Y\ 


112 

A A*+ 


38 


585 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


473 


60 


586 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


144 


68 


587 


U47855 


Araneus 
diadematus 


fibroin-3 


124 


30 


588 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


150 


75 


589 


U93567 


Homo sapiens 


putative p!50 


225 


47 



144 
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SEQ ID 
NO: 


Accession No. 


Species 

• 


Description 

- 


Smith- 
Waterman 
Score 


% 

Ideotlt. 

y 


590 


X71602 


Nicotiana 
tabacum 


extensin 


147 


33 


591 


X57527 


Homo sapiens 


alpha 1 (VHl) collagen 


103 


42 


592 


G0->1 12 


Homo sapiens 


Human secreted protein, 5>JEQ ID NO: 
7193. 


fJ 


An . 

47 


593 




Homo sapiens 


i ype 111 procolJagen (prior sat). 


1 10 . 


48 


594 


R95913 


Homo sapiens 


Neural thread protein. 


116 


37 


595 


U11880 


Petromyzon 
m annus 


cytochrome oxidase subunit I 


127 . 


52 


596 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


O A 

84 


62 


597 


L27428 


Homo sapiens 


reverse transcriptase 


1 CO 

158 


40 


598 


M55251 


Cams famiharis 


glycoprotein 80 


559 


86 


599 


AP000616 


Oryza sativa 


similar to JRJNG-H2 finger protein 
RHAla(AF078683) 


132 


100 

* 


600 


Ml 8094 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


143 


33 


601 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


154 


54 


602 


X7348I 


Drosophila hydei 


mstl01(2) 


107 


42 ' 


603 


M81321 


Macaca 
fascicularis 


proline-rich protein 


114 


39 


604 


X05561 


Homo sapiens 


alpha- 1 chain precursor (AA -27 to 
917) (2953 is 2nd base in codon) 


109 


42 


605 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


82 


62 


606 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


S3 


64 


607 


AL359782 


Trypanosoma 
brucei 


probable similar to nng-h2 finger 
protein rha la. 


107 


44 


608 


L27428 


Homo sapiens 


reverse transcriptase 


147 


43 


609 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


113 


61 


610 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


151 


82 


611 


U93568 


Homo sapiens 


putative p!50 


144 


32 


612 


AB022223 


Arabidopsis 
thaiiana 


extensin protein-like 


186 


58 


613 


Z70684 


Caenoihabditis 
elegans 


F28D1.8 


108 


49 


614 


M11901 


Rattus 
norvegicus 


proline-nch salivary protem 


133 


a9* aW 

36 


615 


X92485 


Plasmodium 
vivax 


pval 


120 


At 

42 


616 


Y86248 


Homo sapiens 


Human secreted protem HCHPF68, 
SEQ ID NO: 163. 


175 


AT* XV ' 

89 


a** *M §wm 

617 


U83280 


Leishmania 
donovani 


39 kDa antigen 


111 


51 


618 


AT « f l\ A #\»* 

AL 160493 


Leishmania 

• 

major 


probable (hhv-6) ul 1 02, variant a 
DNA, complete virion genome 


137 


67 


619 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

001 j. 


120 


53 


620 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


92 


45 


621 


AF130089 


Homo sapiens 


PRO2550 


123 


34 


622 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


133 


59 


623 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


127 


45 


624 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


205 


66 



145 



SEQ1D 
NO: 


\ Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


Ideotit 

y 


625 


XQ7882 


Homo, sapiens 


Po protein 


119 


39 


626 


Y86248 


Homo sapiens 


Human secreted protem HCHPF68, 
SEQIDNO:163. 


70 


100 


621 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


141 


mt m 

51 


628 


G00637 


Homo sapiens . 


Human secreted protein, SEQ ID NO: 
4718. 


1.41 


mm A 

54 


629 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


115 


46 


630 


X63368 


Homo sapiens 


HSJlb 


151 


52 


^ Am. m 

631 


A m Am. «v #^ #m 4^ 

AF1 30089 


Homo sapiens 


PRO2550 


155 


47 


632 


X92485 


Plasmodium 

• 

vivax 


pval 


102 


61 


633 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


102 


39 


634 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


171 


74 


635 


X92485 


Plasmodium 
vivax 


pval 


95 


73 


636 


S80119 


Rattus sp. 


reverse transcriptase homolog 


114 


58 


637 


U 15647 


Mus museums 


reverse transcriptase 


170 


42 


638 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


130 


76 


639 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


83 


36 


640 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


96 


34 


641 


X61296 


Rattus 
norvegicus 


open reading frame 2 

* 


166 


35 


642 


G00352 

_ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


127 

• 


35 


643 


U23172 


Caenorhabditis 
elegans 

■ 

• 


similar to myoblast cell surface antigen 
(SP:CS24_HUMAN, P23246) and D. 
melanogaster No-on-transient A protein 
(PIR:JH0162) 


115 

• 


35 


m+ A 

644 


AF08 1 1 1 1 


Mus musculus 
domesticus 


ORF2 


168 


33 


645 


AK027208 


Homo sapiens 


unnamed protein product 


90 


51 


m** A A* 

646 


AFO 16099 


Mus musculus 


m a * ■ 

endonuclease/reverse transcriptase 


101 


59 


641 


Y86248 


Homo sapiens 


Human secreted protem HCHPF68, 
SEQ ED NO: 163. 


149 


76 


a*~ » r\ 

648 


AF273441 


Pongo pygmaeus 


NADH dehydrogenase subunit 3 


121 


58 


aw a 

649 


L27428 


Homo sapiens 


reverse transcriptase 


173 


58 


650 


AF1 19851 


Homo sapiens 


PR01722 


176 


53 


651 


Y91452 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:125. 


168 


68 


At* AWf*\ 

652 


AF 130089 


WW • 

Homo sapiens 


PRO2550 


m Am A*m. 

130 


36 


653 


W50192 


Homo sapiens 


Ammo acid sequence of salivary 
protem CON-1. 


■* #% m+* 

126 


m m 

41 


654 


AK000385 


Homo sapiens 


unnamed protem product 


195 


63 


r- c C 

655 


AB041881 


Rattus 
norvegicus 


cytoplasmic dynem heavy chain 


m aw gm 

158 


100 


656 j 


^//l £\ APW 

X61047 


Hydra sp. 


mmi-coilagen 


60 i 


mm **% 

38 


Dj / 




riomo sapiens 


unxnown protem 


inn I 


cn 


658 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


143 


62 


659 


AF 194537 


Homo sapiens 


NAG13 


95 


48 


660 


Y67470 


Homo sapiens 


Np70 protein carboxy terminal region. 


120 


50 


661 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


86 


32 


662 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


169 


53 


663 


W48351 


Homo sapiens 


Human breast cancer related protein 


120 


72 



146 
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SEQ1D 
NO: 


Accession No. 


Species 

> 

■ 


Description 


Smith- 
Waterman 
Score 


Identit 

y 








BCRB2. 






664 


U 15647 


Mus musculus 


reverse transcriptase 


148 


50 


665 


R95913 


Homo sapiens 


Neural thread protein. 


161 


59 


666 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7001. 


134 


80 


667 


G03267 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7348. 


H7 . 


44 


668 


AB018705 


Mus musculus 


ORF2 


115 


32 


669 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


280 


75 


670 


X53581 


Rattus 
norvegicus 


ORF4 


71 


39 


671 


U49973 


Homo sapiens 

. 


ORF1; MER37; putative transposase 
similar to pogo element 


201 

• 


66 


672 


R95913 


Homo sapiens 


Neural thread protein. 


144 


78 


673 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


164 


46 


674 


AF118082 


Homo sapiens 


PRO1902 


137 


49 


675 

■ 


Q03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


■ 


43 


676 


D00570 , 


Mus musculus 


open reading frame (25 1 AA) 


112 


72 


677 


AF194537 


Homo sapiens 


NAG 1 3 


238 


56 


678 


M13100 


Rattus 
norvegicus 


unknown protein 


146 


51 


679 


U15647 


Mus musculus 


reverse transcriptase 


123 


54 


680 


R95913 


Homo sapiens 


Neural thread protein. 


145 


55 


681 


R59842 


Homo sapiens 


ApoE4Ll protease. 


107 


63 


682 


AF130089 


Homo sapiens 


PRO2550 


94 


51 


683 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


129 


69 


684 


AF093748 


Homo sapiens 


KH type splicing regulatory protein; 
KSRP 


93 


50 


685 


U93569 


Homo sapiens 


putative pi 50 


133 


58 


686 


G03095 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7176. 


117 


64 


687 

• 


Y36243 


Homo sapiens 


Human secreted protein encoded by 
gene 20. 


69 


73 


688 


AF116712 


Homo sapiens 


PRQ2738 


133 


56 


689 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


134 


53 


690 


U93563 


Homo sapiens 


putative pi 50 


132 


49 


691 


U49973 


Homo sapiens 

• 


ORF1; MER37; putative transposase 
similar to pogo element 


349 


70 


692 


AF090895 


Homo sapiens 


PRO0117 


115 


63 


693 


AF130089 


Homo sapiens 


PRO2550 


132 


80 


694 


S80119 


Rattus sp. 


reverse transcriptase homolog 


101 


43 


695 


U15647 


Mus musculus 


reverse transcriptase 


120 


64 


696 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


132 


59 


697 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


128 


72 


698 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


182 


47 


699 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


459 


83 


700 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


148 


73 


701 


AF003535 


Homo sapiens 


ORF2-Iike protein 


125 


49 


702 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophiia DNA- 
binding protein K10 (NID:g8148) 


89 


41 


703 


L27428 


Homo sapiens 


reverse transcriptase 


255 


50 


704 


AF130089 


Homo sapiens 


PRO2550 


87 


55 


705 


G03052 | Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


52 



147 
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~S£"QID 
NO: 


Accession No. 


Species 


Dtscrinrion 


Smith- 
Waterman 
Score 


% 

Ideotit 

y 








7133. 






706 


Y79140 


Homo sapiens 


Human haemopoietic stem cell 
regulatory protein SCM3. 


211 


88 


707 


U15647 


Mus musculus 


reverse transcriptase 


94 


47 


708 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


153 


66 


709 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


107 

• 


50 


710 


U49973 


Homo sapiens . 


ORF1 ; MER37; putative transposase 
similar to pogo element 


163 


64 


711 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


146 


65 


712 


U93565 


Homo sapiens 


putative pi 50 


108 


33 


713 


G03114 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7195. 


108 


37 


714 


M24732 


Homo sapiens 


jamin-like protein 


92 


35 


715 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


306 


79 


716 


U97674 


Mesocricetus , 
auratus , 


cytochrome c oxidase chain I 


462 


85 


717 


AF004715 


Homo sapiens 


jerky gene product homoiog 


100 


42 


718 


X92485 


Plasmodium 
vivax 


pval 


84 


48 


719 


AF130089 


Homo sapiens 


PRO2550 


132 


74 


720 


AL451017 


Neurospora 
crassa 


related to Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN C 


125 


43 


721 


AK024455 


Homo sapiens 


FLJ00047 protein 


108 


68 


722 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


146 


46 


723 


X83413 


Human 
herpesvirus 6 


U88 


269 


41 


724 


X92485 


Plasmodium 
vivax 


pval 


117 


43 


725 


X92485 


Plasmodium 
vivax 


pval 


97 


41 


726 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6639. 


102 


73 


727 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


118 


43 


728 


A23786 


Beta vulgaris 


chitinase 1 


91 


33 


729 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


107 


49 


730 


G03114 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7195. 


131 


59 


731 


W49717 


Homo sapiens 


Protein polymer adhesive substrate 
PPAS1-C. 


148 


29 


732 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


95 


45 


733 


X96731 


Ostertagia 
circumcincta 


articular collagen 


104 


37 


734 


AF130089 


Homo sapiens 


PRO2550 


118 


40 


735 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6836. 


166 


100 


736 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


494 


86 


737 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


188 


69 


738 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


593 


91 


739 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homoiog 


102 


62 



148 
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* * • * 



SEQ ID 
NO: 


\ Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


Identit 

y 


740 


GP3203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


ill 


54 


.743 


A23786 


Beta vulgaris 


chitinase 1 


106 


36 


I 42 

• 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


71 

■ 


72 


743 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


184 


61 


744 


AE003629 


Drosophila 
melanogaster 


CG 17 1 08 gene product 


76 


36 


745 


U93563 


Homo sapiens 


putative pi 50. 


145 


50 


746 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


68 


66 


747 


AF2 17973 


Homo sapiens 


unknown 


113. 


79 


748 


AC006161 

• 


Arabidopsis 
thaliana 


putative CENP-B/ARS-binding 
protein-like protein 


112 


31 


749 


G02996 


Homo sapiens 


Human secreted protein, SEQ ID "NO: 
7077. 


106 


47 


750 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


115 


54 


751 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


99 


72 

• 


752 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


85 


48 


753 


X92485 


Plasmodium 
vivax 


pval 

• 


89 

■ 


73 


754 


U93563 


Homo sapiens 


putative pi 50 


186 


68 


755 


AP000616 

■ 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla (AF078683) 


116 


50 


756 


AF194537 


Homo sapiens 


NAG 13 


138 


40 


757 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


112 


79 


758 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


119 


61 


759 


AF 130079 


Homo sapiens 


PR02852 


138 


40 


760 

■ 


X92485 

■ 


Plasmodium 
vivax 


pval 


88 


77 


761 


U93050 


Mus musculus 


pory(A) binding protein II 


95 


35 


762 


V01555 

• 


Human 
herpesvirus 4 


BYRF1, encodes EBNA-2 (Dambaugn 
etal, 1984;DillneretaI, 1984) 


80 


41 


763 


V00662 


Homo sapiens 


ATPase 6 


337 


83 


764 


X62677 


Oryctolagus 
cuniculus 


retrovirus related reverse transcriptase 


117 


41 


765 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


423 


81 


mm ^ ^ 

766 


R95913 


Homo sapiens 


Neural thread protein. 


114 


66 


767 


V00662 


Homo sapiens 


cytochrome oxidase I 


223 


83 


768 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


268 


83 


769 


V00662 


Homo sapiens 


cytochrome oxidase I 


357 


81 


770 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


296 


71 


771 


AF026211 


Caenorhabditis 
elegans 


Similar to cuticular collagen 

■ ■ - 




39 


772 


AK024455 


Homo sapiens 


FLJ00047 protein 


108 


53 


773 


X92485 


Plasmodium 
vivax 


pval 


95 


39 


774 


AF1 30051 


Homo sapiens 


PRO0898 


123 


38 


775 


ABO 12223 


Canis familiaris 


ORF2 


174 


51 


776 


AB028664 


Paralichthys 
olivaceus 


cytochrome oxidase subunit-3 


268 


57 


777 


V00662 


Homo sapiens 


cytochrome oxidase I 


436 


85 


778 


X52235 


Homo sapiens 


ORFn 


125 


47 


779 | 


U93569 


Homo sapiens 


putative pi 50 


235 


49 
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t 



t 

k - 

WO 01/64835 PCT/US01/04927 



SEQ ID 
NO: 


Accession No. 


Species 


"Description 


Smith- 
Waterman - 
Score 


94 

Identit 

y 


780 


U49973 

♦ 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo element 


262 


76 


781 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 
. similar to pogo element 


239 


70 


782 


AF090942 


Homo sapiens 


PRO0657 


146 


69 


783 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


85 


66 


784 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


209 


62 


785 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


184 


60 


786 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo element 


158 


60 


787 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


293 


86 


788 


AF130089 


Homo sapiens 


PRO2550 


153 


64 


789 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


131 


57 


790 


G00354 


Homo sapiens 

i 


Human secreted protein, SEQ ID NO: 
4435. 


100 


51 


791 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


116 


80 


792 


Y36203 


Homo sapiens 


Human secreted protein #75. 


104 


77 


793 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


117 


58 


794 


AF090930 


Homo sapiens 


PRO0478 


130 


68 


795 


AKQ24455 


Homo sapiens 


FU00047 protein 


124 


61 


796 


AFO 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


MM 


70 


797 


AF 130089 


Homo sapiens 


PRO2550 


273 


85 


798 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


134 


66 


799 


AF130051 


Homo sapiens 


PRO0898 


162 


80 


800 


G03560 

r 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7641. 


179 


70 


801 


Y14479 

■ 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


75 


51 


802 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


103 


34 


803 


D38484 


Hylobates 
syndactytus 


Cytochrome C oxidase subunit 1 
(COXI) 


263 


70 


804 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


112 


41 


805 


X92485 


Plasmodium 
vivax 


pval 


103 


38 


806 


AF194537 


Homo sapiens 


NAG 13 


285 


51 


807 


AF121360 

• 


Drosophila 
melanogaster 


DNZDHHC/NEW1 zinc finger protein 
11 


179 


47 


808 


X92485 


Plasmodium 
vivax 


pval 


131 


46 


809 


D13951 


Nicotiana 
tabacum 


extensin precursor 


88 


41 


810 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


174 


73 


811 


U93565 


Homo sapiens 


putative pi 50 


118 


38 


812 


AF1 18082 


Homo sapiens 


PRO1902 


88 


54 


813 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


132 


42 


814 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


114 


50 


815 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


113 


43 


816 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


148 


40 
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SEQ ID 

NO: 


^Accession No. 


Species 

• 


Description 


Smith- 
Waterman 
Score 


Idcntit 

y 


817 


AF2 17449 


Schistosoma 
mekongi 


NADH dehydrogenase subunit 6 


102 


37 


.818 


ABO 12223 


Canis familiaris 


ORF2 


101 


54 


819 

't 


X71602' 


Nicotiana 
tabacum 


extensin 


162 


45 


820 


G04091 

• 


Homo sapiens 

■ 


Human secreted protein, SEQ ID NO: 
8172. 


154 

- 


36 


821 


R95913 


Homo sapiens 


Neural thread protein. 


153 


61 


822 

- 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


125 

• 


61 


823 


U49973 


Homo sapiens 

* * 


ORF1; MER37; putative transposase 
similar to pogo element 


364 


57 


824 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


121 


48 


825 


AF194537 


Homo sapiens 


NAG 13 


224 


58 


826 


U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain 1 


437 


72 


827 


AF051782 


Homo sapiens 


diaphanous 1 


108 


38 


828 


AF194537 , 


Homo sapiens 


NAG 13 


92 


45 


829 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


492 


75 


830 


M64791 


Rattus 
norvegicus 


salivary proline-rich protein 


110 


46 


831 


X55685 


Lycopersicon 
esculentum 


extensin (class I) 


108 


31 


832 | Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


117 


34 


833 


U93564 


Homo sapiens 


putative pi 50 


84 


40 


834 


U93563 


Homo sapiens 


putative pi 50 


262 


54 


835 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


214 


80 


836 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rha la. 


107 


48 


837 

* 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


158 


76 


838 


AF194537 


Homo sapiens 


NAG 13 


153 


60 


839 


U52077 


Homo sapiens 


mariner transposase 


344 


67 


840 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


104 


46 


841 


W90847 


Homo sapiens 


Human lymphocyte targeted peptide 
#15. 


130 


92 


842 


M64791 


Rattus 
norvegicus 


salivary proline-rich protein 


114 


36 


843 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


196 


43 


844 


X92485 


Plasmodium 
vivax 


pval 


102 

- 


73 


845 


X61048 


Hydra sp. 


mini-collagen 


106 


41 


846 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


92 


48 


847 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


112 


54 


848 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


128 


68 


849 


AF 194537 


Homo sapiens 


NAG 13 


141 


46 


850 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


121 


45 


851 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


153 


58 


852 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


220 


60 



151 
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~§EO ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Ideotit 

y 


853 


X92485 


Plasmodium 
vivax 


pval 


127 

• 


45 


854 


AF 134305 


Homo sapiens 


Scar3 


99 


38 


855 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


343 


68 


856 


S80119 


Rattus sp. 


reverse transcriptase homolog 


159 


56 


857 


AF1 30089 


Homo sapiens 


PRO2550 


112 


40 


858 


AK024372 


Homo sapiens 


unnamed protein product 


129 


50 


859 


G03790 


Homo sapiens . 


Human secreted protein, SEQ ID NO: 
7871. 


103 


76 


860 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


279 


80 


861 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


96 


44 


862 


U49973 


Homo sapiens 


0RF1; MER37; putative transposase 
similar to pogo element 


83 


40 


863 


AF210651 


Homo sapiens 


NAG 18 


122 


63 


864 


AFO 16099 


Mus museums 


endonuclease/reverse transcriptase 


109 


51 


865 


X55685 


Lycopersicon 
esculentum 


extensin (class I) 


115 


34 


866 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


107 


80 


867 


X92485 


Plasmodium 
vivax 


pval 


105 


41 


868 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


127 


62 


869 


X92485 


Plasmodium 
vivax 


pval 


105 


38 


870 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 




56 


871 


Y27571 


Homo sapiens 


Human secreted protein encoded by 
gene No. 5. 


128 


72 


872 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


123 


36 


873 


AF130089 


Homo sapiens 


PRO2550 


160 


82 


874 


AF1 18082 


Homo sapiens 


PRO1902 


143 


65 


875 


U93564 


Homo sapiens 


putative pi 50 


180 


44 


876 


M10546 


Homo sapiens 


cytochrome oxidase I 


248 


75 


877 


U83303 


Homo sapiens 


line-! reverse transcriptase 


127 


52 


878 


G03790 

* 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


106 

■ 


35 


879 


U11288 


Drosophila 
melanogaster 


diaphanous protein 


93 


46 


880 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 ringer 
protein rhala. 


71 


4 ? 


881 


B08942 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 1 8 SEQ ID NO:99. 


95 


40 


882 


AF130089 


Homo sapiens 


PRO2550 


137 


44 


883 


AF090942 


Homo sapiens 


PRO0657 


142 


73 


884 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


97 


60 


885 


X61296 


Rattus 
norvegicus 


open reading frame 2 


106 


43 


886 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243 . 


91 


57 


887 


X14963 


Homo sapiens 


collagen-like protein (447 AA) 


130 


51 


888 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


117 


55 


889 


L25616 


Homo sapiens 


CGI protein 


150 


62 


890 


G03786 


Homo sapiens . 


Human secreted protein, SEQ ID NO: 
7867. 


171 


66 


891 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 


107 


40 



152 
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■ ■ 



SEO ID 


t Accession No. 


Snecies 


Descriotion 


Smith- 




NO: 






Waterman 


Ideotit 










Score 


y 




I 


* 


fragment, SEQ ID NO:387. 




■ 


892 


X52318 


Bos taurus 


histone H2A.Z (AA 1-127) 


356 


79 


893 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subumt) 


102 


46 


894 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


173 


75 


895 


X92485 


Plasmodium 


pval 


109 


47 






vivax 


■ 






896 


L76I59 


Homosapiens 


FRGl gene product 


100 


35 


897 


D50926 


Homo sapiens 


The KIAA0136 gene product is novel 


280 


89 


898 


X04011 


Homo sapiens 


precursor polypeptide 


114 


95 


899 


M90656 


Homo sapiens 


gamma-gJutamyJcysteine synthetase 


101 


90 


900 


R95913 


Homo sapiens 


Neural thread protein. 


95 


75 


901 


L27428 


Homo sapiens 


reverse transcriptase 


81 


47 


902 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


104 


53 


903 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


76 








4493. 


- 




904 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


126 


37 


905 


AF130089 


Homo sapiens 


PRO2550 


88 


82 


906 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 


230 


50 


• 


* 




similar to pogo element 






907 


L27428 , 


Homo sapiens 


reverse transcriptase 


114 


64 


908 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


71 








4458. 




• 


909 


U93570 


Homo sapiens 


putative pl50 


130 


41 


910 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


91 


63 


911 


L27428 


Homo sapiens 


reverse transcriptase 


168 


46 


912 


X92485 


Plasmodium 


pval 


91 


60 




- 


vivax 


i 






913 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


64 








7870. 


■ 




914 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


96 


64 








7870. 




■ 


915 


Y64890 


Homo sapiens 


Human 5* EST related polypeptide SEQ 


121 


51 








ID NO: 1051. 


* 


- 


916 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


87 


77 


■ 


• 


6582. 






917 


M12099 


Mus musculus 


proline-rich protein 


129 


44 


918 


M15530 


Homo sapiens 


B-cell growth factor 


88 


51 


919 


AF130079 


Homo sapiens 


PR02852 


158 


88 


920 


G03793 


Homosapiens 


Human secreted protein, SEQ ID NO: 


103 


77 








7874. 






921 


X53581 


Rattus 


ORF4 


124 


32 




■ 


norvegicus 








922 


G04067 


Homosapiens 


Human secreted protein, SEQ ID NO: 


110 


50 






- 


8148. 


_ 




923 


G04091 


Homosapiens 


Human secreted protein, SEQ ID NO: 


99 


36 






■ 


8172. 






924 


W50193 


Homo sapiens 


Amino acid sequence of salivary 


80 


47 




■ 


- 


protein CON-2. 






925 


AC006127 


Homo sapiens 


BRG-1 -HUMAN ; nuclear protein 


442 


92 






GRB1 ; homeotic gene regulator, SNF2- 




- 








BETA; MITOTIC GROWTH AND 












TRANSCRIPTION ACTIVATOR; 












POSSIBLE GLOBAL 












TRANSCRIPTION ACTIVATOR 












SNF2L4 






926 


L17318 


Rattus 


proline-rich proteoglycan 


126 


39 






norvegicus 








927 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


321 










similar to ppgo element 







153 



> ■ • 
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SEQ ID 


Accession No. 


Species 

■ 


Description 


Smith- 




NO: 




• 


Waterman . 


Identit . 










Score 


y 


928 


AC006161 


Arabidopsis 


putative CENP-B/ARS-bmdmg 


97 


35 






thaliana 


protein-like protein 






929 


X02873 


Daucus carota 


put precursor 


112 


38 


930 


U93563 


Homo sapiens 


putative pi 50 


IOC 


70 


931 


ABO 12223 


Cams familians 


ORF2 


202 


50 


932 


AF053538 


Alvinella 


fibrillar collagen cham FApl alpha 


114 


37 






pompejana 








933 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


57 






■ 


7871. 


■ 




934 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 


283 


54 


• 






similar to pogo element 






935 


U41017 


Caenorhabditis 


repetitive region; weakly similar to E. 


107 


33 






elegans 


gracilis major membrane skeletal 












protem (PIR:A43417) 






936 


U47855 


Araneus 


fibroin-3 


109 


33 


- 




diadematus 


- 


• 




937 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 


43 




■ 




7214. 






938 


W48351 


Homo sapiens 


Human breast cancer related protein 


108 


50 








BCRB2. 

V ' - - 


. 


. ■ 


939 


M13101 


Rattus 


unknown protein 


121 


40 






norvegicus 






■ 


940 


G00689 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


117 


46 






- 


4770. 


* 




941 


AF010144 


Homo sapiens 


neuronal thread protem AD7c-NTP 


88 


62 


942 


L27428 


Homo sapiens 


reverse transcriptase 


86 


45 


943 


U93564 


Homo sapiens 


putative pi 50 


379 


40 


944 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 


110 


48 




■ 


■ 


IDNO:i051. 






945 


L27428 


Homo sapiens 


reverse transcriptase 


238 


66 


946 


AF1 94537 


Homo sapiens 


NAG 13 


146 


47 


947 


G02538 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


81 


55 






■ 


6619. 


• .. . 




948 


AC006161 


Arabidopsis 


putative CENP-B/ARS-bmding 


113 


37 




■ 


thaliana 


protein-like protein 






949 


AF 194537 


Homo sapiens 


NAG 13 


106 


66 


950 


Y14437 


Homo sapiens 


Human secreted protein encoded by 


74 


75 








gene 27 clone HSAWA27. 






951 


G04067 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


78 


66 




• 




8148. 


- - 


- 


952 


Y19767 


Homo sapiens ! 


SEQ ID NO 485 from W09922243. 


85 


72 


953 


U44838 


Glycme max 


• 

extensm 


145 


39 


954 


R13556 


Homo sapiens 


Protem encoded downstream of hhcJM 


131 


39 








oncoprotem. 






955 


G03801 


V IT * 

Homo sapiens 


Human secreted protem, SEQ ID NO: 


128 


53 








7882. 





... 


956 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


74 








4435. 






957 


AF 130089 


Homo sapiens 


PRO2550 


120 


37 


958 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


71 








7124. 






joy 




Homo sapiens 


PKUU057 




oi 


960 


L27428 


Homo sapiens 


reverse transcriptase 


121 


30 


961 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


157 


68 








7795. 






962 


AF090942 


Homo sapiens 


PRO0657 


138 


61 


963 


U83280 


Leishmania 


39 kDa antigen 


101 


53 






donovani 






964 


U93566 


Homo sapiens 


p40 


102 


31 



154 



WO 01/64835 PCT/US01/04927 



SEQID 
NO: 


s Accession No. 


Species 

< 


Description 

• 


Smith- 
Waterman 



ocore 


% " 
Ideatft 

y 


965 


Gp3786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

. 7Ro7 
loot. 


104 


63 


966 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


136 


40 


967 


AF090930 


Homo sapiens 


PRO0478 


158 . 


80 






i^anis iamuians . 


\Jt\rz 




Jo ! 


969 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


127 


54 


970 


X535ol 


Kattus 
norvegicus 


URF4 


163 . 


43 


fV71 

971 . 


(j03453 


Homo sapiens 


Human secreted protem, SfaQ ID NO:- 

/5J4. 


1 Aft 

109 


56 


972 


G03114 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
71 yo. 


82 


47 


973 


Y69166 


Homo sapiens 


A mature human N-acetylglycosaminyl 
transterase protein. 


9 5 


80 


974 


G03095 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7176. 


74 


51 


975 


1 Tft^ CI it 

U93574 


Homo sapiens 


putative pi 50 


1 A ft 

140 


43 


976 . 


W48351 

♦ 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


79 


65 


977 


Y08062 * 


Homo sapiens 


Human PR0245 protein fragment 
derived from DIM AJ5oJo. 


116 


55 


978 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


148 


63 


979 


At 1301 14 


Homo sapiens 


PRU2459 


121 


61 


980 


K03202 


Homo sapiens 


salivary pronne-ncn protem precursor 


ft ft. 


>ift 

40 


981 


AF 116909 


Homo sapiens 


unknown 


115 


42 


ft no 

982 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


106 


75 


983 


Y 14674 


Plasmodium 
falciparum 


glutam ate— cysteine ligase 


106 


AT AT 

66 


984 


AF229067 


Homo sapiens 


ft A Y"\I tJ 

PADI-H protein 


152 


^ft 

60 


985 


AF 1 19900 


Homo sapiens 


T»T» A\1 Oil 

PR02822 


142 


48 


986 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7/y5. 


lift 
119 


70 


987 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


106 


60 


QQO 


/~>fto lift 

ooj i iy 


riomo sapiens 


i lam an secreted protem, bcQ 1L? NO. 

/ZUU. 


ol 


51 


990 


U35730 


Mus musculus 


jerky 


133 


29 


OOI 

771 


ATI 1 

At I loOoj 


riomo sapiens 


rKvJUy/4 


iJD 


Oi 


992 


U52077 


Homo sapiens 


mariner transposase 


497 


77 


99j 


.697211 


ocmzosaccnarom 
yces pomoe 


Kinesuwike protem 


197 


47 


994 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


124 


68 




T TOO <X2 


riomo sapiens 


putative pi 50 


137 


CA 

50 


OQ£ 


ADA1 cono 


Acetooacter 
xylinus 


similar to cellulose complementing 
protein of A. xylinum ATCC23869 


14U 


54 




uUZJoJ 


riomo sapiens 


Human secreted protem, obvj iu nu: 
6444. 


144 


£A 
oU 


HOC 


Z?3ol Jo 


Pan pamscus 


cytochrome c oxidase subunit 1 


352 


75 


QQO 


Rfni4R 


nomo sapiens 


Unman nAiirnnnl rHfrf*l*Pn'tlIltlftT^ fa/^ftf— 

1 (NDF-1). 




7R 
/o 


1000 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


377 


70 


1001 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


109 


50 


1002 


AL3901 14 


Leishmania 
major 


extremely cysteine/valine rich protein 


249 


61 


1003 


M14702 


Murine leukemia 


pol polyprotein 


206 


48 



155 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


^mittv 
Waterman 
Score 


% 

Identit 

y 






virus 








1004 


Z21507 


Homo sapiens 


human elongation factor-1 -delta 


511 


85 


1005 


L27428 


Homo sapiens 


reverse transcriptase 


176 


63 


1006 


D38112 


WW * 

Homo sapiens 


NADH dehydrogenase subunit 5 


+y a a 

332 


77 


1007 


AF090895 


Homo sapiens 


m*^^ s^\. t% am m mm 

PRO0117 


162 


66 


1008 


G03102 

* 


Homo sapiens 


Human secreted protein, SEQ ID NO: , 

mm 4 /s *h 

7183. 


140 


6 5 


1009 


U44838 


M m^ At • 

Glycine max 


extensm 


166 . 


33 


1010 


AF251290 


Plasmodium 
falciparum 


glutamic acid-rich protein 


m At A 

114 


52 


1011 


L27428 


Homo sapiens 


reverse transcriptase 


m m A 

114 


52 


1012 


AF130089 


Homo sapiens 


PRO2550 


114 


77 


1013 

> 


G04092 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8173. 


81 


44 


1014 


AF090895 


Homo sapiens 


PRO0117 


97 


65 


1015 


AF130089 


Homo sapiens 


PRO2550 


168 


83 


1016 


AF079367 


Mesocricetus 
auratus 


cytochrome c oxidase subunit III 


276 


52 


1017 


AF055985 


Onchocerca 
volvulus 


pyrrolidone-rich antigen 


94 

• 


44 


1018 


Y36366 

■ 


Homo sapiens 

* 


Fragment of human secreted protein 
encoded by gene 3. 


114 


46 


1019 


AF090944 


Homo sapiens 


PRO0663 


137 


50 


1020 


Y21166 


Homo sapiens 


Human bcl2 proto-oncogene mutant 
protein fragment 14. 


84 


36 


1021 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


140 


37 


1022 


AL049608 


Arabidopsis 
thaliana 


extensin-like protein 


IQ5 


34 


1023 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 1 00 clone HNFIU96. 


152 


52 

* 


1024 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


125 


66 


1025 


M12140 


Homo sapiens 


envelope protein 


143 


62 


1026 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


200 


58 


1027 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


266 


67 


1028 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


97 


31 


1029 


G02950 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7031. 


102 


56 


1030 


G02482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6563. 


76 


63 


1031 


G00637 

> * 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


74 


56 


1032 


X90568 


Homo sapiens 


Protein sequence and annotation 
available soon via Swiss-Prot; available 
at present via e-mail from 
LABEIT@EMBL~Heidelberg.DE 


389 


100 


1033 


G02654 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

M»mm*w m 

6735. 


96 


40 


1034 


AF010144 


Homo sapiens 


neuronal thread protem AD/c-NTP 


316 


60 


1035 


AF 194537 


Homo sapiens 


NAG 13 


208 


52 


1036 


L27428 


Homo sapiens 


reverse transcriptase 


166 


51 


1037 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


107 


44 


1038 


AF1 16638 


Homo sapiens 


PR01546 


56 


61 


1039 


U93570 


Homo sapiens 


putative pi 50 


138 


40 


1040 


AF130089 


Homo sapiens 


PRO2550 


150 


91 


1041 


AK024455 


Homo sapiens 


FLJ00047 protein 


151 


68 



156 



WO 01/64835 PCT/US01/04927 

• * r 

I 



NO: 


> Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Identit 

y 


1042 


Y36156 


Homo sapiens 


Human secreted protein #28. 


97 


41 


1043 


U93568 


Homo sapiens 


putative pl50 


124 


34 


1044 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


208 


71 


1045 


U93563 


Homo sapiens 


putative pi 50 


246 


54 


1046 


G00416 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
4497. 


131 


64 


1047 


U93563 


Homo sapiens . 


putative pi 50 


127 


30 


1048 


AF130114 


Homo sapiens 


PR02459 


117 


67 


1049 


U12919 


Mus museums 


adenyryl cyclase type Vll 


170 


68 


1050 


AC008054 


Leishmania 
major 


L8453.1 


129 


30 


1051 


X99467 


Medicago 
tnincatula 


ENOD20 


110 


38 


1052 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


150 


76 


1053 


AF1 16712 


Homo sapiens 


PR02738 


109 


44 


1054 


M96256 


Homo sapiens 


rapamycin binding protein 


168 


56 


1055 


U15647 


Mus museums 


reverse transcriptase 


86 


37 


1056 

■ 


AL024498 


Homo sapiens 


dJ417M14.2 (novel serine/threonine- 
protein kinase (ortholog of mouse and 
rat MAK (male germ cell-associated 
kinase)) 


190 

■ 


72 


1057 


AF090942 


Homo sapiens 


PRO0657 


103 


63 


1058 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


156 


68 


1059 


AF081114 


Mus mus cuius 
domesticus 


ORF2 


134 

■ 


47 


1060 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


180 


83 


1061 


U70935 


Peromyscus 
maniculatus 


reverse transcriptase 


126 


45 


1062 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


84 


55 

- 


1063 


U15647 


Mus musculus 


reverse transcriptase 


95 


38 


1064 


U93567 


Homo sapiens 


putative pi 50 


128 


58 


1065 


Y14479 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


130 


66 


1066 


X92485 


Plasmodium 
vivax 


pval 


119 


62 


1067 


U93567 


Homo sapiens 


p40 


161 


48 


1068 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


540 


84 


1069 


U93570 


Homo sapiens 


putative pi 50 


107 


59 


1070 


AF321051 


Chalinolobus 
tuberculatus 


cytochrome c oxidase subunit III 


333 


71 


1071 


U93567 


Homo sapiens 


p40 


99 


28 


1072 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


105 


66 


1073 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


136 


59 


1074 


U93572 


Homo sapiens 


putative pl50 


140 


53 


1075 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


66 


43 


1076 


AL049608 


Arabidopsis 
thaliana 


extensin-Iike protein 


105 


37 


1077 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


94 


66 


1078 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


81 


57 


1079 


G04091 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


83 


35 



157 



SEQID 


Accession No. 


Sdccics 


DescriDtion 


Smith- 


% 


NO: 






• 


Waterman 


Identit 






H' 


- 


Score 


y 








8172. 






1080 


AF162149 


Mycoplasma 


variable surface lipoprotein 


103 


41 






• ■ 

D0Y1S 


* 






1081 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


148 


75 






■ 


7253. 






1082 


U43360 


Peromyscus 


reverse transcriptase 


121 


42 




■ 


mamculatus 








1083 


U93564 


Homo sapiens 


p40 


97 


42 


1084 


AF229067 


Homo sapiens 


PADI-H protein 


145 


61 


1085 


U49973 


Homo sapiens 


ORF1 ; MER37; putative trahsposase 


221 


60 


- 






similar to pogo element 






1086 


U88573 


Homo sapiens 


NBR2 


165 


67 


1087 


Y36156 


Homo sapiens 


Human secreted protein #28. 


93 


72 


1088 


AF1 94537 


Homo sapiens 


NAG 13 


142 


62 


1089 


B08976 


Homo sapiens 


Human secreted protein sequence 


93 


62 


■ 






encoded by gene 28 SEQ ID NO: 133. 






1090 


AF194537 


Homo sapiens 


NAG 13 


155 


40 


1091 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


35 








8172. 


• 




1092 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


195 


40 


1093 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


91 


46 






■ 


similar to pogo element 




• 


1094 


X53581 


Kattus 


ORF4 


106 


62 






norvegicus 








1095 


AL 160493 


Leishmania 


probable (hhv-6) ul 102, variant a 


129 


51 






major 


DNA, complete virion genome 






1096 


Y86473 


Homo sapiens 


Human gene 52-encoded protein 


72 . 


33 








fragment, SEQ ID NO:388. 






1097 


U40342 


Mus musculus 


ninein 


152 


44 


1098 


M24732 


Homo sapiens 


lamin-like protein 


92 


37 


1099 


X92485 


Plasmodium 


pval 


111 


67 






vivax 


■ 






1100 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


156 


86 








gene No. 107. 






1101 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


116 


90 








RHAla(AF078683) 






1102 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


158 


71 








4518. 






1103 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


36 


_ 






7214. 






1104 


U93572 


Homo sapiens 


putative pi 50 


168 


56 


1105 


U93570 


Homo sapiens 


putative pi 50 


96 


40 


1106 


, L27428 


Homo sapiens 


reverse transcriptase 


188 


43 


1107 


X53581 


Ramis 


ORF4 


141 


43 






norvegicus 








1108 


Y91577 


Homo sapiens 


Human secreted protein sequence 


344 


77 








encoded by gene 2 SEQ ID NO:250. 


- 




1109 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


68 








6571. 


■ 




1110 


U93569 


Homo sapiens 


putative pi 50 


156 


38 


1111 


AF 118086 


Homo sapiens 


PR01992 


135 


54 


1112 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


71 


63 








gene 22 clone HMSJW18. 






1113 


AFO 16099 


Mus musculus 


endonuclease/re verse transcriptase 


124 


62 


1114 


L27428 


Homo sapiens 


reverse transcriptase 


200 


40 


1115 


G02996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


50 








7077. 






1116 


L27428 


Homo sapiens 


reverse transcriptase 


122 


70 


1117 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


148 


65 



158 



WO 01/64835 PCT/US01/04927 

• ■ 



SEQ to 
NO: 


* Accession No. 


Species 


Description 


Smitb- 

Wsterman 

Score 


% 

Identit 

y 




* 




4718. 






1118 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2, 


116 


65 


1119 

'i 


D38484 


Hylobates 
syndactylus 


Cytochrome C oxidase subunit 1 
(COXI) 


315 


89 


1120 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


115 


76 


1121 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


136 


68 


1122 


AFO 13990 


Homo sapiens 


ubiquitin C-terminal hydrolase 


163 


50 


1123 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


94 


55 


1124 


G02490 

• 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
6571. 


154 


65 


1125 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


106 


53 


1126 


AF130089 


Homo sapiens 


PRO2550 


76 


72 


1127 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


92 


59 


1128 


Y14479 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


80 


59 


1129 


AF1 19855 


Homo sapiens 


PR01847 


146 


70 


1130 


AF 194537 


Homo sapiens 


NAG 13 


182 


66 


1131 


L27428 


Homo sapiens 


reverse transcriptase 


173 


38 


1132 


U93570 


Homo sapiens 


putative pi 50 


119 


34 


1133 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


157 


68 


1134 


AJ0048IO 


Zea mays 


cytochrome P450 monooxygenase 


79 


87 


1135 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


138 


70 


1136 


G01502 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5583. 


81 


73 


1137 


ABO 18705 


Mus musculus 


ORF2 


138 


36 


1138 


L20321 


Homo sapiens 


protein serine/threonine kinase 


150 


63 


1139 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


92 


60 


1140 


AF194537 


Homo sapiens 


NAG 13 


115 


33 


1141 


U93564 


Homo sapiens 


putative pi 50 


135 


51 


1142 


D86853 


Catharantbus 
roseus 


extensin 

• 

■ 


142 


37 


1143 


D00570 


Mus musculus 


open reading frame (251 AA) 


213 


50 


1144 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


62 


38 


1145 


Z70684 


Caenorhabditis 
elegans 


F28D1.8 


105 

■ 


32 

• 


1146 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


79 


54 


1147 


Y36156 


Homo sapiens 


Human secreted protein #28. 


151 


62 


1148 


A23786 


Beta vulgaris 


chitinase 1 


98 


.37 


1149 


AF129756 


Homo sapiens 


BAT2 


177 


52 


1150 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


147 


73 


1151 


L27428 


Homo sapiens 


reverse transcriptase 


77 


31 


1152 


U34044 


Homo sapiens 


selenium donor protein 


238 


48 


1153 


AK024455 


Homo sapiens 


FLJ00047 protein 


78 


78 


1154 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


197 


67 


1155 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


120 


64 


1156 


U25281 


Rattus 
norvegicus 


SIB domain binding protein 


98 


34 


1157 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


138 


54 



159 



I 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Jdentir 

y 


1158 


AF 194537 


Homo sapiens 


NAG 13 


106 


48 


1159 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


357 


69 


1160 


U93572- 


Homo sapiens 


p40 


89 


47 


1161 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


184 


68 


1162 


G03787 . 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


123 


57 


1163 


AF1 16712 


Homo sapiens 


PR02738 


129 


69 


1164 


V00662 


Homo sapiens 


cytochrome oxidase I 


465 


69 


1165 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


118 


57 


1166 


L26163 


Mus museums 


histone Hie 


104 


37 


1167 


X70343 


Nicotiana 
sylvestris 


extensin 


95 


33 


1168 


AF130051 


Homo sapiens 


PRO0898 


117 


43 


1169 


G03714 

* 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


104 


48 


1170 


L27428 


Homo sapiens 


reverse transcriptase 


149 


33 


1171 


G00497 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4578. 


108 


62 


1172 


AF030277 


Tragelaphus 
spekii 


cytochrome oxidase subunit III 

* 


266 

• 


54 


1173 


L22030 


Glycine max 


hydroxyproline-rich glycoprotein 


87 


38 


1174 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


168 


45 


1175 


U43627 

• 


Arabidopsis 
thaliana 


extensin 


111 

■ 


42 


1176 


U43627 


Arabidopsis 
thaliana 


extensin 

■ 


98 


29 


1177 


U93565 


Homo sapiens 


putative pi 50 


89 


58 


1178 


J01047 


Caenorhabditis 
elegans 


collagen 


108 


39 


1179 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


170 


68 


1180 


AFO 16099 


Mus museums 


endonuclease/reverse transcriptase 


113 


36 


1181 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


134 


65 


1182 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


107 


62 


1183 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


124 


45 


1184 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


418 


86 


1185 


U87607 


Rattus 
norvegicus 


putative RNA binding protein 1 


106 


41 


1186 


Y86573 


Homo sapiens 

* 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


381 

• 


75 


1187 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


434 


79 


1188 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


75 


35 


1189 


G03789 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
7870. 


159 


68 


1190 


G02490 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
6571. 


146 


68 


1191 


AF 118086 


Homo sapiens 


PR01992 


A A £ 

146 


81 


1192 


W12842 


Homo sapiens 


Truncated pro-alpha 1(111) chain. 


106 


35 


1193 


AF000298 


Caenorhabditis 
elegans 


weak similarity to collagens; glycine- 
and proline-rich 


67 


34 


1194 


G03556 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7637. 


116 


74 


1195 


Y30681 


Homo sapiens 


Splice variant ZAP- 1 B protein of the 
human tumor suppressor gene ZAP-1. 


132 


82 



160 



SEQ ID 
NO: 


Accession No. 


Species 

- 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


1196 


G03114 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7195. 


75 


48 


1197 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo element 


254 


73 


1198. 


U93570 


Homo sapiens 


p40 


103 


30 


1199 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 

• 


145 


39 


1200 


AK024455 


Homo sapiens 


FLJ00047 protein 


115 


56 


1201 


AF090942 


Homo sapiens 


PRO0657 


88 


64 


1202 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


124 


70 


1203 


Y30822 


Homo sapiens 

• 


Human secreted protein encoded from 
gene 12. 


»3 

• 


46 


1204 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


81 


41 


1205 


U35730 


Mus museums 


jerky 


107 


27 


1206 


U 15647 


Mus museums 


reverse transcriptase 


191 


45 


1207 


U15647 


Mus museums 


reverse transcriptase 


124 


50 


1208 


G0341 1 


Homo sapiens 

* * 


Human secreted protein, SEQ ID NO: 
7492. 


140 


58 


1209 


AF010144 


Homo, sapiens 


neuronal thread protein AD7c-NTP 


91 


54 


1210 


AF1 19900 


Homo sapiens 


PR02822 


160 


81 


1211 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


122 


68 


1212 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 

* 


142 


45 


1213 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


125 


61 


1214 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


102 


61 


1215 


G03560 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7641. 


101 


46 


1216 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


270 


58 


1217 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ CD NO:3$7. 


105 


46 


1218 


AC002483 


Homo sapiens 


putative product from mRNA sequence 
CG003 from BRGA2 region; match to 
U50534 (NID:gl685103) 


378 


97 


1219 


AF090895 


Homo sapiens 


PRO0117 


130 


58 


1220 


API 13685 


Homo sapiens 


PRO0974 


117 


60 


1221 


X61295 


Rattus 
norvegicus 


LI retroposon, a portion of its ORF2 
sequence 


126 


.50 


1222 


G03789 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


148 • 


70 


1223 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


99 


56 


1224 


U93574 


Homo sapiens 


putative pl50 


93 


44 


1225 


AF130051 


Homo sapiens 


PRO0898 


133 


69 


1226 


U93563 


Homo sapiens 


putative pi 50 


125 


47 


1227 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase k 
similar to pogo element 


220 


47 


1228 


U93564 


Homo sapiens 


putative pi 50 


116 


47 


1229 


W21733 


Homo sapiens 


NIP- 1 encoded by clone 59. 


138 


63 


1230 


U15647 


Mus museums 


reverse transcriptase 


105 


42 


1231 


U93563 


Homo sapiens 


putative pl50 j 


299 


54 


1232 


R95913 


Homo sapiens 


Neural thread protein. 


138 


51 


1233 


AF 130079 


Homo sapiens 


PR02852 


203 


70 


1234 


X53581 


Rattus 
norvegicus 


ORF3 


106 


60 
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SEQID 
NO: 

, 


Accession No. 


Species 

Ml 


Description 

• 


Smitb- 

Waterman 

Score 


% 

Identit 

y 


1235 


AF1 18086 


Homo sapiens 


PR01992 


144 


81 


1236 


X92485 


Plasmodium 
vivax 


pval 


125 


71 


1237 


AFO 10144 


Homo sapiens 


neuronal thread protein AD7C-NTP 


116 


75 


1238 


U93572 


Homo sapiens 


putative pi 50 


133 


40 


1239 


G01249 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5330. 


69 


56 


1240 


AF130089 


Homo sapiens 


PRO2550 


136 


41 


1241 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


109 


53 


1242 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


161 


75 


1243 


G02752 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6833. 


87 


45 


1244 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


128 


'58 


1245 


U93570 


Homo sapiens 


putative pi 50 


161 


50 


1246 


Z70684 


Caenorhabditis 
eiegans * 


F28D1.8 


121 


45 


1247 


AF257305 


Homo sapiens 


ASH1 


576 


89 


1248 


G03263 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
7344. 


98 


68 

■ 


1249 


AF025467 


Caenorhabditis 
eiegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


107 


43 


1250 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


130 

• 


91 


1251 


U63542 


Homo sapiens 


FAP protein 


116 


61 


1252 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


109 


53 


1253 


AF068294 


Homo sapiens 


HDCMB45P 


251 


63 


1254 


AF090895 


Homo sapiens 


PRO0117 


111 


60 


1255 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


201 


75 


1256 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


116 


51 


1257 


AB033032 


Homo sapiens 


KIAA1206 protein 


115 


80 


1258 


• G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


122 


60 


1259 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


137 


68 


1260 


AB032906 


Hylobates 
pileatus 


dopamine receptor D4 


96 


35 


1261 


AF022985 


Caenorhabditis 
eiegans 

- 


Similar to collagen; coded for by C. 
eiegans cDNA yk55£3.3; coded for by 
C. eiegans cDNA yk66d5.3; coded for 
by C. eiegans cDNA yk71e43; coded 
for by C. eiegans cDNA yk55f3 5; 
coded for by C. eiegans cDNA 
yk66d5.5; coded for by C. eiegans 
cDNAyk71e4.5 


106 

•• 

* 


38 

• 

• 


1262 


U93566 


Homo sapiens 


p40 


182 


39 


1263 


L20096 


Manduca sexta 


ribosomal protein s7 


227 


59 


1264 


AF1 19901 


Homo sapiens 


PR02831 


103 


71 


1265 


D87446 


Homo sapiens 


Similar to a C.elegans protein encoded 
in cosmid C27F2 (U40419) 


219 


97 


1266 


D90279 


Homo sapiens 


collagen alpha 1(V) chain precursor 


120 


42 


1267 


L27428 


Homo sapiens 


reverse transcriptase 


111 


41 


1268 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


126 


68 


1269 


AF016099 


Mus musculus 


endonuclease/reverse transcriptase 


102 


34 



162 



NO: 


Accession i>o» 


opccics 


jli esc n p n vq 


Smith. 

Waterman 
Score 


Idcntit 

y 


1270 


AF 130089 


Homo sapiens 


PRO2550 


96 


69 


1271 


U 12707 


Homo sapiens 


Wiskott-Aldrich syndrome protein 


121 


45 


1272 


AF165310 


Homo sapiens 


ATP cassette binding transporter 1 


243 


100 


1273 


R95913 


Homo sapiens 


Neural thread protein. 


110 


70 


1274 


X92485 


Plasmodium 
vivax 


pval 


106 

* 


65 


1275 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


104 


56 


1276 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


53 


36 


1277 


X03725 


Mus musculus 


ORF 2 (466 aa) 


103 


41 


1278 


U93570 


Homo sapiens 


putative pi 50 


98 


43 


1279 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


159 


83 


1280 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


167 


7 * 


1281 


AJ271871 


Nicotiana 
sylvestris 


putative extensin 


105 


36 


1282 


K03205 


Homo sapiens 


salivary proiine-rich protein precursor 


119 


32 


1283 


AF010144 


Homo sapiens 


neuronal thread protein AD7c~NTP 


124 


66 


1284 


G03114 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7195. 


135 


50 


1285 


A3 1039 


Nicotiana alata 


PRP3 


1 12 


36 


1286 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


116 


72 


1287 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


289 

>< 


67 


1288 


S80119 


Rattus sp. 


reverse transcriptase homolog 


112 . 


33 


1289 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


203 


76 


1290 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


151 


53 


1291 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


128 


82 


1292 


AF130089 


Homo sapiens 


PRO2550 


127 


62 


1293 


AF003535 


Homo sapiens 


ORF2-like protein 


101 


48 


1294 


Y19610 


Homo sapiens 


SEQ ID NO 328 from W09922243. 


100 


42 


1295 


L27428 


Homo sapiens 


reverse transcriptase 


126 


36 


1296 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


140 


71 


1297 


L24433 


Oncorhynchus 
mykiss 


complement component C3 


359 


31 


1298 


AC004381 


Homo sapiens 


SA gene 


443 


55 


1299 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


180 


64 


1300 


Y14479 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


135 


66 


1301 


D84391 


Mus musculus 


reverse transcriptase 


106 


48 


1302 


D13951 


Nicotiana 
tabacum 


extensin precursor 


134 


42 


1303 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


152 


73 


1304 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


93 


38 


1305 


Y13620 


Homo sapiens 


BCL9 


102 


39 


1306 


U93567 


Homo sapiens 


putative pi 50 


245 


63 


1307 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


130 


70 


1308 


AL355774 


Streptomyces 
coelicolor A3(2) 


putative integral membrane protein 


136 


40 


1309 


W54966 


Homo sapiens 


Synthetic human type III collagen 


124 


41 



163 
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SEQDD 
NO: 


1 Accession No* 


Species 


Description 


Sraitb- 
W a terra an 
Score 


Identit 

y 




i 


i 


SYN-C3. 






1310 


M20670 


Plasmodium 
vivax 


circumsporozoite protein 


107 


34 


1311- 

■| 


AF151366 


Arabidopsis 
tbaliana 


arginine/serine-rich protein 


114 


36 


1312 


G03099 

■ 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
7180. 


76 


43 


1313 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
.7339. 


109 


85 


1314 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


111 


37 


1315 


G02485 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
6566. 


123 


62 


1316 


R95913 


Homo sapiens 


Neural thread protein. 


98 


58 


1317 


API 13685 


Homo sapiens 


PRO0974 


170 


47 


1318 


Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene 51 clone HUKEX85. 


122 


61 


1319 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


99 


68 


1320 


AJ242540 

* 


Vol vox carteri f. 

• 

nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


270 


5* 


1321 


U93569 


Homo sapiens 


putative pi 50 


124 


37 


1322 


AF090931 


Homo sapiens 


PRO0483 


111 


85 


1323 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


85 


44 


1324 


Ml 8094 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 

. « 


131 


43 


1325 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


144 


54 


1326 


AF194537 


Homo sapiens 


NAG 13 


125 


49 


1327 


L27428 


Homo sapiens 


reverse transcriptase 


111 


45 


1328 


U93568 


Homo sapiens 


putative pi 50 


112 


30 


1329 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


110 


51 


1330 


L27428 


Homo sapiens 


reverse transcriptase 


142 


53 


1331 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


98 


88 


1332 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


113 


55 


1333 


X71602 


Nicotiana 
tabacura 


extensin — 


113 


35 


1334 


G03133 


Homo sapiens 

■ 


Human secreted protein, SEQ ID NO: 
7214. 


129 


41 


1335 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: . 
6913. 


112 


50 


1336 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


102 


68 


1337 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


95 


51 


1338 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


114 


60 


1339 


G00379 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4460. 


142 


55 


1340 


X55685 

• 


Lycopersicon 
esculentum 


extensin (class I) 


123 


31 


1341 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7001. 


118 


70 


1342 


X71629 


Mus museums 


msgl 


106 


57 


1343 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


105 


79 
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SEOID 
NO: 


Accession No. 


Species 


D esc notion 


SmiuV 

Waterman 

Score 


% 

Identit 

y 


1344 


G03435 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7516. 


116 


85 


1345 


AF161356 


Homo sapiens 


HSPC093 


88 


88 


1346 


JO 1435 


Rattus 
norvegicus 


ATPase 


348 


66- 


1347 


U93563 


Homo sapiens 


putative pi 50 


142 


36 


1348 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


87 

- 


66 


1349 


AF090942 


Homo sapiens 


PRO0657 


152 


54 


1350 

• 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


143 


63 


1351 


U93572 


Homo sapiens 


putative pi 50 


113 


84 


1352 


X92485 


Plasmodium 
vivax 


pval 


130 


70 


1353 


X61047 


Hydra sp. 


mini-collagen 


105 


36 


1354 


AFU5549 


Homo sapiens 


Wiskott-Aldrich Syndrome protein 


128 


46 


1355 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


91 


39 


1356 


AF194537 


Homo sapiens 


NAG 13 


148 


63 


1357 


AF 130079 


Homo sapiens 


PR02852 


139 


73 


1358 


X53581 


Rattus 
norvegicus 


ORF4 


208 


43 


1359 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


195 


50 


1360 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


328 

* 


69 


1361 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40_3. 


126 


65 


1362 


G02451 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6532. 


97 


47 


1363 


X53581 


Rattus 
norvegicus 


ORF4 


110 


35 


1364 


U93569 


Homo sapiens 


putative pi 50 


123 


.41 


1365 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


114 


63 


1366 


X61294 


Rattus 
norvegicus 


L 1 retroposon, a portion of its ORF2 
sequence 


153 


43 


1367 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


69 


80 


1368 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


80 


46 


1369 


G04067 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
8148. 


113 


26 

• 


1370 


X92485 


Plasmodium 
vivax 


pval 


106 


46 


1371 


U90946 


Dictyostelium 
discoideum 


myosin heavy chain kinase B 


114 


62 


1372 


L27428 


Homo sapiens 


reverse transcriptase 


98 


61 


1373 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


124 


80 

. 


1374 


U49974 


Homo sapiens 


mariner transposase 


137 


57 


1375 


AC004891 


Homo sapiens 


contacthvlike; similar to U87224 
(PID:gl857710) 


234 


58 


1376 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


132 


60 


1377 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


317 


51 


1378 


AF138082 


Homo sapiens 


PRO1902 


102 


42 


1379 


AF1 15549 


Homo sapiens 


Wiskott-Aldrich Syndrome protein 


186 


45 


1380 


U93567 


Homo sapiens 


putativepl50 


116 


38 


1381 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


218 


47 
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SEQ ED 
NO: 


Accession No. 


Species 

•** 


Description 


Smith- 

Waterman 

Score 


% 

Identit • 

y 




• 




similar to pogo element 






1382 


B08918 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 28 SEQ ID NO:75. 


87 . 


66 


1383 


AF090895 


Homo sapiens 


PRO0117 


68 


82 


1384 


U93570 


Homo sapiens 


putativepl50 


178 


39 


1385 


G03133 

* 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


132 


43 


1386 


AF130089 


Homo sapiens 


PRO2550 


142 . 


35 


1387 


L27428 


Homo sapiens . 


reverse transcriptase 


163 


49 


1388 


X61296 


Rattus 
norvegicus 


open reading frame 2 


123 


44 


1389 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


138 


67 


1390 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


147 


57 


1391 


U93570 


Homo sapiens 


putative pi 50 


110 


28 


1392 


R95913 


Homo sapiens 


Neural thread protein. 


104 


35 


1393 


L27428 


Homo sapiens 


reverse transcriptase 


101 


39 


1394 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


147 


47 


1395 


AF2 16972 


Homo sapiens . 


p8 protein 


118 


49 


1396 


U49973 


Homo sapiens 

■ 


ORF1 ; MER37; putative transposase 
similar to pogo element 


160 

- 


58 


1397 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


312 


62 


1398 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


?7 


62 


1399 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


125 


54 


1400 


X92485 


Plasmodium 
vivax 


pval 


124 


39 


1401 


U93563 


Homo sapiens 


putative pi 50 


131 


36 


1402 


X67863 


Mus musculus 


T2 


160 


48 


1403 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


94 


39 


1404 


G03714 

- 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


84 


73 


1405 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


71 


71 


1406 


Ml 8094 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


138 


31 


1407 


AF 134304 


Homo sapiens 


Scar2 


118 


40 


1408 


Y08061 


Homo sapiens 


Human c-myb protein fragment 


121 


82 


1409 


U93574 


Homo sapiens 


putative pi 50 


179 


43 


1410 


U93563 


Homo sapiens 


putative pi 50 


98 


43 


1411 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


124 


46 


1412 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


149 


56 


1413 


AF010144 


Homo sapiens 


neuronal thread protein AD7C-NTP 


148 


48 


1414 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


155 


82 


1415 


G03267 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7348. 


92 


52 


1416 


AF1 19855 


Homo sapiens 


PR01847 


82 


70 


1417 


AFO 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


130 


34 


1418 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


99 


39 


1419 


AF1 30079 


Homo sapiens 


PR02852 


114 


69 


1420 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


133 


52 
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SEQ ID 
NO: 


\ Accession No. 


Species 


Description 

• 


SraiuV 

Wflterman 

Score 


% 

Identit 

y 


4 A\ 0^ 4 

1421 


RJ3556 


Homo. salens 


Protein encoded downstream of hhc_M 
oncoprotein. 


115 


44 


1422 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


111 


11 


1423 




Homo sapiens 


Neural thread protein. 


128 


80 


1424 


L26953 


Homo sapiens 


chromosomal protein 


1 A /I 

104 


34 


1425 


Uo32oU 


Leisnmama 
donovani 


39 KDa antigen 


1 AC 

105 


51 


1426 


Y 19684 


Homo sapiens 


SEQ ID NO 402 from W09922243. 


98 


15 


1427 


U83303 


IT " _ 

Homo sapiens 


line-1 reverse transenptase 


149 


A% /\ 

40 


1428 


AF090895 


Homo sapiens 


PRO0117 


111 


75 


4 At r\ f\ 

1429 


AF1 19855 


Homo sapiens 


PRO 1847 


O. ft 

88 


56 


1430 


AF229067 


Homo sapiens 


PADI-H protein 


I57 


51 


1431 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


509 


79 


1432 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


333 


62 


1433 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


149 


57 


1434 


AF161356 


Homo sapiens 


HSPC093 


1 80 


46 


1435 


U93570 


Homo sapiens 


putative pi 50 


1 16 


48 


1436 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


171 


73 


1437 


U83280 


Leishmania 
donovani 


39 kDa antigen 


106 


80 


1438 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


103 


73 


1439 


U 15647 


Mus musculus 


reverse transcriptase 


233 


44 


1440 


L27428 


Homo sapiens 


reverse transcriptase 


78 


40 


1441 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


115 


68 


1442 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


77 


63 


1443 


M22332 


Homo sapiens 


unknown protein 


153 


62 


1444 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


102 


40 


1445 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


101 


66 


1446 


G02515 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6596. 


125 


74 


1447 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


90 


66 


1448 


X76208 


Drosophila 
melanogaster 


protein 33-specific exons 


123 


48 


1449 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


178 


79 


1450 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


109 


45 


1451 


S80119 


Rattus sp. 


reverse transcriptase homolog 


115 


54 


1452 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


70 


63 


1453 


U42471 


Mus musculus 

■ 


Wiscott-Aldrich Syndrome protein 
homolog 


105 


40 


1 AC A 

1454 


A T/TIA A ACC 

AK024455 


Homo sapiens 


1UUU047 protein 


1 AA 

109 


53 


1455 


AC007258 


Arabidopsis 
thaliana 


Hypothetical protein 


105 


37 


1456 


AF1 94537 


Homo sapiens 


NAG 13 


208 


52 


1457 


U63542 


Homo sapiens 


FAP protein 


111 


84 


1458 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


111 


65 


1459 


AF09093 1 


Homo sapiens 


PRO0483 


84 


43 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


Ideotit 

y 


1460 


tine n i «■> 

R95913 


Homo sapiens 


Neural thread protein. 


106 


69 


1461 


R95913 


Homo sapiens 


Neural thread protem. 


109 


A\ #\ 

40 


1462 


U93564 


Homo sapiens 


putative pi 50 


237 


42 


1463 


AB029309 


Homo sapiens 


Npw38-binding protein NpwBP 


97 


37 


« At A*~ A 

1464 


U44838 


Glycine max 


A * 

extensm 


g** 

97 


33 


1465 


AL050341 

■ 


Homo sapiens 

* 


dJ39G22. 1 (rearranged L-myc fusion ,. 
sequence (ZN-15 related zmc finger 
protein)) 


121 

• 


45 


1466 


L27428 


Homo sapiens 


reverse transcriptase 


94 


34 


a a a* mm 

1467 


AF010144 


Homo sapiens 


neuronal thread protem AD7c-NTP 


117 


58 


1468 


Y17221 


Homo sapiens 


Human secreted protein (clone flc317~ 
3). 


98 


48 


1469 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


123 


55 


1470 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


147 


67 


1471 


AF 109907 


Homo sapiens 


S164 


133 


35 


1472 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


142 


36 


1473 


Y86248 

• 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


152 


90 


1474 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


157 


45 


1475 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


398 


58 


1476 


U49974 


Homo sapiens 


mariner transposase 


201 


59 


1477 


U49974 


Homo sapiens 


mariner transposase 


206 


60 


1478 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


117 


72 


1479 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


146 


82 


1480 


U93567 


Homo sapiens 


putative pi 50 


202 


42 


1481 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


101 


46 


1482 


U87607 


Rattus 
norvegicus 


putative RNA binding protein 1 


100 


37 


1483 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


124 


75 


1484 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


108 


62 


1485 


U 15647 


Mus musculus 


reverse transcriptase 


115 


73 


1486 


AF 194537 


Homo sapiens 


NAG 13 


132 


42 


1487 


Ml 1902 


Mus musculus 


proline-rich salivary protein 


118 


40 


1488 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


102 


64 


1489 


AF009668 

■ 


multiple 

sclerosis 

• * 
associated 

retrovirus 


polyprotein 

• 


110 


48 

• 


1490 


AK023542 


Homo sapiens 


unnamed protem product 


114 


37 


1491 


G03786 


WW • 

Homo sapiens 


Human secreted protem, SEQ ID NO: 
7867. 


133 


50 


I4y2 




Homo sapiens 


UKr i ; MbK37; putative transposase 
similar to pogo element 


id a 

184 


A*W 

47 


1493 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


222 


53 


1494 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


94 


50 


1495 


AF109907 


Homo sapiens 


S164 


259 


45 


1496 


R95913 


Homo sapiens 


Neural thread protein. 


110 


51 


1497 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


299 


80 
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NO: 


Accession No. 


Species 


Description 


Smith- 

Watcnnan 

Score 


% 

Identit 

y 


1498 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


158 


65 


1499 


L26953 


Homo sapiens 


chromosomal protein 


104 


67 


1500 


AF090895 


Homo sapiens 


PRO0117 


145 


68 


1501 


U93572 


Homo sapiens 


p40 


115 


42 


1502 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


133 


50 


1503 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6588. 


143 


56 


1504 


G00328 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


119 


69 

■ 


1505 


W00838 


Homo sapiens 

* 


Tumour necrosis factor-related gene 
product 


105 


52 


1506 


AF109907 


Homo sapiens 


S164 


184 


43 


1507 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


288 


67 


1508 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


131 


69 


1509 


AK000241 


Homo sapiens 


unnamed protein product 


167 


72 


1510 


G02532 , 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
6613. 


153 

. 


60 

* 


1511 


S62928 


Homo sapiens 


PRB1M protein precursor 


157 


39 


1512 


ABO 12223 


Canis familiaris 


ORF2 


116 


40 


1513 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


109 


59 


1514 


AF220264 


Homo sapiens 


MOST-1 


108 


80 


1515 


X53581 


Rattus 
norvegicus 


ORF4 


96 


44 


1516 


V00662 


Homo sapiens 


cytochrome oxidase III 


433 


74 


1517 


U49973 


Homo sapiens 

• 


ORF1; MER37; putative transposase 
similar to pogo element 


263 


70 

• 


1518 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


266 


70 


1519 


M24732 


Homo sapiens 


lam in-like protein 


107 


44 


1520 


G03133 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


131 

- • 


51 

• 


1521 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


81 


68 


1522 


AF194537 


Homo sapiens 


NAG13 


116 


42 


1523 


X92485 


Plasmodium , 
vivax 


pval 


85 


42 


1524 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


125 


50 


1525 


AB041228 


Homo sapiens 


G protein-coupled receptor TGR-1 


220 


100 


1526 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


138 


70 


1527 


U52077 


Homo sapiens 


mariner transposase 


237 


56 


1528 


L27428 


Homo sapiens 


reverse transcriptase 


189 


40 


1529 


Y86573 

• 


Homo sapiens 

• 


, Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


355 

- 


78 


1530 


L13610 


Mus museums 


IFN-response element binding factor 2 


90 


37 


1531 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


90 


64 


1532 


U11288 


Drosophila 
melanogaster 


diaphanous protein 


138 


38 


1533 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


429 


63 


1534 


L27428 


Homo sapiens 


reverse transcriptase 


249 


55 


1535 


U93570 


Homo sapiens 


putative pi 50 


114 


31 


1536 


AF130089 


Homo sapiens 


PRO2550 


111 


47 
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Smith- 

Waterman 

Score 


% 1 

Identit 

y 


1537 


X55687 


Lycopersicon 
esculentum 


extcnsin (class II) 


63 


28 


1538 


U 15647 


Mus musculus 


reverse transcriptase 


no 


42 


1539 


AK024455 


Homo sapiens 


FLJ00047 protein- 


139 


80 


1540 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 


128 


48 


1541 . 


AC024788 


Gaenorhabditis 
elegans 


Hypothetical protein Y46E12A.d 


80 

• 


46 


1542 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


132 


73 


1543 


W50192 

• 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-1. 


90 


32 


1544 


G02971 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7052. 


71 


63 


1545 


R95913 


Homo sapiens 


Neural thread protein. 


118 


55 


1546 


D38116 


Pan paniscus 


cytochrome c oxidase subunit 1 


218 


78 


1547 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


370 


71 


1548 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


119 


65 


1549 


AJ004810 


Zea mays 


cytochrome P450 monooxygenase 


140 


70 


1550 


AF113685 


Homo sapiens 


PRO0974 


115 


47 


1551 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


172 

■ 


75 


1552 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


366 


94 


1553 


W40113 


Homo sapiens 


Human alpha-2(IV) collagen protein. 


117 


59 


1554 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


138 


76 


1555 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


113 


40 


1556 


D38U3 


Pan troglodytes 


cytochrome c oxidase subunit 1 


502 


89 


1557 


AF1 30089 


Homo sapiens 


PRO2550 


98 


55 


1558 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA3563 8. 


109 


43 


1559 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


183 


52 


1560 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


180 

•- 


52 


1561 


AF202893 


Mus musculus 


KiGlb 


254 


85 


1562 


M63421 


Drosophila 
melanogaster 


csp32 


104 

- 


39 


1563 


G03652 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7733. 


129 

• 


69 


1564 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


98 


59 


1565 


AJ132106 


Bostaurus 


SCO-spondin 


114 


40 


1566 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 

■ 


119 


66 


1567 


AF161356 


Homo sapiens 


HSPC093 


100 


38 


1568 


AF1 19851 


Homo sapiens 


PR01722 


94 


72 


1569 


L27428 


Homo sapiens 


reverse transcriptase 


107 


48 


1570 


X99451 


Lycopersicon 
esculentum 


extensm-like protem DiflO 


104 


32 


1571 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


126 


59 


1572 


X73481 


Drosophila hydei 


mstl01(2) 


105 


41 


1573 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein Kl 0 (NID:g8 148) 


133 


50 


1574 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


154 


51 



170 



SEO ID 
NO: 


* Accession No 


Soecies 

* 


vcscnpuuu 


Smith- 

Waterman 

Score 


Identit 

y 




i 


1 


8144. 






1575 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


323 


83 


1576 


AF062008 


Caenorhabditis 
elegans — 


unknown 


111 


54 


1577 


X92485 


Plasmodium 
vivax 


pval 


81 


57 


1578 


U93570 


Homo sapiens 


p40 


102 


33 


1579 


AF090944 


Homo sapiens 


PRO0663 


132 


59 


1580 


AL 137798 


Homo sapiens 


dJl 182A14.5.1 (novel gene (isoform 
1)) 


182 


53 


1581 


X92485 


Plasmodium 
vivax 


pval 


120 

* 


44 


1582 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


160 


47 


1583 


G02532 

- 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


135 


54 


1584 

• 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


119 


52 

• 


1585 


Y02749 

• 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


82 


55 


1586 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


152 


60 


1587 


D28482 


Homo sapiens 


SCR2 


390 


83 


1588 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


122 . 


59 


1589 


AF010144 


Homo sapiens 


neuronal thread protein AD7C-NTP 


117 


50 


1590 


AF 118078 


Homo sapiens 


PR01848 


118 


59 


1591 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


56 


78 


1592 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


118 


80 


1593 


U93571 


Homo sapiens 


p40 


170 


77 


1594 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


230 


51 


1595 


L27428 


Homo sapiens 


reverse transcriptase 


138 


45 


1596 

- 


X97707 


Pongopygmaeus 
abelii 


stopcodon created by 
posttranscriptional polyadenylation 


139 


61 


1597 


X98710 


Homo sapiens 


COL1A1 and PDGFB fusion transcript 


107 


31 


1598 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


97 


58 


1599 


AF210651 


Homo sapiens 


NAG 18 


86 


89 


1600 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


148 


48 


1601 


J03770 


Mus musculus 


homeobox protein 


99 


35 


1602 


AFH9901 


Homo sapiens 


PR02831 


119 


56 


1603 


AL031673 


Homo sapiens 


dJ694B14.1 (PUTATIVE novel KRAB 
box protein with 18 C2H2 type Zinc 
finger domains) 


233 


44 


1604 


G037I4 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


130 


66 


1605 


G03267 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7348. 


89 

• 


44 

■ * 


1606 


D88461 


Rattus rattus 


N-WASP 


123 


43 


1607 


AF090942 


Homo sapiens 


PRO0657 


107 


61 


1608 


U35730 


Mus musculus 


jerky 


105 


34 


1609 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


108 


45 


1610 


B06334 


Homo sapiens 


Human subtilisin-kexin isoenzyme 1. 


474 


84 


1611 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


101 


90 


1612 


AK024455 


Homo sapiens 


FLJ00047 protein 


83 


55 


1613 


D86853 


Catharanthus 


extensin 


123 


39 
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Species 


bescri prion 


Smith- 
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% 

Identit 

y 




t 


roseus. 








1614 


API 19851 


Homo sapiens 


PR01722 


135 


59 


1615 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


142 


57 


1616 


AF1 94537 


Homo sapiens 


NAG 13 


154 


63 


1617 


AF 1 19851 


Homo sapiens 


PR01722 


91 


62 


1618 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


92 

• 


48 


1619 


AK024455 


Homo sapiens 


FLJ00047 protein 


147 


60 


1620 


AF2 17973 


Homo sapiens 


unknown 


116 


67 


1621 


AF000298 


Caenorhabditis 
elegans 


weak similarity to collagens; glycine- 
and proline-rich 


102 

* 


41 


1622 


K02576 


Homo sapiens 


salivary proline-rich protein I 


108 


40 


1623 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
S148. 


135 


44 


1624 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


322 


62 


1625 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


356 


72 


1626 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


113 


65 


1627 


X92485 


Plasmodium 
vivax 


pval 


90 


45 


1628 


AF090895 


Homo sapiens 


PRO0117 


156 


61 


1629 


AF1 16661 


Homo sapiens 


PR01438 


87 


54 


1630 


M13100 


Rattus 
norvegicus 


unknown protein 

* 


109 


76 


1631 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


147 


60 


1632 


AF119851 


Homo sapiens 


PR01722 


107 


70 


1633 


M64792 


Rattus 
norvegicus 


salivary proline-rich protein 


109 


46 


1634 


L27428 


Homo sapiens 


reverse transcriptase 


109 


38 


1635 


AF 11 8082 


Homo sapiens 


PRO1902 


80 


40 


1636 


R95913 


Homo sapiens 


Neural thread protein. 


118 


88 


1637 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


79 


60 


1638 


U93570 


Homo sapiens 


putative pi 50 


128 


54 


1639 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


127 


69 


1640 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


171 


68 


1641 


AF194537 


Homo sapiens 


NAG 13 


140 


66 


1642 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


101 


50 


1643 


M64793 

■ 


Rattus 
norvegicus 


salivary proline-rich protein 


117 


33 


1644 


AFO 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


123 


50 


1645 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


96 


70 


1646 


L27428 


Homo sapiens 


reverse transcriptase 


86 


84 


1647 


X92485 


Plasmodium 
vivax 


pval 


137 


40 


1648 


U15647 


Mus musculus 


reverse transcriptase 


93 


68 


1649 


KQ2576 


Homo sapiens 


salivary proline-rich protein 1 


131 


41 


1650 


AF1 16712 


Homo sapiens 


PR02738 


107 


57 


1651 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


139 


72 . 


1652 


X05472 


Rattus 


ORF3 


84 


51 



172 



1 
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Species 
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Smitfa- 
W a term an 
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Identit 

y 






■ 

norvegicus 








1653 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4669. 


122 


71 


1654 


U93566 


Homo sapiens 


p40 


117 


52 


1655 


AF217536 


Homo sapiens 


truncated mevalonate kinase 


141 


70 


1656 


AF090895 


Homo sapiens 


PRO0U7 


125 


60 


1657 


X92485 


Plasmodium 
vivax 


pval 


114 


45 


1658 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


192 


61 


1659 


Y86473 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ ID NO:388. 


73 


30 


1660 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


331 


74 


1661 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


127 


69 


1662 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


312 


67 


1663 


S80119 


Rattus sp. 


reverse transcriptase homolog 


99 


59 


1664 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


106 


45 


1665 


M76729 


Homo sapiens 


pro-alpha- 1 type V collagen 


172 


47 


1666 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


97 


54 


1667 


AF169388 


Mus musculus 


alpha 4 collagen TV 


84 


38 


1668 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


84 

, 


66 


1669 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


428 


80 


1670 


M29622 


Mus musculus 


open reading frame 2 


74 


46 


1671 


W90838 


Homo sapiens 


Human lymphocyte targeted peptide 
#6. 


98 

... 


100 

■ 


1672 : 


G00328 

i 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


55 


58 


1673 


AP090931 


Homo sapiens 


PRO0483 


72 


39 


1674 


AF051782 


Homo sapiens 


diaphanous 1 


116 


49 


1675 


U57361 


Rattus 
norvegicus 


collagen XII alpha 1 

... 


108 


48 


1676 


AF1 82844 


Homo sapiens 


VPS28 protein 


395 


95 


1677 


L27428 


Homo sapiens 


reverse transcriptase 


189 


47 


1678 


G02532 


Homo sapiens 

• - 


Human secreted protein, SEQ ID NO: 
6613. 


142 


53 


1679 


U93565 


Homo sapiens 


putative pi 50 


214 


40 


1680 


. AK002129 


Homo sapiens 


unnamed protein product 


128 


57 


1681 


X03145 


Homo sapiens 


potORFV 


93 


48 


1682 


X63005 


Mus musculus 


proline-rich protein 


108 


38 


1683 


AF 118082 


Homo sapiens 


PRO1902 


117 


42 


1684 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


157 


58 


1685 


R95913 


Homo sapiens 


Neural thread protein. 


92 


66 


1686 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


111 


*1 


1687 


X61296 


Rattus 
norvegicus 


open reading frame 2 


104 


38 


1688 


AB012223 


Canis farailiaris 


ORF2 


98 


39 


1689 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


278 


72 


1690 


U52077 


Homo sapiens 


mariner transposase 


175 


56 


1691 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


83 


46 



173 



SEOlD 
NO: 


s Accession No* 


Sdccmb 


D csc notion 


Smith- 

Waterman 
Score 


w 

Idcntit 

y 1 


1692 


AF061128 


Plasmodium 
falciparum 


merozoite surface protein 1 


85 


44 


1693 


X77722 


Homo sapiens 


interferon alpha/beta receptor 


89 


60 


1694 

» 


M13100 


Rattus 
norvegicus 


unknown protein 


94 


40 


1695 


AF202635 


Homo sapiens 


PP1200 


114 


60 


1696 


AK001116 


Homo sapiens 


unnamed protein product 


127 


53 


1697 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


162 


53 


1698 


AF 11 8078 


Homo sapiens 


PRO 1848 


93 


42 


1699 


X92485 


Plasmodium 
vivax 


pval 


147 


49 


1700 


M63819 


Plasmodium 
falciparum 


malaria antigen 

• 


101 


64 


1701 


AF090930 


Homo sapiens 


PRO0478 


146 


76 


1702 


AB009993 


Mus musculus 


collagen al(V) 


94 


40 


1703 


AL3901 14 


Leishrnania 
major 


extremely cysteine/valine rich protein 


169 


66 


1704 


AF 130089 


Homo sapiens 


PRO2550 


145 


38 


1705 


X83413 , 


Human 
herpesvirus 6 


U88 


113 


58 


1706 


S60088 


Homo sapiens 


putative adhesion molecule=ADMLX 


151 


86 


1707 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


176 


52 


1708 


AF130079 


Homo sapiens 


PR02852 


174 


41 


1709 


Y28682 


Homo sapiens 


Human pp392_3 secreted protein. 


557 


99 


1710 


M14423 


Mus musculus 


pro-alpha- 1 type I collagen 


112 


34 


1711 


D13623 


Rattus sp. 


p34 protein 


128 


45 


1712 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


84 


41 


1713 


X97675 


Homo sapiens 


plakophiiin 2b 


121 


60 


1714 


G03453 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7534. 


102 


51 


1715 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element . 


171 


48 


1716 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


99 


63 


1717 


S80119 . 


Rattus sp. 


reverse transcriptase homolog 


140 


57 


1718 


L27428 


Homo sapiens 


reverse transcriptase 


103 


46 


1719 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


154 


78 


1720 


AF130089 


Homo sapiens 


PRO2550 


106 


35 


1721 


G03652 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7733. 


147 


49 


1722 


G03703 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7784. 


111 


72 


1723 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


132 


60 


1724 

• 


X92485 


Plasmodium 
vivax 


pval 

• 


96 

- 


62 

- 


1725 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


249 


52 


1726 


M81321 


Macaca 
fascicular is 


proline-rich protein 


132 


45 


1727 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6588. 


119 


69 


1728 


U93564 


Homo sapiens 


p40 


129 


58 


1729 


U93574 


Homo sapiens 


putative pl50 


113 


76 


1730 


AF 130089 


Homo sapiens 


PRO2550 


136 


61 


1731 


L05608 


Cercopithecine 


glycoprotein gl 


100 


46 



174 



WO 01/64835 PCT/US01/04927 

» 



NO: 


1 Accession No. 


Species 

* 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 




i 


herpesvirus 2 








1732 


U15647 


Mus musculus 


reverse transcriptase 


138 


36 


1733 


U93574 


Homo sapiens 


putative pi 50 


187 


43 


1734 

< 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


155 


64 


1735 


Y02999 


Homo sapiens 

• 


Fragment of human secreted protein 
encoded by gene 121. 


102 


64 


1736 


Y02999 


Homo sapiens . 


Fragment of human secreted protein 
encoded by gene 121. 


118 


64 


1737 


G00490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4571. 


110 


58 


1738 


AF090942 


Homo sapiens 


PRO0657 


163 


55 


1739 


U11288 


Drosophila 
melanogaster 


diaphanous protein 


149 


46 


1740 


L27428 


Homo sapiens 


reverse transcriptase 


108 


33 


1741 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. • 

* 


76 


54 


1742 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


165 


59 


1743 


AC003682 


Homo sapiens 


R28830 1 


179 


63 


1744 


X65165, 


Volvox carteri 


extensin 


173 


49 


1745 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


114 


62 


1746 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720, 


89 


41 


1747 


G03787 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


115 


42 


1748 


U93565 


Homo sapiens 


putative pl50 


125 


37 


1749 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


121 


45 


1750 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


139 


53 


1751 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


87 


80 


1752 


L27428 


Homo sapiens 


reverse transcriptase 


133 


42 


1753 


U93570 


Homo sapiens 


putative pi SO 


126 


57 


1754 


U22376 


Homo sapiens 


alternatively spliced product using exon 
13A 


134 


50 


1755 


U49973 


Homo sapiens 


ORF 1 ; MER37; putative transposase 
similar to pogo element 


175 

* 


76 


1756 


AP000616 


Oryzasativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


113 


60 


1757 


U54788 


Mus musculus 


Wiskott-Aldrich Syndrome Protein 


158 


44 


1758 


Y86579 


Homo sapiens 


Human gene 92-encoded protein 
fragment, SEQ ID NO:496. 


109 


88 


1759 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7001. 


123 

• 


73 


1760 


Y86579 


Homo sapiens 


Human gene 92-encoded protein 
fragment, SEQ ID NO:496. 


113 


92 


1761 


Y86579 


Homo sapiens 


Human gene 92-encoded protein 
fragment, SEQ ID NO:496. 


110 


81 


1762 


U08020 


Mus musculus 


collagen pro-alpha-1 type I chain 


105 


34 


1763 


GO3790 


Homo sapiens • 


Human secreted protein, SEQ ID NO: 

MAM 4 

7871. 


90 


45 


1764 


U93569 


Homo sapiens 


putative pl50 


148 


36 


1765 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO.163. 


116 


45 


1766 


X02873 


Daucus carota 


put precursor 


112 


47 


1767 


X92485 


Plasmodium 
vivax 


pval 


100 


45 


1768 


R95913 


Homo sapiens 


Neural thread protein. 


96 


65 



175 



WO 01/64835 PCT/US01/04927 

• 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


1769 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


114 


51 


1770 


X97675 


Homo sapiens 


plakophilm 2b 


115 


70 


1771 


G00637 


Homo sapiens 


Human secreted protem, SEQ ID NO: 

Any O 

4718. 




58 


1772 


G02532 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
6613. 


131 


85 


1773 


AF 13 0089 


Homo sapiens 


PRO2550 


ICO 

158 


69 


* ***** > 

1774 


U23552 


Ailuropoda 
melanoleuca 


cytochrome b 


206 


78 


1775 


AF010144 


Homo sapiens 


neuronal thread protem AD7C-NTP 


98 


53 


1776 


AF1I67I5 


Homo sapiens 


PR02829 


134 


67 


1777 


AC008054 


Leishmania 

• 

major 


L8453.1 


114 


28 


1778 


AF037364 


Homo sapiens 


paraneoplastic neuronal antigen MAI 


397 


73 


1779 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


132 


60 


1780 


G02480 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
6561. 


168 


64 


1781 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


80 


36 


1782 


AF174482 


Homo sapiens 


polycomb 3 


133 


46 


1783 


U93563 


Homo sapiens t 


putative pi 50 


196 


70 


1784 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


142 


59 


1785 


AF194537 


Homo sapiens 


NAG13 j 


114 


38 


1786 


U21123 


Drosophila 
melanogaster 


ena polypeptide 


120 


44 


1787 


AF200187 


cercopithicine 
herpesvirus 15 


EBNA2-like protein 


108 


30 


1788 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


97 


56 


1789 


M64792 


Rattus 
norvegicus 


salivary proline-rich protein 


128 


40 


1790 


X92485 


Plasmodium 
vivax 


pval 


98 


51 


1791 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


1 14 


34 


1792 


U49973 

■ 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


197 

- 


81 

... . . 


1793 


G00328 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
4409. 


145 


71 


1794 


AF 104923 


Homo sapiens 


putative transcnption factor 


142 


59 


1795 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


133 


60 


1796 


AC003113 

■ 


Arabidopsis 
thaliana 


F240L6 


57 


62 


1797 


M22332 


Homo sapiens 


unknown protem 


118 


29 


1798 


Y02693 


Homo sapiens 


Human secreted protem encoded by 
gene 44 clone HTDAD22. 


100 


71 


1799 


AL390114 


Leishmania 
major 


extremely cysteme/valine rich protem 


154 


37 


1800 


U93570 


— — — : 

Homo sapiens 


p40 


103 


56 


1801 


X99452 


Lycopersicon 
escuientum 


extensin-like protein Dif54 


101 


28 


1802 


L27428 


Homo sapiens 


reverse transcriptase 


102 


34 


1803 


L27428 


Homo sapiens 


reverse transcriptase 


141 


43 


1804 


Ml 8933 


Mus musculus 


alpha- 1 type-Ill collagen precursor 


118 


30 


1805 


X92485 


Plasmodium 
vivax 


pval 


106 


67 


1806 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


118 


65 



176 



WO01/6483S PCT/US01/04927 



SEQID 
NO: 


* Accession No. 


Species 


Description 


&nith- 

Waterman 

Score 


Identit 

y 


1807 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


105 


33 


1808 


X97675 


Homo sapiens 


plakophilin 2b 


154 


61 


1809 


G03790 


Homo sapiens 

>• ■ 


Human secreted protein* SEQ ID NO: 
7871. 


130 


53 


1810 


X92485 


Plasmodium 
vivax 


pval 


133 


54 


1811 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


121 


66 


1812 

• 


Y17833 

■ 


Human 
endogenous 
retrovirus K 


env protein 


119 

- 


81 

■ 


1813 


AF1 19851 


Homo sapiens 


PRO 1722 


130 


58 


1814 


X53581 


Rattus 
norvegicus 


ORF4 


158 


50 


1815 


G03473 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7554. 


111 


74 


1816 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


153 


68 


1817 


M19155 


Plasmodium 
falciparum 


S-antigen precursor 


164 


50 


1818 


AF1 18082 


Homo sapiens 


PRO1902 


90 


75 


1819 


W40353 


Homo sapiens 

• 


Human unspecified protein from 
US5702907. 


110 


52 


1820 


U93563 


Homo sapiens 


putative pi 50 


114 


35 


1821 


U41538 


Caenorhabditis 
elegans 


proline rich 


95 


52 


1822 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


154 


45 


1823 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


131 


53 


1824 


AF130089 


Homo sapiens 


PRO2550 


128 


40 


1825 


AF090944 


Homo sapiens 


PRO0663 


103 


45 


1826 


AC003113 


Arabidopsis 
thaliana 


F2401.18 


107 


40 


1827 


AF1 94537 


Homo sapiens 


NAG 13 


85 


28 


1828 


AF009668 


multiple 
sclerosis 
associated 
retrovirus 


polyprotein 


185 

• 


41 

* 


1829 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


155 


42 


1830 


X69465 


Sus scrofa 


ryanodine receptor 1 


516 


86 


1831 

• 


G00454 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


108 


.40 

- 


1832 


U88966 


Homo sapiens 


rapamycin associated protein FRAP2 


434 


89 


1833 


M19155 


Plasmodium 
falciparum 


S-antigen precursor 

■ 


105 

- • 


32 


1834 


AF085809 


Mus musculus 


synapsin lb 


98 


33 


1835 


AK023003 


Homo sapiens 


unnamed protein product 


393 


81 


1836 


Y41740 


Homo sapiens 


Human PRO701 protein sequence. 


429 


78 


1837 


M36913 


Zeamays 


cell wall protein (put.); putative 


72 


35 


1838 


X63005 


Mus musculus 


proiine-rich protein 


98 


40 


1839 


X83413 


Human 
herpesvirus 6 


U88 


149 


45 


1840 


AF134304 


Homo sapiens 


Scar2 


87 


37 


1841 


AC024772 


Caenorhabditis 
elegans 


contains similarity to Mus musculus 
alpha-NAC, muscle-specific form 
(GB:U48363) 


131 


25 


1842 


AB002366 


Homo sapiens 


KIAA0368 


153 


75 


1843 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


140 


47 



177 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman • 
Score 


Identit 

y 


1844 


AL035526 


Arabidopsis 
thaliana 


extensin-like protein 


93 


33 


1845 


D26156 


Homo sapiens 


hSNF2b 


91 


34 


1846 


M14228 


Gallus gailus 


c-beta-3 beta-tubulin 


598 


83 


1847 


AK022217 


Homo sapiens 


unnamed protein product 


97 


56 


1848 


AJ250042 


Homo sapiens 


Rab5 GDP/GTP exchange factor 
homologue 


174 


83 


1849 


W23949 

■ • 


Homo sapiens 

■ 


Human phosphoinositide 30H-kinase 
pi 01 subunit. 


143 


28 


1850 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


101 


63 


1851 


AB017114 


Homo sapiens 


AD 3 


113 


100 


1852 


D00570 


Mus museums 


open reading frame (251 AA) 


174 


55 


1853 


A67510 


Mus musculus 


MUS MUSCULUS GENOMIC DNA 
CONTAINING N ALLELE OF FV1 
GENE. 


133 


42 


1854 


U49974 


Homo sapiens 


mariner transposase 


214 


82 


1855 


U93569 


Homo sapiens 


p40 


95 


31 


1856 


D89729 


Homo sapiens 


CRM1 protein 


475 


90 


1857 


AF090895 


Homo sapiens 


PRO0117 


89 


36 


1858 


AFO 15926 


Homo sapiens 


ezrin-radixin-moesin binding 
phosphoprotein-50 


117 


73 


1859 j 


D13721 


Gallus gallus 


NF-kB p65 subunit 


223 


56 


1860 


K03204 


Homo sapiens 


salivary proline-rich protein precursor 


111 


32 


1861 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


195 

+ 


52 


1862 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


331 


69 


1863 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


80 


35 


1864 


L27428 


Homo sapiens 


reverse transcriptase 


84 


51 


1865 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


106 


46 


1866 


U53585 


Mycobacterium 
avium 


fibronectin attachment protein 


86 


36 


1867 


AF255446 

- 


Crypthecodinium 
cohnii 


Dip 1 -associated protein C 


134 


45 


1868 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


81 


63 


1869 


AP000001 


Pyrococcus 
horikoshii 


235aa long hypothetical protein 


108 


40 


1870 


M13100 


Rattus 
norvegicus 


unknown protein 


121 

* • 


53 


1871 


X67863 


Mus musculus 


T2 


101 


35 


1872 


S80119 


Rattus sp. 


reverse transcriptase homolog 


151 


43 


1873 


W73633 


Homo sapiens 


Human secreted protein clone. 


140 


44 


1874 


U57053 


Homo sapiens 


myosin-ID 


203 


82 


1875 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


92 


54 


1876 


M21097 


Homo sapiens 


CD 19 differentiation antigen 


432 


79 


1877 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


150 


52 


1878 


U25281 


Rattus 
norvegicus 


SH3 domain binding protein 


108 


36 


1879 


G04091 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
8172. 


117 


48 


1880 


X73113 


Homo sapiens 


fast MyBP-C 


599 


77 


1881 


AX028128 


Homo sapiens 


unnamed protein product 


162 


43 


1882 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


62 



178 



NO: 


* Accession No. 


Species 


Descrfotion 


Smith- 
Waterman 
Score 


Ideotit 

y 




♦ 


» 


7870. 






1883 


Y00664 


Homo sapiens 


open reading frame t (AA 1-86) 


74 


34 


1884 


U86587 


Mus musculus 

• 


phospharjdylinositol 3-kinase catalytic 
subunit pi 10 delta 


204 


77 

• 


1885 


X67337 


Homo sapiens 


Human pre-mRNA cleavage factor I 68 
kDa subunit 


148 


37 


1886 


Y13829 


Homo sapiens 


MBNL protein 


117 


63 


1887 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


119 


80 


1888 


R95913 


Homo sapiens 


Neural thread protein. 


91 


64 


1889 


L39059 


Homo sapiens 


transcription factor SL 1 


92 


34 


1890 


M69297 


Homo sapiens 


ORF3 


169 


48 


1891 


R95913 


Homo sapiens 


Neural thread protein. 


91 


62 


1892 


Z28201 


Saccharomyces 
cerevisiae 


ORF YKL202W 


95 


51 


1893 


G00427 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
4508. 


90 


51 


1894 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


183 


73 


1895 


Y10055, 


Homo sapiens 


phosphoinositide 3-kinase 


564 


84 


1896 


AF093775 


Mus musculus 


alpha-actinin 3 


375 


85 


1897 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


82 


66 


1898 


M21904 


Homo sapiens 


4F2 heavy chain antigen 


386 


80 


1899 


AB028997 


Homo sapiens 


KIAA1074 protein 


145 


38 


1900 


U97553 


murid 

herpesvirus 4 


unknown 


98 


41 


1901 


AB011142 


Homo sapiens 


KIAA0570 protein 


209 


95 


1902 


AF194537 


Homo sapiens 


NAG 13 


161 


55 


1903 


U93564 


Homo sapiens 


putative pi 50 


142 


33 


1904 


M29399 


Homo sapiens 


erythrocyte membrane protein band 4.2 


413 


90 


1905 


Y27400 


Homo sapiens 


Human P450 reductase functional 
fragment sequence. 


294 


67 


1906 

■ 


Y28503 


Homo sapiens 


HGFH3 Human Growth Factor 
Homologue 3. 


167 


100 


1907 


AF128625 


Homo sapiens 


CDC42-binding protein kinase beta 


509 


89 


1908 


AB029147 


Cucumis sativus 


expressed in cucumber hypocotyls 


98 


41 


1909 


X13783 


Homo sapiens 


alpha-1 type 2 collagen (714 AA) 


87 


43 


1910 


AF240630 


Mus musculus 


IQ motif containing GTPase activating 
protein 1 


330 


48 

* 


1911 


AF129075 


Homo sapiens 


T-COMPLEX PROTEIN 1, THETA 
SUBUNIT (TCP- 1 -THETA) 


511 


90 


1912 


AF011450 


Mus musculus 


type XV collagen 


87 


28 


1913 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


124 

■ 


75 


1914 


AF171230 


Vigna 
unguiculata 


phosphatidic acid phosphatase beta 


112 


53 


1915 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


100 


47 


1916 


M94131 


Homo sapiens 


mucin 


97 


37 


1917 


X13885 


Nicotiana 
tabacum 


extensin (AA 1-620) 


120 


34 


1918 


AF 186605 


Homo sapiens 


MLL2 protein 


115 


29 


1919 


M12130 


Mus musculus 


RNA polymerase II 


498 


83 


1920 


AL049794 


Homo sapiens 


dJ777L9. 1 (novel protein similar to 
mouse kinesin-like proteins KJF1A and 
KIF1B) 


514 


90 


1921 


D83703 


Homo sapiens 


peroxisome assembly factor-2 


233 


64 


1922 


Y11922 


Homo sapiens 


Human 5' EST secreted protein SEQ ID 
No: 522. 


162 


77 
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~5eo id 

NO: 


Accession No 






Smith- 

Waterman 
Score 


*/• 

Ideotit" 

y 


1923 


AC002481 


Homo sapiens 


similar to nitrogen permease regulator, 
similar to P39923 (PID:g730170), 
match to AA233630 (NID:gl 856833) 
and AA399402 (NID:g2053147) 


223 


79 


1924 


D82060 


Homo sapiens 


membrane protein with histidine rich 
charge clusters 


115 


40 


1925 


U49974 


Homo sapiens 


mariner transposase 


195 


61 


1926 


S80119 


Rattussp. 


reverse transcriptase homolog 


104 


40 


1927 


G00354 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


79 


68 

■ 


1928 


S79639 


Homo sapiens 

■ 


EXT Imputative tumour 
suppressor/hereditary multiple 
exostoses candidate gene 


430 


88 


1929 


X58063 


Brugia pahangi . 

■ 


major protein component of the 
micoiilarial sheath 


104 


43 


1930 


AF1 19855 


Homo sapiens 


PR01847 


132 


64 


1931 


AJ005577 


Homo sapiens 


6-phosphofructo-2-kinase 


326 


90 


1932 


AF020261 


Santalum album 


proline rich protein 


93 


33 


1933 


AJ272204 


Homo sapiens 


hypothetical protein 


321 


52 

— — - — ■ — i 


1934 


AL451017 


Neurospora 
crass a 


related to Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN C 


116 


39 


1935 


A61971 


unidentified 


MCSP 


328 


79 


1936 


U93564 


Homo sapiens 


putative pi 50 


217 


54 


1937 

■ 


U97553 


murid 

herpesvirus 4 


unknown 


113 


38 


1938 


AF194537 


Homo sapiens 


NAG 13 


90 


37 


1939 


AF226044 


Homo sapiens 


HSNFRK 


403 


85, 


1940 


X83413 


Human 
herpesvirus 6 


U88 

■ 


303 


50 


1941 


AJ007628 


Rattus 
norvegicus 


ELK channel 1 


112 


38 


1942 


AF 104328 


Arabidopsis 
thaliana 


cell wall-plasma membrane linker 
protein homolog 


135 


35 


1943 


AF 130089 


Homo sapiens 


PRO2550 


93 


63 


1944 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


102 


34 


1945 


M19419 


Mus musculus 


proline-rich salivary protein 


121 


40 


1946 


U57316 


Homo sapiens 


histone acetyltransferase 


132 


73 


1947 


AL163302 


Homo sapiens 


human type XVIII collagen 


79 


34 


1948 


AB020746 


Arabidopsis 
thaliana 


protein kinase-like protein 


117 


36 


1949 


AJ011738 


Homo sapiens 


Inilb 


209 


85 


1950 


AF010144 


Homo sapiens 


neuronal thread protem AD7c-NTP 


155 


48 


1951 


AF 119569 


Homo sapiens 


patched 2 


166 


89 


1952 


AB002107 


Homo sapiens 


hPer 


118 


39 


1953 


X98834 


Homo sapiens 


zinc finger protem Hsal2 


430 


70 


1954 


Y19641 


Homo sapiens 


SEQ ID NO 359 from W09922243. 


96 


64 


1955 


U93569 


Homo sapiens 


putative pi 50 


157 


44 


1956 


G04091 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
8172. 


85 


53 


1957 


AB013729 


Mus musculus 


semaphorin Y 


106 


38 


1958 


AC005360 


TV ' 

Homo sapiens 


FAA 


338 


68 


1959 


AJ223075 


Homo sapiens 


TRIP protein 


598 


95 


1960 


AC004022 


Homo sapiens 


serum paraoxonasearylesterase 3 


147 


62 


1961 


AF076776 


Drosophila 
melanogaster 


helicase DOMINO A 


157 


45 


1962 


AF265555 


Homo sapiens 


ubiquitin-conjugating BIR-domain 
enzyme APOLLON 


587 


77 


1963 


AF229642 


Mus musculus 


DXImx46e protein 


127 


91 
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SEQID 
NU: 


Accession No. 


Species 

■ 


Description 


Smith- 

warerwan 

Score 


% 

Identic 

y 


1964 


U72520 


Mus musculus 


mena protein 


89 


31 


' 1965 


W40309 


Homo sapiens 


Human ITAK protein. . 


179 


32 


1966 


AJ388557 


Canis familiaris 


zinc finger protein 


826 


56 


1967 


Y92515 


Homo sapiens 


Human OXRE-12. 


224 


53 


1968 


Y17832 


Human 
endogenous 
retrovirus K 


pol protein 


187 

- 


49 


1969 


Y41245 


Homo sapiens 


Human Y2 1 8 protein. 


220 


78 


1970 


AB052738 


Sus scrofa 


Smad3 


366 


85 


1971 


AB007644 


Arabidopsis 
thaliana 


contains similarity to phytocyanin/early 
nodulin-like protein~geneJd:K19P173 


106 

4 


32 


1972 


G00333 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
4414. 


122 


57 


1973 


AL357472 


Homo sapiens 


VPS33B 


112 


31 


1974 


AF151902 


Homo sapiens 


CGI- 1 44 protein 


112 


95 


1975 


AL137260 


Homo sapiens 


hypothetical protein 


148 


92 


1976 


U89505 


Homo sapiens 


Hiark 


408 


89 


1977 


U67328 


Mus musculus 


NIPI-Hke protein 


168 


73 


1978 


AK026435 


Homo sapiens 


unnamed protein product 


601 


94 


1979 


Y14318 . 


Homo sapiens 


peroxisomal ABC-transporter 


507 


96 


1980 


U63630 


Homo sapiens 


MCM4 


570 


90 


1981 


AF1 18090 


Homo sapiens 


PRO2044 


154 


84 


1982 


AF016370 


Homo sapiens 


U4/U6 small nuclear ribonucleoprotein 
hPrp3 

mt\m» * 


422 


63 


1983 


AB011154 


Homo sapiens 


KJAA0582 protein 


420 


80 


1984 


AB011422 


Homo sapiens 


Trad 


201 


67 


1985 


AL390156 


Homo sapiens 


hypothetical protein 


230 


97 


1986 


AJ242540 


Volvox carteri f 
nagariensis 


hvdroxvDroline-rich {? lvcoDrotein DZ- 
HRGP 


163 


44 


1987 


M12523 


Homo sapiens 


alloalbumin Venezia 


411 


91 


1988 


G02996 


Homo sapiens 


Human secreted protein SEO ID NO* 
7077. 


145 


63 


1989 


Y36156 


Homo sapiens 


Human secreted protein #28. 


107 


80 


1990 


AF161356 


Homo sapiens 


HSPC093 


156 


57 


1991 


G03443 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7524. 


132 


72 


1992 


AF119851 


Homo sapiens 


PR01722 


126 


52 


1993 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


228 


60 


1994 


AF010144 


Homo sapiens 


neuronal thread protein AD7C-NTP 


113 


40 


1995 


G02469 


Homo sapiens 


Human secreted protein. SEO ID NO" 
6550. 


118 


58 


1996 


G00328 


Homo sapiens 


Human secreted protein. SEO ID NO: 
4409. 


137 


57 


1997 


X75068 


Bos taurus 


plasmalemmal porin 


85 


85 


1998 


Y64869 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
IDNO:1030. 


92 


80 


1999 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


275 


79 


2000 


AF1 18078 


Homo sapiens 


PR01848 


111 


57 


2001 


S80119 


Rattus sp. 


reverse transcriptase homolog 


120 


61 


2002 


AB011U0 


Homo sapiens 


KIAA0538 protein 


146 


73 


2003 


Ml 5530 


Homo sapiens 


B-cell growth factor 


94 


64 


2004 


AF225918 


Mus musculus 


intestinal cell kinase 


216 


77 


2005 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


125 


85 


2006 


D84391 


Mus musculus 


reverse transcriptase 


135 


38 


2007 


AF090930 


Homo sapiens 


PRO0478 


101 


86 



181 
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* 



NO: 






DescrfniiATi 


Smith- 

Waterman 

Score 


% 

Mentit 

y 


2008 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


156 


66 


2009 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


165 


49 


2010 


AF1 13685 


Homo sapiens 


PRO0974 


124 


63 


2011 


AF 130079 


Homo sapiens 


PR02852 


141 


78 


2012 . 


AF010144 j 


Homo sapiens 


neuronal thread protein AD7c-NTP 


94 


51 


2013 


AF090944 


Homo sapiens 


PRO0663 


67 


53 


2014 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


151 


58 


2015 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


223 


75 


2016 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


137 1 


78 


2017 


L26953 


Homo sapiens 


chromosomal protein 


120 


74 


2018 


G03646 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7727. 


68 


57 


2019 


G03646 


Homo sapiens 

i 


Human secreted protein, SEQ ID NO: 
7727. 


64 


42 


2020 


U49973 


Homo sapiens 


ORF 1 ; MER37; putative transposase 
similar to pogo element 


334 


63 


2021 


L26953 


Homo sapiens 


chromosomal protein 


112 


64 


2022 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


139 


69 


2023 


U93569 


Homo sapiens 


putative pi 50 


187 


89 


2024 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


§0 

i' 


72 


2025 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


142 


71 


2026 


Y76198 


Homo sapiens 


Human secreted protein encoded by 
gene 75. 


77 


66 


2027 


AJ242540 

i 


Volvox carteri £ 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


372 


73 


2028 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


159 


80 


2029 


AF194537 


Homo sapiens 


NAG 13 


158 


57 


2030 


L26251 


Trypanosoma 
brucei 


CR5 


110 


41 


2031 


AF197913 


Helicoverpa 
armigera nuclear 
polyhedrosis 
virus 


basic DNA-binding protein BDBP 

. 


149 


55 


2032 


, GQ0344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4425. 


115 


80 


2033 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


157 


82 


2034 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


133 


75 


2035 


G03646 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7727. 


104 


67 


2036 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


137 


71 


2037 


AF130089 


Homo sapiens 


PRO2550 


146 


55 


2038 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


81 


73 


2039 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


122 


66 


2040 

- 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


159 


68 
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SEOIO 

NO: 


'Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


2041 


B01372 


Homo sapiens 


Neuron-associated protein. 


95 


79 


2042 


V01 555 


Human 
herpesvirus 4 


BYRF1, encodes EBNA-2 (Dambaugh 
etal, 1 984; Dillner etal, 1984) 


108 


77 


2043 


G02515 


.Homo sapiens 


Human secreted protein, SEQ ID NO: 
6596. 


125 


65 


2044 


AF 11 8086 


Homo sapiens 


PR01992 


103 


64 


2045 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


146 


64 


2046 


W50192 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-1. 


108 


51 


2047 


Y08062 


Homo sapiens 

* 


Human PR0245 protein fragment 
derived from DNA35638. 


133 


68 


2048 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


155 


64 


2049 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


125 


53 


2050 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


112 


56 


2051 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


70 


62 


2052 


X61045 


Hydra sp. 


mini-collagen 


99 


46 


2053 


X83413 


Human 
herpesvirus 6 


U88 


148 


73 

• 


2054 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6983. 


75 


63 


2055 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


104 


75 


2056 


X16524. 


Dictyostelium 
discoideum 


coding region (AA 1 - 437) 


104 


51 


2057 


R95913 


Homo sapiens 


Neural thread protein. 


83 


77 


2058 


AF1 18080 


Homo sapiens 


PRO1880 


126 


61 


2059 


AJ276003 


Homo sapiens 


GAR 1 protein 


136 


59 


2060 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


120 


70 


2061 


X01918 


DrosophiJa 
melanogaster 


salivary gland glue protein 

• 


179 

• ■ 


40 


2062 


U00029 


Saccharomyces 
cerevisiae 


Yhr217cp 


110 


46 


2063 


AF130051 


Homo sapiens 


PRO0898 


143 


80 


2064 


AF006061 


Homo sapiens 


placental growth hormone isoform 
hGH-V3 


121 


85 


2065 


G02409 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6490. 


129 


66 


2066 


U82303 


Homo sapiens 


unknown 


92 


68 


2067 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


368 


79 


2068 


L27428 


Homo sapiens 


reverse transcriptase 


186 


58 


2069 


AF130051 


Homo sapiens 


PRO0898 


164 


73 


2070 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


271 


73 


2071 


AF 130051 


Homo sapiens 


PRO0898 


110 


56 


2072 


AF010144 


Homo sapiens 


neuronal thread protem AD7c-NTP 


117 


67 


2073 


AF157706 


Human 

herpesvirus 6B 


B4 


104 


49 


2074 


AL049608 


Arabidopsis 
thaiiana 


extensin-like protein 


120 


41 


2075 


AP002460 


Arabidopsis 
thaiiana 


geneJcLFl D9.26~unknown protein 


150 


27 


2076 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


130 


85 
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a&Kl If 
NO: 


Accession no. 


Species 


Description 


1 gT^TTiT — 

ouiiin- 

Watemaan * 
Score 


0/ 

Identit 

y 


2077 


Mil 897 


Mus musculus 


proline-rich salivary protein 


125 


42 


2078 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


66 


75 


2079 


AF04I330 


Bodo saltans 


NADH dehydrogenase subunit 5 


185 


47 


2080 


AJ006470 


Homo sapiens 


cartilage-associated protein (CASP) 


139 


84 


2081 


AK024509 


Homo sapiens 


unnamed protein product 


132 • 


83 


2082 


U93564 


Homo sapiens 


putative p!50 


137 


67 


2083 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


99 


71 


2084 


B01372 


Homo sapiens 


Neuron-associated protein. 


148 


81 


2085 


AL1 60371 


Leishmania 
major . 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


114 


76 


2086 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


255 


63 

■ 


2087 


AF 194537 


Homo sapiens 


NAG 13 


122 


78 


2088 


G02532 


Homo sapiens 


Human secreted protein* SEQ ID NO: 
6613. 


108 


74 


2089 


AF130079 


Homo sapiens 


PR02852 


149 


77 


2090 


U93574 


Homo sapiens 


p40 


260 


96 


2091 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


99 


76 


2092 


AF 194537 


Homo sapiens 


NAG13 


125 


51 


2093 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo element 


319 


72 


2094 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo element 


159 


56 


2095 


U93572 


Homo sapiens 


putative pl50 


195 


57 


2096 


X83413 


Human 
herpesvirus 6 


U88 


132 


57 


2097 


AF010400 


Homo sapiens 


transaldolase-related protein 


463 


89 


2098 


U93563 


Homo sapiens 


putative pi 50 


128 


35 


2099 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo element 


160 


43 


2100 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


133 


43 


2101 


G02532 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
6613. 


97 


71 


2102 


K02401 


Homo sapiens 


chorionic somatomammotropin 


628 


90 


2103 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68j 
SEQIDNO:163. 


141 


80 


2104 


U49973 

• 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo element 


309 


63 


2105 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


192 


37 


2106 


Y59772 


Homo sapiens 


Human normal ovarian tissue derived 
protein 49. 


261 


89 


2107 


AF202051 


Homo sapiens 


NM23-H8 


680 


100 


2108 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


127 


54 


2109 


L22029 


Glycine max 


hydroxyproline-rich glycoprotein 


121 


36 


2110 


D26135 


Homo sapiens 


diacylglycerol kinase gamma 


172 


100 


2111 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo element 


159 


56 


2112 


X65488 


Homo sapiens 


hnRNP U protein 


117 


70 


2113 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


116 


35 


2114 


AF130051 


Homo sapiens 


PRO0898 


93 


62 


2115 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


260 


53 


2116 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


148 


70 



184 
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NO: 


Accession wo. 


opectes 

• 


is cscri prion 


WatermaD 
Score 


57 

Identit 

y 


2117 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


304 


71 


2118 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


70 


66 


2119 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4425. 


118 


84 


2120 


S79410 


Mus musculus 


nuclear Localization signal binding 
protein. 


94 


94 


2121 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


304 


73 


2122 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


83 


32 


2123 


AK025047 


Homo sapiens 


unnamed protein product 


116 


48 


2124 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7452. 


95 


72 


2125 


AF138883 


Bos tanrus 

- 


type 11 collogen cyanogen bromide 
fragment CB10 


103 


40 


2126 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


186 


100 


2127 


G03114 

■ ■ » 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7195. 


117 


52 


2128 


AF 194537 


Homo sapiens 


NAG 13 


126 


66 


2129 


AB015802 


Acetobacter 
xylinus 


similar to cellulose complementing 
protein of A. xylinum ATCC23869 


109 


80 


2130 


AF187147 


Mus musculus 


drebrinA 


110 


38 


2131 


W54966 


Homo sapiens 


Synthetic human type III collagen 
SYN-C3. 


148 

I* 


42 


2132 


L36341 


Aspergillus 
nidulans 


regulatory protein 


130 


49 


2133 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


119 


58 


2134 


AF 116689 


Homo sapiens 


PR02168 


113 


81 


2135 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


191 


38 


2136 


M81321 


Macaca 
fascicularis 


proline-rich protein 

- -- - 


125 


43 


2137 


A18812 


Brassica napus 


extensin 


106 


34 


2138 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


104 


41 


2139 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


191 


92 


2140 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


196 


90 


2141 


Y57285 


Homo sapiens 


Human GPCR protein (HGPRP) 
sequence (clone ED 2214673). 


507 


78 


2142 


AJ242540 


Volvox carteri £ 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


201 


47 


2143 


AF090944 


Homo sapiens 


PRO0663 


164 


53 


2144 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


390 


76 

' 


2145 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQEDNO:163. 


198 


70 


2146 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


119 


82 


2147 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


194 


45 


2148 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 


91 


60 


2149 


AF090942 


Homo sapiens 


PRO0657 


129 


66 


2150 


AF3 30089 


Homo sapiens 


PRO2550 


372 


82 


2151 


AC009991 


Arabidopsis 


unknown protein 


81 


59 
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Species 

* 
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Smith- 
Water nun 
Score 


Identit 

y 




i 


thaliana 








2152 


AF090942 


Homo sapiens 


PRO0657 


93 


53 


2153 


R133J9 


Homo sapiens 


Partial Human Natural Killer receptor. 


215 


89 


2154 


AC008075 


Arabidopsis— 
thaliana. 


F24J5.4 


139 


36 


2155 


AL390114 

• 


Leishmania 
major 


extremely cysteine/valine rich protein 


148 


50 


2156 


S79410 

■ 


Mus musculus 


nuclear localization signal binding 
protein 


112 


58 


2157 


AK024455 


Homo sapiens 


FLJ00047 protein 


152 


66 


2158 


G00376 


Homo sapiens 

> 


Human secreted protein, SEQ ID NO: 
4457. 


'85 


80 


2159 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


165 


75 


2160 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


145 


45 


2161 


AF1 19900 


Homo sapiens 


PR02822 


138 


71 


2162 


AJ223953 


Homo sapiens 


hPTTG 


106 


62 


2163 


AK023542 


Homo sapiens 


unnamed protein product 


76 


52 


2164 


G00344 ( 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4425. 


118 


84 


2165 


U41038 


Caenorhabditis 
elegans 


Similar to cadherin-type repeat 


137 


65 


2166 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


348 


88 

• 


2167 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. • 


108 


84 


2168 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


114 


48 


2169 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


125 


67 


2170 


AF130089 


Homo sapiens 


PRO2550 


142 


65 


2171 


Y 19609 


Homo sapiens 


SEQ ID NO 327 from W09922243. 


103 


65 


2172 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


104 


39 


2173 


G03114 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7195. 


91 


50 


2174 


D38435 


Homo sapiens 


homologue of yeast PMS1 


314 


96 


2175 


AF119851 


Homo sapiens 


PR01722 


230 


69 


2176 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


100 


63 


2177 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


137 


44 


2178 


U23172 


Caenorhabditis 
elegans 

* 


similar to myoblast cell surface antigen 
(SP:CS24_HUMAN, P23246) and D. 
melanogaster No-on-transient A protein 
(PIR:JH0162) 


139 


42 


2179 


AL049608 


Arabidopsis 
thaliana 


extensin-like protein 


286 


57 


2180 


L17318 


Rattus 
norvegicus 


proline-rich proteoglycan 


148 


40 


2181 


AK024455 


Homo sapiens 


FLJ00047 protein 


97 


63 


2182 


R95913 


Homo sapiens 


Neural thread protein. 


100 


69 


2183 


AF266479 


Homo sapiens 


rectachrome l 


148 


81 


2184 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


64 


70 


2185 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


107 


40 


2186 

i 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


110 


84 
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SEQ ID 

NO: 


Accession No. 
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Smith- 

Waterman 

Score 


% 

Identit 

y 


2187 


AF137273 


GaNus gallus 


aJpha 1 (V) collagen 


103 


43 


2188 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


115 


36 


2189 


D90064 


Homo sapiens 


NCA-W272 


271 


100 


2190 


AF1 30089 


Homo sapiens 


PRO2550 


137 


68 


2191 


X65165 


Volvox carteri 


extensin 


113 


62 


2192 


U49973 


Homo sapiens 


ORF 1 ; MER37; putative transposase . 
similar to pogo element 


154 


59 . 


2193 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


148 


59 


2194 


U52077 


Homo sapiens 


mariner transposase 


257 


53 


2195 


B12310 

■ 


Homo sapiens 


Human secreted protein encoded by 
gene 10 clone HDPGP94. 


98 


79 


2196 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


180 


77 


2197 


Y02693 


Homo sapiens 

• 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


121 


65 


2198 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


126 


64 


2199 


G03807 


; Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


101 


67 


2200 


AJ007042 


Homo sapiens 


TRX5 protein 


264 


75 


2201 


AF130051 


Homo sapiens 


PRO0898 


71 


61 


2202 


AK024455 


Homo sapiens 


FU00047 protein 


153 


62 


2203 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


95 


52 


2204 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


128 


82 


2205 


AF090895 


Homo sapiens 


PRO0117 


163 


69 


2206 


G03806 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7887. 


154 


69 


2207 


R95913 


Homo sapiens 


Neural thread protein. 


103 


86 


2208 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


144 


78 


2209 


W48351 


Homo sapiens 

■ 


Human breast cancer related protein 
BCRB2. 


100 


65 


2210 


G03790 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


111 


45 


2211 


AF090931 


Homo sapiens 


PRO0483 


63 


90 


2212 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


100 


74 


2213 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


98 


41 


2214 


M76671 


Lycopersicon 
esculentum 


extensin (class II) 


137 


35 

- 


2215 


. X03717 


Homo sapiens 


pot. unidentified reading frame 


98 


54 


2216 


R95913 


Homo sapiens 


Neural thread protein. 


109 


48 


2217 


AF 118086 


Homo sapiens 


PR01992 


138 


79 


2218 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


343 


95 


2219 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


157 


84 , 


2220 


AF141347 


Homo sapiens 


alpha-tubulin 


571 


94 


2221 


AF071172 


Homo sapiens . 


HERC2 


187 


84 


2222 


M17783 


Homo sapiens 


glia-derived nexin precursor 


529. 


83 


2223 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


588 


85 


2224 


AK026072 


Homo sapiens 


unnamed protein product 


199 


57 


2225 


G03790 • 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


161 


61 


2226 


Z70684 


Caenomabditis 
elegans 


F28D1.8 


126 


41 


2227 


G03807 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7888. 


124 


63 
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or-Q ID 

NO: 


' Accession No. 


Species 


Description 


dtnitn- 

Waterman 

Score 


OS 

Identit 

y 


2228 


AF1 19855 


Homo sapiens 


PRO 1847 


108 


74 


2229 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


122 


55 


2230 


AF090944 


Homo sapiens 


PRO0663 


146 


61 


2231 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


152 


74 


2232 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


80 


59 


2233 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


129 


58 


2234 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


143 


60 


2235 


AK024455 


Homo sapiens 


FLJ00047 protein 


133 


71 


2236 


Y87103 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 142. 


99 

• 


59 


2237 


U87607 


Rattus 
norvegicus 


putative RNA binding protein I 


111 


38 


2238 


G00397 

* 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


119 


63 


2239 


G00357 , 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


75 


65 


2240 


L27428 


Homo sapiens 


reverse transcriptase 


136 


40 


2241 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


122 


82 


2242 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


105 


95 


2243 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


79 


48 


2244 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


145 


75 


2245 


U09116 


Homo sapiens 


ORF1, encodes a 40 kDa product 


346 


87 


2246 


U82303 


Homo sapiens 


unknown 


155 


79 


2247 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


105 

- 


45 


2248 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18, 


105 


47 


2249 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


161 


63 


2250 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


124 


62 


2251 


X81206 


Drosophiia hydei 


histoneH3.3 


101 


71 


2252 


AF155581 


Danio rerio 


proteasome subunit beta 7 


92 


52 


2253 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


172 


60 


2254 


AF084225 


Homo sapiens 


cytochrome P4502E1 


114 


46 


2255 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


136 


64 


2256 


AL132841 


Caenorhabditis 
elegans 


Y15E3A.3 


147 


90 


2257 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


297 


59 


2258 


U93563 


Homo sapiens 


putative pi 50 


218 


66 


2259 


L22650 


Homo sapiens 


early lymphoid activation protein 


82 


55 


2260 


AF 194537 


Homo sapiens 


NAG 13 


117 


56 


2261 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


130 


82 


2262 


G00366 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4447. 


86 


66 


2263 


AL021918 


Homo sapiens 


b34I8. 1 (Kruppel related Zinc Finger 
protein 184) 


372 


61 


2264 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


122 


71 
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SEQ1D 
NO: 


Accession No. 


Species 


Description 


Smitb- 

Waterman 

Score 


% 

Identit 

y 


2265 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


123 


79 


2266 


Y86248 

• 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQEDNO:163. — 


204 


95 


2267 

• 


L29029 

• 


Chlamydomonas 
reinhardtii 


amino acid feature: Rod protein 
domain, aa266 .. 468; amino acid 
feature: globular protein domain, aa 32 
..265 


141 

• 


58 


2268 


Y91577 


Homo sapiens. 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


499 


84 


2269 


A11693 


Homo sapiens 


start codon not included 


594 


87 


2270 


AB014554 


Homo sapiens 


KIAA0654 protein 


141 


63 


2271 


G04067 

■ * * 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


112 


60 


2272 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


118 


54 


2273 


Y20852 


Homo sapiens 


Human neurofilament-H mutant protein 
fragment 1 1. 


125 


35 


2274 


G03453 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7534. 


92 


66 


2275 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


198 


64 

■ 


2276 


U15647 


Mus musculus 


reverse transcriptase 


100 


72 


2277 


AJ004810 


Zea mays 


cytochrome P450 monooxygenase 


129 


43 


2278 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


111 


67 


2279 


G00589 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4670. _ 


1,12 


68 


2280 


L27428 


Homo sapiens 


reverse transcriptase 


150 


60 


2281 


K03202 


Homo sapiens 


salivary proline-rich protein precursor 


.142 


40 


2282 


U49973 


Homo sapiens 


ORF 1 ; MER37; putative transposase 
similar to pogo element 


321 


54 


2283 


D00570 


Mus musculus 


open reading frame (251 AA) 


169 


56 


2284 


Y02887 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


86 


54 


2285 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


144 


50 


2286 


AF1 19901 


Homo sapiens 


PR02831 


116 


82 


2287 


G0379? 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


126 


73 


2288 


AF130089 


Homo sapiens 


PRO2550 


102 


75 


2289 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: { 
4484. 


113 


70 


2290 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


127 


64 


2291 


L26953 


Homo sapiens 


chromosomal protein 


137 


53 


2292 


AF161356 


Homo sapiens 


HSPC093 


100 


57 


2293 


G00328 


Homo sapiens- 


Human secreted protein, SEQ ID NO: 
4409. 


138 


68 


2294 


X03145 


Homo sapiens 


pot ORF I 


120 


43 


2295 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


83 


52 


2296 


AK024455 


Homo sapiens 


FLJ00047 protein 


98 


66 


2297 


G00262 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4343. 


123 


73 


2298 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


349 


73 


2299 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


109 


61 
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crn ill 


A *cci on Nn 




X*Otl l|)UUIl 




/o 


NO: 






Waterman 


Identit ' 










Score 


y 


2300 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


100 


81 






• 


5327. 






2301 


AF251290 


Plasmodium 


glutamic acid-rich protein 


112 


40 






falciparum 








2302 


AF 155232 


Pisum sativum 


extensin 


89 


36 


2303 


AF130089. 


Homo sapiens 


PRO2550 


83 


50 


2304 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 


84 


51 








homolog 


• 




2305 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


86 


30 


2306 


AP002031 


Arabidopsis 


genejd:K3D20.3~ 


168 


44 


■ 




thai i ana 








2307 


AF157321 


Homo sapiens 


30 kDa protein 


309 


64 


2308 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


170 


57 








6613. 




- 


2309 


M76546 


Helianthus 


hydroxyproline-rich protein 


198 


42 






annuus 








2310 


Y27607 


Homo sapiens 


Human secreted protein encoded by 


207 


100 








gene No. 41. 






2311 


AF 130089 


Homo sapiens 


PRO2550 


106 


66 


2312 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


60 








4518. 


■ 




2313 


R10755 


Homo sapiens 


Non-A non-B hepatitis specific 


114 


80 








antigenic protein encoded by 












phageclone lambda HC2533. 






2314 


AB032910 


Hylobates 


dopamine receptor D4 , 


108 


40 






muelleri 


- 


j 




2315 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


L59 


59 








gene No. 107. 






2316 


AK024455 


Homo sapiens 


FLJ00047 protein 


134 


58 


2317 


AF238235 


Entamoeba 


diaphanous protein 


103 


51 






histolytica 








2318 


U15647 


Mus musculus 


reverse transcriptase 


101 


37 


2319 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


117 


51 


2320 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


163 


78 




• 




SEQIDNO:163. 






2321 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


48 


* 


. 




7871. 






2322 


D00570 


Mus musculus 


open reading frame (196 AA) 


122 


39 


2323 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


122 


75 








RHAla(AF078683) 






2324 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


79 








6720. 




• 


2325 


U44838 


Glycine max 


extensin 


126 


36 


2326 


Y87103 


Homo sapiens 


Human secreted protein sequence SEQ 


99 


59 








IDNO:142. 






2327 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 


408 


79 






■ 


fragment, SEQ ID NO:490. 






2328 


AF1 94537 


Homo sapiens 


NAG13 


181 


66 


2329 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


405 










similar to pogo element 




• 


2330 


X53581 


Rattus 


ORF4 


116 


45 






norvegicus 








2331 


L26953 


Homo sapiens 


chromosomal protein 


108 


67 


2332 


L27428 


Homo sapiens 


reverse transcriptase 


117 


76 


.2333 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


84 


50 








4718. 






2334 


AF191687 


Homo sapiens 


alanine-glyoxylate aminotransferase 


100 


37 








homolog 






2335 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


84 


68 
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i 

• 
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% 
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y 


2336 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


106 


53 


2337 

* 


Y86248 


Homo sapiens 


Human secreted protein HGHPF68, 
SEQID NO: 163. 


168 


65 


2338 


R95913 


Homo sapiens 


Neural thread protein. 


164 


67 


2339 


AF090942 


Homo sapiens 


PRO0657 


122 


81 


2340 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


107 


90 


2341 


AF090942 


Homo sapiens 


PRO0657 


124 


60 


2342 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


126 


83 


2343 


ABO 13454 


Rattus 
norvegicus 


NaPi-2 beta 


143 

• 


77 


2344 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


127 


73 


2345 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


146 


45 


2346 


Y67470 


Homo sapiens 


Np70 protein carboxy terminal region. 


120 


40 


2347 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


109 


55 


2348 


Y36203 


Homo sapiens 


Human secreted protein #75. 


138 


78 


2349 


L26953 


Homo sapiens 


chromosomal protein 


131 


57 


2350 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


120 


65 


2351 


S58722 


Homo sapiens 


X- linked retinopathy protein {C- 
terminal, clone XEH.8c} 


144 


58 


2352 


AF130051 


Homo sapiens 


PRO0898 


155 


70 


2353 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


134 


76 


2354 


L27428 


Homo sapiens 


reverse transcriptase 


141 


71 


2355 


L26953 


Homo sapiens 

mm* mm ^m' m ^» W Wm' w t mm *^mW m> ^^m*9^m* 


chromosomal protein 

^^^mmmm^ m>^ — mmr *mr m* mm.mm WB mm* M wrmm*m+ 


128 


66 


2356 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


148 


75 


2357 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


115 


62 


2358 


B12310 


Homo sapiens 


Human secreted protein encoded by 
gene 10 clone HDPGP94. 


115 


50 


2359 


M76546 


HeJianthus 
annuus 


hydroxyproline-rich protein 


103 


43 


2360 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


82 


80 


2361 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


95 


72 


2362 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


117 


41 


2363 


W03642 


Homo sapiens 


Human cannabinoid GPRN-terminal 
sequence. 


98 


80 


2364 


AL451017 


Neurospora 
crassa 


related to Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN C 


152 


45 


2365 


Y02671 


Homo sapiens 

• 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


85 


55 


2366 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


101 


77 


2367 


AF093O97 


Homo sapiens 


putative RNA-binding protein Q99 


248 


97 


2368 


U52077 


Homo sapiens 


mariner transposase 


227 


74 


2369 


X07882 


Homo sapiens 


Po protein 


102 


38 


2370 


U44838 


Glycine max 


extensin 


102 


32 


2371 


AB012223 


Canis familiaris 


ORF2 


158 


60 


2372 


AF025467 


Caenorhabditis 


contains similarity to drosophila DNA- 


104 


42 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 
y 






elegans 


binding protein K10 (NID:g8I48) 






2373 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


107 


40 


2374 


G04078 


Homo sapiens 
* 


Human secreted protein, SEQ ID NO: 
8159. 


106. — 


67 


2375 


AF1 18086 


Homo sapiens 

* 


PR01992 


169 


65 


2376 


W48351 


Homo sapiens 


Human, breast cancer related protein 
BCRB2. 


95 


55 


2377 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


121 


59 


2378 


Y85062 


Homo sapiens 


Interleukin 1 converting enzyme 
homologue (ICEL) protein sequence. 


141 


70 


2379 


R95913 


Homo sapiens 


Neural thread protein. 

* 


120 


50 


2380 


U93572 


Homo sapiens 


putative pi 50 


124 


53 


2381 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


195 


66 


2382 


U52077 


Homo sapiens 


mariner transposase 


282 


67 


2383 


G04067 


Homo sapiens 

A A\ \0* AAA %^*Aj V ■ ^FAAAr 

! 


Human secreted protein, SEO ID NO: 
8148. 


113 


43 


2384 


AJ242540 

* 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


112 


37 


2385 


G03714 


Homo sapiens 


Human secreted protein, SEO ID NO: 
7795. 


88 


75 


2386 


AF130089 


Homo sapiens 


PRO2550 


160 


62 


2387 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


166 

i 


61 


2388 


AF130089 


Homo sapiens 


PRO2550 


103 


71 


2389 


AB027890 


Schizosaccharom 
yces pombe 


Hypothetical protein 


116 


100 


2390 


U93570 


Homo sapiens 


putative pi 50 


151 


54 


2391 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


no 


71 


2392 


GQ3797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


109 


67 


2393 


AE001381 


Plasmodium 
falciparum 


hypothetical protein 


94 


34 


2394 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO;163. 


156 


62 


2395 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7133. 


57 


56 


2396 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


107 


43 


2397 


AF2 10651 


Homo sapiens 


NAG 18 


162 


54 


2398 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 

* 


116 


34 


2399 


S80864 


Homo sapiens 


cytochrome c-like polypeptide 


115 


68 


2400 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


141 


54 


2401 


M13100 


Rattus 
norvegicus 


unknown protein 


148 


46 


2402 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


133 


70 


2403 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


366 


94 


2404 


AFH3685 


Homo sapiens 


PRO0974 


112 


56 


2405 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


98 


& 


2406 


R41001 


Homo sapiens 


Human myotonic dystrophy gene 
protein. 


207 


68 


2407 


AL390114 


Leishmania 


probable (hhv-6) ul 102, variant a 


127 


63 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Idcntit 

y 






major 


DNA, complete virion genome 






2408 

■ ■ 


G03101 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7182. 


378 


95 


2409 

* 


Y91452 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO: 125. 


250 

• 


86 


2410 


U72520 


Mus musculus 


mena protein 


115 


40 


2411 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


132 


44 


2412 

• 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


109 


85 


2413 


AF119901 


Homo sapiens 


PR02831 


113 


66 


2414 


R95913 


Homo sapiens 


Neural thread protein. 


93 


57 


2415 


B01372 


Homo sapiens 


Neuron-associated protein. 


104 


83 


2416 


G02832 


Homo sapiens . 


Human secreted protein, SEQ ID NO: 
6913. 


130 


50 


2417 


J02459 


bacteriophage 

^r*m*rm^rm m 

lambda 


K (tail component: 199) 


720 

' mm 


92 

m 7 mm 


2418 


J04694 


Mus musculus 


aloha- 1 tvoe IV collasen 


103 


43 

^w*m 


2419 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463 


145 


61 


2420 


W26496 


Homo sapiens 


CD2 associated intracellular protein 
CAIP LS02-21. 


115 


80 


2421 


AF000298 


Caenorhabditis 
eleeans 


weak similarity to collagens; glycine- 
and oroline-rich 


145 


38 


2422 


G00442 


Homo saoiens 


Human secreted Drotein SEO ID NO* 
4523. 


158 

* %m 


73 1 


2423 


Y86248 


Homo saDiens 


Human secreted Drotein HCHPF68 
SEQIDNO:163. 


156 


70 


2424 


G02480 


Homo sapiens 


Human secreted protein. SEO ID NO: 
6561. 


122 


61 

^m • 


2425 


Y30822 


Homo sapiens 


Human secreted protein encoded from 
gene 12. 


95 


59 


2426 


U93565 


Homo sapiens 


putative pi 50 


147 


75 


2427 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


246 


53 


2428 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


138 


61 


2429 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


112 


75 


2430 


D13892 


Homo sapiens 


carboxyl methyltransferase 


181 


68 


2431 


X97675 


Homo sapiens 


plakophilin 2b 


131 


65 


2432 


G02896 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6977. 


108 


70 


2433 


X03145 


Homo sapiens 


pot ORF III 

* 


180 


82 


2434 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


77 


86 


2435 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


421 


67 


2436 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


111 


38 


2437 


AE210651 


Homo sapiens 


NAG18 


128 


72 


2438 


AF016099 


Mus musculus 


endonuclease/reverse transcriptase 


232 


46 


2439 


AL160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


122 


53 


2440 


GO0376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


136 


80 


2441 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


139 


54 


2442 


AF090895 


Homo sapiens 


PRO01I7 


115 


61 


2443 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


125 


70 
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MA. 


Accession No. 


Species 


Description 


Smith- 
w a rerman 
Score 


% 

laentic 

y 


2444 


G00594 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4675. 


71 


80 


2445 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


.81 


37 


2446 


AF090895 


Homo sapiens 


PRO0U7 


86 


64 


2447 


G00442 

♦ 


Homo sapiens 


Human secreted protein, SEQ ID NO: . 
4523. 


103 


86 


2448 


W48351 


Homo sapiens 

* 


Human breast cancer related protein 
BCRB2. 


94 

• 


61 


2449 


L21936 


Homo sapiens 

Ml 


succinate dehydrogenase flavoprotein 
subunit 


168 


91 


2450 


AF090942 


Homo sapiens 


PRO0657 


134 


71 


2451 


G03258 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7339. 


126 


87 


2452 


K03179 


Homo sapiens 


pro-alpha- 1 type-I collagen 


120 


44 


2453 


AF 118082 


Homo sapiens 


PRO1902 


143 


46 


2454 


G00328 


Homo saoiens 


Human secreted orotein. SEO ID NO: 
4409. 


114 


68 


2455 


AF130089 


Homo sapiens 


PRO2550 


125 


89 


2456 


G00332 

V V ^ ^* mm 


Homo saoiens 


Human secreted orotein SEO ID NO: 
4413. 


134 


71 


2457 


Y36243 


Homo sapiens 


Human secreted protein encoded by 
eene 20. 


93 


75 


2458 


Y09945 


Rattus 
norvegicus 


putative integral membrane transport 
protein 


166 


46 


2459 


AF 130052 


Homo sapiens 


PRO0956 


75 


48 


2460 

if ■ W 


G00328 


Homo saoiens 


Human secreted nrotein SEO ID NO* 
4409. 


162 


76 


2461 


R96800 


Homo saoiens 


Human histiocyte- secreted factor HSF. 


123 


69 


2462 


AL390114 


Leishmania 
maior 


probable (hhv-6) ul 102, variant a 
DNA. complete virion senome 


157 


48 


2463 


X83413 


Human 
herpesvirus 6 


U88 


236 


50 


2464 


G00517 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4598. 


181 


53 


2465 


X83413 


Human 
herpesvirus 6 


U88 


218 


50 


2466 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


147 


57 


2467 


X83413 


Human 
herpesvirus 6 


U88 


196 


53 


2468 


Y86248 


Homo sapiens 

• 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


165 


61 


2469 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


101 


46 


2470 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


221 


56 


2471 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


136 


63 


2472 


AL359782 


Trypanosoma 
brucei 


possible ( hhv-6) ul 102, variant a dna. 
complete virion genome. 


122 


94 


2473 


AF161361 


Homo sapiens 


HSPC098 


120 


60 


2474 


M76546 


Helianthus 
annuus 


hy droxyproline-rich protein 


142 


40 


2475 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


130 


58 


2476 


G02515 


Homo sapiens 


Human secreted protein/SEQ ID NO: 
6596. 


78 


48 


2477 


AL160371 


Leishmania 


probable (hhv-6) ul 102, variant a 


86 


46 
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SEQ ID 


Accession wo. 


sptctes 


.Description 


OIDIID- 


"ml 


NO: 




* 




Watermao 


Identit 










Score 


y 




i 


major 


DNA, complete virion genome 




♦ 


2478 


G01478 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


96 


66 


j ■ 






5559. 






2479 


AL033545 


Arabidopsis 


extensin-like protein 


114 


40 






thaliana 


■ 


- 




2480 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


142 


59 




• 


brucei 


complete virion genome. 


- 




2481 


AL390114 


Leishmania 


probable (hhv-6) ul 102, variant a 


146 


62 




• 


major 


DNA, complete virion genome 






2482 


AL 132902 


Caenorhabditis 


Y71A12B.4 


160 


37 




- 


elegans 






• 


2483 


AF 1545 02 


Homo sapiens 


quiescent cell proline dipeptidase 


439 


88 


2484 


AF010326 


Drosophila 


short form of CHIP 


56 


42 




• 


melanogaster 








2485 


W80406 


Homo sapiens 


A secreted protein encoded by clone 


119 


50 








dh40 3. 






2486 


W31186 


Homo sapiens 


Human pi 60 polypeptide 160.2. 


115 


60 


2487 


W48351 


Homo sapiens 


Human breast cancer related protein 


103 


74 


- 


i 




BCRB2. 






2488 


L26953 , 


Homo sapiens 


chromosomal protein 


129 


68 


2489 


Y21418 


Homo sapiens 

* 


Human high mobility group protein 


96 


46 








HMGI-C mutant fragment 2. 






2490 


Y91577 


Homo sapiens 


Human secreted protein sequence 


366 


94 








encoded by gene 2 SEQ ID NO:250. 






2491 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


158 


76 








4508. 






2492 


U12707 


Homo sapiens 


Wiskott-Aldrich syndrome protein 


144 


47 


2493 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


117 


81 








encoded by gene 1 7. 


• 


- 


2494 


X92485 


Plasmodium 


pval 


116 


64 






vivax 








2495 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


92 


64 








7795. 


- 




2497 


AF161483 


Homo sapiens 


HSPC134 


211 


78 


2498 


X05006 


Homo sapiens 


S-protein 


298 


92 


2499 


AB032911 


Hylobates agilis 


dopamine receptor D4 


89 


42 


2500 


M98502 


Mus museums 


pMLZ^ 


325 


90 


2501 


Y94920 


Homo sapiens 


Human secreted protein clone 


569 


85 








pm412 12 protein sequence SEQ ID 




• 








NO:46. 






2502 


AB049054 


Homo sapiens 


brain link protein- 1 


224 


90 


2503 


AF064604 


Homo sapiens 


KE03 protein 


224 


37 


2504 


Z69727 


Schizosaccharom 


putative dna-directed rna polymerase iii 


384 


61 




• 


yces pombe 


130 kd polypeptide (ec 2.7.7.6) 






2505 


Ml 1901 


Rattus 


proline-rich salivary protein 


147 


43 






norvegicus 








2506 


AJ010604 


Mus musculus 


L-Sox5 protein 


366 


87 


2507 


X83413 


Human 


U88 


203 


46 






herpesvirus 6 








2508 


AJ277425 


Globodera 


putative cuticular collagen 


125 


40 






pallida 






■ 


2509 


AB017919 


Homo sapiens 


peptidylarginine deiminase type V 


148 


81 


2510 


AF001947 


Homo sapiens 


U4/U6-associated RNA splicing factor 


583 


97 


2511 


AJ238520 


Homo sapiens 


putative transcription factor-like 


722 


100 








nuclear regulator 






2512 


AL390736 


Homo sapiens 


DA209J19.1.1 (GW112 protein) 


557 


84 


2513 


D84223 


Homo sapiens 


leucyl tRNA synthetase 


1113 


100 


2514 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


174 


56 








clone HPMBQ32. 
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SEQID 
NO: 


Accession No. 


S pedes 


Description 


Smith- 
Waterman 
Score 


% 

Identit 1 

y 


2515 


U60803 


Homo sapiens 


clathrin heavy chain 2 


111 


92 


2516 


AJ388557 


1 Canis familiaris 


zinc finger protein 


826 


56 


2517 


AB027251 


Homo sapiens 


zinc finger protein (ZFD25) 


631 


85 


2518 


AK023160 


Homo sapiens 


unnamed protein product 


168 


54 


2519 


D87326 


Mus musculus 


GSG2 


575 


73 


2520 ' 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7452. 


100 


42 


2521 . 




Homo sapiens 


transketolase 


120 


66 


2522 


AF074086 


Homo sapiens 


protease 


390 


86 


2523 

• 


AF220509 


Homo sapiens 


transcription associated factor 
TAFI131L 


801 


99 


2524 


AL078463 


Homo sapiens 


dJ365I19.1 (KIAA0456) 


374 


92 


2525 


AF038995 


Mus musculus 


putative RNA helicase RCK 


160 


93 


2526 


M60618 


Homo sapiens 


nuclear autoantigen 


116 


75 


2527 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


101 


57 


2528 


X68790 

» 


Homo sapiens 


Bactericidal /Permeability Increasing 
Protein 


136 


90 


2529 


AO 1592 


Homo sapiens 


haemoglobin A beta chain 


508 


94 


2530 


U89277 


Homo sapiens 


polyhomeotic 1 homolog 


404 


79 


2531 


X90845 


Rattus 
norvegicus 


alphall spectrin 


507 


87 


2532 


AL137081 


Arabidopsis 
thaliana 


phenylalanine-tRNA synthetase-like 
protein 


178 


33 


2533 


Y 18046 


Homo sapiens 


FGFR1 oncogene partner (FOP) 


363 


100 


2534 


W54966 

* 


Homo sapiens 

* 


Synthetic human type III collagen 
SYN-C3. 


113 

»*. 


42 


2535 


X61047 


Hydra sp. 


mini-collagen 


102 


38 


2536 


AB006330 


Mus musculus 


SOX5 


559 


94 


2537 


AC006283 


Arabidopsis 

• 

thaliana 


En/Spm-Jike transposon protein 


149 


33 


2538 


AF 196779 


Homo sapiens 


JM11 protein 


139 


57 


2539 


AK000741 


Homo sapiens 


unnamed protein product 


233 


47 


2540 


Y 18046 


Homo sapiens 


FGFR1 oncogene partner (FOP) 


315 


84 


2541 


U54996 


Homo sapiens 


HZW10 


371 


81 


2542 


G02631 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6712. 


99 


45 


2543 


M95610 


Homo sapiens 


alpha-2 IX collagen 


93 


34 


2544 


AF071173 


Mus musculus 


Herc2 


349 


82 


2545 


AB000516 


Homo sapiens 


DSIFpl60 


600 


93 


2546 


AL355178 


Homo sapiens 


dJ947L8.1.6 (novel CUB and Sushi 
(SCR repeat) domain protein) 


395 


90 
* 


2547 


U93574 


Homo sapiens 


putative pi 50 


235 


76 


2548 


X80035 


Oryctolagus 
cuniculus 


cysteine rich hair keratin associated 
protein 


99 


31 


2549 


U58088 


Homo sapiens 


Hs-CUL-2 


592 


86 


2550 


L26953 


Homo sapiens 


chromosomal protein 


131 


63 


2551 


G00354 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
4435. 


99 


61 


2552 


U39412 


Homo sapiens 


alpha SNAP 


159 


54 


2553 


AF248651 


Homo sapiens 


RNA-binding protein BRUNOL4 


569 


93 


2554 


AJ006519 


Rattus 
norvegicus 


ASIC-beta 


164 


73 


2555 


U93570 


Homo sapiens 


putative pi 50 


169 


52 


2556 


AF1 04328 


Arabidopsis 
thaliana 


cell wall-plasma membrane linker 
protein homolog 


105 


29 


2557 


AB028975 


Homo sapiens 


KJAA 1052 protein 


610 


83 


2558 


AJ245621 


Homo sapiens 


CTL2 protein 


286 


47 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Identit 

y 


2559 


AF090944 


Homo sapiens 


PRO0663 


174 


78 


2560 

• ' 


Y 17832 


Human 
endogenous 
retrovirus K 


pol protein 


411 

■ 


63 


2561 


AF 109907 ' 


Homo sapiens 


S164 


279 


44 


2562 


A0956I 

■ 


synthetic 
construct . 


human serum albumin 


680 


91 


2563 


U48359 


Gallus galius 


kinesin light chain 


575 


81 


2564 


R63235 


Homo sapiens 


CNS neural thread protein HB4. 


435 


88 


2565 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


623 


89 


2566 


AF 154916 


Chlamydomonas 
reinhardtii 


variable flagellar number protein 


117 


36 


2567 


AF 18 1657 


Drosophila 
meianogaster 


BcDNA.LD34475 


261 


42 


256S 


M12530 


Homo sapiens 


transferrin precursor 


693 


87 


2569 


U96915 


Homo sapiens 


sin3 associated polypeptide pi 8 


651 


93 


2570 


R74205 

■ 


Homo sapiens 

T 


Human death associated protein DAP- 
2. 


285 


27 

■ 


2571 


AL078593 


Homo sapiens 


dJ210Bl.l (KIAA0680) 


107 


37 


2572 

■ 


AF2934Q5 


Phaseolus 
coccineus 


seeoVmicropylar-endothelium-specirlc 
protein 


88 


39 

■ 


2573 


G03114 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7195. 


110 


49 


2574 


AB024601 


Pseudomonas 
aeruginosa 


uridylyl transferase 


120 


37 


2575 


AB050893 


Anadara nodifera 


cytochrome c oxidase subunit I 


111 


79 


2576 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


130 


72 


2577 


M17697 


Homo sapiens 


glutamate dehydrogenase 


369 


61 


2578 


AF045640 


Caenorhabditis 
elegans 


CI 1D2.4 gene product 


264 


43 

- 


2579 


U71382 


Homo sapiens 


OB binding protein- 1 


273 


86 


2580 


AF1 16661 


Homo sapiens 


PR01438 


114 


61 


2581 


AK000496 


Homo sapiens 


unnamed protein product 


153 


55 


2582 


A67510 

■ 


Mus musculus 


MUS MUSCULUS GENOMIC DNA 
CONTAINING N ALLELE OF FV 1 
GENE. 


190 

■ ■ 


38 


2583 


Y44851 


Homo sapiens 


Human CD39-L66 protein. 


205 


97 


2584 


AB029151 


Homo sapiens 


D29 


253 


73 


2585 


AF039023 


Homo sapiens 


Ran-GTP binding protein; RanBP6 


765 


94 


2586 


AF054180 


Homo sapiens 


hematopoietic cell derived zinc finger 
protein 


116 


34 


2587 


AF283645 


Homo sapiens 


folate transporter/carrier 


580 


82 


2588 


AF090930 


Homo sapiens . 


PRO0478 


132 


67 


2589 


AC002339 


Arabidopsis 
thaliana 


putative ABC transporter 


238 


34 


2590 


M34668 


Homo sapiens 

- 


protein tyrosine phosphatase (EC 
3.1.3.48) 


627 


86 


2591 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


141 


63 


2592 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


142 


63 


2593 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


146 


54 


2594 


AF047437 


Homo sapiens 


sperm acrosomal protein 


526 


74 


2595 


U28789 


Mus musculus 


PACT 


528 


71 


2596 


AB002366 


Homo sapiens 


KIAA0368 


615 


83 


2597 


AL049610 


Homo sapiens 


dJ1055C14.2 (KIAA0026 
(transcription factor-like protein 


704 


93 
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- 


JUCSL riJHMJll 
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Score 
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y 






> 


MRGX)) 






2598 


G01279 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5360. 


385 


79 


2599 


AF090895 


Homo sapiens 


PRO0117 


142 


61 


2600 


D29763 


Mus musculus 


seizure-related gene product 6 
precursor 


119 


76 


2601 


U20158 


Homo sapiens 


SLP-76 


108 


53 


2602 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


242 


78 


2603 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


233 


73 


2604 


L24804 


Homo sapiens 


p23 


259 


62 


2605 

■ 


Y12102 


Homo sapiens 


Human 5* EST secreted protein SEQ ID 
NO: 415. 


132 


92 


2606 


U79284 


Homo sapiens 


SEC HI- 


614 


81 


2607 


AF202635 


Homo sapiens 


PP 1200 


105 


47 


2608 


AF277374 


Homo sapiens 


enhancer of polycomb 


254 


85 


2609 


X03484 


Homo sapiens 


raf protein (aa 1-648) 


600 


82 


2610 


U 15637 


Homo sapiens 


CD40 binding protien 


465 


82 


2611 


U40265 

* 


Trypanosoma 
cruzi 


ATPase subunit 6 


98 


31 


2612 


AF090942 


Homo sapiens 


PRO0657 


125 


48 


2613 


V00662 


Homo sapiens 


cytochrome oxidase I 


605 


82 


2614 


U46920 


Homo sapiens 


metaxin 


748 


94 


2615 


Y19757 


Homo sapiens 


SEQ ID NO 475 from W09922243. 


113 


87 


2616 


AK025047 


Homo sapiens 


unnamed protein product 


173 


58 


2617 


D13866 


Homo sapiens 


alpha-catenin 


569 


96 


2618 


U20536 


Homo sapiens 


cysteine protease Mch2 isoform alpha 


588 


87 


2619 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


608 


86 


2620 


AF190168 


Homo sapiens 


serum albumin precursor 


522 


78 


2621 


A06977 


Homo sapiens 


albumin 


607 


93 


2622 


R14584 


Homo sapiens 


TGF beta 1 binding protein encoded by 
clone BPA 13. 


339 


81 


2623 


X56698 


Xenopus laevis 


42Sp48 


117 


47 


2624 


W48351 


Homo sapiens 

• 


Human breast cancer related protein 
BCRB2. 


131 


57 


2625 . 


X07881 


Homo sapiens 


proline-rich protein G 1 


116 


32 


2626 


W48351 

■ 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


89 


58 


2627 


A00279 


synthetic 
construct 


Human serum albumin 


564 


95 


2628 


AK021613 


Homo sapiens 


unnamed protein product 


214 


70 


2629 


M23613 


Homo sapiens 


nucleophosmin 


486 


82 


2630 


AF243424 


Homo sapiens 


SG2NA beta isoform 


256 


98 


2631 


G00506 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4587. 


81 


60 


2632 


A06977 


Homo sapiens 


albumin 


457 


74 


2633 


A06977 


Homo sapiens 


albumin 


563 


93 


2634 


M81088 


Rattus 
norvegicus 


EF-l-alpha 


165 

- 


68 


2635 


AL359587 


Homo sapiens 


hypothetical protein 


496 


82 


2636 


A03758 


Homo sapiens 


serum albumin 


576 


91 


2637 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


127 


39 


2638 


U93563 


Homo sapiens 


putative pl50 


186 


38 


2639 


U68729 


Meloidogyne 
incognita 


cuticle preprocollagen 


113 


34 


2640 


AB033056 


Homo sapiens 


KIAA1230 protein 


269 


94 


2641 


AF116712 


Homo sapiens 


PR02738 


115 


61 


2642 


W48353 


Homo sapiens 


Human breast cancer related protein 


124 


72 
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BCFLT2. 






2643 


G02455 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
6536. 


98 


54 


2644 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

mm my « *v 

7519. 


127 


64 


2645 


AK025116 


Homo sapiens 


unnamed protein product 


104 


64 


2646 


X92485 


Plasmodium 
vivax 


pval 

i ■ 


100 

■ 


47 


2647 


AF130089 


Homo sapiens 


PRO2550 


122 


51 


2648 


AF194537 


Homo sapiens 


NAG 13 


279 


83 


2649 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


90 


66 


2650 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


148 


63 


2651 


AF1 16661 


Homo sapiens 


PR01438 


112 


46 


2652 


U63542 


Homo sapiens 


FAP protein 


128 


79 


2653 


AL359782 

i 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


133 


46 


2654 


Y85062 


Homo sapiens 


Interleukin 1 converting enzyme 
homologue (ICEL) protein sequence. 


139 


72 


2655 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


164 


72 


2656 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhcm) gene preoduct 


145 


51 


2657 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


166 


64 


2658 


B25722 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 9 SEQ ID NO: 1 1 1. 




88 


2659 . 


X92485 

a 


Plasmodium 
. vivax 


pval 

9 


141 


60 


2660 


Y76198 


Homo sapiens 


Human secreted protein encoded by 
gene 75. 


68 


68 


2661 

* 


Y30822 

■ 


Homo sapiens 


Human secreted protein encoded from 
gene 12. 


96 


59 


2662 


AF090931 


Homo sapiens 


PRO0483 


114 


74 


2663 


G03807 


Homo sapiens 


Human secreted protein; SEQ ID NO: 
7888. 


129 

- - - 


57 


2664 


AF090930 


Homo sapiens 


PRO0478 


170 


78 


2665 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


460 


77 


2666 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


368 


67 


2667 


AF090931 


Homo sapiens 


PRO0483 


90 


88 


2668 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


121 


66 


2669 


G00397 

F i 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


169 

■ 


73 


2670 


AF 194537 


Homo sapiens 


NAG 13 


170 


42 


2671 


X86003 


Rattus 
norvegicus 


neuron-derived orphan receptor 

• 


104 

... 


50 


2672 


U63542 


Homo sapiens 


FAP protein 


139 


75 


2673 


AF090930 


Homo sapiens 


PRO0478 


130 


69 


2674 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


161 


75 


2675 


& w- M A m+ ma 

AK024455 


—=rz c 

Homo sapiens 


-+mfm ■ w m. «V M mm • 

FLJ00047 protein 


105 


mi m\ ■ 

53 


2676 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


142 


67 


2677 


AK000496 


Homo sapiens 


unnamed protein product 


110 


67 


2678 


AF1 16715 


Homo sapiens 


PR02829 


151 


78 


2679 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


160 


72 


2680 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


81 
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6613. 






2681 , 


Y76198 


Homo sapiens 


Human secreted protein encoded by 
gene 75. 


131 


75 


2682 


U76604 


Homo sapiens 


1 80 kDa bullous pemphigoid antigen 
2/type XVII collagen 


96 


37 


2683 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


85 


55 


2684 


G00407 


Homo sapiens 
* 


Human secreted protein, SEQ ID NO: 
4488. 


127 


80 


2685 


AF090895 


Homo sapiens 


PRO0117 


81 


51 


2686 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


119 


52 


2687 


M64793 


Rattus. 
norvegicus 


salivary proline-rich protein 


122 


41 


2688 


AFO 10144 


Homo sapiens 

* 


neuronal thread protein AD7c-NTP 


145 


71 


2689 


AF 130089 


Homo sapiens 


PRO2550 


149 


78 


2690 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


102 


47 


2691 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


116 


74 


2692 


G03064 


Homo sapiens 


Human secreted protein. SEO ID NO: 
7145. 


130 


60 


2693 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


106 


26 


2694 


AE001158 


Borrelia 
burgdorferi 


conserved hvDOthetical integral 
membrane protein 


81 


37 


2695 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


162 

♦ 


16 


2696 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


114 


100 


2697 


AF194537 


Homo sapiens 


NAG 13 


126 


90 


2698 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


140 


69 


2699 


U93563 


Homo sapiens 


putative p!50 


138 


37 


2700 


AF090928 


Homo sapiens 


PRO0470 


177 


57 


2701 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


130 


75 


2702 


X92485 


Plasmodium 
vivax 


pval 


116 


50 


2703 


AK024455 


Homo sapiens 


FLJ00047 protein 


103 


57 


2704 


D38114 


Gorilla gorilla 


NADH dehydrogenase subunit 2 (ND2) 


137 


81 


2705 


AF090895 


Homo sapiens 


PRO0117 


105 


51 


2706 


AF090944 


Homo sapiens 


PRO0663 


121 


45 


2707 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


104 

• 


57 


2708 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


157 


85 


2709 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


283 


84 


2710 


AF130079 


Homo sapiens 


PR02852 


125 


48 


2711 


M22334 


Homo sapiens 


unknown protein 


131 


50 


2712 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


123 


58 


2713 


S80119 


Rattus sp. 


reverse transcriptase bomolog 


230 


57 


2714 


AF090931 


Homo sapiens 


PRO0483 


98 


85 


2715 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


104 


79 


2716 


L00016 


Homo sapiens 


urf5 


295 


93 


2717 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


135 


82 


2718 


M81321 


Macaca 
fascicularis 


proline-rich protein 


117 


38 
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2719 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


124 


57 


2720 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rha la. 


147 


56 


2721 


Y02749 


Homo sapiens 

■ 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


132 

■ 


67 

• 


2722 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


129 

- 


56 


2723 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


392 


68 


2724 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


182 


79 


2725 


Y86248 


Homo sapiens 

* 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


195 


92 


2726 


W97293 


Homo sapiens 


An annex in binding protein (NABP-1). 


193 


40 


2727 


U12690 


Homo sapiens 


cytochrome oxidase subunit 11 


547 


85 


2728 


V00662 


Homo sapiens 


cytochrome oxidase I 


500 


92 


2729 


W03642 


Homo sapiens 


Human cannabinoid GPRN-terminal 
sequence. 


110 


56 


2730 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


124 


75 


2731 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rha la. 


113 


57 


2732 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


149 


51 


2733 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


101 


44 


2734 


Y9I577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


288 


81 


2735 

• 


Z74036 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PF01391 (Collagen triple helix repeat 
(20 copies)), Score=58.9,E- 
value=3.5e-14,N=3 


117 


37 


2736 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


154 


62 


2737 


AK024455 


Homo sapiens 


FU00047 protein 


136 


60 


2738 


G00577 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


110 


63 


2739 


AP002460 


Arabidopsis 
thaliana 


geneJcLFlD9.26~uiuaiown protein 


363 


90 


2740 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


358 


69 


2741 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


143 


64 


2742 


L27428 


Homo sapiens 


reverse transcriptase 


119 


45 


2743 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


76 


63 


2744 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


155 


79 


2745 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


114 


51 


2746 


Y86248 


Homo sapiens 

• 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


159 


75 


2747 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


162 


79 


2748 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


107 


49 


2749 


AK024455 


Homo sapiens 


FU00047 protein 


165 


69 


2750 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


167 


72 


2751 


M22332 


Homo sapiens 


unknown protein 


208 


46 
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2752 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


125 


56 


2753 


R13556 


Homo sapiens 

• 


Protein encoded downstream of hhc_M 
oncoprotein. 


150 


65 


2754 


AF220264 


Homo sapiens 


MOST-1 


130 


52 


2755 


AK024455 


Homo sapiens 


FLJ00047 protein 


128 


58 


2756 . 


AF130051 


Homo sapiens 


PRO0898 


162 


73 


2757 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


259 


86 


2758 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


124 


58 


2759 


AJ243666 


Homo sapiens 


NICE-5 protein 


118 


84 


2760 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


533 


90 


2761 


G02482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6563. 


110 


39 


2762 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


257 


88 


2763 


Y73345 


Homo sapiens 


HTRM clone 438283 protein sequence. 


624 


67 


2764 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


156 


45 


2765 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


132 


58 


2766 


X92485 


Plasmodium 
vivax 


pval 


149 


53 


2767 


AF 113685 


Homo sapiens 


PRO0974 


131 


63 


2768 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


109 


56 


2769 


G03243 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7324. 


154 


71 


2770 


X97707 . 


Pongo pygmaeus 
abelii 


stopcodon created by 
posttranscriptionaJ polyadenylation 


185 


87 


2771 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


110 


59 


2772 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


145 


60 


2773 


AF090944 


Homo sapiens 


PRO0663 


118 


62 


2774 


L26953 


Homo sapiens 


chromosomal protein 


112 


58 


2775 


AF090930 


Homo sapiens 


PRO0478 


127 


59 


2776 


AK024455 


Homo sapiens 


FLJ00047 protein 


154 


56 


2777 


R59843 


Homo sapiens 


ApoE4Lx2 protease. 


103 


42 


2778 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


101 


55 


2779 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWI8. 


131 


64 


2780 


G03172 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
7253. 


89 


56 


2782 


D38113 


Pan troglodytes 


NADH dehydrogenase subunit 4 


194 


97 


2783 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


124 


68 


2784 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


123 


49 


2785 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


175 


50 


2786 


W58700 


Homo sapiens 


Human ST-1 partial sequence. 


219 


100 


2787 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


137 


90 


2788 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


519 


76 


2789 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


104 


44 


2790 


AF130089 


Homo sapiens 


PRO2550 


121 


39 


2791 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


135 


63 


2792 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


50 
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• * - 



seq n> 

NO* 


Accession r»o. 


Species 


Description 


Smith- 

Wo term o n 

Score 


% 

lUCllUl 

y 








7133. 






2793 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


153 


so 


2794 


D38112 


Homo sapiens 

* 


cytochrome c oxidase subunit 1 


461 


84 


2795 


AF090931 


Homo sapiens 


PRO0483 


114 


76 


2796 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


107 


53 


2797 


D38U2 


Homo sapiens 


NADH dehydrogenase subunit 2 


114 


78 


2798 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


115 


71 


2799 


G02515 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6596. 


113 


79 


2800 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


117 


48 


2801 


X92485 


Plasmodium 
vivax 


pval 


97 


80 


2802 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


471 


80 


2803 


AF130051 


Homo sapiens 


PRO0898 


118 


78 


2804 


G03356 

« 


Homo sapiens 


Human secreted protein. SEO ID NO: 
7437. 


78 


73 


2805 


Y02749 , 


Homo sapiens 


Human secreted protein encoded bv 
gene 100 clone HNFIU96. 


143 


69 


2806 


G03415 


Homo sapiens 


Human secreted protein SEO ID NO: 
7496. 


102 


62 


2807 


K02401 


Homo sapiens 


chorionic somatomammotropin 


543 


88 


2808 


Y18522 


Corvus 
frugilegus 


cvtochrome oxidase subunit I 


382 


60 


2809 


G01194 

* 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5275. 


555 


93 


2810 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-l). 


399 


77 


2811 


AF007826 


Homo sapiens 


bax epsilon 


94 


39 


2812 


AF010144 


Homo sapiens 

_ — !Z 


neuronal thread protein AD7c-NTP 


220 


67 


2813 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


495 


81 


2814 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


112 


48 


2815 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


102 


75 


2816 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


437 


83 


2817 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


143 


66 


2818 


X55733 


Homo sapiens 


initation factor 4B 


458 


80 


2819 


G03801 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7882. 


132 


65 


2820 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


347 


68 


2821 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


94 


62 


2822 


R13556 

m 


Homo sapiens 


Protein encoded downstream of hhcJM 
oncoprotein. 


118 


54 


2823 


AF 130079 


Homo sapiens 


PR02852 


143 


54 


2824 


G00328 


Homo sapiens 

r 


Human secreted protein, SEQ ID NO: 
4409. 


140 


52 


2825 


Y76198 


Homo sapiens 


Human secreted protein encoded by 
gene 75. 


161 


67 


2826 


AF1 19851 


Homo sapiens 


PR01722 


104 


70 


2827 


D49489 


Homo sapiens 


human P5 


523 


91 


2828 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


206 


81 


2829 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


138 


42 


2830 


AF090931 


Homo sapiens 


PRO0483 


97 


80 


2831 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID T^O: 


144 


65 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 
Score 


% 

Idcntit 

y 








7871. 






2832 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


112 


35 


2833 


X92485 


Plasmodium 
vivax 


pval 

• • 


103 


4 3 


2834 


R32010 


Homo sapiens 


Rpl5-TIA-L 


152 


54 


2835 


G00357 

* 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. . 


153 


59 


2836 


W48351 


Homo sapiens 

• 


Human breast cancer related protein 
BCRB2. 


148 

- 


59 


2837 


AF090931 


Homo sapiens 


PRO0483 


149 


81 


2838 


AF130089 


Homo sapiens 


PRO2550 


123 


49 


2839 


AF090942 


Homo sapiens 


PRO0657 


135 


54 


2840 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


124 


54 


2841 


AF090895 


Homo sapiens 


PRO0117 


146 


55 


2842 


AF132200 


Homo sapiens 


PR01751 


98 


74 


2843 


U 12690 


Homo sapiens 


cytochrome oxidase subunit II 


194 


75 


2844 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


61 


59 


2845 


AL390114 

. i 


Leishmania 
major 


extremely cysteine/valine rich protein 


171 

- 


37 


2846 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


155 


69 


2847 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


141 


58 


2848 


G02532 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


108 

it 


65 


2849 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


441 


77 


2850 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


490 


88 


2851 


K02401 


Homo sapiens 


chorionic somatomammotropin 


488 


90 


2852 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


110 


49 


2853 


U83280 


Leishmania 
donovani 


39 kDa antigen 


131 


47 


2854 


AF090944 


Homo sapiens 


PRO0663 


173 


72 


2855 


V00662 


Homo sapiens 


ATPase 6 


392 


71 


2856 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


147 


50 


2857 


Y85062 


Homo sapiens 


Interleukin 1 converting enzyme 
homologue (ICEL) protein sequence. 


113 


76 


2858 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


178 


84 


2859 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


192 


92 


2860 


Z81068 

4 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PF00 1 02 (Protein-tyrosine 
phosphatase), Score=232.1, E- 
value=2.6e-66, N=l~cDNA EST 
yk299h63 comes from mis 
gene-cDNA EST yk420b4.3 comes 
from this gene-cDNA EST yk439g6.3 
comes from this gene-cDNA EST 
yk299h6.5 comes from this 
gene-cDNA EST yk420b4.5 comes 
from this gene-cDNA EST yk439g6.5 
comes from this gene 


104 

- 


36 

• 

• 


2861 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


444 


85 


2862 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


168 


61 


2863 


U93564 


Homo sapiens 


putative pi 50 


132 


34 
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i. ■ 



SEQ ID 


' Accession No. 


Species 


Description 


bmitn- 




NO: 




•* 




Wa term an 


Ideotit 






• 




Score 


y 


2864 


W80406 


Homosapiens 


A secreted protein encoded by clone 


152 


50 








dh40 3. 






2865 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


76 


■ 






7878. — 


■ 




2866 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 


46 








7253. 






2867 


AF090930 


Homosapiens 


PRO0478 


161 


61 


2868 


Y30822 


Homo sapiens 


Human secreted protein encoded from 


117 


48 




• 




gene 12. 






2869 


U15647 


Mus musculus 


reverse transcriptase 


197 


40 


2870 


U09500 


Homo sapiens 


cytochrome b 


394 


68 


2871 


U09500 


Homo sapiens 


cytochrome b 


614 


92 


2872 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


516 


87 


2873 


D38112 


Homosapiens 


NADH dehydrogenase subunit 4 


458 


84 


2874 


D38112 


Homosapiens 


NADH dehydrogenase subunit 4 


516 


92 


2875 


Y30822 


Homo sapiens 

• 


Human secreted protein encoded from 


87 


55 








gene 12. 






2876 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


624 


96 


2877 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


160 


58 








6619. 


- 




2878 


U32174 


Dictyostelium 


non-receptor tyrosine kinase 


120 


33 






dzscoideum 








2879 


D38U2 


Homo sapiens 


ATPase subunit 6 


141 


54 


2880 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


84 








7124. 


- 




2881 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


93 


79 








encoded by gene 17. 






2882 


V00662 


Hdmo sapiens 


ATPase 6 


130 


87 


2883 


AL390114 


Leishmania 


probable proteophosphoglycan 


117 


50 


- 




major 


- 


- 




2884 


AF1 19851 


Homosapiens 


PR01722 


79 


55 


2885 


X92485 


Plasmodium 


pval 


106 


48 






vivax 




- 




2886 


V00662 


Homo sapiens 


ATPase 6 


127 


87 


2887 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


191 


83 


2888 


G03789 


Homo sapiens ^ 

* 


Human secreted protein, SEQ ID NO: . 


138 


58 




• 


• 


7870. 






2889 


X92485 


Plasmodium 


pval 


107 


63 






vivax 








2890 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


163 


89 


2891 


AF090942 


Homo sapiens 


PRO0657 


72 


37 


2892 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


136 


56 






■ 


4497. 






2893 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


139 


66 








7871. 






2894 


AK024455 


Homo sapiens 


FLJ00047 protein 


139 


57 


2895 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


111 


44 




- 


brucei 


complete virion genome. [ 






2896 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


182 


62 








6613. 


• 


* 


2897 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


73 








7795. 






2898 


D38112 


Homosapiens 


cytochrome c oxidase subunit 3 


248 


53 


2899 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase, subunit) 


366 


68 


2900 


X07882 


Homo sapiens 


Po protein 


146 


41 


2901 


V00662 


Homo sapiens 


cytochrome B 


517 


91 


2902 


D38115 


Pongo pygmaeus 


cytochrome c oxidase subunit 3 


378 


67 


2903 


AF090931 


Homo sapiens 


PRO0483 


137 


67 


2904 


AF003540 


Homo sapiens 


Krueppel family zinc finger protein 


99 


54 



205 



NO: 


Accession pick 




1/ ascription 


Waterman 
Score 


_, — ,. 

/o 

Identit 

y 


2905 


AF130079 


Homo sapiens 


PR02852 


111 


55 


2906 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


427 


79 


2907 


AF202635 


Homo sapiens 


PP1200 


140 


66 


2908 


AF041330 


Bodosaltans 


NADH dehydrogenase subunit 5 


116 


41 


2909 


U63542 


Homo sapiens 


FAP protein 


155 


75 


2910 


D38114 


Gorilla gorilla 


cytochrome c oxidase subunit 1 (COT) , 


446 


72 


2911 


AP002031 


Arabidopsis 
thaliana 


geneJd:K3D20.3~ 

■ 


118 

• 


35 


2912 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


435 


80 


2913 


D38116 


Pan paniscus 


cytochrome c oxidase subunit 1 


469 


83 


2914 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


480 


80 


2915 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


488 


86 


2916 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


519 


80 


2917 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


172 


80 


2918 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


512 


75 


2919 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


458 


79 


2920 


G03797 

* 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


152 


65 


2921 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


324 


87 


2922 


D381L2 


Homo sapiens 


cytochrome c oxidase subunit 3 


530 


84 


2923 


AF157321 


Homo sapiens 


30kDa protein 


370 


68 


2924 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


174 


71 


2925 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


503 


88 


2926 


U47624 


Xenopus laevis 


alpha(E>catenin 


243 


94 


2927 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


135 


47 


2928 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


476 


78 


2929 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


465 


81 


2930 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


151 


58 


2931 

. 


D16480 


Homo sapiens 


enoyl-CoA hydratase/3-hydroxyacyl- 
CoA dehydrogenase alpha-subunit of 
trifunctional protein 


345 


67 


2932 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


212 


95 


2933 


AP000616 


Oryzasativa 

* 


similar to RING-H2 finger protein 
RHAla(AF078683) 


123 


69 


2934 


X83427 


Ornithorhynchus 
anatinus 


cytochrome c oxidase subunit 1 


460 


82 

• 


2935 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


527 


83 


2936 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


448 


92 


2937 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


183 


77 

• 


2938 


X15081 


Crithidia 
fasciculata 


MURF2 protein (AA 1-348) 

• 


105 


42 


2939 


AF194537 


Homo sapiens 


NAG13 


151 


52 


2940 


AJ271872 


Nicotiana 
sylvestris 


extensin 


165 


37 


2941 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


134 


91 


2942 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


387 


69 


2943 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


509 


79 


2944 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


108 


95 


2945 


J01415 


Homo sapiens 


MTND4 


429 


76 


2946 


AC010793 


Arabidopsis 
thaliana 


F20B17.16 


99- 


41 


2947 


AF116712 


Homo sapiens 


PR02738 


132 


60 


2948 


X89658 


Homo sapiens CAP- 18 protein 


136 


49 
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SEQ ID 
NO: 


* Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


Identit 

y 


2949 


V00662 


Homo sapiens 


cytochrome oxidase I 


449 


86 


2950 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


438 


82 


2951 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


363 


76 


2952 

*• 


AP000616 


Qryza sativa 

- 


similar to RING-H2 finger protein 
RHAla(AF078683) 


129 


92 


2953 


X56015 


Crithidia 
oncopelti 


NADH dehydrogenase subunit 5 


102 


32 


2954 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


110 


68 


2955 


Y76287 

• 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 20. 


345 

■ 


78 

• 


2956 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


340 


71 


2957 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


456 


83 


2958 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


102 


76 


2959 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


96 


84 


2960 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


400 


70 


2961 


X69978 


Homo sapiens 


XP-G factor 


539 


88 


2962 


AF2036?7 


Homo sapiens 


prolactin regulatory element-binding 
protein 


453 


77 


2963 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


561 


83 


2964 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


187 


92 


2965 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


119 


63 


2966 


U52077 


Homo sapiens 


mariner transposase 


469 


83 


2967 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


130 


53 


2968 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rha la. 


103 


42 


2969 


X92485 


Plasmodium 
vivax 


pval 


144 


42 


2970 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


127 


63 


2971 


G03714 

• 


Homo sapiens 

. 


Human secreted protein, SEQ ID NO: 
7795. 


129 


63 


2972 


Y17221 


Homo sapiens 


Human secreted protein (clone fk317- 
3). 


91 


56 


2973 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


101 


72 


2974 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


130 

• 


68 


2975 


AF130089 


Homo sapiens 


PRO2550 


167 


46 


2976 


V00662 


Homo sapiens 


cytochrome oxidase I 


539 


79 


2977 


AP000616 


Oryza sativa 


similar to RING-H2 ringer protein 
RHAla(AF078683) 


121 


44 


2978 


Y86248 


Homo sapiens _ 

- 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


126 


72 


2979 


U93568 


Homo sapiens 


putative pl50 


122 


30 


2980 


M12099 


Mus musculus 


proline-rich protein 


119 


39 


2981 


Z38128 


Mus musculus 


histone HI 


117 


36 


2982 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


172 


51 


2983 


AC002291 


Arabidopsis 
thaliana 


extensin 


110 


35 


2984 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


295 


92 


2985 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


171 


65 
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SEQ1D 


Accession No. 


Species 


Description 


omitn- 


A> 


NO: 








Wattrman 


Identit 










Score 


y 


2986 


U 18985 


Homo sapiens 


triad in 


416 


83 


2987 


AF 155232 


Pisum sativum 


ex tens in 


159 


41 


2988 


X03145 


Homo sapiens 


pot ORF V 


133 


44 


2989 


U 12690 


Homo sapiens 


cytochrome oxidase subunit II 


565 


~84 


2990 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


53 








7870. 


• 




2991 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


538 


83 


2992 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


471 


75 


2993 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


511 


72 


2994 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


301 


79 


2995 


D38112 


Homo sapiens 

* 


cytochrome c oxidase subunit 1 


526 


78 


2996 


U97674 


Mesocricetus 


cytochrome c oxidase chain I 


634 


82 






auratus 








2997 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68,. 


158 


76 








SEQIDNO:163. 




* 


2998 

• 


X77816 


Rattus 


PR-Vbetal 


103 


50 






norvegicus 








2999 


V00662 


Homo sapiens 


cytochrome oxidase I 


535 


78 


3000 


L38908 


Nicotiana 


extensin 


146 


38 






tabacum 








3001 


D38112 . 


Homo sapiens 


NADH dehydrogenase subunit 2 


494 


87 


3002 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


111 


66 








SEQIDNO:163. 






3003 


Y67470 


Homo sapiens 


Np70 protein carboxy terminal region. 


117 


46 


3004 


AL359782 


Trypanosoma 


probable similar to ring-h2 ringer 


93 


47 






brucei 


protein rhala. 


i 




3005 

■mr \f \j ^0 


U43627 


Arabidopsis 


extensin 


118 


37 






thaliana 








3006 


AF1 15549 


Homo sapiens 


Wiskott-Aldrich Syndrome protein 


150 


44 


3007 


G03556 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


73 








7637. 






3008 


W54966 


Homo sapiens 


Synthetic human type III collagen 


111 


39 








SYN-C3. 






3009 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


84 




■ 




7124. 






3010 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


212 


77 








SEQ ID NO: 163. 






3011 


X65718 


Prunus dulcis 


extensin 


129 


42 


3012 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


71 


76 








4435. 






3013 


Z38128 


Mus mus cuius 


histoneHl 


104 


32 


3014 


X56010 


Sorghum bicolor 


hydroxyproline-rich glycoprotein 


102 


34 


3015 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


128 


80 








SEQIDNO:163. 






3016 


Y34068 


Homo sapiens 


Histone HI isoform H1.5 pANCA- 


99 


34 








reactive fragment (residues 69-226). 






3017 


X55685 


Lycopersicon 


extensin (class I) 


142 


32 






esculentum 








3018 


AJ133050 


Panulirus argus 


cytochrome c oxidase subunit I 


125 


71 


3019 


G03597 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 


175 


68 








7678. 






3020 


U93564 


Homo sapiens 


putative p 150 


97 


57 


3021 


D38116 


Pan paniscus 


cytochrome c oxidase subunit 1 


555 


82 


3022 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


147 


76 








4446. 






3023 


U97674 


Mesocricetus 


cytochrome c oxidase chain I 


632 


82 






auratus 








3024 


AF155232 


Pisum sativum 


extensin 


181 


42 


3025 


V00662 


Homo sapiens 


cytochrome oxidase I j 


567 


85 
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• 



SEQU> 
NO: 


Accession No. 


Species 

• 


Description 


Smith. 

Waterman 

Score 


% 

1UCUUI 

y 


3026 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


130 


58 


3027 

- 


W00838 


Homo sapiens 


Tumour necrosis factor-related gene 
product 


141 


67 

. 


3028 


AF130089 


Homo sapiens 


PRO2550 


149 


55 


3029 


AF090944 


; Homo sapiens 


PRO0663 


168 


76 


3030 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


122 


92 


3031 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


507 


91 


3032 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


351 


87 


3033 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


99 


75 


3034 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


113 


85 


3035 


U93567 


Homo sapiens 


putative pl50 


173 


41 


3036 


X92485 


Plasmodium 
vivax 


pval 


122 


41 


3037 


AF202635 


Homo sapiens 


PP1200 


128 


67 


3038 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. . 


151 


83 


3039 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


415 


74 




D86853 


Catharanthus 
roseus 


extensin 


198 


40 


3041 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


110 


47 


3042 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


377 


95 


3043 


S80119 


Rattus sp. 

* 


reverse transcriptase homolog 

Mr w 


106 


50 


3044 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


137 


55 


3045 


Y36156 


Homo sapiens 


Human secreted protein #28. 


91 


58 


3046 


U93574 


Homo sapiens 


putative pi 50 


177 


49 


3047 


AF063866 


Melanoplus 
sanguinipes 
entomopoxvirus 


ORF MSV233 hypothetical protein 

J Mr Xr . 


98 


37 


3048 


D38U2 


Homo sapiens 

At 


cytochrome c oxidase subunit 3 


568 


87 


3049 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


144 


63 


3050 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


144 


80 


3051 


AK027208 


Homo sapiens 


unnamed protein product 


145 


72 


3052 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


166 


75 

■ 


3053 


AF 118082 


Homo sapiens 


PRO1902 


84 


50 


3054 


G04072 


Homo sapiens ^ 

* 


Human secreted protein, SEQ ID NO: 
8153. 


118 


41 


3055 


X92485 


Plasmodium 
vivax 


pval 


132 


52 


3056 


G00403 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
4484. 


117 


74 


3057 


AL390114 

• 


Leishmania 
major 


extremely cysteine/valine rich protein 


99 


72 


3058 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


627 


89 


3059 


G00332 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
4413. 


98 


57 


3060 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


100 


38 


3061 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


159 


80 


3062 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


529 


80 


3063 


AF 13005 1 


Homo sapiens 


PRO0898 


159 


71 


3064 


AF195418 


Mus museums 


OD23 


386 


94 


3065 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 


109 


51 
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NO: 


1 Accession No. 
cession No. 


Snecies 


D£&£ri ii ti nn 


Smith- 

Waterman 

Score 


Ideotit 

y 








clone HCACJ81. 






3066 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


338 


80 


3067 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
-protein rhala. 


120 


50 


3068 


U15647 


Mus musculus 


reverse transcriptase 


134 


41 


3069 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


397 


92 


3070 


AF130089 


Homo sapiens 


PRO2550 


149 


55 


3071 


U93565 


Homo sapiens 


putative p 150 


143 


37 


3072 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


112 


45 


3073 


AF090930 


Homo sapiens 


PRO0478 


135 


78 


3074 


AF1 30089 


Homo sapiens 


PRO2550 


156 


58 


3075 


Y86472 

■ ■ 


Homo sapiens 

- 


Human gene 52-encoded protein 
fragment, SEQ ID NO:3 87. 


267 


65 


3076 


U83280 


Leishmania 
donovani 


39 kDa antigen 


98 


51 


3077 


D381L2 


Homo sapiens 


cytochrome c oxidase subunit 3 


626 


86 


3078 


AF1 16712 


Homo sapiens 


PR02738 


1 14 


59 


3079 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


112 


46 , 


3080 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


492 


80 


3081 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


523 


86 


3082 


D381I2 


Homo sapiens 


cytochrome c oxidase subunit 3 


515 


79 


3083 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


150 


41 


3084 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


191 

»' 


97 


3085 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


537 


78 


3086 


D38112 


Homo sapiens 


ATPase subunit 6 


300 


58 


3087 


X86031 


Sus scrofa 


homologue of proline/arginine-rich 
antibacterial protein 


104 


37 


3088 


K02247 


Rattus 
norvegicus 


proline-rich protein 


108 


41 

■ 


3089 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


130 


48 


3090 


V00662 


Homo sapiens 


! cytochrome oxidase I 


597 


84 


3091 


M18094 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


221 


43 


3092 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


392 


59 


3093 


D38116 


Pan paniscus 


cytochrome c oxidase subunit 1 


593 


84 


3094 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


471 


76 


3095 


R63235 


Homo sapiens 


CNS neural thread protein HB4. 


369 


98 


3096 


L26953 


Homo sapiens 


chromosomal protein 


115 


57 


3097 


G00591 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4672. - 


124 


57 


3098 


Y86248 


Homo sapiens 

- 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


152 


91 


3099 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6836. 


153 


96 


3100 


AC006233 


Arabidopsis 
thaliana 


hypothetical protein 


65 


44 


3101 


AF289098 


Cladrastis 
kentukea 


ENOD2 


107 


35 


3102 


AF155232 


Pisum sativum 


extensin 


142 


43 


3103 


AF130089 


Homo sapiens 


PRO2550 


122 


81 


3104 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


143 


80 


3105 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 


104 


59 
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SEQID 
NO: 


Accession No. 


Species 


Description 

• 


Smith- 

Waterman 

Score 


Identit 

y 


3106 


Y86248 


Homo sapiens 

• 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


112 

- 


76 


3107 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


117 


72 


3108 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


334 


85 


3109 


U43627 

■ 


Xrabidopsis 
thaliana 


extensin 


112 


30 


3110 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


118 


100 


3111 


AF130O89 


Homo sapiens 


PRO2550 


137 


37 


3112 


Y86248 

- 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


220 


93 


3113 


A18812 


Brassica napus 


extensin 


128 


32 


3114 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


176 


63 


3115 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


92 


69 


3116 


X92485 


Plasmodium 
vivax i 


pval 

■ 


148 


45 


3117 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


574 


80 


3118 


U23172 

• 


Caenorhabditis 
elegans 


similar to myoblast cell surface antigen 
(SP:CS24_HUMAN, P23246) and D. 
melanogaster No-on-transient A protein 
(PIR:JH0162) 


125 

• 


41 

• 


3119 


U43627 


Arabidopsis 
thaliana 


extensin 


129 

i ■ 


32 


3120 


AF205385 


Pan troglodytes 


NADH dehydrogenase subunit 5 


171 


82 


3121 


Y12950 


Homo sapiens 


Amino acid sequence of a human 
secreted peptide. 


90 


59 


3122 


AF130O51 


Homo sapiens 


PRO0898 


134 


82 


3123 


M81321 


Macaca 
fascicularis 


proline-rich protein 


158 


48 


3124 


U93563 


Homo sapiens 


putative pi 50 


126 


32 


3125 


D38114 


Gorilla gorilla 


NADH dehydrogenase subunit 1 (NDI) 


337 . 


92 


3126 


AF130089 


Homo sapiens 


PRO2550 


155 


77 


3127 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:I63. 


162 


76 


3128 


AF090901 


Homo sapiens 


PRO0195 


109 


53 


3129 


AF003736 


Murine leukemia 
virus 


reverse transcriptase 


164 


40 


3130 


U25281 


Rattus 
norvegicus 


SH3 domain binding protein 


88 


39 


3131 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


116 


71 


3132 


D86853 


Catharanthus 
roseus 


extensin 


110 


33 


3133 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


506 


76 


3134 


V00662 


Homo sapiens 


cytochrome oxidase I 


583 


88 


3135 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


103 


68 


3136 


AJ242540 


Volvox carted £ 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


282 


60 


3137 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


119 


54 


3138 


AF 118082 


Homo sapiens 


PRO1902 


118 


51 


3139 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


270 


87 


3140 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


103 


48 
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SEQ IB 
NO: 


^Accession No. 


> 


Dtscrtntinn 


Smith- 

w »* mm UJ 

Waterman 
Score 


IdentJt 

y 


3141 


D38112 


Homo sapiens 


NADH dehydrogenase sub unit 5 


353 


74 


3142 

. ■ 


G02485 


Homo sapiens 

- 


Human secreted protein, SEQ ID NO: 
6566. 


126 


77 


3143 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


"115 


72 


3144 


AF090930 


Homo sapiens 


PRO0478 


138 


73 


3145 


AF 155232 


Pisum sativum 


extensin 


110 


34 


3146 


M69008 


Homo sapiens 


alpha- 1 typeXIII collagen 


107 


37 


3147 

■ 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


103 


63 


3148 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


505 


84 


3149 


M77194 


Rat leukemia 
virus 


polymerase 


167 

• 


35 


3150 


AP002543 


Arabidopsis 
thaliana 


gb|AAD23015. l~genejd:F15M7. 1 6~si 
milar to unknown protein 


105 


31 


3151 


AK024455 


Homo sapiens 


FLJ00047 protein 


109 


59 


3152 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


89 


39 


3153 


J04794 


Homo sapiens 


aldehyde reductase (EC 1 . 1 . 1 .2) 


137 


58 


3154 


G03714 

* 


Homo sapiens 

■ 


Human secreted protein, SEQ ID NO: 
7795. 


115 


70 


3155 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


160 


53 


3156 


X92485 


Plasmodium 
vivax 


pval 


104 


50 


3157 


AF 194537 


Homo sapiens 


NAG13 


207 


60 


3158 


AF063693 


Mus musculus 


type XIII collagen 


104 


37 


3159 


AP000616 


Oryza sativa 

* 


similar to RJNG-H2 finger protein 
RHAla(AF078683) 


130 


91 


3160 


AF020191 


Mus musculus 


proline-rich protein 13 


107 


51 


3161 


AF1 94537 


Homo sapiens 


NAG13 


170 


76 


3162 


G02535 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
6616. 


103 


41 

■ 


3163 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


161 


58 


3164 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


139 


46 


3165 


U93570 


Homo sapiens 


putative pi 50 


151 


68 


3166 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


144 


71 


3167 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


147 


66 


3168 


L27428 


Homo sapiens 


reverse transcriptase 


94 


51 


3169 


X97675 


Homo sapiens 


plakophilin 2b 


103 


76 


3170 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


115 


54 . 


3171 


AK024455 


Homo sapiens 


FLJ00047 protein 


98 


61 


3172 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


146 


38 


3173 


D38116 


Pan paniscus 


NADH dehydrogenase subunit 4 


467 


81 


3174 


X92485 

■ 


Plasmodium 
vivax 


pval 


88 

, 


69 


3175 


L26953 


Homo sapiens 


chromosomal protein 


124 


78 


3176 


AF042169 


Homo sapiens 


putative ATP-dependent mitochondrial 
RNA helicase 


223 


95 


3177 


AF130079 


Homo sapiens 


PR02852 


96 


44 


3178 


AF273217 


Mus musculus 


cell proliferation related protein CAP 


97 


36 


3179 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7496. 


90 


69 


3180 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


146 


78 


3181 


AF1 18082 


Homo sapiens 


PRO1902 


150 


58 


3182 


U60315 


Molluscum 
contagiosum 


MC107L 


101 


44 



212 



WO 01/64835 PCT/US01/04927 



NO: 




* 




O I'll ill™ 

Waterman 
Score 


/© 

Identit 

y 






virus subtype 1 








3183 


U49973 


Homo sapiens 

* 


ORF1; MER37; putative transposase 
similar to pogo element 


148 

- 


61 


3184 


U93565 


Homo sapiens 


putative pl50— 


265 


58 


3185 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


117 


79 


3186 


U87607 


Rattus 
norvegicus 


putative RNA binding protein 1 


167 


39 


3187 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


160 


63 


3188 


S80U9 


Rattus sp. 


reverse transcriptase homolog 


152 


49 


3189 


X05300 


Rattus 
norvegicus 


ribophorin I 


122 


46 


3190 


B01372 


Homo sapiens 


Neuron-associated protein. 


106 


80 


3191 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


130 


83 


3192 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


135 


80 


3193 


AF000298 


Caenorhabditis 
elegans 


weak similarity to collagens; glycine- 
and proline-rich 


98 


34 


3194 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


140 


87 


3195 


AF022985 


Caenorhabditis 
elegans 


Similar to collagen 


97 


40 


3196 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


452 


81 


3197 


G03411 

- 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


110 

i>. 


58 


3198 


X99452 


Lycopersicon 
esculentum 


extensin-like protein Dif54 


100 


36 


3199 


D86853 


Catharanthus 
roseus 


extensin 


119 


36 


3200 


AF025424 

i 


Rattus 
norvegicus 


RNA polymerase 1 127 kDa subunit 


215 


74 


3201 


U93563 


Homo sapiens 


putative pi 50 


359 


77 


3202 


K03202 


Homo sapiens 


salivary proline-rich protein precursor 


112 


37 


3203 


G00500 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4581. 


153 


84 


3204 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


128 


39 


3205 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


154 


76 


3206 


Y0115S 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


127 


64 


3207 


X97675 


Homo sapiens 


plakophilin 2b 


142 


51 


3208 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


91 


58 


3209 

* 


AB026512 


Ecnomiosa sp. 
Ecnl 


cytochrome c oxidase subunit I 


254 


62 


3210 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


163 


59 


3211 


D38112 


Homo sapiens 


ATPase subunit 6 


447 


78 


3212 


V00662 


Homo sapiens 


ATPase 6 


482 


83 


3213 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


151 


66 


3214 


D38112 


Homo sapiens 


ATPase subunit 6 


400 


72 


3215 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


569 


88 


3216 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


472 


82 


3217 


AF090930 


Homo sapiens 


PRO0478 


96 


64 


3218 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


46 
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SEQ IB 

NO: 


' Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 
y 

• 




< 




7253. 






3219 


AF006061 


Homo sapiens 

* 


placental growth hormone isoform 
bGH-V3 


290 


68 


3220 


G00365 


Homosapiens 
* 


Human secreted protein; SEQ ID NO: 
4446. 


115 


63 


3221 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


441 


71 


3222 


G00354 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO. 
4435. 


111 


59 


3223 


D38112 


Homo sapiens 

* 


cytochrome c oxidase subunit 3 


497 


81 


3224 


L26953 


Homo sapiens 


chromosomal protein 

* 


95 


59 


3225 


U83280 


Leishmania 
donovani 


39 kDa antigen 


117 


90 


3226 


AF239615 


Manihot 
esculenta 


CRANTZ hydroxyproline-rich 
glycoprotein 


124 


43 


3227 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 4 


442 


74 


3228 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


172 


67 


3229 


X92591 


Mus musculus 


transcription factor 


109 


43 


3230 


G02480 


Homo sapiens 


Human secreted protein, SEQ JD NO: 
6561. 


81 


54 


3231 


D38115 


Pongo pygmaeus 


cytochrome c oxidase subunit 1 


476 


72 


3232 


Y3 63 66 


Homo s aniens 


Fragment of human secreted n note in 
encoded by gene 3. 


100 

A W 


54 


3233 


U96416 

■. 


Dennvus 

distinctus 

timjonesi 


cytochrome b 

p 


95 


40 


3234 


Y86248 

* 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


161 


86 


3235 


M76546 


He li an thus 
annuus 


hydroxvproline-rich protein 


101 


40 


3236 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


149 


75 


3237 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


532 


76 


3238 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


576 


89 


3239 


U93567 


Homo sapiens 


p40 


155 


32 


3240 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


146 


79 


3241 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


145 


65 


3242 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


76 


62 


3243 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


138 


69 


3244 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


115 


39 


3245 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


138 


54 


3246 


G02211 


Homo sapiens 

JT 


Human secreted protein, SEQ ID NO: 
6292. 


153 


69 


3247 


U58736 


Caenorhabditis 
elegans 


Similar to cuticular collagen 


107 


37 


3248 


U87607 


Rattus 
norvegicus 


putative RNA binding protein 1 


148 


45 


3249 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


113 


85 


3250 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


166 


61 


3251 


U15183 


Mycobacterium 
leprae 


proline-rich antigen 


109 


40 


3252 


AF130089 


Homo sapiens 


PRO2550 | 151 


73 
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SEQ ID 
NO: 


Accession No. 


Species 

* 


Description 


Smitb* 

Waterman 

Score 


% 

Id en tit 

y 


3253 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


131 


58 


3254 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


162 


80 


3255 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


108 


57 


3256 


J01415 


Homo sapiens 


cytochrome oxidase subunit 3 


156 


88 


3257 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


206 


93 


3258 


Y36203 


Homo sapiens 


Human secreted protein #75. 


III 


77 


3259 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 


98 


57 


3260 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


345 


67 


3261 


Y86248 

• 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQEDNO:163. 


97 


56 


3262 


L38908 


Nicotiana 
tabacum 


extensin 


155 


40 


3263 


Y36366 

* 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


95 


70 


3264 


Y86248 , 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


198 


87 


3265 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


115 


69 


3266 


G03807 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7888. 


158 


59 


3267 


Y36366 


Homo sapiens 
* 


Fragment of human secreted protein 
encoded by gene 3. 


105 


56 


3268 


Y86248 


Homo sapiens 

* 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


159 


72 


3269 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


200 


95 


3270 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


138 


73 


3271 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6836. . 


151 


90 


3272 


Y86248 


Homo sapiens 

■ 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


193 


95 


3273 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


111 


52 


3274 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


148 


87 


3275 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


161 


86 


3276 


S62928 


Homo sapiens 


PRB1M protein precursor 


104 


34 


3277 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


154 


55 


3278 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6836. 


166 


100 


3279 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDN0.163. 


212 


91 


3280 


Y86248 


Homo sapiens 

• 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


118 


50 

- 


3281 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


130 


46 


3282 


M10546 


Homo sapiens 


cytochrome oxidase I 


303 


95 


3283 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


455 


81 


3284 


U97674 


Mesocricetus 
auraius 


cytochrome c oxidase chain I 


502 


83 


3285 


AF1 18086 


Homo sapiens 


PRO 1992 


99 


88 


3286 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


489 


83 
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Accession No. 


Species 

.«> 


Description 


Smitb- 

Wfltfrnan 

tTUISI UlttU 

Score 


% 

Identit ' 

EU CI1UI 

y 


3287 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


485 


88 


3288 


G00262 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4343. 


124 


73 


3289 


AL359782 


Trypanosoma 
brucei 


possible (hh v-6) u 1 1 02, variant a dna, 
complete virion genome. 


158 


60 


3290 


X92485 

■ 


Plasmodium 
vivax 


pval 


99 


58 


3291 


AF1 16712 


Homo sapiens 


PR02738 


102 


51 


3292 


AF119851 


Homo sapiens 


PR01722 


99 


86 


3293 


AF090930 


Homo sapiens 


PRO0478 


141 


72 


3294 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


151 


66 


3295 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


107 


41 


3296 


AK024455 


Homo sapiens 


FLJ00047 protein 


97 


53 


3297 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


156 


60 


3298 


AC007654 


Arabidopsis 
thaliana > 


T19E23.7 


97 


49 


3299 


Y17221 

1 


Homo saoiens 


Human secreted protein (clone fk317- 
3). 


96 


44 


3300 


AJ243905 


Caenorhabditis 
elegans 


SF1 protein 


104 


42 


3301 


AF000298 


Caenorhabditis 
elegans 


weak similarity to collagens; glycine- 
and proline-rich 


96 


41 


3302 


X89453 


Rattus 
norvegicus 


DRPLA 


83 

n 


65 


3303 


U 10099 


Homo sapiens 


POM-ZP3 


227 


52 


3304 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


553 


89 


3305 


L27428 


Homo sapiens 


reverse transcriptase 


144 


43 


3306 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


388 


71 


3307 


AF217374 

* 


Acanthaster 
planci 


cytochrome oxidase subunit I 


439 


78 


3308. 


D38112 


Homo sapiens 

* 


cytochrome c oxidase subunit 1 


439 


82 


3309 


D38112 


Homo sapiens 

* 


cytochrome c oxidase subunit 1 


492 


88 


3310 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


81 


65 


3311 


X53581 


Rattus 
norvegicus 


ORF4 


140 


48 


3312 


V00662 


Homo sapiens 


cytochrome B 


487 


88 


3313 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


451 


80 


3314 


V00662 


Homo sapiens 


cytochrome oxidase I 


524 


88 


3315 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


518 


87 


3316 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


115 


50 


3317 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


356 


69 


3318 


U12693 


Homo sapiens 


cytochrome oxidase subunit II 


477 


78 


3319 


Z67990 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PF01484 (Nematode cuticle collagen 
N-terminal domain), Score=27.5, E- 
value=0.0001,N=l 


75 


40 

■ 


3320 


GO 1931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


126 


75 


3321 


AF090930 


Homo sapiens 


PRO0478 


166 


65 


3322 


X92485 


Plasmodium 
vivax 


pval 


104 


44 


3323 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 j 


513 


80 


3324 


B03148 


Homo sapiens 


Human neuronal differentiation factor- [ 
1 (NDF-1). 


483 


79 
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SKAJ XI* 

NO: 


^ A, HAAM»lAf1 Ma 

Accession no. 


opecies 

- 




Smith. 

Waterman 
Score 


Ideotit 

y 


3325 


AF130089 


Homo sapiens 


PRO2550 


121 


88 


3326 


AC026234 


Unknown 


Contains weak similarity to an 
unknown protein T23E1 8.5 


191 


66 


3327 


V00662 


Homo sapiens 


cytochrome B 


439 


72 


3328 


U12690 


Homo sapiens 


cytochrome oxidase subunit 11 


316 


75 


3329 


AF091711 


Homo sapiens 


splice variant AKAP350 


106 


52 


3330 


U93565 


Homo sapiens 


putative pi 50 


125 


45 


3331 


AF1 19851 


Homo sapiens 


PR01722 


88 


72 


3332 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


489 


81 


3333 


V00662 


Homo sapiens 


cytochrome oxidase I 


544 


90 


3334 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


475 


80 


3335 


Y14482 


Homo sapiens 

■ 


Fragment of human secreted protein 
encoded by gene 17. 


168 

• 


66 


3336 


AF146688 


Takifugu 
nibripes 


serine-threonine kinase 9 


101 


43 


3337 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


101 


50 


3338 

• 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


110 


50 


3339 


V00662 , 


Homo sapiens 


cytochrome oxidase I 


458 


80 


3340 


K02401 


Homo sapiens 


chorionic somatomammotropin 


209 


97 


3341 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


112 


52 


3342 


X65165 


Volvox carteri 


extensin 


146 


41 


3343 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


119 


51 


3344 


V00662 


Homo sapiens 


URF 1 (NADU dehydrogenase subunit) 


559 


84 


3345 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


508 


89 


3346 


Ml 1897 


Mus musculus 


proline-rich salivary protein 


96 


41 


3347 


U92455 • 


Mus musculus 


WW domain binding protein 7; WBP7 


113 


38 


3348 


U83280 


Leishmania 
donovani 


39 kDa antigen 


105 


40 


3349 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


113 


38 


3350 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


542 


79 


3351 


B03148 


Homo sapiens 


Human neuronal differentiation fector- 
1 (NDF-1). 


664 

• ■ 


88 


3352 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


117 


85 


3353 


A18812 


Brassica napus 


extensin 


103 


31 


3354 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


125 


78 


3355 


M81321 


Macaca 
fascicularis 


proline-rich protein 


107 


44 . 


3356 


M 10546 


Homo sapiens 


cytochrome oxidase I 


284 


90 


3357 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


513 


80 


3358 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


541 


79 


3359 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


543 


87 


3360 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


563 


86 


3361 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


179 


57 


3362 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


125 


62 


3363 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


151 


46 


3364 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


569 


82 


3365 


Y67470 


Homo sapiens 


Np70 protein carboxy terminal region. 


102 


40 


3366 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


143 


76 


3367 

■ 


G02409 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6490. 


145 


84 
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NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

IdenHt 

y 


3368 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


587 


87 


3369 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


549 


82 


3370 


AL035526 


Arabidopsis 
tha Liana 


extensih-like protein 


130 


39 


3371 


J04543 


Homo sapiens 


synexin 


101 


40 


3372 


Z29573 


Didelpbis 
virginiana 


cytochrome c oxidase subunit 3 


154 


71 


3373 


U93568 


Homo sapiens 


putative pi 50 


176 


59 


3374 


L28748 


Bos taurus 


putative 


146 


67 


3375 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


512 


75 


3376 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


509 


78 


3377 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


135 


51 


3378 


U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain I 


557 

• 


82 


3379 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


120 


35 


3380 


X92485 


Plasmodium 
vivax < 


pval 


124 


38- 


3381 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


110 


78 


3382 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF.. 


108 


67 


3383 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


115 


86 


3384 


X98296 


Homo sapiens 


ubiquitin hydrolase 


238 


83 


3385 


U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain 1 


494 

n. 


75 


3386 


Y86472 

■ 


Homo sapiens n 


Human gene 52-encoded protein 
fragment, SEQ ID NO:387. 


155 


66 


3387 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein thai a. 


116 


65 


3388 


W73624 


Homo sapiens 


Human secreted protein clone aw92_l. 


327 


62 


3389 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


141 


76 


3390 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


73 


61 


3391 


U35730 


Mus musculus 


jerky 


152 


30 


3392 


U54788 


Mus musculus 


Wiskott-Aldrich Syndrome Protein 


100 


41 


3393 


U97674 


Mesocricetus . 
auratus 


cytochrome c oxidase chain I 


483 


77 


3394 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


539 


81 


3395 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


608 


87 


3396 


X71602 

■ 


Nicotiana 
tabacum 


extensin 

■ 


104 


35 


3397 


L26953 


Homo sapiens 


chromosomal protein 


100 


60 


3398 


AF1 97832 


Cyanocitta 
cristata 


cytochrome oxidase I 

■ 


488 


76 


3399 


X53581 


Rattus 
norvegicus 


ORF4 


140 


42 


3400 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


597 


85 


3401 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


656 


89 


3402 


L26953 


Homo sapiens 


chromosomal protein 


317 


63 


3403 


AF052298 


Drosophila 
silvestris 


Y box protein 


114 


30 


3404 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


133 


53 


3405 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


151 


65 


3406 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


277 


82 


3407 


U01849 


Trypanosoma j ORF2 


94 


30 



218 



WO 01/64835 PCT/US01/04927 



SEQ ID 

NO: 


'Accession No. 
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Description 
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Score 


% 
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y 






brucei * 








3408 

■ 


G00328 


Homo sapiens 

• • 


Human secreted protein, SEQ ID NO: 
4409. 


115 


65 


3409 

• 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ1DN0:163. 


189 


86 


3410 


U93570 


Homo sapiens 


putative pi 50 


233 


52 


3411 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


104 


44 


3412 


AF090944 


Homo sapiens 


PRO0663 


97 


56 


3413 


X61048 


Hydra sp. 


mini-collagen 


128 


44 


3414 


AFU8086 


Homo sapiens 

* 


PR01992 


128 


81 


3415 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


145 


61 


3416 


AF1 16695 


Homo sapiens 


PR02221 


169 


69 


3417 


U31778 


Human 

papillomavirus 
type 20 


putative 

* 


107 


50 


3418 


X92485 


Plasmodium 
vivax 


pval 

* 


97 

< 


54 


3419 


U12690 , 


Homo sapiens 


cytochrome oxidase subunit II 


195 


94 


3420 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


144 


69 


3421 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


139 


71 


3422 


G02409 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6490. 


111 


63 


3423 


X15917 

< 


Paramecium 
aurelia 


ORF4 protein (AA 1-156) 


93 


41 


3424 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6595. 


99 


69 


3425 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


115 


60 


3426 


Y14486 


Homo sapiens 


cytosolic serine 
hydroxymethyltransferase 


196 


61 


3427 


U93569 


Homo sapiens 


putative p 1 50 


110 


44 


3428 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7496. 


120 


67 


3429 


ABO 12223 


Canis familiaris 


ORF2 


169 


44 


3430 


X97675 


Homo sapiens 


plakophiiin 2b 


118 


60 


3431 


AF220264 


Homo sapiens 


MOST-1 


153 


72 


3432 


X64173 

■ 


Zea 

diploperennis 


hydroxyproline-rich glycoprotein 


104 


• 


3433 


AF1 13685 


Homo sapiens 


PRO0974 


104 


72 


3434 


AB002306 


Homo sapiens 


KIAA0308 


282 


83 


3435 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


113 


83 


3436 


AB012223 


Canis familiaris 


ORF2 


108 


43 


3437 


AK024455 


Homo sapiens 


FLJ00047 protein 


100 


65 


3438 


U52077 


Homo sapiens 


mariner transposase 


381 


64 


3439 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


425 


84 


3440 


AK024455 


Homo sapiens 


FLJ00047 protein 


123 


63 


3441 


G02624 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6705. 


90 


78 


3442 


Y36156 


Homo sapiens 


Human secreted protein #28. 


126 


'71 


3443 


AF191032 


Myxine 
glutinosa 


RING3 


84 


63 


■# ■ I I 


W48927 


Homo sapiens 


SchwannomiD-binding protein C- 
terminal fragment 


141 


57 


3445 


Y08061 


Homo sapiens 


Human c-myb protein fragment 


88 


65 



219 



WO 01/64835 PCT7US01/04927 



cca in 


Af*f*0CStAt1 Na 






Smith- 




NO: 






Waterman 


Identit 










Score 


y 


3446 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


157 


67 








clone HPMBQ32. 


■ 




3447 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


281 


86 


3448 


AF1 12481 


Homo sapiens 


RAD54B protein 


392 


87 


344S> 


G00497 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


98 


55 








4578. 


■ 




3450 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


133 


90 








SEQIDN0.163. 


• 




3451 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


418 


82 


3452 


AF130079 


Homo sapiens 


PR02852 


110 


80 


3453 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


143 


73 




- 




gene No. 107. 






3454 


AB032254 


Homo sapiens 


bromodomain adjacent to zinc finger 


447 


85 








domain 2A 




• 


3455 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


134 


62 








6561. 






3456 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


79 


68 








7868. 






3457 


D38112 


Homo sapiens 1 


cytochrome c oxidase subunit 3 


545 


84 


3458 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


134 


83 








7124. 




• 


3459 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


277 


80 


3460 


AK000867 


Homo sapiens 


unnamed protein product 


474 


98 


3461 


Y27854 


Homo sapiens 


Human secreted protein encoded by 


123 


52 








gene No. 101. 






3462 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


89 


30 


3463 


U93568 


Homo sapiens 


putative pi 50 


100 


46 


3464 


AL390114 


Leishmania 


extremely cysteine/valine rich protein 


156 


43 






major 








3465 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


340 


88 


3466 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


119 


100 






• 


SEQ IDNO:163. 






3467 


B01372 


Homo sapiens 


Neuron-associated protein. 


142 


75 


3468 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


103 


40 




■ 




SEQIDNO:163. 






3469 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


139 


80 




- 




4438. 






3470 


W54282 


Homo sapiens 


Protein sequence of the di-alpha 


207 


89 








haemoglobin gene contained in pSSl. 






3471 


AF1 16661 


Homo sapiens 


PR01438 


119 


47 


3472 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


59 






■ 


4497. 




. 


3473 


AF 130089 


Homo sapiens 


PRO2550 


108 


86 


3474 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


80 








4484. 


• 


• 


3475 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


590 


79 


3476 


AF 11 8086 


Homo sapiens 


PR01992 


126 


69 


3477 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


147 


47 








4770. 






3478 


J01415 


Homo sapiens 


MTND4 


482 


78 


3479 


X03404 


Bos taurus 


alpha subunit (aa 1-394) 


583 


89 


3480 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


514 


84 


3481 


G03628 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


242 


80 








7709. 






3482 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


121 


61 








4488. 






3483 


X83427 


Ornithorhynchus 


cytochrome c oxidase subunit 1 


508 


78 






anatinus 








3484 


Y07754 


Homo sapiens 


Human secreted protein fragment 


549 


93 
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» 
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* ■ ■ ■ 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Ideotit 

y 




i 


* 


encoded from gene 1 1 . 






3485 


X97567 


Homo sapiens 


porl 


388 


69 


3486 


X97567 


Homo sapiens 


porl 
* 


608 


84 


3487 


K02401 


Homo sapiens 


chorionic somatomammotropin 


641 


93 


3488 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


135 


71 


3489 


S62941 

• 


Homo sapiens 

• 


Ps 2=basic proline-rich protein(PRBlL 
precursor protein=basic proline-rich 
proteins (Ps, PmF, PmS, and Pe) 
precursor) {C-terminal} 


116 

• 


36 


3490 


S74728 


Homo sapiens 


antiquitin=26g turgor protein homolog 


549 


84 


3491 


L13329 


Homo sapiens 


iduronate-2-sulfatase 


564 


85 


3492 


X79535 


Homo sapiens 


beta tubulin 


620 


88 


3493 


AF081484 


Homo saoiens 


aloha-tubulin isoform 1 


578 


87 


3494 


U09823 

■ 


Oryctolagus 
cuniculus 


elongation factor 1 alpha 


631 


89 


3495 


AF081484 


Homo saoiens 


aloha-tubulin isoform 1 


616 


90 


3496 


M12140 


Homo saniens 


envelone Drotein 


430 


53 


3497 


003714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795 


123 


70 


3498 


W48352 


ffnmn <tfinien<s 

XXUIXXV OClL/lwUd 


Human breast cancer related rtrntein 

XAUAAMUA wlvudl VOUvvl 1 vlHIwU Ul VWlii 

BCFLT1. 


104 


51 

mj X 


3499 


R13556 


Homo saoiens 


Protein encoded downstream of hhc NI 
oncoprotein. 


134 


54 


3500 


Y36156 


Homo saoiens 


Human secreted nrotein #28 


143 


75 


3501 


AFl 13685 


Homo sapiens 


PRO0974 


77 


81 


3502 


G00365 


Worn a <mniftn< 


Human secreted nrotein SFO ID NO - 

1AIXAJACUJ ObvtulbU L/l V IV LLLy U-Lj\£ US I1V/1 

4446 

iff %j • 


157 

X <^ / 


62 


3503 


AF090944 


Homo sapiens 


PRO0663 


164 


81 


3504 


AF090931 


Homo saoieTK 


PRO0483 


103 


70 


3505 


U93570 


Homo sapiens 


p40 


258 


47 


3506 


A R0 16601 


TirfMinrib i 1 n 

alpina 


rvtnchmme c AYidace Qtibimir I 


108 

X w 


82 


3507 


A R0 16601 


T)roQAnh i 1 a 

l~r\ \Jj\J\Jx\ 1 lu 

alpina 


rvtochrome c ovida*^ otihunit 1 

V J IvwiUUulw \« UAlUQdv pUl/IUUb X 


108 


8^ 


3508 


AFl 94537 


Hnmo *ymieni 


NAG13 


449 


78 


3509 


W88627 


Homo canient; 


Secreted nrotein encoded bv pene 04 

clone HPMBQ32. 


153 


60 


3510 


Z70684 


Caenorhabditi «» 

r • 

elegans 


F28D1 8 


102 


37 


3511 


AC008113 


Arabidoosis 

xVJ WlwUUwW 

thaliana 


F12A21 10 

A> A ^*xV-£# X * A V 


90 


48 


3512 


M64791 


Rattus 
norvegicus 


salivary nroline-rich orotein 

MAI ▼ 1M y %0 4k V4UXV A 1>X1 prX V^rlU 


104 


33 


3513 


AF090944 


Homo sapiens 


PRO0663 


115 


52 - 


3514 


AF240630 


Mus musculus 


IQ motif containing GTPase activating 
protein 1 


152 


67 


3515 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded bv cene 90. 


123 


48 


3516 


AF090942 


Homo sapiens 


PRO0657 


128 


47 


3517 


M10546 


Homo sapiens 


cvtochrome oxidase I 


286 


56 


3518 


AF255661 


Crypthecodinium 
cohnii 


Dinapl -interacting protein 5; Dip5 


108 


36 


3519 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


209 


93 


3520 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


102 


38 


3521 


Q02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


131 


64 


3522 


AL359782 


Trypanosoma possible (hhv-6) ul 102, variant a dna, 


105 


42 
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SEQ lb 

NO: 


Accession No. 


Species 


Description 


Smttfa- 

Wa term in ' 

Score 


% 

ldentit 

y 






brucei 


complete virion genome. 






3523 


R95913 


Homo sapiens 


Neural thread protein. 


118 


72 


3524 


Y86248 

• 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 3 63. 


115 


59 


3525 


L27428 


Homo sapiens 


reverse transcriptase 


162 


50 


3526 


Y02749 

■ 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


118 


58 


3527 


AF090930 


Homo sapiens 


•PRO0478 


146 


66 


3528 


G00528 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4609. 


169 


50 


3529 


AF194537 


Homo sapiens 


NAG13 


119 


88 


3530 


U93564 


Homo sapiens 


putative pi 50 

* * 


234 


84 


3531 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


208 


63 


3532 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


275 


92 


3533 


AF090931 


Homo sapiens 


PRO0483 


126 


66 


3534 


W05278 


Homo sapiens 


Tumour necrosis factor-related gene 
product CL6.5.40. 


117 


85 


3535 


G03076 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7157. 


173 


77 


3536 


U21123 


Drosophila 
melanogaster 


ena polypeptide 


117 


45 


3537 


AF209061 


Eubranchipus sp. 


cytochrome c oxidase I 


127 


80 


3538 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


174 


85 


3539 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


118 


45 


3540 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 
oncoprotein. 


88 

«. 


50 


3541 


G00403 


Homo sapiens 

m\ mm Mw 4 m*mt mm 1 mmpmmr mt mm# 


Human secreted protein, SEO ID NO: 
4484. 


128 


71 


3542 


U74612 


Homo sapiens 


hepatocyte nuclear factor-3/fork head 
homolog 1 1 A 


111 


77 


3543 


G03714 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


149 


69 


3544 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


470 


80 


3545 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


120 


54 


3546 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rha la. 


106 


42 


3547 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


584 


83 


3548 


B03148 


Homo sapiens 


Human neuronal difFerentiation fector- 
1 (NDF-1). 


635 


95 


3549 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


661 


91 


3550 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


607 


86 


3551 


AK024455 


Homo sapiens 


FLJ00047 protein 


88 


51 


3552 


W88957 


Homo sapiens 


Polypeptide fragment encoded by gene 
128. 


500 


86 


3553 


D38113 


Pan troglodytes 


NADH dehydrogenase subunit 4 


228 


95 


3554 


AF130092 


Homo sapiens 


PRO2620 


165 


93 


3555 


AL121845 


Homo sapiens 


dJ583P15.5.1 (novel protein (isoform 

> A ^ 

1)) 


659 

■ 


91 

• 


3556 

• 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 


170 


72 


3557 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


599 


88 


3558 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


578 


84 


3559 


AK000385 


Homo sapiens 


unnamed protein product 


138 


47 


3560 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


112 


61 


3561 


X97675 


Homo sapiens 


plakophilin 2b 


123 


65 


3562 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 94 


62 
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> • * 



MA. 


'Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Ideutit 

y 




* 




6595. 






3563 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


198 


90 


3564 


X86029 


Vigna 
unguiculata 


extensin-like protein 


141 


36 

» 


3565 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


77. 


60 


3566 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


110 


94 


3567 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


203 


93 


3568 


Y86248 


Homo sapiens 

■ 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


146 


72 


3569 


S62928 


Homo sapiens 

* 


PRB 1M protein precursor 


102 


44 


3570 


D86853 


Catharanthus 
roseus 


extensin 


123 


40 


3571 


U93574 


Homo sapiens 


putative pi 50 


279 


56 


3572 


Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene 5 1 clone HUKEX85. 


94 


82 


3573 


U45964 , 


Herpesvirus 
papio 


LMP1 


127 


35 


3574 


Y08319 


Homo sapiens 


kinesin-2 


254 


55 


3575 


X97675 


Homo sapiens 


plakophilin 2b 


134 


54 


3576 


AF003151 


Caenorhabditis 
elegans 


contains similarity to an RNA 
recognition motif 


137 


43 


3577 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


126 


66 


3578 


U80447 


Caenorhabditis 
eie&ans 


coded for by C. elegans cDNA 
vkl 87f6.5: coded for bv C. elegans 
cDNA ykl 87f6.3; coded for by C. 
elegans cDNA ykl46fl 1 .5 


117 

■ 


50 


3579 


U23172 

■ 


Caenorhabditis 
elegans 


similar to myoblast cell surface antigen 
(SP:CS24 HUMAN, P23246) and D. 
melanogaster No-on-transient A protein 
(PIR:JH0162) 


126 

• 


31 


3580 


AJ249395 


Globodera 
pallida 


NADH-ubiquinone oxidoreductase 
subunit 4 


85 


33 


3581 


U091 16 


Homo sapiens 


ORF 1 , encodes a 40 kDa product 


139 


55 


3582 


X97675 


Homo sapiens 


plakophilin 2b 


125 


73 


3583 


L26953 


Homo sapiens 


chromosomal protein 


101 


55 


3584 


S80343 


Homo sapiens 


arginyl-tRNA synthetase, ArgRS 


no 


67 


3585 


X97675 


Homo sapiens 


plakophilin 2b 


116 


84 


3586 


X92485 

■ 


Plasmodium 
vivax 


pval 


124 


50 


3587 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


126 


56 


3588 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


136 


56 


3589 


U63542 


Homo sapiens 


FAP protein 


128 


75 


3590 


D38112- 


Homo sapiens 


cytochrome c oxidase subunit 3 


642 


88 


3591 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


612 


87 


3592 


K02401 


Homo sapiens 


chorionic somatomammotropin 


576 


86 


3593 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


108 


40 


3594 


V00662 


Homo sapiens 


cytochrome oxidase I 


459 


76 


3595 


AC008262 


Arabidopsis 
thaliana 


F4N2.10 


136 


31 


3596 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


126 


52 



223 



NO: 


s Accession No. 


Species 

* 


Description 


Smith- 
Waterman 
Score 


Merit | 

y 


3597 


AF1 16645 


Homo sapiens 


PRO 1708 


623 


91 


3598 

* 


M2302a 


Human 
herpesvirus 4 


nuclear antigen precursor 

- 


121 


35 


3599 


W88816 


Homo sapiens 


Polypeptide fragment encoded by gene 
58. 


100 

• 


33 


3600 


X92485 


Plasmodium 
vivax 


pval 


156 


36 


3601 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


122 


78 


3602 


AK000496 


Homo sapiens 


unnamed protein product 


150 


68 


3603 


AF083929 


Mus musculus 


ES18 


98 


40 


3604 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


537 


83 


3605 


U12690 


Homo sapiens 


cytochrome oxidase subunit n 


548 


85 


3606 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


158 

• 


57 


3607 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


429 


75 


3608 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


122 


65 


3609 


Y86248 

1 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


184 


88 


3610 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


224 


80 


3611 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


481 


70 


3612 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


166 


92 


3613 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


582 


83 


3614 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


167 


91 


3615 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


582 


77 


3616 

■ 


U23172 


Caenorhabditis 
elegans 


similar to myoblast cell surface antigen 
(SP:CS24_HUMAN, P23246) and D. 
melanogaster No-on-transient A protein 
(PIRiJH0162) 


101 

* 


40 


3617 


D38112 


Homo sapiens 


cytochrome c oxidase subunit. 1 


591 


85 


3618 


AB037275 


Cynomolgus 
Epstein-Barr 
Virus TsB-B6 


EBNA-1 


119 


42 


3619 


AF061944 


Homo sapiens 


kinase deficient protein KDP 


581 


91 


3620 


Y34068 


Homo sapiens 


Histone HI isoform HI. 5 pANCA- 
reactive fragment (residues 69-226). 


100 


33 


3621 


AL049608 


Arabidopsis 
thaliana 


extensin-Iike protein 


105 


27 


3622 


AP000616 

• 


Oryza sativa 

• 


similar to RING-H2 finger protein 
RHAla(AF078683) 


127 

• 


69 


3623 


S71569 


Neocallimastix 
patritiarum, 
Peptide, 860 aa 


Xylanase B, XYLB {EC 3.2.1.8} 

■ 

• 


108 

■ 


43 


3624 


U23172 


Caenorhabditis 
elegans 


similar to myoblast cell surface antigen 
(SP:CS24 JIUMAN, P23246) and D. 
melanogaster No-on-transient A protein 
(PIR:JH0162) 


101 

• 


37 

• 


3625 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


390 


83 


3626 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


582 


84 


3627 


D38312 


Homo sapiens 


cytochrome c oxidase subunit 3 


637 


90 


3628 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


169 


39 


3629 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


142 


77 


3630 


U93564 


Homo sapiens 


putative pi 50 


330 


90 


3631 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


148 


42 



224 



CPA V¥\ 

NO: 


* Accession No. 


bpectes 


vesenpnou 


OlUllD- 

Watmnao 
Score 


IV 

Ideotit 

y 


3632 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID WO: 
6613. 


100 


80 


3633 


G02360 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6441. . 


139 


65 


3634 


U74071 


Phascolosoma 
sp. 'California* 


cytochrome c oxidase subunit I 


248 


72 


3635 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


121 

• 


87 


3636 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


128 


65 


3637 


Y0267I 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


124 


63 


3638 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


159 


51 


3639 


X65582 


Mus musculus 


alpha-2 collagen 


102 


53 


3640 


AF090942 


Homo sapiens 


PRO0657 


97 


76 


3641 


W88627 

• 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


146 


69 


3642 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


118 


55 


3643 


X14576 


Murine leukemia 
virus 


gagpl5 protein 


134 


44 


3644 


AF1 30051 


Homo sapiens 


PRO0898 


177 


48 


3646 


G03064 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7145. 


120 


59 


3647 


AK000385 


Homo sapiens 


unnamed protein product 


105 


40 


3648 


V00662 


Homo sapiens 


cytochrome oxidase I 


533 


83 


3649 


AF090944 


Homo sapiens 


PRO0663 


129 


73 


3650 


AF090944 


Homo sapiens 


PRO0663 


129 


73 


3651 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


88 


60 


3652 


AF090930 


Homo sapiens 


PRO0478 


92 


57 


3653 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


148 


73 


3654 


ABO 18440 


Echinococcus 
multilocularis 


NADH dehydrogenase subunit 2 


100 


32 


3655 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


165 


68 


3656 


AF161356 


Homo sapiens 


HSPC093 


106 


48 


3657 


AF090930 


Homo sapiens 


PRO0478 


158 


85 


3658 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


128 


82 


3659 


U93574 


Homo sapiens 


putative pi 50 


117 


44 


3660 


Y 02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


149 


51 


3661 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


67 


70 


3662 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


99 


62 


3663 

- 


AL451017 


Neurospora 
crassa 


related to Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN C 


154 


45 


3664 


Y27571 


Homo sapiens 


Human secreted protein encoded by 
gene No. 5. 


136 


77 


3665 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


128 


59 


3666 


U47855 


Araneus 
diadematus 


fibroin-3 


109 


39 


3667 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


604 


88 


3668 


K02401 


Homo sapiens | 


chorionic somatomammotropin 


636 


93 



225 



NO: 


Accession wo. 


species 


j/esenpoon 


OullUr 

Waterman 
Score 


so 

Identit 

y 


3669 


AF083929 


Mus museums 


ES18 


133 


46 


3670 


J01415 


Homo sapiens 


MTND4 


620 


87 


3671 


AK025047 


Homo sapiens 


unnamed protein product 


160 


60 


3672 


G03793 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
7874. 


106 


31 


3673 


AB019038 


Homo sapiens 


beta-1,4 mannosyltransferase 


766 


94 


3674 


U74612 


Homo sapiens 


hepatocyte nuclear factor-3/forkhead 
homolog 1 1 A 


113 


69 


3675 


U80447 

* 


Caenorhabditis 
elegans 


coded for by C. elegans cDNA 
ykl 87f6.5; coded for by C. elegans 
cDNA ykl87f63; coded for by C. 
elegans cDNA ykl46fl 1.5 


113 


39 


3676 


R63235 


Homo sapiens 


CNS neural thread protein HB4. 


197 


68 


3677 


U52077 


Homo sapiens 


mariner transposase 


500 


74 


3678 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


174 


97 


3679 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


191 


86 


3680 


AC003113 


Arabidopsis 
thaliana 


F2401.6 


102 


83 - 


3681 


AF005370 


Alcelaphine 
herpesvirus 1 


putative immediate early protein 

* 


153 


42 


3682 


G01502 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5583. 




62 


3683 


AP002543 


Arabidopsis 

* 

thaliana 


gb|AAD23015.1~gene id;F15M7.16~si 
milar to unknown protein 


131 


40 


3684 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


146 


64 


3685 


U93572 


Homo sapiens 


putative pi 50 


158 


43 


3686 


Y34068 


Homo sapiens 


Histone HI isoform HI. 5 pANCA- 
reactive fragment (residues 69-226). 


98 


43 


3687 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


106 


54 


3688 


L26953 


Homo sapiens 


chromosomal protein 


128 


76 


3689 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


563 


87 


3690 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


109 


67 


3691 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163 . 


122 

■ 


75 


3692 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


165 


85 


3693 


M20789 


Homo sapiens 


alpha- 1 type I collagen 


141 


43 


3694 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


162 


50 


3695 


AF210651 


Homo sapiens 


NAG 18 


124 


67 


3696 


AF130079 


Homo sapiens 


PR02852 


94 


53 


3697 


Y19192 


Talpa europaea 


cytochrome oxidase subunit I 


618 


85 


3698 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


414 


86 


3699 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


487 


76 


3700 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


109 


42 


3701 


AF217374 


Acanth aster 
planci 


cytochrome oxidase subunit I 


563 


85 


3702 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


106 


48 


3703 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


522 


79 


3704 


D67066 


Bos taurus 


N-WASP 


219 


42 


3705 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


100 


51 


3706 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


44 
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y 








7879. 






3707 


Z70684 


Caenorhabditis 
elegans 


F28D1.8 


105 


37 


3708 


G03790 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7871. 


125— 


74 


3709 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


558 


88 


3710 


AF217374 


Acantbaster. 
planci 


cytochrome oxidase subunit ] 


512 


*5 


37J1 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


186 


80 


3712 


AF197854 : 


Melithreptus 
lunatus 


cytochrome oxidase I 


189 


83 


3713 


AP000616 


Oryza sativa 


similar to RING-H2 ringer protein 
RHAla(AF078683) 


129 


95 


3714 


AB029027 


Homo sapiens 


KIAA1 104 protein 


141 


81 


3715 


L26251 

« 


Trypanosoma 
brucei 


CR5 

■ 


73 


31 


3716 


R95913 


Homo sapiens 


Neural thread protein. 


117 


44 


3717 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


95 


67 


3718 


Y36112 , 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 497. 


108 


81 

• 


3719 


R07057 

* 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhem) gene preoduct 


132 


55 


3720 


M64923 


Bos taurus 


C10 protein 


194 


94 


3721 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


115 


75 


3722 


G03043 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


118 


95 

• 


3723 


AF302773 


Homo sapiens 


ninein-Lm isoform 


157 


57 


3724 


AK024455 


Homo sapiens 


FLJ00047 protein 


134 


59 


3725 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


161 


80 


3726 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


121 


52 


3727 


AF220264 


Homo sapiens 


MOST-1 


86 


88 


3728 


L27428 


Homo sapiens 


reverse transcriptase 


316 


62 


3729 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


112 


46 


3730 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


145 


68 


3731 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


118 


56 


3732 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


131 


65 


3733 


AF090930 


Homo sapiens 


PRO0478 


148 


71 


3734 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW 18. 


140 


72 


3735 


U93569 


Homo sapiens 


putative pi 50 


270 


59 


3736 


U93568 


Homo sapiens 


putative pi 50 


151 


38 


3737 


G02493 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
6574. 


124 


82 


3738 


X92485 


Plasmodium 
vivax 


pval 


101 


44 


3739 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


146 


51 


3740 


AF090944 


Homo sapiens 


PRO0663 


124 


56 


3741 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


94 


47 


3742 


AF 118086 


Homo sapiens 


PR01992 


124 


61 


3743 


AF083929 


Mus musculus 


ES18 


108 


35 


3744 


L34685 


Arabidopsis 


cell wall protein 


120 


34 
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SEQ ID 
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Species 
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• 
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% 

Identit ' 

Mil mm %M m> 

y 






thaliana 




» 




3745 


AF090931 


Homo sapiens 


PRO0483 


117 


54 


3746 


AF090930 


Homo sapiens 


PRO0478 


136 


75 


3747 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


154 


59 


3748 


G00368 


Homo sapiens 


Human secreted protein, SEQ ED NO: . 
4449. 


92 


44 


3749 


AP000616 . 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla(AF078683) 


98 


42 


3750 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


165 


73 


3751 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


113 


87 


3752 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


96 


58 


3753 


U93574 


Homo sapiens 


putative pi 50 


157 


60 


3754 


X56015 

i 


Crithidia 
oncopelti 

* 


NADH dehydrogenase subunit 5 


103 


30 


3755 


M80613 


Homo sapiens . 


putative 


114 


26 


3756 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


119 


54 


3757 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


210 


89 


3758 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


91 


58 


3759 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 

w r x ..... . w * - 


no 


46 


3760 


M13100 


Rattus 
norvegicus 


unknown protein 


119 


46 


3761 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


135 


52 


3762 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


132 1 


84 


3763 


Ml 1900 


Mus musculus 


15-kDa proline-rich salivary protein 


125 


40 


3764 


U63542 


Homo sapiens 


FAP protein 


131 


72 


3765 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO 
4493. 


143 


76 


3766 


X92485 


Plasmodium 
vivax 


pval 


119 


54 


3767 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 

* .... 


81 


56 


3768 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


98 


65 


3769 


AB005216 


Homo sapiens 


Nek, Ash and phospholipase C gamma- 
binding protein NAP4 


487 


90 


3770 


AF02621 1 


Caenorhabditis 
elegans 


Similar to cuticular collagen 

■ 


102 


37 


3771 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


91 


55 


3772 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


128 


76 


3773 


AK024455 


Homo sapiens 


FLJ00047 protein 


138 


66 


3774 


AF090895 


Homo sapiens 


PRO0117 


152 


60 


3775 


AF 130051 


Homo sapiens 


PRO0898 


122 


78 


3776 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


100 


63 


3777 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


187 


75 


3778 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


130 


56 
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> 
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% 

Identit 

y 


3779 


X55685 


Lycopersicon 
esculentum 


extensin (class 0 


132 

• 


38 


3780 


L27428 


Homo sapiens 


reverse transcriptase 


116 


53 


3781 


Y71066 


Homo sapiens 


Human membrane transport protein, 
MTRP-11. 


188 


86 


3782 


AL390114 

■ 


Leishmania 
major 


extremely cysteine/valine rich protein 


122 

- 


67 


3783 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


462 


75 


3784 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO:' 
4414. 


127 


67 


3785 


Y67470 


Homo sapiens 


Np70 protein carboxy terminal region. 


129 


36 


3786 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


152 


78 


3787 


S80119 


Rattus sp. 


reverse transcriptase homolog 


143 


40 


3788 


AF090942 


Homo sapiens 


PRO0657 


134 


67 


3789 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


135 


63 


3790 


AF1 16661 


Homo sapiens 

* 


PR01438 


128 


45 


3791 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


130 


38 


3792 


G00403 , 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


127 


74 


3793 


W50192 

■ 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-1. 

* 


101 


48 


3794 


AF000298 


Caenorhabdhis 
elegans 


weak similarity to collagens; glycine- 
and proline-rich 


175 


46 


3795 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


117 


40 


3796 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


185 


88 


3797 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


610 


84 


3798 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


156 


56 


3799 


AL359782 


Trypanosoma, 
brucei 


probable similar to ring-h2 ringer 
protein rhala. 


71 


52 


3800 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


127 


77 


3801 


AF 130079 


Homo sapiens 


PR02852 


164 


54 


3802 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


118 


45 


3803 


U42580 


Paramecium 
bursaria 

Chiorella virus 1 


Pro-rich, PAPK (20X); simUar to 
Arabidopsis anter-specific Pro-rich 
protein, corresponds to Swiss-Prot 
Accession Number P40602 


101 


36 


3804 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


173 


87 


3805 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


126 


72 


3806 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


225 


85 


3807 


U88587 


Nicotiana alata 


120 kDa style glycoprotein 


118 


38 


3808 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 . 


129 


82 


3809 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


122 


67 


3810 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


213 


97 


3811 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


106 


63 


3812 


AF1 69974 


Homo sapiens 


serine raceraase 


153 


56 


3813 


L27428 


Homo sapiens 


reverse transcriptase 


175 


43 


3814 


W50193 


Homo sapiens 


Amino acid sequence of salivary . 
protein CON-2. 


109 

• 


43 
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Smith- 

Waterman 

Score 


% 

Identit 

y 


3815 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


176 


78 


3816 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


112 


65 


3817 


AFI 19901 


Homo sapiens 


PR0283 1 


113 


55 


3818 


W48351 

* 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


99 


89 


3819 


AL1 60371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


91 


66 


3820 


X92485 


Plasmodium 
vivax 


pval 


108 


46 


3821 


AK024455 


Homo sapiens 


FLJ00047 protein 


79 


59 


3822 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


178 


87 


3823 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ ID NO:387. 


99 


50 


3824 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCS 1 1 037. 


86 


94 


3825 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


138 


53 

. 


3826 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


129 


88 


3827 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


77 


61 


3828 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


139 

» 


51 


3829 


K03208 


Homo sapiens 


salivary proline-rich protein precursor 


161 


41 


3830 


U44838 . 


Glycine max 


extensin 


108 


47 


3831 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


136 


71 


3832 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


205 


84 


3833 


AL163912 


Arabidopsis 
thaliana 


glycine-rich protein atGRP-7 


117 

■ 


37 


3834 


AFI 30089 


Homo sapiens 


PRO2550 


138 


45 


3835 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


92 


76 


3836 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


152 


70 


3837 


X56010 


Sorghum bicolor 


hydroxyproline-rich glycoprotein 


98 


37 


3838 


X68249 


Xenopus laevis 


Proline rich protein 


92 


66 


3839 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


262 


92 


3840 


, R63235 . 


Homo sapiens 


CNS neural thread protein HB4. 


186 


100 


3841 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


103 


64 


3842 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


116 


62 


3843 


AP164614 


Homo sapiens 


envelope protein 


508 


77 


3844 


AFI 30051 


Homo sapiens 


PRO0898 


130 


73 


3845 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


144 


74 


3846 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


119 


68 


3847 


M13100 


Rattus 
norvegicus 


unknown protein 


129 


46 


3848 


X97675 


Homo sapiens 


plakophilin 2b 


145 


67 


3849 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


141 


65 


3850 


K03206 


Homo sapiens 


salivary proline-rich protein precursor 


114 


42 
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% 

Ideotit 
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3851 


U93574 


Homo sapiens 


putative pi 50 


165 


82 


3852 


AF090931 


Homo sapiens 


PRO0483 


97 


78 


3853 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


279 


80 


3854 


U93570 


Homo sapiens 


putative pi 50 


135 


38 


3855 


M13100 


Rattus 
norvegicus 


unknown protein 


100 


57 


3856 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


166 


63 


3857 


Y48576 


Homo sapiens 


Human breast tumour-associated 
protein 37. 


117 


49 


3858 


G03807 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7888. 


147 


45 


3859 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


150 


71 


3860 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. . 


165 


69 


3861 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


131 


83 


3862 


Y02671 , 


Homo sapiens 

• 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


171 


81 


3863 


AK00Q241 


Homo sapiens 


unnamed protein product 


100 


48 


3864 


Y36112 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 497. 


121 

> 


80 


3865 


AK024455 


Homo sapiens 


FLJ00047 protein 


124 


69 


3866 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


87 


64 


3867 


AF1 16712 


Homo sapiens 


PR02738 


92 


48 


3S6S 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


165 


62 


3869 


G00416 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
4497. 


128 


62 


3870 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


94 


75 


3871 


AK024455 


Homo sapiens 


FLJ00047 protein 


83 


72 


3872 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


147 


44 


3873 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


113 


51 


3874 


AF090930 


Homo sapiens 


PRO0478 


120 


63 


3875 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


145 


70 


3876 


D88674 


Homo sapiens 


antizyme inhibitor 


147 


75 


3877 


AF263744 


Homo sapiens 


erbb2-interacting protein ERBIN 


212 


37 


3878 


Y36112 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 497. 


140 


82 


3879 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


82 


35 


3880 


Y01181 


Homo sapiens 


Polypeptide fragment encoded by gene 
12. 


144 


60 

• 


3881 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


122 


60 


3882 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


134 


75 


3883 


M81321 


Macaca 
fascicularis 


proline-rich protein 


101 


41 


3884 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


109 


56 


3885 


AF013214 


Bos taurus 


acidic ribosomal phosphoproteinPO 


177 


94 • 


3886 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


77 
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•54 

Identit 

y 








4484. 






3887 


Y59860 


Homo sapiens 


Human normal uterus tissue derived 
protein 23. 


137 


72 


3888 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


147 


72 


3889 


M64792 


Rattus b 
norvegicus 


salivary proline-rich protein 


104 


40 


3890 

* 


AF156228 

. ■ 


Drosophila 
melanogaster 


salivary gland secretion protein 


104 


35 

• 


3891 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


90 


47 


3892 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


138 


38 


3893 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


146 


65 


3894 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


122 


34 


3895 


X02873 


Daucus carota 


put. precursor 


104 


48 


3896 


AF090942 


Homo sapiens 


PRO0657 


135 


50 


3897 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


195 


77 


3898 


X92485 


.Plasmodium 
vivax 


pval 


111 


62 

■ 


3899 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


107 


70 


3900 


AL160493 


Leishmania 
major 


probable (hhv-6) ul 1 02, variant a 
DNA, complete virion genome 


94 


93 


3901 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


163 


83 


3902 


AF090942 


Homo sapiens 


PRO0657 


134 


65 


3903 


AF220264 


Homo sapiens 


MOST-1 


74 


59 


3904 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


131 


53 


3905 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


129 


85 


3906 


AK025047 


Homo sapiens 


unnamed protein product 


127 


73 


3907 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


135 


86 


3908 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


134 


72 


3909 


X77816 


Rattus 
norvegicus 


PR-Vbetal 


104 


62 


3910 


G02654 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6735. 


114 


70 


3911 


AK024455 


Homo sapiens 


FLJ00047 protein 


135 


61 


3912 


U93567 


Homo sapiens 


putative pi 50 


233 


63 


3913 


W54966 


Homo sapiens 


Synthetic human type III collagen 
SYN-C3. 


96 


40 


3914 


X94976 


Brassica napus 


cell wall-plasma membrane linker 
protein 


104 


36 


3915 


P92219 


Homo sapiens 
(human) 


CRI protein. 


125 


73 


3916 


X92485 


Plasmodium 
vivax 


pval 


94 


64 


3917 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


105 


52 


3918 


AF118086 


Homo sapiens 


PR01992 


130 


87 


3919 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


125 


80 


3920 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


79 
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3921 


AF090930 


Homo sapiens 


PRO0478 


128 


64 


3922 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


114 


75 


3923 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


130 


65 


3924 . 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 103. 


134 

■ 


71 


3925 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


102 


58 


3926 


D63487 


Homo sapiens 


The KIAA0153 gene product is related 
to a putative C.elegans gene encoded in 
cosmid F42A8. 


141 


100 


3927 


U21123 


Drosophila 
melanogaster 


ena polypeptide 


117 


37 


3928 


X58438 


Mus musculus 


proline rich protein 


136 


36 


3929 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


152 


71 


3930 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


132 


73 


3931 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


86 


48 


3932 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


110 


65 


3933 


X92485 


Plasmodium 
vivax 


pval 


116 

i ... 


36 


3934 


X97675 


Homo sapiens 


plakophilin 2b 


129 


82 


3935 


G03258 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


84 


58 


3937 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


107 


56 


3938 


AF090895 


Homo sapiens 


PRO0117 


166 


68 


3939 


K02550 


Oncorhynchus 
raykiss 


70-kilodalton heat shock protein 


104 


38 


3940 


AK024455 


Homo sapiens 


FU00047 protein 


112 


74 


3941 


L26953 


Homo sapiens 


chromosomal protein 


92 


67 


3942 


U63542 


Homo sapiens 


FAP protein 


135 


75 


3943 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


139 


80 


3944 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


141 


67 


3945 


AK024455 


Homo sapiens 


FLJ00047 protein 


123 


76 


3946 


Y08769 


Rattus 
norvegicus 


microvascular endothelial 
differentiation gene 2 


92 


52 


3947 


W48927 


Homo sapiens 


Schwannomin-binding protein O- 
terminal fragment. 


103 


56 


3948 


AF252293 


Homo sapiens 


PAR3 


161 


44 


3949 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


125 


47 


3950 


Y86248 

i 

•> 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


128 


59 


3951 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


174 


87 


3952 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRBZ 


152 


75 


3953 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


168 


64 


3954 


R95913 


Homo sapiens 


Neural thread protein. 


110 


54 


3955 


AF130089 


Homo sapiens 


PRO2550 


122 


63 


3956 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


119 


48 
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% 

Ideotit 

y 


3957 


Y36203 


Homo sapiens 


Human secreted protein #75. 


121 


75 


3958 


Y86248 


Homo sapiens 


Human secreted" protein HCHPF68, 

• * 

SEQ ID NO: 163. 


177 


61 


3959 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


131 


69 


3960 


U93563 


Homo sapiens 


putative pi 50 


227 


62 


3961 


D86853 


Catharanthus 
roseus 


extensin 


100 


40 


3962 


U12390 


Cloning vector 
pSportl 


beta-galactosidase alpha peptide 


101 


41 


3963 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


107 

* 


80 


3964 


G00397 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


121 


42 


3965 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


150 


66 


3966 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


101 


34 


3967 


AF229126 . 


Homo sapiens 


acetylcholinesterase collagen-like tail 
subunit isofbrm VII 


108 


36 


3968 


AJ006770 


Cicer arietinum 


extensin 


86 


29 


3969 


AF 130089 


Homo sapiens 


PRO2550 


128 


82 


3970 


U93563 


Homo sapiens 


putative pi 50 


99 


48 


3971 


AK000496 


Homo sapiens 


unnamed protein product 


134 


71 


3972 


Z70684 


Caenorhabdhis 
elegans 


F28D1.8 


108 


42 


3973 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


137 


72 


3974 


AK024455 


Homo sapiens 


FLJ00047 protein 


132 


65 


3975 


AK024455 


Homo sapiens 


FLJ00047 protein 


128 


85 


3976 


X01918 


Drosophila 
melanogaster 


salivary gland glue protein 


99 


40 


3977 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


136 


46 


3978 


AF229067 


Homo sapiens 


PADI-H protein 


107 


87 


3979 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


189 


48 


3980 


S80119 


Rattus sp. 


reverse transcriptase homolog 


134 


49 


3981 


AF090901 


Homo sapiens 


PRO0195 


106 


94 


3982 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


111 


42 


3983 

■ 


X92485 

• 


Plasmodium 
vivax 


pval 


102 


39 


3984 


G02640 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6721. 


127 


64 


3985 


AB012162 


Homo sapiens 


APCL protein 


190 


42 


3986 


AFO 16099 


Mus muscuius 


endonuc lease/reverse transcriptase 


139 


65 


3987 


U93570 


Homo sapiens 


putative pl50 


170 


71 


3988 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


202 


73 


3989 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


244 


85 


3990 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


420 


92 


3991 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


272 


85 


3992 


L38908 


Nicotiana 
tabacum 


extensin 


114 


'A f\ 

40 


3993 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


116 


64 


3994 


AF090944 


Homo sapiens 


PRO0663 


99 


,39 


3995 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


150 


75 


3996 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


574 


87 


3997 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


488 


76 ' 
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% 
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3998 


U52077 


Homo sapiens 


manner transposase 


475 


70 


3999 


U93570 


Homo sapiens 


p40 


ill 


34 


4000 


U15647 


Mus musculus 


reverse transcriptase 


137 


43 


4001 


AF1 16712 


Homo sapiens 


PR02738 


105 *~ 


52 


4002 


AF1 13685 


Homo sapiens 


PRO0974 


125 


53 


4003 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


92 


78 


4004 


AF1 16715 


Homo sapiens 


PR02829 


115 


73 


4005 


D38112 


Homo sapiens 


NADH dehydrogenase summit 2 


288 


80 


4006 


D38112 


Homo sapiens 


cytochrome c oxidase summit 1 


614 


86 


4007 


G03787 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7868. 


106 


45 


4008 


U90268 


Homo sapiens 


Kritl 


152 


52 


4009 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


472 


96 


4010 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1(NDF-1). 


304 


100 


4011 


B03148 

i 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


472 


98 


4012 


W50193 


Homo sapiens 

ft 


Amino acid sequence of salivary 
protein CON-2. 


102 

■ 


50 

. 


4013 


G00262 


Homo sapiens 

- 


Human secreted protein, SEQ ID NO: 
4343. 


124 


73 

• 


4014 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


118 


47 


4015 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


522 


81 


4016 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


147 

i» 


52 


4017 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


106 


57 


4018 


AB003476 


Homo sapiens 


gravin 


549 


91 


4019 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


391 


93 


4020 


U44949 


Xenopus iaevis 


zona pellucida A glycoprotein homolog 


143 


32 


4021 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


571 


88 


4022 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit I 


554 


87 


4023 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


165 


88 


4024 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


131 


38 


4025 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


583 


86 


4026 


AF090895 


Homo sapiens 


PRO0117 


104 


49 


4027 


AFI30056 


Homo sapiens 


PRO 1 367 


80 


60 


4028 


X97675 


Homo sapiens 


plakophilin 2b 


157 


83 


4029 


X67337 


Homo sapiens 

* 


Human pre-mRNA cleavage factor 1 68 
kDa subunit 


114 


35 


4030 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


110 


74 


4031 


D38112 


Homo sapiens 


ATPase subunit 6 


395 


80 


4032 


S70718 

* 


Hemicentrotus 
pulcherrimus=se 
a urchins, tests, 
Peptide, 632 aa 


fibrillar collagen alpha 120 and 140 
chains 


104 


33 


4033 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


355 


66 


4034 


AC004497 


Homo sapiens 


MX2 


464 


58 


4035 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


151 


61 


4036 


AF1 17888 


Homo sapiens 


myosin-IXa 


113 


34 


4038 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


100 


48 


4039 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


153 


73 
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MA. 
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* 
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Smith- 

waicruiQQ 

Score 


% 

10CDDI 

y 


4040 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


338 


86 


4041 


AF1 16661 


Homo sapiens 


PR01438 


136 


53 


4042 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


120 

• 


84 


4043 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 

A* 


493 


85 


4044 


M90516 


Homo sapiens 


ghitamine:rructose-6-phosphate 
amidotransferase 


261 


72 


4045 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


102 


64 


4046 


AF1 30089 


Homo sapiens 


PRO2550 


143 


78 


4047 


AF090942 


Homo sapiens 


PRO0657 


119 


41 


4048 


R95913 


Homo sapiens 


Neural thread protein. 


138 


47 


4049 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


111 


59 


4050 


X77962 


Tetrahymena '. 
thennophila 


fibrillarin 


105 


49 


4051 


G00338 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

^ m A>^^A«A ^^AA *m^^ A A AA AP" A ^» A A A m A A » • 

4419. 


144 


71 


4052 


X02794 


Friend murine 

^» a A^^A*»^^» »«AA^A A aa^^ 

leukemia virus 


Pr65 


107 


37 


4053 


Y00994 , 


Homo sapiens 


Human CSR3 protein sequenced 


109 


38 


4054 


AL 12 1585 


Homo sapiens 


DA504H3. 1 CSNX5 (sorting nexin 5)) 

^» A Ai Ar ~ A> A>^^ ■ A ■ Ajr A ▼ A A> A^ • Mw A vaaaaa A*^^A A A A A A^ MM 


299 


74 


4055 


V00662 


Homo sapiens 

^» A> AAA AV^MAjA' A»»^ 


URF 1 (NADH dehydrogenase subunit) 


314 


80 


4056 


G03438 


> Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519 

• A^ A A* V < 


65 


54 


4057 


Y86248 


Homo sapiens 1 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


187 


67 


4058 


AF205385 


Pan troglodytes 


NADH dehydrogenase subunit 5 


202 


89 


4059 


Y86248 

a> Waa^T %J 


Homo saoiens 


Human secreted orotein HCHPF68^ 
SEQ ID NO: 163. 


187 


100 


4060 


U63542 


Homo saoiens 


FAP orotein 


142 

A ¥ A* 


73 

t AW 


4061 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 

9 \J A^ A* A 


158 


50 


4062 


Y95028 


Homo sapiens 


Human clone vp7 1 ORF2, SEQ ID 
NO: 128. 

A ' * A Al A* • 


123 


44 


4063 


G00361 


Homo saoiens 


Human secreted Drotein. SEO ID NO: 
4442. 


76 ~* 


65 


4064 


AF000298 


Caenorhabditis 

aa W aA^^a an a Aa<W ^A p »<V 

elegans 


weak simiiaritv to colla&ens: elvcine- 
and proline-rich 


165 


46 


4065 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


147 


93 


4066 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


109 


76 


4067 


AB035523 


Gallus gallus 


avenalU 


101 


48 


4068 


AC079829 


Arabidopsis 
thaliana 


Pto kinase interactor, putative 


105 


43 


4069 


AF21065I 


Homo sapiens 


NAG18 


97 


77 


4070 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


111 


43 


4071 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


158 


65 


4072 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


131 


43 


4073 


Af 03 8007 


Homo sapiens 


FIC1 


153 


96 


4074 


M33228 


Trypanosoma 
brucei 


ATPase6 


103 


37 


4075 


U54788 


Mus museums 


Wiskott-Aldrich Syndrome Protein 


119 


46 


4076 


M81321 


Macaca 
rascicularis 


proline-rich protein 


104 


36 


4077 


X52235 


Homo sapiens 


ORFII 


192 


39 


4078 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


83 
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4079 


AB026542 


Homo sapiens 


WASP-family protein 


98 


41 


4080 


L17318 


Rattus 
norvegicus 


proline-rich proteoglycan 


105 


42 


4081. 


AF130051 


Homo sapiens 


PRO0898 


117 


42 


4082 


AF 130089 


Homo sapiens 


PRO2550 


85 


79 


4083 


G03807 


Homo sapiens 


Human, secreted protein, SEQ ID NO: 
7888. 


117 


48 


4084 


W50192 


Homo sapiens 


Amino acid sequence of salivary 
protein CON- 1. 


89 


40 


4085 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


163 


80 


4086 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


144 


74 


4087 


AF085691 


Homo sapiens 


multidrug resistance-associated protein 
3A 


197 


49 


4088 


G02490 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
657 1. 


109 


62 

* 


4089 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


139 


51 


4090 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


103 


65 


4091 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: . 
4718. 


159 


68 


4092 


X92485 


Plasmodium 
vivax 


pval 


95 


38 


4093 


U93569 


Homo sapiens 


p40 


205 


40 


4094 


L16864 


African swine 
fever virus 


cd2 homologue 


98 


45 


4095 


X71413 


Homo sapiens 


ELE1 


675 


98 


4096 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


149 


60 


4097 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


116 


70 


4098 


AF090930 


Homo sapiens 


PRO0478 


149 


78 


4099 


AF090942 


Homo sapiens 


PRO0657 


124 


56 


4100 

• 


G03564 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


139 


62 


4101 


L26953 


Homo sapiens 


chromosomal protein 


104 


54 


4102 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


128 


70 


4103 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


94 


58 


4104 


U73199 


Mus muscuius 


Rho-guanine nucleotide exchange 
factor 


370 


56 


4105 


AF090931 


Homo sapiens 


PRO0483 


104 


90 


4106 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


133 


72 


4107 


AF1 18082 


Homo sapiens 


PRO1902 


145 


49 


4108 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


116 


50 


4109 


U93564 


Homo sapiens 


p40 


539 


91 


4110 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


144 


54 


4111 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


138 


65 


4112 


AF217536 


Homo sapiens 


truncated mevalonate kinase 


91 


73 


4113 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


120 


36 


4114 


Ml 7522 


Paracoccus 


cytochrome cl precursor (EC 1.10.2.2) 


101 


41 
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4115 


G03714 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7795. 


121 

. 


59 


4116 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 1 
6613. 


148 


73 


4117 


G01931 

+ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


80 


85 


4118 


X04758 


Homo sapiens 


pro- alpha (V)collagen (A A 1099) 


106 


39 


4119 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


122 


47 


4120 


AF130089 


Homo sapiens 


PRO2550 


132 


68 


4121 


AF090942 


Homo sapiens 


PRO0657 


136 


48 


4122 


Y01400 


rHomo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


115 

• 


66 


4123 


W50193 


Homo sapiens 

• 


Amino acid sequence of salivary 
protein CON-2. 


72 


48 


4124 


X92485 


Plasmodium 
vivax 


pval 

* 


104 


54 


4125 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


131 


59 


4126 


AK024455 


Homo sapiens 


FLJ00047 protein 


86 


61 


4127 


AF1 19855 


Homo sapiens 


PR01847 


99 


68 


4128 


S80119 


Rattus sp. 


reverse transcriptase homolog 


129 


39 


4129 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4425. 


156 


62 


4130 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


135 


72 


4131 


R86406 


Homo sapiens 


Human matrix metalloprotease 
MMPmla. 


108 


83 


4132 


L16461 


Chlamydomonas 
reinhardtii 


structural wall protein 
• 


87 


37 


4133 


Y30713 


Homo sapiens 


Amino acid sequence of a human 
secreted protein. 


232 


95 


4134 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


121 


69 


4135 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


135 


70 

■ 


4136 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


134 


42 


4137 


Y15173 


Human 

papillomavirus 
type 75 


E4 protein 

• 


101 

■ 


38 


4138 


AF1 30089 


Homo sapiens 


PRO2550 


130 


35 


4139 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:I63. 


111 


38 


4140 


Y36203 


Homo sapiens 


Human secreted protein #75. 


120 


48 


4141 


AF130089 


Homo sapiens 


PRO2550 


139 


36 


4142 


S80119 


Rattus sp. 


reverse transcriptase homolog 


170 


47 


4143 


U54636 


Staphylococcus 
aureus 


protein A 


135 


35 


4144 


AL390114 

T 


Leishmania 
major 


extremely cysteine/valine rich protein 


157 


55 


4145 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


112 


54 


4146 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


117 


37 


4147 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhcm) gene preoduct 


98 


61 


4148 


AF130051 


Homo sapiens 


PRO0898 


124 


88 


4149 


G00673 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


67 
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4150 


Z29481 


Homo sapiens 


3-hydroxyanthranilic acid dioxygenase 


244 


62 


4151 


M81321 


Macaca 
fascicularis 


proline-ricb protein 


172 


50 


4152 


G00416 , 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


129 


66 


4153 


AF1 19851 


Homo sapiens 


PR01722 


128 


62 


4154 


AB037826 


Homo sapiens 


KIAA1405 protein 


194 


60 


4155 


M12099 


Mus musculus 


proline-rich protein 


124 


37 


4156 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO:' 
7795. 


122 


46 


4157 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


124 


41 


4158 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


134 


83 


4159 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


141 


74 


4160 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


135 


39 


4161 


U35730 , 


Mus musculus 


jerky 


116 


31 


4162 


281525 


Caenorfaabditis 
eJegans 


contains similarity to Pfam domain: 
PF01391 (Collagen triple helix repeat 
(20 copies)), Score=35.5, E-value=4e- 
07, N=2 


130 


50 


4163 


AF1 13685 


Homo sapiens 


PRO0974 


129 


45 


4164 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


168 


71 


4165 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


137 


70 


4166 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


175 


72 


4167 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


86 


45 


4168 


AB000505 


Daucus carota 


unnamed protein product 


101 


43 


4169 


U87607 


Rattus 
norvegicus 


putative RNA binding protein 1 


113 


30 


4170 


AF 119900 


Homo sapiens 


PR02822 


154 


59 


4171 


AE001381 


Plasmodium 
falciparum 


hypothetical protein 


106 


33 


4172 


AE000034 

• 


Mycoplasma 
pneumoniae 


Afunctional threonine dehydrogenase; 
galactosyltransferase 


100 


28 


4173 


U12390 


Cloning vector 
pSportl 


beta-galactosidase alpha peptide 


100 


47 


4174 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


142 


68 


4175 


AK024455 


Homo sapiens 


FLJ00047 protein 


116 


60 


4176 


Mi 1759 

. 


Lycopersicon 
esculentum 


cell wall hydroxyproline-rich 
glycoprotein 


94 


44 


4177 


AK024455 


Homo sapiens 


FLJ00047 protein 


102 


59 


4178 


AY007557 

■ 


Mycobacterium 
avium subsp. 
paratuberculosis 


fibronecun-attachrnent protein FAP-P 


98 

■ 


42 


4179 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


115 


69 


4180 


M12100 


Mus musculus 


proline-rich protein MP-3 


132 


44 


4181 


AF090930 


Homo sapiens 


PRO0478 


138 


63 


4182 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


152 


75 


4183 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


184 


*4 
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4184 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


150 


43 


4185 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


128 

- 


51 


4186 


AF090944 


Homo sapiens 


PRO0663 


124 


49 


4187 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO; 163. 


93 

* * 


62 


4188 


D38112 


Homo sapiens 


ATPase subunit 6 


439 


78 


4189 


AF130051 


Homo sapiens 


PRO0898 


89 


57 


4190 


AF090930 


Homo sapiens 


PRO0478 


137 


83 


4191 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


100 


35 


4192 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


212 


91 


4193 


Y13247 


Homo sapiens 


FBI 9 protein 


142 


46 


4194 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


268 


93 


4195 


AF124729 


Mus museums 


acinusS' 


140 


42 


4196 

• 


AJ277425 


Globodera 
pallida 


putative cuticular collagen 


156 


43 


4197 


Y86248 

* 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


161 


77 


4198 


Z70684 


Caenorhabditis 
elegans 


F28D1.8 


97 


45 


4199 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


119 


36 


4200 


X62379 


Mus musculus 


form in, isoform IV 


115 


48 


4201 


D38113 


Pan troglodytes 


NADH dehydrogenase subunit 4 


224 


93 


4202 


M14820 


Trypanosoma 
brucei 


ORF2 bases 1807-2850; first start 
codon at 2032; putative 


103 

* 


29 


4203 


U93572 


Homo sapiens 


p40 


1«2 


34 


4204 


D82026 


Silene latifolia 


glycine-rich protein 


98 


49 


4205 


AF003151 


Caenorhabditis 
elegans 


contains similarity to an RNA 
recognition motif 


107 


40 


4206 


L07545 


Leishmania 
tarentolae 


A 'c* was inserted after nt 369 (=nt 
10459 in genomic sequence (M10126)) 
to correct -1 frameshift probably due to 
gel compression 


101 


35 

■ 


4207 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


104 


38 


4208 


AF229126 


Homo sapiens 

*> 


acetylcholinesterase collagen-like tail 
subunit isoform VII 


96 


75 


4209 


AF090895 


Homo sapiens 


PRO0117 


142 


68 


4210 


X92485 


Plasmodium 
vivax 


pval 


104 

■ 


35 


4211 


G02828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6909. 


173 


73 


4212 


G02828 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
6909. 


151 

■ 


69 


.4213 


AF 115549 


Homo sapiens 


Wiskott-Aldrich Syndrome protein 


122 • 


43 


4214 


AF090944 


Homo sapiens 


PRO0663 


97 


39 


4215 


X12544 


Homo sapiens 


3 HLA-DR B protein precursor (AA - 
29 to 267) 


Ml 


60 


4216 


M69065 


human 
herpesvirus 2 


ORF1 


89 


38 


4217 


AF090895 . 


Homo sapiens 


PRO0117 


133 


78 


4218 


BO 1372 


Homo sapiens 


Neuron-associated protein. 


135 


46 


4219 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


71 


47 


4220 


U93567 


Homo sapiens 


p40 


170 


35 


4221 


D00570 


Mus musculus 


open reading frame (251 AA) 


108 


28 


4222 


X97675 


Homo sapiens 


plak()philin 2b 


122 


47 


4223 


AF270937 


Plutella 
xylosteUa 


PxORF73 peptide 


99 


54 
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^ 


granulovirus 








4224 


AF1 30089 


Homo sapiens 


PRO2550 


144 


69 


4225 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


125 


82 


4226 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


99 


62 


4227 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhem) gene preoducL 


172 


5 * 


4228 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


161 


74 


4229 


AL049537 


Homo sapiens 


dJl 164110.1 (brefeldin A-inhibited 
guanine nucleoude-exchange protein 2) 


212 


90 


4230 


AF 118086 


Homo sapiens 


PR01992 


150 


69 


4231 


M81321 

> 


Macaca 
fascicularis 


proline-rich protein 


130 


41 


4232 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6595. 


138 


68 


4233 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


132 


56 


4234 


G03798 t 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


110 


52 


4235 

• 


R13556 


Homo sapiens 

• 


Protein encoded downstream of hhc_M 
oncoprotein. 


124 


32 


4236 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


119 


42 


4237 


X93498 


Homo sapiens 


21 -Glutamic Acid-Rich Protein 


368 


56 


4238 


R95913 


Homo sapiens 


Neural thread protein. 


139 


55 


4239 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


146 


63 


4240 


X97675 


Homo sapiens 


plakophilin 2b 


108 


78 • 


4241 


Y36156 


Homo sapiens 


Human secreted protein #28. 


137 


65 


4242 


AK000385 


Homo sapiens 


unnamed protein product 


131 


33 


4243 


AJ252253 


human 
herpesvirus 2 


glycoprotein G-2 


107 


36 


4244 


Y99447 


Homo sapiens 


Human PR01556 (UNQ764) amino 
acid sequence SEQ ID NO:372. 


597 


100 


4245 

■ 


Y05398 


Homo sapiens 


Human TIE ligand NL8 protein 
sequence. 


424 


84 


4246 


L00016 


Homo sapiens 


urf4 


222 


90 


4247 


AF1 34579 


Zea mays 


arabinogalactan protein 


134 


39 


4248 


Z34465 


Zea mays 


extens in-like protein 


123 


33 


4249 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


128 


62 


4250 


J02695 

' 


Plasmodium 
yoelii 


circumsporozoite protein 

■ 


110 


33 


4251 


AK024455 


Homo sapiens 


FLJ00047 protein 


136 


65 


4252 


U02570 


Homo sapiens 


CDC42 GTPase-activating protein 


566 


93 


4253 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


449 


85 


4254 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


111 


63 


4255 

■ 


M61143 


Bovine 
herpesvirus 1 


latency-related open reading frame 2; 
putative 


101 


42 


4256 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


113 


43 


4257 


Y06294 


Homo sapiens 


Human transcription regulator MOP6 
partial sequence. 


410 


79 


4258 


Y36495 

■ 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


112 


61 


4259 


D90252 


Human 

papillomavirus 
type 5b 


E4 protein 

» 


110 


32 
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4260 


X90569 


Homo sapiens 


elastic titin 


660 


95 


4261 


AF090930 


Homo sapiens 


PRO0478 


114 


71 


4262 


AF 132209 


Homo sapiens 


prepro-rhajor basic protein homolog 


422 


72 


-4263 


AF043102 


Pneumocystis 
carinii 


surface glycoprotein A 


121 


29 


4264 


G03790 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


132 


55 


4265 


AF090895 


Homo sapiens 


PRO0117 


159 


62 


4266 


AF090930 


Homo sapiens 


PRO0478 


143 


50 


4267 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


119 


34 


4268 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 


106 


61 


4269 


D23660 


Homo sapiens 


ribosomal protein 


493 


73 


4270 


AF 130089 


Homo sapiens 


PRO2550 


119 


39 


4271 


U93564 


Homo sapiens 


putative pi 50 


121 


37 


4272 


AB033615 


Mus musculus 


phospholipase C-L2 


485 


79 


4273 


AF006082 


Homo sapiens 


Arp2 


488 


85 


4274 


Y12293 


Mus musculus 


lun 


104 


33 


4275 


X54289 


Bos taurus 


cGMP-dependent protein kinase 
(isoform I beta) 


561 


81 


4276 


AF1 19855 


Homo sapiens 


PR01847 


155 


71 


4277 


J01415 


Homo sapiens 


MTND4 


372 


63 


4278 


AB021078 


Plasmid Collb- 
P9 


ybbA 


101 


30 


4279 


G00328 


Homo sapiens 
* 


Human secreted protein, SEQ ID NO: 
4409. 


M0 

H. 


73 


4280 


G00328 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


no 


73 


4281 


AF1 19851 


Homo sapiens 


PR01722 


110 


60 


4282 


D38116 


Pan paniscus 


NADH dehydrogenase subunit 1 


346 


92 


4283 


Y15908 


Homo sapiens 


DIA-12C protein 


109 


52 


4284 


Y30681 


Homo sapiens 


Splice variant ZAP- IB protein of the 
human tumor suppressor gene ZAP- 1 . 


122 


76 


4285 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


121 


42 


4286 


AK024455 


Homo sapiens 


FU00O47 protein 


135 


57 


4287 


Y11525 


Homo sapiens 


CCAAT/enhancer binding protein 

alpha 

i ..... 


100 


32 


4288 


AF033122 

- 


Homo sapiens 


non-<)53 regulated PA26-T1 nuclear 
protein 


128 


81 


4289 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


111 


39 


4290 


W48351 


. Homo sapiens 


Human breast cancer related protein 
BCRB2. 


140 


58 


4291 


AJ242540 


Vorvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


132 


34 


4292 


X60432 


Zeamays 


prolin rich protein 


118 


42 


4293 


U09367 


Homo sapiens 


zinc finger protein ZNF136 


461 


60 


4294 


AF038960 


Homo sapiens 


SKD1 homolog 


146 


82 


4295 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


128 


54 


4296 


U66561 


Homo sapiens 


kruppel-related zinc finger protein 


512 


89 


4297 


AF043706 


Caenorhabditis 
elegans 


contains similarity to granulins 


104 


55 


4298 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


138 


65 


4299 


AF152510 


Homo sapiens 


protocadherin gamma A3 short form 
protein 


520 


87 
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■ 


uescnpuoo 


c m uh 

Waterman 
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Ideotit 

y 


4300 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


142 


72 


4301 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


109 


50 


4302 


AL035440 


Arabidopsis 
thaliana 


putative protein 


103 


31 


4303 


L05500 


Homo sapiens 


adenylyl cyclase 


655 


96 


4304 


AF1 16712 


Homo sapiens 


PR02738 


140 


54 


4305 


D13757 


Homo sapiens 


amidophosphoribosyltransferase 


182 


92 


4306 


L20450 


Mus musculus 


DNA-binding protein 


470 


64 


4307 


AF213386 


Mus musculus 


ATP-binding cassette protein 


175 


97 


4308 


X76850 


Mus musculus 


MAP kinase-activated protein kinase 2 


154 


69 


4309 


S801 19 


Rattus sp. 


reverse transcriptase homolog 


123 


55 


4310 

• 


X67337 


Homo sapiens 


Human pre-mRNA cleavage factor I 68 
kDa subunit 


110 


32 


4311 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


96 


58 


4312 


Z46236 


Ovis aries 


keratinocyte growth factor 


210 


75 


4313 


AB020700 


Homo sapiens 


KIAA0893 protein 


569 


87 


4314 


AF124727 


Homo sapiens 


acinusS 


498 


88 


4315 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


88 


32 

■ 


4316 


AL 160493 


Leishmania 
major 


probable (hhv-6) u 1 1 02, variant a 
DNA, complete virion genome 


122 


53 


4317 


AF 119900 


Homo sapiens 


PR02822 


135 


58 


4318 


U74612 


Homo sapiens 


hepatocyte nuclear factor-3/fork head 
homolog 1 1 A 


104 


70 


4319 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


117 


51 


4320 


AB026054 


Homo sapiens 


brain finger protein 


412 


90 


4321 


AF090944 


Homo sapiens 


PRO0663 


133 


71 


4322 


AF165513 


Homo sapiens 


vacuolar protein sorting 45 isoform 


712 


97 


4323 


Y99418 


Homo sapiens 

■ 


Human PR01317 (UNQ783) amino 
acid sequence SEQ ID NO:277. 


532 


97 


4324 


AF 163772 


Leishmania 
major 


7138.7 

■ 


142 


37 


4325 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


128 

• 


59 


4326 


W40054 


Homo sapiens 


P300/CBP-associated transcriptional 
coractor P/CAF C-terminus. 


175 


94 


4327 


AF217411 


Homo sapiens 


neuroligin 3 isoform HNL3 


549 


84 


4328 


U94836 


Homo sapiens 


ERPROT 213-21 


175 


87 


4329 


Y20763 


Homo sapiens 


Human neurofilament-M mutant 
protein fragment 45. 


501 


87 


4330 


AB037745 


Homo sapiens 


KIAA1324 protein 


1014 


99 


4331 


M91563 


Rattus 
norvegicus 


NMDA receptor subtype 2C 


116 


39 


4332 


G03704 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7785. 


286 


76 


4333 


AC006841 


Arabidopsis 
thaliana 


Mutator-like transposase 


130 


70 


4334 


AP000373 


Arabidopsis 
thaliana 


jasmonate inducible protein; 
myrosinase binding protein-like 


130 


53 


4335 


U22961 


Homo sapiens 


similar to human albumin, Swiss-Prot 
Accession Number P02768; Method: 
conceptual translation supplied by 
author 


377 


90 


4336 


AJ277426 


Globodera 
pallida 


putative cuticular collagen 


112 


37 
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4337 


U 16802 


Rattus 
norvegicus 


Ca2+-dependent activator protein; 
calcium-dependent actin-binding 
protein 


560 


79 


4338 


Y27907 


Homo sapiens 


Human secreted protein encoded by 
gene No. 119. 


118 

■ 


56 


4339 


AB024520 


Homo sapiens 


notcb4 


93 


33 


4340 


AF 128406 


Homo sapiens 


nuclear prelamin A recognition factor 


241 


100 


4341 


W19771 


Homo sapiens 


Beta-1 integrin modulator B 171. 


169 


100 


4342 


AL096828 

- 


Homo sapiens 


dJ963E22. 1 (Novel protein similar to 
NY-REN-2 Antigen) 


666 


88 


4343 


D50312 


Homo sapiens 


uKATP-1 


542 


80 


4344 


U84487 


Homo sapiens 


CX3C chemokine precursor 


371 


78 


4345 


U93569 


Homo sapiens 


putative pi 50 


111 


47 


4346 


AL161755 


Streptomyces 
coelicolor A3(2) 


putative serine/threonine protein kinase 

- 


102 


30 


4347 


AK026162 


Homo sapiens 


unnamed protein product 


377 


98 


4348 


AF221759 


Homo sapiens 


Maml 


314 


47 


4349 


AF 165926 


Homo sapiens 


NUP155 


147 


84 


4351 


U50185 


Rattus ' 
norvegicus 


PP-1M 


144 


52 


4352 


D50455 


Rattus 
norvegicus 


phodpholipase C delta4 


196 


70 


4353 


X54131 


Homo sapiens 


protein-tyrosine phosphatase 


261 


87 


4354 


AF151850 


Homo sapiens 

A. 


CGI-92 protein 


255 


92 


4355 


G03996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8077. 


105 

* 


95 


4356 


U93564 


Homo sapiens 


putative pi 50 


171 


72 


4357 


AP001507 


Bacillus 
halodurans 


unknown conserved protein 


159 

• 


34 


4358 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


153 


60 


4359 


AF072697 


Musmusculus 


SHYC 


177 


97 


4360 

• 


G03789 

> 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


129 


69 


4361 


M98776 


Homo sapiens 


keratin 1 


449 


77 


4362 


AK024436 


Homo sapiens 


FLJ00026 protein 


671 


91 


4363 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


155 


49 


4364 


D13896 


Rattus 
norvegicus 


cytoplasmic dynein heavy chain 


327 


92 


4365 


X79389 


Homo sapiens 


glutathione transferase Tl 


164 


96 


4366 


AB006458 


Mus musculus 


alpha-D-mannosidase 


177 


56 


4367 


X01060 


Homo sapiens 


put. transferrin receptor (aa 1-760) 


160 


96 


4368 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


103 


54 


4369 


AL353715 


Homo sapiens 


DK3184A7.3.1 (helicase-like protein 
NHL) 


485 


100 


4370 


AF001631 


Oryctolagus 
cuni cuius 


glucose-regulated protein GRP94 


118 


92 


4371 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


113 


80 


4372 


AL121741 


Schizosaccharom 
yces pombe 


putative negative regulator of vesicle ' 
formation 


200 


41 


4373 


M96982 


Homo sapiens 


U2 snRNP auxiliary factor small 
subunit 


279 


70 


4374 


U47741 


Homo sapiens 


CREB-binding protein 


123 


96 


4375 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


127 


75 


4376 


Y 12781 , 


Homo sapiens 


transducin (beta) like 1 protein 


1056 


90 
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* 


Descnption 
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% 

Identit 

y 


4377 


W69424 


Homo sapiens 


Human secreted protein bg!4(M . 


183 


97 


4378 

• 


AL096700 


Homo sapiens 

• 


dJ499B10.2 (phosphorylase kinase, 
alpha2(Uver)(PYK)) 


639 


86 


4379 


AFi 77390 


Manduca sexta— 


antennal specific membrane protein 
AMP 


378 


51 


4380 


X97675 


Homo sapiens 


plakophilin 2b 


156 


75 


4381 


R33713 


Homo sapiens 


PgllOl. 


104 


100 


4382 


AB015473 

■ 


Arabidopsis 
thaliana 


gene_id:MCM23.1-unknown protein 


113 


61 

• 


4383 


AFI 16715 


Homo sapiens 


PR02829 


133 


50 


4384 


AL357472 


Homo sapiens 


VPS33B 


676 


99 


4385 


AF090931 


Homo sapiens 


PRO0483 


155 


58 


4386 


U92645 

« » 


Gecarcinus 
lateralis 


alpha* 1 -tubulin 


511 


75 


4387 


Y36095 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 480. 


202 


90 


4388 


M64923 


Bostaurus 


CIO protein 


218 


95 


4389 


X98264 


Homo sapiens 


M-phase phosphoprotein 4 


197 


100 


4390 


AK000264 


Homo sapiens 


unnamed protein product 


239 


73 


4391 


M 12937 , 


Homo sapiens 


ferritin heavy subunit 


662 


84 


4392 


P92219 


Homo sapiens 
(human) 


CR1 protein. 


116 


80 


4393 


X16135 


Homo sapiens 


L protein (AA 1-558) 


759 


99 


4394 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


148 


58 


4395 


AC005591 


Homo sapiens 


PkB-iike 


170 


97 


4396 


AFI 6 1426 


Homo sapiens 


HSPC308 


313 


77 


4397 


Y02671 


Homo sapiens 

* 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


146 


61 


4398 


AFI 04921 


Homo sapiens 


succinyl-CoA synthetase alpha subunit 


679 


88 


4399 


AF257330 


Homo sapiens 


COB W-like protein 


586 


90 


4400 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


116 


65 


4401 


AF119851 


Homo sapiens 


PR01722 


101 


79 


4402 


M55542 


Homo sapiens 


guanylate binding protein isoform I 


230 


76 


4403 


Y07752 


Volvox carteri 


pherophorin-S 


459 


88 


4404 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


113 


44 


4405 


U07786 


Sus scrofe 


beta actin 


519 


90 


4406 


A06977 


Homo sapiens 


albumin 


586 


88 


4407 


L20755 


Cuscuta reflexa 


hybrid proline-rich protein;cytokinh> 
induced;haustoria 


112 


41 


4408 


AB002299 


Homo sapiens 


KIAA0301 


612 


98 


4409 


AB001424 


Mus museums 


KIF17 


104 


47 


4410 


M88108 


Homo sapiens 


p62 


574 


92 


4411 


AL121673 


Homo sapiens 


bA305P22.2 (novel protein) 


415 


71 


4412 


AF064553 


Mus musculus 


NSD1 protein 


224 


64 


4413 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


107 


36 


4414 


A06977 


Homo sapiens 


albumin 


505 


82 


4415 


A06977 


Homo sapiens 


albumin 


596 


90 


4416 


Y87116 


Homo sapiens 


Human secreted protein sequence SEQ 
IDNO:155. 


123 


50 


4417 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


103 


85 


4418 


G03053 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7134. 


100 


54 


4419 


AF049606 


Mus musculus 


transcription factor NF-ATc isoform b 


114 


90 
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O tit 
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% 

y 


4420 


Z11190_cdl 


Homo sapiens 


1 l-DEC-1998 lnterleukiir-3 coding 
sequence from b2HFLS20 W cDNA 
library. 


354 

■ 


97 

■ 


4421 


W64469 


Homo sapiens 


Human secreted protein from clone 
CW795 2. 


203 


100 


4422 


M12523 


Homo sapiens 


alloalbumin Venezia 


350 


94 


4423 


AF130077 


Homo sapiens 


PR02619 


561 


89 


4424 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


161 


64 


4425 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


393 


91 


4426 


A09561 


synthetic 
construct 


human serum albumin 


651 


96 


4427 


AL133215 


Homo sapiens 


bA108L7.5 (novel protein similar to 
Plasmodium POM1 and C. elegans 
F46G11.1 (Tr:Q20485)) 


392 


100 


4428 


W33663 


Homo sapiens 


Human puromycin-sensitive 
aminopeptidase (PSA)-68. 


172 


* 


4429 


AB021654 


Homo sapiens 

m 


DD2/biie acid-binding 
protem/AKRlC2/3alpha- 
hydroxysteroid dehydrogenase type 3 


184 


81 


4430 


W63683 


Homo sapiens 


Human secreted protein 3. 


114 


42 


4431 


AY008763 


Homo sapiens 


sentrin/SUMO-specific protease 


447 


96 


4432 


U52965 


Homo sapiens 


ENX-1 


176 


94 


4433 


AF 180470 


Mus musculus 


Kiaa0575 


423 


77 


4434 


X17206 


Homo sapiens 


put LLRep3 protein (AA 1-22 1) 


581 


99 


4435 


Y70929 


Homo sapiens 


Human zilla4 splice variant protein. 


621 


100 


4436 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


135 


71 


4437 


A06977 


Homo sapiens 


albumin 


610 


97 


4438 


A00279 


synthetic 
construct 


Human serum albumin 


621 


94 


4439 


L29028 


Chlamydomonas 
eugametos 

*> 


amino acid feature: N-giycosylation 
sites, aa 41 .. 43, 46 .. 48, 51 .. 53, 72 
74, 107 .. 109, 128 130,132 134, 
158 .. 160, 163 .. 165; amino acid 
feature: Rod protein domain, aa 169 .. 
340; amino acid feature: globular 
protein domain, aa 32 ..168 


105 


36 


4440 


AK024455 


Homo sapiens 


FLJ00047 protein 


112 


65 


4441 


U49973 


Homo sapiens 

■ 


ORF1 ; MER37; putative transposase 
similar to pogo element 


133 


69 


4442 


G02872 

• 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
6953. 


142 


60 


4443 


AF1 13685 


Homo sapiens 


PRO0974 


101 


54 


4444 


W03627 


Homo sapiens 


Human follicle stimulating hormone 
GPR N-terminal sequence. 


181 


42 


4445 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


132 

* 


76 


4446 


G00412 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
4493. 


141 

• 


54 

- 


4447 


L02867 


Homo sapiens 


paraneoplastic antigen 


136 


78 


4448 


D38435 


Homo sapiens 


homologue of yeast PMS1 


108 


66 


4449 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


122 


65 


4450 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


142 


68 


4451 


AF220264 


Homo sapiens 


MOST-1 


101 


40 


4452 


AF130079 


Homo sapiens 


PR02852 


97 


59 


4453 


AF116715 


Homo sapiens 


PR02829 


139 


69 J 
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y 


4454 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


144 


53 


4455 


AF130087 


Homo sapiens 


PR02411 


108 


73 


4456 


AF279891 


Homo sapiens 


dead box protein 15 

* 


108 


58 


4457 


L08258 


Strongylocentrot 
us purpuratus 

* * 


kinesin light chain isoform 4 


176 


94 


4458 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


116 

• 


78 


4460 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


146 


74 


4461 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


156 


52 


4462 


G02607 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6688. 


118 


72 


4463 


AF 119900 


Homo sapiens 


PR02822 


144 


80 


4464 


M15530 


Homo sapiens 


B-cell growth factor 


92 


76 


4465 


Y 17833 


Human 
endogenous 
retrovirus K 


env protein 


107 


62 


4466 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


132 


78 


4467 


AF090895 


Homo sapiens 


PRO0117 


112 


79 


4468 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


146 


64 


4469 


X97675 


Homo sapiens 


plakophilin 2b 


119 


78 


4470 


AF1 18081 


Homo sapiens 


PRO 1900 


119 


74 


4471 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


121 


60 


4472 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


63 


4473 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


205 


73 


4474 


AF130114 


Homo sapiens 


PR02459 


121 


67 


4475 


AF178534 


Homo sapiens 


talin 

• 


213 


67 


4476 


Y27907 


Homo sapiens 


Human secreted protein encoded by 
cene No. 119. 


122 


92 


4477 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


99 


66 


4478 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


142 


62 


4479 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


153 


55 


4480 


AF2 17374 


A can th aster 
planci 


cytochrome oxidase subunit I 


130 


100 


4481 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. • 


110 


57 


4482 


Y20983 


Homo sapiens 


Human glial fibrillary acidic protein 
GFAP wild type fragment 9. 


110 


63 


4483 


AF221552 


Oryza sativa 


proline-rich protein RiP- 15 


119 


33 


4484 


L25941 


Homo sapiens 


integral nuclear envelope inner 
membrane protein 


110 


84 


4485 


AF130089 


Homo sapiens 


PRO2550 


161 


81 


4486 


U39742 


Caenorhabditis 
elegans 


coded for by C. elegans cDNA 
yk25e5.3; coded for by C. elegans 
cDNA yk25e5.5; similar to repeat 
guanylate kinase domain of D. 
melanogaster Iethal(l) discs large- 1 
tumor suppressor protein 
(SP:DLGl_DROME, P31007) and R 
norvegicus postsynaptic density protein 


98 


72 
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95 (PSD-95) (SP:PSD9_RAT, P31016) 






4487 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


89 


81 


4488 


X97675 


Homo sapiens 


plakophilin 2b 


127 


88 


4489 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


105 

• 


71 


4490 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


160 


64 


4491 

• 


G03043 


Homo sapiens 

■ 


Human secreted protein, SEQ ID NO: 
7124. 


122 

■ 


77 


4492 

■ 


U83280 


Leishmania 
donovani 


39 kDa antigen 


107 

■ 


76 


4493 


AF023142 


Homo sapiens 


pre-mRNA splicing SR protein rA4 


141 


40 


4494 


K02576 


Homo sapiens 


salivary proline-rich protein 1 . 


119 


40 


4495 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


98 


45 


4496 


AF130089 


Homo sapiens 


PRO2550 


132 


75 


4497 


AF090930 


Homo sapiens 


PRO0478 


160 


76 


4498 


AF1 19900 


Homo sapiens 


PR02822 


158 


55 


4499 


AF1 16661 


Homo sapiens 


PR01438 


126 


44 


4500 


AF1 16661 


Homo sapiens 

* 


PR01438 


118 


42 


4501 


AL1 60371 


Leishmania 
major 


probable (hhv-6) u 1 1 02, variant a 
DNA, complete virion genome 


113 


48 


4502 


M81321 


Macaca 
fascicularis 


proline-rich protein 


154 


44 


4503 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


154 


96 


4504 


D44596 


Saccharomyces 
cerevisiae 


Mdjlp heat shock protein 


93 


42 


4505 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


163 


76 


4506 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


207 


93 


4507 


U93564 


Homo sapiens 


p40 


520 


86 


4508 


Y02999 


Homo sapiens 

* 


Fragment of human secreted protein 
encoded by gene 121 . 


135 


55 


4509 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


109 


77 


4510 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


146 


66 


4511 


L27428 


Homo sapiens 

* 


reverse transcriptase 


143 


87 


4512 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


105 


80 


4513 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


97 


75 


4514 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


128 


60 

* 


4515 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


120 


73 


4516 


AF130051 


Homo sapiens 


PRO0898 


141 


86 


4517 


U 12690 


Homo sapiens 


cytochrome oxidase subunit II 


185 


94 


4518 


G02597 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6678. 


130 


100 


4519 


AK024455 


Homo sapiens 


FU00047 protein 


152 


68 


4520 


AF210651 


Homo sapiens 


NAG 18 


16J 


88 


4521 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


140 


68 


4522 


AF217374 


Acanthaster 
planci 


cytochrome oxidase subunit I 


131 


76 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 
y 


4523 


U72520 


Mus musculus 


mena protein 


102 


42 


4524 


J01415 


Homo sapiens 


cytochrome oxidase subunit 3 


129 


75 


4525 


AL 160493 


Leishmania 
major 


probable (hh v-6) u 1 1 02, variant a 
DNA, complete virion genome 


108 


84 


4526 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


139 


76 


4527 


G03043 


Homo sapiens 


Human, secreted protein, SEQ ID NO: 
7124. 


138 


89 


4528 


AF090931 


Homo sapiens 


PRO0483 


91 


88 


4529 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


111 


51 


4530 


G02515 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6596. 


130 


92 


4531 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


123 


64 


4532 


L26953 


Homo sapiens 


chromosomal protein 


102 


79 


4533 


G02832 

i 


Homo saoiens 


Human secreted protein SEO ID NO* 
6913. 


88 


54 


4534 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284.. 


122 


40 


4535 


AF090944 


Homo saoiens 


PRO0663 


113 


53 


4536 


U15647 

- 


Mus musculus 


expressed in select embryonal 
carcinoma cells and testicular cells* 
similar to Swiss-Prot Accession 
Number PI 1260 


119 

* 


35 


4537 


JL13858 


Homo sapiens 


guanine nucleotide exchange factor 


150 


... 

81 


4538 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


138 


68 


4539 


AF090930 


Homo sapiens 


PRO0478 v 


104 


53 


4540 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


112 


64 


4541 


K02401 


Homo sapiens 


chorionic somatomammotropin 


672 


96 


4542 


W88627 


Homo saoiens 


Secreted protein encoded bv sene 94 
clone HPMBQ32. 


171 


67 


4543 


J04208 


Homo sapiens 


inosine-5-monophosphate 
dehydrogenase (EG 1.1.1 205) 


455 


92 


4544 


K02401 


Homo sapiens 


chorionic somatomammotropin 


397 


88 


4545 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


155 

• 


66 


4546 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


161 


68 


4547 


R24056 


Homo sapiens 


hGH variant #8 - 172Arg 174Thr 
176Tyr 178His. 


117 


62 


4548 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


111 


68 


4549 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


124 


65 


4550 


AF1 19855 


Homo sapiens 


PR01847 


152 


68 


4551 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


153 


70 


4552 


M59217 


Homo sapiens 


alpha- 1 type XIII collagen 


98 


57 


4553 


R83119 


Homo sapiens 


Human cisplatin resistance protein. 


136 


96 


4554 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


161 


72 


4555 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


91 


60 


4556 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


150 


70 


4557 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 


101 


44 
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Accession wo. 


apecies 


Description 


oiniin- 


OjZT 


NO: 




Watennan 


Identit : 




• 






Score 


y 




■ 




isoform 






4558 


AF130089 


Homo sapiens 


PRQ2550 


150 


85 


4559 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


161 


56 




■ 


1 


clone HPMBQ32. 




* 


4560 


W48351 


Homo sapiens 


Human breast cancer related protein 


81 


64 


- 




BCRB2. 






4561 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


55 








6619. 


■ 




4562 


G03114 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


106 


82 








7195. 




■ 


4563 


AF090931 


Homo sapiens 


PRO0483 


145 


73 


4564 


AJ242540 


Volvox carteri f. 


hydroxyproline-rich glycoprotein DZ- 


358 


77 


• - 




nagariensis 


HRGP 






4565 


AF130089 


Homo sapiens 


PRO2550 


175 


85 


4566 


293244 


Homo sapiens 


bKl 16F5.2 (PUTATIVE RhoGAP 


125 


61 








(CDC42 GTPAse Activating Protein) 












LIKE protein) 






4567 


G00397 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 


142 


56 








4478. 






4568 


AF130089 


Homo sapiens 


PRO2550 


125 


75 


4569 


AF130089 


Homo sapiens 
* 


PRO2550 


172 


82 


4570 


AF225961 


Rattus 


RhoGEF glutamate transport modulator 


149 


78 






norvegicus 


GTRAP48 


■ 




4571 


W48351 


Homo sapiens 


Human breast cancer related protein 


143 


64 








BCRB2. 






4572 


G02640 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


157 


70 


- 


- 




6721. 






4573 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


91 


73 








7867. 






4574 


X92485 


Plasmodium 


pval 

* 


111 


52 


* 




vivax 








4575 


U93567 


Homo sapiens 


p40 


151 


71 


4576 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 


90 


57 








clone HNHF029. 






4577 


AC007508 


Arabidopsis 


F1K23.4 


143 


42 






thaliana 








4578 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 


53 








7888. 






4579 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


175 


73 








7874. 


. 




4580 


G00338 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


76 








4419. 




• 


4581 


AF1 30079 


Homo sapiens 


PR02852 


117 


70 


4582 


AF090930 


Homo sapiens 


PRO0478 


109 


57 


4583 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


134 


79 


4584 


G03172 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


149 


62 








7253. 






4585 


AF1 18090 


Homo sapiens 


PRO2044 


175 


97 


4586 


AL080243 


Homo sapiens 


E 1 A binding protein p300; match: 


144 


93 








proteins: Sw:Q09472 Sw:Q92793 


* . 










Sw:P45481 Wp:CE00571 












Wp:CE21 1 17 TrO01368 Wp:CE08856 












Wp:CE00570 Wp:CE08453 Tr:O44076 






4587 


Y85062 


Homo sapiens 


Interleukin 1 converting enzyme 


140 


62 








homologue (ICEL) protein sequence. 






4588 


AJ131245 


Homo sapiens 


Sec24B protein 


131 


81 


4589 


W02105 


Homo sapiens 


Human L-asparaginase. 


559 


100 


4590 


L26953 


Homo sapiens 


chromosomal protein 


146 


71 


4591 


Ml 6976 


Glycine max 


N-75 


125 


36 



» 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smitn- 
Wa terra aa 
Score 


% 

Ideotit 

y 


4592 


AFH9851 


Homo sapiens 


PR01722 


118 


61 


4593 


AF090931 


Homo sapiens 


PRO0483 


99 


85 


4594 


M20789 


Homo sapiens 


alpha-] type I collagen 


103 


41 


4595 

"i 


G00381 


Homo sapiens 


Human secreted proteinrSEQ ID NO: 
4462. 


97 


72 


4596 


AF1 30089 


Homo sapiens 


PRO2550 


124 


71 


4597 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


143 


65 


4598 


AF 118082 


Homo sapiens 


PRO1902 


122 


64 


4599 


AK023034 


Homo sapiens 


unnamed protein product 


156 


100 


4600 


AF 118082 


Homo sapiens 


PRO1902 


148 


62 


4601 


AF 119855 


Homo sapiens 


PR01847 


130 


50 


4602 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


99 


77 


4603 


G02994 


Homo sapiens . 


Human secreted protein, SEQ ID NO: 
7075. 


162 


65 


4604 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 1 00 clone HNFIU96. 


135 


54 


4605 


AL1 60371 

i 


Leishmania 
major 


probable (hhv-6) u l 102, variant a 
DNA, complete virion genome 


123 


75 


4606 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


127 


74 


4607 


V00672 


Pan troglodytes 


reading frame protein 4 


194 


72 


4608 


X86791 


Sus scrofa 


beta-giobin 


156 


87 


4609 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


130 


88 


4610 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


146 


90 


4611 


G03789 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
7870. 


149 


68 


4612 


AB029309 


Homo sapiens 


Npw38-binding protein NpwBP 


120 


38 


4613 


W48351 


Homo sapiens 

* 


Human breast cancer related protein 
BCRB2. 


148 


53 


4614 


AK024372 


Homo sapiens 


unnamed protein product 


124 


63 


4615 


AC005360 


Homo sapiens 


FAA 


175 


91 


4616 


D38113 


Pan troglodytes 


ATPase subunit 6 


204 


69 


4617 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


122 


45 


4618 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


118 


81 


4619 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


99 


90 


4620 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


125 


50 


4621 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


107 


45 


4622 


AF126163 


Homo sapiens 


HHLA3 protein 


124 


75 


4623 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


109 


69 


4624 


U38184 


Trypanosoma 
cruzi 


ATPase subunit 6 


102 


45 

. 


4625 


U01849 


Trypanosoma 
brucei 


ORF1 


97 


36 


4626 


G03058 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7139. 


95 


57 


4627 


M33228 


Trypanosoma 
brucei 


ATPase 6 


126 


37 


4628 


AF1 18086 


Homo sapiens 


PR01992 


113 


60 


4629 1 AF041330* 


Bodo saltans 


NADH dehydrogenase subunit 5 


149 


41 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
WatermaD ' 
Score 


as 

Identit 

y 


4630 


XI 2544 


Homo sapiens 


3 HLA-DR B protein precursor (AA - 
29 to 267) 


108 


48 


4631 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 1 63. 


111 


46 


4632 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: J 63. 


124 


60 


4633 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


151 


60 


4634 


J01415 


Homo sapiens 


cytochrome oxidase subunit 3 


138 


100 


4635 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


135 


56 


4636 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


189 


100 


4637 


X67320 


Rattus 
norvegicus 


Hld-Histone 


122 


40 


4638 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


181 


76 


4639 


Y36366 


Homo sapiens 

1 


Fragment of human secreted protein 
encoded by gene 3. 


96 


73 


4640 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


136 


54 


4641 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


123 


70 


4642 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


190 


56 


4643 


M29580 


Homo sapiens 


zinc finger protein 7 (ZFP7) 


103 


38 


4645 


U47741 


Homo sapiens 


CREB-bmding protein 


207 


94 


4646 


AL035461 


Homo sapiens 


dJ967N21.5 (novel MCM2/3/5 family 
member) 


213 


80 


4647 


G03435 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7516. 


76 


60 


4648 


AF227948 


Homo sapiens 


HBV pX associated protein-8; XAP-8 


402 


86 


4649 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


143 


86 


4650 


U93574 


Homo sapiens 


putative pi 50 


121 


33 


4651 


A02739 


synthetic 
construct 


chloramphenicol acetyltransferase 


622 


99 


4652 


AF 190449 


Mus musculus 


putative transcription factor ALF-4 


274 


83 


4653 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


108 


76 


4654 


X15187 


Homo sapiens 


precursor polypeptide (AA -21 to 782) 


763 


98 


4655 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


128 


91 


4656 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


169 


77 


4657 


BO 1372 


Homo sapiens 


Neuron-associated protein. 


135 


83 


4658 


AK024455 


Homo sapiens 


FLJ00047 protein 


151 


60 


4659 


AC004850 


Homo sapiens 


vacuolar assembly protein VPS41 
homolog (S53) | 


187 


92 


4660 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


181 


92 


4661 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


138 


58 


4662 


Y 14479 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


154 


78 


4663 


M69180 


Homo sapiens 


nonmuscle myosin heavy chain-A 


106 


84 


4664 


AF 130089 


Homo sapiens 


PRO2550 


117 • 


65 


4665 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


133 


91 
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• • • 



CPA f TV 

NO: 


Accession i~o. 


opccics 

* 




Smith. 

Waterman 
Score 


Ideotit 

y 


4666 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


139 


62 


4667 


AP000616 


Oryza sativa 


1 similar to RING-H2 finger protein 
RHAla(AF078683) 


111 


78 


4668 


AF 113695 


Homo sapiens 


PR01365 


82 


71 


4669 


AF003540 


Homo sapiens 


KrueppeJ family zinc ringer protein 


111 


64 


4670 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


134 

M. 


49 


4671 


AL 10965 8 


Homo sapiens 


dJ776F14.1 (ortholog of mouse P47) 


110 


100 


4672 


G03789 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: ' 
7870. 


129 


62 

• 


4673 


AP000616 


Oryza sativa 

• 


similar to RING-H2 finger protein 
RHAla(AF078683) 


138 


60 


4674 


X51591 


Homo sapiens 


beta-myosin heavy chain (1151 AA) 


442 


97 


4675 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


148 


76 


4676 


D86971 


Homo sapiens 


no similarities to reported gene 
products 


133 


100 

■ 


4677 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


142 


43 


4678 


AF044205 


Gossypiurn 
hirsutum 


proline-rich protein precursor 


117 


39 


4679 


D38116 


Pan paniscus 


NADH dehydrogenase subunit 4 


243 


98 


4680 


AF1 13685 


Homo sapiens 


PRO0974 


117 


74 


4681 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


193 


90 


4682 


L26163 


Mus musculus 


histone Hie 


111 


37 


4683 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


J45 


78 


4684 

■ 


Y86248 

■ 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


112 

■ 


53 

- 


4685 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


110 


60 


4686 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413. 


153 


72 


4687 


D38112 


Homo sapiens 


NADH 'dehydrogenase subunit 6 


165 


100 


4688 


W50922 


Homo sapiens 


Amino acid sequence of a heterogenous 
ribonucleotide protein. 


139 


100 


4689 


Y00281 


Homo sapiens 


precursor 


789 


100 


4690 


AF210651 


Homo sapiens 


NAG 18 


142 


65 


4691 


L26953 


Homo sapiens 


chromosomal protein 


124 


72 


4692 


L26953 


Homo sapiens 


chromosomal protein 


124 


72 


4693 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


166 


80 


4694 


L27428 


Homo sapiens 


reverse transcriptase 


144 


87 


4695 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


165 


74 


4696 


W29474 


Homo sapiens 


Human histone HI isoform H1S-1. 


93 


41 


4697 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


126 


56 


4698 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


138 


62 


4699 

• 


G00413 


Homo sapiens 

■ 


Human secreted protein, SEQ ID NO: 
4494. 


141 


54 


4700 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


162 


70 


4701 


AF003540 


Homo sapiens 


Krueppel family zinc finger protein 


112 


88 


4702 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


159 


59 


4703 


AF241242 


Mus musculus 


T-cell-specific T-box transcription 
factor T-bet 


294 


75 


4704 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


152 


80 
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CPA Vtv 

NO: 


Accession NO. 


species 


Description 


iJUIIUI* 

Waterman 
Score 


0/ 

/o 

Identit 

y 


• 






SEQ ID NO: 1 63. 






4705 


AF038963 


Homo sapiens 


RNA helicase 


102 


70 


4706 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


148 


63 


4707 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


145 


70 


4708 


AF289099 


Maackia 
amurensis 


ENOD2f 


113 


42 


4709 


AF130087 


Homo sapiens 


PR02411 


124 


57 


4710 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


188 


86 


4711 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


138 


86 


4712 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


143 

■ 


78 


4713 


AF273133 


Ochotona 
macrotis 


NADH dehydrogenase subunit4 

* 


154 


76 


4714 


G03453 


Homo sapiens 

! 


Human secreted protein, SEQ ID NO: 
7534. 


104 


48 


4715 


Z18361 


Ovis aries 


trichohyalin 


120 


33 


4716 


AF1 30089 


Homo sapiens 


PRO2550 


152 


75 


AIM 


AF1 19851 


Homo sapiens 


PR01722 


143 


60 


4718 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


145 


45 


4719 


K02401 


Homo sapiens 


chorionic somatomammotropin 


619 


96 


4720 


Y86248 


Homo sapiens 

* 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


114 


52 


4721 


J03756 


Homo sapiens 


hGH-V2 


515 


74 


4722 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


194 


93 


4723 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


128 


56 


4724 

* 


D49490 

■ 


Homo sapiens 


protein disulfide isomerase-related 
protein (PDIR) 


213 


55 


4725 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


138 


66 


4726 


K02401 


Homo sapiens 


chorionic somatomammotropin 


657 


98 


4727 


D87459 


Homo sapiens 


Similar to Voibox carteri extensin 
(S22697) 


162 

■ 


80 


4728 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) u 1 1 02, variant a dna, 
complete virion genome. 


99 


34 


4729 


AF090944 


Homo sapiens 


PRO0663 


151 


63 


4730 


M32305 

■ # ■ 


Human 

papillomavirus 
type 47 


E1/E4 fusion protein 

• 


106 


40 


4731 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


133 


57 


4732 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


135 


76 


4733 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


114 


43 


4734 


AP000616 


Oryza sativa 


similar to RING-H2 ringer protein 
RHAla(AF078683) 


133 


72 


4735 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


113 


43 


4736 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


99 


59 


4737 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


92 


47 


J738 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


133 


73 
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1 SEQ ID 
NO: 


' Accession No. 


Species 

• 


Description 


Smith- 
Watermao 

Score 


% 

IdeoHt 

y 






i 


RHAla (AF078683) 






4739 


AF044205 


Gossypiura 
hirsutum 


proline-rich protein precursor 


105 


42 


4740 


S78854 


Oryctolagus 
cuniculus 


alpha-tropomyosin 


159 


87 


4741 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


151 


79 


4742 


BO 1372 


Homo sapiens 


Neuron-associated protein. 


1 59 


85 


4743 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


122 


100 


4744 


Y86248 


Homo sapiens 

i * 


Human secreted protein HCHPF68, 
SEQ1DN0:163. 


176 


82 


4745 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


154 


78 


4746 . 


Y86248 


Homo sapiens 

* 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


164 


80 


4747 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


120 


100 


4748 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-b2 finger 
protein rhala. 

* 


117 


53 


4749 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


208 


93 


4750 


AF1 55232 


Pisum sativum 


extensin 


172 


40 


4751 


AP000616 


Oryza. sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


117 




4752 


AF130079 


Homo sapiens 


PR02852 


104 


70 


4753 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


154 


50 


4754 


G04072 

- 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
8153. 


167 


50 


4755 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


138 


100 


4756 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


163 


66 


4757 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


102 


45 


4758 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


242 


94 


4759 


AF046935 


Homo sapiens 


PCFllphomolog 


244 


94 


4760 


M76546 


Heltanthus 
annuus 


hydroxyproline-rich protein 


165 


41 


4761 


Y30681 


Homo sapiens 


Splice variant ZAP- IB protein of the 
human tumor suppressor gene ZAP- 1 . 


115 


60 


4762 


Y27868 

• 


Homo sapiens 

■ 


Human secreted protein encoded by 
gene No. 107. 


178 


84 


4763 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


134 


84 


4764 


R95913 


Homo sapiens 


Neural thread protein. 


136 


70 


4765 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


139 


69 


4766 

. 


Y86248 

• 


Homo sapiens 

■ 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


102 


56 

• 


4767 


U63159 


Mus musculus 


transaldolase 


218 


100 


4768 


AF230279 


Caenorhabditis 
elegans 


SW13-like protein; PSA-1 


106 


38 


4769 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


116 


56 


4770 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


136 


67 


4771 


X92485 


Plasmodium 


pval 


85 


64 



255 



( 



WO 01/64835 PCT/US01/04927 



SEQID 
NO: 


Accession No. 
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% 

Identit 

y 






vivax 








4772 


S58722 


Homo sapiens 


X-lmked retinopathy protein {C- 
terminai, clone XEH.8c} 


95 


80 


4773 


AC002398 


Homo sapiens 


F25965 3 ~* 


96 


39 


4774 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


no 


75 


4775 . 


AF120151 


Homo sapiens 


cytokine receptor-like molecule 9 


112 


100 


4776 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


141 


74 


4777 


G01828 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


108 


75 


4778 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


317 


55 


4779 


AF265575 


Homo sapiens 


ubiquitous TPR-mouf protem Y 
isoform 


101 


55 


4780 


AB007860 


Homo sapiens 


KIAA0400 


119 


88 


4781 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


122 


88 


4782 


GO 1828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


98 


84 

. • 


4783 


M22334 


Homo sapiens 


unknown protein 


611 


93 


4784 


Y36203 


Homo sapiens 


Human secreted protein #75. 


350 


76 


4785 


K02401 


Homo sapiens 


chorionic somatomammotropin 


188 


97 


4786 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4449. 


152 


64 


4787 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


102 


67 


4788 


M38451 


Homo sapiens 


growth hormone 


264 ! 


94 


4789 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


146 


77 


4790 


K02401 


Homo sapiens 


chorionic somatomammotropin 


250 


96 


4791 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


127 


88 


4792 


Y36112 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 497. 


125 


78 


4793 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


131 


51 


4794 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


124 


57 


4795 


M18095 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


103 


34 


4796 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


131 


69 


4797 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


160 


77 


4798 


AL132841 


Caenorhabditis 
elegans 


Y15E3A.3 


120 


51 


4799 


AL009I46 


Drosophila 
melanogaster 


alternatively spliced form 


79 


94 


4800 


S74221 


Homo sapiens 


IK factor=cytokine down-regulating 
HLA class II 


108 


80 


4801 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protem (nuclear rna-binding 
protein) 


121 


100 


4802 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear rna-binding 
protein) 


94 


90 


4803 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear rna-binding 
protein) 


101 


90 
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Species 


Description 


omtin- 

Waterman 

Score 


Identit 

y 


4804 


R95913 


Homo sapiens 


Neural thread protein. 


139 


71 


4805 


X61045 


Hydra sp. 


mini-collagen 


126 


68 


4806 


X12544 


Homo sapiens 


3 HLA-DR B protein precursor (AA - 
29 to 267) 


114 


56 


4807 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


160 


84 


4808 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 6 


277 


94 


4809 


L26953 


Homo sapiens 


chromosomal protein 


90 


60 


4810 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


108 


76 


4811 


L26953 


Homo sapiens 


chromosomal protein 


121 


62 


4812 


Y27907 


Homo sapiens 


Human secreted protein encoded by 
gene No. 119. 


124 


71 


4813 


G03787 


Homo sapiens 


Human secreted protein, SEQID NO: 
7868. 


78 


46 


4814 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


100 


78 


4815 


AF152961 


Homo sapiens 

* 


chromatin-specific transcription 
elongation factor FACT 140 kDa 
subunit 


152 

* 


96 


4816 


U93569 


Homo sapiens 


putative pi 50 


334 


82 


4817 


AB014575 


Homo sapiens 


KIAA0675 protein 


125 


67 


4818 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


114 


90 


4819 


G00354 


Homo sapiens 
* 


Human secreted protein, SEQ ID NO: 
4435. 


129 


45 


4820 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


134 


48 


4821 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


129 


61 


4822 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


137 


83 


4823 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


126 


55 


4824 


AF1 13685 


Homo sapiens 


PRO0974 


no 


54 


4825 


U63542 


Homo sapiens 


FAP protein 


136 


70 


4826 


X83413 


Human 
herpesvirus 6 


U88 


130 


54 


4827 

• 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


152 


76 


4828 


U76609 


Homo sapiens 


ribosomal L5 protein 


134 


89 


4829 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


110 


57 


4830 


, AC006161 


Arabidopsis 
thaliana 


putative CENP-B/ARS-binding 
protein-like protein 


141 


31 „ 


4831 


G03556 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7637. 


143 


71 


4832 


U94189 


Rattus 
norvegicus 


Duo 


158 


79 


4833 


J04031 


Homo sapiens 


MDMCSF (EC 1.5.1.5; EC 3.5.4.9; EC 
6.3.43) 


193 


97 


4834 


Y13397 


Homo sapiens 


Amino acid sequence of protein 
PR0334. 


654 


97 


4835 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


107 


53 


4836 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


464 


93 


4837 


AF272833 


Homo sapiens 


misato 


413 


87 


4838 


AF130089 


Homo sapiens 


PRO2550 


132 


71 


4839 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


54 
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SEO ID 

NO: 


''Accession No. 


Snecics 

• 


Descriotion 


Smith- 

Waterman 

Score 


% 

Identit 

y 




I 




4414. 






4840 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


89 


80 


484) 


W77354 


Homo sapiens 


Human telomere repeat binding factor 
2 truncated protein. 


106 


100 


4842 


AF090942 


Homo sapiens 


PRO0657 


160 


75 


4843 


AF090944 


Homo sapiens 


PRO0663 


202 


80 


4844 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


167 


81 


4845 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, . 
SEQIDNO:163. 


166 


74 


4846 


Y86248 


Homo sapiens 


Human secreted protein HGHPF68, 
SEQIDNO:163. 


228 


97 


4847 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


198 


95 


4848 


AB023485 


Mus musculus 


transcription factor CA150b 


118 


42 


4849 


Z70684 


Caenorhabditis 
elegans 


F28D1.8 


98 


42 


4850 


AF025467 

* 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein Kl 0 (NID:g8 148) 


106 


45 


4851 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


179 


92 


4852 


Y86248 

+ 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


145 


82 


4853 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


123 


77 


4854 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 


112 


55 


4855 


U66464 


Homo sapiens 


hematopoietic progenitor kinase 


110 


81 


4856 


AC0 11708 


Arabidopsis 
thaliana 


putative RING zinc finger protein 


121 


43 


4857 


M81321 


Macaca 
fascicularis 


proline-rich protein 


126 


35 


4858 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


104 


40 


4859 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


195 


75 


4860 


X65165 


Volvox carteri 


extensin 


248 


55 


4861 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


106 


79 


4862 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


185 


92 


4863 

• 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


142 


75 


4864 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


132 


100 


4865 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


163 


63 


4866 


AB023229 


Homo sapiens 


KIAA1012 protein 


208 


92 


4867 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


144 


80 


4868 


L26953 


Homo sapiens 


chromosomal protein 


126 


70 


4869 


U63542 


Homo sapiens 


FAP protein 


134 


40 


4870 


D86853 


Catharanthus 
roseus 


extensin 


102 


48 


4871 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


152 


69 


4872 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


136 


55 


4873 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


156 


84 
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»> 
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% 
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y 


4874 


Y 17832 


Human 
endogenous 
retrovirus K 


env protein 

* 


219 

- 


67 


4875 


AF2I7517 


Homo sapiens 


un characterized bone marrow protein 
BM041 


81 


38 


4876 


L27428 


Homo sapiens 


reverse transcriptase 


105 


91 


4877 


M64792 


Rattus 
norvegicus 


salivary proline-rich protein 


146 


46 


4878 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


99 


38 


4879 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


105 


77 


4880 


R58816 


Homo sapiens 


Human c-myc far upstream element 
(FUSE) binding protein (FBP)variant 
from PBL clone 31-10. 


87 


39 


4881 

1 X* \af m 


G03258 

Xr mm dtmWmm %m 


Homo sapiens 

mm, ^ ^ m a\ am-m ^a* m^arama>mtm mm ^mr at ■ mm 


Human secreted protein, SEQ ID NO: 
7339. 


150 


67 


4882 


AF 175223 

m mm m M am* a\amaaa+mm 


Drosophila 
melanogaster 


SANT domain protein SMRTER 

^r* t mm ^mW amw mt am am mmm ^mr ww^mW m a*mm mmm my * mmm^m —a» mm MVv* v 


84 


89 


4883 


AF 174482 

A m 9 ■ • ^a* mm 


Homo sapiens 

mm ^*^mu*mm ^» vmv Avaav 


polvcomb 3 

^m ^m m j ^mr mmmm am mmm mm 7 


136 


57 


4884 


Y54324 


Homo sapiens 


Amino acid sequence of a human 
gastric cancer anti&en protein. 


147 


66 


4885 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO: . 
7001. 


141 


71 


4886 

TvvV 


U55376 


Caenorhabditis 
elegans 


F 1 6H 1 1 2. eene product 

* * \J m m m m mm* UVltW Wr fc- 


95 

a* 9 mar* 


100 


4887 


AF261918 

m mm mm ^m m mm amy *mm 


Homo sapiens 


disintegrin metalloproteinase with 

^a^aymtamw am ar am w ™aw ^ammwam mmmam m am m ^m w*^mww am mm ^a* Bp aw ^aw -aw^a* mama-ma mammmma ^mw mm^mmam 

thrombospondin repeats 


262 


72 


4888 


J02459 


bacteriophage 
lambda 


K (tail component; 1 99) 


610 


100 


4889 


Y95435 


Homo sapiens 


Human calcium channel SOC- 
2/CRAC-l. 


219 


89 


4890 


B01372 


Homo sapiens 


Neuron-associated protein. 

* 


134 


80 


4891 


X64712 


Gallus gallus 


collagen-alpha-3 type IX 


112 


41 


4892 


K02401 


Homo sapiens 


chorionic somatomammotropin 


556 


87 


4893 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


108 


55 


4894 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


139 


62 


4895 


L26953 


Homo sapiens 


chromosomal protein 


93 


81 


4896 


AJ277740 


Homo sapiens 


RPBllblbeta protein 


102 


66 


4897 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


145 


87 


4898 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


131 


67 


4899 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


153 


64 


4900 


U90304 


Homo sapiens 


iroquois-class homeodomain protein 
IRX-2a 


109 


41 


4901 


D29956 


Homo sapiens 


This gene is similar to tre 
oncogene(X63547). 


120 


73 


4902 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413. 


116 


70 


4903 


Y69166 


Homo sapiens 


A mature human N-acetylglycosaminyl 
transferase protein. 


102 


82 


4904 


AF1 19900 


Homo sapiens 


PR02822 


157 


57 


4905 


U93565 


Homo sapiens 


putative pi 50 


146 


55 


4906 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


120 


67 


4907 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


160 


76 


4908 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


136 


55 



t 
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SFft ID 
NO: 


* AcrMtinn Na 






Smith. 

Waterman 

Score 


Identit 

y 




t 


i 


SEQIDNO:163. 






4909 


AF1 55232 


Pisum sativum 


extensin 


98 


43 


4910 


U15647 


Mus musculus 


reverse transcriptase 


196 


50 


4911 


G02994 


Homo sapiens 


-Human secreted protein, SEQ ID NO: 
7075. 


144 


56 


4912 


AF2254I9 


Homo sapiens 


HSCARG 


248 


61 


4913 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


110 


4 ? 


4914 


AL1 17382 


Homo sapiens 


dJSS 1L22.2 (novel protein) 


312 


100 


4915 


AB037814 


Homo sapiens 


KIAA1393 protein 


192 


68 


4916 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


169 


89 


4917 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


105 

- 


90 


4918 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


116 


86 


4919 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


132 


100 


4920 


AJ242540 

• 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


183 


65 


4921 


AF090944 


Homo sapiens 


PRO0663 


141 


57 


4922 


AF090931 


Homo sapiens 


PRO0483 


159 


55 


4923 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


128 


81 


4924 


U09500 


Homo sapiens 


cytochrome b 


276 


94 


4925 


AB002377 


Homo sapiens 


KIAA0379 protein 


304 


98 


4926 


M22334 


Homo sapiens 


unknown protein 


201 


62 


4927 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


142 


65 


4928 


AF 130089 


Homo sapiens 


PRO2550 


116 


82 


4929 


U21123 


Drosophila 
melanogaster 


ena polypeptide 


94 


41 


4930 


W75159 


Homo sapiens 


Human secreted protein encoded by 
gene 45 clone HJTDSS4. 


294 


82 


4931 


G00354 


Homo sapiens 

■ 


Human secreted protein, SEQ ID NO: 
4435. 


96 


74 


4932 


G02460 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6541. 


101 


57 


4933 


Y02671 


Homo sapiens 

• 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


142 


70 


4934 


AF130087 


Homo sapiens 


PR02411 


137 


80 


4935 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


132 


56 


4936 


AB032604 


Mus musculus 


MIWI(piwi) 


313 


50. 


.4937 


X74987 


Homo sapiens 


2 t -5' oligoadenylate binding protein 


144 


96 


4938 


AF054658 


Homo sapiens 


kappa 1 immunoglobulin light chain 
variable region 


120 


87 


4939 


R96800 


Homo sapiens 


Human hisuocyte-secreted factor HSF. 


134 


67 


4940 


U22230 


Felis catus 


ribosomal protein SI 7 


118 


100 


4941 


AB028956 


Homo sapiens 


KIAA1033 protein 


527 


88 


4942 


G00429 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4510. 


281 

• 


86 

* 


4943 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


152 


71 


4944 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCSl 1 037. 


108 


95 


4945 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


170 


76 


4946 


AF090895 


Homo sapiens 


PRO01 17 


125 


65 


4947 


D13866 


Homo sapiens 


alpha-catenin 


575 


100 


4948 


L01664 


Homo sapiens 


lysophospholipase 


166 


96 
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SEQm 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman " 
Score 


Identit ' 

y 


4949 


AFI1 8085 


Homo sapiens 


PRO 1975 


572 


96 


4950 


AK024455 


Homo sapiens 


FLJ00047 protein 


95 


81 


4951 


AP000616 

— 


Oryza sativa 

■ 


similar to RING-H2 finger protein 
RHAla(AF078683) 


113 


58 


4952 


AB011142 


Homo sapiens 


KIAA0570 protein 


120 


88 


4953 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


108 


39 


4954 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


175 


69 


4955 


AK024455 


Homo sapiens 


FLJ00047 protein 


119 


64 


4956 


AF1 16715 


Homo sapiens 


PR02829 


142 


75 


4957 


M31520 


Homo sapiens 


ribosomal protein S24 


158 


100 


4958 


D38113 


Pan troglodytes 


ATPase subunit 8 


98 


64 


4959 


AF 130089 


Homo sapiens 


PRO2550 


156 


75 


4960 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


136 


95 


4961 


S61950 


Oryctolagus 
cunicuius 


alpha 2(1) procollagen, alpha 2(1) {C- 
terminal} 


159 


93 


4962 


AF070540 


Homo sapiens 


putative nuclear protein 


513 


93 


4963 

• 


S61950 


Oryctolagus 
cunicuius 


alpha 2(1) procollagen, alpha 2(1) {C- 
terminal} 


159 


93 


4964 


AF146191 


Homo sapiens 


FRG1 


343 


92 


4965 


Y16624 


Homo sapiens 
* 


Human phosphoprotein 32 (pp32) 
protein sequence. 


114 


100 


4966 


X67813 


Canis familiaris 


signal recognition particle,72 kDa 
subunit 


127 


100 


4967 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


127 


63 


4968 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


117 


69 


4969 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


101 


80 


4970 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


173 


69 


4971 


D38112 


Homo sapiens 


ATPase subunit 6 


112 


91 


4972 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


209 


97 


4973 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


162 


80 


4974 


AL050399 


Arabidopsis 
thaliana 


putative proline-rich protein 


75 


91 


4975 


LI 1668 


Bos taurus 


cyclophilin-40 


134 


100 


4976 


M55169 


Homo sapiens 


tripeptidyl peptidase II 


385 


90 


4977 


AF090944 


Homo sapiens 


PRO0663 


154 


88 


4978 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4669. 


102 


62 


4979 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


128 


70 


4980 


G00721 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
4802. 


104 


95 


4981 


R54656 


Homo sapiens 


Prostaglandin 12 production stimulating 
protein B. 


101 


100 


4982 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


124 


38 


4983 


AF273053 


Homo sapiens 


CTCL tumor antigen se89-l 


281 


96 


4984 


M15530 


Homo sapiens 


B-celJ growth factor 


117 


58 


4985 


D13641 


Homo sapiens 


mitochondrial outer membrane protein 
19 


534 


94 


4986 


L11316 


Mus musculus 


ect2 


528 


92 


4987 


X51755 


Homo sapiens 


open reading frame (458 AA) 


108 


90 


4988 


M61877 


Homo sapiens 


erythroid alpha spectrin 


129 


87 
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SEQ ID 


' Accession No. 


Species 

* 


Description 


Smith- 

v» uicruittO 

Score 


% 

luCRuI 

y 


4989 


X57316 


Saccharomyces 
cerevisiae 


carboxypeptidase s 


134 


50 


4990 


G00338 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4419. 


151 

-• 


78 

. 


4991 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


173 


61 


4992 


M34019 


Bos taurus 


beta-adrenergic receptor kinase 


172 


100 


4993 


L26953 


Homo sapiens 


chromosomal protein 

* 


135 


86 


4994 


X97675 


Homo sapiens 


plakophilin 2b 


124 


63 


4995 


Y36112 

• 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 497. 


153 


77 


4996 


AF130089 


Homo sapiens 


PRO2550 


175 


48 


4997 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6582. 


95 


68 


4998 


Y*6214 


Homo sapiens 


Nuclear transport protein clone hfb341 
protein sequence. 

mm ^ W ^^m m mm- mm m^ ^» mw m 


312 


98 


4999 


G036S1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7762. 


100 


95 


5000 


AF152961 

I 


Homo sapiens 


chromatin-specific transcriDtion 
elongation factor FACT 140 kDa 
subunit j 


127 


96 


5001 


U70063 


Homo sapiens 


acid ceramidase 


315 


96 


5002 

mm \m \fA^ 


AF041373 

.1 * B> V • A mm / if 


Rattus 
norvegicus 


clathrin assembly orotein short form 

Vimi mm AAA Uyw¥UlVI J Mr w*+AAA. m AmmA AAA 


163 


63 


5003 


AK001841 

A A A.w±*f A ■ X. 


Homo saoiens 


unnamed Drotein Droduct ' 


468 


83 


5004 


G00333 


Homo saoiens 


Human secreted protein. SEQ ID NO: 
4414. 


143 


61 


5005 


S79410 


Miis musculus 


nuclear localization signal binding 
protein 


116 


47 


5006 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. . 


107 


55 


5007 


G00338 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4419. 


151 


76 


5008 


Z15005 


Homo sapiens 


CENP-E 


281 


84 


5009 

mm 


AF130089 

« JbA A mm W m 


Homo saDiens 


PRO2550 

A\ JL m±^^ mmmm U V 


122 


75 


5010 


AL049698 

• 


Homo sapiens 

• 


dJ470B24.1.1 (myeloid/lymphoid or 
mixed- lineage leukemia Ctrithorax 
(Drosophila) homolog); translocated to, 
4 (AF-6) (isoform 1)) 


77 


33 


5011 


G02382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6463. 


90 


62 


5012 


AL023780 


Schizosaccharom 
ycespombe 


DNA binding protein 


145 


36 


5013 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 
oncoprotein. 


93 


46 


5014 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


104 


43 


5015 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


104 


55 


5016 

r 


U15647 

- 


Mus musculus 

• 


expressed in select embryonal 
carcinoma cells and testicular cells; 
similar to Swiss-Prot Accession 
Number PI 1260 


119 


35 


5017 


B10545 


Homo sapiens 


Human aspartate protease psn-like4 
protein. 


649 


98 


5018 


AF089745 


Homo sapiens 


FK506-binding protein 


656 


98 


5019 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


634 


97 


5020 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


603 


92 


5021 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


623 


94 


5022 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


671 


100 
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SEQ ID 


Accession (No. 


Species 


Description 


o nil ut- 


OA 


NO: 




Waterman 


Identit 










Score 


y 


5023 


AF 130089 


Homo sapiens 


PRO2550 


109 


71 


5024 


AB013452 


Homo sapiens 


ATPasell 


208 


77 


5025 


R27654 


Homo sapiens 


Human calcium channel 27980/16. 


183 


100 


5026 


Y36156 


Homo sapiens 


Human secreted protein #28. 


'114 


64 


5027 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


150 


74 








7795. 






5028 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID MO: 


149 


59 








4658. 


• 




5029 


S74728 


Homo sapiens 


antiquitin=26g turgor protein homolog 


652 


98 


5030 


L41498 


Homo sapiens 


longation factor 1 -alpha 1 


592 


87 


5031 


AB040672 


Homo sapiens 


UDP-GaENAc: polypeptide N- 


454 


70 








acetylgalactosarainyltransferase 






5032 


AJ223948 


Homo sapiens 


RNA helicase 


574 


84 


5033 


AF021935 


Rattus 


mytonic dystrophy kinase-related 


670 


97 






norvegicus 


Cdc42-binding kinase 






5034 


AF1 50755 


Mus musculus 


microtubule-actin crosslinking factor 


526 


80 


5035 


AF 150755 


Mus musculus 


microtubule-actin crosslinking factor 


602 


89 


5036 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


598 


89 


5037 


U09823 


Oryctolagus 


elongation factor 1 alpha 

**** M 


708 


96 






cuniculus 




• 




5038 


AE003621 


Drosophila 


CG13384 gene product 


279 


45 






melanogaster 






■ 


5039 


G00379 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


68 








4460. 






5040 


AF047469 


Homo sapiens 


arsenite translocating ATPase 


650 


94 


5041 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


41 




* 




7214. 






5042 


AJ007798 


Homo sapiens 


stromal antigen 3, (STAG3) 


288 


100 


5043 


L26953 


Homo sapiens 


chromosomal protein 


124 


67 


5044 


AB005047 


Homo sapiens 


SH3 binding protein 


105 


88 


5045 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


78 








7124. 






5046 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


164 


83 








7519. 


• 




5047 


W48351 


Homo sapiens 


Human breast cancer related protein 


121 


63 








BCRB2. 






5048 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


64 








7874. 






5049 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 


120 


59 








clone HCACJ81. 






5050 


Z25535 


Homo sapiens 


nuclear pore complex protein hnupl53 


115 


88 


5051 


J03176 


Bradyrhizobium 


cytochrome b/cl precursor 


97 


35 






japonicum 








5052 


X85995 


Homo sapiens 


immunoglobulin light chain 


200 


100 


5053 


G02994 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 


164 


76 




■ 




7075. 






5054 


B01372 


Homo sapiens 


Neuron-associated protein. 


109 


80 


5055 


AJ242540 


Volvox carteri f. 


hydroxyproline-rich glycoprotein DZ- 


209 


54 






nagariensis 


HRGP 






5056 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


51 








7452. 






5057 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


93 


39 








7871. 






5058 


X92474 


Homo sapiens 


ch-TOG 


147 


100 


5059 


L27428 


Homo sapiens 


reverse transcriptase 


264 


75 


5060 


S79410 


Mus musculus 


nuclear localization signal binding 


119 


44 








protein 






5061 


L25616 


Homo sapiens 


CGI protein-' 


119 


77 


5062 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


62 
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1 
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* 



SJfiVl U) 

NO: 


Accession No. 


£> pec res 


Description 


omitD- 
Wa term an 
Score 


Identit 

y 








7871. 






5063 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


164 


59 


5064 


AF 11 8082 


Homo sapiens 


PRO 1902 


175 


53 


5065 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


84 


51 


5066 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


108 


100 


5067 


AF155232 


Pisum sativum 


extensin 


137 


45 


5068 


X63005 


Mus musculus 


proline-rich protein 


98 


35 


5069 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


307 


56 


5070 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


173 


89 


5071 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


153 


68 


5072 


Y86248 

- 


Homo sapiens 

* 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


164 


73 

■ 


5073 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


109 


73 

♦ 
♦ 


5074 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


121 


80 


5075 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


113 


72 


5076 


U 12690 


Homo sapiens 


cytochrome oxidase subunit II 


151 


88 


5077 


AF090944 


Homo sapiens 


PRO0663 


138 


78 


5078 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


150 


74 


5079 


X97675 


Homo sapiens 


plakophilin 2b 


179 


93 


5080 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


123 


76 


5081 


AF240630 

• 


Mus musculus 


IQ motif containing GTPase activating 
protein 1 


176 


59 


5082 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


100 


54 


5083 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


99 


59 


5084 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


101 


57 


5085 


Y27907 


Homo sapiens 


Human secreted protein encoded by 
gene No. 1 19.. 


118 


77 


5086 


AB0476O0 


Macaca 
fascicularis 


hypothetical protein 


93 


69 


5087 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


88 


80 


5088 


AF000298 


CaenorhabdirJs 
elegans 


weak similarity to collagens; glycine- 
and proline-rich 


112 


57 


5089 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
'4484. 


87 


61 


5090 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


74 


61 


5091 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121 . 


83 


78 


5092 


AF065484 


Homo sapiens 


sorting nexin 1 A 


172 


81 


5093 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


139 


90 


5094 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


164 


78 


5095 


U02313 


Mus musculus 


protein kinase 


147 


84 



264 



SEQ 1U 
NO; 

• 


Accession wo. 


species 


B t riiir fr tr~u\ ■ mm} 2 am 

uescnpuon 


OHJIII*- 

Waterman 
Score 


-yr 

Idea tit 

y 


5096 


R13556 


Homo sapiens 


Protein encoded downstream of hhcJM 
oncoprotein. 


129 


67 


5097 

* 


U64849 


Caenorhabditis 
elegans 


Contains similarity to Pfam domain: 
PF00646 (F-box), Score=28.7, E- 
value=4.3e-05, N=l 


197 


39 


5098 


AF006010 


Homo sapiens 


progestin induced protein 


174 


86 


5099 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7452. 


119 


55 


5100 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


132 


82 


5101 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


159 


77 


5102 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


99 


70 


5103 


J02459 


bacteriophage 
lambda 


J (tailihost specificity; 1 1 32) 


485 


96 


5104 


J02459 


bacteriophage 
lambda 


H (tail component;853) 


823 


98 


5105 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


106 


42 


5106 


AF 130089 


Homo sapiens 


PRO2550 


121 


35 


5107 

• 


G04078 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
8159. 


135 


81 


5108 


P92219 


Homo sapiens 
(human) 


CR1 protein. 


123 


88 


5109 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


83 


78 


5110 


U41751 


Mus musculus 


EI24 


560 


83 


5111 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


150 


69 


5112 


AF130089 


Homo sapiens 


PRO2550 


164 


88 


5113 


G03084 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7165. 


160 


72 


5114 


AF052831 


Trypanosoma 
cmzi 


unknown 


107 


70 


5115 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna; 
complete virion genome. 


128 


68 


5116 


X55684 


Lycopersicon 
esculentum 


extensin (class I) 


60 


29 


5117 


W804O6 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


124 


64 


5118 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


115 


60 


5119 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


112 


79 

» 


5120 


X65718 


Prunus dulcis 


extensin 


94 


33 


5121 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


132 


100 


5122 


AJ007714 


Homo sapiens 


lysine-ketoglutarate reductase 
/saccharopine dehydrogenase 


154 


96 


5123 


AJ007714 


Homo sapiens 


lysine-ketoglutarate reductase 
/saccharopine dehydrogenase 


158 


80 


5124 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


109 


44 


5125 


AF 130089 


Homo sapiens 


PRO2550 


112 


46 


5126 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


129 


60 


5127 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


99 


68 
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SEQID 


Accession No. 


Species 


Description 


Smith- 


% 








ww ucermau 


1QCDUI 






** * 




Score 


y 


5128 


G00403 • 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 

• 


57 








4484. 






5129 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


140 


55 




• 




clone HPMBQ32. 




■«*--" 


5130 


J02459 


bacteriophage 


H (tail component;853) 


378 


100 






lambda 




* ■ 




5131 

> 


G00328 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 


123 


78 








4409. 




• 


5132 


J04088 


Homo sapiens 


DNA topoisomerase II (EC 5.99.13) 


163 


96 


5133 


AL132841 


Caenorhabditis 


Y15E3A.3 


147 


85 






elegans 








5134 


D90827 


Escherichia coli 


Serine/Threonine protein phosphatase 


329 


46 








(EC 3.1.3.16). 






5135 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


159 


62 








7871. 






5136 


G00407 


Homo sapiens 


Human secreted protein, SEO ID NO: 


161 


91 








4488. 






5137 


G04078 


Homo sapiens 


Human secreted protein. SEO ID NO: 


136 


96 








8159. 






5138 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


67 




. " * 




7124. 






5139 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


93 


31 








7878. 






5140 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


46 








7253. 






5141 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


166 


67 






gene 22 clone HMSJW1 8. 


►» 




5142 


AF003535 


Homo sapiens 


ORF2-like protein 


168 


50 


5143 


AF259792 


Homo sapiens 


SWHLbeta protein 


684 


98 


5144 


Y36203 


Homo sapiens 


Human secreted protein #75. 


130 


63 


5145 


AF130089 


Homo sapiens 


PRO2550 


151 


76 


5146 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


128 


52 








RHAla(AF078683) 






5147 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


85 








7253. 






5148 


AL359782 


Trypanosoma 


probable similar to ring-h2 finger 


101 


67 






brucei 


pn»teinri,ala. 






5149 


W48351 


Homo sapiens . 

* 


Human breast cancer related protein 


118 


56 








BCRB2. 






5150 


AF130051 . 


Homo sapiens 


PRO0898 


80 


72 


5151 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 


165 


63 








(hhcm) gene preoduct 






5152 


X56832 


Homo sapiens 


muscle specific enolase 


573 


90 


5153 


X76930 


Homo sapiens 


hepatocyte nuclear factor 4 

X •* 


544 


91 


5154 


U12919 


Mus musculus 


adenylyl cyclase type VII 


156 


75 


5155 


G03172 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 

XT * ^ 


91 


76 








7253. 






5156 


G04072 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 


133 


50 








8153. 






5157 


Y08387 


Homo sapiens 


mu-adaptin-related protein 2 


252 


86 


5158 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 


126 


66 








oncoprotein. 






5159 


AF090942 


Homo sapiens 


PRO0657 


139 


77 


5160 


AF090942 


Homo sapiens 


PRO0657 


120 


70 


5161 


D67066 


Bos taurus 


N-WASP 


135 


42 


5162 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


73 . 








7124. 






5163 


AF044496 


Canis familiaris 


acidic ribosomal phosphoprotein P0 


112 


78 


5164 


W48351 


Homo sapiens 


Human breast cancer related protein 


106 


58 



266 



SEQ ID 
NO* 


Accession No. 


Species 

• 


Description 


Smith- 
Waterman 
Score 


Identit 

y 




i 




BCRB2. 






5165 


AJ250095 


Bordetella 
bronchiseptica 


pertactin (P.68) 


119 


64 


5166 


U96411 


Mus musculus 


otogelin; MLEMP — 


609 


90 


5167 


AF 11 6238 


Homo sapiens 


pseudouridine synthase I 


237 


87 


5168 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


151 


78 


5169 


AL050342 


Homo sapiens 


dJ655K7. 1 (novel protein) 


263 


100 


5170 


AK024455 


Homo sapiens 


FLJ00047 protein 


177 


72 


5171 


Y86248 


Homo sapiens 


Human secreted protein UCHPF6S, 
SEQ ID NO: 163. 


94 


100 


5172 


L38908 


Nicotiana 
tabacum 


extensin 


101 


47 


5173 


AF055904 


Myxococcus 
xanthus 


unknown 


107 


36 


5174 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. , 


143 


59 


5175 


Y00337 


Homo sapiens 


Human secreted protein encoded by 
gene 81. 


123 


86 


5176 


A JO 10957 


Hippopotamus 
amphibius 


NADH1 


212 


90 


5177 


G03438 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


84 


76 


5178 


U83280 


Leishmania 
donovani 


39 kDa antigen 


128 


90 


5179 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein Kl 0 (NID:g8148) 


183 


45 


5180 


L27428 


Homo sapiens 


reverse transcriptase 


106 


46 


5181 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


144 


45 


5182 


R83119 


Homo sapiens 

* 


Human cisplatin resistance protein. 


142 


96 


5183 


R95330 


Homo sapiens 
* 


Tumor necrosis factor receptor 1 death 
domain ligand (clone ITU). 


475 


71 


5184 


AF090930 


Homo sapiens 


PRO0478 


121 


57 


5185 


AF193613 


Homo sapiens 


cell recognition molecule Caspr2 


338 


50 


5186 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


102 


36 


5187 


AL031118 


Homo sapiens 


dJ153G14.3 (novel C2H2 type Zinc 
Finger protein) 


465 


62 


5188 


AF183144 


Mus musculus 


left-right dynein 


199 


83 


5189 


J02459 


bacteriophage 
lambda 


J (taihhost specificity;l 132) 


748 


95 


5190 


AF193613 


Homo sapiens 


cell recognition molecule Caspr2 


229 


50 


5191 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6595. 


135 


65 


5192 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


120 


84 


5193 


Y00311 


Homo sapiens 


Human secreted protein encoded by 
gene 54. 


112 


49 


5194 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


126 


56 


5195 


AF104261 


Mus musculus 


Pax transcription activation domain 
interacting protein PTIP 


487 


72 


5196 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


132 


65 


5197 


G03043 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7124. 


130 


80 


5198 


X55777 


Homo sapiens 


52kD protein 


106 


30 


5199 


AF092170 


Homo sapiens 


polycystic kidney disease 2 related 


122 


91 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 




Wfltprmfln 

if iiici ujhu 


Identit 








Score 


y 








protein 






5200 


AF092170 


Homo sapiens 
* 


polycystic kidney disease 2 related 


138 


96 








protein 






5201 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


67 








7879. 


* 




5202 


AP000616 


Oryza sativa 


similar to RING-H2 ringer protein 


, 1 10 


81 








RHAla(AF078683) 


* 


• 


5203 


R95913 


Homo sapiens 


Neural thread protein. 


96 


61 


5204 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


83 


55 








6588. 






5205 


AF3 19476 


Homo sapiens 


GKAP42 


705 


100 


5206 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


78 


62 








6913. 






5207 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


119 


75 








4446. 




• 


5208 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


166 


50 








7871. 






5209 


G00689 


Homo saoiens 


Human secreted protein, SEQ ID NO: 

m mm mm ■ ■ M mmmYm V mw mrmm mr^m mw m mm m ^ m mm m m ^^r_ ^ ' 


139 


39 






i 


4770. 






5210 


AF09093 1 


Homo saoiens 


PRO0483 


94 


90 


5211 


G00357 


Homo saoiens 


Human secreted protein. SEO ID NO: 


114 


74 








4438. 






5212 


GO 1246 


Homo saoiens 


Human secreted protein. SEO ID NO: 


116 

* 


70 








5327. 






5213 


U53209 


Homo sapiens 


trans fbrmer-2 alpha 


124 


95 


5214 


G02485 


Homo saoiens 


Human secreted protein. SEO ID NO: 


130 


58 








6566. 


t> 




5215 


Y02671 


Homo saoiens 


Human secreted protein encoded by 


151 


49 








gene 22 clone HMSJW1 8. 






5216 


L26953 


Homo saoiens 


chromosomal protein 


95 


52 


5217 


G02532 


Homo saoiens 


Human secreted protein, SEQ ID NO: 


149 


66 








6613. 






5218 


AF220264 


Homo saoiens 


MOST-1 


151 


65 


5219 


G03133 


Homo saoiens 


Human secreted protein, SEQ ID NO: 


125 


53 








7214. 






5220 


G00637 

VJ Www f 


Homo saoiens 


Human secreted protein. SEQ ID NO: 


138 


65 








4718. 






5221 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


66 








7888. 






5222 


AF194537 


Homo sapiens 


NAG 13 


120 


60 


5223 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


60 








4457. 






5224 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 


144 


58 








encoded by gene 17. % 






5225 


AF000996 


Homo sapiens 


ubiquitous TPR motif, Y isoform 


164 


70 


5226 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


73 








4413. 






5227 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


164 


69 








7519. 






5228 


M34451 


Caenorhabditis 


collagen (roI-6) 


100 


41 






elegans 








5229 


X16396 


Homo sapiens 


precursor polypeptide (AA -29 to 3 15) 


306 


88 


5230 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


112 


69 






brucei 


complete virion genome. 






5231 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


157 


75 








7879. 






5232 


D86982 


Homo sapiens 


similar to human ankyrin 1(S08275) 


201 


100 


5233 


AF1 16712 


Homo sapiens 


PR02738 


135 


49 


5234 


AL049758 


Homo sapiens 


dJ437M2 1 . 1 (novel protein) 


184 


70 
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NO: 


Accession No. 


Species 


Description 


Smith- 
Water man 
Score 


% 

Identit 

y 


5235 


AB014527 


Homo sapiens 


KIAA0627 protein 


164 


100 


5236 


AF1 94537 


Homo sapiens 


NAG 13 


183 


64 


5237 


R07057 

• 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhcm) gene preoduct 


170 


43 


5238 


AF1 35439 


Mus musculus 


form in binding protein 1 1 


230 


100 


5239 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


137 


69 


5240 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


168 


63 


5241 


AF1 16661 


Homo sapiens 


PR01438 


152 


58 


5242 


L27428 


Homo sapiens 


reverse transcriptase 


193 


53 


5243 


AF130089 


Homo sapiens 


PRO2550 


141 


60 


5244 


AF1 18086 


Homo sapiens 


PR01992 


135 


71 


5245 


AJ228863 


Mus musculus 


CLIP-115 


195 


97 


5246 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


148 


81 


5247 


AC007887 


Arabidopsis 
thaliana 


F1504.29 


76 


93 


5248 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


151 


75 


5249 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


176 


66 


5250 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


136 


70 


5251. 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


162 

» 


80 


5252 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


112 


54 


5253 


AF090944 


Homo sapiens 


PRO0663 


177 


62 


5254 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


132 


57 


5255 


Y02749 

i 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


129 


74 


5256 


D38112 


Homo sapiens 


NADH dehydrogenase summit 2 


157 


91 


5257 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


160 


78 


5258 


U16296 


Homo sapiens 


TIAMt protein 


133 


100 


5259 


AL160371 


Leisomania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


92 

• 


93 


5260 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


324 


78 


5261 


AK024455 


Homo sapiens 


FLJ00047 protein 


100 


50 


5262 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


217 


100 


5263 


AB047600 


Macaca 
fascicular** 


hypothetical protein 


125 


57 


5264 


U83246 


Homo sapiens 


copine I 


199 


86 


5265 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


150 


72 


5266 


U91329 


Homo sapiens 


kines in-like motor protein KIF1C 


132 


93 


5267 


J00672 


Oryctolagus 
cuniculus 


beta-myosin 


109 


100 


5268 


AL451017 


Neurospora 
crassa 


related to Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN C 


74 


43 


5269 


AF1 19851 


Homo sapiens 


PRO 1722 


96 


61 


5270 


AF090930 


Homo sapiens 


PRO0478 


113 


91 


5271 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


123 


62 


5272 


B01372 


Homo sapiens 


Neuron-associated protein. 


122 


60 
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SEQ ID 
NO: 


'Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


Identit 

y 


5273 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ ID NO:387. 


116 


78 


5274 


AF090931 


Homo sapiens 


PRO0483 


144 


67 


5275 


AL 1 60493 


Leishmania 


probable (hhv-6) u 1 1 02, variant a 


117' 


68 






major 


DNA, complete virion genome 






5276 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


111 


58 


5277 


G03714 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


147 


67 

■ 


5278 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


120 


67 


5279 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


151 


69 


5280 


AF010144 


Homo sapiens 


neuronal thread protein AD7C-NTP 


196 


45 


5281 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


115 


73 


5282 


A18812 


Brassica napus 


extensin 


120 


38 


5283 ■ 


Z68747 


Homo sapiens 


imogen38 


291 


75 


5284 


X77816 


Rattus 
norvegicus 


PR-Vbetal 


97 


37 


5285 


AF130089 


Homo sapiens 


PRO2550 


181 


73 


5286 


AF1 30089 


Homo sapiens 


PRO2550 


147 


67 


5287 


AF090942 


Homo sapiens 


PRO0657 


90 


50 


5288 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


116 


74 


5289 

• 


G03714 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
7795. 


143 


64 


5290 


D86972 


Homo sapiens 


similar to E.coli hypothetical 29.6 KD 
protein(P 1 : YIG W_ECOLI) 


111 


100 


5291 


W48351 


Homo sapiens 


Human breast cancer related protein . 
BCRB2. 


145 


57 


5292 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


131 


65 


5293 


Y08061 


Homo sapiens 


Human c-myb protein fragment 


213 


76 


5294 


AF090895 


Homo sapiens 


PRO0117 


134 


52 


5295 


AF017635 


Homo sapiens 


DCHT 


145 


100 


5296 


G02558 


Homo sapiens 

■ 


Human secreted protein, SEQ ID NO: 
6639. 


136 


60 


5297 


AF000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


118 


67 


5298 


AK024455 . 


Homo sapiens 


FLJ00047 protein 


67 


80 


5299 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


164 


75 


5300 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


129 


55 


5301 


R14643 


Homo sapiens 


Gamma interferon receptor. 


107 - 


82 


5302 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) * 


238 


90 


5303 


Y08062 


Homo sapiens 

* 


Human PR0245 protein fragment 
derived from DNA35638. 


205 


68 


5304 


AF 130089 


Homo sapiens 


PRO2550 


100 


77 


5305 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


117 


44 


5306 


X92485 


Plasmodium 
vivax 


pval 


128 


61 


5307 


G02597 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6678. 


77 


57 


5308 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


140 


42 


5309 


Y91577 


Homo sapiens 


Human secreted protein sequence 


489 


78 



270 



qfh in 
NO: 




OpCCIC9 


uescnpuun 

• 


Waterman • 
Score 


Identit 

y 








encoded by gene 2 SEQ ID NO:250. 






5310 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


138 


65 


5311 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


129 

■ 


65 


5312 


AF1 19851 


Homo sapiens 


PR01722 


97 


65 


5313 . 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


160 


90 


5314 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


113 


81 


5315 


AF071310 


Mus musculus 


OPA-containing protein 1 


228 


53 


5316 


AF090942 


Homo sapiens 


PRO0657 


162 


70 


5317 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


109 


83 


5318 


AF090894 


Homo sapiens 


PRO0113 


125 


63 


5319 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


129 


66 


5320 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


130 


75. 


5321 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. , 


138 


81 


5322 


L27428 


Homo sapiens 


reverse transcriptase 


139 


52 


5323 . 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


109 

• 


83 


5324 


AF1 18082 


Homo sapiens 


PRO1902 


107 


70 


5325 


M81321 


Macaca 
fascicularis 


proline-rich protein 


114 


37 


5326 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


U2 


79 


5327 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


99 


72 


5328 


AF090930 


Homo sapiens 


PRO0478 


152 


73 


5329 


L27428 


Homo sapiens 


reverse transcriptase 


267 


65 


5330 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


140 


82 


5331 


AL160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


164 


54 


5332 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


135 


52 


5333 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


124 


76 


5334 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


97 


56 


5335 


AP000060 


Aeropyrum 
pernix 


103aa iong hypothetical protein 


71 


50 


5336 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


146 


75 


5337 


Z93784 


Homo sapiens 


dJ398C22. 1 (novel protein, ortholog of. 
mouse brain protein E46) 


280 


68 


5338 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6292. 


132 


64 


5339 


AF210651 


Homo sapiens 


NAG 18 


160 


56 


5340 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


133 


80 


5341 


G03172 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7253. 


151 


75 


5342 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


105 


50 


5343 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


103 


75 


5344 


G03021 


1 Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


65 
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SEOID 
NO: 


^ Accession No. 


Species 


Description 


Smitfa- 

Waterman 

Score 


IdentSt 

y 




1 




7102. 






5345 


Y87103 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 142. 


92 


48 


5346 


AL121891 


Homo sapiens 


dJ 1 1 87M1 72 (KI AA0552 protein) 


148 


43 


5347 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


130 


33 


5348 


AF090944 


Homo sapiens 


PRO0663 


205 


72 


5349 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


135 


57 


5350 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-b2 finger 
protein rha la. 


101 


37 


5351 


AF265578 


Homo sapiens 


orphan neurotransmitter transporter 
NTT5 


121 


39 


5352 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


105 


75 


5353 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


167 


& 


5354 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


97 


60 


5355 


G00590 

t 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4671. 


113 


80 


5356 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


123 


67 


5357 


W88609 


Homo sapiens 


Secreted protein encoded by gene 76 
clone HKIXL73. 


94 


79 


5358 


AC004450 


Arabidopsis 
thaliana 


putative extensin 


110 


37 


5359 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


120 


79 


5360 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


199 


74 


5361 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


174 


97 


5362 


AB017437 


Galius gallus 


avena 


98 


44 


5363 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


155 


93 


5364 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDN0.163. 


181 


70 


5365 


G02538 . 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


159 


68 

■ 


5366 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein #153. 


94 


60 


5367 


X52022 


Homo sapiens 


collagen type VI, alpha 3 chaui 


150 


33 


5368 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


120 


77 


5369 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


145 


75 


5370 


X97675 


Homo sapiens 


plakophilin 2b 


170 


85 


5371 


AF 130079 


Homo sapiens 


PR02852 


103 


95 


5372 


L26953 


Homo sapiens 


chromosomal protein 


117 


54 


5373 


AF210651 


Homo sapiens 


NAG 1 8 


137 


67 


5374 


Y30681 


Homo sapiens 


Splice variant ZAP- IB protein of the 
human tumor suppressor gene ZAP-1. 


106 


76 


5375 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


109 


70 


5376 


AL109819 


Arabidopsis 
thaliana 


extens in-like protein 


123 


34 


5377 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


127 

* 


79 


5378 


AF036170 


Dictyostelium 


homeobox-containing protein Wariai 


93 


83 
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NO: 






Waterman 
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y 




' « 


discoideum 


♦ 






5379 


AF130089 


Homo sapiens 


PRO2550 


134 


75 


5380 


AF161384 


Homo sapiens 


HSPC266 


663 


96 


5381 ~ 


AF130051 


Homo sapiens 


PRO0898 


147 


77 


5382 


AF090944 


Homo sapiens 


PRO0663 


118 


83 


5383 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


55 . 


• 


> 




7214. , 






5384 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


110 


68 








gene 22 clone HMSJW18. 






5385 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


165 


53 








SEQ ID NO: 163. 






5386 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


145 


96 


5387 


U54788 


Mus musculus 


Wiskott-Aldrich Syndrome Protein 


108 


51 


5388 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


89 








5909. 






5389 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


125 


70 








SEQIDNO:163. 






5390 


G01828 


Homo sapiens 

■ 


Human secreted protein, SEQ ID NO: 


103 


72 








5909. 


- 




5391 


AL359782 

y 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


119 


85 






brucet 


complete virion genome. 






5392 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


120 


65 








4535. 






5393 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


78 








7253. 






5394 


G01552 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


159 


90 








5633. 






5395 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


65 




■ 




7124. 






5396 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


85 


59 








encoded by gene 17. 






5397 


X86681 


Homo sapiens 


HNP36 protein 


248 


71 


5398 


L27428 


Homo sapiens 


reverse transcriptase 


133 


62 


5399. 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 


132 


54 




• 




oncoprotein. 






5400 


AL080196 


Homo sapiens 


hypothetical protein 


164 


96 


5401 | 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


119 


75 


5402 


AF090944 


Homo sapiens 


PRO0663 


99 


64 


5403 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


93 


58 








4433. 






5404 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 


103 


37 








homolog 






5405 


W05278 


Homo sapiens 


Tumour necrosis factor-related gene 


137 


80 




• 




product CL6.5.40. 






5406 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


102 


69 








4484. 






5407 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


102 


69 








4484. 


■ 




5408 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


6? 








4409. 




■ 


5409 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


139 


63 








7519. 






5410 


Y12950 


Homo sapiens 


Amino acid sequence of a human 


65 


47 








secreted peptide. 






5411 


AF181626 


Drosophila 


BcDNA.GH03694 


171 


53 






melanogaster 








5412 


U93565 


Homo sapiens 


putative pi 50 


195 


70 


5413 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


170 


64 








7253. 
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SEQ ID 
NO: 


'Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


Identit 

y 


5414 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


126 

■ 


61 


5415 


X83572 


Homo sapiens 


ARSD 


133 


100 


5416 


D86853 


Catharanthus 
roseus 


extensin 


1 13 


41— 


5417 


Y86248 


Homo sapiens 

* 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


139 


60 


5418 


G03472 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7553. 


175 


77 


5419 


R32010 


Homo sapiens 


Rpl5-TIA-1. 


140 


83 


5420 


D67066 


Bos taunis 


N-WASP 


153 

* 


41 


5421 


U42391 


Homo sapiens 


myosin-IXb 


203 


100 


5422 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


116 


74 


5423 


AF220264 


Homo sapiens 


MOST-1 


94 


78 


5424 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


128 


63 


5425 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


131 


57 


5426 


AF130089 


Homo sapiens 


PRO2550 


120 


58 


5427 


270204 , 

i 


Caenorhabditis 
elegans 

> 

i 


contains similarity to Pfam domain: 
PF00628 (PHT>-fmger), Score=63.1, Br 
value=1.9e-15, N=l-cDNA EST 
EMBL:T00556 comes fixmi this 
gene-cDNA EST yk344f7.5 comes 
from this gene 


131 


55 

• 


5428 


G03172 


Homo sapiens 
* 

« 


Human secreted protein, SEQ ID NO: 
7253. 


158 


51 


5429 


AF1 30089 


Homo sapiens 


PRO2550 


112 


73 


5430 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


161 


52 


5431 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


131 


72 


5432 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


152 


79 


5433 


AF090942 


Homo sapiens 


PRO0657 


109 


67 


5434 


AE003588 


Drosophiia 
melanogaster 


CG2839 gene product 


141 


50 


5435 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


154 


79 


5436 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


128 


41 


5437 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


143 


63 


5438 


G03172 


Homo sapiens ; 


Human secreted protein, SEQ ID NO: 
7253. 


117 


84 


5439 


G03793 


Homo sapiens | 


Human secreted protein, SEQ ID NO: 
7874. 


112 


76 


5440 


AP000616 


Oryzasativa 


similar to RING-H2 finger protein 
RHAIa(AF078683) 


110 


42 


5441 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NlP 


124 


64 


5442 


AL160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


193 


56 


5443 


X07495 


Homo sapiens 


translated region (AA 1-264) 


108 


41 


5444 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


135 


62 


5445 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


130 


41 


5446 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


136 


71 
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* 


Species 

* 
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Smith- 
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I % 
IdenHt 
y 


5447 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


123 


62 


5448 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


101 


45 


5449 


U 13 866 


unidentified 
cloning vector 


non-functional lacZ alpha peptide 

• * * 


106 


57 


5450 


X01469 


Plasmodium 
Jophurae 


histidine-rich protein 


162 

- 


56 


5451 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


130 


88 


5452 


Y 14482 


Homo sapiens 

m 

* 


Fragment of human secreted protein 
encoded by gene 17. 


155 


68 


5453 


AF1 19900 


Homo sapiens 


PR02822 


162 


63 


5454 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


120 


51 


5455 


AC007204 


Homo sapiens 


BC273239 1 


1485 


69 


5456 


L27428 


Homo sapiens 


reverse transcriptase 


122 


41 


5457 


Ml 8094 

i 


Phaseolus 
vulgaris 


hydroxypro line-rich glycoprotein 


141 


39 


5458 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


118 


68 


5459 


AF090930 


Homo sapiens 


PRO0478 


144 


64 


5460 


AF130089 


Homo sapiens 


PRO2550 


124 


60 


5461 


G00377 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
4458. 


135 


69 


5462 


X97675 


Homo sapiens 


plakophilin 2b 


103 


72 


5463 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

A\ -9 

7253. 


137 


69 

■ 


5464 


AF229067 


Homo sapiens 


PADI-H protein 


146 


65 


5465 


AF116715 


Homo sapiens 


PR02829 


83 


71 


5466 


AF090930 


Homo sapiens 


PRO0478 


146 


52 


5467 


U93574 


Homo sapiens 


putative p i 50 


203 


52 


5468 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


89 


50 


5469 


AF191838 


Homo sapiens 


TANK binding kinase TBK1 


121 


95 


5470 


AF 130079 


Homo sapiens 


PR02852 


146 


69 


5471 


AF090944 


Homo sapiens 


PRO0663 


90 


66 


5472 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


106 


63 


5473 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


127 


60 


5474 


AF064553 


Mus musculus . 


NSD1 protein 


269 


100 


5475 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


132 


65 


5476 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


130 


86 


5477 


U11271 


Homo sapiens 


thromboxane A2 receptor 


118 


67 


5478 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


143 


60 


5479 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


113 


75 


5480 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


119 


74 


5481 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


103 


85 


5482 


AF238235 


Entamoeba 
histolytica 


diaphanous protein 


108 


35 


5483 


AF090930 


Homo sapiens 


PRO0478 


120 


55 


5484 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


111 


84 
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Score 


% " 

Identit 

y 


5485 


M61883 


Sus scrofa 


apomucin 


252 


55 


5486 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


107 


77 


5487 


G03043 — 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


110 


46 


5488 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


88 


62 


5489 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


124 

* 


47 


5490 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


220 


71 


5491 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDN0:163. 


195 


65 


5492 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


116 


38 


5493 


AF1 18086 


Homo sapiens 


PR01992 


135 


82 


5494 


AF090930 


Homo sapiens 


PRO0478 


105 


74 


5495 


Y94451 


Homo sapiens 

i 


Human inflammation associated 
protein #8. 


170 


61 


5496 


G03465 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7546. 


72 


65 


5497 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


152 


78 


5498 


X56123 


Mus musculus 


talin 


594 


88 


5499 


AL035652 


Homo sapiens 


dJU6.l (topoisomerase (DNA) I) 


105 


86 


5500 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


160 

»t 


81 


5501 


Y45389 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 31. 


161 


93 


5502 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


119 


60 


5503 


X03145 


Homo sapiens 


pot. ORF I 


215 


79 


5504 


AF240630 


Mus musculus 


IQ motif containing GTPase activating 
protein 1 


180 


61 

■ 


5505 


AL390212 


Homo sapiens 


dJ29M10B.1.2 (protein kinase C 
binding protein 1 (isoform 2)) 


1282 


93 


5506 


AF118086 


Homo sapiens 


PRO 1992 


146 


61 


5507 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


116 


74 


5508 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


165 


50 


5509 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


211 


95 


5510 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


114 


69 


5511 


AF090944 


Homo sapiens 


PRO0663 


125 


58 


5512 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


125 


53 


5513 


Y36156 


Homo sapiens 


Human secreted protein #28. 


110 


68 


5514 


AF 118086 


Homo sapiens 


PR01992 


128 


76 


5515 


222866 


Mus musculus 


skelemin 


218 


95 


5516 


222866 


Mus musculus 


skelemin 


218 


95 


5517 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


127 


71 


5518 


AL1 33380 


Homo sapiens 


dJ862P83 (Similar to MAP3K10 
(mitogen-activated protein kinase 
kinase kinase 10)) 


603 


77 


5519 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


187 


78 


5520 


AF090901 


Homo sapiens 


PRO0195 


115 


60 
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SEQ ID 

NO: 


'Accession No. 


Species 

* 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


5521 

■ 


Y86248 

• 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


180 


S3 


5522 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


i55 


73 


5523 


W90847 


Homo sapiens 


Human lymphocyte targeted peptide 
#15. 


149 


100 


5524 


X97675 


Homo sapiens 


plakophitin 2b 


153 


70 


5525 


D37793 


Mus museums 


synaptotagminII/IP4BP 


651 


84 


5526 


M32319 


Homo sapiens 


HLA protein allele B5 1 


754 


94 


5527 


AF130051 


Homo sapiens 


PRO0898 


93 


64 


5528 


X92485 


Plasmodium 
vivax 


pval 


88 


66 


5529 


AJ131890 


Homo sapiens 


DNA polymerase lambda 


205 


100 


5530 


AF265575 


Homo sapiens 

- 


ubiquitous TPR-motif protein Y 
isoform 


132 

• 


68 


5531 


Y02926 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 101. 


87 


59 


5532 


AF090931 


Homo sapiens 


PRO0483 


133 


75 


5533 


AF090942 


Homo sapiens 


PRO0657 


122 


66 


5534 


R13556 


Homo sapiens 
* 


Protein encoded downstream of hhc_M 
oncoprotein. 


145 


56 


5535 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


569 


91 


5536 


K02401 


Homo sapiens 


chorionic somatomammotropin 


375 


98 


5537 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


136 


68 


5538 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


130 


44 


5539 


S62928 


Homo sapiens 


PRB1M protein precursor 

* - A 


104 


40 


5540 


G03258 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7339. 


127 


70 


5541 


AE000463 


Escherichia coli 
K12 


putative glycosidase 


254 


100 


5542 


J02459 


bacteriophage 
lambda 


H (tail component;853) 


761 

• 


100 


5543 


Y85062 


Homo sapiens 


Interleukin 1 converting enzyme 
homologue (ICEL) protein sequence. 


119 


58 


5544 


U59185 


Homo sapiens 


putative monocarboxylate transporter 


233 


87 


5545 


G02832 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
6913. 


187 


65 


5546 


D87446 


Homo sapiens 


Similar to a C.elegans protein encoded 
in cosmid C27F2 (U4041 9) 


556 

« 


84 


5547 


X92485 


Plasmodium 
vivax 


pval 


110 




5548 


AB023233 


Homo sapiens 


KIAA1016 protein 


246 


97 


5549 


U16359 


Rattus 
norvegicus 


nitric oxide synthase 


95 


64 


5550 


AC002044 


Homo sapiens 


Alpha-fetoprotein enhancer binding 
protein (3 f partial) 


111 


38 


5551 


AF026802 


Homo sapiens 


alpha-3 type DC collagen 


1 17 


47 


5552 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


85 


88 


5553 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


91 


88 


5554 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


104 


71 


5555 


AF 130089 


Homo sapiens 


PRO2550 


172 


86 


5556 


G02902 


Homosapiens 


Human secreted protein, SEQ ID NO: 
6983. 


105 


72 


5557 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


160 


61 
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Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


K 

Identit 

y 


5558 


AF010144 


Homo sapiens 


neuronal thread protein AD7C-NTP 


131 


68 


5559 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


112 


67 


5560 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


159— 


73 


5561 


Y02749 

i 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


141 


67 


5562 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


150 

* 


71 


5563 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


146 


80 


5564 


AF090942 


Homo sapiens 


PRO0657 


157 


73 


5565 

• 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


169 


43 

- 


5566 


AL035526 


Arabidopsis 
thaliana 


extensin-like protein 


103 


40 


5567 


AF130089 


Homo sapiens 


PRO2550 


132 


67 


5568 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


129 


83 


5569 


X67337 


Homo sapiens 


Human pre-mRNA cleavage factor 1 68 
kDa subunit 


153 

* 


42 


5570 


AF068266 


Homo sapiens 


EHT protein 


169 


91 


5571 


W48927 


Homo sapiens 


Schwannom in-binding protein C- 
terminal fragment 


113 


42 


5572 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


112 


68 


5573 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


1?2 


71 


5574 


L20450 


Musmusculus 


DNA-binding protein 


283 


61 


5575 


AP000616 


Oryzasativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


114 


95 


5576 


Y01400 

■ 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


140 


71 


5577 


Y20648 


Homo sapiens 


Human neurofilament-L wild type 
protein fragment 38. 


66 


37 


5578 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


112 


41 


5579 


AL359782 


Trypanosoma 
brucei 


possible (nhv-6) ul 102, variant a dna, 
complete virion genome. 


142 


74 


5580 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


142 


77 


5581 


AF 1 53606 


Homo sapiens 


angiopoietin-related protein 


737 


95 


5582 


AF090930 


Homo sapiens 


PRO0478 


149 


77 


5583 


X52493 


Glycine max 


DNA -directed RNA polymerase 


140 


46. 


5584 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


101 


51 


5585 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


66 


47 


5586 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


130 


72 


5587 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


148 


56 

* 


5588 


G00357 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
4438. 


148 


66 


5589 


AC006014 


Homo sapiens 


similar to KIAA0618 and nuclear 
envelope protein POM 12 1 ; . 
alternatively spliced product; similar to 
PID:g3327050andP52591 
(PED:gl709213) 


108 


84 


5590 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 


122 


84 
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jLrescnpiioQ 


OUllUH 
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~BT 

Identit 

y 




» 




oncoprotein. 






5591 


AB001684 


Chlorella 
vulgaris 


ORF74 


61 


47 


5592 


AF090930 


Homo sapiens 


PRO0478 — 


123 


70 


5593 


G02996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7077. 


120 


57 


5594 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40_3. 


124 


59 


5595 


L26953 


Homo sapiens 


chromosomal protein 


116 


62 


5596 


L26953 


Homo sapiens 


chromosomal protein 


115 


67 


5597 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


157 


55 


5598 


Y71213 


Homo sapiens 


Human irritable bowel disease related 
polypeptide IMX39. 


131 


44 


5599 


GO0376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


125 


45 


5600 


AF130089 


Homo sapiens 


PRO2550 


142 


62 


5601 


AY014404 


Homo sapiens 


kinesin-like protein RBKIN2 


297 


98 


5602 


AF210651 ■ 


Homo sapiens 


NAG 18 


113 


71 


5603 


AF090931 


Homo sapiens 


PRO0483 


129 


71 


5604 


X01060 


Homo sapiens 


put. transferrin receptor (aa 1-760) 


226 


80 


5605 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


455 


75 


5606 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 

< 


119 


.42 


5607 


A3 1039 


Nicotiana alata 


PRP3 


92 


36 


5608 


AF090944 


Homo sapiens 


PRO0663 


147 


64 


5609 


G00552 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
4633. 


64 


46 


5610 


X01065 


Cairina moschata 


histoneH5 ~" 


88 


40 


5611 


U03413 


Dictyostelium 
discoideum 


calcium binding protein 


104 


40 


5612 


AL160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


144 


37 


5613 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


127 

. ■ 


65 

■ 


5614 


G01218 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5299. 


196 


97 


5615 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413. 


122 


68 


5616 


AB029309 


Homo sapiens 


Npw38-binding protein NpwBP 


312 


78 


5617 


X03145 


Homo sapiens 


pot ORF III 


101 


80 


5618 


AF090930 


Homo sapiens 


PRO0478 


159 


78 


5619 


D88461 


Rattus rattus 


N-WASP 


107 


40 


5620 


W48927 


Homo sapiens 

i 


Schwamiomin-binding protein C- 
terminal fragment. 


104 


38 


5621 


Y36156 


Homo sapiens 


Human secreted protein #28. 


134 


50 


5622 


AF090894 


Homo sapiens 


PRO0113 


119 


65 


5623 


AF130089 


Homo sapiens 


PRO2550 


130 


69 


5624 


AF130089 


Homo sapiens 


PRO2550 


140 


78 


5625 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


162 


49 


5626 


AF130089 


Homo sapiens 


PRO2550 


152 


80 


5627 


AF151837 


Homo sapiens 


CGI-79 protein 


127 


78 


5628 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


122 


84 


5629 


AJ237981 


Vitis vinifera 


putative proline-rich cell wall protein 


93 


50 


5630 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6582. 


1 10 


52 



« 
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NO: 
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Species 

< 


Description 


Smith- 
Waterman 
Score 


% 

Ideotit 

y 


5631 


AF 130089 


Homo sapiens 


PRO2550 


151 


62 


5632 


U47661 


Lupin us luteus 


proline-rich protein PRP2 precursor 


67 


28 


5633 


Y08061 


WW * 

Homo sapiens 


Human c-myb protein fragment 


93 


61 


5634 

• 


AL133215 

■ 


Homo sapiens 

< 


bA108L7.6 (semaphorin 4G (sema 
domain, immunoglobulin domain (Ig), 
transmembrane domain (TM) and short 
cytoplasmic domain)) 


323 

. 


100 

■ 


5635 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


138 


67 


******** 

5636 


AF090942 


Homo sapiens 


PRO0657 


184 


67 


5637 

• 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


95 

• 


80 


5638 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


139 


83 


5639 


AF 119900 


Homo sapiens 


PR02822 


143 


62 


5640 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


151 


61 


5641 . 


Y36243 


Homo sapiens 


Human secreted protein encoded by 
gene 20. 


118 


72 


5642 


AF090895 


Homo sapiens 


PRO0117 


140 


56 


5643 


Y51611 


Homo sapiens 


Human HSGT1 protein. 


611 


96 


5644 


AF 162692 


Homo sapiens 


putative voltage-gated calcium channel 
gamma-4 subunit 


172 


73 


5645 


Z98883 


Homo sapiens 


C407A10.1 (GPU (N- 
acetylglucosaminyl transferase 
component)) 


229 


100 

■ 


5646 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


117 


80 


5647 


G03 172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


159 


78 


5648 


AF217517 


Homo sapiens 


un characterized bone marrow protein 
BM041 


97 


49 


5649 


AF210651 


Homo sapiens 


NAG 18 


149 


58 


5650 


AK000385 


Homo sapiens 


unnamed protein product 


125 


36 


5651 


U023 13 


Mils museums 


protein kinase 


143 


96 


5652 


AF090944 


Homo sapiens 


PRO0663 


95 


43. 


5653 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


131 


61 


5654 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


100 


58 


5655 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


157 


70 


5656 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


120 


100 


5657 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


123 


47 


5658 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


159 


46 


5659 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


163 


60 


5660 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


105 


46 


5661 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


125 


71 


5662 


D87744 


Mus musculus 


DRPLA protein 


89 


40 


5663 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


151 


65 


5664 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


87 


65 
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^ 

NO: 


Accession No. 


Species 
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Waterman 
Score 


% 

Identit 

y 


5665 


AF055985 


Onchocerca 
volvulus 


pyrrolidone-nch antigen 


71 


44 


5666 


A188I2 


Brassica napus 


extensin 


98 


41 


5667 


AF1 30089 


Homo sapiens 


PRO2550 


128 


74 ~ 


5668 


AF090930 


Homo sapiens 


PRO0478 


136 


54 


5669 


X52634 


Mus musculus 


tlm protein 


169 


59 


5670 


AB001684 


Chlorella 
vulgaris 


ORF49b 

i 


70 

• 


62 


5671 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


153 

4 


66 


5672 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


123 


78 


5673 


AF090942 


Homo sapiens 


PRO0657 


150 


60 


5674 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


123 


78 


5675 


M22332 


Homo sapiens 


unknown protein 


121 


43 


5676 


W40353 

i 


Homo sapiens 


i Human unspecified protein from 
US5702907. 


128 


72 


5677 


AL049868 


Homo sapiens 


dJ927M242(KIAA1219) 


403 


95 


5678 


U31785 


Human 

papillomavirus 
type 36 


putative 


103 


44 


5679 


G02409 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6490. 


101 


73 


5680 


AF068229 


Homo sapiens 


lysyl hydroxylase 3 


148 


92 


5681 


Y00358 

* 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


209 

»i 


81 


5682 


AB008515 


Homo sapiens 


RanBPM 


142 


100 


5683 


ABO 10692 

• 


Arabidopsis 
thaliana 


contains similarity to endo-beta-N- 
acetylglucosaminidase-gene id:K18I2 
3.27 


148 


45 


5684 


J02459 


bacteriophage 
lambda 


J (tail:host specificity; 1 132) 


746 


100 


5685 


AL359782 


Trypanosoma 
brucci 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


194 


69 


5686 


AB049054 


Homo sapiens 


brain link protein-1 


160 


100 


5687 


AF229067 


Homo sapiens 


PAD1-H protein 


144 


68 


5688 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


120 


70 


5689 


AP001313 


Arabidopsis 
thaliana 


contains similarity to 
pherophorin-gene_id:T5M7.14 


106 


47 


5690 


M76976 


Drosophila 
melanogaster 


dorsal-ventral patterning protein 


167 


37 


5691 


L24559 


Homo sapiens 


DNA polymerase alpha 


175 


100 


5692 


AF071777 


Mus musculus 


IRE1 


604 


85 


5693 


AF061741 


Homo sapiens 


retinal short-chain 
dehydrogenase/reductase retSDRl 


704 


99 


5694 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


178 


66 


5695 


AF092092 


Homo sapiens 


AP-3 adaptor complex mu3 A subunit 


653 


92 


5696 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


106 


86 


5697 


Y27907 


Homo sapiens 


Human secreted protein encoded by 
gene No. 119. 


132 


75 


5698 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


116 


40 


5699 


AF161361 


Homo sapiens 


HSPC098 


128 


56 


5700 


U21123 


Drosophila 
melanogaster 


ena polypeptide 


106 


57 
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Species 
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% i 
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5701 


AF1 19901 


Homo sapiens 


PR02831 


113 


76 


5702 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


101 j 


49 


5703 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


105 


78 


5704 


AF130089 


Homo sapiens 


PRO2550 


122 


75 


5705 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


136 


72 


5706 


G02455 

- 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


113 


75 

■ 


5707 


AF229067 


Homo sapiens 


PADI-H protein 


134 


81 


5708 


W40353 


Homo sapiens 


Human unspecified protein from 
US5702907. 


129 


65 


5709 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


137 


81 


5710 


AK024455 


Homo sapiens 


FLJ00047 protein 


108 


54 


5711 


Y76381 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 75. 


247 


93 


5712 


AF1 16715 


Homo sapiens 


PR02829 


118 


75 


5713 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413. 


154 


71 


5714 


G04078 , 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
8159. 


141 


79 

* 


5715 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


91 


80 


5716 


AF090944 


Homo sapiens 


PRO0663 


136 


56 


5717 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


153 


65 


5718 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


109 


70 


5719 


AF130089 


Homo sapiens 


PRO2550 


140 


86 


5720 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


144 


67 


5721 


AF130089 


Homo sapiens 


PRO2550 


146 


58 


5722 


AF 118086 


Homo sapiens 


PR01992 


130 


58 


5723 


D21230 


Homo sapiens 


alpha 1 (XV) collagen chain 


390 


78 


5724 


AL049569 


Homo sapiens 


dJ37C10.5 (KIAA0445) , 


266 


93 


5725 


X04412 


Homo sapiens 


plasma gelsolin 


761 


95 


5726 


A21853 


synthetic 
construct 


serine protease inhibitor 


206 


100 


5727 


Y16610 


Homo sapiens 


paraplegin 


218 


95 


5728 


AF132021 


Homo sapiens 


myosin X 


613 


87 


5729 


AL1 63491 


Arabidopsis 
thaliana 


putative protein 


236 


46 


5730 


LI 9704 


Homo sapiens 


alternative first exon 


561 


86 


5731 


D83703 


Homo sapiens 


peroxisome assembly factor-2 


240 


84 


5732 


AB007936 


Homo sapiens 


KIAA0467 protein 


189 


77 


5733 


L38622 


Mus musculus 


mSin3B gene product 


280 


94 


5734 


AF051944 


Gallus gallus 


Xin J 


460 


68 


5735 


B07857 


Homo sapiens 


Amino acid sequence of Smadl 
interactor protein clone S I +12-5. 


172 


67 


5736 


Z14020 


Nicotiana 
tabacum 


Pistil extensin like protein, partial CDS 
only 


105 


44 


5737 


LI 7308 


Gossypium 
hirsutum 


proline-rich cell wall protein 


145 


34 


5738 


Y76141 


Homo sapiens 


Human secreted protein encoded by 
gene 18. 


109 


72 


5739 


AF178534 


Homo sapiens 


talin 


583 


82 


5740 


D13435 


Homo sapiens 


PIG-F 


143 


100 


5741 


AC005578 


Homo sapiens 


F20887 1, partial CDS 


584 


86 
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Watermao 

Score 


% 

Identit 

y 


5742 


AC005578 


Homo sapiens 


F20887_l, partial CDS 


470 


85 


5743 


AJ010479 


Homo sapiens 


kinesin-like protein 


102 


23 


5744 


M19419 


Mus musculus 


proline-rich salivary protein 


107 


42 


5745 


U29377 


Caenorhabditis 
elegans 


lysine-rich 


97 

■i 


44 


5746 


G00377 

* 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4458. 


90 


67 


5747 


D84223 


Homo sapiens 


leucyl tRNA synthetase 


165 


83 


5748 


J04990 


Homo sapiens 


cathepsin G 


165 


100 


5749 


Y36203 


Homo sapiens 


Human secreted protein #75 . 


122 


77 


5750 


V00488 


Homo sapiens 


alpha globin 


181 


94 


5751 


M18642 


Homo sapiens 


lactoferrin 


122 


96 


5752 


W15286 


Homo sapiens 


Soluble type I insulin-like growth 
factor receptor. 


481 


87 


5753 


Z98595 


Schizosaccharom 
yces pombe 


coronin-like protein 


192 


38 


5754 


AL035683 


Homo sapiens 


dJ1063B2.1(Beta-l,4- 
galactosyltransferase) 


239 


100 


5755 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


170 


61 


5756 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6588. 


105 


61 


5757 


D49490 


Homo sapiens 


protein disulfide isomerase-related 
protein (PDIR) 


669 


96 


5758 


W88627 


Homo sapiens 

• 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


175 


72 


5759 


X61451 j 


Mus musculus 


F41 


130 


100 


5760 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


91 


61 


5761 

• 


AL 1 60493 £ 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


130 


76 


5762 


AF 130079 


Homo sapiens 


PR02852 


126 


61 


5763 


AF143946 


Homo sapiens 


transcriptional activator SRCAP 


288 


40 


5764 


M80627 


Homo sapiens 


helix-loop-helix protein 


127 


88 


5765 


AF038007 


Homo sapiens 


FIC1 


290 


47 


5766 


G01881 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5962. 


427 


97 


5767 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


172 


75 


5768 


B12884 


Homo sapiens 


Human alpha-bradeion protein 
sequence. 


465 


70 


5769 


Y66645 


Homo sapiens 


Membrane-bound protein PROl 3 10. 


347 


100 


5770 


AB033168 


Mus musculus 


nuclear protein ZAP 


115 


39 


5771 


X83575 


Cricetulus 
griseus 


CHOI antigen 


113 


57 


5772 


AF135422 


Homo sapiens 


GDP-mannose pyrophosphorylase A 


764 


91 


5773 


L38644 


Rattus 
norvegicus 


karyopherin beta 


121 


68 


5774 


AB020746 


Arabidopsis 
thaliana 


protein kinase-like protein 


107 

■ ■ 


33 


5775 


Y17920 


Drosophila 
melanogaster 


C ALO protein 


128 


31 


5776 


AK000309 


Homo sapiens 


unnamed protein product 


988 


98 


5777 


AFO 10404 


Homo sapiens 


ALR 


469 


97 


5778 


AB045131 


Homo sapiens 


NADPH-dependent retinol 
dehydrogenase/reductase 


126 


92 


5779 


AF197927 


Homo sapiens 


AF5q31 protein 


530 


95 


5780 


X79198 


Homo sapiens 


host cell factor 


119 


74 


5781 


X69838 


Homo sapiens 


G9a 


157 


41 
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vucnpuun 
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Waterman 
Score 


Identit 

y 


5782 


Y94450 


Homo sapiens 


Human inflammation associated 
protein #7. 


227 


44 


5783 


AF2 16804 


Rattus 


nuclear matrix transcription factor 


1539 


98 






-norvegicus 








5784 


R06463 


Homo sapiens 


Derived protein of clone ICA13 
(ATCC 40553). 


638 


91 


5785 


AF000149 


Mus musculus 


ATP-binding cassette transporter 


292 


41 


5786 


D89974 


Homo sapiens 


grycosylphosphatidyl inositol-anchored 
protein GP1-80 


490 


78 


5787 


AF090942 


Homo sapiens 


PRO0657 


132 


63 


5788 


X56932 


Homo sapiens 


23 kD highly basic protein 


286 


100 


5789 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
.7253. 


114 


63 


5790 


297211 


Schizosaccharom 
yces pombe 


kinesin-like protein 


346 

* 


60 


5791 


AF134707 


Homo sapiens 


disintegrin and metalloproteinase 
domain 19 


144 

• 


95 


5792 

- 


Z19550 

i 


Homo sapiens 


N-acetyllactosaminide beta-l,6-N- 
acetylglucosaminyltransferase 


547 

- 


76 


5793 


AF090930 _, 


Homo sapiens 


PRO0478 


103 


80 


5794 

• 


Y59879 


Homo sapiens 


Human normal uterus tissue derived 
protein 42. 


407 


97 


5795 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


149 


72 


5796 


AF077208 


Homo sapiens 


HSPC022 


682 


100 


5797 


D42138 


Homo sapiens 


PIG-B 


597 


86 


5798 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


151 


75 


5799 


M29487 


Homo sapiens 


integrin alpha subunit precursor 


160 


80 


5800 


D87438 


Homo sapiens 
• 


Similar to a C.elegans protein in 
cosmidC14H10 


354 


85 


5801 


W54282 


Homo sapiens 


Protein sequence of the di-alpha 
haemoglobin gene contained inpSSl. 


195 


97 


5802 


AB030176 


Homo sapiens 


peptidylargmine deiminase type II 


107 


100 


5803 


D38441 


Homo sapiens 


acylamino acid-releasing enzyme 


156 


71 


5804 


Y55939 


Homo sapiens 


Human GEK2 protein. 


157 


96 


5805 


X89626 


Bos taunts 


240K protein of rod photoreceptor cng- 
channel 


92 


42 


5806 


AC006963 


Homo sapiens 


similar to Kelch proteins; similar to 
BAA77027 (PID:g4650844) 


177 


38 


5807 


Y86275 


Homo sapiens 


Human secreted protein HSRGW16, 
SEQIDNO:190. 


436 


77 


5808 


D50577 


Mesocricetus 
auratus 


carboxylesterase precursor 


328 


55 - 


5809 


AF272148 


Homo sapiens 


KRAB zinc finger protein 


593 


68 


5810 


W57260 


Homo sapiens 


Human semaphorin Y. 


651 


96 


5811 


AF202893 


Mus musculus 


Kifclb 


491 


82 


5812 


AC006963 


Homo sapiens 


similar to Kelch proteins; similar to 
BAA77027 (PlD:g4650844) 


219 


38 


5813 


AB017114 


Homo sapiens 


AD 3 


142 


57 


5814 


AC007954 


Homo sapiens 


unknown 


678 


100 


5815 


AF124726 


Homo sapiens 


acinusL 


790 


93 


5816. 


AB015330 


Homo sapiens 


HRIHFB2007 


167 


36 


5817 


AB020684 


Homo sapiens 


KIAA0877 protein 


699 


57 


5818 


Z15025 


Homo sapiens 


Bat2 


172 


49 


5819 


X95761 


Mus musculus 


new:Rhobin 


549 


79 


5820 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


166 


68 


5821 


X55126 


Mus musculus 


Zfp-29 


633 


93 


5822 | Y99366 


Homo sapiens 


Human PR01475 (UNQ746) amino 


452 


98 
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To 
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Identit ' 
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y 








acid sequence SEQ ID NO:88. 






5823 


AB040801 


Homo sapiens 


SREB3 


611 


95 


5824 


M27878 


Homo sapiens 


DNA binding protein 


462 


64 


5825 


Y13620 


Homo sapiens 


BCL9 


140 


40 


5826 


AF152510 


Homo sapiens 


protocadherin gamma A3 short form 


525 


92 








protein 


■ 




5827 


M23451 


Achlya 


steroid receptor 


337 ' 


61 






ambisexualis 


■ 


• 




5828 


AJ011305 


Homo sapiens 


guanine nucleotide exchange factor, 


132 


100 




■ 




delta subunit 






5829 


Y85565 


Homo sapiens 


Human homologue of UNC-53 (Hs- 


710 


96 








UNO-53/2) sequence. 






5830 

• 


U03277 


Drosophila 


cloned by ability to arrest the cell cycle 


258 


46 






melanogaster 


when expressed in the fission yeast 




- 






- 


Schizosaccharomyces pombe 






5831 . 


AF193342 


Mus musculus 


GCN2alpha 


469 


85 


5832 


AF279265 


Homo sapiens 


putative anion transporter 1 


240 


91 


5833 


AK023998 


Homo sapiens 


unnamed protein product 


687 


97 


5834 


AK023998 


Homo sapiens 


unnamed protein product 


674 


96 


5835 


AF071059 


Mus musculus 


zinc finger RNA binding protein 


243 


100 


5836 


AF178534 


Homo sapiens 


talin 


463 


62 


5837 


J04569 


Homo sapiens 


glial fibrillary acidic protein 


102 


55 


5838 


AF068625 


Mus musculus 


DNA cytosine-5 methyltransf erase 3A 


130 


89 


5839 


AB037739 


Homo sapiens 


KIAA13 18 protein 


644 


97 


5840 


R86865 


Homo sapiens 

ft 


Human protocadherin pc3. 

ft ft 


660 


96 


5841 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


84 








7124. 


D 




5842 


X51486 


Gallus gallus 


cNFI-Al protein (AA 1-522) 


590 


86 


5843 


U65785 


Homo sapiens 


1 50 kDa oxygen-regulated protein 

m %J ft 


200 


95 








ORP150 






5844 


U16126 


Homo sapiens 

My 


EAA4 


679 


97 


5845 


AL035071 


Homo sapiens 


dJ1085F17.L3 (ortholog of mouse 


717 


100 








DNMT3B (DNA Cytosine-5 






• 






Methyltransferase 3B3) (isoform 3)) 


• 


• 


5846 


Y13620 


Homo sapiens 


BCL9 


505 


81 


5847 


AC004812 


Homo sapiens 


similar to human HsGCN 1 U77700 


316 


100 






(PID:g2282576); similar to yeast 












translation activator GCN1 












(PID:g462168) 






5848 


M77246 


Rattus 


beta-chain clathrin associated protein 


611 


89 






norvegicus 


complex AP-2 






5849 


AF007833 


Homo sapiens 


kruppel-related zinc finger protein 


569 


79 








hcKrox 


• 




5850 


AF056617 


Homo sapiens 


BWSCR2 associated zinc-finger 


175 


32 






t 


protein BAZ1 






5851 


D38548 


Homo sapiens 


The ha093 6 gene product is novel. 


409 


65 


5852 


G00528 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


166 


55 








4609. 


■ 




5853 


D87071 


Homo sapiens 


similar to C.elegans protein encoded in 


188 


76 








cosmid T20D3 (Z68220). 


- . 




5854 


M64979 


Homo sapiens 


glial factor- 1 


557 


83 


5855 


G01480 


Homo sapiens 


Human' secreted protein, SEQ ID NO: 


400 


82 








5561. 






5856 


X06345 


Xenopus laevis 


lamin A (AA 1-265) 


109 


68 


5857 


AB017499 


Oryctolagus 


Lipoprotein Receptor Related Protein 5 


196 


40 






cuniculus 








5858 


D43633 


Oryzias latipes 


G protein-coupled seven- 


281 


50 








transmembrane receptor 






5859 


AF067I65 


Homo sapiens 


zinc finger protein 3 


669 


87 
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y 


5860 


AL031118 


Homo sapiens 


dJ153G14.3 (novel C2H2 type Zinc 
Finger protein) 


218 


38 


5861 


U13897 


Homo sapiens 


homolog of Drosophila discs large 
protein, isoform 1 


130 


35 


5862 


M74002 


Homo sapiens 


arginine-rich nuclear protein 


80 


23 


5863 


X67855 


Oryctolagus 
cuniculus 


brain calcium channel BII-1 


147 


66 


5864 


U35376 


Homo sapiens 


repressor transcriptional factor 


475 


67 


5865 


AF251038 


Homo sapiens 


GAP-like protein 


325 


58 


5866 


L46815 


Mus musculus 


DNA binding protein Rc 


409 


66 


5867 


S68944 


Rattus sp. 


Na+/Cl(-)-dependent neurotransmitter 
transporter 


582 


95 


5868 


X97675 


Homo sapiens 


plakophilin 2b 


159 


93 


5869 


AF1 52478 


Homo sapiens 


protocadherin alpha 13 short form 
protein 


680 

■ 


100 


5870 


AF1 18082 


Homo sapiens 


PRO1902 


130 


48 


5871 


G0341 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


149 


49 


5872 


A61971 


unidentified 


MCSP 


621 


82 


5873 


254206 , 


Bovine 
herpesvirus 1 


UL51 


104 


33 


5874 


AF069765 


Homo sapiens 

• 


signal recognition particle 72 


674 


97 


5875 


AF1 79428 


Homo sapiens 


DNA polymerase zeta catalytic subunit 
variant I 


692 


100 


5876 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


119 


48 


5877 


Y94990 


Homo sapiens 


Human secreted protein vh21 3, SEQ 
IDNO:20. 


446 


98 


5878 

■ 


L08811 

■ 


Drosophila 
melanogaster 


adherin 


264 

- 


42 


5879 


D86983 


Homo sapiens 


similar to D.melanogaster 
peroxidasin(U11052) 


429 


100 


5880 


AF 1 52508 


Homo sapiens 


protocadherin gamma A2 short form 
protein 


664 


100 


5881 


AF152508 


Homo sapiens 


protocadherin gamma A2 short form 
protein 


660 


98 


5882 


AF190018 


Mus musculus 


K-Cl cotrahsporter 


199 


88 


5883 


AB023229 


Homo sapiens 


K1AA1012 protein 


107 


83 


5884 


U53420 


Rattus 
norvegicus 


sodium-calcium exchanger form 3 


595 


98 


5885 


AF039019 


Homo sapiens 


zinc finger DNA binding protein 89 
kDa 


608 . 


92 


5886 


AF207661 

• 


Homo sapiens 

- 


sodium bicarbonate cotransporter-like 
protein i 


701 


100 


5887 


AF130051 


Homo sapiens 


PRO0898 


115 


74 


5888 


AF 152508 


Homo sapiens 


protocadherin gamma A2 short form 
protein 


720 


100 


5889 


AF251079 


Homo sapiens 


heptacellular carcinoma novel gene-3 
protein 


741 


98 


5890 


R13556 


Homo sapiens 

- 


Protein encoded downstream of hhcJM 
oncoprotein. 


118 


51 


5891 


AF1 57326 


Homo sapiens 


T1P120 protein 


571 


85 


5892 


AF129170 


Homo sapiens 


apolipoprotein E receptor 2 


630 


99 


5893 


X05561 


Homo sapiens 


alpha- 1 chain precursor (AA -27 to 
917) (2953 is 2nd base in oodon) 


197 


45 


5894 


AJ238374 


Homo sapiens 


putative protein TH1 


641 


96 


5895 


X63526 


Homo sapiens 


homologue to elongation factor 1- 
gamma from A.salina 


709 


97 


5896 


Y11651 


Homo sapiens 


phosphate cyclase 


521 


80 
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% 
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5897 


Y 18448 


Homo sapiens 


Bassoon protein 


681 


100 


5898 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


138 


63 


5899 


AF 152473 


Homo sapiens 


protocadherin alpha Gl short form 
protein 


733 

■ 


100 


5900 


G03790 

* 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


133 


66 


5901 


R06463 


Homo sapiens 


Derived protein of clone ICA 1 3 
(ATCC 40553). 


522 


99 


5902 


X12928 


Triticum 
aestivum 


HMW glutenin subunit 5 (AA 1-848) 


169 


25 


5903 


X61048 


Hydra sp. 


mini-collagen 


88 


33 


5904 


AF 118082 


Homo sapiens 


PRO1902 


247 


60 


5905 


A00279 


synthetic 
construct 


Human serum albumin 


482 


84 


5906 


T14722_cdl 


Homo sapiens 


01-DEC-1992 Human CD27 antigen 
cDNA. 


431 


94 


5907 


Y73475 


Homo sapiens 


Human secreted protein clone ye7_l 
protein sequence SEQ ID NO:172. 


271 


94 


5908 


AF175265 


Homo sapiens 


vacuolar sorting protein 35 


645 


92 


5909 


Y12713 


Mus musculus 


Pro-PoI-dUTPase polyprotein 


125 


50 


5910 


GO036O 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4441. 


115 


58 


5911 


W88762 


Homo sapiens 


Polypeptide fragment encoded by gene 
20. 


152 


50 


5912 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


154 

»» 


68 


5913 

* 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


209 


71 


5914 


AB046048 


Macaca 
fascicularis 


unnamed portein product 


143 


49 


5915 


S80644 


Sus scrofa 


glutaminase; GA 


338 


85 


5916 AF213393 


Mus musculus 


ATP-binding cassette protein 


92 


95 


5917 


X52622 


Mus musculus 


integrase (409 AA) 


425 


45 


5918 


U49973 


Homo sapiens 


ORF2: function unknown 


114 


64 


5919 


S71403 


Mus musculus 


m33-A isoform 


120 


42 


5920 


AF090931 


Homo sapiens 


PRO0483 


94 


89 


5921 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


99 

■ 


36 


5922 


AJ006770 


Cicer arietinum 


extensin 


96 


32 


5923 


AJ250193 


Mus musculus 


muscle protein 637 


730 


72 


5924 


AF229067 


Homo sapiens 


PADI-H protein 


278 


63 


5925 


X92485 


Plasmodium 
vivax 


pva| 


99 


50 


5926 


G03714 


Homo sapiens 

- 


Human secreted protein, SEQ ID NO: 
7795. 


84 

* 


66 


5927 


AF130089 


Homo sapiens 


PRO2550 


114 


64 


5928 


J00126 


Homo sapiens 


MHCfactorB 


237 


100 


5929 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


251 


79 


5930 


AF090895 


Homo sapiens 


PRO0117 


63 


62 


5931 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


147 


37 


5932 


U83857 


Homo sapiens 


Aacll 


209 


80 


5933 


AF068294 


Homo sapiens 


HDCMB45P 


144 


50 


5934 


X92485 


Plasmodium 
vivax 


pval 


135 


69 


5935 

* 


L07545 


Leishmania 
tarentolae 


A V was inserted after nt 369 (=nt 
10459 in genomic sequence (Ml 0126)) 
to correct -1 firameshift probably due to 


127 


36 
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■ 


gel compression 






5936 


R95913 


Homo sapiens 


Neural thread protein. 


276 


58 


5937 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


130 

■ 


65 


5938 


Y16781 


Homo sapiens 


Human secreted protein (clone 
bh!57_7). 


1258 


100 


5939 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


167 


68 


5940 


U93570 


Homo sapiens 


putative pi 50 


103 


41 


5941 


G03472 


Homo sapiens 


Human secreted protein, SEQ ID NO.' 
7553. 


189 


74 


5942 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


110 


47 


5943 


AL390114 

■ 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


122 


90 


5944 


AF130089 


Homo sapiens 


PRO2550 


101 


48 


5945 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


162 


86 


5946 


X92485 


Plasmodium 
vivax 


pval 


104 


70 


5947 


M29622, 


Mus musculus 


open reading frame 2 


132 


64 


5948 


AF1 30089 


Homo sapiens 


PRO2550 


131 


60 


5949 


AF130089 


Homo sapiens 


PRO2550 


151 


60 


5950 


AF130079 


Homo sapiens 


PR02852 


117 


52 


5951 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


185 


62 


5952 


U49973 


Homo sapiens 


ORF2: function unknown 


160 


62 


5953 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
geneNo. 107. 


145 


55 


5954 


M22332 


Homo sapiens 


unknown protein 


169 


41 


5955 


Y02671 


Homo sapiens 

- 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


133 


72 


5956 


U70935 


Peromyscus 
maniculatus 


reverse transcriptase 


100 


36 


5957 

■ 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


105 


# 


5958 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


112 


48 


5959 


Z49944 


Canis familiaris 


CDC42 GTP-binding protein 


120 


40 


5960 


AF1 18082 


Homo sapiens 


PRO1902 


158 


61 • 


5961 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


91 


65 


5962 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


121 


62 


5963 


AF 118082 


Homo sapiens 


PRO1902 


113 


60 


5964 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


152 


65 


5965 

■ 


U49973 


Homo sapiens 

- 


ORF I ; MER37; putative transposase 
similar to pogo element 


120 


57 


5966 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


306 


77 


5967 


G03224 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7305. 


80 


41 


5968 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


109 


75 


5969 


AF1 19855 


Homo sapiens 


PR01847 


162 


72 


5970 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


146 


46 


5971 


M19155 


Plasmodium 
falciparum 


S-antigen precursor 


122 


37 


5972 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


121 


66 
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NO: 


ApfKtinn Kin 


species 
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omitD- 
W a term an 
Score 


% 

Id en tit 

y 


5973 


ALI60371 


Leishmania 
major 


jjrobable (hhv-6)ul 102, variant a 
DNA, complete virion genome 


107 


60 


5974 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


83 


68 


5975 


G00497 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4578. 


111 


47 


5976 


U93572 


Homo sapiens 


p40 


112 


33 


5977 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


107 


58 


5978 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


120 


70 


5979 


AL 160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


106 


54 


5980 


M64792 


Rattus 
norvegicus 


salivary proline-rich protein 


105 


31 


5981 


X66285 


Mus musculus 


HC1 ORF 


86 


36 


5982 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


110 


43 


5983 


Y36366 


Homo sapiens 

■ 


Fragment of human secreted protein 
encoded by gene 3. 


131 


60 


5984 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


110 


60 


5985 


G02753 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6834. 


107 


47 


5986 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


160 


77 


5987 


AF130079 


Homo sapiens 


PR02852 


272 


77 


5988 


U93570 


Homo sapiens 


putative pi 50 


87 


46 


5989 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


202 


89 


5990 


AF 130089 


Homo sapiens 


PRO2550 


238 


74 


5991 


AF 118082 


Homo sapiens 


PRO1902 


119 


41 


5992 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


131 


60 

• 


5993 


L27428 


Homo sapiens 


reverse transcriptase 


110 


60 


5994 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


98 


45 


5995 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


189 


90 


5996 


X58236 


Homo sapiens 


36/8-8 fusion protein with epitope for 
anti-Iectin antibody 


78 


68 


5997 


Y08061 


Homo sapiens 


Human c-myb protein fragment. 


97 


63 


5998 


AP002460 

i 


Arabidopsis 
thaliana 


gene_id:FlD9,26~unknown protein 


100 


38 


5999 


U93569 


Homo sapiens 


p40 


103 


52 




Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


100 


63 


6001 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


121 


51 


6002 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


115 


64 


6003 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


105 


40 


6004 


AF130089 


Homo sapiens 


PRO2550 


122 


63 


6005 


R95913 


Homo sapiens 


Neural thread protein. 


235 


56 


6006 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


173 


58 


6007 


X92485 


Plasmodium 
vivax 


pval 


96 


53 
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aty ID 

NO: 


Accession No. 


species 


Description 


ami in- 

Waterman 

Score 


T— ST 1 

Identit 

y 


6008 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


236 


62 


6009 

i 


Y 17832 

• 


Human 
endogenous 
retrovirus K 


env protein 


103 


66 


6010 


AF1 19900 


Homo sapiens 


PR02822 


91 


50 


6011 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


155 

• 


55 


6012 


Y86248 

• 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


201 


91 

• 


6013 


Y36156 


Homo sapiens 


Human secreted protein #28. 


253 


70 


6014 


AF130079 


Homo sapiens 


PR02852 


119 


36 


6015 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


137 


63 


6016 


AF1 30089 


Homo sapiens 


PRO2550 


110 


40 


6017 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


142 


57 


601S 


AF1 16661 


Homo sapiens 


PR01438 


147 


69 


6019 


AF130089 


Homo sapiens 


PRO2550 


169 


71 


6020 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


208 


60 


6021 


AF1 19851 


Homo sapiens 


PR01722 


147 


65 


6022 


G02507 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
6588. 


139 

■ 


47 


6023 


AF130089 


Homo sapiens 


PRO2550 


142 


66 


6024 


G03133 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7214. 


91 

i 

1 


76 


6025 


Ml 8093 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


103 


48 


6026 


D86853 


Catharanthus 
roseus 


extensin 


97 


46 


6027 


AF130089 


Homo sapiens 


PRO2550 


95 


47 


6028 


X92485 


Plasmodium 
vivax 


pval 


102 


61 


6029 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


149 


53 


6030 


G00643 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4724. 


96 


76 


6031 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


144 


44 


6032 


AB011483 


Arabidopsis 
thaliana 


contains similarity to root cap 
protein-genejd:MUF9. 16 


117 


37 


6033 


X92485 


Plasmodium 
vivax 


pval 


85 


40 


6034 


AF 11 8082 


Homo sapiens 


PRO1902 


112 


58 


6035 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4458. 


121 


59 


6036 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


140 


65 


6037 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


107 


71 


6038 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


140 


65 


6039 


X92485 


Plasmodium 
vivax 


pval 


84 


78 


6040 


AF130051 


Homo sapiens 


PRO0898 


107 


69 


6041 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


113 


58 


6042 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


209 


75 
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'Accession lyo* 


Species 


i/escnpnon 


1 Smith. 1 

O Hi (tlx- 


' " 


NO: 








Waterman 


Identit 






• 




Score 


y 


6043 


Y36495 


Homo sapiens 


Fragment of human secreted protein 


98 


56 








encoded by gene 27. 


- 




6044 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


43 








7870. — 


• 




6045 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


150 


65 








7795. 


- 


• 


6046 


Y36156 


Homo sapiens 


Human secreted protein #28. 


93 


62 


6047 


AF130079 


Homo sapiens 


PR02852 


205 


83 


6048 


R95913 


Homo sapiens 


Neural thread protein. 


148 


45 


6049 


D38112 


Homo sapiens 


cytochrome c oxidase subunh 3 


234 


86 


6050 


D38113 


Pan troglodytes 


NADH dehydrogenase subunit 5 


403 


61 


6051 


Ml 1901 


Rattus 


proline-rich salivary protein 


96 


42 






norvegicus 








6052 


AF 130089 


Homo sapiens 


PRO2550 


348 


73 


6053 


AF1 19851 


Homo sapiens 


PRO 1722 


109 


68 


6054 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


168 


66 

• 








7874. 






6055 


AF161356 


Homo sapiens 


HSPC093 


83 


58 


6056 


X92485 


Plasmodium 


pval 


155 


55 






vivax 








6057 


G04000 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


520 


100 








8081. 






6058 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


148 


65 


6059 


W03626 


Homo sapiens 


Human thyrotropin GPR N-terminal 


163 


53 








sequence. 






6060 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


300 


68 






■ 


6953. 






6061 


U93563 


Homo sapiens 


putative pi 50 


389 


37 


6062 


AF090895 


Homo sapiens 


PRO0117 


66 


78 


6063 


X92485 


Plasmodium 




110 


86 






vivax 






• 


6064 


AF130089 


Homo sapiens 


PRO2550 


166 


71 


6065 


M22332 


Homo sapiens 


unknown protein 


188 


48 


6066 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 


101 


52 








derived from DNA35638. 






6067 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


107 


41 


6068 


G00528 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


75 






4609. 






6069 


M22332 


Homo sapiens 


unknown protein 


153 


80 


6070 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


109 


82 


- 






RHAla(AF078683) 






6071 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


73 








4484. 






6072 


AF 130079 


Homo sapiens 


PR02852 


143 


60 


6073 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


55 






• 


4478. 






6074 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


158 


65 








7795. 






6075 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


157 


68 








6613. 




• 


6076 


AF1 16712 


Homo sapiens 


PR02738 


102 


41 


6077 


W48351 


Homo sapiens 


Human breast cancer related protein 


102 


69 






BCRB2. 






6078 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


60 








4435. 






6079 


AF1 18082 


Homo sapiens 


PRO1902 


196 


61 


6080 


AF1 18082 


Homo sapiens 


PRO1902 


165 


59 


6081 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


66 








4478. 
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SEO ID 


^Accession No. 




Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score , 


y 


6082 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


188 


71 


6083 


Y86248 


Homo sapiens 


_ _ mm mm W ^tolH mw ta.B|k _m> A 

Human secreted protein HCHPF68, 


109 


60 






■ • 


m™m Wmm\ mWm^ W» mmm< ^m. 9 mmm^ mm m* m9± 

SEQ ID NO: 163. 






6084 


V00672 


Pan troglodytes 


reading frame protein 4 


185 


85 


6085 


AF130051 


Homo sapiens 


PRO0898 


mm. A+m. mm 

307 


67 


6086 


AF130079 


Homo sapiens 


PR02852 


96 


72 


6087 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


171 


82 


6088 


AF090895 


Homo sapiens 


PRO0117 


no 


67 


6089 


U43360 


Peromyscus 


reverse transcriptase 


101 


46 


■ 




maniculatus 


• 






6090 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


36 






* 


7284. 






6091 


Y02925 


Homo sapiens 


Fragment of human secreted protein 


111 


46 




• 




encoded by gene 101. 






6092 


G00366 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


54 








4447. 






6093 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


176 


59 




• 




7870. 




■ 


6094 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


199 


50 








gene 44 clone HTDAD22. 






6095 


U70935 


Peromyscus 


reverse transcriptase 


216 


44 






maniculatus 






■ 


6096 


X71442 


Rattus 


ORF 1; putative 


109 


53 






norvegicus 


■ 




* 


6097 


AF119855 


Homo sapiens 


PR01847 


88 


71 


6098 


L27428 


Homo sapiens 


reverse transcriptase 


104 


45 


6099 


AF090895 


Homo sapiens 


PRO0117 


119 


61 


6100 


U52077 


Homo sapiens 


mariner transposase 


295 


67 


6101 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


156 


73 


■ 






7795. 






6102 


M22332 


Homo sapiens 


unknown protein 


132 


44 


6103 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


56 








4478. 


■ 




6104 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


64 


47 


■ 






6720. 






6105 


M64791 


Rattus 


salivary proline-rich protein 


123 


41 






norvegicus 








6106 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


139 


69 








SEQIDNO:163. 






6107 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


75 


58 








4478. 






6108 


AF068294 


Homo sapiens 


HDCMB45P 


141 


43 


6109 


AF1 18086 


Homo sapiens 


PR01992 


160 


77 


6110 


M63819 


Plasmodium 


malaria antigen 


62 


46 






falciparum 


- 






6111 


G03093 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


65 




• 


• 


7174. 


- 




6112 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


282 


70 








7871; 


- 




6113 


AF1 19851 


Homo sapiens 


PR01722 


156 


63 


6114 


AF229067 


Homo sapiens 


PADI-H protein 


134 


48 


6115 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


143 


44 


6116 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


201 


80 


6117 


AF285567 


Microcebus 


cytochrome b 


185 


58 






griseorufus 








6118 


AL1 38656 


Arabidopsis 


ex tens in precursor-like protein 


141 


28 






thaliana 


> 






6119 


X92485 


Plasmodium 


pval 


165 


61 






vivax 
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I 



t 
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> 



NO: 


Accession No* 


Soecies 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


6120 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


164 


84 


6121 


M13100 


Rattus 

• 

norvcgicus 


unknown protein 


176 


49 


6122 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


177 


50 


6123 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


83 


71 


6124 


AF 130089 


Homo sapiens 


PRO2550 


109 


85 


6125 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


129 


70 


6126 


AF144054 


Homo sapiens 


apoptosis related protein APR-4 


149 


54 


6127 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


138 


70 


6128 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


236 


77 


6129 


U93564 


Homo sapiens 


putative pl50 


152 


53 


6130 


Y02999 


Homo sapiens 

i 


Fragment of human secreted protein 
encoded by gene 121. 


129 


57 


6131 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


127 


56 


6132 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


195 


67 


6133 


W50192 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-1. 


92 


42 


6134 


AK000496 : 


Homo sapiens 


unnamed protein product 


158 


59 


6135 


X92485 


Plasmodium 
vivax 


pval 


146 


55 


6136 


Y36203 


Homo sapiens 


Human secreted protein #75. 


133 


57 


6137 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


148 


68 


6138 


Y02693 


Homo sapiens 

_ 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


106 


36 


6139 


AF130051 


Homo sapiens 


PRO0898 


77 


65 


6140 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. , 


222 


73 


6141 


AF119851 


Homo sapiens 


PR01722 


107 


63 


6142 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


117 

• 


57 


6143 


R95913 


Homo sapiens 


Neural thread protein. 


114 


70 


6144 


AF068294 


Homo sapiens 


HDCMB45P 


123 


56 


6145 


AF 118082 


Homo sapiens 


PRO1902 


113 


32 


6146 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


105 


42 


6147 


Y30731 


Homo sapiens 


Amino acid sequence of a human 
secreted protein. 


938 


100 


6148 


U93574 


Homo sapiens 


putative pl50 


99 


47 


6149 


AF090894 


Homo sapiens 


PRO0113 


145 


55 


6150 


AF1 30079 


Homo sapiens 


PR02852 


125 


49 


6151 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


91 


48 


6152 


G03224 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7305. 


89 


60 


6153 


Y01398 


Homo sapiens 


Secreted protein encoded by gene 16 
clone HNGFR75. 


90 


82 


6154 


X92485 


Plasmodium 
vivax 


pval 


110 


42 


6155 


S79410 


Mus museums 


nuclear localization signal binding 
protein 


118 


48 


6156 


G01495 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


170 


89 
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SFO ID 


* Accession JVn 


UlNiLILa 


Dtscrintinn 

mJ IUUVU 


Smftfi- 


% 


NO: 








Watennan 


ldentit 










Score 


y 




i 


» 


5576. 






6157 


X92485 


Plasmodium 


pval 


157 


52 


■ 




vivax 


• 






6158 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


58 


'i 






6953. 






6159 


U36794 


Coturnix 


Cytochrome oxidase subunit I 


70 


73 




■ 


japonica . 


* 






6160 


AF1 18082 


Homo sapiens 


PRO1902 


119 


66 


6161 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


154 


64 


• 


- 


. 


4718. 






6162 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


37 








6065. 


■ 




6163 


M22332 


Homo sapiens 


unknown protein 


125 


51 


6164 


W48351 


Homo sapiens 


Human breast cancer related protein 


163 


59 




> 




BCRB2. 


■ 




6165 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


235 


64 








6953. 






6166 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


36 








4770. 


- 




6167 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


139 


59 


6168 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


562 


85 


6169 


D38113 


Pan troglodytes 


NADH dehydrogenase subunit 4 


296 


51 


6170 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 


85 








6613. 






6171 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


329 


73 








7867. 






6172 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


136 


53 




* 




6953. 






6173 


K03206 


Homo sapiens 


salivary proline-rich protein precursor 


104 


36 


6174 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


136 


75 








SEQIDNOM63. 






6175 


U93569 


Homo sapiens 


p40 


192 


40 


6176 


X70343 


Nicotiana 


extensin 


94 


35 






sylvestris 








6177 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


140 


81 








SEQIDNO:163. 


■ • 


■ 


6178 


AF1 19855 


Homo sapiens 


PRO 1847 


114 


43 


6179 


Y36366 


Homo sapiens 


Fragment of human secreted protein 


98 


68 






. 


encoded by gene 3. 






6180 


X03717 


Homo sapiens 


pot unidentified reading frame 


112 


42 


6181 


U49973 


Homo sapiens 


ORF 1 ; MER37; putative transposase 


204 


76 




• 


• ■ 


similar to pogo element 






6182 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


110 


73 




• 


brucei 


complete virion genome. 






6183 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


240 


88 


6184 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


102 


47 








gene No. 107. 






6185 


X86031 


Sus scrofa 


homologue of prolme/argimne-rich 


99 


49 






■ 


antibacterial protein 






6186 


Y36156 


Homo sapiens 


Human secreted protein #28. 


196 


75 


6187 


AF 169387 


Mus museums 


alpha 3 collagen IV 


121 


38 


6188 


AF1 94537 


Homo sapiens 


NAG 13 


138 


37 


6189 


X91836 


Vigna 


extensin class 1 protein 


167 


33 






unguiculata 








6190 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


175 


63 






■ 


8153. 






6191 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


123 


46 








gene No. 107. 






6192 


X92485 


Plasmodium 


pval 


115 


54 
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• 
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Waterman 

Score 


% 

Identit 

y 






vivax 








6193 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


345 


65 


6194 


W88627 


Homo sapiens — 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


82 

* 


54 


6195 


G03790 


Homo sapiens 1 


Human secreted protein, SEQ ID NO: 
7871. 


214 


56 


6196 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


87 


79 


6197 

* 


Y08062 - 

■ 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


118 


41 

■ 


6198 


X62691 


Homo sapiens 


ribosomal protein homologous to yeast 
S24 


215 


73 


6199 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


100 


46 

•* 


6200 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


143 


73 


6201 


V00711 


Mus musculus 


cytochrome oxidase I 


435 


92 


6202 


U93572 


Homo sapiens 


putative pi 50 


99 


63 


6203 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


109 


45 


6204 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


163 


39 


6205 


Ml 5530 


Homo sapiens 


B-cell growth factor 


113 


60 


6206 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


128 


50 


6207 


W88627 


Homo sapiens 

• 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


181 


72 


6208 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


1?3 


75 


6209 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


120 


62 


6210 


Y36203 


Homo sapiens 


Human secreted protein #75. 


169 


81 


6211 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


130 


52 


6212 


G03295 

i 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7376. 


123 


48 


6213 


D86854 


Catharanthus 
roseus 


extensin 


120 


44 


6214 


G02754 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6835. 


117 


100 


6215 


W33698 


Homo sapiens 


AL-2-long (AL-21) protein. 


262 


100 


6216 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


102 


62 


6217 


U40342 


Mus musculus 


ninein 


77 


72 


6218 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


174 


49 


6219 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


234 


54 


6220 


X92485 


Plasmodium 
vivax 


pval 


107 


46 


6221 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


104 


40 


6222 


X86029 


Vigna 
unguiculata 


extensin-like protein 

* 


140 


35 . 


6223 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


216 


59 


6224 


AF068294 


Homo sapiens 


HDCMB45P 


164 


60 


6225 


Y50797 


Homo sapiens 


Human caimodulin/NEF protein 
binding inhibitor 2. 


135 


50 


6226 


AE003859 


Xylella 
fasti diosa 


hypothetical protein 


139 


41 


6227 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


58 
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i « ■ 



~5E6 ID 
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Snectes 


D&scrintinn 




% 


NO: 








Waterman 


Identit 










Score 


y 




i 


1 


4718. 






, 6228 


AF130089 


Homo sapiens 


PRO2550 


126 


43 


6229 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


190 


88 








SEQ ID NO: 163. 


• 




6230 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


124 


77 






• 


RHAla(AF078683) 


■ 




6231 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


174 


79 








SEQ ID NO: 163. 


• 




6232 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


168 


78 




• 




SEQIDN0.163. 


■ 




6233 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 


164 


67 


■ 






similar to pogo element 


■ 




6234 


Y91577 


Homo sapiens 


Human secreted protein sequence 


288 


94 




■ 




encoded by gene 2 SEQ ID NO:250. 






6235 


AF1 13685 


Homo sapiens 


PRO0974 


114 


46 


6236 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


150 


40 








7214. 






6237 


AL096844 


Streptomyces 


putative integral membrane protein 


88 


39 






coelicolor A3(2) 




■ 




6238 


AF1 30079 


Homo sapiens 


PR02852 


167 


50 


6239 


AF130089 


Homo sapiens 


PRQ2550 


365 


85 


6240 


L78671 


Homo sapiens 


CoxII/D-loop DNA fusion protein 


212 


69 


6241 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


102 


54 








gene No. 107. 






6242 


AF1 09907 


Homo sapiens 


S164 


146 


90 


6243 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


132 


52 








gene 22 clone HMSJW18. 






6244 


AF010I44 


Homo sapiens 


neuronal thread protein AD7c-NTP 


138 


52 


6245 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


32 




• 




7871. 






6246 


AF130089 


Homo sapiens 


PRO2550 


154 


77 


6247 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


91 


40 


6248 


Y36156 


Homo sapiens 


Human secreted protein #28. 


105 


43 


6249 


AF 116661 


Homo sapiens 


PR01438 


131 


60 


6250 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


68 








7795. 


. • 


■ 


6251 


G00381 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


70 


73 








4462. 






6252 


Y64890 


Homo sapiens 


Human S EST related polypeptide SEQ 


106 


42 




- 




IDNO:1051. 






6253 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


247 


68 


6254 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


117 


75 






a 


SEQ ID NO: 163. 






6255 


V00662 


Homo sapiens 


cytochrome oxidase I 


355 


78 


6256 


Y02999 


Homo sapiens 


Fragment of human secreted protein 


131 


65 






• 


encoded by gene 121. 






6257 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


176 , 


86 


6258 


AF061340 


Artibeus 


cytochrome c oxidase subunit III 


308 


63 






jamaicensis 




* 




6259 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


114 


43 


6260 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


179 


78 








6908. 






6261 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 


192 


75 








similar to pogo element 






6262 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


71 








7795. 






6263 


U09116 


Homo sapiens 


ORFl, encodes a 40 kDa product 


114 


37 


6264 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


88 


60 








7344. 







296 



WO 01/64835 PCT/US01/04927 



SEQ fi> 
NO: 


Accession No. 
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% 

Identit 

y 


6265 


U49974 


Homo sapiens 


mariner transposase 


206 


71 


6266 


Y08062 


Homo sapiens 

* 


Human PR0245 protein fragment 
derived from DNA35638. 


71 


72 


6267 


AF097473 


Mus musculus 


ORF1 


86 


40 


6268 


X70343 


Nicotiana 
sylvestris 


extensin 


95 


33 


6269 . 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


94 

• 


84 


6270 


L27428 


Homo sapiens 


reverse transcriptase 


132 


48 


6271 


AF 118082 


Homo sapiens 


PRO 1902 


153 


53 


6272 


U12690 


Homo sapiens 


cytochrome oxidase subunit 11 


222 


91 


6273 


X03 145 


Homo sapiens 


pot ORFV 


89 


35 


6274 


Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene 51 clone HUKEX85. 


90 


71 


6275 


U93572 


Homo sapiens 


p40 


83 


43 


6276 


AF058700 


Bos taums 


ubiquitin-S27a fusion protein 


148 . 


70 


6277 


AF016099 


Mus musculus 


endonuc lease/reverse transcriptase 


96 


65 


6278 


AF130079 


Homo sapiens 


PR02852 


179 


86 


6279 


AF161356 


Homo sapiens 


HSPC093 


108 


57 


6280 


X77816 


Rattus 
norvegicus 


PR-Vbetal 


109 


51 


6281 


G03243 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7324. 


130 


58 


6282 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


81 


69 


6283 


AF068294 


Homo sapiens 


HDCMB45P 


9,8 


37 


6284 


AF1 16715 


Homo sapiens 


PR02829 


112 


68 


6285 


X92485 


Plasmodium 
vivax 


pval 


92 


64 


6286 


U93570 


Homo sapiens 


p40 


90 


38 


6287 


Y36203 


Homo sapiens 


Human secreted protein #75. 


225 


62 


6288 


Z11793 


Homo sapiens 


selenoprotein P 


220 


65 


6289 

• 


X92485 


Plasmodium 
vivax 


pval 


125 


70 


6290 


X52634 


Mus musculus 


tlm protein 


114 


45 


6291 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


94 


36 


6292 


AF1 16661 


Homo sapiens 


PR01438 


144 


46 


6293 


L27428 


Homo sapiens 


reverse transcriptase 


81 


44 


6294 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: j 
4478. 


152 


51 


6295 


AF130089 


Homo sapiens 


PRO2550 


137 


35 


6296 


L27428 . 


Homo sapiens 


reverse transcriptase 


96 


53 


6297 


BO 1372 


Homo sapiens 


Neuron-associated protein. 


134 


69 


6298 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


110 


58 


6299 


AF1 30089 


Homo sapiens 


PRO2550 


110 


61 


6300 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


136 


63 


6301 


AF220264 


Homo sapiens 


MOST-1 


172 


90 


6302 


AF 130079 


Homo sapiens 


PR02852 


311 


71 


6303 


AF068294 


Homo sapiens 


HDCMB45P 


136 


53 


6304 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


110 


62 


6305 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


113 


45 


6306 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


166 


58 



297 



SEOID 
NO: 


t Accession No 




I Dcs£ rinHnn 
i i/vovi ij;uvii 


i Smith- 
Waterman 
Score 


— 
Identit 

y 


6307 


S85655 


Homo sapiens 


prohibitin 


664 


91 


6308 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


155 


64 


6309 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: - 
7795. 


151 


73 


6310 


AF1 19851 


Homo sapiens 


PR01722 


160 


67 


6311 


L27428 


Homo sapiens 


reverse transcriptase 


91 


50 


6312 


AF 130089 


Homo sapiens 


PRO2550 


140 


55 


6313 


AF 130089 


Homo sapiens 


PRO2550 


211 


68 


6314 


X159I7 


Paramecium 
aurelia 


ORF4 protein (AA 1-156) 


97 


38 


6315 


Y08062 


Homo sapiens 

• 


Human PR0245 protein fragment 
derived from DNA35638. 


94 


51 


6316 


Y36203 


Homo sapiens 


Human secreted protein #75 . 


124 


65 


6317 


L27428 


Homo sapiens 


reverse transcriptase 


117 


56 


6318 


Ml 1900 


Mus muscuius 


1 5-kDa proline-rich salivary protein 


105 


41 


6319 


G02532 

■ 


Homo sapiens 


Human secreted protein, SEQ tt) NO: 
6613. 


175 


69 

• 


6320 


U92673 


Eurythenes sp. 
'AUS-3F* 


cytochrome oxidase I 


59 


37 


6321 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


283 


72 


6322 


AK000496 


Homo sapiens 


unnamed protein product 


206 


70 


6323 


AB047934 


Macaca 
fascicularis 


hypothetical protein 


173 


68 


6324 


AB023485 


Mus muscuius 


transcription factor CA150b 


131 


43 


6325 


AF020262 


Bos tanrus 


generaJ protein transport factor pi 6 


76 


67 


6326 


U22376 


Homo sapiens 


alternatively spliced product using exon 
13A 


98 


57 


6327 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


115 


60 


6328 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


100 


57 


6329 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
"7870. 


115 


67 


6330 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 

• 


82 

• • 


63 

- 


6331 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


138 


57 


6332 


K01664 

■ 


Drosophila 
melanogaster 


B km -like protein 


112 


37 


6333 


M69297 


Homo sapiens 


ORF3 


70 


43 


6334 


U 15647 


Mus muscuius 


reverse transcriptase 


146 


53 


6335 


AF090894 


Homo sapiens 


PRO0113 


107 


48 


6336 


AF130089 


Homo sapiens 


PRO2550 


359 


75 


6337 


AJ271872 


Nicotiana 
sylvestris 


extensin 


134 


32 


6338 


AJ132694 


Homo sapiens 


small GTPaseraclb 


99 


100 


6339 


AF119851 


Homo sapiens 


PR01722 


123 


60 


6340 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


215 


56 


6341 


U83774 


Pernis apivorus 


cytochrome oxidase I 


193 


80 


6342 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


139 


62 


6343 


X92485 


Plasmodium 
vivax 


pval 


104 


81 


6344 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


110 


64 


6345 


U93572 


Homo sapiens 


p40 


79 


48 


6346 


X92485 


Plasmodium 


pval 


156 


55 
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Description 

• 
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% 

Identit 

y 






vivax 








6347 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


104 


85 


6348 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO; 163. 


158 


77 


6349 


G03786 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7867. 


89 


64 


6350 


AF130089 


Homo sapiens 


PRO2550 


187 . 


69 


6351 


U56732 


Rattus 
norvegicus 


KRAB/zinc finger suppressor protein 1 


161 


64 


6352 


U70932 


Peromyscus 
leucopus 


reverse transcriptase 


126 


61 


6353 


X55687 


Lycopersicon 
esculentum 


extensin (class IT) 


89 


44 


6354 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


107 


57 


6355 

■ 


G00357 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


88 


76 ; 

■ 


6356 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


98 


61 


6357 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


165 


51 


6358 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


214 

• 


58 


6359 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


75 


88 


6360 


X07882 


Homo sapiens 


Po protein 


151 


47 


6361 


Z11592 


Phytophthora 
parasitica 


cytochrome c oxidase subunit 3 




76 


6362 


AF090944 


Homo sapiens 


PRO0663 


130 


46 


6363 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


117 


39 


6364 


AF229067 


Homo sapiens 


PADI-H protein 


155 


75 


6365 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


100 


55 


6366 


AK024501 


Homo sapiens 


FLJ001 10 protein 


282 


100 


6367 


AF 118082 


Homo sapiens 


PRO 1902 


157 


57 


6368 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO:. 
8153. 


151 


63 


6369 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6836. 


161 

■ 


96 


6370 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


124 


63 


6371 


AF130079 


Homo sapiens 


PR02852 


297 


77 


6372 


X92485 


Plasmodium 
vivax 


pval 


129 


42 


6373 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


122 


63 


6374 


AF119851 


Homo sapiens 


PR01722 


251 


65 


6375 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


139 


73 


6376 


X70343 


Nicotiana 
sylvestris 


extensin 


104 


35 . 


6377 


G03224 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7305. 


70 


52 


6378 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


97 


69 


6379 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


109 


51 


6380 


Z38128 


Mus museums 


histone HI 


113 


36 


6381 


U93573 


Homo sapiens 


p40 


212 


43 



299 



"sEqid 

NO: 


t Accession No. 


species 

■ 


besenptfon 


Smith- 

Waterman 

Score 


Ideatit 


6382 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


109 




6383 


AL096744 


Homo sapiens 


hypothetical protein j 


164 


49 


6384 


AF220264 


Homo sapiens 


MOST-1 


119 


56 


6385 


X92485 


Plasmodium 
vivax 


pval 


108 


65 


6386 


X53581 


Rattus 
norvegicus 


ORF4 


148 


36 


6387 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


116 


52 


6388 


M13100 

- 


Rattus 
norvegicus 


unknown protein 


119 


48 


6389 


AB047948 


Macaca 
fascicularis 


hypothetical protein 


80 


33 


6390 


X53581 


Rattus 
norvegicus 


ORF4 


139 ' 


55 


6391 


AK025947 


Homo sapiens 


unnamed protein product 


166 


52 


6392 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


244 


75 


6393 


M22332 


Homo sapiens 


unknown protein 


166 


39 


6394 


AF229067 


Homo sapiens 


PADI-H protein 


128 


59 


6395 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


80 


70 


6396 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


95 


44 


6397 


M26361 


Mus museums 


LINE/lg H-chain fusion protein 


97 


61 


6398 


AF 1 18082 


Homo sapiens 


PRO1902 


93 


48 


6399 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


116 


89 


6400 


U93570 


Homo sapiens 


putative pi 50 


88 


43 


6401 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


147 


61 


6402 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


77 


56 


6403 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


156 


47 


6404 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


155. 


56 


6405 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
IDNO:1051. 


88 


39 


6406 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


84 


68 


6407 


G00560 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4641. 


244 


90 


6408 


L27428 


Homo sapiens 


reverse transcriptase 


108 


42 


6409 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


136 


58 


6410 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


106 


44 


6411 


U70935 


Peromyscus 
maniculatus 


reverse transcriptase 


154 


40 


6412 


AF090931 


Homo sapiens 


PRO0483 


99 


85 


6413 


AL03U86 


Homo sapiens 


bK984Gl.l (supported by FGENES) 


249 


78 


6414 


AF130089 


Homo sapiens 


PRO2550 


100 


55 


6415 


AB047963 


Macaca 
fascicularis 


hypothetical protein 


165 


57 


6416 


M22332 


Homo sapiens 


unknown protein 


111 


42 


6417 


G02832 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
6913. 


140 


50 


6418 


AF016099 


Mus museums 


endonuclease/reverse transcriptase 


147 | 60 
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6419 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


194 


84 


6420 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, . 
SEQIDNO:163. 


144 


81 


6421 


U49973 


Homo sapiens 


ORF2: function unknown- 


91 


62 


6422 


G02754 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6835. 


135 


96 


6423 


AF072508 


Homo sapiens 


envelope protein 


128 


49 


6424 


AK000385 


Homo sapiens 


unnamed protein product 


161 


54 


6425 


U49973 

• 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


148 


54 


6426 


X92485 


Plasmodium 
vivax 


pval 


103 


75 


6427 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


183 


80 


6428 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


97 


46 


6429 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


100 


51 


6430 


Y36203 


Homo sapiens 


Human secreted protein #75. 


139 


58 


6431 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


112 


47 


6432 


X92485 


Plasmodium 
vivax 


pval 


83 


72 


6433 


Y35984 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 233. 


121 

< 


59 


6434 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


66 


43 


6435 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


1?2 


51 


6436 


D89501 


Homo sapiens 


similar to salivary prolirie-rich protein 
P-B 


88 


38 


6437 


Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene 51 clone HUKEX85. 


113 


58 


6438 


X99467 


Medicago 
truncatula 


ENOD20 


110 


37 


6439 


Y86574 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:491 . 


129 


58 


6440 


X92485 


Plasmodium 
vivax 


pval 

• 


114 


46 


6441 


AK024455 


Homo sapiens 


FLJ00047 protein 


173 


57 


6442 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


85 


51 


6443 


Y36203 


Homo sapiens 


Human secreted protein #75. 


104 


56 


6444 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


138 


50 


6445 


M24732 


Homo sapiens 


lamin-like protein 


71 


45 


6446 


AF119851 


Homo sapiens 


PRO 1722 


163 


69 


6447 


AF126163 


Homo sapiens 


HHLA3 protein 


98 


53 


6448 


G00577 


Homo sapiens 

- 


Human secreted protein, SEQ ID NO: 
4658. 


151 


58 


6449 


AF200187 


cercopithicine 
herpesvirus 15 


EBNA2-like protein 


125 


38 


6450 


X86031 


Sus scrofa 


homologue of proline/arginine-rich 
antibacterial protein 


107 


52 


6451 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


114 


54 


6452 


AF090942 


Homo sapiens 


PRO0657 


129 


65 


6453 


M19503 


Homo sapiens 


ORF1; putative 


86 


39 


6455 


X92485 


Plasmodium 
vivax 


pval 


99 


40 
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6456 


X86030 


Vigna . 
unguiculata 


extensin-like protem 


115 


36 


6457 


Y08061 


Homo sapiens 


Human c-myb protein fragment 


113 


61 


6458 


AF055985 


Onchocerca 
volvulus 


pyrrolidone-rich antigen 


90 


37 


6459 


AF1 19851 


Homo sapiens 


PR01722 


243 


66 


6460 


J05497 


Rattus 
norvegicus 


snRNP-associated polypeptide N 


101 


31 


6461 


AF1 30089 


Homo sapiens 


PRO2550 


183 


76 


6462 


W50192 


Homo sapiens 


Amino acid sequence of salivary 
protem CON-1. 


93 


53 


6463 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


122 


72 


6464 


Y36156 


Homo sapiens 


Human secreted protein #28. 


85 


52 


6465 


AL 160493 


Leishmania 
major 


probable (hhv-6) u 1 102, variant a 
DNA, complete virion genome 


120 


40 


6466 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


94 


36 


6467 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


96 


58 


6468 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


121 


67 


6469 


X92485 


Plasmodium 
vivax 


pval 


89 


72 

-• 


6470 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


128 


56 


6471 


AF130089 


Homo sapiens 


PRO2550 


155 


83 


6472 


U93569 


Homo sapiens 


putative pi 50 


131 


42 


6473 


AF130079 


Homo sapiens 


PR02852 


111 


54 


6474 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


115 


51 


6475 


AF220264 


Homo sapiens 


MOST-1 


111 


58 


6476 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


137 


45 


6477 


AF016099 


Mus museums 


endonuclease/reverse transcriptase 


106 


35 


6478 


X92485 


Plasmodium 
vivax 


pval 


220 


55 


6480 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


112 


61 


6481 


U49973 


Homo sapiens 


ORF2: function unknown 


96 


42 


6482 


U 15647 


Mus museums 


reverse transcriptase 


111 


61 


6483 


X92485 


Plasmodium 
vivax 


pval . 


181 


49 


6484 


L27428 


Homo sapiens 


reverse transcriptase 


135 


50 


6485 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein #153. 


144 


52 


6486 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


174 


63 


6487 


U93569 


Homo sapiens 


p40 


176 


53 


6488 


Y64890 


Homo sapiens 

■ 


Human 5* EST related polypeptide SEQ 
IDNO.1051. 


72 




42 


6489 


L26953 


Homo sapiens 


chromosomal protein 


93 


33 


6490 


G04072 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
8153. 


125 


57 


6491 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


155 


70 


6492 


U49973 


Homo sapiens 
• 


ORF1 ; MER37; putative transposase 
similar to pogo element 


251 


75 


6493 


U15647 


Mus musculus 


reverse transcriptase 


100 


40 
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6494 


AK027208 


Homo sapiens 


unnamed protein product 


109 


68 


6495 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


135 


52 


6496 


AF1 30089 


Homo sapiens 


PRO2550 


230 


69 


6497 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


237 


62 


6498 


AF090894 


Homo sapiens 


PRO0113 


140 


68 


6499 


Y03000 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 12 1 . 


138 


84 


6500 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


123 


69 


6501 


M69297 


Homo sapiens 


ORF3 


87 


58 


6502 


M22332 


Homo sapiens 


unknown protein 


81 


44 


6503 


X92485 


Plasmodium 
vivax 


pval 


111 


52 


6504 


AF090895 


Homo sapiens 


PRO0117 


131 


60 


6505 


Y73932 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein #119. 


97 


69 


6506 


U93568 


Homo sapiens 


putative pi 50 


122 


32 


6507 


X86029 


Vigna 
unguiculata 


extensin-like protein 


177 


32 


6508 


G03790 


Homo sapiens 

■ 


Human secreted protein, SEQ ID NO: 
7871. 


282 

• 


60 


6509 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


142 


61 


6510 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


93 

i 


35 


6511 


U52077 


Homosapiens 


mariner transposase 


372 


73 


6512 


G02639 


Homo sapiens 

- 


Human secreted protein, SEQ ID NO: 
6720. 


81 


50 


6513 


AF090928 


Homo sapiens 


PRO0470 


97 


39 


6514 


AF134827 


Monodelphis 
domestica 


small nuclear ribonucieoprotein B' 


108 


47 


6515 


G02832 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
69131 


82 


44 


6516 


AF068754 


Homo sapiens 


heat shock factor binding protein 1 
HSBP1 


118 


69 


6517 


AF1 30089 


Homo sapiens 


PRO2550 


207 


69 


6518 


M22332 


Homosapiens 


unknown protein 


154 


55 


6519 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


95 


39 


6520 


G031I9 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7200. 


65 


37 


6521 


AK023563 


Homo sapiens 


unnamed protein product 


268 


58 


6522 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


181 


61 


6523 


AF210651 


Homo sapiens 


NAG 18 


85 


85 


6524 


AF130079 


Homo sapiens 


PR02852 


127 


51 


6525 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


113 


51 


6526 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


85 


62 


6527 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


115 


48 


6528 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


86 


60 


6529 


R59842 


Homo sapiens 


ApoE4Ll protease. 


92 


58 


6530 


R59842 


Homo sapiens 


ApoE4Ll protease. 


135 


77 


6531 


W48351 


Homo sapiens 


Human breast cancer related protein 


88 


60 ' 
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BCRB2. 






6532 


AF119851 


Homo sapiens 


PR01722 


121 


61 


6533 


AF063866 


Melanoplus 

* * 

sanguinipcs 
entoraopoxvirus 


ORF MSV233 hypothetical protein 


85 


35 

1 


6534 


G02469 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6550. 


123 


59 


6535 


R95913 


Homo sapiens 


Neural thread protem. 


132 


67 


6536 


AF 13 0079 


Homo sapiens 


PR02852 


113 


58 


6537 


AB047600 


Macaca 
fasciculans 


hypothetical protem 


159 


60 


6538 


G03789 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7870. 


279 


68 


6539 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


112 


55 


6540 


AF 130089 


Homo sapiens 


PRO2550 


124 


56 


6541 


U93574 


Homo sapiens 


putative pi 50 


159 


39 


6542 


AF1 18082 


Homo sapiens 


PRO1902 


114 


45 


6543 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


78 


50 


6544 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


118 


44 


6545 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


138 


62 


6546 


AF1 16661 


Homo sapiens 


PR01438 


162 


59 


6547 


S62928 


Homo sapiens 


PRB1M protein precursor 


116 


48 


6548 


AB030192 


Mus musculus 


homologue of S70447 FIFo-ATP 
synthase complex Fo membrane 
domain f subunit 


174 


63 


6549 


AK025116 


Homo sapiens 


unnamed protein product 


136 


56 


6550 


X92485 


Plasmodium 
vivax 


pval 


105 


56 


6551 


AF130079 


Homo sapiens 


PR02852 


172 


91 


6552 


U70935 


Peromyscus 
maniculatus • 


reverse transcriptase 


84 


42 


6553 


S79410 


Mus musculus 

* 


nuclear localization signal binding 
protein 


100 


55 


6554 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


no 


39 


6555 


AF130089 


Homo sapiens 


PRO2550 


205 


64 


6556 


AF130079 


Homo sapiens 


PR02852 


165 


48 


6557 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhcm) gene preoduct 


160 


74 


6558 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


133 


49 


6559 


X92485 


Plasmodium 
vivax 


pval 


121 


78 


6560 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


76 


45 


6561 


AF217374 


Acanthaster 
planci 


cytochrome oxidase subunit I 


234 


85 


6562 


G03714 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
7795. 


150 


61 | 


6563 


Y95028 


Homo sapiens 


Human clone vp7 1 ORF2, SEQ ID 
NO: 128. 


134 


84 


6564 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


66 


47 


6565 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


180 


49 
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6566 


AF265575 


Homo sapiens 

• 


ubiquitous TPR-motif protein Y 
isoform 


96 

• 


57 


6567 


X92485 


Plasmodium 
vivax 


pval 


96 


58 


6568 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


103 


73 


6569 


AF119851 


Homo sapiens 


PR01722 


81 


60 


6570 


AF289098 


Cladrastis 
kentukea 


ENOD2 . 


110 


36 


6571 


AF1 18086 


Homo sapiens 


PR01992 


115 


65 


6572 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


142 


57 


6573 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


89 


41 


6574 


M22332 


Homo sapiens 


unknown protein 


95 


34 


6575 

• 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


98 


64 

* 


6576 


AK021848 


Homo sapiens 


unnamed protein product 


131 


41 


6577 


AF161356 


Homo sapiens 


HSPC093 


155 


57 


6578 


AF1 16909 


Homo sapiens 


unknown 


75 


76 


6579 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


95 


53 


6580 


AF1 30089 


Homo sapiens 


PRO2550 


276 


74 


6581 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


119 


33 


6582 


U49973 


Homo sapiens 


ORF2: function unknown 


94 


62 


6583 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


90 


65 


6584 


AJ249395 


Globodera 
pallida 


NADH-ubiquinone oxidoreductase 
subunit 4 


99 


41 


6585 


Y27907 


Homo sapiens 


Human secreted protein encoded by 
gene No. 1 19. 


75 


62 


6586 


U49973 


Homo sapiens 


ORF2: function unknown 


134 


58 


6587 


Y08061 


Homo sapiens 


Human c-myb protein fragment 


112 


47 


6588 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


217 


53 


6589 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


147 


41 


6590 


U43627 

» 


Arabidopsis 
thaliana 


extensin 


103 " 


31 


6591 


X62677 


Oryctolagus 
cunicuius 


retrovirus related reverse transcriptase 


71 


34 


6592 


Y91577 


Homo sapiens 

< 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


177 


66 


6593 


, G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


100 


72 


6594 


M69297 


Homo sapiens 


ORF3 


137 


43 


6595 


X68101 


Rattus 
norvegicus 


trg 


372 


89 


6596 


G00354 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
4435. 


81 


62 


6597 


Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene 51 clone HUKEX85. 


99 

....... 


43 


659% 


Z99714 


Homo sapiens 


bKl 048E9.5 (novel protein) 


1642 


98 


6599 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


162 


59 


6600 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


132 


65 


6601 


AK000385 


Homo sapiens 


unnamed protein product 


187 


74 


6602 


AF130051 


Homo sapiens 


PRO0898 


137 


54 


6603 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


88 


48 



305 



WO 01/64835 PCT/US01/04927 



NO: 


i Accession Nn 


* 


Yl*Af*vi ntinn 
jLFcscnpuvu 


Smith- 
Waterman 

Score 


Identif 

y 






i 


terminal, clone XEH.8c} 






6604 


X61047 


Hydra sp. 


mini-collagen 


116 


45 


6605 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. ■ — 


109 


71 


6606 


G02872 


Homo sapiens . 


Human secreted protein, SEQ ID NO: 
6953. 


212 


66 


6607 


AF232673 


Homo sapiens 


B17 long form 


1272 


99 


6608 


AF1 18082 


Homo sapiens 


PRO1902 


77 


52 


6609 


R13556 


Homo sapiens 


Protein encoded downstream of hhcjM 
oncoprotein. 


132 


74 


6620 

• 


Y08062 


Homo sapiens 

* * 


Human PRQ245 protein fragment 
derived from DNA35638. 


70 


61 


6611 


Y36203 


Homo sapiens 


Human secreted protein #75. 


140 


62 


6612 


X92485 


Plasmodium 
vivax 


pval 


.157 


72 

• 


6613 


U52077 


Homo sapiens 


mariner transposase 


260 


67 


6614 


AF16I392 


Homo sapiens 


HSPC274 


138 


45 


6615 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


93 


65 


6616 


AF1 19851 


Homo sapiens 


PRO 1722 


147 


49 


6617 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


101 


63 


6618 


AL3901 14 


Leishmania 
major 


extremely cysteine/valine rich protein 


117 

■ 


39 


6619 


Y64890 


Homo sapiens 


Human 5* EST related polypeptide SEQ 
IDNO:1051. 


105 


58 


6620 


U15647 


Mus museums 


reverse transcriptase 


106 


38 


6621 


AF130089 


Homo sapiens 


PRO2550 


171 


52 


6622 


X92485 

■ 


Plasmodium 
vivax 


pval 

■ 


139 


44 


6623 


AF090895 


Homo sapiens. 


PRO0117 


161 


58 


6624 


M63730 


Homo sapiens 


BPAG2 


107 


32 


6625 


G0325S 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


383 


61 


6626 


AF078860 


Homo sapiens 


PTD007 


69 


73 


6627 


AL080253 


Arabidopsis 
thaliana 


putative snRNP protein 


101 


37 


6628 


AK000385 


Homo sapiens 


unnamed protein product 


140 


57 


6629 


W60652 


Homo sapiens 


Human DVic-1 (alternative transcript). 


218 


51 


6630 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


152 


66 


6631 


AB046048 


Macaca 
fascicularis 


unnamed portein product 


118 


65 


6632 


X92485 


Plasmodium 
vivax 


pval 

...... 


119 

• 


88 


6633 


X03145 


Homo sapiens 


pot ORF V 


98 


39 


6634 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


165 


41 


6635 


X59448 


Drosophila' 
melanogaster 


dynamin 


111 


39 


6636 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


162 


50 


6637 


AF 130089 


Homo sapiens 


PRO2550 


141 


75 


6638 


AF1 18082 


Homo sapiens 


PRO1902 


108 


63 


6639 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


144 


60 


6640 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


104 


47 


6641 


AF 11 8082 


Homo sapiens 


PRO1902 


121 


53 


6642 


AF1 18082 


Homo sapiens 


PRO1902 


195 


68 


6643 


R95913 


Homo sapiens 


Neural thread protein. 


212 


65 



306 



t • 



WO 01/64835 PCT/US01/04927 



X 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


•A 

Identit 

y 


6644 


AF090928 


Homo sapiens 


PRO0470 


81 


65 


6645 


AF130079 


Homo sapiens 


PR02852 


151 


69 


6646 


AF003535 


Homo sapiens 


ORF2-like protein 


140 


35 


6647 


G03787 


Homosapiens 


Human secreted protein, SEQ ID NO: 
7868. 


169 

■ 


61 


6648 


AF010144 


Homo sapiens 


neuronal thread protem AD7c-NTP 


100 


42 


6649 . 


M15530 


Homo sapiens 


B-cell growth factor 


121 


76 


6650 


AF090942 


Homo sapiens 


PRO0657 


119 


53 


6651 


AF1 18082 


Homo sapiens 


PRO 1 902 


130 


52 


6652 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


143 


87 


6653 


AF1 30089 


Homo sapiens 


PRO2550 


143 


63 


6654 


AF1 19851 


Homo sapiens 


PR01722 


285 


70 


6655 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


103 


75 


6656 


AF1 19855 


Homo sapiens 


PR01847 


61 


60 


6657 


G04072 

i 


Homo sapiens 


Human secreted protein, SEQ ID NO:' 
8153. 


107 


36 


6658 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


84 

■ 


61 

. 


6659 


AF1 19851 


Homo sapiens 


PRO 1722 


177 


54 


6660 


X92485 


Plasmodium 
vivax 


pval 


104 


55 


6661 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


116 


57 


6662 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 

1 * 

>». 


68 


6663 


L22029 


Glycine max 


hydroxyproltne-rich glycoprotein 


115 


35 


6664 


D00570 


Mus musculus 


open reading frame (251 AA) 


126 


35 


6665 


AF130051 


Homo sapiens 


PRO0898 


129 


77 


6666 


G03133 


Homosapiens 


Human secreted protein, SEQ ID NO: 
7214. 


108 


42 


6661 


G02851 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6932. 


71 


41 


6668 


AF130079 


Homo sapiens 


PR02852 


110 


59 


6669 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


113 


66 


6670 


G00527 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4608. 


98 


73 


6671 


G03790 


Homosapiens 


Human secreted protein, SEQ ID NO: 
7871. 


80 


51 


6672 


X62677 


Oryctolagus 
cuniculus 


retrovirus related reverse transcriptase 


113 


36 


6673 


V01201 


Simian sarcoma 
virus 


coding sequence of pol 


143 


47 


6674 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


159 


78 


6675 


G00648 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4729. 


64 


56 


6676 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


96 


48 

. — ._ . . . 


6677 


AF130089 


Homo sapiens 


PRO2550 


100 


75 


6678 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


94 


68 


6679 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. | 

• - - 


105 


71 


6680 


G03812 


Homosapiens 


Human secreted protein, SEQ ID NO: 
7893. 


218 


53 


6682 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


53 
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4433. 






6682 


AF161356 


Homo sapiens 


HSPC093 


163 


47 


6683 


G02538 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
6619. 


78 


72 


6684 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


87 


42 


6685 


S79410 


Mus musculus 


nuclear localization signal binding 
protem 


98 


43 


6686 


U93564 


Homo sapiens 


p40 


96 


41 


6687 


AF1 19855 


Homo sapiens 


PR01847 


137 


67 


6688 


U52077 


Homo sapiens 


mariner transposase 


268 


81 


6689 


X92485 


Plasmodium 
vivax 


pval 


97 


58 


6690 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


133 


56 


6691 


M63274 


Plasmodium 
falciparum 


malaria antigen 


61 


46 


6692 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


147 


80 


6693 


M11901 


Rattus 
norvegicus 


proline-rich salivary protein 


115 


40 


6694 


G02451 


Homo sapiens 1 


Human secreted protein, SEQ ID NO: 
6532. 


131 


50 


6695 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


174 


66 


6696 


AF068294 


Homo sapiens 


HDCMB45P 


154 


59 


6697 


X55685 


Lycopersicon 
esculentum 


extensin (class I) 


112 


41 


6698 


M83293 


Simian 

immunodeficienc 
y virus 


vpx protein 


85 


41 


6699 


GO 1246 

< 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


169 


88 


6700 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


121 


58 


6701 


AF1 18080 


Homo sapiens 


PRO1880 


, 127 


59 


6702 


X92485 


Plasmodium 
vivax 


pval 


106 


58 


6703 


AF161356 


Homo sapiens 


HSPC093 


155 


53 


6704 


X92485 


Plasmodium 
vivax 


pval 


164 

_ 


54 


6705 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


172 


59 


6706 


AK025326 


Homo sapiens 


unnamed protein product 


146 


50 


6707 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


112 


43 . 


6708 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


119 


71 


6709 


AF176531 


Mus musculus 


F-box protein FBX16 


324 


60 


6710 


X92485 


Plasmodium 
vivax 


pval 


96 

♦ 


64 


6711 


ATtlAAOn 

AF 13 0089 


Homo sapiens 


PRO2550 


203 


67 


6712 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


150 


59 


6713 


AF090944 


Homo sapiens 


PRO0663 


1 14 


43 


6714 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


110 


47 


6715 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein #153. 


131 


60 
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6716 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO. 
7795. 


103 


65 


6717 


G04072 


Homo sapiens 

- 


Human secreted protein, SEQ ID NO: 
8153. 


115 


46 


6718 


G01222 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5303. 


54 

• 


100 


6719 


Y36203 


Homo sapiens 


Human secreted protein #75. 


129 


78 


6720 


G03789 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
7870. 


A AMI 

107 


mW -M 

64 


6721 


Y85062 


Homo sapiens 


Interleukin 1 converting enzyme 
homologue (ICEL) protein sequence. 


130 


—m « 

61 


6722 


Y36203 


Homo sapiens 


Human secreted protein #75. 


m. ^mmt 

167 


72 


6723 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


116 


46 


6724 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


133 


42 


6725 


U80761 


Homo sapiens 


CTG26 alternate open reading frame 


88 


76 


6726 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


218 


61 


6727 


Y91577 

* 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


399 


81 


6728 


U01877 


Homo sapiens 


p300 protein 


119 


27 


6729 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


107 


51 


6730 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


130 


58 


6731 


X66285 


Mus musculus 


HCIORF 


79 


38 


6732 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


114 


38 


6733 


AF057352 


Homo sapiens 


hepatocellular carcinoma autoantigen 


148 


81 


6734 


AF090895 


Homo sapiens 


PRO0U7 


152 


57 


6735 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


164 


61 


6736 


X92485 


Plasmodium 
vivax 


pval 


92 


89 


6737 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


112 


49 


6738 


Y02693 


Homo sapiens 


Human secreted protein encodec 
gene 44 clone HTDAD22. 


I by 


278 


72 


6739 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


191 


47 


6740 


AF 118082 


Homo sapiens 


PRO1902 


83 


49 


6741 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


83 


43 


6742 


Y36708 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 62. 


71 


64 


6743 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


177 


70 


6744 


Y91475 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 25 SEQ ID NO: 148. 


159 


50 


6745 


X92485 


Plasmodium 
vivax 


pval 


112 


39 


6746 


Z82268 


Caenorhabditis 
elegans 


predicted using Geneftnder-contains 
similarity to Pfam domain: PF01391 
(Collagen triple helix repeat (20 
copies)), 8001^73.8, E-value»1.2e-18, 
N=2; PF01484 (Nematode cuticle 
collagen N-terminal domain), 
Score=37.5, E-value=le-07, 
N=l~cDNA EST yk92fl .5 comes from 
this gene-cDNA EST yk55f6.5 comes 


108 


36 


from this gene-cDNA EST yk65b7.3 
comes from mis gene-cDNA EST 
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yk58G.5 comes from mis gene-cDNA 
EST yk72h5.5 comes from mis 
gene-cDNA EST yk55c4.5 comes from 
Jhis gene-cDNA EST yk58hl.5 comes 
from this gene-cDNA EST yk65al2 5 
comes from this gene-cDNA EST 
yk94f4.5 comes from this gene 


■ 

- 


• 


6747 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID "NO: 
7214. 


113 


38 


6748 


R95913 


Homo sapiens 


Neural thread protein. 


108 


56 


6749 


U57747 

- 


Mus museums 


platelet-activating factor 
acetylhydrolase isoform lb beta subunit 


209 


69 


6750 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


159 


65 


6751 


G00637 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
4718. 


96 


48 


6752 


AF090895 


Homo sapiens 


PRO0117 


115 


56 


6753 


AF217973 


Homo sapiens 


unknown 


105 


66 


6754 


AF1 19851 


Homo sapiens 


PRO 1722 


156 


55 


6755 


AF1 19855 


Homo sapiens 


PRO 1847 


189 


66 


6756 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


163 


55 


6757 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


123 


57 


6758 


G03453 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7534. 


156 


70 


6759 


AF130051 


Homo sapiens 


PRO0898 


98 


68 


6760 


M36914 


Zea mays 


cell wall protein (put); putative 


75 


28 


6761 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


98 


85 


6762 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6834. 


88 


62 


6763 


Ml 9756 


Homo sapiens 


MHC HLA-Bw47 precursor 


580 


92 


6764 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


115 


52 


6765 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


321 


51 


6766 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


124 


59 


6767 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


303 . 


64 


6768 


AF061340 


Artibeus 
jamaicensis 


cytochrome c oxidase subunit II 


213 


67 


6769 


T71265_cdl 

i 


Homo sapiens 


19-JAN-1996 Neuronal apoptosis 
mhibrtor protem (NAIP) cDNA. 


188 


59 


6770 


B01390 


Homo sapiens 


Neuron-associated protein. 


92 


55 


6771 


AF017777 


Drosophila 
melanogaster 


la costa 


114 

- 


50 


6772 


Z34955 

• 


Sus scrofa 

- 


protein phosphatase 2A 65 kDa 
regulatory subunit, alpha isoform 


331 


93 

* 


6773 


X61296 


Rattus 

* 

norvegicus 


open reading frame 2 


101 


37 


0774 


A T 1 1 T3 A C 

AL1 17345 


Homo sapiens 


aJ4lor2J^ (novel protein) 


oiy 




6775 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


110 


61 


6776 


AJ007798 


Homo sapiens 


stromal antigen 3, (STAG3) 


238 


16 


6777 


AB029065 


Mus museums 


Ser/Thr kinase KKIAMRE-beta 


165 


66 


6778 


M37400 


Homo sapiens 


aspartate aminotransferase 


208 


68 


6779 


AF130089 


Homo sapiens 


PRO2550 


127 


52 


6780 


AF130079 


Homo sapiens 


PR02852 


332 


81 



310 



NO: 


I A <>MtCW\n AIsv 

v\XCc&5tOD llO* 




jLPcscnpnon 


&mifti_ 

Waterman 
Score 


% 

ldentit 

y 


6781 


AL049748 


Homo sapiens 


dJ41P2.2 (RNA binding motif protein 9 
(isofonn 1)) 


264 


60 


6782 


LI 1007 


Rattus 
norvegicus 


cyclin-dependent kinase 4 

■ 


239 

■ 


75 


6783 


AF319938 


Thermus 
thermophilus 


competence factor ComEA 


115 


36 


6784 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


119 


44 


6785 


L16864 


African swine 
fever virus 


cd2 homologue 


152 


34 


6786 


Y36203 


Homo sapiens 


Human secreted protein #75. 


99 


44 


6787 


AF 130087 


Homo sapiens 


PR02411 


172 


65 


6788 


AF208846 


Homo sapiens 


BM-004 


664 


100 


6789 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


90 


83 


6790 


AP000059 


Aeropyrum 
pernix 


120aa long hypothetical protein 


81 


43 

» 


6791 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


146 


53 


6792 


Y12961 


Homo sapiens 


Amino acid sequence of a human 
secreted peptide. 


175 


65 


6793 


Y73442 

• 


Homo sapiens 


Human secreted protein clone ya66_l 
protein sequence SEQ ID NO: 106. 


565 


93 


6794 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


136 


50 


6795 


AC004022 


Homo sapiens 


serum paraoxonasearylesterase 3 


137 


93 


6796 


AF009668 


multiple 
sclerosis 
associated 
retrovirus 


polyprotein 

i 


194 


60 

• 


6797 


R59842 


Homo sapiens 


ApoE4Ll protease. 


121 


92 


6798 


U88167 


Caenorhabditis 
elegans 


contains similarity to C2 domains 


171 


73 


6799 


AF293024 


Homo sapiens 


steroid receptor RNA activator isofonn 
1 


767 • 


97 


6800 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


123 


65 


6801 


BO 1203 


Homo sapiens 


Human GTPase associated protein-28. 


313 


87 


6802 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


125 


47 


6803 


AF078036 


Homo sapiens 


RelA associated inhibitor, NF-kappaB 
subunit p65 binding protein 


1275 


95 


6804 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


131 


53 


6805 


Y57946 

• 


Homo sapiens 

• 


Human transmembrane protein 
HTMPN-70. 


1128 


100 


6806 


G03714 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


81 


81 


6807 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


129 


52 


6808 


AB046048 


Macaca 
fascicularis 


unnamed portein product 


273 


63 


6809 


K02401 


Homo sapiens 


chorionic somatomammotropin 


446 


95 


6810 


AF130089 


Homo sapiens 


PRO2550 


150 


56 


6811 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


123 


65 


6812 


G03295 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7376. 


93 


38 


6813 


AF090930 


Homo sapiens 


PRO0478 


103 


65 


6814 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


138 


54 
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6815 


B24423 


Homo sapiens 


Human PR01244 protein sequence 
SEQIDNO:193. 


484 


95 


6816 


Ml 1901 


Rattus 
nbrvegicus 


proline-rich salivary protein 


105 


43 


6817 


AF0I0144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


96 


56 


6818 


Y30822 


Homo sapiens 


Human secreted protein encoded from 
gene 12. 


102 


48 


6819 


U82303 


Homo sapiens 


unknown 


105 


57 


6820 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


113 


45 


6821 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


97 


39 


6822 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


146 


86 


6823 


U52077 


Homo sapiens 


mariner transposase 


192 


78 


6824 


AC004237 


Homo sapiens 


KIF3 1 


129 


86 


6825 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


138 


66 


6826 


G03295 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7376. 


112 


64 


6827 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


108 


68 


6828 


U49974 


Homo sapiens 


mariner transposase 


103 


86 


6829 


AF119851 


Homo sapiens 


PR01722 


115 


53 


6830 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


82 


62 


6831 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


94 

it 


73 


6832 


Y19684 


Homo sapiens 


SEQ ID NO 402 from W09922243. 


131 


53 


6833 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


74 


50 


6834 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


168 


61 


6835 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


140 


60 


6836 


X92485 


Plasmodium 
vivax 


pval 


99 


58 


6837 


AB046048 


Macaca 
fascicularis 


unnamed portein product 


147 


42 


6838 


AB047600 


Macaca 
fascicularis 


hypothetical protein 

_ ... 


134 


52 


6839 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


141 


60 


6840 


G00376 


Homo sapiens 

» 


Human secreted protein, SEQ ID NO: 
4457. 


126 


73 


6841 


AF130089 


Homo sapiens 


PRO2550 


167 


63 


6842 


G03243 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7324. 


140 


56 


6843 


AF090894 


Homo sapiens 


PRO0113 


148 


62 


6844 


G00397 


Homo sapiens 

■ 


Human secreted protein, SEQ ID NO: 
4478. 


119 


51 


6845 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


75 


61 


6846 


G03295 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7376. 


65 


44 


6847 


AF130089 


Homo sapiens 


PRO2550 


160 


76 


6848 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


132 


68 


6849 


AF1 16715 


Homo sapiens 


PR02829 


123 


67 


6850 


Y01437 


Homo sapiens 


Secreted protein encoded by gene 55 


142 


96 



312 



"t it A Vn 
NO: 


kArreeciftn Nn 




JLPCSCripUUQ 


Waterman 
Score 


Idea tit 

y 




t 


* 


cloneHLMBP18. 






6851 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


110 


76 


6852 


AF068294 


Homo sapiens 


HDCMB45P 


161 


56 


6853 


G02532 


Homo sapiens . 


Human secreted protein, SEQ ID NO: 
6613. 


111 


57 


6854 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


165 


72 


6855 


AF1 30051 


Homo sapiens 


PRO0898 


117 


61 


6856 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO:* 
7879. 


138 


72 


6857 


X55685 


Lycopersicori 
esculentum 


extensin (class I) 


190 


45 


6858 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) u 1 1 02, variant a dna, 
complete virion genome. 


164 


69 


6859 


AF030144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


90 


52 


6860 


AF 119851 


Homo sapiens 


PRO 1722 


331 


73 


6861 


G00418 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4499. 


130 


67 


6862 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


114 


53 


6863 

- 


X92485 


Plasmodium 
vivax 


pval 

■ 


105 


63 


6864 


AF068294 


Homo sapiens 


HDCMB45P 


206 


6\ 


6865 


AF068294 


Homo sapiens 


HDCMB45P 


128 


46 


6866 


L00016 


Homo sapiens 


urf4 


157 


58 


6867 


X92485 


Plasmodium 
vivax 


pval 

* 


95 


89 


6868 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


148 


77 


6869 


AF130089 


Homo sapiens 


PRO2550 


162 


67 


6870 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


257 


65 


6871 


X71972 


Mus museums 


ribosomal protein S24 


211 


85 


6872 


M28016 


Homo sapiens 


cytochromeb 


72 


93 


6873 


M24069 


Homo sapiens 


DNA-binding protein A 


1662 


79 


6874 


AF130051 


Homo sapiens 


PRO0898 


163 


76 


6875 

♦ 


AF006061 


Homo sapiens 


placental growth hormone isoform 
hGH-V3 


263 


90 


6876 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


94 


80 


6877 


AF 130079 


Homo sapiens 


PR02852 


130 


76 


6878 


S80905 


Homo sapiens 


Conl 


148 


38 


6879 


X55683 


Lycopersicon 
esculentum 


extensin (class I) 


70 


35 


6880 


U16359 


Rattus 

■ 

norvegicus 


nitric oxide synthase 


106 


94 


6881 


AK025047 


Homo sapiens 


unnamed protein product 


167 


50 


6882 


X92485 


Plasmodium 
vivax 


pval 


no 


63 


6883 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


185 


46 


6884 


AK024455 


Homo sapiens 


FLJ00047 protein 


81 


64 


6885 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


113 


47 


6886 


AF014008 


Bostaurus 


myocardial vascular inhibition factor 


158 


96 


6887 


AF265575 


Homo sapiens 


. ubiquitous TPR-motif protein Y 
isoform 


183 


83 


6888 


Y45382 


Homo sapiens | 


Human secreted protein fragment 


199 


49 
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SEQ U> 
NO: 


Accession No. 


Specie* 


Description 


Smith- 
Waterman . 
Score 


Identit 

y 


■ 


• 




encoded from gene 28. 






6889 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


140 


67 


6890 


AF130051 


Homo sapiens 


PRO0898 


135 


65 


6891 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


147 


65 


6892 


X72963 


Nicotiana 
tabacum 


pAP8 product 

• 


80 


44 


6893 


AF073957 


Homo sapiens 


CXC chemokine BRAK 


399 


82 


6894 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


160 


80 


6895 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDN0.163. 


150 


70 


6896 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


178 


85 


6897 


AP000616 


Oryza sativa 


similar to RING-H2 ringer protein 
RHAla(AF078683) 


142 


62 


6898 


AF1 18082 


Homo sapiens 


PRO1902 


144 


53 


6899 


AF130051 


Homo sapiens 


PRO0898 


187 


72 


6900 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


121 


85 


6901 


AF130089 


Homo sapiens 


PRO2550 


100 


63 


6902 


Z29446 


Hepatitis C virus 


core region 


60 


36 


6903 


AF 107406 


Homo sapiens 


GW128 


71 


63 


6904 


W03627 


Homo sapiens 


Human follicle stimulating hormone 
GPR N-terminal sequence. 


173 


64 


6905 


AF 130089 


Homo sapiens 


PRO2550 


8§ 


65 . 


6906 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


78 


84 


6907 


U93569 


Homo sapiens 


p40 


145 


48 


6908 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


186 


50 


6909 


AF130089 


Homo sapiens 


PRO2550 


244 


77 


6910 


Ml 8093 


Phaseolus 
vulgaris 


h ydroxyproline-rich glycoprotein 


119 


38 


6911 


AF 130089 


Homo sapiens 


PRO2550 


139 


47 


6922 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


146 

• 


60 


6913 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 


131 


70 


6914 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


106 


74 


6915 


AF161361 


Homo sapiens 


HSPC098 


79 


66 


6916 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


207 

- 


60 


6917 


AF1 30051 


Homo sapiens 


PRO0898 


258 


61 


6918 


G03800 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


108 


65 


6919 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


131 


58 


6920 


G02635 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6716. 


168 


80 


6921 


AF1 16661 


Homo sapiens 


PR01438 


71 


48 


6922 


AF090940 


Homo sapiens 


PRO0644 


72 


85 


6923 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


186 


70 


6924 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


89 


55 


6925 


AF229067 


Homo sapiens 


PADI-H protein 


122 


72 
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* 
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seO Id 

NO: 


Accession No. 


Soecics 


bescrintion 

Wlvl m mm mm %fmM 


Smith- 
Waterman . 
Score 


% 

Idcntit . 

y 


6926 


W50192 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-1. 


105 

• 


37 


6927 


G03800 


Homo sapiens 


Human secreted protein, SEQ BO NO: 
7881. 


173 


60 


6928 


X56603 


Mus musculus 


mouse 57-KD Calcium-binding protein 
(MCaBP) 


103 


52 


6929 


AF1 19851 


Homo sapiens 


PRO 1 722 


148 


76 


6930 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


96 


61 


6931 


AF077208 


Homo sapiens 


HSPC022 . 


155 


72 


6932 


Y 19684 


Homo sapiens 


SEQ ID NO 402 from W09922243. 


111 


62 


6933 


M22332 


Homo sapiens 


unknown protein 


154 


77 


6934 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


146 


60 


6935 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


168 


63 


6936 


G00376 

• 


Homo sapiens 

V 


Human secreted protein, SEQ ID NO: 
4457. 


124 


50 

- 


6937 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


99 


77 


6938 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


111 


68 


6939 


AF1 18082 


Homo sapiens - 


PRO1902 


120 


54 


6940 


AF090942 


Homo sapiens 


PRO0657 


149 


72 


6941 


M22332 


Homo sapiens 


unknown protein 


96 


51 


6942 


AF130087 


Homo sapiens 


PR02411 


132 


58 


6943 


Ml 5530 


Homo sapiens 


B-cell growth factor 


V 


52 


6944 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


87 


52 


6945 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


73 


66 


6946 


Z10561_cdi 

• 


Homo sapiens 


12-MAR-1998 DNA encoding G 
protein-coupled receptor protein 
designated SREB2. 


330 


96 


6947 


W53869 


Homo sapiens 


Gravin polypeptide fragment, residues 
1526-1780. 


170 


89 


6948 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


164 


86 


6949 


AF1 18086 


Homo sapiens 


PR01992 


117 


86 


6950 


AF090928 


Homo sapiens 


PRO0470 


115 


58 


6951 


AF130089 


Homo sapiens 


PRO2550 


124 


69 


6952 


Y38397 


Homo sapiens 


Human secreted protein encoded by 
gene No. 12: 


94 


64 


6953 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


132 


55 


6954 


X92485 


Plasmodium 
vivax 


pval 


108 


64 


6955 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


117 


85 


6956 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


93 


53 


6957 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6983. 


106 


50 


6958 


AF130089 


Homo sapiens 


PRO2550 


127 


67 


6959 


Y02749 

• 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


119 


49 


6960 


J02459 


bacteriophage 
lambda 


U (tail component; 131) 


492 


92 


6961 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


180 


68 
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SEQ ID 

NO: 


\ Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Identft 

y 


6962 


i 

Z66490 


• 

Salmo tratta 


7214. 

slow myotomal muscle tropomyosin 


182 


72 


6963 


AF 130089 


Homo sapiens 


PRO2550 


108 


73 


6964 


W21733 


Homo sapiens 


NIP-1 encoded by clone 59. 


131 


69- 


6965 


AF161356 


Homo sapiens 


HSPC093 


58 


48 


6966 


AB047934 


Macaca 
fascicularis 


hypothetical protein 


146 


58 


6967 


X92485 


Plasmodium 
vivax 


pval 


116 


55 


6968 


AF 118082 


Homo sapiens 


PRO1902 


85 


65 


6969 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


176 


63 


6970 


AF1 16712 


Homo sapiens 


PR02738 


101 


56 


6971 


X96667 

• 


Homo sapiens 


beta-galactoside alpha-2,3- 
sialyltransferase 


172 


97 


6972 


Y86574 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:491. 


150 


82 


6973 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


176 


58 


6974 


AF162149 


Mycoplasma . ' 
bovis 


variable surface lipoprotein 


143 


40 

■ 


6975 

- 


W17971 

< 


Homo sapiens 


RAC protein kinase C-terminal binding 
protein C-terminal region. 


135 


81 


6976 


AF 130089 


Homo sapiens 


PRO2550 


345 


78 


6977 


AF 130089 


Homo sapiens 


PRO2550 


195 


63 


6978 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


130 


75 


6979 


M60510 


Mus musculus 


beta-D-galactosidase fusion protein 


96 


47 


6980 


AF 118086 


Homo sapiens 


PR01992 


131 


64 


6981 


AF130089 


Homo sapiens 


PRO2550 


132 


64 


6982 


L27428 


Homo sapiens 


reverse transcriptase 


128 


35 


6983 


U93566 


Homo sapiens 


p40 


98 


3! 


6984 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


138 


78 


6985 


L27428 


Homo sapiens 


reverse transcriptase 


133 


67 


6986 


AF090895 


Homo sapiens 


PRO0117 


139 


62 


6987 


G00528 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4609. 


212 


60 


6988 


Y36203 


Homo sapiens 


Human secreted protein #75. 


189 


73 


6989 


AF1 16712 


Homo sapiens 


PR02738 


87 


42 


6990 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


128 


63 


6991 


AF090930 


Homo sapiens 


PRO0478 


154 


52 


6992 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


102 


76 


6993 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


151 


48 


6994 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6834. 


113 


47 


6995 

• 


J02974 


Acanthamoeba 
castellanii 


myosin IB heavy chain 


120 

ii i 


43 


6996 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


151 


69 


6997 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


153 


60 


6998 


Y12961 


Homo sapiens 


Amino acid sequence of a human 
secreted peptide. 


114 


66 


6999 


AF1 13685 


Homo sapiens 


PRO0974 


88 


58 


7000 


AF 130089 


Homo sapiens 


PRO2550 


146 


58 
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SEO ID 


Accession No* 


1 Snecies 


Dtscriotioa 

m\r V*r • * mm ™ • 


Smith- 




NO: 






Waterman 


Identit 










Score 


y 


7001 


U93565 


Homo sapiens 


putative pi 50 


96 


46 


7002 


Y02749 


Homo sapiens 


Human secreted protein encoded by . 


146 


57 






■ 


gene 100 clone HNFIU96. 






7003 


Y91577 


Homo sapiens 


Human secreted protein sequence 


206 


92 








encoded by gene 2 SEQ ID NO:250. 






7004 


AF130089 


Homo sapiens 


PRO2550 


104 


50 


7005 


AL 137472 


Homo sapiens 


hypothetical protein 


179 


55 


7006 


Y91577 


Homo sapiens 


Human secreted protein sequence 


379 


98 








encoded by gene 2 SEQ ID NO:250. • 




■ 


7007 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


121 


38 


- 






7871. 






7008 


AF1 19851 


Homo sapiens 


PR01722 


136 


63 


7009 


AF2 18028 


Homo sapiens 


unknown 


104 


52 


7010 


AF130079 


Homo sapiens 


PR02852 


129 


60 


7011 


Y64890 


Homo sapiens 


Human 5* EST related polypeptide SEQ 


110 


54 








EDNO:1051. 






7012 


X03145 


Homo sapiens 


pot ORF VI 


130 


65 


7013 


AF090944 


Homo sapiens 


PRO0663 


171 


75 


7014 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


122 


65 








gene 22 clone HMSJW1 8. . . 


■ 


. ■ 


7015 


M26460 


Homo sapiens 


retinoblastoma 1 


113 . 


37 


7016 


X92485 


Plasmodium 


pval 


112 


56 






vivax 








7017 


X92485 


Plasmodium 


pval 


144 


57 






vivax 


■ 






7018 


AF130079 


Homo sapiens 


PR02852 


224 


74 


7019 


Y02785 


Homo sapiens 


Human secreted protein encoded by 


129 


43 








gene 51 clone HUKEX85. 






7020 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


58 






• 


7795. 






7021 


Y02886 


Homo sapiens 


Fragment of human secreted protein 


106 


69 






■ 


encoded by gene 90. 






7022 


AF090901 


Homo sapiens 


PRO0195 


103 


88 


7023 


AF1 16661 


Homo sapiens 


PR01438 


120 


47 


7024 


AP002460 


Arabidopsis 


gene_i±FlD9.26~unknown protein 


325 


60 






thaliana 


1 " r 




■ 


7025 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 


126 


63 








derived protein #153. 


■ 




7026 


AF 119900 


Homo sapiens 


PR02822 


111 


70 


7027 


AF130051 


Homo sapiens 


PRO0898 


133 


44 


7028 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 


179 


68 








derived protein #153. 




■ 


7029 


Y19684 


Homo sapiens 


SEQ ID NO 402 from W09922243. 


72 


60 


7030 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


50 








6613. 






7031 


J03941 


Mus musculus 


ferritin heavy chain 


124 


57 


7032 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


157 


59 








7795. 






7033 


P70494 


Homo sapiens 


Sequence of human B-cell growth 


120 


65 








factor (BCGF). 


• 




7034 


Y36156 


Homo sapiens 


Human secreted protein #28. 


164 


48 


7035 


U80761 


Homo sapiens 


CTG26 alternate open reading frame 


116 


48 


7036 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


130 


52 








clone HPMBQ32. 






7037 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


55 








6953. 






7038 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


78 


68 








6619. 






7039 


AF130089 


Homo sapiens 


PRO2550 


121 


65 
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7040 


AF1 19855 


Homo sapiens 


PR01847 


158 


65 


7041 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


116 


62 


7042 

*> 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


117 


54 


7043 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


123 


80 


7044 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


144. 


63 


7045 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


86 


44 


7046 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


80 


72 


7047 


G03482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7563. 


100 


57 


7048 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619! 


152 


66 


7049 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


94 


72 


7050 


AF176323 


Macropus 
eugenii 


small nuclear ribonucleoprotein B' 


103 . 


41 


7051 


AF255446 


Crypthecodinium 
cohnii 


Dip 1 -associated protein C 


127 


38 


7052 


M64792 


Rattus 
norvegicus 


salivary proline-rich protein 


149 


42 


7053 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


147 


69 


7054 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


109 


54 


7055 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


185 


86 


7056 


U60269 


Homo sapiens 


putative envelope protein; orf similar to 
env of Type A and Type B retroviruses 
and to class II HERVs 


125 

■ 


65 


7057 


AF068294 


Homo sapiens 


HDCMB45P 


264 


73 


7058 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


190 


95 


7059 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


140 


62 


7060 


AF090894 


Homo sapiens 


PRO0113 


185 


62 


7061 


AF107406 


Homo sapiens 


GW128 


66 


80 


7062 


W50193 


Homo sapiens 

■ 


Amino acid sequence of salivary 
protein CON-2. 


86 

■ 


46 


7063 


AF119851 


Homo sapiens 


PR01722 


117 


74 


7064 


X92485 


Plasmodium 
vivax 


pval 


123 


66 


7065 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


79 


59 


7066 


L23545 


Homo sapiens 


putative 


135 


44 


7067 


AF1 19851 


Homo sapiens 


PR01722 


301 


69 


7068 


AF130079 


Homo sapiens 


PR02852 


118 


71 


7069 


AF119851 


Homo sapiens 


PR01722 


150 


64 


7070 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


138 


48 


7071 


Ml 1759 


Lycopersicon 
esculentum 


cell wall hydroxyprolinevrich 
glycoprotein 


94 


39 


7072 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


145 


62 


7073 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


155 


44 



318 



, . t 

, 1 • 

■ 

* , J 

* * ' 

WO 01/64835 PCT/US01/04927 



NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman • 
Score 


•A 

Identit 

y 








4478. 






7074 


AF 130089 


Homo sapiens 


PRO2550 


146 


60 


7075 


U49974 


Homo sapiens 


manner transposase 


198 


83 


7076 


AF23608fL 


Homo sapiens 


CYP4F1 1 


427 


79 


7077 


AF 130089 


Homo sapiens 


PRO2550 


159 


52 


7078 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


206 


85 


7079 


D32202 


Homo sapiens 


alpha 1 C adrenergic receptor isoform 2 


152 


71 


7080 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


88 


62 


7081 


AB046048 


Macaca 
fascicularis 


unnamed portein product 


135 


46 


7082 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


142 


75 


7083 


X53581 


Rattus 
norvegicus 


ORF4 


119 


64 


7084 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


111 


62 


7085 


G037I4 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


131 


64 


7086 


G03295 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7376. 


71 


66 


7087 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


81 


54 


7088 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


148 


50 


7089 


U08021 


Homo sapiens 


nicotinamide N-methyltransferase 


M« 


100 


7090 


U93569 


Homo sapiens 


p40 


199 


82 


7091 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


124 


56 


7092 


X92485 


Plasmodium 
vivax 


pval 


125 


60 


7093 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


109 

■ 


62 

■ 


7094 


AF1 18082 


Homo sapiens 


PRO1902 


191 


50 


7095 


X92485 


Plasmodium 
vivax 


pval , 


129 


67 


7096 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


80 


66 


7097 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


162 


67 


7098 


AF1 16715 


Homo sapiens 


PR02829 


123 


63 


7099 


L27428 


Homo sapiens 


reverse transcriptase 


189 


50 


7100 


G02532 


Homo sapiens : - 


Human secreted protein, SEQ ID NO: 
6613. 


84 


48 


7101 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


104 


65 


7102 


D85030 


Schizosaccharom 
yces pombe 


ribosomal protein S33 homolog 


139 


85 


7103 


G03021 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7102. 


177 


77 


7104 


U72543 


Sus scrofa 


ubiquirin~like/S30 ribosomal fusion 
protein 


129 


100 


7105 


AC007787 


AA 187-502 


NFI-X3=transcription factor 


190 


84 


7106 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


110 


38 


7107 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


157 ■ 


56 


7108 

< 


Z69381 


Saccharomyces 
cerevisiae 


N1114 


450 


91 
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7109 


V49572_cdl 


Homo sapiens 


13-NOV-1996 Human stomach cancer 
clone HP10122cDNA#l. 


353 


mm 0^ 

70 


7110 


W50192 


Homo sapiens 


Amino acid sequence of salivary 
protein CON- 1. 


99 


45 


7111 


AF068294 


Homo sapiens 


HDCMB45P 


132 


55 


7112 


AB033020 


Homo sapiens 


KIAA1 194 protein 


A ^ m A 

850 


100 


7113 


AB019692 


Homo sapiens 


protein kinase PKNbeta 


792 


87 


7114 


M12140 


Homo sapiens 


envelope protein 


154 


79 


7115 


Y27854 

■ 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


88. 


62 


7116 


AF 124491 


Homo sapiens 


ARF GTPase-activating protein GIT2 


1216 


100 


7117 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


153 


54 


7118 


AF130089 


Homo sapiens 


PRO2550 


184 


64 


7119 


AF1 30079 


Homo sapiens 


PR02852 


144 


68 


7120 


AL3 59782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


112 


51 


7121 


AF032384 


Xenopus laevis 


metaBoprotease-disintegrin 


304 


76 


7122 


KOI 664 


Drosophila 
melanogaster 


Bkm-like protein 


99 


71 


7123 


AJ238706 


Drosophila 
melanogaster 


monocarboxylate transporter 1 
homologue 


161 


38 


7124 


X92485 


Plasmodium 
vivax 


pval 

- 


127 


54 


7125 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


80 


65 


7126 


AF1 16712 


Homo sapiens 


PR02738 


U8 


53 


7127 


G03790 

• 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
7871. 


115 


64 


7128 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


90 


62 


7129 


AF1 18090 


Homo sapiens 


PRO2044 


280 


80 


7130 


AF 129085 


Homo sapiens 


carboxy terminus of Hsp70-interacting 
protein 


593 


91 


7131 


AF054840 


Homo sapiens 


tetraspan TM4SF; Tspan-3 


513 


83 


7132 


L27428 


Homo sapiens 


reverse transcriptase 


156 


51 


7133 


AK001647 


Homo sapiens 


unnamed protein product 


2334 


99 


7134 


AF130051 


Homo sapiens 


PRO0898 


142 


84 


7135 


X53581 


Rattus 
norvegicus 


ORF4 


213 


46 


7136 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


101 


86 

• 


7137 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


194 


68 


7138 


U21634 


Onychomys 
torridus 


cytochrome c oxidase subunit III 


111 


84 


7139 


X60592 


Homo sapiens 


CDw40 


201 


95 


7140 

■ 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


141 


63 


7141 


M76979 


Homo sapiens 


pigment epithelial-differentiating factor 


458 


93 


7142 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


142 


54 


7143 


Y27677 


Homo sapiens 


Human secreted protein encoded by 
gene No. 111. 


323 


90 


7144 


AF116712 


Homo sapiens 


PR02738 


132 


60 


7145 


AB024057 


Homo sapiens 


vascular Rab-GAP ABC-containing 
protein 


121 


45 


7146 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


144 


65 
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7147 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


141 


76 


.7148 


R06400 


Homo sapiens 


Lambda gtl0cb2 encoded C-terminal 
alpha amidating enzyme. 


337 


85 


7149 


S87759 


Homo sapiens 


protein phosphatase 2C alpha, 
PP2Calpha 


427 


78 


7150 


M12140 


Homo sapiens 


envelope protein 


541 


64 


7151 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


106 


85 


7152 


X55683 


Lycopersicon 
esculentum 


extensin (class I) 


70 


38 


7153 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


194 


61 


7154 


AF130089 


Homo sapiens 


PRO2550 


89 


60 


7155 


AF 100742 


Homo sapiens 


M-phase phosphoprotein homolog 


714 


94 


7156 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


143 


78 


7157 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


174 


79 


7158 


AL031230 


Homo sapiens 


dJ73M23.3(KJAA0319) 


214 


45 


7159 


AF041330 


Bodo saltans 


NADH dehydrogenase summit 5 


155 


37 


7160 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


129 


46 


7161 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


150 


53 


7162 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


107 


55 


7163 


M13043 


Mus musculus 


alpha-l (IV) collagen 


80 


42 


7164 


AB0M004 


Homo sapiens 


UDP-N-acetylglucosam ine 
pyrophosphorylase 


257 


90 


7165 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1& 


111 


47 


7166 


AF1 19851 


Homo sapiens 


PR01722 


279 


61 


7167 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


136 


51 


7168 


W73327 

■ 


Homo sapiens 


Bipolar affective disorder associated 
gene feh 16 product. 


287 


63 


7169 


Y35946 

* 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 195. 


1231 


99 


7170 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


132 


76 


7171 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


184 


60 


7172 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


141 


64 


7173 


AF129507 


Homo sapiens 


transcription factor ICBP90 


245 


55 


7174 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


131 


56 


7175 


AF161356 


Homo sapiens 


HSPC093 


107 


55 


7176 


AF2 18028 


Homo sapiens 


unknown 


116 


55 


7177 


G02654 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6735. 


117 


52 


7178 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


94 


38 


7179 


AF130089 


Homo sapiens 


PRO2550 


124 


47 


7180 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


109 


66 


7181 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


152 


65 


7182 


Y36156 


Homo sapiens 


Human secreted protein #28. 


127 


62 
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7183 


G02607 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

M* m+ m% A 

6688. 


121 


75 


7184 


X92485 


Plasmodium 
vivax. 


pval 


129 


60 


7185 


m ^N>B m mm. mm. 0m. mm. mm. 

AF 130089 


Homo sapiens 


W% m+^ mm* mm mm mm 

PRO2550 


116 


53 


mm ^% m* 

7186 


AF130089 


Homo sapiens 


"V^-WK jp*h —mx Amy mU m± 

PRO2550 


137 


64 


7187 


X51760 


Homo sapiens 


zinc finger protein (583 AA) 


236 


56 


7188 


X92485 


Plasmodium 
vivax 


pval 


131 


50 


7189 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


152 


57 


7190 


AF068294 


Homo sapiens 


HDCMB45P 


163 


50 


7191 


AF1 19855 


Homo sapiens 


PRO 1847 


55 


50 


7192 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


142 


57 


7193 


M15530 


Homo sapiens 


B-cell growth factor 


106 


53 


7194 


G03411 

i 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


121 


41 


7195 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


124 


60 


7196 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


94 


68 


7197 


U60315 


Molluscum 
contagiosum 
virus subtype 1 


MC054L 

- 


116 

* 


36 


7198 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


116 

i 


67 


7199 


Ml 5530 


Homo sapiens 


B-celi growth factor 


146 


59 


7200 


AF090894 


Homo sapiens 


PRO0113 


165 


65 


7201 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


160 


57 


7202 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


129 


53 


7203 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


92 


65 


7204 


AF126163 


Homo sapiens 


HHLA3 protein 


131 


78 


7205 


X92485 


Plasmodium 
vivax 


pval 


116 


42 


7206 


AF090931 


Homo sapiens 


PRO0483 


136 


83 


7207 


Y36156 


Homo sapiens 


Human secreted protein #28. 


127 


49 


7208 


Y12961 


Homo sapiens 


Amino acid sequence of a human 
secreted peptide. 


158 


55 

■ 


7209 


Y36203 


Homo sapiens 


Human secreted protein #75. 


124 


56 


7210 


M69008 


Homo sapiens 


alpha- 1 type XIII collagen 


90 


34 


7211 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


144 


71 


7212 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


133 


60 


7213 


AF 130089 


Homo sapiens 


PRO2550 


163 


86 


7214 


AF130089 


Homo sapiens 


PRO2550 


111 


52 


7215 


AF1 18082 


Homo sapiens 


PRO1902 


150 


63 


mmjm^ « ^ 

7216 


w m* mm mm. +m m+ mm. 

Y53663 


Homo sapiens 


A human B-cell growth factor related 
protein (BGFRP). 


mm. 

83 


^m j j 

66 


7217 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


141 


68 


7218 


AF220264 


Homo sapiens 


MOST-1 


123 


66 


7219 


AF229067 


Homo sapiens 


PADI-H protein 


121 


52 


7220 


AF1 18086 


Homo sapiens 


PRO 1 992 


130 


61 


7221 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


179 


100 
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* 


SEQ ID N0.163. 






, 7222 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


154 




7223 _ 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


200 


95 


7224 


L27428 


Homo sapiens 


reverse transcriptase 


127 


55 


7225 


AB008227 


Adiantum 
capillus-veneris 


Extensin 


121 


43 


7226 


D38112 


Homo sapiens 


. cytochrome c oxidase subunit 1 


113 


78 


7227 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 1 
SEQIDNO:163. 


153 


96 


7228 

■ 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


112 


74 


7229 


Ml 5530 


Homo sapiens 


B-cell growth factor 


137 


44 


7230 


AF004341 


Homo sapiens 


cytochrome c oxidase subunit IE 


140 


87 


7231 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


162 


46 


7232 

- 


AP000616 

• 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


123 


90 


7233 


XI 5769 


Musmusculus 


U 1 RNA-associated 70-kDa protein 


105 


29 


7234 


Ml 5530 


Homo sapiens 


B-celJ growth factor 


113 


61 


7235 


AF194537 


Homo sapiens 


NAG 13 


249 


65 


7236 


R20313 


Homo sapiens 


Gly(5) beta-globin mutant 


296 


80 


7237 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


123 

• 


50 

■ 


7238 


Y36156 


Homo sapiens 


Human secreted protein #28. 


97 


80 


7239 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


395 


81 


7240 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


122 


70 


7241 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


123 


90 


7242 


AF090942 


Homo sapiens 


PRO0657 


118 


47 


7243 


AF119851 


Homo sapiens 


PRO 1722 


112 


62 


7244 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


100 


53 


7245 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


144 


66 


7246 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


129 


78 


7248 


X92485 

.... . 


Plasmodium 
vivax 


pval 


108 


58 


7249 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


170 


74 


7250 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


153. 


70 


7251 


U01849 


Trypanosoma 
bmcei 


ORF2 


93 


39 


7252 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


113 


54 


7253 


G03473 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7554. 


110 


83 


7254 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


167 


66 


7255 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


179 


83 


7256 


AF130079 


Homo sapiens 


PR02852 


147 


70 


7257 


AF 118082 


Homo sapiens 


PRO1902 


80 


45 


7258 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


137 


55 


7259 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


90 


65 
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DpcClcS 


jjcscnpuon 


o uii in* 


OA 
SO 


NO: 






Waterman 


Identit 










Score 


y 


7260 


X92485 


Plasmodium 


pval 


105 


85 






vivax 


* 


• 




7261 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


76 




■ 


• 


7879. 






7262 


AF 130089 


Homo sapiens 


PRO2550 


98 


43 


7263 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


114 


60 




* 




gene 22 clone HMSJW18. 






7264 


AFl 16661 


Homo sapiens 


PRO 143 8 


139 . 


67 


7265 


AF104021 


Bovine viral 


polyprotein 


161 


40 




* 


diarrhea virus-2 








7266 


AL1 60493 


Leishmania 


probable (hhv-6) ul 102, variant a 


116 


43 






major 


DNA, complete virion genome 






7267 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


112 


62 




■ 




gene 44 clone HTDAD22. 




■ 


7268 


Ml 5530 


Homo sapiens 


B-cell growth factor 


154 


60 


7269 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 


54 






■ 


7437. 




* 


7270 


AFU8082 


Homo sapiens 


PRO 1902 


139 


62 


7271 


S52010 


Mus sp. , 


orfl 5' of EpoR 


93 


33 


7272 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


50 








6913. 


- 




7273 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


166 


61 








7795. 






7274 


AF130089 


Homo sapiens 


PRO2550 


124 


43 


7275 


X92485 


Plasmodium 


pval 


147 


44 






vivax 


- 






7276 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


»J 


59 








4438. 






7277 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


67 








4488. 






7278 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


89 


67 






• 


7253. 






7279 


AF130087 


Homo sapiens 


PR02411 


72 


65 


7280 


Y40039 


Homo sapiens 


Peptide sequence derived from a 


130 


69 




■ 




human secreted protein. 






7281 


J03507 


Homo sapiens 


complement protein C7 precursor 


357 


81 


7282 


AFl 16661 


Homo sapiens 


PR01438 


117 


48 


7284 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


61 








6613. 






7285 


AFl 16715 


Homo sapiens 


PR02829 


128 


75 


7286 


AF130079 


Homo sapiens 


PR02852 


122 


52 


7287 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 


120 


50 








oncoprotein. 






7288 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


338 


75 








7870. 






7289 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


114 


72 








encoded by gene 17. 






7290 


W48351 


Homo sapiens 


Human breast cancer related protein 


146 


59 








BCRB2. 




■ 


7291 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


156 


58 








6619. 






7292 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


167 


61 








6613. 






7293 


X92485 


Plasmodium 


pval 


122 


71 






vivax 








7294 


AF130089 


Homo sapiens 


PRO2550 


87 


62 


7295 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


50 








7870. 






7296 


X92485 


Plasmodium 


pval 


152 


75 
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l/^OU IUIIUU 


Smith- 
Wafer man . 
Score 


Identft 

y 






vivax 








7297 


AF090894 


Homo sapiens 


PRO0113 


172 


65 


7298 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


121 


61 

♦ 


7299 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 1 02, variant a dna, 
complete virion genome. 


114 


66 


7300 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


228 


80 


7301 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


111 


73 


7302 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


119 


46 


7303 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


168 


76 


7304 


U58658 


Homo sapiens 


unknown 


102 


48 


7305 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


118 


53 


7306 


G03790 


Homo sapiens 

i 


Human secreted protein, SEQ ID NO: 
7871. 


116 

■ 


63 


7307 


G00377 

< 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4458. 


149 


58 


7308 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


112 


52 


7309 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


115 


54 


7310 


M15530 


Homo sapiens 


B-cell growth factor 


131 


54 


7311 


AF 118082 


Homo sapiens 


PRO 1902 


1<?8 


46 


7312 


L78671 


Homo sapiens 


CoxII/D-loop DNA fusion protein 


412 


92 


7313 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


326 


74 


7314 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


149 


59 


7315 


AF289099 


Maackia 
amurensis 


ENOD2f 


107 


32 


7316 


U17009 


Phytophthora 
infestans 


SecY-independent transporter protein 


135 


41 


7317 


AF220264 


Homo sapiens 


MOST T l 


133 


72 


7318 


U49973 


Homo sapiens 


ORF2: function unknown 


111 


53 


7319 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


274 


80 


7320 


AB047600 


Macaca 
fascicularis 


hypothetical protein 


79 


56 ' 


7321 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


205 


95 


7322 


AC005175 

■ 


Homo sapiens 


TA2R HUMAN, BETA ISOFORM; 
TXA2-R; PROSTANOID TP 
RECEPTOR 


127 


57 


7323 


R48296 


Homo sapiens 


Human PGF-2/NT-3. 


89 


93 


7324 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJV/18. 


157 


60 


7325 


AF1 19855 


Homo sapiens 


PR01847 


172 


66 


7326 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


141 


50 


7327 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


156 


56 


7328 


Y40039 


Homo sapiens 


Peptide sequence derived from a 
human secreted protein. 


98 


56 


7329 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


161 


64 


7330 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


HI 


72 
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NO: 


< Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


Identit 

y 


7331 


AF126163 


Homo sapiens 


HHLA3 protein 


98 


47 


, 7332 


y—H yv% y% £■ f% 

G03258 


Homo sapjens 


V Y » « y*w^ y^, «l ^y^. 

Human secreted protein, SEQ ID NO: 
7339. 


122 


61 


7333 


AF130079_ 


Homo sapiens 


PR02852 


136 


54 


7334 


AK023563 


Homo sapiens 


unnamed protein product 


265 


73 


7335 


G02532 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
6613. 


113 


61 


7336 


Ml 5530 


Homo sapiens 


B-cell growth factor 


136 


75 


7337 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


168 


81 


7338 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


339 


80 


7339 


Y86248 


Homo sapiens 

i 


Human secreted protem HCHPF68, 
SEQIDNO:163. 


170 


74 


7340 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


400 


82 


7341 


L00016 


Homo sapiens 


urf4 


295 


72 


7342 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


350 


86 


7343 


M36647 


Homo sapiens 


mitochondrial hinge protein precursor 


286 


82 


7344 


X15917 

- 


Paramecium . 
aurelia 


ND2 protem (AA 1-193) (unusual start 
codon) 


132 


34 

- 


7345 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


136 


82 


7346 


Y14482 


Homo sapiens 


Fragment of human secreted protem 
encoded by gene 17. 


8? 


69 


7347 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


133 


69 


7348 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


169 


71 


7349 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


347 


83 


7350 


V00662 


Homo sapiens 


cytochrome oxidase I 


325 


85 


y^*% ^ 4 

7351 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


406 


80 


7352 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


224 


79 


7353 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


270 


86 


7354 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


167 


90 


7355 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


120 


87 


7356 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


362 


84 


7357 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


188 


72 


7358 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


335 


72 


" ^t ^ y* y* 

7359 


* V 4V y* y» y*. A 

U09500 


Homo sapiens 


cytochrome b 


275 


80 


7360 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


209 


73 


y*y* ^ 

7361 


AF1 18078 


Homo sapiens 


PROI848 


95 


58 


7362 


AB017116 


Homo sapiens 


AD 1 


148 


89 


7363 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


146 


42 


7364 


D38M2 


— - „, ,..,.„. v ,.„. 
Homo sapiens 


NADH dehydrogenase subunit 1 


203 


82 


7365 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


206 


84 


7366 


D38H2 


Homo sapiens 


NADH dehydrogenase subunit 3 


408 


82 


7367 


D38H2 


Homo sapiens 


NADH dehydrogenase subunit 3 


456 


86 


7368 


D38H2 


Homo sapiens 


cytochrome c oxidase subunit 3 


481 


85 


7369 


X86031 


Sus scrofa 


homologue of prolme/arginine-rich 
antibacterial protein 


150 


53 


7370 


AB017H6 


Homo sapiens 


AD 1 


145 


91 


7371 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


173 


85 


7372 


AFH8082 


Homo sapiens 


PRO 1902 


122 


54 


7373 


G00492 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4573. 


163 


62 


7374 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


53 


7375 


AF090894 


Homo sapiens 


PRO01 13 


128 


64 


7376 


AK000241 


Homo sapiens 


unnamed protein product 


131 


67 
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Score 


% 

Identit 

y 


7377 


AF 119855 


Homo sapiens 


PRO 1847 


1 ^ A ' 

164 


62 


7379 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase . 


114 


37 


7380 


U57362 


Rattus 
norvegicus 


collagen XII alpha 1 


148 


47 


7381 


D38112 


Homo sapiens 


ATPase subunit 6 


423 


76 


7382 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


135 


58 


7383 


AB040938 


Homo sapiens 


KIAA1 505 protein 


983 


76 


7384 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


102 


48 


7385 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO;163. 


122 


75 


7386 


U63295 


Homo sapiens 


seven in absentia homolog 


260 


85 


7387 


AE003499 


Drosophila 
melanogaster 


CG 12706 gene product 


103 


32 


7388 


X03342 


Homo sapiens 


rpL32 (aa 1-135) 


123 


51 


7389 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


129 


58 


7390 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


154 


100 


7391 


AK026800 


Homo sapiens 


unnamed protein product 


137 


52 


7392 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


117 


73 


7393 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


89 


70 


7394 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


161 

n - 


80 


7395 


Ml 5530 


Homo sapiens 


B-ceJJ growth factor 


98 


57 


7396 


L10908 


Mus musculus 


Gcapl gene product 


105 


51 


7397 


AF1 19851 


Homo sapiens 


PR01722 


127 


55 


7398 


X99256 


Hylobates bar 


cytochrome oxidase subunit III 


319 


73 


7399 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


497 


77 


7400 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


399 


80 


7401 


L78671 


Homo sapiens 


CoxII/D-Joop DNA fusion protein 


170 


83 


7402 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


183 


91 


7403 


Y86248 


Homo sapiens 

t 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


189 


88 


7404 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


115 


40 


7405 


D83006 


Saccharomyces 
cerevisiae 


gene required for phosphoylation of 
oligosaccharides/ has high homology 
with YJR061w 


171 


38 


7406 


AF 144636 


Gailus gallus 


osteoblast regulatory factor 3A 


108 


44 


7407 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


166 


70 


7408 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


407 


93 


7409 


AF004341 


Homo sapiens 


cytochrome c oxidase subunit HI 


140 


87 


7410 


AP000616 


Oryza saliva 


similar to RING-H2 finger protein 
RHAla(AF078683) 


111 


59 


7411 


AF004341 


Homo sapiens 


cytochrome c oxidase subunit in 


138 


82 


7412 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


431 


82 


7413 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


127 


56 


7414 


AB036396 

* 


Rana 

nigromaculata 


cytochrome b 


109 


87 


7415 


AF090930 


Homo sapiens 


PRO0478 


121 


76 


7416 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


171 


96 


7417 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


446 


91 
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Desr rintinn 


Waterman 
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% 

Identit 

y 


7418 

* 


AEO 18440 


Echinococcus 
multilocularis 


NADH dehydrogenase subunit 2 


112 


33 


7419 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


123 


66 


7420 


Y86248 


Homo sapiens 

■ 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


116 

- 


50 

• 


742 J 


AF 155232 


Pisum sativum 


extenstn 


128 


37 


7422 


AF161356 


Homo sapiens 


HSPC093 


89 


48 


7423 


D38115 


Pongo pygmaeus 


cytochrome c oxidase subunit 3 


162 


81 


7424 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, ' 
SEQIDNO:163. 


171 


79 


7425 


AF275953 


synthetic 
construct 


Cox8ND6gfp fusion protein 


214 

■ 


93 


7426 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1. 


156 


73 


7427 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


130 


59 


7428 


AK024455 


Homo sapiens 


FLJ00047 protein 


82 


66 


7429 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


212 


82 


7430 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


124 


48 


7431 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


195 


90 


7432 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


217 


84 


7433 


U87145 


Toxoplasma 
gondii 


ORFD 


89 


36 


7434 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


213 


72 


7435 


Y27868 

• 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


135 


57 


7436 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


151 


82 


7437 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


190 


88 


7438 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


141 


78 


7439 


AF250284 


Amsacta moorei 
entomopoxvirus 


AMV012 


88 


31 


7440 


AF004341 


Homo sapiens 


cytochrome c oxidase subunit III 


149 


85 


7441 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


114 


52 


7442 


AF275807 


Homo sapiens 


PNAS-110 


157 


48 


7443 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


126 


56 


7444 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


475 


95 


7445 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


146 


71 


7446 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


130 


60 


7447 


X92485 


Plasmodium 
vivax 


pval 


153 


45 


7448 


AF1 19851 


Homo sapiens 


PR01722 


96 


55 


7449 


AF1 30079 


Homo sapiens 


PR02852 


153 


73 


7450 


AF119851 


Homo sapiens 


PR01722 


238 


52 


7451 


AL451017 


Neurospora 
crassa 


related to Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN C 


113 


38 


7452 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


147 


57 


7453 


G03243 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7324. 


136 


63 


7454 


R14163 


Homo sapiens 


Cellular DNA-binding protein RSI. 


270 


42 


7455 


X63526 


Homo sapiens 


homologue to elongation factor 1- 


387 


80 
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gamma from A.salma 






7456 


G03I72 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


I40 


80 


7457 


AF119851 


Homo sapiens 


PR01722 


102 - 


55 


7458 


G02756 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6837. 


83 


55 


7459 


AF1 19851 


Homo sapiens 


PRO 1722 


132 


64 


7460 


L10908 


Mus musculus 


Gcapl gene product 


106 


64 


746! 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


124 


54 


7462 


X77816 


Rattus 
norvegicus 


PR-Vbetal 


142 


51 


7463 


Y34068 


Homo sapiens 


Histone HI isoform HI. 5 pANCA- 
reactive fragment (residues 69-226). 


96 


35 


7464 


AB046048 


Macaca 
fascicularis 


unnamed portein product 

• 


302 


62 


7465 


X92485 


Plasmodium 
vivax 


pval 


115 


47 


7466 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


116 


65 


7467 


W50192 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-1. 


91 


35 


7468 


AF161361 


Homo sapiens 


HSPC098 


92 


57 


7469 


AF216389 


Homo sapiens 


semaphorin Rs 


288 


88 


7470 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


153 


73 


7471 


AF 130089 


Homo sapiens 


PRO2550 


108 


48 


7472 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 




58 


7473 


AF217536 


Homo sapiens 


truncated mevalonate kinase 


115 


71 


7474 


X61296 


Rattus 
norvegicus 


open reading frame 2 


134 


49 


7475 


U22376 


Homo sapiens 


alternatively spliced product using exon 
13A 


165 


46 


7476 


AF119851 


Homo sapiens 


PR01722 


133 


46 


7477 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


127 


69 


7478 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


533 


87 


7479 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


122 

i 


59 


7480 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


168 


84 


7481 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


181 


80 


7482 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6292. 


122 


56 


7483 


G03793 


Homo sapiens 

- 


Human secreted protein, SEQ ID NO: 
7874. 


132 


37 


7484 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6836. 


166 


100 


7485 


D38112 


Homo sapiens 


cytochrome c oxidase subunit I 


277 


72 


7486 


M29294 


Rattus 

• 

norvegicus 


small nuclear nbonucleoparticle- 
associated protein 


132 


42 


74 0/ 


YQ2671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


140 


50 


7488 


AF090901 


Homo sapiens 


PRO0195 


101 


62 


7489 
7490 


X97249 
D38112 


Homo sapiens 
Homo sapiens 


leucine-rich primary response protein 
NADH dehydrogenase subunit 3 


304 
447 


93 
84 


7491 


AK024455 


Homo sapiens 


FLJ00<M7 protein 


122 


63 


7492 


U87269 


Homo sapiens 


p!20E4F transcription factor 


244 


66 


7493 


A3 1036 


Nicotiana alata 


PRP2 

- 


87 


31 



329 



WO 01/64835 PCTYUS01/04927 

i » - 



NO: 


1 \ Accession pto« 




nMPiififiAn 

j^cscnpiiou 


OHllUn 

Waterman 
Score 


4£ 

Idcntit 

y 


7494 


Y?6248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


206 


97 


7495 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


166 


91 


7496 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


161 

■ - 


66 


7497 


AK024455 


Homo sapiens 


FU00047 protein 


67 


61 


7498 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


174 


84 * 


7499 


X75438 


Susscrofa 


homologue of proline/arginine rich ■ 
antibacterial peptides 


116 


44 


7500 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


166 


89 


7501 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


137 


61 


7502 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQK>NO:163. 


213 


60 


7503 


AB042770 

i 


Pteropus 
dasymallus 


cytochrome c oxidase suburiit 3 

* ' 


169 


89 


7504 


AF126163 


Homo sapiens 


HHLA3 protein 


110 


62 


7505 


D38112' 


Homo sapiens 


cytochrome c oxidase subunit 3 


299 


90 


7506 


D381J2 


Homo sapiens 


cytochrome c oxidase subunit 3 


194 


88 


7507 


W48351 


Homo sapiens 


Human breast cancer related protein . 
BCRB2. 


128 


72 


7508 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


162 


86 


7509 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


116 


65 


7510 


L38908 


Nicotiana 
tabacum 


extensin 


114 

• 


36 


7512 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


273 


73 


7513 


Y36156 


Homo sapiens 


Human secreted protein #28. 


168 


44 


7514 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


171 


96 


7515 . 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


110 


46 


7516 


D3SU5 


Pongo pygmaeus 


cytochrome c oxidase subunit 3 


170 


84 


7517 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7344. 


105 


43 


7518 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


350 


82 


7519 


D38114 


Gorilla gorilla 


NADH dehydrogenase subunit 3 (ND3) 


141 


84 


7520 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


410 


85 


7521 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


1 12 


71 


7522 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


*6 


46 


7523 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


142 


57 


7524 


AF220264 


Homo sapiens 


MOST-1 


86 


62 


7525 


AF130089 


Homo sapiens 


PRO2550 


109 


35 


7526 


G00381 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4462. 


106 


79 

■ 


7527 


U49973 


Homo sapiens 


ORF2: function unknown 


105 


68 


7528 


Y14482 


Homo sapiens 

■ 


Fragment of human secreted protein 
encoded by gene 17. 


122 


62 


7529 


X05831 


Rattus 
norvegicus 


fibronectin precursor 


100 


64 


7530 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


175 


60 


7531 


X92485 


Plasmodium 


pval 


120 


59 
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< 



SeQID 
NO: 


Accession No. 

■ 


Species 

m 

\ 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 




i 


vivax . 








7532 


D382J5 


Pongo pygmaeus 


cytochrome c oxidase subunit 3 


3 S3 


75 


7533 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


321 


67 


7534 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, vanant a dna, 
complete virion genome. 


126 

M * 


72 — 


7535 


AF119851 


Homo sapiens 


PR01722 


110 


67 


mm m* **** ^ 

7536 


D38112 


WW • 

Homo sapiens 


cytochrome c oxidase subunit 3 


mm m* « 

531 


81 


7537 


G03714 


my mm * 

Homo sapiens 


Human secreted protem, SEQ ID NO: 
7795. 


146 


75 


7538 


X92485 


Plasmodium 
vivax 


pval 

T T 


141 


48 


7539 


AF1 19900 


Homo sapiens 


PR02822 


A m* 

86 


42 


7540 


G02532 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
6613. 


126 


50 


7541 


M15530 


Homo sapiens 


B-cell growth factor 


136 


53 


7542 


X53581 


Rattus 
norvegicus 


ORF4 


174 


46 


7543 


Y86248 

» 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


138 


74 


7544 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


m) m. m* 

116 


mfm. m+ 

86 


7545 


M15530 


Homo sapiens 


B-cell growth factor 


96 


47 


7546 

- 


X92485 

• 


Plasmodium 
vivax 


pval 


149 


58 


7547 


AF113685 


Homo sapiens 


PRO0974 


110 


48 


7548 


U49973 


Homo sapiens 


ORJF1; MER37; putative transposase 
similar to pogo element 


226 


73 


7549 


L78671 


Homo sapiens 


CoxII/D-loop DNA fusion protein 


316 


73 


7550 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


107 


70 


7551 


AF130089 


Homo sapiens 


PRO2550 


- mm mm, tf\. 

130 


45 


7552 


R95913 


Homo sapiens 


Neural thread protein. 


99 


56 


7553 


U93569 


Homo sapiens 


putative pi 50 


171 


47 


7554 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


111 


75 


7555 


L78671 


Homo sapiens 


Coxil/D-loop DNA fusion protein 


207 


56 


7556 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


130 

... 


59 


7557 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


125 


43 


7558 


Y86445 


Homo sapiens 


Human gene 42-encoded protein 
fragment, SEQ ID NO:360. 


95 


71 


7559 


X83413 


Human 
herpesvirus 6 


U88 


108 


32 


7560 


A1SS12 


Brassica napus 


extensin 


125 


38 


7561 


D38112 


Homo sapiens 


ATPase subunit 6 


495 


87 


7562 


AF 130089 


Homo sapiens 


PRO2550 


141 


46 


7563 


AF265575 


Homo sapiens 

... 


ubiquitous TPR-motif protem Y 
isoform 


106 

- 


61 


7564 


G00361 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
4442. 


128 


62 


mm mm —w _ 

7565 


+mf T mr A. m m\ 

U50191 


Caenorhabditis 
elegans 


C. elegans collagen dpy-10 gene (Levy, 
A.u. t Yang, J. ana Kramer, j.m. jvioi. 
Biol Cell 4, 803-17, 1993) 


■ ^ mf\ « 

101 


#\ mm 

37 


7566 


V00672 


Pan troglodytes 


reading frame protein 4 


294 


73 


7567 


A18812 


Brassica napus 


extensin 


145 


48 


7568 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


241 


69 


7569 


L00016 


Homo sapiens 


urf4 


284 


71 


7570 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


115 


50 


7571 


G00673 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 

- - 


65 
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NO: 


Accession No. 


Species 


toescnptTon 

- 


Smith- 
Waterman . 
Score 


% 

ldentit 

y 








4754. 






7572 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


126 


65 


7573 


X92485 


Plasmodium 
vivax 


pval — 


157 


49 


7574 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


151 


54 


7575 


AF 130089 


Homo sapiens 


PRO2550 


348 


70 


7576 


R95913 


Homo sapiens 


Neural thread protein. 1 


113 


45 


7577 


L26953 


Homo sapiens 


chromosomal protein 


148 


38 


7578 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


116 


46 


7579 


AB047948 


Macaca 
fascicularis 


hypothetical protein 


173 


67 


7580 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


149 


57 


7581 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


95 


51 


7582 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


125 


85 


7583 


AC004079 


Homo sapiens 


human HOXA3; 95% similarity to 
e307530(PID:gl 888441) 


127 


100 


7584 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


171 

> 


71 


7585 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDN0.163. 


141 


80 


7586 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


222 


100 


7587 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


194 


71 


7588 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 1 


115 


80 


7589 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


127 


77 


7590 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


112 


60 


7591 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 

* 


133 


72 


7592 


A49807 


Neisseria 
meningitidis 


IGA-LPHA3 


93 


33 


7593 


Z67990 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PF01484 (Nematode cuticle collagen 
N- terminal domain), Score=27.5, E- 
value=O.0001,N=l 


102 


42 


7594 


U43627 


Arabidopsis 
thaliana 


extensin 


99 


44 


7595 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


103 


74 


7596 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


112 


49 


7597 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


190 


92 


7598 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ED NO: 163. 


148 


96 


7599 


AP000616 


Oryzasativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


117 


95 


7600 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


192 


85 


7601 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


129 


91 


7602 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


333 


71 



332 



SEQID 
NO: 


^Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


Ideotit 

y 


7603 


D?8U2 


Homo sapiens 


cytochrome c oxidase subunit 3 


190 


86 


7604 


U09500 


Homo sapiens 


cytochrome b 


279 


93 


.7605 


Z14016 


Nicotiana 
tabacum 


pistil extensin like protein, partial CDS 


84 

• 


40 


7606 


G03801 


Homo sapiens . 


Human secreted protem, SEQ ID NO: 
7882. 


108 


71 


7607 


G03636 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7717. 


55 


52 


7608 


G03714 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
7795. 


114 


54 


7609 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


97 


62 


7610 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


434 


87 


7611 


Y21418 


Homo sapiens 


Human high mobility group protein 
HMGI-C mutant fragment 2. 


97 


46 


7613 


X92485 


Plasmodium 
vivax 


pval 


106 


58 


7614 


AC003058 

• 


Arabidopsis 
thaliana 


unknown protein 

• 


191 


88 

• 


7615 


S74221 


Homo sapiens 


IK factor=cytokine down-regulating 
HLA class II 


153 


65 


7616 


U28971 


Caenorhabdkis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear rna-binding 
protein) 


148 


74 


7617 - 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 

. . 1 


169 


56 


7618 


L35013 


Homo sapiens 


spliceosomal protem 


95 


55 


7619 


V00662 


Homo sapiens 


URF 3 (NADH dehydrogenase subunit) 


441 


85 


7620 


AF1 16638 


Homo sapiens 


PR01546 


172 


42 


7621 


K03202 


Homo sapiens 


salivary proline-rich protein precursor 


146 


46 


7622 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


145 


72 


7623 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


144 


67 


7624 


U41031 


Caenorhabditis 
elegans 


prolirie-rich 


92 


59 


7625 


G03453 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7534. 


130 


55 


7626 


L19098 


Choristoneura 
fumiferana 


cytochrome oxidase I 


102 


66 


7627 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


99 


76 


7628 


AF126163 


Homo sapiens 


HHLA3 protein 


117 


67 


7629 


G00361 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


69 


53 


7630 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


83 


55 


7631 


G00377 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
4458. 


87 


64 


7632 


Y27907 


Homo sapiens 


Human secreted protem encoded by 
gene No. 119. 


120 


48 


7633 


R95913 


Homo sapiens 


Neural thread protein. 


126 


61 


7634 


V00662 


Homo sapiens 


cytochrome oxidase I 


396 


85 


7635 


Y86248 


Homo sapiens 


Human secreted protem HCHPF68, 
SEQIDNO:163. 


205 


84 


7636 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


464 


85 


7637 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


165 


94 


7638 


AF 118086 


Homo sapiens 


PR01992 


120 


68 


7639 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163; 


176 


84 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman . 
Score 


Identit . 

y 


7640 


W88627 


Homo sapiens 


% * flit 4% A 

Secreted protein encoded by gene 94 
clone P1PMBQ32. 


95 


39 


7641 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


143 


68 


7642 


D38112 


Homo sapiens 


NADH dehydrogenase subunrt 3 


192 


91 


7643 


D38112 


Homo sapiens 


NADH dehydrogenase summit 3 


416 


80 


7644 


AF004341 


Homo sapiens 


cytochrome c oxidase subunit III 


133 


84 


7645 


U93570 


Homo sapiens 


putative pl50 


206 


42 


7646 


D38112 


Homo sapiens 


ATPase subunit 6 


192 


80 


7647 


AF090942 


Homo sapiens 


PRO0657 


128 


71 


7648 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


134 

* 


73 


7649 


G60397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


115 


52 


7650 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


130 


61 


7651 


X71442 


Rattus 
, norvegicus 


ORF 1; putative 


105 


52 


7652 


L07545 


Leishmania 
tarentolae 


A 'c* was inserted after nt 369 (=nt 
10459 in genomic sequence (M10126)) 
to correct -1 frameshift probably due to 
gel compression 


111 


32 


7653 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


89 


53 


7654 


G02473 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6554. 


291 


49 


7655 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


165 


63 


7656 


D38U2 


Homo sapiens 


NADH dehydrogenase subunit 3 


470 


88 


7657 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


367 


80 


7658 

• 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


108 


95 


7659 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


109 


50 


7660 


Y86248 


Homo sapiens 

* 


Human secreted protein HCHPF6 8, 
SEQIDNO:163. 


112 


77 

- 


7661 


AP002460 


Arabidopsis 
thaliana 


geneJd:FlD9.26~unknown protein 


121 

■ 


40 


7662 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


137 


96 


7663 


AF2 17973 


Homo sapiens 


unknown 


124 


51 


7664 


X92485 


Plasmodium 
vivax 


pval 


118 


59 


7665 


AF1 18082 


Homo sapiens 


PRO 1902 


123 


44 


7666 


AB014563 


Homo sapiens 


KIAA0663 protein 


267 


90 


7667 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


114 


49 


7668 


AF1 19855 


Homo sapiens 


PR01847 


152 


72 


7669 


AK025116 


Homo sapiens 


unnamed protein product 


127 


65 


7670 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


140 


55 


7671 


R14847 


Homo sapiens 


Protein associated with biochemical 
pathway involving cAMP. 


164 


89 


7672 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


93 


68 


7673 


U37426 


Homo sapiens 


kinesin-like spindle protein HKSP 


247 


81 


7674 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


125 


75 


7675 


S67633 


Homo sapiens 


class I major histocompatibility antigen 


103 


59 
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Smith- 

Waterman 

Score 


% 

Idcntit 

y 


7676 


AJF126163 


Homosapiens 


HHLA3 protein 


146 


72 


7677 

• • 


AC005192 


Homosapiens 

• ■ 


- mm - • /* * . m 

similar to mouse lnterferon-related 
protein PC4; 96% identical to P19182 
(PID:gl35861) 


143 

■ 


87 


7678 


X07882 


Homo sapiens 


Po protein 


126 


39 


7679 


Y14482 


Homo sapiens . 


Fragment of human secreted protein 
encoded by gene 17. 


* A A 

148 


M+0m\ 

60 


7680 


AJ242540 


Volvox carteri f. 

• • 

naganensis 


hydroxyproline-rich glycoprotein ,DZ- 
HRGP 


177. 


37 


7681 


AF09O930 


Homo sapiens 


PRO0478 


80 


58 


7682 


X55654 


Homo sapiens 


cytochrome C oxidase II subunit 


115 


65 


7683 


AB046048 


Macaca 
fascicular^ 


unnamed portein product 


m\ A m* 

145 


67 


7684 


AF130079 


WW * 

Homo sapiens 


PR02852 


m « mm 

117 


60 


7685 


M13101 


Rattus 
norvegicus 


unknown protein 

■ 


114 


53 

«. 


7686 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


134 


74 


7687 


AF126163 , 


Homo sapiens 


HHLA3 protein 


141 


84 


7688 


Y36156 


Homo sapiens 


Human secreted protein #28. 


120 


61 


7689 


M37197 


Homo sapiens 


CCAAT-box-binding factor 


104 


95 


7690 


AL390114 


Leishmania 
major 


probable (hhv-6) u 1 1 02, variant a 
DNA, complete virion genome 


97 


54 


7691 


AF220264 


Homo sapiens 


MOST-1 


117 


76 


7692 


AF063866 


Melanoplus 
sanguinipes 
entomopoxvirus 


ORF MSV233 hypothetical protein 

■ * 


94 


31 


7693 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


98 


40 


7694 


AF118082 


Homo sapiens 


PRO1902 


115 


55 


7695 


G00517 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4598. 


116 


56 


7696 


Y15163 


Mus musculus 


putative splice variant containing 
region C 


133 


51 


7697 


Y64890 


Homo sapiens 


Human 5* EST related polypeptide SEQ 
IDNO:105L 


132 

- • 


52 


7698 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


155 


77 


7699 


Ml 5530 


Homo sapiens 


B-ceU growth factor 


106 


79 


7700 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


108 


70 


7701 


G02538 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
6619. 


166 


62 


mm mm 4m, sm. 

7702 


AF090931 


Homo sapiens 


PRO0483 


125 


68 


7703 


G03432 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7513. 


114 


79 


7704 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


175 


63 


7705 


AF041330 


Bpdo saltans 


NADH dehydrogenase subunit 5 


141 


35 


7706 


G02532 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
6613. 


149 


58 


7707 


X92485 


Plasmodium 
vivax 


pval 


i o £ 
126 


42 


7708 


AK023563 


Homo sapiens 


unnamed protein product 


217 


72 


7709 - 


AP000616 


Oryza sativa ! 


similar to RING-H2 finger protein 
RHAla(AF078683) 


124 


58 


7710 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


124 


56 


7711 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


62 
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SEQ ID 
NO: 


^Accession No. 


Species 

„ ♦ 


Description 


Sroitb- 

Waternian 

Score 


Identit 
J 




i 


i 


7795. 


■ 




7712 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


116 


72 


7713 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


194 


58 


7714 

t 


X92485 


Plasmodium 
vivax 


pval 


123 


68 


7715 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


137 


48 


7716 


AF1 19851 


Homo sapiens 


PR01722 


168 


60 


7717 


U93570 


Homo sapiens 


putative pi 50 


133 


35 


7718 


AFU6712 


Homo sapiens 


PR02738 ; 


94 


68 


7719 


S58722 


Homo sapiens 


X-linked retinopathy protein {O 
terminal, clone XEH.8c} 


107 


55 


7720 


U49973 


Homo sapiens 


ORF2: function unknown 


200 


85 


7721 


U49973 

» 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


234 


73 


7722 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


155 


76 . 


7723 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


140 


65 


7724 


Y02671 , 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


70 


73 


7725 


G03789 

* 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


94 


55 


7726 


S58722 


Homo sapiens' 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


130 


75 


7727 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


94 


78 


7728 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


163 


65 


7729 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJ W 1 8. 


111 


63 


7730 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


119 


69 


7731 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


113 


84 


7732 


Y08769 


Rattus 
norvegicus 


microvascular endothelial 
differentiation gene 2 


94 


38 


7733 


U09500 


Homo sapiens 


cytochrome b 


463 


84 


7734 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


334 


92 


7735 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


399 


92 


7736 


Y12950 


Homo sapiens 

* 


Amino acid sequence of a human 
secreted peptide. 


128 


55 


7737 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


167 


66 


7738 


W48353 


Homo sapiens 


Human breast cancer related protein 
BCFLT2. 


86 

■ 


70 


7739 

- 


D86854 

■ 


Catharanthus 
roseus 


extensin 


154 


40 


7740 


G03243 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7324. 


122 


57 


7741 


X86031 


Sus scrofa 


homologue of prolme/arginine-rich 
antibacterial protein 


99 


4? 


7742 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6834. 


117 


35 


7743 


V00662 


Homo sapiens 


cytochrome oxidase I 


568 


89 


7744 


R63235 


Homo sapiens 


CNS neural thread protein HB4. 


145 


93 


7745 


R63235 


Homo sapiens 


CNS neural thread protein HB4. 


154 


100 


7746 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


133 


61 
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NO: 


Accession No. 


Soecies 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


7747 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


126 


71 


7748 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


327 


92 


7749 


G02827 


Homo sapiens- 


Human secreted protein, SEQ ID NO: 
6908. 


150 


71 


7750 


AF1 30079 


Homo sapiens 


PR02852 


m a* 

147 




7751 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDN0:163. 


104 


49 


7752 


U43627 


Arabidopsis 
thahana 


extensin 


116 


37' 


7753 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


133 


71 


7754 


D38U2 


Homo sapiens 


NADH dehydrogenase subunit 1 


551 


83 


7755 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


88 


56 


7756 


AF119851 


Homo sapiens 


PR01722 


97 


56 


7757 


U80955 


Caenorhabditis 
elegans , 


similar to RE repeat region of 
PIR:S27806 D. melanogaster homeotic 
protein BarH2 protein (dual bar 
protein) 


89 

■ 


100 


7758 


AC002045 


Homo sapiens 


Unknown protein product CIT987SK- 
A-589H1J splice form 2 


97 


94 


7759 


Y02775 


Homo sapiens 


Human secreted protein encoded by 
gene 12 clone HFTCU1 9. 


288 


98 


7760 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


94 

i 

_ ji 


80 


7761 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


199 


76 


7762 


AF090931 


Homo sapiens 


PRO0483 


153 


84 


7763 


X92485 


Plasmodium 
vivax 


pval 


114 


62 


7764 


AF090931 


Homo sapiens 


PRO0483 


107 


86 


7765 


G02752 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6833. 


103 


43 


7766 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


168 


73 


7767 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


148 


72 


7768 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


222 


83 


7769 


Y40039 


Homo sapiens 


Peptide sequence derived from a 
human secreted protein. 


68 


72 


7770 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


156 


62 


7771 


U93569 


Homo sapiens 


p40 


188 


43 


7772 


AL049934 


Homo sapiens 


hypothetical protein 


113 


47 


7773 


AF090931 


Homo sapiens 


PRO0483 


86 


65 


7774 


AP000616 


Oryza sativa 


similar to RING-H2 ringer protein 
RHAla(AF078683) 


137 


92 


7775 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

A A w\r\ 

4409. 


143 


65 


7776 


AF126163 


Homo sapiens 


HHLA3 protein 


141 


71 


7777 


G00423 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4504. 


93 


47 


7778 


AF090942 


Homo sapiens 


PRO0657 


148 


74 


7779 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


194 


83 


7780 


U 14966 


Homo sapiens 


ribosomal protein L5 


315 


81 
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SEQll> 
NO: 


^Accession No. 


Species 


Description 


Waterman 
Score 


Identit 

y 


7781 


R59843 


Homo sapiens 


Apo£4Lx2 protease. 


126 


47 


■ 7782 


AF 119855 


Homo sapiens 


PR01847 


131 


51 


.7783 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4425. 


79 


53 


7784 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


405 


82 


7785 


G00331 

■ 


Homo sapiens 

■ 


Human secreted protein, SEQ ID NO: 
4412. 


64 

• 


42 


7786 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


141 


59 


7787 


AF118082 


Homo sapiens 


PRO1902 


131 


48 


7788 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


88 


64 


7789 


J03807 


Strongylocentrot 
us purpuratus 


histone Hl-delta 


111 


43 


7790 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


139 


85 


7791 


V00662 


Homo sapiens 


cytochrome oxidase I 


522 


80 


7792 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


153 


72 


7793 


G02832 


Homo sapiens 

■ 


Human secreted protein, SEQ ID NO: 
6913. 


84 


53 


7794 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


82 


46 


7795 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


156 


76 


7796 


D38112 


Homo sapiens 


ATPase subunit 8 


166 


77 


7797 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


159 


48 


7798 


AF090931 


Homo sapiens 


PRO0483 


132 


75 


7799 


G03714 

- 


Homo sapiens . 


Human secreted protein, SEQ ID NO: 
7795. 


97 


57 


7800 


D21261 


Homo sapiens 


similar to human 22kDa, SM22 mRNA 
(HUM22SM). 


99 


84 


7801 


X64707 


Homo sapiens 


BBCi 


232 


77 


7802 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


104 


33 


7803 


Y86248 

• 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


112 


50 


7804 


Y36203 


Homo sapiens 


Human secreted protein #75. 


142 


56 


7805 


AFi 19851 


Homo sapiens 


PR01722 


88 


46 


7806 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


149 


57 


7807 


X68600 


Hordeum 
vulgare 


pZE40 

■ 


90 


37 


7808 


AK025047 


Homo sapiens 


unnamed protein product 


138 


54 


7809 


AF130089 


Homo sapiens 


PRO2550 


375 


71 


7810 


G01249 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5330. 


124 


65 


7811 


R59842 


Homo sapiens 


ApoE4Ll protease. | 


156 


58 


7812 


AF016446 


Caenorhabditis 
elegans 


similar to C. elegans cuticulin precursor 
CUT-2 (SP:P34682) 


95 


42 


7813 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


154 


65 


7814 


AK025047 


Homo sapiens 


unnamed protein product 


165 


71 


7835 


W34499 


Homo sapiens 


Obesity receptor C protein. 


309 


89 


7816 


AJ222673 


Hepatitis C virus 
type 2c 


core protein 


95 


42 


7817 


AB01 1483 


Arabidopsis 
tfaaliana 


contains similarity to root cap 
protein-gene Jd:MUF9. 1 6 


104 


61 


7818 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


154 


74 
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SEOID 
NO: 


Accession No 


Sneries 


Descriotion 

- 


Smith- 
Waterman . 
Score 


% " 

Idcntit 

y 


7819 


X92485 


Plasmodium 
vivax 


pval 


112 


61 


7820 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


103 


60 


7821 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


174 


65 


7822 


AF 130089 


Homo sapiens 


PRO2550 


107 


67 


7823 


AF1 18086 


Homo sapiens 


PR01992 


143 


73 


7824 


G02493 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
6574. 


132 


63 


7825 


AL390114 


Leishraania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


142 


59 


7826 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


109 


42 


7827 


G00352 


Homo sapiens 

■ 


Human secreted protein, SEQ ID NO: 
4433. 


128 


65 


7828 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


104 


58 


7829 


AF 132961 


Homo sapiens 


CGI-27 protein 


154 


49 


7830 


X70343 


Nicotiana 
sylvestris 


extensin 


105 


30 


7831 


AL1 60371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


100 


62 


7832 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


129 


51 


7833 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


99 


46 


7834 


G02469 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6550. 


74 


65 


7835 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


163 


55 


7836 


G03482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7563. 


72 


52 


7837 


AF090942 


Homo sapiens 


PRO0657 


126 


59 


7838 


Z33426 


Human 
respiratory 
syncytial virus 


glycoprotein 


110 


34 


7839 


D63480 


Homo sapiens 


The KIAA0 1 46 gene product is novel. 


158 


89 


7840 


W48352 


Homo sapiens 


Human breast cancer related protein 
BCFLT1. 


88 


63 


7841 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


156 


60 


7842 


W90847 

1 M. 


Homo sapiens 


Human lymphocyte targeted peptide 
#15. 


145 

_ 


100 


7843 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


127 


66 


7844 


M28016 


Homo sapiens 


cytochrome b 


214 


78 


7845 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


283 


82 


7846 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


94 


73 


7847 


D38112 


Homo sapiens 


cytochrome b 


525 


83 


7848 


L27428 


Homo sapiens 


reverse transcriptase 


210 


67 


7849 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6836. 


156 


96 


7850 


X92485 


Plasmodium 
vivax 


pval 


121 


54 


7851 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


147 


59 


7852 


D38112 ' 


Homo sapiens 


NADH dehydrogenase subunit 2 


191 


93 
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SEQID 
NO: 


'Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


7853 


W48808 


Homo sapiens 


Homo sapiens clone CG109_1 protein. 


72 


69 


7854 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


142 


76 


7855 

■| 


U43627 


Arabidopsis 
thaliana 


extensin 


124 

■ 


35 


7856 


AP000616 


Oryza saliva 


similar to RING-H2 finger protein 
RHAla(AF078683) 


133 


95 


7857 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


95 


57 


7858 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, ' 
SEQIDNO:163. 


157 


91 


7859 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6834. 


108 


45 


7860 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


159 

- 


54 


7861 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


155 


63 


7862 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


265 


77 


7863 


AP000616 . 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


145 


56 


7864 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


222 


100 


7865 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


409 


87 


7866 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


112 


50 


7867 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


169 


74 


7868 


AL031004 


Arabidopsis 
thaliana 


RSZp22 splicing factor 


127 


35 


7869 


U83771 


Accipiter 
superciliosus 


cytochrome oxidase I 

• * 


168 


73 


7870 


U83771 


Accipiter 
superciliosus 


cytochrome oxidase I 


186 

■ 


81 


7871 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


203 


56 


7872 


Y08062 


Homo sapiens 


Human PRQ245 protein fragment 
derived from DNA35638. 


130 


71 


7873 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


106 


44 


7874 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


138 


69 


7875 


AF1 19855 


Homo sapiens 


PR01847 


174 


70 


7876 


AF210651 


Homo sapiens 


NAG 18 


164 


76 


7877 


Z93244 


Homo sapiens 


bKl 1 6F5.2 (PUTATIVE RhoGAP 
(CDC42 GTPAse Activating Protein) 
LIKE protein) 


111 


45 


7878 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


227 


79 


7879 


AK024455 


Homo sapiens 


FLJ00047 protein 


120 


68 


7880 


Y36156 


Homo sapiens 


Human secreted protein #28. 


85 


68 


7881 


AF 116661 


Homo sapiens 


PR01438 


138 


61 


7882 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


141 


55 


7883 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


111 


63 


7884 


AP000616 


Oryza sativa 


similar to RJNG-H2 ringer protein 
RHAla(AF078683) 


106 


64 


7885 


AK022821 


Homo sapiens 


unnamed protein product 


175 


72 


7886 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


117 


59 
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SEQID 
NO: 


s Accession No. 


I Species 


Descrintinn 


Smith- 

Waterman 

Score 


Idcntit 

y ! 


7887 


L27428 


Homo sapiens 


reverse transcriptase 


165 


63 


, 7888 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


139 


& 


7889 


MI5530 


Homo sapiens 


B-cell growth factor 


132 


74 


7890 


Y01181 


Homo sapiens 


Polypeptide fragment encoded by gene 
12. 


81 


64 


7891 


AB046048 


Macaca 
fascicularis 


unnamed portein product 


150 


61 


7892 


GO 1249 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5330. 


154 


73 


7893 


AF130089 


Homo sapiens 


PRO2550 


224 


61 


7894 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


127 


69 


7895 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


142 

• 


62 


7896 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


149 


50 


7897 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


108 


68 


7898 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein #153. 


144 


65 

■ 


7899 

■ 


G03043 

* 


Homo sapiens 

- 


Human secreted protein, SEQ ID NO: 
7124. 


105 


83 


7900 


AF026689 


Homo sapiens 


prostate-specific transglutaminase 


104 


57 


7901 


X03717 


Homo sapiens 


pot. unidentified reading frame 


105 


46 


7902 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


94 


67 


7903 


X6I046 


Hydra sp. 


mini-collagen 


107 


46 


7904 


AF124726 


Homo sapiens 


acinusL 


111 


44 


7905 


U77049 


Ovis aries 


bactinecin 1 1 


116 


38 


7906 


AF 118086 


Homo sapiens 


PR01992 


130 


73 


7907 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


321 


48 


7908 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


324 


83 


7909 


AC0Q4450 


Arabidopsis 
thaliana 


putative extensin 


167 


34 


7910 


AF130079 


Homo sapiens 


PR02852 


110 


42 


7911 


AF090942 


Homo sapiens 


PRO0657 


144 


66 


7912 


AF090931 


Homo sapiens 


PRO0483 


96 


80 


7913 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


128 


46 


7914 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


87 


68 


7915 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


105 


75 


7916 


X92485 


Plasmodium 
vivax 


pval 


102 


67 


7917 


AF090942 


Homo sapiens 


PRO0657 


126 


76 


7918 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


108 


47 


7919 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


156 


63 


7920 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 1 
6953. 


138 


61 


7921 


L26251 


Trypanosoma 
brucei 


CR5 


95 


37 


7922 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


176 


73 
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7923 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


174 


61 


7924 


L27428 


Homo sapiens 


reverse transcriptase 


121 


33 


-7925 


Y36156 


Homo sapiens 


Human secreted protein #28. 


137 


67 


7926 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


119 


73 


7927 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


138 


65 


7928 


Y02693 . 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


138 


60 


7929 


G02832 

- 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


166 


61 


7930 


AF004341 


Homo sapiens 


cytochrome c oxidase subunit III 


144 


87 


7931 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


176 


53 


7932 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


179 


60 


7933 


V00662 


Homo sapiens 


URF 3 (NADH dehydrogenase subunit) 


279 


92 


7934 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


132 


63 


7935 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


101 


64 


7936 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


159 


55 


7937 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


320 


80 


7938 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


159 

F 

ll 


71 


7939 


AF 144054 


Homo sapiens 


apoptosis related protein APR-4 


127 


65 


7940 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


314 


73 


7941 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


121 


57 


7942 


AB040936 


Homo sapiens 


KIAA 1503 protein 


149 


71 


7943 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


107 


83 


7944 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


74 


60 


7945 


Y13141 


Bromheadia 
finlaysoniana 


extensin 


68 


35 


7946 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


126 


50 


7947 


AP000735 


Arabidopsis 
thaliana 


extensin protein-like 


214 


40 


7948 


AF161356 


Homo sapiens 


HSPC093 


134 


75 


7949 


AF090942 


Homo sapiens 


PRO0657 


110 


48 


7950 


AJ249395 


Globodera 
pallida 


cytochrome b 


117 


36 


7951 


AF 116661 


Homo sapiens 


PR01438 


135 


49 


7952 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


189 


92 


7953 


Ml 5530 


Homo sapiens 


B-ceJl growth factor 


143 


65 


7954 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


137 


68 


7955 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


130 


62 


7956 


AF130089 


Homo sapiens 


PRO2550 


153 


62 


7957 


AL160371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


99 


48 


7958 


AF161356 


Homo sapiens 


HSPC093 


154 


56 
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7959 


GQ2832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


131 


57 


7960 


X92485 

i 


Plasmodium - 

• 

vivax 


pval 

* 


107 


53 


7961 


AF210651 


Homo sapiens 


NAG 18 


113 


77 


7962 


AF130051 


Homo sapiens 


PRO0898 


84 


71 


7963 


R59842 


Homo sapiens 


ApoE4Ll protease. 


116 


68 


7964 


AF090931 


Homo sapiens 


PRO0483 


129 . 


56 


7965 

• 


Y64890 


Homo sapiens 

• 


. Human 5' EST related polypeptide SEQ 
IDNO:1051. 


151 

■ 


51 


7966 


R95913 


Homo sapiens 


Neural thread protein. 


110 


40 


7967 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


180 


50 


7968 


AF090894 


Homo sapiens 


PRO0113 


104 


58 


7969 


Y21040 


Homo sapiens 


Human glial fibrillary acidic protein 
GFAP mutant fragment 49. 


285 


100 


7970 


AK001452 


Homo sapiens 


unnamed protein product 


346 


56 


7971 . 


V00672 


Pan troglodytes 


reading frame protein 4 


134 


90 


7972 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


165 


48 


7973 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


104 


84 


7974 


AF1 16661 


Homo sapiens 


PR01438 


140 


62 


7975 


AF 119882 


Homo sapiens 


PR02492 


112 


48 


7976 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


80 


65 


7977 


S58722 


Homosapiens 


X-linked retinopathy protein (C- 
terminal, clone XEH.8c} 


124 


89 


7978 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


125 


36 


7979 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


149 


68 


7980 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


289 


w 


7981 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


135 


59 


7982 


X77816 


Rattus 
norvegicus 


PR-Vbetal 


109 


69 


7983 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


149 


73 


7984 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


128 


72 


7985 


R59842 


Homo sapiens 


ApoE4Ll protease. 


130 


82 


7986 


AF090895 


Homo sapiens 


PRO0117 


83 


58 


7987 


X07881 


Homo sapiens 


proline-rich protein Gl 


131 


42 


7988 


G03790 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
7871. 


123 


53 


7989 


AF090894 


Homo sapiens 


PRO0113 


104 


61 


7990 


Z93244 


Homo sapiens 


bKl I6F5.2 (PUTATIVE RhoGAP 
(CDG42 GTPAse Activating Protein) 
LIKE protein) 


100 


61 


7991 


AK024455 . 


Homo sapiens 


FLJ00047 protein 


131 


56 


7992 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


136 


61 


7993 


AF210651 


Homo sapiens 


NAG 1 8 


133 


61 


7994 


Y14482 


Homosapiens 


Fragment of human secreted protein 
encoded by gene 17. 


92 


42 


7995 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


171 


82 


7996 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


113 


54 
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7997 


AF090928 


Homo sapiens 


PRO0470 


86 


76 


7998 


AJ251579 


Arabidopsis 
thaliana 


cef protein 


113 


37 


7999 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


119 


60 


8000 


R95913 


Homo sapiens 


Neural thread protein. 


105 . 


70 


8001 


AF229067 


Homo sapiens 


PADI-H protein 


118 


61 


8002 

* 


W88609 


Homo sapiens 

• 


Secreted protein encoded by gene 76 
clone HKIXL73. 


83 


65 

- 


8003 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


85 


61 


8004 


S79410 


Mus wusculas 


nuclear localization signal binding 
protein 


123 


43 


8005 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


93 


68 


8006 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


147 


64 


8007 


AF016099 


Mus musculus 


endonuclease/reverse transcriptase 


147 


38 


8008 


G03714 


Homo sapiens - 

i 


Human secreted protein, SEQ ID NO: 
7795. 


150 


72 


8009 


AP000616 

* 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


123 


71 


8010 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


92 


54 


8011 


G00589 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4670. 


152 


59 


8012 


G02832 


Homo sapiens . 


Human secreted protein, SEQ ID NO: 
6913. 


123 


66 


8013 


AF161356 


Homo sapiens 


HSPC093 


185 


64 


8014 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


107 


54 


8015 


G00413 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4494. 


71 


65 


8016 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


139 


66 


8017 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


94 


77 


8018 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


108 


46 


8019 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


178 


61 


8020 


G03639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7720. 


85 


65 


8021 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


108 


51 


8022 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


202 


77 


8023 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


195 


68 


8024 


G03786 


Homo sapiens 


Human secreted protein, SEQ. ID NO: 
7867. 


158 


53 


8025 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


133 


58 


8026 


X92485 


Plasmodium 
vivax 


pval 


128 


56 


8027 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


135 


61 


8028 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


105 


66 


8029 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


126 


59 



344 



WO 01/64835 PCT/US01/04927 

. t ■ ■ . 



NO: 






IWfifrinfion 


Smith- 

Waterman 

Score 


Identif 

y 


8030 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


117 


40 


8031 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


126 1 


72 


8032 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


125 


41 


8033 


Y40039 


Homo sapiens 


Peptide sequence derived from a 
human secreted protein. 


113 


69 


8034 

- 


Y14482 

• 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


102 

■ 


66 


8035 


Y86248 


Homo sapiens 

> 


Human secreted protein HCHPF6S, 
SEQIDNO:163. 


221 


95 


8036 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


106 


36 


8037 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


113 


55 


8038 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


108 


64 


8039 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


110 


68 


8040 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


169 


86 


8041 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein #153. 


170 


56 


8042 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


95 


58 


8043 


AK024455 


Homo sapiens 


FU00047 protein 


152 


68 


8044 


AF220264 


Homo sapiens 


MOST-1 


83 


68 


8045 


AK024455 


Homo sapiens 


FLJ00047 protein 


131 


55 


8046 


Y73966 

• 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein #153. 


181 

• 


78 


8047 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


147 


61 


8048 


W03642 


Homo sapiens 


Human cannabinoid GPR N-tenninal 
sequence. j 


102 


43 


8049 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


117 


59 


8050 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


154 


56 


8051 


D00570 


Mus musculus 


open reading frame (196 AA) 


127 


59 


8052 


AF040964 


Homo sapiens 


unknown protein 1T1 


143 


50 


8053 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


168 


83 


8054 


G03787 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
7868. 


108 


70 


8055 


G02538 

■ 


Homo sapiens 

- 


Human secreted protein, SEQ ID NO: 
6619. 


134 


62 


8056 


Y86248 


Homo sapiens 

■ 


Human secreted protein HCHPF6S, 
SEQ ID NO: 163. 


112 


65 


8057 


AF090895 


Homo sapiens 


PRO01 17 


110 


77 


8058 


AP000616 


Oryza sat iva 


similar to RING-H2 finger protein 
RHAla(AF078683) 


124 


90 


8059 


AF1 18082 


Homo sapiens 


PRO 1902 


117 


50 


8060 


AK022217 


Homo sapiens 


unnamed protein product 


145 


53 


8061 


AF130089 


Homo sapiens 


PRO2550 


160 


76 


8062 


AF119851 


Homo sapiens 


PR01722 


177 


54 


8063 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


362 


73 


8064 


D38112 


Homo sapiens 


7<IADH dehydrogenase subunit 3 


426 


85 


8065 


D38U2 


Homo sapiens 


NADH dehydrogenase subunft 3 


344 


90 


8066 


AF068294 


Homo sapiens 


HDCMB45P 


146 


60 


8067 


V00662 


Homo sapiens 


URF 3 (NADH dehydrogenase subunit) 


461 


87 


8068 


D38U2 


Homo sapiens 


NADH dehydrogenase subunit 4 


504 


90 
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8069- 


X86031 


Sus scrofa 


homologue of proline/arginine-rich 
antibacterial protein 


97 


40 


8070 


R59842 


Homo sapiens 


ApoE4Ll protease. 


147 


47 


8071 


AFU9851 


Homo sapiens 


PR01722 


139 


59 


8072 


X92485 


Plasmodium 
vivax 


pval 


183 


45 


8073 . 


AF116712 


Homo sapiens 


PR02738 


95 


52 


8074 


AF213465 


Homo sapiens 


dual oxidase 


184 


80 


8075 


AF090895 


Homo sapiens . 


PRO0117 


153 


57 


8076 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


162 


73 


8077 


T85403_cdl 


Homo sapiens 


19-APR-1993 Human cadherin-1 1 
coding sequence. 


284 


98 


8078 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
IDNO:1051. 


99 


63 


8079 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


162 


73 


8080 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


142 


52 


8081 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


130 


37 


8082 


X05285 


Drosophila 
melanogaster 


fibrillarin 


100 


34 


8083 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


142 


65 


8084 


G02216 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6297. 


380 

it 


82 


8085 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163, 


146 


64 


8086 


X92485 


Plasmodium 
vivax 


pval 


138 


64 


8087 


AP000616 

• 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


117 


82 


8088 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


180 


79 


8089 


U27143 


Homo sapiens 


protein kinase C inhibitor-I 


119 


41 


8090 


AF1 19851 


Homo sapiens 


PR01722 


85 


47 


8091 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


112 


73 


8092 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


133 


64 


8093 


AF 11 9854 


Homo sapiens 


PR01843 


522 


100 


8094 


X92485 


Plasmodium 
vivax 


pval 


115 


82 


8095 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


598 


86 


8096 


AF097473 


Mus musculus 


ORFl 


109 


46 


8097 


AF097473 


Mus musculus 


ORF1 


109 


46 


8098 


Y17833 


Human 
endogenous 
retrovirus K 


pol protein 

* 


281 


73 


8099 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


156 


69 


8100 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


126 


42 


8101 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


126 


65 


8102 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


145 


70 


8103 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


163 


71 
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A 4 A A 

8104 


AF 130089 


Homo sapiens 


PRO2550 


| 130 


42 


, 8105 


G0341 1 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


4 m) A 9 

116 


57 








7492. 






8106 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 


4. 

132 


mi 

61 








derived protem #153. 






8107 


AF090942 


Homo sapiens 


PRO0657 


123 


41 


8108 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


134 


38 


8109 


AL160371 


Leishmania 


probable (hhv-6) u 1 1 02, variant a 


101 


77 




■ 


major 


DNA, complete virion genome 






8110 


D38112 


Homo sapiens 


cytochrome c oxidase summit 1 


340 


90 


8111 


AF130089 


Homo sapiens 


PRO2550 


123 


86 


8112 


AP000616 


Oryzasariva 


similar to RING-H2 finger protein 


122 


90 






* 


RHAla(AF078683) 


• 




8113 


G02872 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


182 


47 








6953. 


- 




8114 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


77 








4435. . 






8115 


G00452 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


72 


50 








4533. 


- 




8116 


D38U2 


Homo sapiens 


NADH dehydrogenase subunit 3 


459 


88 


8117- 


L17318 


Rattus 


proline-rich proteoglycan 


110 


45 






norvegicus 








8118 


S79410 


Mus musculus 


nuclear localization signal binding 


125 


62 






■ 


protein 






8119 


G03639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


106 


78 




■ 




7720. 






8120 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


106 


84 


8121 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


no 


45 


■ 






6571. 






8122 


D86854 


Catharanthus 


extensin 


142 


35 






roseus 






. 


8123 


X92485 


Plasmodium 


pval 


144 


53 . 


» 




vivax 


• 






8124 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


59 


■ 






7870. 






8125 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


163 


67 




■ 




6613. 






8126 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


145 


62 








gene 22 clone HMSJW1 8, 






8127 


X07881 


Homo sapiens 


proline-rich protein Gl 


142 


37 


8128 


X70343 


Nicotiana 


extensin 


102 


29 




■ 


sylvestris 






- ■ 


8129 


W48351 


Homo sapiens 


Human breast cancer related protein 


113 


48 








BCRB2. 






8130 


AF090895 


Homo sapiens 


PRO0117 


150 


71 


8131 


D00570 


Mus musculus 


open reading frame (251 AA) 


130 


41 


8132 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


106 


61 




* 




6623. 






8133 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


128 


61 








encoded by gene 17. 




• 


8134 


AF130079 


Homo sapiens 


PR02852 


181 


77 


8135 


/-I /v*t nil 

G03714 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


109 


58 








7795. 






8136 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


98 


64 








7437. 






8137 


W50192 


Homo sapiens 


Amino acid sequence of salivary 


112 


43 








protein CON-1. 






8138 


AF 119900 


Homo sapiens 


PR02822 


113 


57 


8139 


D38U2 


Homo sapiens 


NADH dehydrogenase subunit 3 


427 


86 



347 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


8140 


AF004341 


Homo sapiens 


cytochrome c oxidase subunit III 


155 


88 


8141 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


163 


58 


8142 


D70836 


Homo sapiens 


Zinc-finger protein 


156 


47 


8143 


AC002291 


Arabidopsis 
thaliana 


extensin 


153 


37 


8144 




Lycopersicon 
esculentum 


extensin (class I) 


85 


34 


8145 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


100 


76 


8146 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


108 


63 


8147 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


110 


79 


8148 


M22332 


Homo sapiens 


unknown protein 


156 


40 


8149 


U23455 


Caenorhabdkis 
elegans 


similar to D. melanogaster homeotic 
protein BarH2 (PIR:A41726) 


92 

■ 


76 


8150 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


131 


66 


8151 


U49973 

* 4 1 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


441 


76 


8152 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


155 

- 


67 


8153 


AF 130089 


Homo sapiens 


PRO2550 


129 


42 


8154 


AF130089 


Homo sapiens 


PRO2550 


173 


50 


8155 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


140 


64 


8156 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


171 


82 


8157 


L27428 


Homo sapiens 


reverse transcriptase 


181 


50 


8158 


U11271 


Homo sapiens 


thromboxane A2 receptor ~^ 


138 


73 


8159 


G02538 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


145 


53 


8160 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


139 


58 


8161 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


155 


59 


8162 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


141 


64 


8163 


AF161361 


Homo sapiens 


HSPC098 


89 


66 


8164 


G03797 


Homo sapiens 


Human secreted protein, SEQ WNO: 
7878. 


117 


58 


8165 


AF130089 


Homo sapiens 


PRO2550 


121 


63 


8166 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


148 


85 


8167 


X92485 


Plasmodium 
vivax 


pval 

• 


97 


56 


8168 


X92485 


Plasmodium 
vivax 


pval 


115 


57 


8169 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


124 


61 


8170 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


123 


60 


8171 


X92485 


Plasmodium 
vivax 


pval 


105 


44 


8172 


Y19684 


Homo sapiens 


SEQ ID NO 402 from W09922243. 


75 


64 


8173 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


106 


65 


8174 


X86031 


Sus scrofa 


homologue of proline/arginine-rich 
antibacterial protein 


108 


44 
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Smith- 
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Score 


% 

Ideotit . 

y 


8175 


Y36203 


Homo sapiens 


Human secreted protein #75. 


150 


51 


8176 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


114 


41 


8177 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


135 


68 


8178 


AF090928 


Homo sapiens 


PRO0470 


119 . 


72 


8179 


Y02671 


1 Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


141 


75 


8180 


AF220264 


Homo sapiens 


MOST-1 


94 


61 


8181 


AF1 19851 


Homo sapiens 


PR01722 


143 


73 


8182 


W48351 

- 


Homo sapiens 

* 


Human breast cancer related protein 
BCRB2. 


134 


64 


8183 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


143 


59 


8184 


X92485 


Plasmodium 
vivax 


pval 


127 


72 


8185 


X92485 

* 


Plasmodium 
vivax 


pval 


130 


65 


8186 


AF1 19851 


Homo sapiens 


PR01722 


285 


64 


8187 


AF090942 


Homo sapiens 


PRO0657 ! 137 


38 


8188 


AF090931 


Homo sapiens 


JrKUU4oj 


1 j4 


66' 


8189 


Y02693 


Homo sapiens 


riuman secreieo protein encoded oy 
gene 44 clone HTDAD22. 


14<S 


43 


8190 


U49973 


Homo sapiens 


ukt j , iyXcKj /, putative transposase 

a mi liar lu pugu cicmcm 


ZOO 


79 


8191 


G02639 


Homo sapiens 


Unman ce*r*rr*t(*r\ nrntpin Ct7f^ TT~\ WfV 

nuinnn scucicu pruicui, ady xu iskj. 

6720. 


1 1 n 

1 


43 


8192 


Y02671 


Homo sapiens 


Human cp/TPtpH nrntPtn mipaHpH V\\r 

eene 22 clone HMSJW18. 


170 


78 


8193 


AF1 13685 


Homo sapiens 


PRO0974 


86 


52 


8194 


Y36203 


Homo sapiens 


Human secreted protein #75. 


132 


61 


8195 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


225 


77 


8196 


S80119 


Rattus sp. 


reverse transcriptase homolog 


105 


40 


8197 


X86031 


Sus scrofa 


homologue of proline/arginine-rich 
antibacterial protein 


124 


39 


8198 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


121 


60 


8199 • 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


91 


68 


8200 


Y36203 


Homo sapiens 


Human secreted protein #75. 


131 


56 


8201 


Y36203 


Homo sapiens 


Human secreted protein #75. 


136 


44 


8202 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


166 


48 


8203 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


125 


50 


8204 


Y36203 


Homo sapiens 


Human secreted protein #75. 


127 


58 


8205 


G03243 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7324. 


139 


61 


8206 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


190 


92 


8207 


AF090931 


Homo sapiens 


PRO0483 


160 


87 


8208 


AF126163 


Homo sapiens 


HHLA3 protein 


107 


67 


8209 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


93 


41 


8210 


Y36156 


Homo sapiens 


Human secreted protein #28. 


100 


62 


8211 


AF161356 


Homo sapiens 


HSPC093 


73 


60 


8212 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


107 


34 


8213 [ AB046048 


Macaca 


unnamed portein product 


146 


61 
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TEQ ID 

NO: 


Accession too. 


Species 

>** 


Description 

* 


Smith. 

Waterman 

Score 


% 

Idcntit 

y 






fascial laris 








8224 


AF130079 


Homo sapiens 


PRG2852 


121 


59 


8215 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 
denved protem #153. 


109 

• * 


50 


8216 


AF090930 


Homo sapiens 


PRO0478 


153 


79 


8217 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


105 


60 


8218 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


135 . 


56 


8219 


Ml 1901 


Ramos 
norvegicus 


proline-rich salivary protein 


102 


42 


8220 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


95 


32 


8221 


X52235 


Homo sapiens 


ORFII 


174 


39 


8222 


AF130089 


Homo sapiens 


PRO2550 


159 


67 


8223 


AF068294 


Homo sapiens 


HDCMB45P 


161 


58 


8224 


M62387 


Oryctolagus 
cuniculus 


ubiquitin conjugating-protein 


219 


90 

» 


8225 


AFU3685 


Homo sapiens 


PRO0974 


136 


65 


8226 


D38112 


Homo sapiens 


MADH dehydrogenase subunit 3 


430 


85 ' 


8227 


Y02886 

■ ■ 


Homo sapiens 

• 


Fragment of human secreted protein 
encoded by gene 90. 


115 


58 


8228 


Y36156 


Homo sapiens 


Human secreted protem #28. 


117 


50 


8229 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


130 


44 


8230 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


145 

* 


50 


8231 


AP000616 


Oryzasativa 


similar to RJNG-H2 finger protein 
RHAla(AF078683) 


105 


46 


8232 


S62928 


Homo sapiens 


PRB 1 M protein precursor 


113 


37 


8233 


AB047600 


Macaca 
fascicularis 


hypothetical protein 


142 


62 


8234 


G02872 

i ! 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


104 


48 


8235 


U12695 


Pan paniscus 


cytochrome oxidase subunit II 


259 


84 


8236 


KOI 664 


Drosophila 
melanogaster 


Bkm-like protein 


116 


61 


8237 


AF1 16661 


Homo sapiens 


PR01438 


135 


56 


8238 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 


107 


53 


8239 


U93572 


Homo sapiens 


p40 


168 


49 


8240 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


100 


72 


8241 


Z70292 


Homo sapiens 


chemokine CCM 


231 


83 


8242 


AL160493 


Leishmania 
major 


probable (hhv-6) u 1 1 02, variant a 
DNA, complete virion genome 


119 


82 


8243 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


97 


83 


8244 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


114 


78 


8245 


G00338 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4419. 


81 


65 


8246 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 
protem fragment 59. 


74 


60 


8247 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


115 . 


58 


8248 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


127 


58 


8249 


G03789 


Homo sapiens . 


Human secreted protein, SEQ ID NO: 
7870. 


107 


84 


8250 


AF1 1 8082 | Homo sapiens 


PRO 1902 


136 


50 
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1 k£6tD 
NO: 


i Accession Na. 


oumw 


A 


Smith- 

Waterman 

Score 


% 

Identtt 

y 


8251 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


196 


59 


8252 


GO 1828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


106 


81 


8253 


AF130089 


Homo sapiens 


PRO2550 


143 


53 


8254 


AK024455 


Homo sapiens 


FU00047 protein 


151 


68 


8255 


AK022821 


Homo sapiens 


unnamed protein product 


140 


68 


8256 


G03714 


Homo sapiens 

■ 


Human secreted protein, SEQ ID NO: 
7795. 


143 


53 


8257 


M24509 


Mus musculus 


ferritin heavy chain 


347 


69 


8258 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


142 


57 


8259 


U93570 


Homo sapiens 


p40 


254 


57 


8260 


AF218028 


Homo sapiens 


unknown 


129 


71 


8261 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 
protein fragment 59. 


77 


68 


8262 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


604 


90 


8263 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


132 


60 


8264 


AL 139298 


Streptomyces 
coelicolor A3(2) 


putative membrane protein 


92 


34 


8265 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


138 


70 


8266 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


100 


64 


8267 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


113 


38 


8268 


X92485 . 


Plasmodium 
vivax 


pval 


102 


63 


8269 


X61048 


Hydra sp. 


mini-collagen 


110 


46 


8270 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


88 


82 


8271 


G00397 


Homo sapiens 

- 


Human secreted protein, SEQ ID NO: 
4478. 


121 


75 


8272 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


107 


48 


8273 


AF090944 


Homo sapiens 


PRO0663 


122 


42 


8274 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


81 


85 


8275 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


140 


54 


8276 


AF130089 


Homo sapiens 


PRO2550 


275 


56 


8277 


L27428 


Homo sapiens 


reverse transcriptase 


105 


27 


8278 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


112 


54 


8279 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


159 

• 


73 


8280 


G04072 


Homo sapiens 

> 


Human secreted protein, SEQ ID NO: 
8153. 


109 


71 


8281 


AF104921 


Homo sapiens 


succinyl-CoA synthetase alpha subunit 


188 


42 


8282 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


117 


39 


8283 


X55683 


Lycopersicon 
esculentum 


extensin (class I) 


96 


36 


8284 


AF225910 


Mus musculus 


DAZ-associated protein I 


117 


33 


8285 


AF 130089 


Homo sapiens 


PRO2550 


347 


75 


8286 


AB046100 


Macaca 
fascicular is 


unnamed protein product 


140 


45 


8287 


X55684 


Lycopersicon 


extensin (class I) 


67 


45 
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Score 


% 

Identit 

y 






esculentuin 








8288 


AF029670 


Homo sapiens 


Tfti J Am ft A • J ^ * ♦ . _ 

Rad5 1 C truncated protein 


186 


69 


8289 


«rA^ An /" 

Y02886 


Homo sapiens 

r m 


Fragment of human secreted protein 
,.encoded by gene 90. 


149 


50 


8290 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


119 


67 


8291 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

A A *% O 

4438. 


ft AW*\ 

152 


50 


8292 


D89999 


Gallus gallus 


muscle derived protein 


1134 


66 


8293 


LI 1566 


Homo sapiens 


nbosomal protein L 1 8 


A*\ *± r 

335 


69 


8294 


W61170 


Homo sapiens 


v\ . • ft « « • f r» 

Partial polypeptide sequence from 
CP140 partial cDNA. 


J 4 4 

441 


95 


8295 


Awm AmjM 

S62077 


Homo sapiens 


HPlHs alpha=25 kda chromosomal 
autoantigen 


175 


94 


8296 


AF 119882 


Homo sapiens 


PR02492 


m A**. AW 

125 


48 


8297 


G02532 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
6613. 


^ AW at* 

156 


54 


8298 


AKQ25116 


Homo sapiens 


unnamed protein product 


259 


82 


8300 


X92485 


Plasmodium 

• 

vivax 


pvaJ 


162 


47 


8301 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


157 


76 


8302 


AF092176 


Homo sapiens 


MHC class II antigen 


280 


84 


8303 


AF 11 8082 


Homo sapiens 


PRO1902 


174 


52 


8304 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


105 


50 


8305 


AK023563 


Homo sapiens 


unnamed protein product 


2*4 


60 


8306 


G03714 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
7795. 


136 


66 


8307 


U15647 


Mus musculus 


reverse transenptase 


107 


38 


8308 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


110 


40 


8309 


AB005540 


Rattusrattus 


PCTAIRE2 


166 


A9 A*\. 

59 


8310 


m at mm, *v _p~ j~l 

M37679 


m ft 

Mus musculus 


Ig heavy chain precursor 


95 


64 


8311 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


137 


82 


8312 


W29643 


Homo sapiens 


Human secreted protem CW383_1 . 


312 


AM Af*A A*k, 

100 


8313 


AFl 13685 


Homo sapiens 


PRO0974 


116 


39 


8314 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


128 


69 


8315 


Z28407 


Homo sapiens 


ribosomal protein L8 


497 


mm tf\. 

78 


8316 


A # > a*— Am a*%> a*" 

AE003536 


Drosophila 
melanogaster 


A* At A^ ■% « 1 

CG6451 gene product 


237 


m ft 

51 


8317 


M19419 


Mus musculus 


prolme-nch salivary protem 


115 


/ft /V 

39 


8318 


G03786 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
7867. 


92 


53 


8319 


Y29862 


Homo sapiens 


Human secreted protein clone 
du515 21. 


399 


81 


8320 


Z34278 


Homo sapiens 


mucin 


103 


25 


8321 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


142 


91 


8322 


Y 19684 


Homo sapiens 


SEQ ID NO 402 from W09922243. 


82 


62 


8323 


Z46791 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PF01391 (Collagen triple helix repeat 
(20 copies)), Scokf67.1, E- 
value=1.2e-16, N=3; PF01484 
(Nematode cuticle collagen N-terminal 
domain), Score=82.0, E-value=3.9e-21, 
N=l 


122 


37 

« 


8324 


AF021232 


Homo sapiens 


TRAIL-R4-A 


113 


100 
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NO: 


'Accession No. 


"Species 


bescrtDtion 


Smith- 

Waterman 

Score 


'% 
Identit 

y 


8325 


AP046085 


Macaca 
fasciculans 


unnamed protein product 


292 


75 


8326 


AF1 13944 


Muntiacus 
muntjak 


prion protein 


96 


39 

• 


8327 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


125 


63 


8328 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6292. 


123 


62 


8329 


AJ005897 


Homo sapiens 


JM5 


275 


mm mm 

73 


8330 


AF1 19900 


Homo sapiens 


PR02822 ■ 


138 


76 


8331 


AF163573 


Homo sapiens 


CARKL 


359 


84 


8332 


AFO 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


103 


37 


8333 


AF026246 


Homo sapiens 


HERV-E integrase 


354 


78 


8334 


J03756 


Homo sapiens 


hGH-V2 


362 


82 


8335 


AB001517 


Homo sapiens 


KNP-I alpha protein 


126 


76 


8336 


X97490 


Mus m us cuius 


PNG protein 


469 


98 


8337 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


159 


74 


8338 


V18354_cdl 


Homo sapiens 


17-NOV-1992 Human R20 seven 
transmembrane (7TM) receptor 
genomic DNA. 


379 

- 


92 


8339 

• 


V18354_cdl 

■ 


Homo sapiens 

- 


17-NOV-1992 Human R20 seven 
transmembrane (7TM) receptor 
genomic DNA. 


402 


93 


8340 


Z11502 


Homo sapiens 


intestine-specific annexin 


385 


89 


8341 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


110 


54 


8342 


AC003113 


Arabidopsis 
thaliana 


F240I.6 


68 


50 


8343 


X12517 


Homo sapiens 


C protein (AA 1-159) 


321 / 


82 


8344 


D86549 


Homo sapiens 


p97 homologous protein 


279 


96 


8345 


Y73384 


Homo sapiens 


HTRM clone 2284580 protein 
sequence. 


1513 


100 


8346 


AP001745 


Homo sapiens 


human cDNA DKFZp586F0422, 
Accession No. AL050 173 


118 


40 


8347 

.. 


G02832 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


121 

. 


49 


8348 


AF091090 


Homo sapiens 


unknown 


125 


44 


8349 


AF106542 


Homo sapiens 


pregnancy-specific beta-1 glycoprotein 
1 


145 


100 


8350 


AF090942 


Homo sapiens 


PRO0657 


252 


65 


8351 ■ 


L23116 


Homo sapiens 


galctocerebrosidase 


185 


85 


8352 


AJ243803 


Strep tomyces 
coelicolorA3(2) 


hypothetical protein 


97 


36 


8353 


Y11339 


Homo sapiens 


GauVAc aJpha-2,6-sialyltransferase I 


306 


100 


8354 


U09477 


Homo sapiens 


p53 -binding protein 


388 


100 


8355 


L27584 


Homo sapiens 


Ca channel B3 subunit 


197 


100 


8356 


AF071569 


Homo sapiens 


multifunctional calcium/calmodulin- 
dependent protein kinase U delta2 
isoform 


574 

• 


100 


8357 


W54282 


Homo sapiens 


Protein sequence of the di-alpha 
haemoglobin gene contained m pSSI. 


321 


81 


8358 


Y12916 


Homo sapiens 


Amino acid sequence of a human 
secreted peptide. 


759 


100 


8359 


AF038961 


Homo sapiens 


SL15 protein 


166 


100 


8360 


W75093 


Homo sapiens 


Human secreted protein encoded by 
gene 37 clone HFVGS85. 


328 


90 


8361 


R95326 


Homo sapiens 


Tumor necrosis factor receptor 1 death 
domain ligand (clone 2DD). 


386 


97 
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PCT/US01/04927 



SEQ ID 
NO: 



Accession No. 



Species 



Description 



Smith- 

Waterman 

Score 



Identit 

y 



8362 



AF026954 



Bos taurus 



pyruvate dehydrogenase phosphatase 
regulatory subunit precursor; PDPr 



269 



88 



8363 



X07855 



Homo sapiens 



G-Protein 



289 



98 



8364 



AL1 10247 



Homo sapiens 



hypothetical protein 



265 



97 



8365 



AB046774 



Homo sapiens 



K1AA1554 protein 



3619 



99 



8366 



M10050 



Homo sapiens 



fatty acid binding protein 



97 



100 



8367 



M20882 



Homo sapiens 



pregnancy-specific beta-glycoprotein e 



1097 



99 



8368 



AB051901 



Homo sapiens 



VDUPl 



662 



98 



8369 



U72678 



Mus musculus. 



EF-9 



176 



65 



8370 



AL 117237 



Homo sapiens 



hypothetical protein 



478 



60 



8371 



M10617 



Homo sapiens 



L-FABP 



104 



84 



8372 



AJ249980 



Homo sapiens 



hypothetical protein 



1527 



97 



8373 



AF003136 



Caenorhabditis 
elegans 



contains weak similarity to an AMP- 
binding motif 



242 



55 



8374 



8375 



X91638 



Gallus gallus 



BRM protein 



225 



W75093 



Homo sapiens 



Human secreted protein encoded by 
gene 37 clone HFVGS85. 



369 



90 



100 



8376 



8377 



Z36714 



Homo sapiens 



cyclinF 



519 



X86401 



Homo sapiens 



L-arginine: glycine amidinotransferase 



200 



95 



97 



8378 



M12413 



Homo sapiens 



apolipoprotein B100 



1278 



99 



8379 



G00328 



Homo sapiens 



Human secreted protein, SEQ ID NO: 
4409. 



159 



66 



8380 



8381 



AF151860 



Homo sapiens 



Y91577 



Homo sapiens 



CGI- 102 protein 



202 



Human secreted protein sequence 
encoded by gene 2 SEQ ID NO250. 



255 



77 



92 



8382 



AF017807 



Homo sapiens 



Arp2/3 complex 16kDa subunit 



418 



87 



8383 



U28727 



Homo sapiens 



pregnancy-associated plasma protein-A 
preproform 



196 



97 



8384 



G03800 



Homo sapiens 



Human secreted protein, SEQ ID NO: 
7881. 



128 



85 



8385 



G02827 



Homo sapiens 



Human secreted protein, SEQ ID NO: 
6908. 



203 



69 



8386 



AF090931 



Homo sapiens 



PRO0483 



145 



77 



8387 



S63758 



Mus sp. 



metallothionein-I gene transcription 
activator 



324 



98 



8388 



W89030 



Homo sapiens 



Polypeptide fragment encoded by gene 
165. 



174 



66 



8389 



G00500 



Homo sapiens 



Human secreted protein, SEQ ID NO; 
4581. 



93 



70 



8390 



G02532 



Homo sapiens 



Human secreted protein, SEQ ID NO: 
6613. 



157 



46 



8391 



V29245 cdl 



Homo sapiens 



15-NOV-1996 Nucleotide sequence of 
the human selenium-binding protein. 



421 



82 



8392 



D38112 



Homo sapiens 



NADH dehydrogenase subunit 4L 



184 



77 



8393 



Y20717 



Homo sapiens 



Human neurofilament-M wild type 
protein fragment 59. 



99 



77 



8394 



AK023563 



Homo sapiens 



unnamed protein product 



285 



75 



8395 



G01984 



Homo sapiens 



Human secreted protein, SEQ ID NO: 
6065. 



161 



75 



8396 



G00407 



Homo sapiens 



Human secreted protein, SEQ ID NO: 
4488. 



121 



61 



8397 



P70494 



Homo sapiens 



Sequence of human B-cell growth 
factor (BCGF). 



108 



70 



8398 



G03714 



Homo sapiens 



Human secreted protein, SEQ ID NO: 
7795. 



150 



57 



8399 



G03714 



Homo sapiens 



Human secreted protein, SEQ ID NO: 
7795. 



108 



63 



8400 



W88607 



Homo sapiens 



Secreted protein encoded by gene 74 
clone HHSEG23. 



146 



55 



354 



NO: 








Smith- 

Waterman 

Score 


Ideatft 

y 


8401 


GQ0407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


90 


69 


8402 


AF1 19851 


Homo sapiens 


PR01722 


133 


65 


8403 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


^67 


76 


8404 


L06498 


Homo sapiens 


ribosomal protein S20 


170 


94 


8405 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


58 


8406 


AF1 18086 


Homo sapiens 


PR01992 


142 


80 


8407 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO:' 
7881. 


153 


81 

■ 


8408 


AF090919 


Homo sapiens 


PRO0327 


96 


74 


8409 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


127 


62 


8410 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


150 


100 


8411 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


103 


75 


8412 


AFM6715 . 


Homo sapiens 


PR02829 


158 


76 


8413 


AF229067 


Homo sapiens 


PADI-H protein , 


137 


56 


8414 


G03793 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


131 


77 


8415 


AF068294 


Homo sapiens 


HDCMB45P 


132 


54 


8416 


W34499 


Homo sapiens 


Obesity receptor C protein. 


251 


77 


8417 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


272 


57 


8418 


S79410 


Mus museums 


nuclear localization signal binding 
protein 


113 


54 


8419 


AK025047 


Homo sapiens 


unnamed protein product 


128 


58 


8420 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


130 


46 


8421 


AL132841 


Caenorhabditis 
elegans 


Y15E3A.3 


135 


56 


8422 


AF130051 


Homo sapiens 


PRO0898 


164 


72 


8423 


G00092 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4173. 


161 


96 


8424 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


108 


100 


8425 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


134 


75 


8426 

• 


G03798 


Homo sapiens 

■ . 


Human secreted protein, SEQ ID NO : 
7879. 


91 


48 

■ 


8427 


AF1 94641 


Homo sapiens 


immunoglobulin lambda light chain 
variable region 


174 


69 


8428 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


146 


68 


8429 


M64322 


Homo sapiens 


protein tyrosine phosphatase 


201 


100 


8430 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


135 


55 


8431 


AF090931 


Homo sapiens 


PRO0483 


158 


87 


8432 


J04495 


Macaca mulatta 


alpha-globra 


150 


100 


8433 


1 Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


150 


78 


8434 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


137 


48 


8435 


AJ272324 


Homo sapiens 


adaptor molecule-1 


333 


100 


8436 

* 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6834. 


91 


48 


8437 


Ml 8085 


Homo sapiens 


glycoprotein lib 


154 


100 
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CPA If) 

NO: 


Arpp&cinn KIa 






Smith- 

Waterman 

Score 


% 

Identit 

y 


8438 


X92485 


Plasmodium 
vivax 


pval 


138 


64 


8439 


G03136 

• 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
7217. 


126 


56 


8440 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 


105 


72 


8441 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


122 


55 


8442 


G02872 : 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


139 


69 


8443 


Y36203 


Homo sapiens 


Human secreted protein #75. 


103 


70 


8444 


AB034633 


Mus musculus 


deubiquitinarJng enzyme 


129 


100 


8445 


U01849 


Trypanosoma 
brucei 


ORF2 


95 


41 


8446 


J05497 


Rattus 
norvegicus 


snRNP-associated polypeptide N 


115 


37 


8447 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


137 


50 


8448 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


222 


100 


8449 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


168 


94 


8450 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQJDV(0:163. 


152 


63 


8451 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


162 


91 


8452 


W78226 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 1. 


8<H 


97 


8453 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDN0.163. 


109 


100 


8454 


G03 172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


138 


88 


8455 


AF099505 


Homo sapiens 


colon carcinoma related protein 


100 


63 


8456 


AF217511 


Homo sapiens 


uncharacterized bone marrow protein 
BM034 


130 


96 


8457 


Y86248 

■ 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


155 


70 


8458 


LI 7007 


Didelphis 
marsupialis 


protamine 1 

• 


101 

■ 


80 


8459 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


145 


62 


8460 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


121 


75 


8461 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


110 


81 


8462 


AF016371 


Homo sapiens 


U-snRNP-associated cyclophilin 


108 


70 


8463 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


80 


93 


8464 


AF130089 


Homo sapiens 


PRO2550 


190 


65 


8465 


AF197913 


Helicoverpa 
armigera nuclear 
polyhedrosis 
virus 


basic DNA-binding protein BDBP 

r 


159 


55 

• 


8466 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


183 


70 


8467 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


97 


60 


8468 . 


AF1 18082 


Homo sapiens 


PRO1902 


72 


42 


8469 


AF217449 . 


Schistosoma 
mekongi 


NADH dehydrogenase subunit 6 


107 


40 


8470 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


88 


75 
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SEQID 
NO: 


\ Accession No. 


Species 

• 


Description 


Smith. 

Waterman 

Score 


% 

Identft 

y 








mm m* » 

7339. 






8471 


AF090931 


Homo sapiens 


PRO0483 


161 


79 


8472 


Y14482 


Homo sapiens 


^m^ mm m a # 

Fragment of human secreted protein 

111 4 *m 

encoded by gene 17. 


127 


56 


8473 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


50 


8474 


AF022789 


Homo sapiens 


ubiquitin hydrolyzing enzyme 1 


140 


100 


8475 


X03145 


Homo sapiens 


pot. ORF VI 


349 


81 


8476 

■ 


G03714 


Homo sapiens 

. 


Human secreted protein, SEQ ID NO: 
7795. 


152 

• 


65 


8477 


R59842 


Homo sapiens 


ApoE4Ll protease. 


115 


69 


8478 


G03714 


Homo sapiens . 

• 


Human secreted protein, SEQ ID NO: 
7795. 


112 


56 


8479 


K01664 

• 


Drosophila 
melanogaster 


Bkm-Iike protein 


140 


44 


8480 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536; 


95 


70 


8481 


AF090931 


Homo sapiens 


PRO0483 


111 


67 


8482 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


122 


92 


8483 


M69297 


Homo sapiens 


ORF 3 


102 


39 


8484 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


148 


72 


8485 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


136 


65 


8486 


AF032906 


Homo sapiens 


cathepsin Z precursor; CTSZ 


299 


100 


8487 


X15917 

♦ 


Paramecium 
aurelia 


ND2 protein (AA 1-193) (unusual start 
codon) 


94 


35 


8488 


U93574 


Homo sapiens 


putative pi 50 


109 


35 


8489 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


142 


72 


8490 


X92485 


Plasmodium 
vivax 


pval 

* 


120 

■ 


85 


8491 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


109 


42 


8492 


Ml 5530 


Homo sapiens 


B-cell growth factor 


126 


64 


8493 


AF090895 


Homo sapiens 


PRO0117 


123 


60 


8494 


G02403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6484. 


166 


100 


8495 


AF 116661 


Homo sapiens 


PR01438 


126 


63 


8496 


AF130089 


Homo sapiens 


PRO2550 


298 


70 


8497 


AF090895 


Homo sapiens 


PRO0317 


113 


75 


8498 


AF090930 


Homo sapiens 


PRO0478 


142 


68 


8499 


AF126163 


Homo sapiens 


HHLA3 protein 


139 


72 


8500 


G02360 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6441. 


109 


75 


8501 


S79410 


Mus museums 


nuclear localization signal binding 
protein 


98 


48 


8502 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


132 


63 


8503 


AF119851 


Homo sapiens 


PR01722 


107 


53 


8504 


G03714 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
7795. 


158 


67 


8505 


AF090931 


Homo sapiens 


PRO0483 


128 


50 


8506 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


126 


56 


8507 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


154 


82 


8508 


X92485 


Plasmodium 


pval 


83 


70 



357 



WO 01/64835 PCT/US01/04927 

i 

i 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman . 
Score 


% 

Uentit 

y 






' vivax 








S509 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


144 


50 


8510 


AL160493 


Leishmania 

* 

major 


probable (hhv-6) ul 102, variant a 
DNA, complete vinon genome 


124 


86 


8511 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


141 


51 


8512 


AF229067 


Homo sapiens 


PADI-H protein 


155 


76 


8513 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


170 


68 


8514 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


157 


66 


8515 


AF090930 


Homo sapiens 


PRO0478 


88 


80 


8516 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


134 

■ 


59 


8517 


KOI 664 


Drosophila 
melanogaster 


Bkm-like protein 


102 


92 


8518 


M31145 


Homo sapiens 


insulin-like growth factor binding 
protein precursor 


460 


96 


8519 


M18193 


Homo sapiens 


inter-alpha-trypsin inhibitor heavy 
chain old gene name 'ITT 


118 


41 


8520 


AF1 16689 


Homo sapiens 


PR02168 


127 


92 


8521 


AF130051 


Homo sapiens 


PRO0898 


133 


85 


8522 


L27428 


Homo sapiens 


reverse transcriptase 


201 


72 


8523 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


108 


74 


8524 


AF161356 


Homo sapiens 


HSPC093 


1?7 


60 


8525 


Y02926 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 101. 


95 


83 


8526 


AF 118086 


Homo sapiens 


PR01992 


95 


64 


8527 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


108 


50 


8528 


AB046048 


Macaca 
fascicularis 


unnamed portein product 


154 

■ 


62 


8529 


AF 130089 


Homo sapiens 


PRO2550 


150 


70 


8530 


Y03000 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


111 


55 


8531 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


108 


65 


8532 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


172 


59 


8533 


D38112 


Homo sapiens 


NADH dehydrogenase subumt 4L 


395 


91 


8534 


S79410 

i ■ 


Mus musculus 


nuclear localization signal binding 
protein 


100 


45 


8535 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


107 


95 


8536 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


128 


89 

. . 


8537 


Z28407 


Homo sapiens 


ribosomal protein L8 


242 


79 


8538 


AF130089 


Homo sapiens 


PRO2550 


139 


71 


8539 


Q55625_cdl 


Homo sapiens 


22-JUN- 1 992 Human beta globin 5'- 
UTR-CDS-3-UTR. 


274 


76 


8540 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


154 


61 


8541 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


69 


8542 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


149 


64 


8543 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


240 


61 
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i ■ 



CPA If) 

NO: 


/tikcaaiuij no. 


Qnn^M 
^VCUw 


1\Mfri ntintt 
jLrovrijJuun 


Smith. 

Waterman 
Score 


Identit 

y 




I 




6953. 






8544 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


117 




8545 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


110 


'95 


8546 


AF1 19851 


Homo sapiens 


PR01722 


146 


80 


8547 


U49973 


Homo sapiens 


ORF2: function unknown 


105 


86 


8548 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


146 


71 


8549 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO:' 
6619. 


152 


53 


8550 


G02832 


Homo sapiens 

■ 


Human secreted protein, SEQ ID NO: 
6913. 


80 


40 


8551 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


.140 


74 


8552 


V00672 


Pan troglodytes 


reading frame protein 4 


142 


92 


8553 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


164 


63 


8554 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


148 


71 


8555 


AB046100 


Macaca 
fascicularis 


unnamed protein product 


144 


58 


8556 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


322 


73 


8557 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


140 


62 


8558 


U30888 


Homo sapiens 


tRNA-Guanine Transglycosylase 


276 


86 


8559 

i 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


114 


71 


8560 

- 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


108 


72 


8561 


G03136 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7217. 


no 


57 


8562 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


140 


83 


8563 


W34499 


Homo sapiens 


Obesity receptor C protein. 


188 


92 


8564 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


124 


56 


8565 


L07946 


Volvox carteri 


histoneHl-I 


108 


38 


8566 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


132 


77 


8567 


M13941 


Human 
herpesvirus 4 


nuclear antigen 1 


100 


43 


8568 


Y36156 


Homo sapiens 


Human secreted protein #28. 


136 


67 


8569 


X70343 


Nicotiana 
sylvestris 


extensin 


92 


34 


8570 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


381 


87 


8571 


V00672 


Pan troglodytes 


reading ftame protein 4 


146 


93 


8572 


D38116 


Pan paniscus 


ATPase subunit 6 


116 


96 


8573 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


132 


52 


8574 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


217 


100 


8575 


M58009 


Pan troglodytes 


cytochrome c oxidase subunit II 


271 


84 


8576 


D38112 


Homo sapiens 


ATPase subunit 6 


117 


96 


8577 


D38112 


Homo sapiens 


ATPase subunit 6 


180 


93 


8578 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


179 


91 


8579 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


189 


88 
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SEQ ID 


Accession No. 


S pedes 


Description 


Smith- 


% 


NO: 






Waterman 


Identit ' 










Score 


y 


8580 


M15530 


Homo sapiens 


B-cell growth factor 


109 


71 


8581 


Y86248 


Homo sapiens 

* 


Human secreted protein HCHPF68, 


127 


77 








SEQIDNO:163. 






8582 


AL359782 


Trypanosoma 


probable similar to ring-h2 ringer 


109 


47 






brucei 


protein rhala. 






8583 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


131 


71 




* 




SEQIDNO:163. 


* 




8584 


Y91577 


Homo sapiens 


Human secreted protein sequence 


241 


97 


• 


. ■ 




encoded by gene 2 SEQ ID NO:250. 


- 




8585 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


185 


80 


- 






SEQIDNO:163. 


* 




8586 


M28016 


Homo sapiens 


cytochrome b 


129 


92 


8587 


M58009 


Pan troglodytes 


cytochrome c oxidase subunit II 


153 


96 


8588 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


176 


90 






SEQIDNO:163. 






8589 


Y08769 


Rattus 


microvascular endothelial 


109 


51 






norvegicus 


differentiation gene 2 






8590 


LO0OI6 


Homo sapiens 


urf4 


176 


94 


8591 


AF150105 


Homo sapiens 


small zinc finger-like protein 


523 


100 


8592 


U15306 


Homo sapiens 


NFX1 


123 


95 


8594 


R44519 


Homo sapiens 


Adult human alpha-globin VI 07S. 


222 


78 


8595 


Y03000 


Homo sapiens 


Fragment of human secreted protein 


128 


70 








encoded by gene 121. 






8596 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


151 


90 


8597 


X67688 


Homo sapiens 

XT 


transketolase 


123 


84 


8598 


L00016 


Homo sapiens 


urf4 


130 


85 


8599 


Y36366 


Homo sapiens 


Fragment of human secreted protein 


209 


54 








encoded by gene 3. 






8600 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


159 


89 








SEQIDNO:163. 






8601 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


54 








6953. 






8602 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


115 


43 


8603 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


120 


95 








RHAla(AF078683) 






8604 


S79980 


Bos taurus 


ribosomal protein L37 


109 


84 


8605 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


48 








7888. 






8606 


D38112 


Homo sapiens 


ATPase subunit 6 


202 


100 


8607 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


150 


42 






SEQIDNO:163. 


< 




8608 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


58 








4414. 






8609 


AF229067 


Homo sapiens 


PADI-H protein 


112 


78 


8610 


AF130089 


Homo sapiens 


PRO2550 


125 


86 


8611 


J03071 


Homo sapiens 


chorionic somatomammotropin CS-5 


710 


97 


8612 


AL390114 


Leishraania 


extremely cysteine/valine rich protein 


104 


41 






major 




• 




86J3 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


112 


84 








SEQIDNO:163. 




■ 


8614 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


116 


70 








RHAla(AF078683) 






8615 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


123 


91 








RHAla(AF078683) 






8616 


M28016 


Homo sapiens 


cytochrome b 


148 


93 


8617 


AF1 18082 


Homo sapiens 


PRO1902 


94 


66 


8618 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


182 


82 








SEQIDNO:163. 






8619 


X02585 


Xenopus laevis 


unidentified open reading frame 1 (166 


170 


72 
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SEQID 
NO* 


Accession No. 


Species 


Description 

• 


Smith- 
Wotcrnifln ' 
Score 


% 

Identft ' 

y 




■ 




aa) 






8620 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


125 


57 


8621 


AC004450 


Arabidopsis 
thaliana 


putative extensin 


140 


48 


8622 

- 


Y86248. 

• 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


195 


79 

- 


8623 


AF130089 


Homo sapiens 


PRO2550 


150 


78 


8624 


L00016 


Homo sapiens 


urf4 


280 


92 


8625 


AF130051 


Homo sapiens 


PRO0898 


161 


76 


8626 


G03482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7563. 


123 


49 


8627 


L00016 


Homo sapiens 


urf4 


201 


95 


8628 


D38112 


Homo sapiens 


NADH dehydrogenase subunit4L 


169 


82 


8629 


AF1 19851 


Homo sapiens 


PR01722 


171 


65 


8630 


X55654 


Homo sapiens 


cytochrome C oxidase II subunit 


120 


92 


8631 


AP000616 


Oryza sativa 


similar to RING-H2 fmger protein 
RHAla (AF078683) 


108 


56 


8632 


X55654 


Homo sapiens 


cytochrome C oxidase II subunit 


107 


95 


8633 


G03043 


Homo sapiens 
* 


Human secreted protein, SEQ ID NO: 

7124; 


132 


86 


8634 


AJ224997 


Rattus 
norvegicus 


huntingtin 


73 


59 


8635 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


155 


80 


8636 


M19061 


Ateles geoffroyi 


delta-globin 


100 


40 


8637 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


102 


65 


8638 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


104 


50 


8639 


AF1 16715 


Homo sapiens 


PR02829 


158 


76 


8640 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


142 


62 


8641 


Y09010 


Mus musculus 


Ser/Thr kinase 


193 


95 


8642 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


227 


79 


8643 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


116 


95 


8644 


AP000616 ' 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


134 


88 


8645 


V463U_cdl 


Homo sapiens 


12-JAN-1998 Human secreted protein 
cDNA from clone D157. 


197 


100 


8646 


Y17379 


Homo sapiens 


NADH ubiquinone oxidoreductase 51- 
kD subunit 


240 


97 


8647 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


142 


68 


8648 


AJ388520 


Canis familiaris 


Ribosomal protein 


134 


100 


8649 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


136 


56 


8650 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


116 


80 


8651 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


99 


36 


8652 


G00492 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4573. 


136 


64 


8653 


AF229067 


Homo sapiens 


PADI-H protein 


134 


69 


8654 


AF090942 


Homo sapiens 


PRO0657 


103 


42 


8655 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


184 


84 
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> ■ 



SEQ ID 
NO: 


Accession No. 


Species 

* 


Description 


Smith- 
Waterman 
Score 


Identit 

y 


8656 


U37359 


Homo sapiens 


MRE1 1 homologue hMrel 1 


238 


97 


8657 


AL160371 


Leishmania 
major 


probable (hhv-6) ul 1 02, variant a 
DNA, complete virion genome 


113 


60 


8658 


G00262 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4343. 


124 


83 


8659 


AF161356 


Homo sapiens 


HSPC093 


149 


64 


8660 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


121 


44 


8661 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


107 


37 


8662 


U 19098 


Lycopersicon 
chilense 


proline-rich protein 


72 

k 


48 


8663 


AF1 18082 


Homo sapiens 


PRO1902 


95 


75 


8664 


AF 130089 


Homo sapiens 


PRO2550 


307 


79 


8665 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


115 


40 


8666 


X55683 


Lycopersicon 
esculentum 


extensin (class I) 


102 


32 


8667 


AK026841 


Homo sapiens 


unnamed protein product 


276 


100 


8668 


AL1099$3 


Homo sapiens 


dJ1188J21.1 (FSH primary response 
(LRPR1, rat) homolog 1) 


597 


100 


8669 


Y01181 


Homo sapiens 


Polypeptide fragment encoded by gene 
12. 


123 


58 


8670 


W48352 


Homo sapiens 


Human breast cancer related protein 
BCFLT1. 


103 


65 


8671 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


121 


61 


8672 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


184 


82 


8673 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


211 


93 


8674 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


126 


61 


8675 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


114 


66 


8676 


X55691 


Lycopersicon 
esculentum 


glycine-rich protein 


85 


38 


8677 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


188 


80 


8678 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


183 


75 


8679 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


71 


8680 


Y08769 


Rattus 
norvegicus 


microvascular endothelial 
differentiation gene 2 


96 


82 


8681 


U80761 


Homo sapiens 


CTG26 alternate open reading frame 


136 


88 


8682 


Y86248 


Homo sapiens 
* 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


137 

9 


93 


8683 


AL451017 


Neurospora 
crassa 


related to Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN C 


107 


45 


8684 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


141 


90 


8685 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


127 


55 


8686 


W64486 


Homo sapiens 


Human DR3 protein. 


139 


100 


8687 


M18093 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


134 


33 


8688 


K03202 


Homo sapiens 


salivary proline-rich protein precursor 


119 


35 


8689 


U36448 


Homo sapiens 


Ca2+-dependent activator protein for 


107 


91 
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* 



SEQ1D 
NO- 


Accession No. 


Species 

*> 


Description 


Smith- 

Wat»rnn>» 
fYMtCJ iuBu 

Score 


% 

Irian tit 

y 




• 




secretion 






8690 


G02409 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
6490. 


80 


43 


8691 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


1-47. 


74 


8692 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


126 


95 


8693 


AF130089 


Homo sapiens 


PRO2550 


126 


35 


8694 


M58009 


Pan troglodytes 


cytochrome c oxidase subunit II 


233 


97 


8695 


M11902 


Mus musculus 


proline-rich salivary protein 


98 


35 


8696 


W34499 


Homo sapiens 


Obesity receptor C protein. 


263 


83 


8697 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163., 


181 


73 


8698 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


125 


48 


8699 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


129 


88 


8700 


Y86248 


Homo sapiens 

i 


Human secreted protein HCHPF68. 
SEQ ID NO: 163. 


111 


51 


8701 


Y86248 

1 


Homo saoiens 


Human secreted nrotein HCHPF68. 
SEQ ID NO: 163. 


165 


73 


8702 

§ **** 


Y36366 


Homo saoiens 


Frasment of human secreted orotein 
encoded by gene 3. 


133 

m mw *r 


62 


8703 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


184 


92 


8704 


D38112 


Homo sapiens 


ATPase subunit 6 


117 


96 


8705 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


97 


58 


8706 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


115 


79 


8707 


AF 118086 


Homo sapiens 


PR01992 


167 


81 


8708 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


114 


51 


8709 


LI 0908 


Mus musculus 


Gcapl gene product 


179 


38 


8710 


S79410 


Mus musculus 


nuclear localization signal binding 
protein- 


135 


69 


8711 


AF130089 


Homo sapiens 


PRO2550 


399 


78 


8712 


X55654 


Homo sapiens 


cytochrome C oxidase II subunit 


107 


95 


8713 


AF 11 8082 


Homo sapiens 


PRO1902 


123 


58 


8714 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


173 


81 


8715 


V35554_cdl 


Homo sapiens 


2 1 -NOV- 1 996 Human tumour antigen 
protein (clone 3) encoding cDNA. 


258 


83 


8716 


AF130089 


Homo sapiens 


PRO2550 


143 


51 


8717 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


92 


52 


8718 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


147 


71 


8719 


AF153208 


Homo sapiens 


GC-rich sequence DNA-bindin e factor 

^mmr mt d^mrmt mr w ^» ^^m m^ ^Wmlm* ^m' ^mr mmmw ^m . m> mm ^mw mmm*^~ ^^—-^m^m^~ ^^^m ^m ^mw^w w* 

candidate 


115 


70 


8720 


AF090931 


Homo sapiens 

* 


PRO0483 


145 


87 


8721 


X92485 


Plasmodium 
vivax 


pval 


125 


79 


8722 


U72543 


Sus scrota 


ubiquitin-like/S30 ribosomal fusion 
protein 


201 


97 


8723 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


223 


85 


8724 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


121 


60 
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SEQID 
NO: 


* Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


Ideotir 

y 


8725 


AF130051 


Homo sapiens 


PRO0898 


102 


72 


8726 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


185 


78 


8727 

't 


R24750 


Homo sapiens 


hGH variant #38- 174S 176Y 10F 14S 
18F21L 167S 171D 175T 179T. 


116 


82 


8728 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


92 


72 


8729 


X53375 


Helianthus 
animus 


anther-specific protein SF 1 8 


114 


39 


8730 


S74221 


Homo sapiens 


IK factor=cytokine down-regulating ' 
HLA class II 


159 


42 


8731 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


166 


89 


8732 


AB027508 


Homo sapiens 


h VPS 11 


194 


50 


8733 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear rna-binding 
protein) 


144 


93 


8734 


AF266479 


Homo sapiens 


rectachrome 1 


116 


74 


8735 


AL132841 . 


Caenorhabditis 
elegans 


Y15E3A.3 


125 


60 


8736 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


152 


65 


8737 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


79 

• 


56 


8738 


U93565 


Homo sapiens 


putative pi 50 


190 


52 


8739 


AF026689 


Homo sapiens 


prostate-specific transglutaminase 


117 


57 


8740 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


109 


70 


8741 


Y76198 


Homo sapiens 


Human secreted protein encoded by 
gene 75. 


165 


61 


8742 


M58009 


Pan troglodytes 


cytochrome c oxidase subunit II 


213 


97 


8743 


M17887 


Homo sapiens 


acidic ribosomal phosphoprotein (P2) 


106 


74 


8744 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rha la. 


116 


49 


8745 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


134 


68 


8746 


G03356 


^Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


135 


71 


8747 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


94 


40 


8748 


Y86248 

• 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


173 


76 


8749 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


166 


67 


8750 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


272 


57 

■ 


8751 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


131 


71 


8752 


AL451017 


Neurospora 
crassa 


related to U 1 SMALL NUCLEAR 
RIBONUCLEOPROTEIN C 


123 


37 


8753 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


180 


55 


8754 


Y27868 


Homo sapiens 


Human secreted protein encoded by . 
gene No. 107. 


105 


44 


8755 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


103 

• 


78 


8756 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


284 


87 


8757 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


190 


89 



364 



NO: 


nvwssstvij nil. 


ODCCIC9 

• 


A/C9vnpilUIl 

- 


Cmith- 

Waterman 
Score 


% "1 
Idcntit 

y 


8758 


G00689 


Homosapiens 


Human secreted protein, SEQ ID NO: 
4770. 


103 


40 


8759 


G03797 


Homo sapiens 


Human secreted protein, SEQ JD NO: 
7878. 


120 


61 


8760 


AF132984 


Homosapiens 


nuclear pore complex interacting 
protein NPIP 


196 


94 


8761 


AF130079 


Homo sapiens 


PR02852 


125 


65 


8762 


AF 1 55232 


Pisum sativum 


extensin 


177 


42 


8763 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


128 


95 


8764 


X75438 


Susscrofa 


homologue of proline/arginine rich 
antibacterial peptides. 


96 

<* 


38 


8765 


Y86248 

■ 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


157 


96 


8766 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


137 


75 


8767 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


207 


95 


8768 


AL161543 . 


Arabidopsis 
thaliana 


extensin like protein 


100 


38 


8769 


K03202 


Homosapiens 


salivary proline-rich protein precursor 


no ■ 


47 


8770 


D38112 


Homo sapiens 


NADH dehydrogenase subunrt 4L 


178 


78 


8771 


D83407 


Homo sapiens 


1 a thyroid hormone responsive gene in 
human skin fibroblasts 


133 


100 


8772 


V00662 


Homo sapiens 


URF A6L (NADH dehydrogenase 
subunit) 


172 


73 


8773 


W03642 


Homo sapiens 


Human cannabinoid GPRN-terminal 
sequence. 


107 


66 


8774 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


117 


100 


8775 


AF090931 


Homo sapiens 


PRO0483 


175 


88 


8776 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


125 


58 


8777 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


126 


73 


8778 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


154 


70 


8779 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 1 02, variant a dna, 
complete virion genome. , 


149 


50 


8780 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


170 


91 


8781 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


103 


65 


8782 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


184 


70 


8783 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


169 


97 


8784 


U55376 


Caenorhabditis 
elegans 


F16H1 1 2 gene product 


141 


90 


8785 


U55376 


Caenorhabditis 
elegans 


F16H1 12 gene product 


150 


100 

• 


8786, 


AL132841 


Caenorhabditis 
elegans 


Y15E3A.3 


127 


96 


8787 


AF266479 


Homo sapiens 


rectachrome 1 


148 


61 


8788 


U23455 


Caenorhabditis 
elegans 


similar to D. melanogaster homeotic 
protein Barffi (PIR: A4 1 726) 


106 


95 


8789 


AL132841 


Caenorhabditis 
elegans 


Y15E3A3 


162 


96 


8790 


AC006693 


Caenorhabditis 


Hypothetical protein W02H5.e 


180 


94 



365 
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SEQ1D 
NO: 


Accession No. 


Species 


DescriptioD 


Smith- 

Waterman 

Score 


% 

Identit 

y 






elegans 








8791 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


159 

. 


80 


8792 


AF150105 


Homo sapiens 


small zinc finger-like protein 


505 


98 


8793 


AL390935 


Leishmania 
major 


probable proO 195 


85 


31 


8794 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


115 


55 


8795 


AF118082 


Homo sapiens . 


PRO 1902 


123 


75 


8796 


D38112 


Homo sapiens 


ATPase subunit6 


438 


82 


8797 


AF1 18086 


Homo sapiens 


PR01992 


156 


81 


8798 


Y53037 


Homo sapiens 


Human secreted protein clone 
dx219 13 protein sequence SEQ ID 
NO:80. 


196 


100 


8799 


X92485 


Plasmodium 
vivax 


pval 


132 


63 


8800 


G02532 

* 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


170 


65 


8801 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


123 


58 


8802 


AF1 16909 


Homo sapiens 


unknown 


116 


63 


8803 


X92485 


Plasmodium 
vivax 


pval 


127 


82 


8804 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


140 


67 


8805 


Z95114 


Homo sapiens 


bK212A2.2 (apolipoprotein L, 2) 


294 


100 


8806 


AF1 43369 


Mus musculus 


msgl -related protein 2 


348 


83 


8807 


G00500 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4581. 


110 


76 


8808 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


129 


91 


8809 


AF230279 

* 


Caenorhabditis 
elegans 


SWI3-like protein; PSA-1 


126 


37 


8810 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


168 


82 


8811 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


122 


54 

■ 


8812 


D38112 


Homo sapiens 


ATPase subunit 6 


207 


100 


8813 


W99376 


Homo sapiens 


Human fibroblast growth factor 2 24 
kD isoform N-terrainus. 


93 


41 


8814 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


157 


57 


8815 


X54978 


Bos taurus 


17,000 dalton myosin light chain 


133 


96 


8816 


M58459 


Homo sapiens 


ribosomal protein S4Y isoform 


260 


100 


8817 


D38U2 


Homo sapiens 


NADH dehydrogenase subunit 1 


256 


90 


8818 


U23947 


Mycoplasma 
pulmonis 


VsaDI 

• 


132 


31 


8819 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


140 


69 


8820 


L00016 


Homo sapiens 


urf4 


205 


90 


8821 


X55683 


Lycopersicon 
esculentum 


extensin (class I) 


111 


43 


8822 


AL1 60493 


Leishmania 
major 


probable (hhv-6) u 1 1 02, variant a 
DNA, complete virion genome 


94 


43 


8823 


X66285 


Mus musculus 


HC1 ORF 


94 


53 


8824 


AF061340 


Artibeus 
jamaicensis 


Fl ATPase subunit 6 


97 


90 


8825 


Ml 8093 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


123 


36 


8826 


AF1 19855 


Homo sapiens 


PR01847 


113 


52 
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WO 01/64835 PCT/US01/04927 

• ■ » 



SEO ID 
NO: 


* Accession Nn 


0|JCUC9 


i 


Smith, 
dill i m- 

Waterman 
Score 


Idcntit 

y 


8827 


M28016 


Homo sapiens 


cytochrome b 


140 


93 


8828 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


142 


56 


8829 


AK024455 


Homo sapiens 


FLI00047 protein 


145 


59 


8830 


D38H2 


Homo sapiens 


ATPase subunit 6 


194 


95 


8831 


D38112 


Homo sapiens 


ATPasesubunit6 


184 


90 


8832 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla(AF078683) 


134 


60 


8833 


AF1 30079 


Homo sapiens 


PR02852 


115 


88 


8834 


AB003784 


Drosophila 
simulans 


HistoneH3 


135 


100 


8835 


Y91577 


Homo sapiens 

r 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


173 

• 


82 


8836 


X70343 

• 


Nicotiana 
sylvestris 


extensin 


112 


37 


8837 


L17318 


Rattus 
norvegicus 


proline-rich proteoglycan 

^^^^ 


142 


38 


8838 


Y36366 

» 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


148 


74 


8839 


W34625, 


Homo sapiens 


Human C3 protein mutant FT-3. 


277 


100 


8840 


AF1 16638 


Homo sapiens 


PR01546 


78 


41 


8841 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


138 


61 


8842 


AF1 30087 


Homo sapiens 


PR02411 


131 


67 


8843 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


150 


72 


8844 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


155 


60 


8845 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


108 


66 


8846 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


140 


68 


8847 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


146 


70 


8848 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:I63. 


131 


68 


8849 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


273 


79 


8850 


K02120 


Bovine leukemia 
virus 


gag 


201 


54 


8851 


AF229067 


Homo sapiens 


PADI-H protein 


162 


57 


8852 


AF09093 1 


Homo sapiens 


PRO0483 


125 


82 


8853 


R14584 


Homo sapiens 


TGF beta 1 binding protein encoded by 
clone BPA 13. 


189 


100 


8854 


U52197 


Mus musculus 


pory(A) polymerase III 


183 


85 


8855 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


193 


72 


8856 


G00549 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4630. 


104 


66 


8857 

■ 


Y14482 


Homo sapiens 

• 


Fragment of human secreted protein 
encoded by gene 17. 


93 


64 

• 


8858 


Q55625jxil 


Homo sapiens 


22-JUN-1992 Human beta globin 5'- 
UTR-CDS-3'-UTR. 


412 


98 


8859 
• 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


152 


67 


8860 


G02828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6909. 


112 


72 


8861 


AF130079 


Homo sapiens 


PR02852 


160 


64 


8862 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: j 


101 


61 
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WO 01/64835 PCT/US01/04927 



CI? it TT\ 

NO: 


Accession no. 


opccics 


ucsenpnon 


oitlitD- 

Wnterman 

Score 


7b 

Idcntit 

y 








4488. 






8863 


X64707 


Homo sapiens 


BBC1 


216 


100 


8864 


G00328 

- 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


127 


65 


8865 


G00092 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4173. 


210 


100 


8866 


AL137351 


Homo sapiens 


hypothetical protein 


327 


100 


8867 


AF090942 


Homo sapiens 


PRO0657 


105 


38 


8868 


AK023050 


Homo sapiens 


unnamed protein product 


137 


60 


8869 ' 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


101 


65 


8870 


AF216751 


Homo sapiens 


CDA14 


282 


96 


8871 


AL1 33262 


Homo sapiens 


d J3 1 9D22. 1 (CDC5-!ike protein) 


158 


90 


8872 


V32779_cdl 


Homo sapiens 


1 1 -MAR- 1997 Novel human F0 ATP 
synthase subunit encoding DNA 
sequence. 


267 

- 

• 


81 


8873 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


118 


58 


8874 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


89 


43 


8875 


G02532 

» * 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


154 


63 


8876 


AK001873 


Homo sapiens 


unnamed protein product 


373 


100 


8877 


X52138 


Homo sapiens 


L7a protein 


259 


82 


8878 


X52138 


Homo sapiens 


L7a protein 

• 


273 


87 


8879 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


147 


57 


8880 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


128 


57 


8881 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


159 


71 


8882 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


153 


67 


8883 


AF130089 


Homo sapiens 


PRO2550 


134 


79 


8884 


U36898 


Rattus 
norvegicus 


pheromone receptor VN6 


143 


53 


8885 


S79410 


Mus museums 


nuclear localization signal binding 
protein 


143 


64 


8886 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


138 


92 


8887 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


124 


72 


8888 


R48296 


Homo sapiens 


Human PGF-2/NT-3. 


118 


66 


8889 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 


81 


77 


8890 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


120 


50 


8891 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


121 


77 


8892 


X53375 


Helianthus 
annuus 


anther-specific protein SF18 


109 

* 


64 


8893 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


119 


54 


8894 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


136 


68 


8895 


AF090894 


Homo sapiens 


PRO0113 


114 


60 


8896 


AF161356 


Homo sapiens 


HSPC093 


148 


51 


8897 


AF1 16638 


Homo sapiens 


PR01546 


91 


45 


8898 


U22231 


Felis catus 


ribosomal protein S3a 


201 


100 
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SEQID 
NO: 


'Accession No. 


Species 

» 


Description 


Smith- 
Waterman 
Score 


Id en tit 

y 


8899 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


114 


75 


.8900 


AF090894 


Homo sapiens 


PRO0U3 


114 


61 


8901 


G03714._- 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


139 


63 


8902 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


126 


95 


8903 


R90288 


Homo sapiens 


Modified pigment epithelium-derived 
factor (rPEDF). 


258 


88 


8904 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 1 
SEQIDN0.163. 


155 


86 

• 


8905 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


118 


66 


8906 


X70343 


Nicotiana 
sylvestris 


extensin 


122 


37 


8907 


G02386 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6467! 


162 


70 


8908 


AF130089 


Homo sapiens 


PRO2550 


125 


85 


8909 


AF220264 


Homo sapiens 


MOST-1 


113 


47 


8910 


Y25769 


Homo sapiens 


Human secreted protein encoded from 
gene 59. 


119 - 


77 


8911 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


77 


52 

• 


8912 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


162 


83 


8913 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


104 


77 


8914 : 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


138 


64 


8915 


AF09D944 


Homo sapiens 


PRO0663 


73 


58 


8916 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


.151 


76 


8917 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


84 


47 


8918 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


140 


64 


8919 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


254 


82 


8920 


AJ245416 


Homo sapiens 


G7b protein 


360 


86 


8921 


D90228 


Homo sapiens 


mitochondrial acetoacetyl-CoA thiolase 
precursor 


190 


100 


8922 


AF229067 


Homo sapiens 


PADI-H protein 


157 


65 


8923 


G02832 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
6913. 


132 


65 


8924 


Y76198 


Homo sapiens 


Human secreted protein encoded by 
gene 75. 


62 


68 


8925 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


116 


63 


8926 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


140 


65 


8927 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


184 


59 


8928 


AF287892 


Homo sapiens 


sialic acid binding immunoglobulin- 
like lectin 8 long splice variant 


994 


76 


8929 


AF030131 


Mus musculus 


Plenty of SH3s; POSH 


181 


77 


8930 


J02459 


bacteriophage 
lambda 


E (capsid component;341) 

* 


814 


98 


8931 


U13866 


unidentified 
cloning vector 


non-functional lacZ alpha peptide 


112 


59 



369 



SEQ1D 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Vo 

Wen tit 

y 


8932 


U67540 


Methanococcus 
jannaschii 


conserved hypothetical protein 


98 


56 


8933 


Z74472 

i 

* 

* 


Caenorhabditis 
elegans 

> 

• 

i 

7 
* 

■ 

.* 

•# 

! 


predicted using Genefinder-contains 
similarity to Pfam domain: PF01391 
(Collagen triple helix repeat (20 
copies)), Score°84.3, E-value^S.le^, 
N=*2; PF0I484 (Nematode cuticle 
collagen N-terminal domain), 
Score=34.2, E-value=9.9e-07, 
• N=l~cDNA EST yklOOdl 0.3 comes 
from this gene; cDNA EST- 
EMBL:D65979 comes from this 
gene-cDNA EST ykl00d4.3 comes 
from mis gene; cDNA EST ykl23g7.3 
comes from this gene~cDNA EST 
yk58e6.3 comes from this gene; cDNA 
EST yk67a5.3 comes from this 
gene-cDNA EST EMBL:T01590 
comes from this gene; cDNA EST 
yk60cl 1.3 comes from this 
gene-cDNA EST yk89c!0.3 comes 
from mis gene; cDNA EST yk85g9.3 
comes from this gene-cDNA EST 
yk83e2.5 comes from mis gene; cDNA 
EST yk75Q.3 comes from this 
gene-cDNA EST CEESK90F comes 
from mis gene; cDNA EST yk79f5.5 
comes from this gene~cDNA EST 
yk64bl2.3 comes from this gene; 
cDNA EST yk64h9.3 comes from this 
gene-cDNA EST yk74dl.3 comes 
from this gene; cDNA EST yk75e4.3 
comes from this geno-cDNA EST 
yk83e2 3 comes from this gene; cDNA 
EST yk91cl2.3 comes from this 
gene-cDNA EST yk98C3 comes from 
mis gene; cDNA EST yk60h4.5 comes 
from this gene-cDNA EST yk60cl 1 .5 
comes from this gene; cDNA EST 
yk64h9.5 comes from this gene~cDNA 
EST yk64a9.5 comes from this gene; 
cDNA EST yk64bl2.5 comes from mis 
gene-cDNA EST yk67a5.5 comes from 
this gene; cDNA EST yk74dl.5 comes 
from this geno-cDNA EST yk75e4.5 
comes from this gene; cDNA EST 
yk75C5 comes from this gene-cDNA 
EST yk79eo\5 comes from this gene; 
cDNA ESTyk80d2.5 comes from this 
gene-cDNA EST yk85g9.5 comes 
from mis gene; cDNA EST yk89cl0.5 
comes from this gene-cDNA EST 
yk89c4i> comes from this gene; cDNA 
EST yk90el 1.5 comes from this 
gene-cDNA EST yk91cl2.5 comes 
from this gene; cDNA EST yk96e2.5 
comes from this gene-cDNA EST 
yk98c2.5 comes from this gene; cDNA 
EST yk98£2.5 comes from this 
gene-cDNA EST yk98hl 1.5 comes 


101 

t 

1 


38 

* 

■ 
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% 
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y 


p 


• 




from this gene; cDNA EST ykl07e7.5 
comes from this gene~cDNA EST . 
yk313g9.3 comes from this gene; 
cDNA EST yk3 13g9.5 comes from this 
gene-cDNA EST yk312gl.3 comes 
from this gene; cDNA EST yk312gl .5 . 
comes from this gene~cDNA EST 
yk293c4.3 comes from this gene; 
cDNA EST yk293c4.5 comes from this 
gene 


• 

• 

■ 

■ 


• 


8934 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


127 

■ 


63 


8935 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


125 


48 


8936 


X03741 


Homo sapiens 


myosin heavy chain (635 AA) 


1343 


99 


8937 


M29295 


Rattus 
norvegicus 


small nuclear ribonucJeoparticle- . 
associated protein 


106 


52 


8938 


D00097 


Homo sapiens 


serum amyloid P component 


188 


100 


8939 


W34499 


Homo sapiens 


Obesity receptor C protein. 


255 


83 


8940 


L00016 


Homo sapiens 


urf4 


293 


83 


8941 


Y86248 

- 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


164 


94 

■ 


8942 


AB007148 


Homo sapiens 


ribosomal protein S3a 


153 


73 


8943 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


213 


95 


8944 


M10119 


Homo sapiens 


ferritin light subunit 


129 


100 


8945 


M74718 


Homo sapiens 


SEF2-1A protein 




84 


8946 


AL355178 


Homo sapiens 


dJ947L8. 1 .6 (novel CUB and Sushi 
(SCR repeat) domain protein) 


366 


51 


8947 


Y56033 


Homo sapiens 


Polyproline-rich domain from dynamic. 


70 


51 


8948 


X06547 


Homo sapiens 


glutathione S-transferase (GST-Pi) (AA 
1-210) 


172 


97 


8949 


AF1 19851 


Homo sapiens 


PR01722 


154 


78 


8950 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


123 


88 


8951 


X54802 


Homo sapiens 


cytochrome-c oxidase subunit IV 


116 


95 


8952 


AJ011580 


bacteriophage 
PS34 


gp23 


338 


98 


8953 


Y59441 


Homo sapiens 


Human delta3 fragment #5. 


271 


96 


8954 


J02459 


bacteriophage 
lambda 


B (capsid component;533) 


807 


99 


8955 


AL132841 


Caenorhabditis 
elegans 


Y15E3A.3 ' 


172 

- - 

• 


94 


8956 


W34499 


Homo sapiens 


Obesity receptor C protein. 


166 


89 


8957 


X92485 


Plasmodium 
vivax 


pval 


135 


71 


8958 


AB010340 


Mus musculus 


mszfBl 


122 


60 


8959 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


103 


51 


8960 


X14898 


Cricetulus sp. 


ORF (AA 1-286) (1 is 2nd base in 
codon) 


108 


54 


8961 


AF090931 


Homo sapiens 


PRO0483 


119 


82 


8962 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWI8. 


130 


72 


8963 


AF170724 


Homo sapiens 


cell cycle checkpoint protein CHFR 


1364 


99 


8964 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


151 


43 


8965 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


250 


74 
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y 


8966 


Y87233 


Homo sapiens 


Human signal peptide containing 
protein HSPP- 1 0 SEQ ID NO: 1 0. 


445 


100 


8967 


AF127085 


Mus mus cuius 


semaphorin cytoplasmic domain- 
associated protein 3B - 


112 


80 


8968 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


109 


46 


8969 


AF090894 


Homo sapiens 


PRO0113 


153 


61 


8970 


Y19730 


Homo sapiens 


SEQ ID NO 448 from W09922243. 


158 


49 


8971 


U38544 


Mus musculus 


alpha- 1 type 1 collagen 


102 


52 


8972 


X92485 

■ 


Plasmodium 
vivax 


pval 

• - 


113 


50 


8973 


AF241228 


Xenopus laevis 


Friend of GATA 


143 


100 


8974 


AF090930 


Homo sapiens 


PRO0478 


79 


41 


8975 


AF238374 


Homo sapiens 


mutant fibroblast growth factor 
receptor 3; FGFR3 


115 


50 


8976 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 

4 


99 


43 


8977 


AB0O1838 


Homo sapiens 


recovenn 


179 


80 


8978 


AF1 18082 


Homo sapiens 


PRO1902 


73 


70 


8979 


AFi 16715 


Homo sapiens 


PR02829 


102 


76 


8980 


AF090942 


Homo sapiens 


PRO0657 


132 


61 


8981 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


158 


65 


8982 


U28971 


Caenorhabdhis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


108 


84 


8983 


A00127 


Homo sapiens 


melanoma associated antigen P97 


366 


94 


8984 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


158 


88 


8985 


Y59778 


Homo sapiens 


Human norma] ovarian tissue derived 
protein 55. 


71 


61 


8986 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


128 


42 


8987 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6292. 


98 


50 


8988 


W50193 

■ 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


80 


51 


8989 


Y24025 


Homo sapiens 


Amino acid sequence of the human 
MMSC1 protein. 


160 


100 


8990 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


130 


67 

■ 


8991 


D38112 


Homo sapiens 


N ADH dehydrogenase subunit 4L 


165 


82 


8992 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein mala. 


111 


62 


8993 


M28016 


Homo sapiens 


cytochrome b 


130 


100 


8994 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


133 


90 


8995 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


298 


89 


8996 


Y16430 


Mus musculus 


ribosomal protein L35a 


101 


65 


8997 


Y45318 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 18. 


96 


76 


8998 


AL049659 


Arabidopsis 
thaliana 


putative protein 


126 


42 


8999 


AE003500 


Drosophila 
meJanogaster 


CG 12379 gene product 


285 


67 


9000 


R37991 


Homo sapiens 


Sequence of a new cytokine which 
inhibites induction by gammainterferon 
of expression of Class 11 
histocompatibility antigens. 


124 


48 
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% 
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9001 


AF090930 


Homo sapiens 


PRO0478 


193 


89 


9002 


AP000381 


Arabidopsis 
thaliana 


gb|AAF57656.1^ene_icLK17E12.13~s 
imilar to unknown protein 


193 


29 


9003 _ 


. M17885 


Homo sapiens 


acidic ribosomal phosphoprotein (P0) 


364 


66 


9004 


D63424 


Homo sapiens 


glycogen synthase kinase 3a]pha 


126 


81 


9005 


AP000616 


Oryzasativa 


similar to RJNG-H2 finger protein 
RHAla (AF078683) 


113 


75 


9006 


AL1 17637 


Homo sapiens 


hypothetical protein 


257 


92 


9007 


AE001381 


Plasmodium 
falciparum 


hypothetical protein 


141 


26 


9008 


W34499 


Homo sapiens 


Obesity receptor C protein. 


152 


86 


9009 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


199 


80 


9010 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


202 


94 


9011 


AF090931 


Homo sapiens 


PRO0483 


135 


68 


9012 


M11717 


Homo sapiens 


heat shock protein 


696 


100 


9013 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


123 


75 


9014 


R59842 


Homo sapiens 


ApoE4Ll protease. 


86 


85 


9015 


AL160371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


96 


65 


9016 


AF229067 


Homo sapiens 


PADI-H protein 


119 


54 


9017 


AF090944 


Homo sapiens 


PRO0663 


138 


54 


9018 


AL1 60371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


103 


50 


9019 


X92485 


Plasmodium 
vivax 


pval 


101 


73 


9020 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


82 


72 


9021 


AF073519 


Homo sapiens' 


small EDRK-rich factor 1, long isoform 


116 


66 


9022 


AF229067 


Homo sapiens 


PADI-H protein 


127 


45 


9023 


M33112 


Homo sapiens 


amyloid-beta protein 


105 


67 


9024 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


79 


68 


9025 


AF161361 


Homo sapiens 


HSPC098 


130 


55 


9026 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


169 


72 


9027 


AF130051 


Homo sapiens 


PRO0898 


138 


77 


9028 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


132 


49 


9029 


G00354 

i 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


117 


58 


9030 


AC002310 


Homo sapiens 


Unknown gene product 


155 


85 


9031 


X92485 


Plasmodium 
vivax 


pval 


112 


42 


9032 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


130 


52 


9033 


G00487 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4568. 


123 


69 


9034 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


138 


61 


9035 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


111 


66 


9036 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


173 


59 


9037 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


100 


54 


9038 


AF220264 


Homo sapiens 


MOST-1 


96 


80 
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Waterman 
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9039 


AF130051 


Homo sapiens 


PRO0898 


119 


68 


9040 


AF116715 


Homo sapiens 


PR02829 


121 


63 


9041 


L76200 


Homo sapiens 


guanylate kinase 


193 


83 


9042 

'i 


G03415 


Homo sapiens 


^« 4 - 0 _mmm% _mm* mm mmmm m_^ m* _mmm 

Human secreted protein, SEQ ID NO: 

-mm, m a _m* 

7496. 


125 


55 — 


9043 


AF152513 


Homo sapiens 


protocadherin gamma A6 short form 
protein 


334 


100 


9044 


AF130079 


Homo sapiens 


PR02852 


125 . 


33 


9045 


W48351 


Homo sapiens 

- 


Human breast cancer related protein 
BCRB2. 


128 


64 


9046 


AF090942 


Homo sapiens 


PRO0657 


mm mWm. m9* 

126 


65 


9047 


AF284223 


Homo sapiens 


terra-like protein 


228 


100 


9048 


AP000616 

■ 


Oryza saliva 


similar to RING-H2 ringer protein 
RHAla(AF078683) 


128 

• 


68 


9049 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


132 


92 


9050 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


123 


75 

■ * 


9051 


AL034344 


Homo sapiens 


dJ118B18.1 (forkhead box CI) 


323 


100 


9052 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


106 


62 


9053 


AF225910 


Mus musculus 


DAZ-associated protein 1 


100 


45 


9054 


AF130087 


Homo sapiens 


PR024U 


147 


70 


9055 


W80400 


Homo sapiens 


A secreted protein encoded by clone 
dd71_2. 


858 


97 


9056 


X70343 


Nicotiana 
sylvestris > 


extensin 


113 


33 


9057 


AP000616 

■ 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


128 


57 


9058 


R95913 


Homo sapiens 


Neural thread protein. 


142 


38 


9059 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


131 


56 


9060 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


131 

■ 


50 


9061 


X70343 

* 


Nicotiana 
sylvestris 


extensin 


103 


42 


9062 


AJ271872 

■ 


Nicotiana 
sylvestris 


extensin 


129 


33 


9063 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


151 


73 


9064 


AF266164 


Rattus 
norvegicus 


densin-1 80 variant D 


144 


96 


9065 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


135 

* 


68 


9066 


M28016 


Homo sapiens 


cytochrome b 


178 


87 


9067 


M28016 . 


Homo sapiens 


cytochrome b 


193 


95 


9068 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


148 


60 


9069 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


111 


100 


9070 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


141 


72 


9071 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


147 


64 


9072 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


109 


69 


9073 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


171 


85 


9074 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


88 


88 


9075 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


137 


62 
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encoded by gene 17. 






9076 


L38941 


Homo sapiens 


ribosomal protein L34 


139 


88 


9077 


GO 1 984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065.~ 


164 


86 


9078 


X55684 


Lycopersicon 
esculentum 


extensin (class I) 

• 


72 


35 

■ 


9079 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


176 


82 


9080 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


229 


59 


9081 


AF1 30089 


Homo sapiens 


PRO2550 


118 


71 


9082 


X92485 


Plasmodium 
vivax. 


pval 


125 


71 


9083 


AF21665Q 


Homo sapiens 


MTAP 


154 


73 


9084 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


124 


78 


9085 


J02459 


bacteriophage 
lambda 


E (capsid component;341) 


191 


73 


9086 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


138 


61 


9087 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


130 


75 


9088 


G03793 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


144 


61 


9089 


AK024455 


Homo sapiens 


FLJ00047 protein 


145 


82 


9090 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


181 


60 


9091 


AF090895 


Homo sapiens 


PRO0H7 


87 


66 


9092 


Y86248 

- 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


211 

■ 


86 

■ 


9093 


AF1 30079 


Homo sapiens 


PR02852 


157 


74 


9094 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDN0:163. 


180 


82 


9095 


Y08769 


Rattus 
norvegicus 


microvascular endothelial 
differentiation gene 2 


97 


42 


9096 


X65551 , 


Homo sapiens 


antigen of the monoclonal antibody Ki- 
67 


106 


65 


9097 


W34499 


Homo sapiens 


Obesity receptor C protein. 


231 


80 


9098 

• 


Y08769 


Rattus 
norvegicus 


•microvascular endothelial 
differentiation gene 2 


127 


63 


9099 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 12 1 . 


129 


78 


9100 


AF229067 


Homo sapiens 


PADI-H protein 


162 


59 


9101 


G00673 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4754. ' 


157 


65 


9102 


Y60010 


Homo sapiens 


Human endometrium tumour EST 
encoded protein 70. 


69 


63 


9103 


R95913 


Homo sapiens 


Neural thread protein. 


124 


72 


9104 


S58722 


Homo sapiens 


X-linked retinopathy protein (C- 
tenninal, clone XEH.8c} 


104 


68 


9105 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


156 


88 


9106 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


170 


57 


9107 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


163 


89 


9108 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6834. 


123 


41 


9109 


L41944 


Homo sapiens 


interferon receptor 162 


66 
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9110 


AK000496 


Homo sapiens 


unnamed protein product 


138 


78 


9111 


G03797 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
7878. 


81 


62 


9112 


G02538 — 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


142 


52 


9113 


M26361 


Mus musculus 


LINE/Ig H-chain fusion protein 


105 


50 


9114 


AF073519 


Homo sapiens 


small EDRK-rich factor 1, long isoform 


119 


56 


9115 


AF 119900 


Homo sapiens 


PR02822 


140 


58 


9116 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


158 


60 


9117 


L00016 


Homo sapiens 


urf5 


171 


97 


9118 


AC006276 


Homo sapiens 


R28379_3 


138 


100 


9119 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


131 


72 


9120 


AF 119855 


Homo sapiens 


PR01847 


155 


81 


9121 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


105 


84 


9122 


Y87103 


Homo sapiens 


Human secreted protein sequence SEQ 
IDNO:142. 


115 


78 


9123 


G00397 

* 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


134 


53 


9124 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


103 


40 


9125 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


96 


63 


9126 


AF130089 


Homo sapiens 


PRO2550 


125 


78 


9127 


W48351 

■ 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


98 

i' 


56 


9128 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


107 


75 


9129 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


131 


49 


9130 


AK024455 


Homo sapiens 


FLI00047 protein 


130 


78 


9131 


W88435 


Homo sapiens 


Disease associated protein kinase 
DAPK-4. 


336 


100 


9132 


U63332 


Homo sapiens 


super cysteine rich protein; SCRP 


144 


59 


9133 


Y94922 


Homo sapiens 


Human secreted protein clone pv6_l 
protein sequence SEQ ID NO:50. 


154 


90 


9134 


AF3 17425 


Homo sapiens 


GAC-1 


182 


70 


9135 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


174 


54 


9136 


AF1 16715 


Homo sapiens 


PR02829 


116 


71 


9137 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


122 

* • 


65 


9138 


AF130089 


Homo sapiens 


PRO2550 


140 


62 


9139 


AF161221 


Homo sapiens 


kallikrein-like protein 6 


204 


90 


9140 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


110 


48 


9141 


U27486 


Pseudorabies 
virus 


EPO 


84 


44 


9142 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


150 


96 


9143 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


220 


80 


9144 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, - 
SEQ ID NO: 163. 


163 


84 


9145 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


180 


77 


9146 


U52077 


Homo sapiens 


mariner transposase 


365 


78 


9147 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


143 


67 
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SEQID 
NO: 


I 'Accession No. 


Species 


Description 

* 


Smith- 
Wa term an 
Score 


Ideotit 

y 




i 




clone HPMBQ32. 






9148 


AP000616 


Oryza sativa 


similar to RING-H2 ringer protein 
RHAla(AF078683) 


100 


81 


9149 

'i 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


118 


66 


9150 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


105 


54 


9151 


AL3 59782 


Trypanosoma 
brucei 


possible (hhv-6) u 1 102, variant a dna, 
complete virion genome. 


101 


70 


9152 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: ' 
6613. 


163 


66 


9153 


Y86248 


Homo sapiens 

* 


Human secreted protein HCHPF68, 
SEQ1DN0:163. 


160 


72 


9154 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


106 


63 


9155 


Y02926 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 101. 


95 


80 


9156 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


133 


67 


9157 


G03924 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8005. 


265 


80 

• 


9158 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


131 


72 


9159 


AFl 30089 


Homo sapiens 


PRO2550 


180 


61 


9160 


AFl 16715 


Homo sapiens 


PR02829 


184 


82 


9161 


K01664 

- 


Drosophiia 
melanogaster 


Bkm-like protein 


200 


69 


9162 


D88146 


Homo sapiens 


UDP-galactose transporter 2 


101 


100 


9163 


AF107406 


Homo sapiens 


GW128 


155 


60 


9164 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6983. 


97 


51 


9165 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


146 


60 


9166 


AF210651 


Homo sapiens 


NAG 18 


83 


65 


9167 


V00672 


Pan troglodytes 


reading frame protein 4 


119 


95 


9168 


Y01155 


Homo sapiens 


Secreted protein encoded by gene 4 
clone HIBCW32. 


225 

- 


74 


9169 


AF161356 


Homo sapiens 


HSPC093 


100 


75 


9170 


G03714 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7795. 


174 


66 


9171 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


115 


86 


9172 


AFl 19855 


Homo sapiens 


PRO! 847 


178 


59 


9173 


AF130087 


Homo sapiens 


PR02411 


129 


48 


9174 


AF130079 


Homo sapiens 


PR02852 


149 


59 


9175 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
-7214. 


101 


90 

■ 


9176 


AF090894 


Homo sapiens 


PRO0113 


118 


65 


9177 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rha la. 


105 


46 

• 


9178 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


100 


52 


9179 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


161. 


70 


9180 


AFl 16715 


Homo sapiens 


PR02829 


127 


64 


9181 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


144 


71 


9182 


AF229067 


Homo sapiens 


PADI-H protein 


164 


64 


9183 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


134 1 


80 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman - 
Score 


% 

I den tit 

y 








6613. 






9184 


AP000616 


Oryza sativa 


similar to RING-H2 ringer protein . 
RHAla (AF078683) 


106 


64 


9185 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


132— 


57 


9186 


G03786 


Homo sapiens 

i 


Human secreted protein, SEQ ID NO: 
7867. 


113 


*5 


9187 


AF090931 


Homo sapiens 


PRO0483 


117 


68 


9188 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


136 


67 


9189 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


130 


47 


9190 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


269 


85 


9191 


Ml 8094 


Phaseolus 
vulgaris ' 


hydroxyproline-rich glycoprotein 


150 


40 


9192 . 


G00552 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4633. 


153 


73 


9193 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


62 


9194 


M18093 

* 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


103 


31 " 


9195 


X77816 


Rattus 
norvegicus 


PR-Vbetal 


98 


47 


9196 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


107 


48 


9197 

• 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


111 


68 


9198 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


164 


83 


9199 


J02459 


bacteriophage 
lambda 


E (capsid component;341) 


235 


85 


9200 


G03714 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7795. 


117 


65 


9201 


X67640 


Homo sapiens 


HS24/P52 


222 


75 


9202 


X67640 


Homo sapiens 


HS24/P52 


231 


81 


9203 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: '. 
7879. 


128 


66 


9204 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


107 


74 


9206 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


186 


66 


9207 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


136 


56 


9208 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


160 


67 


9209 


G03714 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7795. 


131 


68 


9210 


AF116715 


Homo sapiens 


PR02829 


123 


81 


9211 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


135 


70 


9212 


AF 118086 


Homo sapiens 


PR01992 


84 


57 


9213 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


115 


62 


9214 


J02459 


bacteriophage 
lambda 


E (capsid component;341) 


247 


92 


9215 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4458. 


74 


76 


9216 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


134 


74 
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NO: 


« Accession No. 


Species 

« » 

* 


Description 


Smith- 

Waterman 

Score 


Identit 

y 


9217 


GQ2832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


137 


53 


9218 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81 


130 


44 


9219 


AP000616 


Oryza sativa 


similar to RING-H2 finger protem 
RHAla(AF078683) 


110 


68 


9220 


L41944 


Homo sapiens 


interferon receptor 


141 


50 


9221 


AF 130079 


Homo sapiens 


PR02852 


170 


70 


9222 


G0O381 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4462. 


100 


85 


9223 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


142 


56 


9224 

• 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


104 

• 


63 


9225 


U80761 


Homo sapiens 


CTG26 alternate open reading frame 


92 


84 


9226 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


157 


65 


9227 


AK027208 . 


Homo sapiens 


unnamed protein product . 


138 


68 


9228 


B25722 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 9 SEQ ID NO: 111. 


125 


87 


9229 


G00381 

* 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4462. 


127 


88 


9230 


AK025047 


Homo sapiens 


unnamed protein product 


179 


64 


9231 


AF090894 


Homo sapiens 


PRO0113 


195 


57 


9232 


AB046048 . 


Macaca 
fascicularis 


unnamed portein product 


176 


54 


9233 


AB008227 


Adiantum 
capillus-veneris 


Extensin 


93 


34 


9234 


AF 130089 


Homo sapiens 


FRO2550 


98 


73 


9235 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


135 . 


52 


9236 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


162 


62 


9237 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


126 


59 


9238 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


126 


52 


9239 


AE003499 


Drosopnila 
melanogaster 


CG 12706 gene product 


166 


37 


9240 


G00361 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
4442. 


161 


64 


9241 , 


X92485 


Plasmodium 
vivax 


pval 


112 


61 


9242 


AF055985 


Onchocerca 
volvulus 


pyrrolidone-rich antigen 


69 


42 


9243 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


140 


80 


9244 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


187 


92 


9245 


M22332 


Homo sapiens 


unknown protein 


117 


40 


9246 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


109 


66 


9247 


AL136173 


Homo sapiens 


dJ914B9.1 (novel protein (HSPC 162) 
similar to Rattus norvegicus 
bithoraxoid-like protein) 


305 


76 


9248 


AF1 16661 


Homo sapiens 


PR01438 


141 


53 


9249 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


144 


76 



379 



NO: 


Accession riO* 


jyCCics 


iscscnpuon 


Smith- 

Waterman 
Score 


/v 

Identit 

y 


9250 


J02459 


bacteriophage 
lambda 


E (capsid component;341) 


245 


89 


9251 


J02459 

« 


bacteriophage 
lambda 


E (capsid component;341) 


245 


92 


9252 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


128 


63 


9253 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


124 


72 


9254 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


110 


90 


9255 


Y27868 


Homo sapiens 

• 


Human secreted protein encoded by 
gene No. 107. 


132 


65 


9256 


X52164 


Mus musculus 


Q300 protein (AA 1-77) 


106 


62 


9257 


AF217374 


Acanthaster 
planci 


cytochrome oxidase subunit I 


259 


98 


9258 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


94 


64 


9259 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


107 

■ 


54 


9260 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


129 


66 


9261 


M28016 


Homo sapiens 


cytochrome b 


148 


93 


9262 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


112 


68 


9263 


R55749 


Homo sapiens 


Extracellular domain of human high 
affinity IFN-gamma receptor. 


116 


83 


9264 


K02403 


Homo sapiens 


complement component C4A 


9025 


99 


9265 


X92485 


Plasmodium 
vivax 


pval 


161 


55 


9266 


AK024455 


Homo sapiens 


FLJ00047 protein 


98 


83 


9267 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


103 


51 


9268 


AF130089 


Homo sapiens 


PRO2550 


124 


77 


9269 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


94 


54 


9270 


Y19684 


Homo sapiens 


SEQ ID NO 402 from W09922243. 


96 


81 


9271 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


132 


53 


9272 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


155 


72 


9273 


AF229067 


Homo sapiens 


PADI-H protein 


127 


54 


9274 


Y36203 


Homo sapiens 


Human secreted protein #75. 


154 


77 


9275 


X92485 


Plasmodium 
vivax 


pval 


157 


72 


9276 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


132 


60 


9277 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


156 


84 


927S 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDN0.163. 


138 


84 


9279 


Y20772 


Homo sapiens 


Human neurofilament-M mutant 
protein fragment 54. 


112 


70 


9280 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


146 


59 


9281 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


103 


63 


9282 


K01664 


Drosophila 
melanogaster 


Bkm-like protein 


89 


60 


9283 


X92485 


Plasmodium. 


pval 


124 


79 
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NO: 


Accession no. 


opccics 

* 


JJC5Crlpuun 


Smith* 

Waterman 
Score 


ST 

7# 

Identit 

y 




i 


vivax 


i 






9284 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6983. 


95 


75 


9285 


-AF1 30087 


Homo sapiens 


PR0241 1 


138 


78 


9286 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


140 


54 


9287 


AF004340 


Homo sapiens 


ATPase 6/8 


98 


100 


9288 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


102 


68 


9289 


J02459 

• 


bacteriophage 
lambda 


1 (tail component£23) 

* - 


457 


97 

• 


9290 


AC003058 . 


Arabidopsis 
thaliana 


unknown protein 


177 


84 


9291 


U23455 

• 


Caenorhabditis 
elegans 


similar to D. melanogaster homeotic 
protein BarH2 (PIR:A41726) 


107 


87 


9292 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


311 


91 


9293 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


139 


61 


9294 


R28916 


Homo sapiens 


Type III procollagen (prior art). 


117 


33 


9295 


W48351 , 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


126 


77 


9296 


AF130089 


Homo sapiens 


PRO2550 


137 


52 


9297 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


93 


79 


9298 


U22376 


Homo sapiens 


alternatively spliced product using exon 
13A 


239 


77 


9299 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


134 


69 

■ 


9300 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


105 


24 


9301 


D90282 


Homo sapiens 


carbamyl phosphate synthetase 1 (EC 
6.3.4.16) 


276 


96 


9302 


J02459 


bacteriophage 
lambda 


E (capsid component;341) 


231 


93 


9303 


M15077 


Photinus pyralis 


Luciferase 


557 


100 


9304 


L00016 


Homo sapiens 


urf4 


303 


93 


9305 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


141 


69 


9306 


L46721 


Homo sapiens 


mucin 


204 


80 


9307 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


110 


84 


9308 


J02459 


bacteriophage 
lambda 


E (capsid component£41) 


218 


93 


9309 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


78 


66 


9310 


AF130075 


Homo sapiens 


PR02532 


92 


69 


9311 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


97 


51 


9312 


J02459 


bacteriophage 
lambda 


E (capsid component£4I) 


201 


79 


9313 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


133 


95 


9314 


AK024455 


Homo sapiens 


FLJ00047 protein 


128 


68 


9315 


D38112 


Homo sapiens 


ATPase subunit 6 


223 


81 


9316 


AL1 60493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


113 


85 


9317 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


117 


88 


9318 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


257 


72 
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ofcQ ID 
NO: 


Accession JNo. 


Species 


Description 


Sin lib* 

Waterman 

Score 


1 a/ I 

Identit 

y 


9319 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


101 


61 


9320 


AK001116 


Homo sapiens 


unnamed protein product 


96 


40 


9321 


G02535 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6616. 


174 


60 - 


9322 


AF037081 


Pan troglodytes 


ribonuclease k6 precursor 


109 


100 


9323 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


149 


63 


9324 


AF1 53056 


Fasciola hepatica 


tegumental antigen 


60 


44 


9325 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


200 


69 


9326 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6836. 


80 


57 


9327 


AF1 18086 


Homo sapiens 


PR01992 


139 


60 


9328 


G02360 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6441. 


110 


73 

* 


9329 


AF1 07406 


Homo sapiens 


GW128 


142 


56 


9330 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


156 


75 


9331 


X92485 


Plasmodium 
vivax 


pval 


121 


56 


9332 


Y86248 


Homo sapiens 

* 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


179 


52 


9333 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


113 


67 


9334 


Y86248 


Homo sapiens 

* 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


108 


46 


9335 


M22332 


Homo sapiens 


unknown protein 


158 


60 


9336 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


95 


46 


9337 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


160 


76 


9338 


Y01400 


Homo sapiens 

* 


Secreted protein encoded by gene 18 
clone HNHF029. 


150 


68 


9339 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


169 


59 


9340 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene22cloneHMSJW18. 


154 


64 


9341 


AK024455 


Homo sapiens 


FLJ00047 protein 


117 


79 


9342 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


191 


76 


9343 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


133 


82 


9344 


AF032457 


Homo sapiens 


BimEL 


123 


100 


9345 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


117 


46 


9346 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


153 


75 


9347 


X53020 


Plasmodium 
falciparum 


erythrocyte membrane-associated 
antigen 


68 


75 


9348 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


120 


70 


9349 


AF130089 


Homo sapiens 


PRO2550 


116 


69 


9350 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


165 


72 


9351 

* 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


94 


82 


9352 


AL080242 


Homo sapiens 


bA554C12.1 (RBX1 or ROC 1 (ring- 
box or ring ringer protein 1)) 


184 


60 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


£} »A| 

Smith- 
Waterman 
Score 


% 

Idea tit 

y 


9353 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


125 


64 


9354 


X71442 


Rattus 
norvegicus 


ORF 1; putative 


96 


40 


9355 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


98 


46 


9356 


AF090931 


Homo sapiens 


PRO0483 


148 


76 


9357 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


144 


56 


9358 


AC005175 


Homo sapiens 

. * 


TA2R HUMAN, BETA ISOFORM; ' 
TXA2-R; PROSTANOID TP 
RECEPTOR 


153 


63 


9359 


AF1 16715 


Homo sapiens 


PR02829 


125 


66 


9360 


Y02671 

• 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJ W 1 8. 


143 


72 


9361 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


178 


75 


9362 


Y14437 

» 


Homo sapiens 


Human secreted protein encoded by 
gene 27 clone HSAWA27; 


90 


57 


9363 


G03356 , 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


86 


76 


9364 


AF229067 


Homo sapiens 


PADI-H protein 


116 


51 


9365 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


248 


64 


9366 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. __j 


104 


76 


9367 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


95 


76 


9368 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


105 


74 


9369 


R86406 


Homo sapiens 


Human matrix metal loprotease 
MMPmla. 


108 


83 


9370 


AF181977 


Hepatitis GB 
virus C 


E2 protein 


95 


38 


9371 


AF090931 


Homo sapiens 


PRO0483 


151 


62 


9372 


AF090894 


Homo sapiens 


PRO0113 


133 


57 


9373 


AK002154 


Homo sapiens 


unnamed protein product 


226 


55 


9374 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


140 


71 


9375 


U77494 


Homo sapiens 


RANBP8 


166 


100 


9376 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


155 


52 


9377 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


126 


6 * 


9378 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


151 


78 


9379 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


97 


68 


9380 


AL160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


117 


62 


9381 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6595. 


104 


47 


9382 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


139 


57 


9384 


AB047936 


Macaca 
fascicularis 


hypothetical protein 


95 


41 


9385 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


57 


64 


9386 


U04810 


Homo sapiens 


tastin 


151 


100 


9387 


U79260 


Homo sapiens unknown 


97 


79 
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SEQ ID 
NO: 


'Accession No. 


O ^ mt AM 

bpecies 


Description 


omitn- 

Waterman 

Score 


% 

Ideotit 

y 


9388 


U18339 


Variola virus 


D4L 


103 


65 


, 9389 


GO380O 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


134 


75 


9390 


AL390935 


-Leishmania 
major 


possible dopap 


99 


88 


9391 


AB015633 


Homo sapiens 


type 11 membrane protein 


108 


100 


9392 


AF1 13685 


Homo sapiens 


PRO0974 


98 


62 


9393 


AF064819 


Homo sapiens 


serine protease DESCl 


227 


69 


9394 

• 


G03786 


Homo sapiens 

. 


Human secreted protein, SEQ ID NO: 
7867. 


219 


68 


9395 


G01400 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
5481. 


104 


100 


9396 


AK024455 


Homo sapiens 


FLJ00047 protein 


153 


68 


9397 


AF033260 


porcine 

endogenous type 
C retrovirus 


reverse transcriptase 
* 

- 


155 


53 


9398 


X12517 


Homo sapiens 


C protein (AA 1-159) 


147 


55 


9399 


AB029948 


Homo sapiens 


mitochondrial seryl-tRNA synthetase 


1203 


100 


9400 


Y87075 


Homo sapiens 


Human secreted protein sequence SEQ 
IDNO:114. 


234 


93 


9401 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


150 

• 


58 


9402 


Ml 5530 


Homo sapiens 


B-cell growth factor 


113 


74 


9403 


X56932 


Homo sapiens 

* 


23 kD highly basic protein 


114 


85 


9404 


AC018748 


Arabidopsis 
thaliana 


Contains similarity to PI 1 protein from 
Drosophila melanogaster gb|X59691 . 


75 


65 


9405 


Y86248 

• 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


114 


88 


9406 


AF220264 


Homo sapiens 


MOST-1 


124 


70 


9407 


AFU9851 


Homo sapiens 

* 


PR01722 


157 


65 


9408 


X92485 

< • 


Plasmodium 
vivax 


pval 


163 


78 


9409 


X92485 


Plasmodium 
vivax 


pval 


165 


77 


9410 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


97 


55 


9411 

> 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


104 


64 


9412 


AF130089 


Homo sapiens 


PRO2550 


147 


55 


9413 


AF 130089 


Homo sapiens 


PRO2550 


175 


65 


9414 


L 14848 


Homo sapiens 


MHC class I-related protein 


225 


97 


9415 


R95913 


Homo sapiens 


Neural thread protein. 


107 


74 


9416 


AB001684 


Chlorella 
vulgaris 


ORF49b 


68 


53 


9417 


G02532 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
6613. 


111 


50 


9418 


W03642 


Homo sapiens 


Human cannabinoid GPR N-tenninal 
sequence. 


107 


73 


9419 


AL050285 


Homo sapiens 


hypothetical protein 


194 


100 


9420 


AF090944 


Homo sapiens 


PRO0663 


132 


59 


9421 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


153 


61 


9422 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


140 


70 


9423 


M20030 


Homo sapiens 


small proline rich protein 


62 


41 


9424 


AF116718 


Homo sapiens 


PRO2900 


396 


100 


9425 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


102 


35 


9426 


X61047 


Hydra sp. 


mini-collagen | 


92 


45 
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SF.OID 
NO: 


Accession Nn 




Fl^«^fi nti nn 

vvSvl IJJUUIl 


Smith- 

Waterman 

Score 


/o 

Identit 

y 


9427 


G00637 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
4718. 


105 


71 


9428 


L10908 


Mus musculus 


Gcapl gene product 


107 


44 


9429 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


79 


87 


9430 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


123 


61 


9431 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


111 


80 


9432 


X12580 


Medicago sativa 


put. nodulin (soybean N-75 homolog.) 


87 


33 


9433 


AL160371 


Leishmania 
major 


probable (hhv-6) u 1 1 02, variant a 
DNA, complete virion genome 


106 


48 


9434 


AC003028 


Arabidopsis 
thaliana 


30S ribosomal protein S31 


79 


42 


9435 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


106 


75 


9436 


G02827 


Homo sapiens 

i 


Human secreted protein, SEQ ID NO: 
6908. 


207 


75 


9437 


Y02671 


Homo sapiens 

» 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


163 


85 


9438 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


84 


57 


9439 


AF1 18086 


Homo sapiens 


PRO 1992 


145 


75 


9440 


AF229067 


Homo sapiens 


PADI-H protein 


140 


64 


9441 


M36913 


Zea mays 


cell wall protein (put); putative 


107 


38 


9442 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


161 


59 


9443 


AF229067 


Homo sapiens 


PADI-H protein 


147 


85 


9444 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


171 


79 


9445 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


155 


57 


9446 


X92485 


Plasmodium 
vivax 


pval 


97 


55 


9447 


AF1 19851 


' Homo sapiens 


PR01722 


123 


55 


9448 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


130 


88 


9449 


AL391688 


Homo sapiens 


bA524D 16A.2. 1 (novel protein similar 
to mouse granuphilin-a) 


258 


97 


9450 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


130 


91 


9451 


X92485 


Plasmodium 
Yivax 


pval 


123 


88 


9452 


X83000 


Myocastor 
coypus 


lipase related protein 2 

- 


168 


43 


9453 


M35603 


Mus musculus 


Hoxr3.1 protein 


447 


78 


9454 


M35520 


Canis familiaris 


GTP-binding protein (rab5) 


271 


98 


9455 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


362 


86 


9456 


A13595 


synthetic 
construct 


immunosuppresive protein PP15 


111 


100 


9457 


X67788 


Rattus 
norvegicus 


ezrin, p81 


185 


97 


9458 


U28068 


Mus musculus 


neurogenic differentiation factor 


132 


89 


9459 


AF1 55 140 


Homo sapiens 


gonadotropin-regulated testicular RNA 
helicase; GRTH 


556 


88 


9460 


AF 155 140 


Homo sapiens 


gonadotropin-regulated testicular RNA 
helicase; GRTH 


379 


94 


9461 


AF169689 


Homo sapiens 


protocadherin alpha 10 alternate 
isoform 


208 


95 


9462 


U73193 


Homo sapiens 


inward rectifier potassium channel 


236 


96 



385 



f 

WO 01/64835 PCT/US01/04927 

■ ■ * 



♦ 



SEQ ID 
NO: 


1 Accession No. 

• 


Species 


Description 


Smith- 
Waterman 
Score 


Identft 

y 




i 


■ 


Kir 1.2 






9463 


U83278 


80s taurus 


neural specific protein CRMP-2 


157 


61 


9464 


W48352 


Homo sapiens 

• 


Human breast cancer related protein 
BCFLT1. 


74 

•■ — 


57 


9465 


AL031673 

• 


Homo sapiens 


dJ694B14.3 (PUTATIVE novel 
haloacid dehalogenase-like hydrolase 
family protein similar to (archaea) 
bacterial proteins) 


444 

• 


too 


9466 


X67247 


Homo sapiens 


ribosomal protein S8 


519 


90 


9467 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


115 


51 


9468 


U80761 


Homo sapiens 


CTG26 alternate open reading frame 


95 


84 


9469 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


145 


71 


9470 


API 30056 


Homo sapiens 


PR01367 


98 


76 


9471 


X71354 


Homo sapiens 


vesicular monoamine transporter 


102 


95 


9472 


U66372 


Bos taurus 


ribosomal protein S29 


120 


95 


9473 


D32002 


Homo sapiens 


nuclear cap binding protein 


583 


96 


9474 


W54282 


Homo sapiens 

• 


Protein sequence of the di-alpha 
haemoglobin gene contained in pSSl . 


469 


91 


9475 


V49572_cdl 


Homo sapiens 


13-NOV-1996 Human stomach cancer 
clone HP10122cDNA#l. 


562 


100 


9476 


D38U2 


Homo sapiens 


NADH dehydrogenase subunit 4L 


247 


88 


9477 


AB043820 


Homo sapiens 


platelet glycoprotein VI-2 


95 


47 


9478 


AB020236 


Homo sapiens 


ribosomal protein L27A 


437 


86 


9479 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


149 


52 


9480 


AF205633 


Homo sapiens 


ninjurin2 


364 


100 


9481 


D14886 


Homo sapiens 


TFIIA-37 


196 


100 


9482 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


166 


76 


9483 


AF 178842 


Homo sapiens 


Rhesus blood group-associated 
glycoprotein 


128 


75 


9484 


AF 134895 


Homo sapiens 


glyoxylate reductase 


838 


95 


9485 


AC004832 


Homo sapiens 


similar to 45 kDa secretory protein ; 
similar to CAA10644.1 
(PID:g4164418) 


562 


98 


9486 


Z35761 


Homo sapiens 


TEL/ABL 


230 


71 


9487 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO250. 


149 


72 


9488 


AL032631 


Caenorhabditis 
elegans 


predicted using Genefinder 


159 


31 


9489 


AJ271079 


Oenothera elata 
subsp. hookeri 


Ycf2 protein 


113 


36 


9490 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


135 


40 


9491 


Y53005 


Homo sapiens 


Human secreted protein clone pm749_8 
protein sequence SEQ ID NO: 16. 


124 


61 


9492 


Y21091 


Homo sapiens 


Human p53 cellular tumour antigen 
mutant protein fragment 28. 


85 


4* 


9493 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


153 


76 


9494 


AF119851 


Homo sapiens 


PR01722 


94 


54 


9495 


R04932 


Homo sapiens 


Interferon-gamma receptor segment 
from clone 39 responsiblefor binding 
the target 


142 


100 


9496 


U40952 


Caenorhabditis 
elegans 


C03B1.10 gene product 


103 


75 


9497 


M63838 


Homo sapiens 


interferon-gamma induced protein 


128 


92 
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SEQW 
NO: 


< Accession No. 


Species 

> 


Description 


Smith- 
Waterman 
Score 


% 

Ideotit 

y 


9498 


AL451017 


Neurospora 
crassa 


related to Ul SMALL NUCLEAR 
RJBONU CLEOPROTEIN C 


147 


44 


9499 


AF1 08831 


Homo sapiens 


K:C1 cotransporter 3 


244 


77 


9500 

- » 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhem) gene preoduct 


154 


68 


9501 


G03411 

• 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7492. 


139 

* 


63 


9502 


AE003499 


Drosophila 
melanogaster 


CG 1 2706 gene product 


180. 


36 


9503 


X02344 


Homo sapiens 


beta-tubulin 


363 


94 


9504 


D78014 


Homo sapiens 


dihydropynmidinase related protein-3 


189 


84 


9505 


U59446 


Brassica napus 


myrosinase-binding protein related 
protein 


116 


40 


9506 


AB010437 


Rattus rattus 


cadherin-8 


239 


50 


9507 


AF229067 


Homo sapiens 


PADI-H protein 


178 


74 


9508 


AJ224819 


Homo sapiens 


tumor suppressor 


291 


85 


9509 


AF017275 


Mus musculus 


growth factor independence-IB 


242 


40 


9510 


AF177203 


Homo sapiens 


cerebral cell adhesion molecule 


388 


52 


9511 


AY007160 


Homo sapiens 

- 


similar to Homo sapiens CGI-57 
protein mRNA with GenBank 
Accession Number AF1 5 1815.1 


1533 


99 

• 


9512 

■ 


AL160371 


Leishmania 
major 


probable (hhv-6) u 1 1 02, variant a 
DNA, complete virion genome 


104 


66 


9513 


X52164 


Mus musculus 


Q300 protein (AA 1-77) 


92 


59 


9514 


X80035 


Oryctolagus 
cuniculus 


cysteine rich hair keratin associated 
protein 


96 


40 


9515 


D86491 


Xenopus laevis 


Nfrl 


1777 


76 


9516 


Y70761 


Homo sapiens 


Human beta-amyloid peptide (BAP) 
binding protein, BBP3. 


168 


80 


9517 


AK024500 


Homo sapiens 


FLJ00 109 protein 


1714 


100 


9518 


AF008220 


Bacillus subtilis 


YtaG 


104 


41 


9519 


AF099977 


Mus musculus 


schlafen4 


163 


35 


9520 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA3563 8. 


162 


72 


9521 


AL035068 


Homo sapiens 


dJ116G19.l (novel protein) 


102 


76 


9522 


B24527 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 15 SEQ ID NO:153. 


729 


90 


9523 


AF129756 


Homo sapiens 


MSH5 


182 


100 


9524 


AJ238706 


Drosophila 
melanogaster 


monocarboxyiate transporter 1 
homologue 


126 


32 


9525 


Y76198 

■ 


Homo sapiens 

• 


Human secreted protein encoded by 
gene 75. 


250 


97 

- 


9526 


D86604 


Mus musculus 


Bach2 


200 


92 


9527 . 


AB032918 


Hylobates 
moloch 


dopamine receptor D4 


91 


36 


9528 


AB007830 


Homo sapiens 


CSR2 


995 


99 


9529 


AC004472 


Homo sapiens 


PI. 11659 4 


449 


98 


9530 


AF273047 


Homo sapiens 


CTCL tumor antigen se20-7 


104 


29 


9531 


AF130079 


Homo sapiens 


PR02852 


153 


67 


9532 


X07881 


Homo sapiens 


pro line-rich protein Gl 


105 


43 


9533 


AK001845 


Homo sapiens 


unnamed protein product 


1368 


63 


9534 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


94 


41 


9535 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


185 


86 


9536 


X99583 


Homo sapiens 


CHL1 protein 


635 


98 


9537 


AL022170 


Homo sapiens 


dJ501N12.1 


111 


37 


9538 


X66179 


Xenopus laevis 


p70 S6 kinase 


120 


75 
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err* m 
NO: 






T% A* ¥"1+ ! An 

i/cscn p u oo 


Qmifk- 
OIUlUt a ' 

Waterman 
Score 


7% 

y 


9539 


U68380 


Gallus gallus 


csdp 


289 


98 


. 9540 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


97 


90 


9541 


AB013918 


Homo sapiens 


CAD 


369 


78 


9542 


U29380 


Caenorhabditis 
elegans 


similar to adenylate cyclase 


501 


42 


9543 


AF020312 


Mus musculus 


proline-ricb protein 9-1 


95 


51 


9544 


G03435 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7516. 


142 


96 


9545 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


142 

■ 


66 


9546 


Y01158 


Homo sapiens 

i 


Secreted protein encoded by gene 18 
clone HCACJ81. 


110 


47 


9547 


AB044546 


Homo sapiens 


mitogen-activated protein kinase kinase 
kinase 


407 


60 


9548 


U52111 


Homo sapiens 


Ca2+/Calmodu tin-dependent protein 
kinase I 


660 


100 


9549 


G03714 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7795. 


129 


68 


9550 


Y87852 


Homo sapiens 


Human FGF-5 protein fragment 


146 


100 


9551 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


109 


50 

- 


9552 


D45131 


Homo sapiens 


basigin 


311 


66 


9553 


A14656 


synthetic 
construct 


protein antigen 


450 


89 


9554 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


133 


52 


9555 


X73458 


Homo sapiens 


protein kinase 


568 


85 


9556 


AJ131890 


Homo sapiens 


DNA polymerase lambda 


1549 


100 


9557 


S56555 


Homo sapiens 


paraoxonase/arylesterase 


224 


85 


9558 


M22332 


Homo sapiens 


unknown protein 


90 


38 


9559 


B24598 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 47 SEQ ID NO:224. 


66 


44 


9560 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


95 


51 


9561 


Y08061 


Homo sapiens 


Human c-myb protein fragment 


77 


76 


9562 


X55687 


Lycopersicon 
escuientum 


extensin (class II) 


69 


60 


9563 


AC002333 


Arabidopsis 
thaliana 


putative SF16 protein {Helianthus 
annuus} 


111 


31 


9564 


AF 11 8082 


Homo sapiens 


PRO1902 


106 


61 


9565 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


85 


52 


9566 


U10323 


Homo sapiens 


NF45 protein 


480 


82 


9567 


AF100757 


Homo sapiens 


COP9 complex subunit 4 


686 


90 


9568 


AL035608 


Homo sapiens 


dJ479J7^ (transmembrane 4 
superfamily member 6) 


293 

• 


90 


9569 


G00413 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4494. 


118 


63 


9570 


U47856 


Araneus 
diadematus 


fibroin-4 


98 


44 

■ 


9571 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


83 

■ 


54 


9572 


AJ006239 


Homo sapiens 


dfliydropteridine reductase | 


461 


84 


9573 


G00821 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4902. 


248 


75 


9574 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


90 


36 


9575 


V00488 


Homo sapiens 


alpha globin 


508 


84 


9576 


AF116719 


Homo sapiens 


PR02987 


483 


91 
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NO: 


vAccession No. 


Species 

> 


Description 


Smitb- 

Watennaa 
Score 


Identit 

y 


9577 

■ 


L77967 


Ovis aries 


small proline-rich protein with paired 
repeat 


62 


38 


9578 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


474 


92 


9579 


P90387 


Homo sapiens 
(Human) 


N-terminal of human serum albumin 
polypeptide. 


274 

■ 


67 


9580 


U22961 


Homo sapiens 

. ■ 


similar to human albumin, Swiss-Prot 
Accession Number P02768; Method: 
conceptual translation supplied by . 
author 


271 

■ 


88 


9581 


U15779 


Homo sapiens 


p70 


101 


57 


9582 


M19419 


Mus m us cuius 


proline-rich salivary protein 


102 


36 


9583 


L22030 


Glycine max 


hydroxyprolme-rich glycoprotein 


114 


40 


9584 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


105 


32 


9585 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


143 


61 


9586 


U93569 


Homo sapiens 


putative pi 50 


232 


51 


9587 


AF 144054 


Homo sapiens 


apoptosis related protein APR-4 


120 


52 


9588 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


128 


58 


9589 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7452. 


90 


61 


9590 


L27428 


Homo sapiens 


reverse transcriptase 


120 


46 


9591 


G03714 


Homo sapiens 

■ 


Human secreted protein, SEQ ID NO: 
7795. 


85 


62 

• 


9592 


AJ133489 


Canis famiJiaris 


albumin 


204 


60 


9593 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


294 


68 


9594 


M15386 


Homo sapiens 


gamma-globin 


316 


77 


9595 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


134 


68 


9596 


AF1 16719 


Homo sapiens 


PR02987 


309 


89 


9597 


AF1 16719 


Homo sapiens 


PR02987 


485 


94 


9598 


AF068294 


Homo sapiens 


HDCMB45P 


131 


34 


9599 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


152 


70 


9600 


AF 118082 


Homo sapiens 


PRO1902 


151 


59 


9601 


GO 1984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


186 


68 


9602 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
787L 


176 


65 


9603 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQ ID NO:220. 


261 


80 


9604 


AF194537 


Homo sapiens 


NAG 13 


154 


61 


9605 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


96 


67 


9606 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


100 


33 


9607 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


134 


48 


9608 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


129 


67 

■ 


9609 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


101 


32 


9610 


AF 11 8080 


Homo sapiens 


PRO1880 


237 


100 


9611 


AF1 19851 


Homo sapiens 


PRO 1722 


138 


81 


9612 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


178 


55 


9613 


AF09093O 


Homo sapiens 


PRO0478 


88 


71 


9614 


AF068294 


Homo sapiens 


HDCMB45P 


121 


49 


9615 


G01657 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


79 
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SFO ID 
NO: 






I/CdWl I|/UVII 


Smith- 
Waterman . 
Score 


Identit 

y 








5738. 






9616 


AF090931 


Homo sapiens 


PRO0483 


143 


63 


9617 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ ID NO:387. 


92 


44 


9618 


G0341 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


121 


56 


9619 


AB046061 


Macaca 
fascicularis 


unnamed protein product 


93 


44 


9620 


Y87116 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 155. 


122 


71 


9621 


AF1 13685 


Homo sapiens 


PRO0974 


156 


65 


9622 


AF097178 . 


Equus caballus 


glyceraldehyde-3-phosphate 
dehydrogenase 


119 


75 


9623 


U83303 


Homo sapiens 


line- 1 reverse transcriptase 


134 


62 


9624 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


112 


53 


9625 


G02532 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


123 


77 


9626 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


164 


73 


9627 


Y86573 


Homo sapiens 


Human gene 91-encc 
fragment, SEQ ID N< 


ided protein 
0:490. 


236 


71 


9628 


AF130079 


Homo sapiens 


PR02852 


138 


50 


9629 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


130 


60 


9630 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


167 

■A — .. - 


61 


9631 


G03102 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7183. 


87 


50 


9632 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


128 


41 


9633 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6582. 


134 


81 


9634 


AF1 18082 


Homo sapiens 


PRO 1902 


114 


57 


9635 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. . 


133 


54 


9636 


L07543 


Leishmania 
tarentolae 


MURF4 


85 

* 


42 


9637 


U49973 


Homo sapiens 


ORF2: function unkn 


own 


163 


44 


9638 


AF090895 


Homo sapiens 


PRO0117 


77 


41 


9639 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


108 


71 


9640 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


90 


46 


9641 


AF1 18078 


Homo sapiens 


PR01848 


114 


66 


9642 


AK024455 


Homo sapiens 


FLJ00047 protein 


131 


79 


9643 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40J. 


130 


62 


9644 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


141 


42 


9645 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6582. 


69 


62 


9646 


AF055985 


Onchocerca . 
volvulus 


pyrrolidone-rich antigen 


69 


35 


9647 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


478 


90 


9648 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


155 


50 


9649 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


60 
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SEQID 
NO: 


i Accession No. 

• 


Species 

i 


Description 


Smith- 

Waterman 

Score 


% 

Identit 
7 






i 


6619. 






9650 


G03I33 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


173 


38 


-9651 

'» 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


118 


36 


9652 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


162 


64 


9653 


D00570 


Mus musculus 


open reading frame (1 96 AA) 


182 


51 


9654 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


138 


54 


9655 


Y02693 


Homo sapiens 

■ * 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


105 


74 


9656 


W88947 


Homo sapiens 


Polypeptide fragment encoded by gene 
118. 


82 


70 


9657 


W88947 


Homo sapiens 


Polypeptide fragment encoded by gene 
118. 


79 


39 


965S 


M24732 


Homo sapiens 


lam in-like protein 


76 


31 


9659 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


161 


73 


9660 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhem) gene preoduct 


149 


48 


9661 

■ 


W88627 

• 


Homo sapiens 

• 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


135 • 


62 

• 


9662 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} . 


126 


71 


9663 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element , 


141 


63 


9664 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


153 


66 


9665 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


142 


93 


9666 


AF 113685 


Homo sapiens 


PRO0974 


112 


73 


9667 


G02490 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


133 


62 


9668 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


102 


47 


9669 


B12310 


Homo sapiens 


Human secreted protein encoded by 
gene 10 clone HDPGP94. 


80 


41 


9670 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


106 


5 & 


9671 


AF090942 


Homo sapiens 


PRO0657 


118 


66 


9672 


AF16I356 


Homo sapiens 


HSPC093 


94 


63 


9673 


AF090930 


Homo sapiens 


PRO0478 


167 


64 


9674 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


98 


50 


9675 


G02640 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6721. 


87 


70 


9676 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


119 


60 


9677 


X13412 


Rattus rattus 


flk protein 


614 


69 


9678 


AJ006770 


Cicer arietinum 


extensin 


163 


48 


9679 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


120 


67 


9680 


G00494 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4575. 


106 


58 


9681 


U63542 


Homo sapiens 


F AP protein 


120 


68 


9682 


M15530 


Homo sapiens 


B-celi growth factor 


146 


76 


9683 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


180 


62 


9684 


AF321379 


Callicebus 
moloch 


gamma2-globin 


108 


86 
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SEQ ID 

NO: 


Accession No. 


Species 

Hi 


Description 


Smith- 
Waterman • 
Score 


Identit 

y 


9685 


L27428 


Homo sapiens 


reverse transcriptase 


122 


39 


9686 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


77 . 


51 


9687 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


104 


58 


9688 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


123 


58 


9689 


X81713 


Homo sapiens 


smallest subunit of TFIIA 


210 


91 


9690 


AF 193330 

• 


Human 
respiratory 
syncytial virus 


G protein 

- 


100 

■ 


25 


9691 


AF150100 


Homo sapiens 


small zinc finger-like protein 


205 


83 


9692 


AF068294 


Homo sapiens 


HDCMB45P 


122 


54 


9693 


AF 189307 


Homo sapiens 


unknown 


96 


50 


9694 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


165 


50 


9695 


AF 130089 


Homo sapiens 


PRO2550 


198 


68 


9696 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 103. 


170 

• 


80 


9697 


AK024455 


Homo sapiens 


FLJ00047 protein 


137 


63 


9698 


M24732 


Homo sapiens 


lamin-like protein 


92 


42 


9699 


AF116715 


Homo sapiens 


PR02829 


119 


75 


9700 


AF1 30089 


Homo sapiens 


PRO2550 


234 


57 


9701 


V00662 


Homo sapiens 


ATPase6 


255 


77 


9702 


LI 0908 


Mus musculus 


Gcapl gene product 


93 


35 


9703 


AF1 30089 


Homo sapiens 


PRO2550 


208 


78 


9704 


AF1 94537 


Homo sapiens 


NAG 13 


89 


51 


9705 


U79260 


Homo sapiens 


unknown 


98 


77 


9706 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


78 


43 


9707 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


70 


92 


9708 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


136 


60 

■ 


9709 


AF 1 94537 


Homo sapiens 


NAG 13 


99 


48 


9710 


AF130089 


Homo sapiens 


PRO2550 


87 


58 


9711 


X03145 


Homo sapiens 


pot ORF in 


135 


41 


9712 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


97 


35 


9713 


AF1 16695 


Homo sapiens 


PR02221 


185 


50 


9714 


AF130051 


Homo sapiens 


PRO0898 


108 


68 


9715 


AF 130089 


Homo sapiens 


PRO2550 


143 


50 


9716 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


124 


49 


9717 


L27428 


Homo sapiens 


reverse transcriptase 


133 


50 


9718 


G03133 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7214. 


88 


43 


9719 


L27428 


Homo sapiens 


reverse transcriptase 


127 


56 


9720 


M74009 


Theropithecus 
gelada 


cytochrome c oxidase subunit II 


283 


67 


9721 


M10546 


Homo sapiens 


cytochrome oxidase I 


292 


84 


9722 


AF130051 


Homo sapiens 


PRO0898 


96 


57 


9723 


U93564 


Homo sapiens 


putative pi 50 


144 


49 


9724 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


373 


76 


9725 


AK000496 


Homo sapiens 


unnamed protein product 


155 


55 


9726 


G02640 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6721. 


94 

• 


62 
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NO: 


Accession No. 


Species 


"description 

• 


Smith- 
Waterman 
Score 


*4 

Ideotit 

y 


9727 


A? 130089 


Homo sapiens 


PRO2550 


154 


40 


9728 


AF 130089 


Homo sapiens 


PRO2550 


127 


mt m 

61 


9729 


M19419 


Mus musculus 


proline-rich salivary protein 


98 


39 


9730 


D90053 


Sus scrota 


destnn 


119 


43 


9731 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

mm mm m 

7871. 


138 

. 


50 


9732 


AF090942 


Homo sapiens 


PRO0657 


72 


45 


9733 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


no 


44 


9734 


Y86248 

• 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


159 


94 


9735 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


150 


85 


9736 


L27428 


Homo sapiens 


reverse transcriptase 


217 


33 


9737 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


89 


65 


9738 


AF007064 


Saccharomyces 
cerevisiae 


basic protein 


108 


31 


9739 . 


BO 1372 


Homo sapiens 


Neuron-associated protein. 


128 


60 


9740 


G03140 

i 
i 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7221. 


109 


50 


9741 


AF 194537 


Homo sapiens 


NAG13 


123 


54 


9742 


Y69166 


Homo sapiens 


A mature human N-acetylglycosaminyl 
transferase protein. 


101 


77 


9743 


X92485 


Plasmodium 
vivax 


pval 


106 


41 


9744 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


122 


76 


9745 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


128 


50 


9746 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


87 


64 


9747 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


163 


69 


9748 . 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


124 


74 


9749 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


97 


66 


9750 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


156 


71 


9751 


G03925 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8006. 


157 


71 


9752 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


97 


51 


9753 


X92485 


Plasmodium 
vivax 


pval 


91 


39 


9754 


Y07786 


Vibrio cholerae 


Rfc-like protein 


92 


32 


9755 


AF068294 


Homo sapiens 


HDCMB45P 


125 


63 


9756 


L27428 


Homo sapiens 


reverse transcriptase 


91 


34 


9757 


AK024455 


Homo sapiens 


FLJ00047 protein 


143 


59 


9758 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


103 


41 


9759 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


96 


67 


9760 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


179 


58 


9761 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


154 


65 


9762 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


91 


78 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 1 
Identit * 

y 








7888. 






9763 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


119 


60 


9764 


Y27868 


Homo sapiens — . 


Human secreted protein encoded by 
gene No. 107. 


111 


85 


9765 


R13556 

• 


Homo sapiens 


Protein encoded downstream ofhhc_M , 
oncoprotein. 


116 


36 


9766 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


165 . 


74 


9767 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


162 


74 


9768 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


117 


52 


9769 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


92 


50 


9770 


AF130089 


Homo sapiens 


PRO2550 


104 


31 


9771 . 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7133. 


154 


62 


9772 


U52077 


Homo sapiens 


mariner transposase 


194 


56 


9773 


Y87297 


Homo sapiens 


Human signal peptide containing 
protein HSPP-74 SEQ ID NO:74. 


345 


100 


9774 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


73 


36 


9775 

i » 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


96 

* 


32 


9776 


G03I33 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


124 


53 


9777 


L27428 


Homo sapiens 


reverse transcriptase 


251 


53 


9778 


K02576 


Homo sapiens 


salivary proline-rich protein i 


141 


40 


9779 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


111 


58 


9780 


AJ224997 


Ratms 
norvegicus 


huntingtin 


64 


64 


9781 


AF283769 


Homo sapiens 


similar to GenBank Accession Number 
AC021163 


119 


53 


9782 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7133. . 


81 


50 


9783 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla(AF078683) 


120 


68 


9784 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


85 


65 


9785 


AF144054 


Homo sapiens 


apoptosis related protein APR-4 


83 


53 


9786 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


75 


41 


9787 


M64792 


Rattus 
norvegicus 


salivary proline-rich protein 


122 


45 


9788 


D90053 


Sus scrofa 


destrin . ! 


146 


43 


9789 


A18812 


Brassica napus 


extensin 


104 


34 


9790 


AF1 30089 


Homo sapiens 


PRO2550 


102 


66 


9791 


AF130089 


Homo sapiens 


PRO2550 


326 


70 


9792 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


495 


79 


9793 


M10546 


Homo sapiens 


cytochrome oxidase I 


343 


75 


9794 


Y85062 


Homo sapiens 


Interleukin 1 converting enzyme 
homologue (ICEL) protein sequence. 


106 


58 


9795 


X92485 


Plasmodium 
vivax 


pval 


108 


48 


9796 


M15530 


Homo sapiens 


B-cell growth factor 


119 


70 


9797 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


148 


65 


9798 


X61046 


Hydra sp. 


mini-collagen 


101 


55 
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SEQlOD 
NO: 


» Accession No. 


Species 

> 


Description 


Smith- 

Waterman 

Score 


Identit 

y 


mm iTK 

9799 


U93571 


Homo sapiens 


p40 


82 


36 


9800 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


170 . 


52 


9801 

• 


X92485 


Plasmodium 

• 

vivax 


pval 


113 

• 


45 


9802 


Y 1 4479 


Homo sapiens 


Fragment of human secreted protein 

Iff * A 

encoded by gene 14. 


63 

w- 


44 


9803 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


583 


89 


9804 


G01736 


Homo sapiens 


Human secreted protem, SEQ ID NO: 

mm f% 4 mmm 

5817. 


m mm /v 

150 


m mm 

47 


9805 


Y01400 


Homo sapiens 

* 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


132 


71 


9806 


X92485 


Plasmodium 
vivax 


pval 


106 


53 


9807 


Y86573 


Homo sapiens 


Human gene 91-encoded protein 
fragment, SEQ ID NO:490. 


163 


67 


9808 


AF11985I 


Homo sapiens 


PR01722 


341 


68 


9809 


G03482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7563. 


77 


57 


9810 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


133 


72 


9811 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


113 


65 


9812 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


118 


37 


9813 


G03703 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7784. 


121 


72 


9814 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


72 


92 


9815 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


71 


71 


9816 


AF1 18082 


Homo sapiens 


PRO 1902 


93 


45 


9817 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871; 


207 


mm f%. 

59 


9818 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


99 


46 


9819 


Y01400 


Homo sapiens 


Secreted protem encoded by gene 18 
clone HNHF029. 


130 


69 


9820 


LI 0908 


Mus muscujus 


GcapJ gene product 


99 


37 


9821 


A00469 


Homo sapiens 


growth hormone 


254 


81 


9822 


U79260 


Homo sapiens 


unknown 


85 


39 | 


9823 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


125 


62 


9824 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


103 


74 


9825 . 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6588. 


126 


51 


9826 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


148 


44 


9827 


Y24789 


Homo sapiens 


Human secreted protem bG77_l. 


265 


100 


9828 


G02363 


Homo sapiens 

• 


Human secreted protem, SEQ ID NO: 
6444. 


72 


60 


9829 


G00689 


Homo sapiens 


Human secreted protem, obQ LU mu. 
4770. 


1 1 Q 




9830 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


104 


51 


9831 


G00437 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
4518. 


118 


74 


9832 


AF151866 


Homo sapiens 


CGI-1 08 protein 


115 


31 


9833 


U93563 


Homo sapiens 


putative pi 50 


127 


61 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman ■ 
Score 


% 

Identit 

y 


9834 


AE003859 


Xylella 
fastidiosa 


hypothetical protein 


97 


30 


9835 


G03172 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


138 


65 


9836 


B01372 


Homo sapiens 


Neuron-associated protein. 


150 


88 


9837 


G03115 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7196. 


115 


57 


9838 


AF068294 


Homo sapiens 


HDCMB45P 


90 . 


65 


9839 

• 


S77772 


Homo sapiens 


aspartylglucosaminidase, AGA {C- 
terminal, alternatively spliced} {EC 
3.5.1.26} 


71 

* 


66 

. 


9840 


AF090942 


Homo sapiens 


PRO0657 


115 


76 


9841 . 


G03564 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


75 


54 

- 


9842 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


156 


67 


9843 


AF 130089 


Homo sapiens 


PRO2550 


216 


75 


9844 


M15530 


Homo sapiens 


B-cell growth factor 


113 


67 


9845 


U93567 


Homo sapiens 


p40 


248 


75 


9846 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


151 


55 


9847 


AK024455 


Homo sapiens 


FLJ00047 protein 


151 


64 


9848 


AF1 18082 


Homo sapiens 


PRO1902 


58 


81 


9849 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


129 


65 


9850 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


329 


77 


9851 


AF090944 


Homo sapiens 


PRO0663 


122 


80 


9852 


X71442 


Ratals 
norvegicus 


ORF 1; putative 


109 


48 


9853 


AF130089 


Homo sapiens 


PRO2550 


126 


60 


9854 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


98 


62 

- 


9855 


U93563 


Homo sapiens 


putative pl50 


136 


48 


9856 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


276 


74 


9857 


AF017777 


Drosophila 
melanogaster 


lacosta 


99 


43 


9858 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


99 


37 


9859 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


106 


37 


9860 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


138 


71 

• ■ 
* 


9861 


AF1 16909 


Homo sapiens 


unknown 


103 


43 


9862 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


113 


38 


9863 


Y21811 


Homo sapiens 


CPF polypeptide 36PR0. 


131 


58 


9864 


U93567 


Homo sapiens 


p40 


252 


70 


9865 


AF090895 


Homo sapiens 


PRO0117 


164 


60 


9866 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


109 


47 


9867 


AF090930 


Homo sapiens 


PRO0478 


106 


84 


9868 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


167 


84 


9869 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


83 


60 


9870 


AF1 19900 


Homo sapiens 


PR02822 


134 


48 


9871 


AF1 16715 


Homo sapiens 


PR02829 


111 


81 


9872 


U52077 


Homo sapiens 


mariner transposase 


332 


76 
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SEQID 
NO: 


^Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


1* 
Ideotit 

y 


9873 


Y69166 


Homo sapiens 


A mature human N-acetylglycosaminyl 
transferase protein. 


95 


85 


9874 


U93570 


Homo sapiens 


putative pi 50 


243 


44 


9875 


L27428 


Homo sapiens 


reverse transcriptase — 


127 


54 


9876 


S80119 


Rattus sp. 


reverse transcriptase homolog 


163 


43 


9877 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


113 


64 


9878 


AF1 87823 


Zea mays 


transposase DOPA 


132 


38 


9879 


W40353 


Homo sapiens 


Human unspecified protein from 
US5702907. 


101 


41 


9880 


G03415 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7496. 


133 


43 


9881 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


101 


59 


9882 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


148 


52 


9883 


GO 1249 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5330. 


57 


65 


9884 


X92485 


Plasmodium 
vivax 


pval 


129 


43 


9885 


M64792 


Rattus 
norvegicus 


salivary proline-rich protein 


98 


37 


9886 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 1 00 clone HNFIU96. 


89 


59 


9887 


G04072 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
8153. 


155 


54 


9888 


G04063 

♦ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


121 


40 


9889 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


346 


68 


9890 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


133 


38 


9891 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


143 

■ 


66 


9892 


X53581 


Rattus 
norvegicus 


ORF7 


81 


56 


9893 


G03267 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7348. 


109 


53 


9894 


AE003859 


Xylella 
fastidiosa 


hypothetical protein 


99 


27 


9895 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


157 


58 


9896 


L26953 


Homo sapiens 


chromosomal protein 


107 


68 


9897 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


95 


51 


9898 


U93569 


Homo sapiens 


putative pi 50 


106 


52 


9899 


U15647 


Mus musculus 


reverse transcriptase 


118 


47 


9900 


U49973 


Homo sapiens 

» 


ORF1; MER37; putative transposase 
similar to pogo element 


118 


70 


9901 


U93563 


Homo sapiens 


putative pi 50 


116 


39 


9902 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


153 


81 


9903 


X03145 


Homo sapiens 


potORFV 


148 


54 


9904 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


84 


65 


9905 


AF066073 


Dictyostelium 
discoideum 


SP85; PsB 


113 


52 


9906 


AF090895 


Homo sapiens 


PRO0117 


125 


58 


9907 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


102 


57 


9908 


M24732 


Homo sapiens 


lam in-like protein 


95 


35 
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CCA YTV 

NO: 


Accession IiO. 






•jiiiiitt 

Waterman 
Score 


Idcntit 

y 


9909 


Y30822 


Homo sapiens 


Human secreted protein encoded from 
gene 12. 


96 


55 


9910 


AF1 18080 


Homo sapiens 


PRO1880 


120 


66 


9911 


AF116719 


Homo sapiens 


PR02987 


216 


72 


9912 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


150 


75 


9913 


G03703 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7784. 


129 


76 


9914 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


72 


58 


9915 


G00233 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4314. 


103 


71 


9916 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


114 


75 


9917 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


75 


83 


9918 


AF116712 


Homo sapiens 


PR02738 


106 


69 


9919 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


90 


77 


9920 


AF068294 


Homo sapiens 


HDCMB45P 


296 


59 


9921 


G02996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7077. 


58 


54 


9922 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
IDNO:103. 


143 


69 


9923 


U38979 


Homo sapiens 


hPMSR3 


135 


46 


9924 


AB002317 


Homo sapiens 


KIAA0319 


294 


75 


9925 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


169 


42 


9926 


U49973 


Homo sapiens 

• 


ORF1; MER37; putative transposase. 
similar to pogo element 


153 


59 


9927 


G00214 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4295. 


261 


97 


9928 


Y14483 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 18. 


863 


76 


9929 


AF130051 


Homo sapiens 


PRO0898 


128 


60 


9930 


AF 130089 


Homo sapiens 


PRO2550 


123 


41 


9931 


AF064597 


Homo sapiens 


LINE-1 like protein 


101 


48 


9932 


AF027144 


Homo sapiens 


zinc finger protein 


76 


42 


9933 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


120 


44 


9934 


U93569 


Homo sapiens 


putative pl50 


105 


68 


9935 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


133 


51 


9936 


Y73344 


Homo sapiens 


HTRM clone 0258181 protein 
sequence. 


145 


46 


9937 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


123 


77 


9938 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


159 


40 


9939 


D63643 


Homo sapiens 


clathrin coat assembly protein-like 


232 


69 


9940 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


75 


65 


9941 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


176 


71 


9942 


X04588 


Homo sapiens 


cytoskeletal tropomyosin (AA 1-248) 


379 


81 


9943 


G03683 


Homo sapiens 


Human secreted protein, SEQ ID NO: , 
7764. 


58 


52 


9944 


AF130114 


Homo sapiens 


PR02459 


117 


67 


9945 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


297 


64 



398 



NO: 


Accession vHvm 


opCCIt-5 




Smith- 

Waterman 

Score 


Identit 

y 








similar to pogo element 






9946 


D00570 


Mus musculus 


open reading frame (25 1 AA) 


214 


63 


9947 


X03717 


Homo sapiens 


pot. unidentified reading frame 


76 


35 


9948 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


127 


64 


9949 


J02621 


Homo sapiens 


high mobility group protein 14 , 


90 


50 


9950 


AF090931 


Homo sapiens 


PRO0483 


114 


75 


9951 


AF164612 . 


Homo sapiens 


Gag protein 


132 


63 


9952 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


112 


65 


9953 


Y08061 


Homo sapiens 


Human c-myb protein fragment 


118 


59 


9954 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81, 


138 


74 


9955 


G00974 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5055. 


94 


38 


9956 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


124 


61 


9957 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 


177 


51 


9958 


X92485 


Plasmodium 
vivax 


pval 


137 


44 


9959 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


142 


53 


9960 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


174 


68 


9961 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


123 


38 


9962 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


124 


56 


9963 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


80 


.57 


9964 


AP000616 

i 


Oryza saliva 


similar to RING-H2 finger protein 
RHAla(AF078683) 


136 


82 


9965 


Y27908 


Homo sapiens 


Human secreted protein encoded by 
gene No. 119. 


92 


53 


9966 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


76 


40 


9967 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


136 


38 


9968 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


101 


67 


9969 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


90 


83 


9970 


Y36156 


Homo sapiens 


Human secreted protein #28. 


170 


68 


9971 


U79260 


Homo sapiens 


unknown 


142 


67 


9972 


L27428 


Homo sapiens 


reverse transcriptase 


108 


63 


9973 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


119 


60 


9974 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


78 


85 


9975 


W88947 


Homo sapiens 


Polypeptide fragment encoded by gene 
118. 


79 


39 


9976 


AF090944 


Homo sapiens 


PRO0663 


109 


73 


9977 


AF1 16715 


Homo sapiens 


PR02829 


120 


56 


9978 


U93563 


Homo sapiens 


putative pi 50 


132 


57 


9979 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


97 


80 


9980 


U42580 


Paramecium 
bursaria 


A658R 


70 


40 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Tdeotit 

y 






Chlorella virus 1 








9981 


G03371 


Homo.sapiens 


Human secreted protein, SEQ ID NO: 
7452. 


87 


61 


9982 


G02363 


Homo sapiens 


Human secreted'protein, SEQ ID NO: 
6444. 


135. 


70 


9983 


X61047 


Hydra so. 


mini-collagen 


90 


36 


9984 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


98 


40 


9985 


Y41552 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 80. 


90 


42 


9986 


G03411 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7492. 


132 


52 


9987 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


115 


41 


9988 


AF042832 


Homo sapiens 


forkhead-retated transcription factor 
FREAC-9 


89 




9989 


X03717 


Homo sapiens 


pot. unidentified reading frame 


105 


67 


9990 


AL390935 


Leishmania 
major i 


possible dopap 


118 


74 


9991 


AF201949 


Homo sapiens 


60S ribosomal protein L30 isolog 


109 


58 


9992 


AF1301J4 


Homo sapiens 


PR02459 


115 


68 


9993 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQEDNO:163. 


148 


83 


9994 


AF068294 


Homo sapiens 


HDCMB45P 


196 


43 


9995 


Y95028 


Homo sapiens 


Human clone vo7 1 ORF2, SEO ID 
NO:128. 


117 

1 


54 


9996 


Ml 1759 


Lycopersicon 
esculentum 


cell wall hydroxyproline-rich 
glycoprotein 


74 


S3 


9997 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


132 


55 


9998 


AF102771 


Homo sapiens 


cardiac-specific ras association 
(RalGDS/AF-6) domain family 1 
protein isoform ID 


148 


100 


9999 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


213 


88 


10000 


D38112 


Homo sapiens 


ATPase subunit 6 


279 


69 


10001 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


97 


39 


10002 


AK000496 


Homo sapiens 


unnamed protein product 


154 


74 


10003 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


125 


74 




B07702 


Homo sapiens 


Protein encoded by the endogenetic 
fragment of HERV-W. 


143 


56 


10005 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


97 


65 


10006 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7133. 


154 


72 


10007 


AF021231 


Mus museums 


acetylcholinesterase>associated 
collagen 


109 


41 


10008 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


69 


78 


10009 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


170 


.78 


10010 


AF1 30051 


Homo sapiens 


PRO0898 


134 


67 


10011 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


100 


44 


10012 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


95 


78 


10013 


AF073519 


Homo sapiens 


small EDRK-rich factor 1, long isoform 


90 


52 


10014 


AF130052 


Homo sapiens 


PRO0956 


104 


50 



400 



WO 01/64835 PCT/US01/04927 



NO: 


Accession No. 
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omirn- 

Watcnnan 

Score 


rc 1 

Identft 

y 


10015 


X92485 


Plasmodium 
vivax 


pval 


119 


46 


10016 


AF1 30089 


Homo sapiens 


PRO2550 


171 


56 


10017 


AF130089 


Homo sapiens ~ 


PRO2550 


172 


58 


10018 


G03133 


Homo sapiens 

■ 


Human secreted protein, SEQ ID NO: 
7214. 

* 


146 


38 


10019 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


109 


54 


10020 


G02584 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6665. 


109 


43 


10021 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4669. 


70 


51 


10022 


AF090942 


Homo sapiens 


PRO0657 


113 


67 


10023 


Ml 1759 

* 


Lycopersicon 
esculentum 


cell wall hydroxyproline-rich 
glycoprotein 


77 


33 


10024 


X55681 


Lycopersicon 
esculentum 


extensin (class I) 


95 


35 


10025 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


93 


61 


10026 


W88627 


Homo sapiens 

* 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


153 


42 


10027 


Y73377 


Homo sapiens 


HTRM clone 1645941 protein 
sequence. 


1180 


84 


10028 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


108 


59 


10029 


G00366 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4447. 


112 


65 


10030 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 

*^ .. 


163 


82 


10031 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


131 


66 


10032 


G02451 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6532. 


74 


48 


10033 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7001. 


104 


85 


10034 


G03093 


Homo sapiens ~* 


Human secreted protein, SEQ ID NO: 
7174. 


192 


60 


10035 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


153 


55 


10036 


Y14479 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


128 


76 


10037 


AFt 18086 


Homo sapiens 


PR01992 


148 


68 


10038 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


125 


47 


10039 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


137 


65 


10040 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


139 


45 


10041 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


128 


75 


10042 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


100 


33 


10043 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


102 


64 


10044 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


135 


71 


10045 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


148 


46 


10046 


X92485 


Plasmodium 
vivax 


pval 


98 


58 
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Waterman 
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% 
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10047 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


102 


87 


10048 ■ 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


63 


84 


10049 


U09202 


Homo sapiens 


ornithine decarboxylase antizyrae 


202 


67 


10050 


X92485 

■ 


Plasmodium 
vivax 


pval 


77 


34 


10051 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
8144. 


87 


40 


10052 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


114 


57 


10053 


AC005396 


Arabidopsis 
thaJiana 


putative proline-rich protein 


100 


33 


10054 


G03600 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7681. 


91 


94 


30055 


X73434 


Ovis aries 


KAP5.4 keratin protein 


76 


45 


10056 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: . 
6566. 


82 


62 


10057 


G03789 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7870. 


115 


66 


10058 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


116 


55 


10059 


G00328 

- 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


130 


73 


10060 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


100 


42 


10061 


L27428 


Homo sapiens 


reverse transcriptase 


107 


43 


10062 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


92 


47 


10063 


G03112 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7193. 


93 


46 


10064 


AF220264 


Homo sapiens 


MOST-1 


135 


48 


10065 


AF130114 


Homo sapiens 

* 


PR02459 


87 


68 


10066 


D86853 


Catharanthus 
roseus 


extensin 


115 


35 


10067 


G00500 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
4581. 


79 


71 


10068 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


102 


54 

■ 


10069 


U16359 


Rattus 
norvegicus 


nitric oxide synthase 




93 


77 


10070 


AJ233597 


Mus famulus 


reverse transcriptase 


65 


36 


10071 


G04072 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
8153. 


94 


67 


10072 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


114 


51 


10073 


AK024455 


Homo sapiens 


FLJ00047 protein 


146 


66 


10074 


X92485 


Plasmodium 
vivax 


pval 


121 


36 


10075 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


130 


64 

■ 


10076 


AF055985 


Onchocerca 
volvulus 


pyrrolidone-rich antigen 


73 


43 


10077 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


165 


75 


10078 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


106 


72 


10079 


AF130050 


Homo sapiens 


PRO0872 


102 


63 


10080 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


44 
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omttfi- 
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% 
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y 


■ 


* 


i 


8153. 






10081 


Y87156 


Homo sapiens 

• ■ 


Human secreted protein sequence SEQ 
1DN0:195. 


285 

* 


78 


10082 


AF068294 


Homo sapiens 


HDCMB45P 


144 


68 


10083 


U41038 


Caenorhabditis 
elegans 


Similar to cadherin-type repeat 

■ 


98 


48 

■ 


10084 


X92485 


Plasmodium 
vivax 


pval 


108 


53 


10085 


AF1 18082 


Homo sapiens 


PRO1902 


117 


57 


10086 


AF090942 


Homo sapiens 


PRO0657 


88 


53 


10087 


AF1 18086 


Homo sapiens 


PR01992 


172 


71 


10088 


Y01400 


Homo sapiens * 

« 


Secreted protein encoded by gene 18 
clone HNHF029. 


119 


68 


10089 


K02576 


Homo sapiens 


salivary proline-rich protein I 


117 


34 


10090 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


154 


77 


10091 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


138 


56 


10092 


G03119 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7200. 


90 


56 


10093 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


134 


66 


10094 


Ml 5530 


Homo sapiens 


B-cell growth factor 


116 


64 


10095 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


124 


65 


10096 


Ml 5530 


Homo sapiens 


B-cell growth factor 


87 


68 


10097 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


84 


79 


10098 


G00369 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4450. 


150 


66 


10099 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


157 


70 


10100 


G03482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7563. "' 


103 


60 


10101 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


114 


70 


10102 

■ 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


130 


53 


10103 


G03786 " 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


156 


52 


10104 


G00365 


Homo sapiens 


Human secreted protein, SEQID NO: 
4446. 


87 


60 


10105 


L27428 


Homo sapiens 


reverse transcriptase 


220 


38 


10106 


W90834 


Homo sapiens 


Human lymphocyte targeted peptide 
#2. 


91 


43 


10107 


AF161356 


Homo sapiens 


HSPC093 


227 


61 


10108 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


123 


64 


10109 


AF1 30089 


Homo sapiens 


PRO2550 


126 


77 


10110 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


118 


60 


10111 


M63274 


Plasmodium 
falciparum 


malaria antigen 


81 


54 


10112 


AF090944 


Homo sapiens 


PRO0663 


133 


70 


10113 


L27428 


Homo sapiens 


reverse transcriptase 


91 


32 


10114 


Y17221 


Homo sapiens 


Human secreted protein (clone flc3 17- 
3). 


112 


64 


10115 


W88947 


Homo sapiens 

* 


Polypeptide fragment encoded by gene 
118. 


85 


44 
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Smith- 
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% 
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y 


10116 


L27428 


Homo sapiens 


reverse transcriptase 


112 


44 


10117 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6582. 


115 


61 


10118 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: - 
8144. 


-79 


34 


10119 


AF1 94537 


Homo sapiens 


NAG13 


295 


51 


10120 


D00570 


Mus musculus 


open reading frame (196 AA) 


97 


40 


10121 


X77816 


Rattus 
norvegicus 


PR-Vbetal 


122 


47 


10122 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


148 

■ 


55 


10123 


G03793 . 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


92 


66 


10124 


AF090942 


Homo sapiens 


PRO0657 


127 


78 


10125 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


145 


68 


10126 

.■ 


G03807 

> 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


113 


49 


10127 


U93563 


Homo sapiens 


putative pi 50 


94 


35 


10128 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


146 

■ 


79 


10129 


AF130089 


Homo sapiens 


PRO2550 


208 


75 


10130 


Ml 5530 


Homo sapiens 


B-cell growth factor 


102 


73 


10131 


X55685 


Lycopersicon 
esculentum 


extensin (class I) 


122 


30 


10132 


L27428 


Homo sapiens 


reverse transcriptase 


197 


40 


10133 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


9?. 


73 


10134 


AF 194537 


Homo sapiens 


NAG 13 


220 


48 


10135 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


116 


75 

* 


10136 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


117 


61 


10137 


Y86573 


Homo sapiens 


Human gene 91-encoded protein 
fragment, SEQ ID NO:490. 


223 


59 


10138 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


178 


75 


10139 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


170 


52 


10140 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


133 


64 


10141 


AF072164 


Homo sapiens 


HFSE-1 


80 


60 


10142 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6292. 


103 


74 


10143 


M15530 


Homo sapiens 


B-cell growth factor 


91 


40 


10144 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


103 


61 


10145 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


162 


65 


10146 


G00379 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4460. 


128 


53 


10147 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
IDNO:103. 


112 


61 


10148 


X71975 


Drosophila 
melanogaster 


put homologue to S.cerevisiae GAR1 
gene 


118 


39 


10149 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


123 


49 


10150 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


163 


81 
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10151 


Y08769 


Rattus 
norvegicus 


microvascular endothelial 
differentiation gene 2 


106 


83 


10152 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


117 


52 


10153 


AF1 13685 


Homo sapiens 


PRO0974 


133 


62 


10154 


U41038 


Caenorhabditis 
elegans 


Similar to cadherin-type repeat 


129 


67 


10155 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


89 


50 


10156 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


131 


41 


10157 


AB011099. 


Homo sapiens 


KIAA0527 protein 


94 


35 


10158 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


78 

■ 


72 


10159 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


139 


71 


10160 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


161 


68 


10161 

- 


G02872 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


162 

• 


88 


10162 


Ml 5530 , 


Homo sapiens 


B-cell growth factor 


137 


63 


10163 


AF090895 


Homo sapiens 


PRO0117 


93 


67 


10164 


G03043 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7124. 


109 


60 


10165 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


126 


52 


10166 


AF 118082 


Homo sapiens 


PRO 1902 


111 


55 


10167 


AF090895 


Homo sapiens 


PRO0117 


112 


75 


10168 


G00328 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
4409. 


135 


69 


10169 


AF090942 


Homo sapiens 


PRO0657 


149 


67 


10170 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


145 


68 


10171 


AL160371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


85 


51 


10172 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


136 


62 


10173 

■ 


G02532 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
6613. 


158 


63 


10174 


AF090942 


Homo sapiens 


PRO0657 


75 


44 


10175 


U12390 


Cloning vector 
pSportl 


beta-galactosidase alpha peptide 


97 


47 


10176 


Y01154 


Homo sapiens 


Protein sequence Seq Id 54 from 
WO9901020. 


108 


75 


10177 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


168 


80 


10178 


M22332 


Homo sapiens 


unknown protein 


131 


62 


10179 


U49973 

• 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


160 


60 


30180 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4449. 


65 


68 


10181 


AF 119900 


Homo sapiens 


PR02822 


132 


72 


10182 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


114 


56 


10183 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


124 


59 


10184 


AF2 10651 


Homo sapiens 


NAG 18 


150 


65 


10185 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


109 


46 


10186 


X71442 


Rattus 


ORF 1; putative 


91 1 47 
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norvegicus 








10187 


AK024455 


Homo sapiens 


FLJ00047 protein 


122 


54 


10188 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


82 


72 


10189 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


78 


61 


10190 


AF220264 


Homo sapiens 

JT 


MOST-1 


165 


82 


10191 


Y91617 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 19 SEQ ID NO:290. 


54 


44 


10192 

• 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


119 


80 


10193 


X92485 


Plasmodium 
vivax 


pval 
* 


128 


57 


10194 


X61296 


Rattus 
norvegicus 


open reading frame 2 


79 


72 


10195 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


116 


62 


10196 


AF130089 


Homo sapiens 


PRO2550 


144 


62 


10197 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


124 


56 


10198 


AF090942 


Homo sapiens 


PRO0657 


133 


76 


10199 


S79304 


Rattus sp. 


cytochrome oxidase subunit I; COX I 


280 


66 


10200 


S61070 


Homo sapiens 


reverse transcriptase homolog^ol 
{retroviral element} 


188 


49 


10201 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


125 


51 


10202 


AF068294 


Homo sapiens 


HDCMB45P 


174 


65 


10203 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


174 


50 


10204 


U93572 


Homo sapiens 


putative pi 50 


77 


40 


10205 


L27428 


Homo sapiens 

* 


reverse transcriptase 

-_, — ^ — — -r — - — ,, . — , 1 


133 


50 


10206 


J02621 


Homo sapiens 


high mobility group protein 14 


94 


57 


10207 


AF130079 


Homo sapiens 


PR02852 


144 


73 


10208 


R95913 


Homo sapiens 


Neural thread protein. 


134 


51 


10209 


U49973 


Homo sapiens 


ORF 1 ; MER37 ; putative transposase 
similar to pogo element 


154 


76 


10210 


AB046048 


Macaca 
fascicularis 


unnamed portein product 


162 


60 


10211 


AF090931 


Homo sapiens 


PRO0483 


128 


79 


10212 


AF130051 


Homo sapiens 


PRO0898 


144 


76 


10213 


G03798 


Homo sapiens 
* 


Human secreted protein, SEQ ID NO: 
7879. 


95 


54 

■ 


10214 


AF130089 


Homo sapiens 


PRO2550 


158 


70 


10215 


G03564 • 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7645. 


96 


55 


10216 


M15894 


Homo sapiens 

* 


chorionic somatomammotropin 
precursor 


116 


63 


10217 


AK024372 


Homo sapiens 

* 


unnamed protein product 


85 


56 


10218 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


118 


69 


10219 


M37679 


Mus museums 


Ig heavy chain precursor 


64 


83 


10220 


G03806 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7887. 


91 


69 


10221 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


127 


47 


10222 


Y36505 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 29. 


102 


48 


10223 


AF090940 


Homo sapiens 


PRO0644 


105 


76 


10224 


U49974 


Homo sapiens 


mariner transposase 


155 


76 


10225 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


154 


42 
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• * * 



SEQ ID 

NO: 


Accession No. 


Species 

» 

- 


Description 


Smith- 

Wnfprma n 

wr a(M Mia lj 

Score 


% 
y 


10226 


AF293639 


Homo sapiens 


dioxin receptor repressor 


819 


98 


• 10227 


L27428 


Homo sapiens 


reverse transcriptase 


137 


44 


10228 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


106 


70 


1X)229 


U93566 


Homo sapiens 


p40 


206 


80 


10230 


Y08061 


Homo sapiens 


Human c-myb protein fragment. 


114 


66 


10231 


AF 118082 


Homo sapiens 


PRO1902 


76 


52 


10232 


AF1 18086 


Homo sapiens 


PR01992 


99 


75 


10233 


X53375 


Helianthus 
annuus 


anther-specific protein SF1 8 

1 


95 


64 


10234 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


156 


72 


10235 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


100 


35 


10236 


L27428 


Homo sapiens 


reverse transcriptase 


244 


50 


10237 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


93 


70 


10238 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


142 


72 


10239 


Y01400 , 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


149 


80 


10240 


S52010 


Mus sp. 


1st Met is at position 21 


98 

^F* 


42 


10241 


AL1 10147 


Homo sapiens 


hypothetical protein 


125 


35 


10242 


X98485 


Plasmodium 
vivax 


putative 

Br ^FW ^^»^^r* T ^r 

1 


85 


51 


10243 


S58722 


Homo sapiens 


X- linked retinopathy protein IC- 
terminal, clone XEH.8c} ■ 


70 


65 

Vf* 


10244 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 103. 


125 


49 


10245 


U49973 


Homo sapiens 

* - 


ORF2: function unknown 


150 


81 


10246 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


113 


56 


10247 


AF 130089 


Homo sapiens 


PRO2550 


139 


73 


10248 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


160 


59 


10249 


U93565 


Homo sapiens 


putative pi 50 


109 


43 


10250 


AF130089 


Homo sapiens 


PRO2550 


104 


54 


10251 


G04063 


Homo sapiens 


Human secreted protein, SEO ID NO: 
8144. 


78 


36 


10252 


G04063 


Homo sapiens 

• 


Human secreted protein. SEO ID NO: 
8144. 


110 


58 


10253 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4669. 


123 


54 


10254 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


95 


57 


10255 


X03145 


Homo sapiens 


pot ORF V 


76 


63 


10256 


Y01405 


Homo sapiens 


Secreted protein encoded by gene 23 
clone HDPBA48. 


116 


65 


10257 


G03411 


Homo sapiens 


Human secreted protein* SEO ID NO: 
7492. 


113 


68 


10258 


G00333 


Homo sapiens . 


Human secreted protein. SEO ID NO: 
4414. 


138 


68 


10259 


U49973 


Homo sapiens 


ORF 1 ; MER37; putative transposase 
similar to pogo element 


135 


60 


10260 


G02528 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6609. 


95 


59 


10261 


M86246 


Homo sapiens 


EHS-2 


96 


62 


10262 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


180 


66 


10263 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: j 


102 


62 
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SEQID 
NO* 


Accession No. 


Species 


Description 


Smith- 

Watermnn ' 
Score 


% 

IHpntit ' 

y 








7871. 






10264 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


313 


84 


10265 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


144 


56 


10266 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


150 


61 


10267 


Y91452 


Homo sapiens 

• 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:125. 


408 


89 


10268 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


74 

* 


26 


10269 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


133 


52 


10270 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


105 


61 


10271 


X61048 


Hydra sp. 


mini-collagen 


121 


40 


10272 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


150 


66 


10273 


L24521 


Homo sapiens 


transformation-related protein 


134 


51 


10274 


G03806 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7887. 


96 


47 


10275 


U59446 


Brassica napus 


myrosinase-binding protein related 
protein 


115 


39 


10276 


GO 1480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5561. 


482 


98 


10277 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


143 


81 


10278 


U93564 


Homo sapiens 


putative pi 50 


130 


46 


10279 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


123 


44 


10280 


AF090944 


Homo sapiens 


PRO0663 


156 


73 


10281 


AF130079 


Homo sapiens 


PR02852 


146 


75 


10282 


X89401 


Homo sapiens 


ribosomal protein L21 


108 


48 


10283 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


65 


66 


10284 


Y25821 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 41. 


1602 


98 


10285 


AF010144 


Homo sapiens 

* 


neuronal thread protein AD7c-NTP 


308 


43 


10286 


AF090931 


Homo sapiens 


PRO0483 


131 


74 


10287 


Y48480 


Homo sapiens 


Human breast tumour-associated 
protein 25. 


203 


90 


10288 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


116 


44 


10289 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


160 


64 


10290 


X67863 


Mus musculus 


72 


147 


37 


10291 


AF229067 


Homo sapiens 


PADI-H protein 


123 


43 


10292 


AF078844 


Homo sapiens 


hqp0376 protein 


125 


59 


10293 

• 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


123 


37 


10294 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


103 


64 


10295 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


162 


70 


10296 


V00662 


Homo sapiens 


ATPase 6 


268 


88 


10297 


AB007861 


Homo sapiens 


KIAA0401 


424 


92 


10298 


M24732 


Homo sapiens 


lamin-like protein 


183 


40 


10299 


D90053 


Sus scrofa 


destrin 


123 


48 


10300 


X61047 


Hydra sp. 


mini-collagen 


86 


43 
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CCA 1T\ 

NO: 


Accession No. 


Species 


Description 


Smitb- 

WatCJTDflD ' 

Score 


% 
y 


10301 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


180 


50 


10302 


AF1 18078 


Homo sapiens 


PR01848 


157 


63 


10303 


X55681 


Lycopersicon 
esculentum 


extensin (class 1) 


87 


36 


10304 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


105 


62 


10305 


X70944 


Homo sapiens 


PTE-associated splicing factor 


84 


32 


10306 


AF068294 


Homo sapiens 


HDCMB45P 


220 


42 


10307 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


96 


41 


10308 


AJ005559 


Mus musculus 


SPR2A protein 


64 


42 


10309 


AL353995 


Arabidopsis 
thaliana 


AtAGP4 


98 


38 


10310 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


213 

• 


92 

• 


10311 


M15386 


Homo sapiens 


gamma-globin 


614 


88 


10312 


V00488 


Homo sapiens 


alpha globin 


574 


90 


10313 


AL049730 


Arabidopsis 
thaliana - 


pEARLI 1-like protein 


117 


38 


10314 

• 


AL356299 


Homo sapiens 


bK3216D2.1.1 (S- 

adenosy Ihomocysteine hydrolase 

(SAHH), isoform 1) 


256 


70 


10315 


AF1 94537 


Homo sapiens 


NAG13 


113 


63 


10316 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


99 


62 


10317 


AF090931 


Homo sapiens 


PRO0483 


86 


68 


10318 


Y27854 


Homo sapiens 


Human secreted protein encoded by 

Mm • 

gene No. 101. 


102 


77 


10319 


L29219 


Homo sapiens 


clkl; putative 


341 


100 


10320 


V00488 


Homo sapiens 


alpha globin 


314 


89 


10321 


AJ223953 


Homo sapiens 


hPTTG 


354 


89 


10322 


X79389 


Homo sapiens 


glutathione transferase Tl 


78 


75 


10323 

• 


Y45382 

• 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


101 

• 


66 

* 


10324 


AF116715 


Homo sapiens 


PR02829 


128 


73 


10325 


AF1 16619 


Homo sapiens 


PRO1051 


125 


77 


10326 


Y14318 


Homo sapiens 


peroxisomal ABC-transporter 


249 


100 


10327 


U73844 


Homo sapiens 


ESE-la 


185 


90 


10328 


AB051901 


Homo sapiens 


VDUP1 


132 


83 


10329 


AH63300 


Homo sapiens 


human ubiquitin conjugating enzyme 
G2 EC 63.2.19. 


720 


100 


10330 


AF1 17237 


Homo sapiens 


prefoldin subunit2 


.729 


100 


10331 


D42054 

• 


Homo sapiens 


KIAA0092 gene product is distantly 
related to smooth muscle myosin. 


352 


84 


10332 


AF038965 


Homo sapiens 


26S proteasome ATPase subunit 


431 


100 


10333 


Y94960 


Homo sapiens 


Human secreted protein clone mil 17_1 
protein sequence SEQ ID NO: 126. 


399 


96 


10334 


X60376 


Brassica napus 


proline-rich protein 


114 


31 


10335 


M34427 


Homo sapiens 


T-plastin 


343 


91 


10336 


V00488 


Homo sapiens i 


alpha globin 


445 


88 


10337 


M15386 


Homo sapiens 


gamma-globin 


402 


85 


10338 


X61 123 


Homo sapiens 


BTG1 


367 


90 


10339 


U22961 


Homo sapiens 


similar to human albumin, Swiss-Prot 
Accession Number P02768; Method: 
conceptual translation supplied by 
author 


278 


87 


10340 


A03992 


synthetic 
construct 


growth hormone 


145 


61 


10341 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


57 | 
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SEQ ID 
NO- 


1 Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


1 den tit 

y 




i 




4518. 






10342 


U02680 


Homo sapiens 


protein tyrosine kinase 


140 


92 


10343 


AF1 80681 


Homo sapiens 


guanine nucleotide exchange factor 


1425 


88 


10344 

** 


G02994— 


Homo sapiens 


Human secreted protein; SEQ ID NO: 
7075. 


123 


69 


10345 


V00488 


Homo sapiens 


alpha globin 


318 


93 


10346 


M17885 


Homo sapiens 


acidic ribosomal phosphoprotein (P0) 


329 


98 


10347 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


100 


66 


10348 


U29953 


Homo sapiens 


pigment epithelium-derived factor 


359 


74 


10349 


G01790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5871. 


85 


66 


10350 


V00488 


Homo sapiens 


alpha globin 


361 


90 


10351 


U93567 


Homo sapiens 


p40 


203 


84 


10352 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQ ID NO:220. 


253 


85 


10353 


AF1 94537 


Homo sapiens 


NAG 13 


196 


41 


10354 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQ ID NO:220. 


297 


81 


10355 


Y01400 , 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


144 


65 


10356 


Y01400 


Homo sapiens 


Secreted orotein encoded bv eene 18 
clone HNHF029. 


94 


54 


10357 


G00333 


Homo saniens 


Human secreted orotein. SEO ID NO: 

A A%AAAAMwA %J %0%0m %<*wv*m U4 v Willi m^ ■m^^ m r ■ ■ ^ » ^ * 

4414. 


153 

* WW 


68 


10358 


G0341 1 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7492. 


136 


53 


10359 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


95 


77 


10360 


X61296 


Rattus 
norvegicus 


open reading frame 2 


87 


40 


10361 


L24521 


Homo sapiens 


transformation-related protein 


103 


32 


10362 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


101 


55 


10363 


Y95028 


Homo sapiens 


Human clone vp7 10RF2,SEQID 
NO:128. 


89 


80 


10364 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


101 


58 


10365 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQ ID NO:220. 


322 


85 


10366 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


101 


59 


10367 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


125 


53 


10368 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


120 


59 


10369 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


60 


83 


10370 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


169 


81 

• 


10371 


X62677 


Oryctolagus 
cuniculus 


retrovirus related reverse transcriptase 


80 


75 


10372 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


60 


83 


10373 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


89 


77 


103.75 


AF130089 


Homo sapiens 


PRO2550 


139 


78 


10376 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


62 
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SEQ ID 

MA. 

nv. 


Accession No. 


Species 


Description 


Smith- 

Wntrrmon 

Score 


V. 

Irientit 

1UCI1UI 

y 








7874. 






10377 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


96 


51 


10378 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


132 


58 


10379 


W88627 

* 


Homo sapiens 

• 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


117 


71 


10380 


Y01400 


Homo sapiens 
* 


Secreted protein encoded by gene 18 
clone HNHF029. 


125 


54 


10381 


U93569 


Homo sapiens 


putative pl50 


160 


60 


10382 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


140 


77 


10383 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. . 


83 


71 


10384 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


137 


63 


10385 


G00332 

4 


Homo sapiens 


Human secreted protein, SEQ ID NO: . 
4413. 


67 


50 


10386 


X62691 


Homo sapiens 


ribosomal protein homologous to yeast 
S24 


117 


58 


10387 

» 
• 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


141 

• 


45 


10388 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


153 


70 


10389 


U93570 


Homo sapiens 


putative pi 50 


112 


58 


10390 


AF090942 


Homo sapiens 


PRO0657 


126 


68 


10391 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6582. 


123 


64 


10392 


AF130089 


Homo sapiens 


PRO2550 


144 


71 


10393 


U40739 


Homo sapiens 


cyclin C 


464 


86 


10394 


G03807 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7888. 


151 


75 


10395 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


137 


55 

r 


10396 


W88947 


Homo sapiens 


Polypeptide fragment encoded by gene 
118. 


165 


66 


10397 


AF090930 


Homo sapiens 

* 


PRO0478 


133 


80 


10398 


AK024455 


Homo sapiens 


FLJ00047 protein 


153 


62 


10399 


R96800 


Homo sapiens 

XT 


Human histiocyte-secreted factor HSF. 


142 


69 


10400 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


73 


51 


10401 


AB030033 


Dictyostelium 
discoideum 


AmiB 


82 


33 


10402 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 

* 


113 


63 


10403 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


148 


77 


10404 


AK024455 


Homo sapiens 


FLJ00047 protein 


138 


71 


10405 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 


128 


5? 


10406 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561 . 


143 


76 


10407 


AF090942 


Homo sapiens 


PRO0657 


105 


64 


10408 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terrainal, clone XEH.8c} 


93 


54 


10409 


J02621 


Homo sapiens 


high mobility group protein 14 


140 


64 


10410 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


87 


46 


10411 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


79 


56 



411 



WO 01764835 PCT7US01/04927 

* 

i 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO' 




* 

* 




Wfl f PTm on 

" » msi iuhii 












Score 


y 






i 


7879. 






10412 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


79 


66 


■ 






terminal, done XEH.8c} 






10413 


AF130079 


Homo sapiens 


PR02852 


184 


90 


10414 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 


237 


77 








SEQIDNO:220. 


• 




10415 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


122 


64 








gene 44 clone HTDAD22. 


• 




10416 


AF016099 


Mus mus cuius 


endonuclease/reverse transcriptase 


89 


78 


10417 


S80864 


Homo sapiens 


cytochrome c-like polypeptide 


184 


52 


10418 


Y3068I 


Homo sapiens 


Splice variant ZAP-IB protein of the 


131 


75 


• 






human tumor suppressor gene ZAP-1 . 


• 




10419 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


160 


90 




• 




7068. 






10420 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


59 








6588. 






10421 


AF090942 


Homo sapiens 


PRO0657 


J 54 


57 


10423 


L78669 


Homo sapiens 


CoxII/D-loop DNA fusion protein 


106 


90 


10424 


AF095770 


Homo sapiens 

* 


PTH-responsive osteosarcoma Dl 

* 


105 


75 








protein 

* 


- 




10425 


AL359782 


Trypanosoma 


probable similar to ring-h2 finger 


130 


64 






brucei 


protein rhala. 






10426 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


122 


64 






i 


gene 44 clone HTDAD22. 






10427 


AF068294 


Homo sapiens 


HDCMB45P 


256 


56 


10428 


AF194537 


Homo sapiens 


NAG 13 


182 


54 


10429 


AF090931 


Homo sapiens 


PRO0483 


108 


79 


10430 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


93 


68 








7795. 






10431 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
* 


127 


87 








protein 




. . 


10432 


X67920 


Homo sapiens 


tryptophanyl-tRNA synthetase 


138 


84 


10433 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 


294 


78 








SEQIDNO:220. 






10434 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 


127 


87 








protein 






10435 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


75 








6571. 






10436 


S79410 


Mus musculus 


nuclear localization signal binding 


117 


72 




■ 




protein 






10437 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


120 


67 








4658. 




■ 


10438 


AP000616 


Oryzasativa 


similar to RING-H2 finger protein 


140 


69 




• 




RHAla(AF078683) 






10439 


AF130089 


Homo sapiens 


PRO2550 


106 


80 


10440 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 


331 


83 








SEQIDNO220. 


. _- 




10441 


W94294 


Homo sapiens 


Human phosphatidylinositol transfer 


143 


96 








protein gamma. 






10442 


AP000616 


Oryza sati va 


similar to RING-H2 finger protein 


121 


87 








RHAla(AF078683) 






10443 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 


127 


87 






4 


protein 






10444 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


67 








7519. 






10445 


U93567 


Homo sapiens 


p40 


262 


69 


10446 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 


112 


92 








protein 






10447 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 


120 


83 



412 
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SEQID 
NO* 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


1 % 
Identit 

y 








protein 






10448 


G02597 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6678. 


104 


68 


10449 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 




80 


10450 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


117 


80 


10451 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


104 


80 


10452 


AF130089 


Homo sapiens 


PRO2550 


315 


68 


10453 


G02530 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6611. 


110 


71 


10454 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


86 


34 


10455 


Y86248 


Homo sapiens 

• 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


113 


85 


10456 


G03556 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7637. 


180 


83 


10457 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


119 


96 


10458 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


133 . 


58 


10459 


Y41710 


Homo sapiens 


Human PR0618 protein sequence. 


894 


71 


10460 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


84 


71 


10461 


K02401 


Homo sapiens 


chorionic somatomammotropin 


471 


88 


10462 


G02113 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6194. 


241 


73 


10463 


G0341 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


93 


68 


10464 


U15647 


Mus museums 


reverse transcriptase 
* 


110 


47 


10465 


G03790 

i 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


97 


57 


10466 


G02754 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6835. 


145 


93 


10467 


G03790 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7871. 


137 


57 


10468 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


62 


55 


10469 


AF095770 


Homo sapiens 

* 


PTH-responsive osteosarcoma Dl 
protein 


93 

■ 


86 


10470 


S58722 

■ 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


119 


63 


10471 


AF068294 


Homo sapiens 


HDCMB45P 


117 


43 


10472 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


335 


72 


30473 


G03790 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
7871. 


121 


62 


10474 


X79238 


Homo sapiens 


ribosomal protein L30 


126 


58 


10475 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


127 


87 


10476 


G02640 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6721. 


106 


61 


10477 


AC006693 


Caenorhabditis 
elegans 


Hypothetical protein W02H5.e 


119 


41 


10478 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


87 


66 


10479 


AF130079 


Homo sapiens 


PR02852 


97 


54 


10480 


AK024455 


Homo sapiens 


FLJ00047 protein 


135 


69 



413 
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SEQID 


Accession No. 


Species 


Description 

« 


Smith- 

Wflt^rtnon 

TV UlCf lUilU 

Score 


% . 

luCUul 

y 


10481 


K02401 


Homo sapiens 


chorionic somatomammotropin 


463 


80 


10482 


G00398 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
4479. 


69 


83 


10483- 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


132 


61 


10484 


R13556 

< 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


119 


80 


10485 


AF095770 


Homo sapiens 

Jr 


PTH-responsive osteosarcoma Dl 
protein 


90 


54 


10486 


U79260 


Homo sapiens 


unknown 


105 


90 


10487 


Y87156 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 195. 


243 


53 


10488 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


90 


78 


10489 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


120 


64 


10490 


Y01400 

» 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


123 


60 


10491 


Y48576 


Homo sapiens 


Human breast tumour-associated 
protein 37. 


471 


84 


10492 


G00577 


Homo sapiens 


Human secreted protein. SEO ID NO: 
4658. 


176 


77 

¥ 9 


10493 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


135 


64 


10494 


G02538 


Homo sapiens 


Human secreted protein. SEO ID NO: 
6619. 


123 

1 


44 


10495 


G02538 


Homo saoiens 


Human secreted protein. SEO ID NO: 
6619. 


86 


46 


10496 


Y67470 


Homo sapiens 


Np70 protein carboxv terminal region. 


145 


40 


10497 


AL359782 

A MhMmmm0 9+9 ^ r 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


111 

9* w* m 


50 

90 V 


10498 


S58722 

i 


Homo saoiens 


X- linked retinopathy protein iC- 
terminal, clone XEH.8c} 


96 


73 

V 9W 


10499 


AL132841 


Caenorhabditis 
elegans 


Y15E3A.3 


142 


96 


10500 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


120 


49 


10501 


AF220264 


Homo sapiens 


MOST-1 


93 


83 


10502 


G00437 


Homo sapiens 


Human secreted protein, SEO ID NO: 
4518. 


116 


57 


10503 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7452. 


100 


70 


10504 


, AF090931 


Homo sapiens 


PRO0483 


88 


80 


10505 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


89 


72 


10506 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


121 


81 


10507 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


188 


82 


10508 


AF130050 


Homo sapiens 


PRO0872 


101 


74 . 


10509 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


176 


68 


10510 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


120 


96 


10511 


AK026249 


Homo sapiens 


unnamed protein product 


97 


68 


10512 


X95276 


Plasmodium 
falciparum 


ORF91 


75 


29 


10513 


X17093 


Homo sapiens 


leukocyte antigen F 


397 


91 


10514 


W4835 1 Homo sapiens 


Human breast cancer related protein 


115 


66 



414 
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SEQID 
NO: 


Accession No. 


Species 

i 


Description 


Smith- 
Waterman 
Score 


% 

Identit 
y 

# i 






■ 


BCRB2. 






« 10515 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


154 


68 


10516 

"i 


G03807 


Homo sapiens 

i 


Human secreted protein, SEQ ID NO: 
7888. 


86 


47~ 


10517 


AF090944 


Homo sapiens 


PRO0663 


118 


48 


10518 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone X£H.8c} 


131 


77 


10519 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
IDNOcl03. 


230 


58 


10520 


AF095770 


Homo sapiens 

* * 


PTH-responsive osteosarcoma Dl 
protein 


127 


87 


10521 


G03807 


Homo sapiens 


Human secreted protein, SEO ID NO: 
7888. 


130 


40 


10522 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


127 


87 


10523 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413. 


139 




10524 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma D I 
protein 


125 


100 


10525 


J04495 


Macaca mulatta 


alpha-globin 


144 


90 


10526 


G02535 


Homo saoiens 


Human secreted protein SEO ID NO* 
6616. 


105 


61 


10527 


AFI 30089 


Homo saoiens 


PRO2550 


132 


67 


10528 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
ID NO* 1051 


90 


53 


10529 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEO ID NO-220 


112 


75 


10530 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


157 


79 


10531 


G03714 


Homo sapiens 

. y- 


Human secreted protein, SEQ ID NO: 
7795. 


151 


59 


10532 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


121 


63 


10533 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


112 


43 


10534 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


131 


65 


10535 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


95 


60 


10536 


G00328 


Homo sapiens 


Human secreted protein. SEO ID NO: 
4409. 


157 


79 


10537 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


199 

* m* ^ 


90 


10538 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


113 


75 


10539 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


120 


65 


10540 


AF130089 


Homo sapiens 


PRO2550 


195 


67 


10541 


Y60522 


Homo sapiens 


Human normal bladder tissue EST 
encoded protein 194. 


73 


72 


10542 


G01175 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5256. 


83 


100 


10543 


G019S4 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
6065. 


81 


54 


10544 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


147 


75 


10545 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


109 


66 
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SEQID 

MA. 


Accession No. 


Species 


Description 


Smith- 

waterman 

Score 


% 

Id en tit 

y 


10546 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


123 


46 


10547 


AF130089 


Homo sapiens 


PRO2550 


133 


63 


10548 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQ ID NO:220. 


246 


80 


10549 


G00382 


Homo sapiens 


Human secreted protein,' SEQ ID NO: 
4463. 


122 


51 


10550 


J02459 


bacteriophage 
lambda 


J (tail:host specificity;l 132) 


772 


97 


10551 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


158 


80 

* 


10552 


G00328 


Homo sapiens < 


Human secreted protein. SEO ID NO: 
4409. 


123 


50 

V ^p* 


10553 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


184 


71 


10554 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded bv gene 3. 


107 


60 


10555 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


140 


69 


W5S6 


U49973 


Homo sapiens 


ORF2: function unknown , 


91 


70 


10557 


AF151850 


Homo sapiens 


CGI-92 protein 


459 

pV ✓ 


63 


10558 


X66285 


Mus musculus 


HC1 ORF 


92 

^ pW 


30 


10559 


S58722 

* 


Homo sapiens 


X-linked retinopathy protein {C- 
terminah clone XEH 8c \ 


127 


67 


10560 


AF130050 


Homo sapiens 


PRO0872 


100 


71 


10561 


Y91577 


Homo saoiens 


Human secreted nrotein sentience 

* ALiiXitiJi JWUI wlyu l/4vl>Wui 0VU UVUvV 

encoded bv gene 2 SEO ID NO250. 


335 


80 


10562 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


126 


58 


10563 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


127 


76 


10564 


G00332 


Homo saoiens 


Human secreted protein SEO ID NO* 
4413. 


158 


62 


10565 


G00637 


Homo saoiens 


Human secreted protein. SEO ID NO' 
4718. 


118 


77 


10566 

M \S V.* 


V00488 


Homo sapiens 


aloha globin 


464 


89 


10567 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


162 


81 


10568 


S58722 


Homo sapiens . 


X-linked retinopathy protein {C- 
terminal, clone XEH. 8c) 


113 


72 


10569 


G03040 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7121. 


127 


47 


10570 


G03807 


Homo sapiens 


Human secreted protein, SEO ID NO: 
7888. 


125 


65 


10571 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


79 


88 

^p^ *pT 


10572 


Y02693 


Homo sapiens 


Human secreted protein encoded bv 
gene 44 clone HTDAD22. 


158 


80 


10573 


Y27854 


Homo sapiens 


Human secreted protein encoded bY 
gene No. 101. 


138 


56 


10574 


AF 119900 


Homo saoiens 


PR02822 


154 


55 


10575 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


87 


72 


10576 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7496. 


131 


66 


10577 


G03112 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7193. 


80 


47 


10578 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


121 


53 


10579 


Y91577 


Homo sapiens 


Human secreted protein sequence 


335 


83 
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_ * 

i 



SEQID 
NO: 


'Accession No. 


Species 

■ 


Description 


Sraith- 

Waterman 

Score 


% 
y 








encoded by gene 2 SEQ ID NO:250. 






10580 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


104 


47 


10581 


U49973 


Homo sapiens 


ORFl;MER37; putative transposase 
similar to pogo element 


324 


68 


10582 


G00589 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4670. 


129 


66 


10583 


AF090895 


Homo sapiens 


PRO0117 


137 


63 


10584 


L27428 


Homo sapiens 


reverse transcriptase 


98 


50 


10585 


G00418 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4499. 


101 


68 


10586 


G00382 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
4463. 


108 

- 


82 

• 


10587 


G03438 

» 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


166 


76 


10588 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


100 


70 


10589 


X03145 


Homo sapiens 


pot ORF I 

MY 


120 


43 


10590 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


139 


70 


10591 


AF 144054 


Homo sapiens 


apoptosis related protein APR-4 


92 


50 


10592 


AK024455 


Homo sapiens 


FLJ00047 protein 


111 


75 


10593 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


143 


58 


10594 


G03636 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7717. 


116 


54 


10595 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


196 


57 


10596 


AF 130089 


Homo sapiens 


PRO2550 


140 


66 


10597 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4449. 


115 


52 


10598 


G00481 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4562. 


79 


71 


10599 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


146 


79 


10600 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


119 


67 


10601 


AF090895 


Homo sapiens 


PRO0117 


106 


68 


10602 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
tenninal, clone XEH.8c} 


83 


72 


10603 


AF090930 


Homo sapiens 


PRO0478 


119 


70 


10604 


AF095770 


Homo sapiens 


FTH-responsive osteosarcoma Dl 
protein 


120 


83 


30605 


AF 132949 


Homo sapiens 


CGI- 15 protein 


114 


96 


10606 


Y95028 


Homo sapiens 


Human clone vp7 10RF2,SEQID 
NO: 128. 


176 


77 


10607 


G00328 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
4409. 


155 


74 


10608 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


127 


63 


10609 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


88 


60 


10610 


AF 130089 


Homo sapiens 


PRO2550 


86 


90 


10611 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


149 


72 


10612 


L27428 


Homo sapiens 


reverse transcriptase 


311 


52 


10613 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


83 


84 


10634 


D00570 


Mus museums 


open reading frame (196 AA) 


116 


69 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


10615 


Y86573 


Homo sapiens 


Human gene 9 1 -encoded protein 
fragment, SEQ ID NO:490. 


290 


62 


10616 


Y27907 


Homo sapiens 


Human secreted protein encoded by 
gene No. 119. 


125 


51 


10617 


AF1 16695 


Homo sapiens 


PR02221 


131 


56 


10618 


M64793 

* 


Rattus 
norvegicus 


salivary proline-rich protein 


101 


48 


10619 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


104 


65 


10620 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


112 


58 


10621 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQ ID NO:220. 


336 


85 


10622 


AF 118082 


Homo sapiens 


PRO 1902 


80 


73 


10623 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


136 


50 


10624 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


109 


67 


10625 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


92 


75 


10626 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


121 


62 

. 


10627 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


104 


52 


10628 

t 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


137 


66 


10629 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


1?9 


67 


10630 


U93569 


Homo sapiens 


putative p!50 


104 


52 


10631 


AF090895 


Homo sapiens 


PRO0117 


136 


70 


10632 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


89 


57 


10633 


AF1 13685 


Homo sapiens 


PRO0974 


130 


64 


10634 


AK024455 


Homo sapiens 


FLJ00047 protein 


103 


63 


10635 


M24732 


Homo sapiens 


lamin-like protein 


80 


65 


10636 


U49974 


Homo sapiens 


mariner transposase 


116 


72 


10637 


AKQ24455 


Homo sapiens 


FLJ00047 protein 


158 


67 


10638 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


121 

■ 


64 


10639 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


137 


57 


10640 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


123 


72 


10641 


L27428 


Homo sapiens 


reverse transcriptase 


150 


54 


10642 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


68 


81 


10643 


U93565 


Homo sapiens 


putative pi 50 


282 


57 


10644 


AF090942 


Homo sapiens 


PRO0657 


153 


74 


10645 


Y48333 


Homo sapiens 


Human prostate cancer-associated 
protein 30. 


196 


90 


10646 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


242 


73 


10647 


U79260 


Homo sapiens 


unknown 


95 


77 


10648 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


149 


64 


10649 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


158 


87 


10650 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


339 


89 
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SEQID 
NO: 


'Accession No. 


I Species 


Description 


Smith- 

Waterman 

Score 


V. 

Identir 

y 


10651 


U15183 


Mycobacterium 
leprae 


proline-rich antigen 


102 


37 


10652 


AF2 17536 


Homo sapiens 


truncated mevalonate kinase 


138 


70 


10653 


AF 119855 


Homo sapiens 


PR01847 


74 


100 


10654 


M15530 


Homo sapiens 


B-cell growth factor 


94 


52 


10655 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


144 


54 


10656 


Ml 53 17 


Plasmodium 
lophurae 


histidine-rich protein 


124 


51 


10657 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NOf 
6613. 


143 


66 


10658 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


114 


77 


10659 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


172 

r 


55 


10660 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


150 


78 


10661 


G00328 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
4409. 


160 


81 


10662 


G00328 , 


Homo sapiens 
* 


Human secreted protein, SEQ ID NO: 
4409. 


151 


71 


10663 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


160 


81 


10664 


AK000496 


Homo sapiens 


unnamed protein product 


140 


77 


10665 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


147 


75 


.10666 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


160 


81 


10667 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


149 


78 


10668 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


303 


78 


10669 


D00526 


Rattus 
norvegicus 


L-gulono-gamma-lactone oxidase 


108 


73 


10670 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


143 


65 


10671 

■ 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


409 


84 


10672 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


110 


65 


10673 


G00423 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4504. 


123 


43 


10674 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


85 


36 


10675 


U93565 


Homo sapiens 


putative pi 50 


115 


45 


10676 


AF109907 


Homo sapiens 


S164 


205 


65 


10677 


G02620 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6701. 


118 


73 


10678 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


106 


62 


10679 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


127 


87 


10680 


Y87156 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 195. 


248 


54 


10681 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


88 


52 


10682 


S58722 


Homo sapiens 


X- linked retinopathy protein {C- 
terminal, clone XEH.8c} 


126 


77 


10683 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


123 


72 
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SEQ ID 

NO. 


Accession No. 


Species 


Description 


Smith- 
Waterman ' 
Score 


0/ 

% 

Idendt 

y 








terminal, clone XEH.8c} 






10684 


S58722 


Homo sapiens 

* 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


123 


72 


10685 


W54966 


Homo sapiens 


Synthetic human type III collagen 
SYN-C3. j 


99 ~ 


40 


10686 


AF068294 


Homo sapiens 


HDCMB45P 


182 


54 


10687 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


104 


53 


10688 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQ ID NO:220. 


277 


76 


10689 


G00233 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
4314. 


118 


65 


10690 


G0341 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


143 


71 


10691 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4425. 


97 


81 


10692 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


160 


68 . 


10693 


AF130O89 


Homo sapiens 


PRO2550 


133 


56 


10694 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


129 


71 


10695 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


123 


71 


10696 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


161 


45 


10697 


G00648 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4729. 


58 

ii 


56 


10698 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


75 


71 


10699 


G00517 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4598. 


92 


38 


10700 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


132 


55 


10701 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


118 


54 


10702 


G03411 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
7492. 


66 


73 


10703 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


124 


44 


10704 


G00357 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
4438. 


113 


78 


10705 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


119 


50 


10706 


AK025116 


Homo sapiens 


unnamed protein product 


160 


84 


10707 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


160 


81 


10708 


G00328 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
4409. 


160 


81 

* 


10709 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


230 


76 


10710 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


96 


45 


10711 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


174 


70 


10712 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


127 


50 


10713 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


148 


45 


10714 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


151 


80 
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* * • 

i. - 



SEQ ID 


Accession No. 


Species 


Description 

■ 


Smith- 










Wat enaan 


Identit 










Score 


y 


10715 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


157 


67 


i 






4497. 






10716 


Y02999 


Homo sapiens 


Fragment of human secreted protein 


109 


68 

m mW W 








encoded by gene' 12 1 . 






10717 


Y 14480 


Homo sapiens 

M.mtr+m,^imw » H rw.amA«* ^ a .a haT 


Fragment of human secreted protein 


151 

4 mmm A 


52 

-aV 








encoded by gene 14. 






10718 


G00637 


Homo sapiens 


Human secreted protein, SEO ID NO: 


84 


80 








4718. 






10719 


M19973 


Rattus 


cytochrome PB24 


104 


45 

• »4-T" 






norvegicus 


t 






10720 


AL390114 


Leishmania 


probable (hhv-6) ul 102, variant a 


103 


72 

» 4** 






major 


DNA, complete virion genome 






10721 


W48351 


Homo sapiens 


Human breast cancer related protein 


161 

A V A 


52 

****** 








BCRB2. 

• 






10722 


L27428 


Homo sapiens 


reverse transcriptase 


159 


42 


10723 


S58722 


Homo sapiens 


X- linked retinopathy protein (C- 


127 


75 








terminal, clone XEH. 8c} 






10724 


Y00311 


Homo sapiens 


Human secreted protein encoded bv 


84 


50 




* 




gene 54. 






10725 


AF0908$5 


Homo sapiens 


PRO0117 


170 


71 


10726 


Y 14480 


Homo sapiens 

A l>V4'4V N4**j4*44WA*4^AAAl T 


Fraement of human secreted protein 


105 


42 








encoded by gene 14. 






10727 

A ""tr 9 mm 9 


K03205 

A *■* \J mw A*t **•* v 


Homo sapiens 


salivarv oroline-rich nrotein orecursor 


123 

4V mm*** 


38 ~i 

■ mm* W 


10728 


U93563 


Homo sapiens 


putative pISO 


162 

41 *4i#A* 


64 


10729 ~" 

A V 9 AmtmW 


X14576 


Murine leukemia 


ffft? "fiisioTi orotein 


172 


41 

~4 






vims 

V AA Ws/ 








10730 


AF090895 


Homo sapiens 


PRO0117 


151 


60 


10731 

* V 9 a 


AF119851 


Homo saniens 


PRO 1722 


127 


54 


10732 


G00577 


Ufimn sartiens 


MiiTYiati oecneted tirotf»in RPO Tl~i NO* 

XXUU1CUA OvwlVMAJ yiUlCUl^ MXJ 1 ~ \-f • 


157 


61 








4658 






10733 


G00442 


Homo osnien^ 


Human oecreted nrotetn SFO ID NO* 


156 










4523 






10734 


G03438 


Hnmn sanieriQ 


Human ^ecret^l nrotein SEO TO NO* 


135 


73 








7519 






10735 


G03789 


Homo saniens 


Human oecreted nrotein SFO ID NO* 


151 

4 a* 4 


65 








7870. 






10736 


G00442 


Homo sapiens 


Human secreted protein SEO ID NO" 


141 

4V »* 4\ 


75 








4523. 






10737 


G03790 


Homo sapiens 


Human secreted protein. SEO ID NO: 


122 


74 








7871. 






10738 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 


124 


69 








isoform 






10739 


G00368 


Homo sapiens 


Human secreted protein. SEO ID NO: 


91 


60 








4449. 






10740 


AF 13 0051 


Homo sapiens 

• * w*4*V aA#i/*Vli4# 


PRO0898 

A A "4>*44^ •%4T VrnWrnT "4j4> 


90 

■IT W 


63 


10741 


AF090895 


Homo sapiens 


PRO0117 


129 


55 


10742 


AF 11 9900 


Homo sapiens 


PR02822 

A A A* VAtA" 


164 


85 


10743 


AJ005560 


Mus musculus 


SPR2B protein 


88 

*4|^ ****' 


35 

*4* t 


10744 


Z79996 


Homo saoiens 


cB33F2 I (PUTATIVE novel protein 

vi-/ J X 4>« L 11 ^/ A lA A> X V 4W Uv f ¥l UlvvvlU 


185 


100 








similar to C-termtnal parts of APOL 

*■» H*l 1 *i**A*4 W m m^ " ^» ** v4 444 444144 *4r <4AB M# A * AA ^mmJ 








■ 


• 


(apolipoprotein L) and TNF-inducible 




■ 








protein CG12-1) 






10745 


G03140 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


88 • 








7221. 






10746 


X82208 


Homo sapiens . 


beta-centractin 


134 


96 


10747 


D49744 


Mus musculus 


farnesyltransferase alpha subunit 


228 


60 


10748 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


50 








7492. 






10749 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


62 1 
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4 

I 



« 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 
y 








4523. 






10750 


AF 130089 


Homo sapiens 

• 


PRO2550 


147 


62 


10751 


G00403 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
4484. 


108 


71 


10752 


D 17652 


Homo sapiens 

* 


HBpl5/L22 


416 


72 


10753 


AB002389 


Homo sapiens 


KIAA0391 


486 


86 


10754 


G03093 


Homo sapiens 


Human secreted protein. SEQ ID NO: 
7174. 


238 


73 


10755 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


103 


57 


10756 


AF 130089 


Homo sapiens 


PRO2550 


104 


72 


10757 


X75421 


Hypocrea 
jecorina 


actin 


197 


66 


10758 


AF298594 


Nicotiana alata 


arabinogalactan protein 


107 


30 


10759 


U94832 


Homo sapiens 


KSRP 


95 


40 


10760 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


179 


82 




I OWJ/J 


i 


Unman npnP 01 —aii/twIaH niVkfAin 

jiuiziaD gene 7i "ciicucicu pruicin 
fragment SFO IF) NfV49f) 


771 

All 


Ol/ 


I 10762 


781528 

. _ i 


llcUJU 1 Lid 

eleeans 


UlwulwlvU UDlllK VJ vllvlJl luvl ^\*\Jll \Ol HO 

similarity to Pfem domain* PF009^Q 
f Sodium "sulfate svmoorter 
transmembrane region), Score=2.8, B- 
value=l;2, N=l: PF01757 (Domain of 
unknown function), Score=556.0, E~ 
value=8.1e-164,N-l 


103 

1 VJ 

■ 


27 


10763 


V00662 


Homo sapiens 


ATPase 6 


593 


83 


10764 


A06977 


Homo sapiens 


albumin 


610 


81 


10765 


X67863 


Mus musculus 


T2 


109 


46 


10766 


AF 137030 


Homo sapiens 


transmembrane protein 2 


620 


96 


10767 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


101 


66 


10768 


X15324 


Homo sapiens 


angiotensinogen 


529 


89 


10769 


M64983 


Homo sapiens 


fibrinogen beta chain 


604 


87 


10770 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


381 


83 


10771 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


133 


73 


10772 


M64110 


Homo sapiens 


caldesmon 


346 


94 


10773 


AB014566 


Homo sapiens 


KIAA0666 protein 


145 


78 


10774 


AL138810 


Homo sapiens 


dJ 179L 1 0.2 (Similar to CGI-29 
protein) 


216 


90 


10775 


AJ223953 


Homo sapiens 


hPTTG 


353 


82 


10776 


U92288 


Human 
herpesvirus 6 


HN1 


113 


39 


10777 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


143 


78 


10778 


X12796 


Bostaurus 


HMG1 protein (AA 1-215) 


435 


96 


10779 


U86782 


Homo sapiens 


26S proteasome-associated padl 
homolog 


477 


89 


10780 


M60047 


Homo sapiens 

* 


heparin binding protein 


514 


81 


10781 


Y14480 

* 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


99 


48 


10782 


AF130079 


Homo sapiens 


PRD2852 


175 


48 


10783 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


115 


64 


10784 


Ml 1147 


Homo sapiens 


ferritin light chain 


439 


83 


10785 


AF090931 


Homo sapiens 


PRO0483 


145 


55 


10786 


AF072441 


Homo sapiens 


calcineurin binding protein cabin 1 


498 


90 


10787 


AF1 16715 


Homo sapiens 


PR02829 


127 


88 



422 



* 
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SEQID 
NO: 


Accession No. 


Spedes 1 


Description 


Sraitb- 
Waterman ' 
Score 


Idea tit 

y 


10788 


AF201949 


Homo sapiens 


60S ribosomal protein 130 isolog 


631 


91 


10789 


AC005253 


Homo sapiens 


R26445 1 


403 


84 


10790 


M15386 


Homo sapiens 


gamma-globin 


617 


82 


10791 


Y95005 


Homo sapiens 


Human secreted protein vc57_l, SEQ 
IDNO:50. 


716 


95 — 


10792 


G01285 

* 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5366. 


165 


52 


10793 


X67863 


Mus musculus 


T2 


128 


50 


10794 


Y82326 


Homo sapiens 


Human arginase I SEQ ID NO: 17. 


507 


80 


10795 

■ 


Y00755 


Homo sapiens 


extracellular matrix protein BM-40 
(AA 1 - 303) 


559 


87 


10796 


AF1 19851 


Homo sapiens 


PR01722 


117 


71 


10797 


AF090930 


Homo sapiens 


PRO0478 


146 


63 


10798 


V00488 


Homo sapiens 


alpha globin 


509 


84 


10799 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


131 


42 


10800 


X83618 


Homo sapiens 


hydroxymethylglutaryl-CoA synthase 


422 


78 


10801 


AF 132952 


Homo sapiens 


CGI-1 8 protein 


341 


88 


10802 


Ml 7375 


Gallus gallus 


type XII collagen 


88 


42 


10803 


K02401 


Homo sapiens 


chorionic somatomammotropin 


436 


85 


10804 


K02401 


Homo sapiens 


chorionic somatomammotropin 


410 


86 


10805 


U61232 


Homo sapiens 


cofactor E 


359 


77 


10806 


D86438 


Homo sapiens 
* 


Ibar (ionized calcium binding adapter 
molecule 1) 


406 


77 


10807 


AF 11 8082 


Homo sapiens 


PRO1902 


105 


38 


10808 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


148 


60 


10809 


AF090942 


Homo sapiens 


PRO0657 


163 


52 


10810 


U63332 


Homo sapiens 


super cysteine rich protein; SCRP 


116 


86 


10811 


G00637 


Homo sapiens 


Human secreted protein, SEQID NO: 
4718. 


104 


45 


10812 


X61296 


Rattus 
norvegicus 


open reading frame 2 


156 


41 


10813 


U82303 


Homo sapiens 


unknown 


103 


64 


10814 


G01478 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5559. 


75 


77 


10815 


U52077 


Homo sapiens 


mariner transposase 


131 


85 


10816 


D381 12 


Homo sapiens 


ATPase subunit 6 


251 


74 


10817 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


103 


80 


10818 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


102 


52 


10819 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


94 


66 


10820 


Y21811 


Homo sapiens 


CPF polypeptide 36PRO. 


200 


62 


10821 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


161 


60 


10822 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


163 


70 


10823 


U22961 


Homo sapiens 


similar to human albumin, Svyriss-Prot 
Accession Number P02768; Method 
conceptual translation supplied by 
author 


337 


63 


10824 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


122 


61 


10825 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


122 


68 


10826 


J03634 


Homo sapiens 


erythroid differentiation protein 
precursor 


222 


68 
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sEon> 

NO: 


^Accession No. 


Soccics 

i 


Desc fiction 


Smith- 

Waterman 

Score 


V. 

9 W 

Identit 

y 


10827 


D38U2 


Homo sapiens 


NADH dehydrogenase subunit 5 


452 


78 


10828 


V00488 


Homo sapiens 


alpha globin 


132 


86 


10829 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. — 


131 


70 


10830 


AF090942 


Homo sapiens 


PRO0657 


122 


43 


10831 


AF090931 


Homo sapiens 


PRO0483 


141 


72 


10832 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


138 




10833 

■ 


G03806 

- 


Homo sapiens 

. 


Human secreted protein, SEQ ID NO: 
7887. 


88 


57 


10834 


AF1 18082 


Homo sapiens 


PRO1902 


109 


62 


10835 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


135 


44 


10836 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


152 

■ 


86 


10837 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


143 


47 


10838 


U52077 


Homo sapiens 


mariner transposase 


259 


67 


10839 


M10546 


Homo sapiens 


cytochrome oxidase I 


341 


83 


10840 


Y44361 


Homo sapiens 


Human cell cycle regulation protein-2. 


144 


60 


10841 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


96 


56 


10842 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


565 


80 


10843 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


118 


100 


10844 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


157 


81 


10845 


AF014889 


Homo sapiens 


NADH dehydrogenase subunit 2 


416 


85 


10846 


AF1 94537 


Homo sapiens 


NAG 13 


177 


70 


10847 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


108 


40 


10848 


Ml 5530 


Homo sapiens 


B-cell growth factor 


92 


67 


10849 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


98 


52 


10850 


AP000616 


Oryza sativa 

■ 


similar to RING-H2 finger protein 
RHAla(AF078683) 


121 

. ■ 


76 


10851 


AF1 16712 


Homo sapiens 


PR02738 


84 


57 


10852 


V00662 


Homo sapiens 


ATTase6 


278 


84 


10853 


U01849 


Trypanosoma 
brucei 


ORF1 


93 


37 


10854 


Y02749 

■ 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


93 


61 


10855 


AF130089 


Homo sapiens 


PRO25S0 


126 


72 


10856 


AF 130079 


Homo sapiens 


PR02852 


154 


75 


10857 


AK025116 


Homo sapiens 


unnamed protein product 


113 


70 


10858 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


114 


71 


10859 


G03703 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
7784. 


97 


56 


10860 


M36647 


Homo sapiens 


mitochondrial hinge protein precursor 


207 


78 


10861 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


83 


50 


10862 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6588. 


126 


46 


10863 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


148 


63 


10864 


B01372 


Homo sapiens 


Neuron-associated protein. 


106 


38 


10865 


U38964 


Homo sapiens 


hPMSR2 


152 


68 


10866 


AF1 19855 


Homo sapiens 


PR01847 


142 


67 
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NO: 
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opccies 


T\ ac m+i An 

JLJcJCnpiJOD 


CmItH-> 
OJUIUf** 

Waterman 
Score 


Ideatit 

y 


10867 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


99 


75 


10868 


U79260 


Homo sapiens 


unknown 


94 


47 


10869 


R07057 — 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhcm) gene preoduct. 


151 


52 


10870 


A JO 10025 


Homo sapiens 


unr-interacting protein 


100 


64 


10871 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


110 


44 


10872 

■ 


G03798 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


136 


71 


10873 


Y02749 


Homo sapiens 

* 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


126 


61 


10874 


AF1 30089 


Homo sapiens 


PRO2550 


115 


88 


10875 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


114 


59 


10876 


G00500 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4581. 


83 


48 


10877 


AF1 18086 


Homo sapiens 


PRO 1992 


145 


82 


10878 


AF1 16712 


Homo sapiens 


PR02738 


110 


55 


10879 


G03172 , 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


123 • 


77 


10880 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


HI 


61 


10881 


Y87064 


Homo sapiens 

• 


Human secreted protein sequence SEQ 
ID NO: 103. 


106 


51 


10882 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


107 


67 


10883 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


101 


65 


10884 


U79260 


Homo sapiens 


unknown 


111 


56 


10885 


W88627 


Homo sapiens. 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


153 


60 


10886 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


107 


57 


10887 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


148 


80 


10888 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


142 


58 


10889 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


156 


67 


10890 


G00328 


Homo sapiens 

• ■ 


Human secreted protein, SEQ ID NO: 
4409. 


105 


52 

• 


10891 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


187 . 


65 


10892 


A67510 


Mus musculus 

- 


MUS MUSCULUS GENOMIC DNA 
CONTAINING N ALLELE OF FV1 
GENE. 


120 


37 


10893 


AJ223410 


Homo sapiens 


EBIl-ligand chemokine 


174 


77 


10894 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


475 


83 


10895 


V00662 


Homo sapiens 


ATPase 6 


316 


95 


10896 


V00662 


Homo sapiens 


ATPase6 


427 


82 


10897 


U09500 


Homo sapiens 


cytochrome b i 


403 


92 


10898 


V93564 


Homo sapiens 


p40 


215 


91 


10899 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


69 


77 


10900 


G02538 

* 


Homo sapiens 


Human secreted protein, SEQ ID NO: t 
6619. 


159 


60 


10901 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


93 


43 
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opecies 


Description 


oniitQ- 

Watcnnan 

Score 


Identtt 

y 


10902 


M10546 


Homo sapiens 


cytochrome oxidase" I 


302 


66 


10903 


G00447 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4528. 


102 


58 


10904 


AK024455 


Homo sapiens 


FLJ00047 protein 


118 


74 


10905 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


108 


64 


10906 


AF090942 


Homo sapiens 


PRO0657 . 

i 


149 


74 


10907 


J01415 


Homo sapiens 


cytochrome oxidase subunit 3 


167 


91 


10908 


D38112 


Homo sapiens 


ATPase subunit 6 


355 


68 


10909 

• 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


145 


72 


10910 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


114 


72 


10911 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


144 


53 


10912 


V00662 


Homo sapiens 


ATPase 6 


289 


69 


10913 


X62677 


Oryctolagus 
cuniculus 


retrovirus related reverse transcriptase 


93 


51 


10914 


V00662 


Homo sapiens 


cytochrome oxidase III 


509 


86 


10915 


J03071 


Homo sapiens 


chorionic somatomammotropin CS-2 


459 


80 


10916 


Z28029 


Saccharomyces 
cerevisiae 


ORF YKL030W 


96 


32 


10917 


G03102 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7183. 


94 


56 


10918 


R13556 


Homo sapiens 


Protein encoded downstream of hhcjvl 
oncoprotein. 


114 


36 


10919 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


in 


88 


10920 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


103 


39 


10921 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


99 


95 


10922 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


75 


63 


10923 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


125 


85 


10924 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


140 


73 


10925 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


77 


38 


10926 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 


111 


58 


10927 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


139 


75 


10928 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


128 


78 


10929 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6582. 


77 


57 


10930 


K02576 


Homo sapiens 


salivary pro line-rich protein 1 


93 


53 


10931 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


83 


54 


10932 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


262 


84 


10933 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


113 


72 


10934 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


120 


62 


10935 


AP000616 

• 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla (AF078683) 


116 


69 
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SEQID 
NO: 


Accession No. 


Species 

• 


Description 


Smith- 
Water man 
Score 


% 

Wen tit 

y 


10936 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


120 


56 


10937 


AB028893 


Homo sapiens 


ribosomal protein SI 1 


279 


53 


10939 


Y27868 


Homo sapiens 

- 


Human secreted protein encoded by 
gene No. 107. 


127 


55 


10940 


AF 11 9900 


Homo sapiens 


PR02822 


154 


53 


10941 


G03556 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7637. 


127 


62 


10942 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


109 


41 


10943 

• 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


164 

■ 


61 


10944 


AF1 16715 


Homo sapiens 


PR02829 


119 


69 


10945 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


303 


95 


10946 


L27428 


Homo sapiens 


reverse transcriptase 


112 


51 


10947 


AP000616 

• 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


118 


90 


10948 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


121 


83 


10949 


AF116715 


Homo sapiens 


PR02829 


134 


67 


10950 


S58722 


Homo sapiens 


X-linxed retinopathy protein {C- 
terminal, clone XEH.8c} 


139 


83 


10952 


AF130079 


Homo sapiens 


PR02852 


119 


79 


10953 


Z38128 


Mus musculus 


bistone HI 


126 


35 


10954 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


135 


53 


10955 


L22029 


Glycine max 


hydroxyproline-rich glycoprotein 


126 


37 


10956 


G00500 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
4581. 


81 


69 


10957 


AF 118082 


Homo sapiens 


PRO1902 


90 


51 


10958 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


116 


53 


10959 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


82 


51 


10960 


U79260 


Homo sapiens 


unknown 


97 


52 


10961 


U49973 


Homo sapiens 


ORF 1 ; MER37; putative transposase 
similar to pogo element 


194 


66 


10962 


W48351 . 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


124 


48 


10963 


G03084 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7165. 


66 


73 


10964 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


129 


87 


10965 


Z28029 


Saccharomyces 
cerevisiae 


ORFYKL030W 


99 


70 


10966 


V00672 


Pan troglodytes 


reading frame protein 5 


107 


70 


10967 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


138 


58 


10968 


K0240I 


Homo sapiens 


chorionic somatomammotropin 

* - - - — ■■ ■ 


478 


77 


10969 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


642 


86 


10970 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


156 


57 

■ 


10971 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


126 


56 


10972 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


103 


51 


10973 


K02401 


Homo sapiens 


chorionic somatomammotropin 


491 


82 


10974 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


146 


93 


10975 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


236 


85 


10976 


Y02785 


Homo sapiens 


Human secreted protein encoded by 


108 


50 
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¥i**^ ri nti n n 
i/cstnpuun 


OIUI 1 11- 

Waterraan 
Score 


Identit 

y 








gene 51 clone HUKEX85. 






10977 


J03071 


Homo sapiens 


chorionic somatomammotropin CS-2 


435 


73 


10978 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


119 


74 


1097? 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


137 


73 


10980 . 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


133 


63 


10981 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


90 


65 


10982 


X92485 


Plasmodium 
vivax 


pval 


119 


47 


10983 


Y15913 


Homo sapiens 


COL1A1 and PDGFB fusion transcript 


64 


47 


10984 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


463 


77 


10985 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


309 


65 


10986 


Y86248 

♦ 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


145 


87 


10987 


AF130089 


Homo sapiens 


PRO2550 


88 


31 


10988 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


127 


65 


10989 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


137 


75 


10990 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


145 


76 


10991 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


M8 

»> 


72 


10992 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


138 

* • 


69 


10993 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


120 


92 


10994 


AJ277092 


Oryctolagus 
cuniculus 


translationally controlled tumor protein 
3 


172 


49 


10995 


AF052831 

• 


Trypanosoma 
cruzi 


unknown 

• 


92 


37 


10996 


Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene 51 clone HUKEX85. 


119 


53 


10997 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


123 


43 


10998 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


165 


52 


10999 


S52010 


Mus sp. 


orfl 5* of EpoR 


93 


39 


11000 


AF130079 


Homo sapiens 


PR02852 


131 


81 


11001 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


111 


75 


11002 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


132 


83 


11003 


G03790 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
7871. 


151 


60 


11004 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


159 


58 


11005 


K02401 


Homo sapiens 


chorionic somatomammotropin 


579 


89 


11006 


K02401 


Homo sapiens 


chorionic somatomammotropin 


464 


87 


11007 


M15894 


Homo sapiens 


chorionic somatomammotropin 
precursor 


370 


74 


11008 


K02401 


Homo sapiens 


chorionic somatomammotropin 


467 


88 


11009 


K02401 


Homo sapiens 


chorionic somatomammotropin 


537 


91 


11010 


K02401 


Homo sapiens 


chorionic somatomammotropin 


475 


87 


11011 


K02401 


Homo sapiens 


chorionic somatomammotropin 


460 


88 
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Accession INO. 






S mi th- 
ou! lUt- 

Waterman 
Score 


Idcntit 

y 


11012 


G03639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7720. 


101 


61 


11013 


AF266166 


Gillichthys 
mirabilis 


r ibosomal protein S 1 3 

» 


223 


54 


H014 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


176 


61 


11015 


AF1 16715 


Homo sapiens 


PR02829 


121 


69 


11016 


S58722 

■ 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


92 


75 


11017 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


113 


78 


11018 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


114 


40 


11019 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


101 


48 


11020 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


151 


51 


11021 


X14963 


Homo sapiens 


collagen-like protein (447 AA) 


107 


47 


11022 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


151 


72 


11023 


AF116715 


Homo sapiens 


PR02829 


140 


75 


11024 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


125 


70 


11025 


AP000001 


Pyrococcus 
horikoshii 


58aa long hypothetical protein 

• 


99 


52 


11026 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


601 


81 


11027 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


434 


74 


11028 


Z97333 


Homo sapiens 


RHCE protein 


149 


51 


11029 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


113 


48 


11030 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413. 


120 


77 


11031 


U61234 


Homo sapiens 


cofactor C 


226 


63 


11032 


M10546 


Homo sapiens 


cytochrome oxidase I 


116 


88 


11033 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


584 


93 


11034 


L10910 


Homo sapiens 


splicing factor 


108 


58 


11035 


M26361 


Mus musculus 


LINE/Ig H-chain fusion protein 


140 


48 


11036 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


101 


79 


11037 


AK024455 


Homo sapiens 


FLJ00047 protein 


122 


48 


11038 


AF126163 


Homo sapiens 


HHLA3 protein 


96 


68 


11039 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


144 


79 


11040 


AK024455 


Homo sapiens 


FLJ00047 protein 


151 


62 


11041 


L06498 


Homo sapiens 


ribosomal protein S20 


321 


73 


11042 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


123 


84 


11043 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


125 


68 


11044 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


208 


91 


11045 


G00552 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4633. 


79 


62 


11046 


D38112 


Homo sapiens 


ATPase subunit 6 


297 


90 


11047 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


416 


73 


11048 


G00491 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4572. 


73 


80 


11049 


AF090944 


Homo sapiens 


PRO0663 


96 


70 


11050 


AF130089 


Homo sapiens 


PRO2550 


152 


71 


11051 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


108 


81 
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Species 


Description 


Smith- 

TT HICl mun 

Score 


Vo 

la e nut 

y 








gene 22 clone HMSJW18. 






11052 


W89031 


Homo sapiens . 

* 


Polypeptide fragment encoded by gene 
167. 


254 

„ 


95 


11053 


AF1 09907 


Homo sapiens 


S164 


136. 


45 


11054 


AF217536 


Homo sapiens 


truncated mevalonate kinase 


117 


58 


11055 


X72004 


Halichoems 
grypus 


ATP synthase subunit 6 

♦ 


160 


78 


11056 


V00662 


Homo sapiens 

* 


ATPase 6 


300 j 


71 


11057 


AF228021 


Bos taurus 


cyclophilin I 


209 


73 


11058 


L27428 


Homo sapiens 


reverse transcriptase. 


303 


65 


11059 


U09116 


Homo sapiens 


ORF1, encodes a 40 kDa product 


464 


80 


,11060 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


145 


68 


11061 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


301 


85 


11062 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


119 


64 


11063 


Y08061 


Homo sapiens 


Human c-myb protein fragment. 


128 


76 


11064 


U50188 


Cavia oorcellus 


CGRP-recentor comDonent orotein 


152 


88 


11065 


G03107 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7188 


104 


44 

♦ 


1 1066 


G00397 


Homo ^aniens 


Human secreted orotein SEO ID NO* 
4478. 


112 

X x 


64 


11067 


AP000616 


Oryza saliva 


similar to RING-H2 finger protein 
RHAla (AF078683} 

■A. VJ. lift AW »X V 9 WW*^ / 


113 


71 


11068 


AE003499 


Drosophila 
melanoff aster 


CG 12706 gene product 


145 

r 


43 


11069 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEOIDNO:163. 


160 


65 


11070 


D38112 


Homo saoiens 


cytochrome c oxidase subunit 1 


416 


67 


11071 


D38112 


Homo saoiens 


cytochrome c oxidase subunit 3 


220 


86 


11072 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


144 


63 


11073 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


115 


60 

• 


11074 


G03787 


Homo saoiens 

A 4VAMV 


Human secreted orotein. SEO ID NO: 
7868. 


117 


65 


11075 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


589 


87 


11076 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


588 


83 


11077 


S58722 


Homo sapiens 


X-linked retinopathy orotein IC- 
terminal, clone XEH.8c} 


106 


86 


11078 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNOU63. 


90 

■ - 


50 


11079 


AF 132200 


Homo sapiens 


PR01751 


97 


89 


11080 


AF1 16661 


Homo sapiens 


PR01438 


109 


51 


11081 


AK024455 


Homo sapiens 

r 


FU00047 protein 


93 


48 


11082 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


84 


73 


11083 


J02963 


Homo sapiens 


platelet glycoprotein Ob precursor 


108 


80 


11084 


AF068294 


Homo sapiens 


HDCMB45P 


229 


59 


11085 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


137 


73 


11086 


Y86248 


Homo sapiens 


Human secreted protein HCHPF6 8, 
SEQ ID NO: 163. 


105 


49 


11087 


L26953 


Homo sapiens 


chromosomal protein 


122 


75 


11088 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


103 


46 


11089 


S58722 


Homo sapiens 


X-linked retinopathy protein {O 
terminal, clone XEH.8c} 


112 


56 


11090 


AF130089 


Homo sapiens 


PRO2550 


172 


60 


[no9i 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


107 


35 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 
y 


11092 


AF090930 


Homo sapiens 


PRO0478 • 


146 


75 


11093 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


163 


82 


11094 


V00662 


Homo sapiens 


ATPase 6 


130 


87 


11095 


BO 1372 


Homo sapiens 


Neuron-associated protein. 


122 


82 ™ 


11096 


D38112 


Homo sapiens 


ATPase subunit 6 


152 


62 


11097 


U18339 


Variola virus 


D4L 


96 


60 


11098 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


113 


60 


11099 


AF 130089 


Homo sapiens 


PRO2550 


128 


75 


11100 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


218 


62 


11101 


U49973 


Homo sapiens 


ORF 1 ; MER37; .putative transposase 
similar to pogo element 


232 


79 


11102 


AF090930 


Homo sapiens 


PRO0478 


154 


73 


11103 - 


G03807 


Homo sapiens 


Human secreted protein, SEO ID NO: 
7888. 


133 


65 


11104 


K03036 


Mus musculus 


alpha- 1 type I procollagen , 


105 


43 


11105 


G04072 


Homo sapiens 


Human secreted protein, SEO ID NO: 
8153. 


142 


5* 


11106 


L13129 , 


Mus musculus 


synexin 


119 


35 


11107 


AF280111 


Homo sapiens 


cytochrome P450 subfamily IIIA 
polypeptide 43 


122 


52 


11108 


J00314 


Homo sapiens 


beta-tubulin 


606 


94 


11109 


X05196 


Homo sapiens 


aldolase C 


412 


89 


11110 


D00570 


Mus musculus 


open reading frame (196 AA) 

■ m «WWM4^ am wilt \ * 4 M ^ # 


127 


85 


11111 


D50532 


Homo sapiens 


macrophage lectin 2 


165 


65 


11112 


Y01400 

t 


Homo saoiens 


Secreted orotein encoded bv eene 1 8 
clone HNHF029. 


125 


74 


11113 


M15530 


Homo saoiens 


B-cell crowth factor 


163 


82 


11114 


V00662 


Homo sapiens 


ATPase 6 


217 


91 


11115 - 

* 


Y86248 


Homo saoiens 


Human secreted nrotein HCHPF68 
SEQIDNO:163. 


165 


100 


11116 


K02576 


Homo saoiens 


salivarv nroline-rich nrotein 1 


119 


36 


11117 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478 


132 


59 


11118 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


103 


67 


11119 


G00379 


Homo saoiens 


Human secreted orotein. SEO ED NO: 
4460. 


94 


42 


11120 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


127 


75 


11121 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


104 


64 


11122 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


90 


59 


11123 


AF161356 


Homo sapiens 


HSPC093 


119 


64 


11124 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


102 


70 


11125 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


149 


60 


11126 


AF068294 


Homo sapiens 


HDCMB45P 


272 


59 


11127 


Y27854 


Homo sapiens 
* 


Human secreted protein encoded by 
gene No. 101. 


120 


79 


11128 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


101 


78 


11129 


AF130089 


Homo sapiens 


PRO2550 


124 


80 


11130 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


109 


51 


11131 


U40265 


Trypanosoma 
cruzi 


ATPase subunit 6 


102 


31 
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SEQID 
NO: 


* Accession No. 


Species 

i 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


11132 


X01677 


Homo sapiens 


glyccraldehyde-3-phosphate 
dehydrogenase 


555 


88 


.11133 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


108 


37 


11134 


M81757 


Homo sapiens 


S19 ribosomal protein 


232 


76 


11135 


U93565 


Homo sapiens 


putative pi 50 


173 


51 


11136 


AF061944 


Homo sapiens 


kinase deficient protein KDP 


369 


100 


11137 


G00397 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
4478. 


145 


61 


11138 


AB030255 


Homo sapiens 


serine/threonine phosphatase 1 gamma 


404 


83 


31139 

- 


AL049610 


Homo sapiens 


dJ1055C14.2 (KIAA0026 
(transcription factor-like protein 
MRGX)) 


248 


77 


11140 


Y36421 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 8. 


81 


53 


11141 


M13692 


Homo sapiens 


alpha- 1 acid glycoprotein precursor 


112 


41 


11142 


M16961 


Homo sapiens 


alpha-2-HS-glycoprotein 


543 


89 


11143 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


605 


88 


11144 


AF090944 


Homo sapiens 


PRO0663 


137 


67 


11145 


X92485 


Plasmodium 
vivax 


pval 

■ 


102 


57 


11146 


AF130089 


Homo sapiens 


PRO2550 


113 


58 


11147 


U63542 


Homo sapiens 


FAP protein 


154 


57 


11148 


W03988 


Homo sapiens 


SH2 domain from human SH-PTP2 
(amino acids 1-106). 


162 


73 


11149 


AF 11 6695 


Homo sapiens 


PR02221 


173 j 


44 


11150 


Ml 5530 


Homo sapiens 


B-cell growth factor 


105 


45 


11151 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


160 


64 


11152 


Ml 5530 


Homo sapiens 


B-cell growth factor 


124 


75 


11153 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


117 


67 


11154 


AF118082 


Homo sapiens 


PRO1902 


79 


52 


11155 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414^ 


123 


72 


11156 


Y27893 


Homo sapiens 


Human secreted protein encoded by 
gene No. 116. 


266 


96 


11157 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


148 


84 


11158 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


154 


72 


1115? 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


105 


58 


11160 


X60376 


Brassica napus 


proline-rich protein 


111 


39 


11161 


S79410 


Mus museums 


nuclear localization signal binding 
protein 


111 


37 


11162 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


149 


53 


11163 


AF090930 


Homo sapiens 


PRO0478 


146 


73 


J1164 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


90 


60 


11165 


AF242772 


Homo sapiens 


mesenchymal stem cell protein 
DSCD28 


141 


51 


UU6 


X82629 


Homo sapiens 


Mox-2 


307 


83 


11167 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


168 


91 


11168 


X55684 


Lycopersicon 
esculentum 


cxtensin (class I) 


71 


66 


11169 


AC079O41 


Arabidopsis 
thaliana 


hypothetical protein 


98 


45 


11170 


AF09093O 


Homo sapiens 


PRO0478 


156 


50 
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SEQID 
NO: 


Accession No. 
• 


Species 


Description 

• 


Smith- 
Waterman 
Score 


% | 

Identit 

y 


11171 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


99 


70 


11172 


Y02749 

- * 


Homo sapiens 


Human secreted protein encoded by 
gene 1 00 clone HNFIU96. 


124 


54 


11173 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


587 


82 


11174 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: ] 
7795. 


127 


66 


11175 

* 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWI8. 


157 


68 

- 


11176 


AF090931 


Homo sapiens 


PRO0483 


146 


87 


11177 


M15530 


Homo sapiens 


B-cell growth factor 


153 


75 


11178 


AF090930 


Homo sapiens 


PRO0478 


129 


52 


11179 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


98 


77 


11180 


AF1 94537 


Homo sapiens 


NAG13 


165 


48 


11181 


AF068294 


Homo sapiens 


HDCMB45P 


146 


53 


11182 


AF090931 


Homo sapiens 


PRO0483 


153 


78 


11183 


W48351 


Homo sapiens 

• 


Human breast cancer related protein 
BCRB2. 


141 


65 


11184 


Y13396 


Homo sapiens 


Amino acid sequence of protein 
PR0332. 


452 

■ 


92 


11185 


AF090930 


Homo sapiens 


PRO0478 


132 


51 . 


11186 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


178 


79 


11187 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


237 


70 


11188 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


1.17 


83 


11189 


AL080206 


Homo sapiens 


hypothetical protein 


103 


71 


11190 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


135 


77 


11191 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


112 


65 


11192 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


127 


54 


11193 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 


150 


65 


11194 


M92046 


Trypanosoma 
cruzi 


surface antigen 


160 


26 


11195 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


126 


72 


1U96 


G03224 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7305. 


98 


75 


11197 


G03133 


Homo sapiens 

- 


Human secreted protein, SEQ ID NO: 
7214. 


117 


50 , 


11198 


D88548 


Homo sapiens 


24-kDa subunit of complex I 


160 


47 


11199 


AF090852 


Antilocapra 
americana 


prion protein 


115 


38 


11200 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


136 


62 


11201 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


139 


75 


11202 


U93563 


Homo sapiens 


putative pl50 


239 


43 


11203 


U93570 


Homo sapiens 


putative p!50 


173 


45 


11204 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


166 


68 


11205 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


155 


68 


11206 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


87 


56 
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SEOID 
NO: 


*AcC£S*inn Nil 


Sneeies 


DescriDtioo 


SmitH~ 

Waterman 

Score 


79 

Idcntit 

y 


11207 


W48808 


Homo sapiens 


Homo sapiens clone CG109_1 protein. 


123 


89 


11208 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


619 


87 


11209 


W48808 


Homo sapiens 


Homo sapiens clone GG 109_1 protein. 


125 


89 


11210 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


521 


83 


11211 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


90 

t * 


76 


11212 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


547 


81 


11213 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
.7878. 


92 


74 


11214 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO:' 
7253. 


126 


63 


11215 


AF162149 


Mycoplasma 
bovis 


variable surface lipoprotein 


118 

■ 


34 


11216 


AF130089 


Homo sapiens 


PRO2550 


147 


96 


11217, 


AFO90931 


Homo sapiens 


PRO0483 


112 


77 


11218 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


163 


70 


11219 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


111 


55 


11220 


AF13005.1 


Homo sapiens 


PRO0898 


122 


63 


11221 


AF130079 


Homo sapiens 


PR02852 


171 


64 


11222 


G00333 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


146 


55 


11223 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


130 


71 


11224 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


128 


69 


11225 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


141 


85 


11226 


G02360 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6441. 


103 


90 


11227 


M15530 


Homo sapiens 


B-cell growth factor 


122 


55 


11228 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


121 


67 


11229 


M15317 


Plasmodium 
lophurae 


histidine-rich protein 


116 


43 


11230 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


131 


78 


11231 


W19932 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCS55. 


74 


76 


11232 


Y86248 

• 


Homo sapiens 

■ . 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


134 


67 

■ 


11233 


AK023045 


Homo sapiens 


unnamed protein product 


100 


69 


11234 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


127 


82 


11235 


AP000616 


Qryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


125 


65 


11236 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


109 


53 


11237 


AF029403 


Homo sapiens 


oxysterol 7alpha-hydroxylase 


144 


72 


11238 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


112 


44 


11239 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ED NO: 163. 


161 


96 


11240 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


127 


43 


11241 


L36529 


Homo sapiens 


protein p84 


343 


90 


11242 


M15530 


Homo sapiens 


B-cell growth factor 


122 


55 


11243 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 


130 


65 
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% 
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clone HCACJ81. 






11244 


U92698 


Rattus 
xiorvegicus 


ribosomal protein S2 


229 m 


74 


11245 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDN0.163. 


116 


41 


11246 


W75859 


Homo sapiens 


Human secretory protein of clone 
DM340-1. 


123 


37 


11247 


G038O0 


Homo sapiens 

a 


Human secreted protein, SEQ ID NO: 
7881. 


132 


73 

• 


11248 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


93 


56 


11249 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


132 


56 


11250 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


128 


72 


11251 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


115 


46 


11252 


AK021455 


Homo sapiens 


unnamed protein product 


146 


81 


11253 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


88. 

• 


56 


11254 


S80916 


Homo sapiens 


parotid "o" protein, Po=salivary 
proline-rich protein {exon 3} 


113 


43 


11255 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


136 


45 


11256 


G02382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6463. 


107 

• 


67 


11257 


AF194537 


Homo sapiens 


NAG 13 


153 


76 


11258 . 


Ml 5530 


Homo sapiens 


B-ccll growth factor 


130 


70 


1 1259 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


131 


72 


11260 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. . 


104 


59 


11261 


G02922 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7003. 


316 


92 


11262 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
44\3. 


111 


81 


11263 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


142 


71 


11264 


AF1 30089 


Homo sapiens 


PRO2550 


.111 


66 


11265 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


118 


71 


11266 


Ml 5530 


Homo sapiens 


B-cell growth factor 


158 


76 


11267 


AF130079 


Homo sapiens 


PR02852 


124 


68 


11268 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


104 


50 


11269 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


125 


57 


11270 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


107 


63 


11271 


AF1 18082 


Homo sapiens 


PRO1902 


87 


47 


11272 


M10546 


Homo sapiens 


cytochrome oxidase I 


305 


74 


11273 


U12390 


Cloning vector 
pSportl 


beta-galactosidase alpha peptide 


108 


53 


11274 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


235 


81 


11275 


U93570 


Homo sapiens 


putative pl50 


182 


54 


11276 


X03I45 


Homo sapiens 


potORFV 


144 


60 


11277 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


103 


57 


11278 


AJ.035681 


Homo sapiens 


dJ756G23.1 (novel Leucine Rich 


269 


100 
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SEQ ID 
NO: 


* Accession No. 


Species 

* 


Description 


Smith- 
Waterman 
Score 


Idea tit 

y 






* 


Protein) 






11279 


G00379 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4460. 


103 


61 


11280 

> 


G03356 


Homo sapiens 


Human secreted protein i SEQ ID NO: 
7437. 


115 

■ 


70 


11281 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


87 


71 


11282 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


162 


81 


11283 


AF116715 


Homo sapiens 


PR02829 


141 


71 


11284 


AL359782 


1 Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


111 


76 


11285 


U52077 


Homo sapiens 


mariner transposase 


198 


78 


11286 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


120 


62 


11287 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

im. 


103 


45 


11288 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 




60 


11289 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


123 


72 


11290 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


120 


70 


11291 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


106 


46 


11292 


AF1 16715 


Homo sapiens 


PR02829 


130 


63 


11293 


AF090930 


Homo sapiens 


PRO0478 


134 


60 


11294 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


119 


65 


11295 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


102 


50 


11296 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


141 


49 


11297 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


154 


72 


11298 


B08918 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 28 SEQ ID NO:75. 


126 


54 


11299 


AK025270 


Homo sapiens 


unnamed protein product 


150 


54 


11300 


AF 11 8078 


Homo sapiens 


PR01848 


121 


63 


11301 


AF090942 


Homo sapiens 


PRO0657 


81 


43 


11302 


AF1 16715 


Homo sapiens _j 


PR02829 


131 


60 


11303 


AF090931 


Homo sapiens 


PRO0483 


121 


71 


11304 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


108 


82 


11305 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


122 


59 


11306 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


93 


89 


11307 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


108 


65 


11308 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


114 


51 


11309 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


67 


55 


11310 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


105 


71 


11311 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


104 


73 


11312 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


88 


73 

4 
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NO: 


Accession no* \ 


opccics 


tj CoVJi pi IvU 

• 


Smith. 

Waterman 
Score 


% 

Idcntit 

y 


11313 


U40265 


Trypanosoma 
cmzi 


ATPase subunit 6 


94 


31 


11314 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


157 


53 


11315 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rha la. 


113 


54 


11316 


G02493 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
6574. * 


165 

* 


90 


11317 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


149 


74 


11318 

• 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


144 


55 


11319 


AF161356 


Homo sapiens 


HSPC093 


112 


56 


11320 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


102 


53 


11321 


AF1 16661 


Homo sapiens 


PR01438 


131 


74 


11322 


AK000496 


Homo sapiens 


unnamed protein product 


137 


59 


11323 


G03703 


Homo sapiens 

i 


Human secreted protein, SEQ ID NO: 
7784. 


74 


57 


11324 


U63542 


Homo sapiens 


FAP protein 


151 


79 


11325 


M15530 


Homo sapiens 


B-cell growth factor 


192 


45 


11326 


AF116715 


Homo sapiens 


PR02829 


164 


78 


11327 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


127 


62 


11328 


AF090895 


Homo sapiens 


PRO0117 


116 


69 


11329 


AF090942 


Homo sapiens 


PRO0657 


132 


62 


11330 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


190 


72 


11331 


U12390 


Cloning vector 
pSportl 


beta-galactosidase alpha peptide 


112 


49 


11332 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


119 


38 


11333 

■ 


Y86248 

• 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


131 

■ 


52 

• 


11334 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. . 


145 


53 


11335 


X92485 


Plasmodium 
vivax 


pval 


105 


38 


11336 


Ml 5530 


Homo sapiens 


B-cell growth factor 


148 


73 


11337 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


126 


100 


11338 


AF 11 8086 


Homo sapiens 


PR01992 


123 


80 


11339 


G02532 . 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


79 


66 

*• 

* 


11340 


G02532 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
6613. 


162 


73 


11341 


AF130079 


Homo sapiens 


PR02852 


145 


80 


11342 


M19419 


Mus musculus 


proline-rich salivary protein 


104 


44 


11343 


W03642 


Homo sapiens 


Human cannabinoid GPRN-terminal 
sequence. 


123 


62 


11344 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


120 


91 


11345 


U94832 


Homo sapiens 


KSRP 


122 


51 


11346 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


139 


56 


11347 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


188 


87 


11348 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7133. 


173 


59 


11349 


W48351 


Homo sapiens 


Human breast cancer related protein 


111 


68 
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NO: 






D esc fiction 


Waterman 
Score 


Idcntit 

y 








BCRB2. 






1J350 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7492. 


149 


62 


11351 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


267 


68 


11352 


AF161356 


Homo sapiens 


HSPC093 


134 


51 


11353 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


139 


47 


11354 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


115 . 


64 


11355 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


126 


67 


11356 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6588. 


123 


59 


11357 


AF130089 


Homo sapiens 


PRO2550 


114 


62 


11358 


AK000496 


Homo sapiens 


unnamed protein product 


126 


41 


11359 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


173 


81 


11360 


G03714 


Homo sapiens 

» 


Human secreted protein, SEQ ID NO: 
7795. 


128 

• 


64 


11361 


AF130056 


Homo sapiens 


PR01367 J 


74 


57 


11362 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7496. 


116 


63 


11363 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCSl 1037. 


104 


91 


11364 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


137 


53 


11365 


Z30643 


Homo sapiens 


chloride channel (putative) 


4p9 


98 


11366 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4669. 


107 


68 


11367 


AF068294 


Homo sapiens 


HDCMB45P 


281 


57 


11368 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


110 


71 


11369 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


136 


89 


11370 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. , 


129 


79 


11371 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


137 


57 


11372 


U18339 


Variola virus 


D4L 


95 


61 


11373 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


100 


52 


11374 


Ml 5530 


Homo sapiens 


B-cell growth factor 


135 


61 


11375 


X92485 


Plasmodium 
vivax 


pval 


111 


50 


11376 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


177 


71 


11377 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


111 


68 


11378 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


96 


53 


11379 


G02567 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6648. 


83 


54 


11380 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


110 


85 


11381 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


156 


64 


11382 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


121 


79 


11383 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M | 
oncoprotein. 


104 


41 
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SEQID 
NO: 


1 Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Ideotit 

y 


11384 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


86 


53 


11385 


AP000616 


OryzasanVa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


114 


40 


11386 


AF068294 


Homo sapiens 


HDCMB45P 


158 


48 


11387 


B08976 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 28 SEQ ID NO:133. 


136 


46 


11388 


AF090931 


Homo sapiens 


PRO0483 


123 


74 


11389 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


156 


75 


11390 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


102 


48 


11391 


G03411 

- 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


117 


54 


11392 


W54966 


Homo sapiens 


Synthetic human type III collagen 
SYN-C3. 


134 


43 


11393 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


111 


46 


11394 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


128 


49 


11395 


AF090930 


Homo sapiens 


PRO0478 


148 


70 


11396 


AF210651 


Homo sapiens 


NAG18 


100 


62 


11397 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


121 


56 


11398 


AF063243 


Bos taurus 


ribosomal protein L30 


165 


77 


11399 


AF162149 


Mycoplasma 
bovis 


variable surface lipoprotein 


104 


37 


11400 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


175 


71 


11401 


D38U2 


Homo sapiens 


NADH dehydrogenase subunit 4 


341 


92 


11402 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


137 


90 


11403 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


115 


70 


11404 


M10126 


Leishmania 
tarentolae 


NH2 terminus uncertain 


101 


41 


11405 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


126 


69 


1 1406 


X82385 


Homo sapiens 


RNA polymerase II subunit 


296 


100 


11407 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


151 


68 


11408 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


141 


44 


11409 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


118 


77 


11410 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


169 


77 


11411 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


169 


82 


11412 


AK025116 


Homo sapiens 


unnamed protein product 


133 


44 


11413 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


113 


63 


11414 


AK024455 


Homo sapiens 


FLJ00047 protein 


127 


65 


11415 


AK024455 


Homo sapiens 


FU00047 protein 


87 


63 


11416 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
tenninal, clone XEH.8c} 


94 


57 


11417 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


123 


58 


11418 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


148 


69 
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OZAJ 1U 

NO: 


Accession no. 


Species 


ucscnpmon 


(Smith. 

Waterman 
Score 


/ft 

Ideotit 

y 


11419 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


113 


53 


11420 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 


166 


68 


1 1421 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


124 


51 


11422 


G01249 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5330. 


113 


70 


1 1423 


G03787 


Homo sapiens . 


Human secreted protein, SEQ ID NO: 
7868. 


88 


62 


11424 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


136 


72 


11425 


AK002154 


Homo sapiens 


unnamed protein product 


292 


75 


11426 


Y02887 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


98 


75 


11427 . 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


143 


64 


11428 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


131 


81 


11429 


S58722 


Homo sapiens 


X-l inked retinopathy protein {C- 
terminal, clone XEH.8c} 


130 


81 


11430 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


172 


61 


11431 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


112 


74 


11432 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


128 


52 


11433 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


128 

■ 


58 


1 1434 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


147 


63 


11435 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


139 


83 


11436 


X63220 


Bos taurus 


NADH dehydrogenase 


154 


57 


11437 


AF090895 


Homo sapiens 


PRO0U7 


122 


59 


11438 


J04655 


,Ascarissuum 


collagen 


90 


46 


11439 


AF090942 


Homo sapiens 


PRO0657 


166 


52 


11440 


AK024455 


Homo sapiens 


FLJ00047 protein 


100 


55 


11441 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


116 


49 


11442 


M15530 


Homo sapiens 


B-cell growth factor 


105 


75 


11443 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


133 


64 


11444 


Y19730 


Homo sapiens 


SEQ ID NO 448 from W09922243. 


115 


67 


11445 


AF090895 


Homo sapiens 


PRO0117 


79 


61 


11446 


AF1 18086 


Homo sapiens 


PR01992 


109 


63 


11447 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


160 


80 


11448 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 
protein fragment 59. 


97 


75 


11450 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


136 


72 


11451 


Y02999 


Homo sapiens 
■ 


Fragment of human secreted protein 
encoded by gene 121. 


89 


55 


11452 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


144 


71 


11453 


AF090931 


Homo sapiens 


PRO0483 


140 


78 


11454 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


129 


65 
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• 
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Smith- 
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Score 


% 

fftonflt 
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y 


11455 


G00383 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4464. 


104 


63 


11456 


AF1 18086 


Homo sapiens 


PRO 1992 


151 


71 


1 1457 

'i 


U73168 


Homo sapiens 


partial CDS, human putative tumor 
suppressor (U23946) 


121 


95 


11458 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


96 


73 


13459 


AK000496 


Homo sapiens 


unnamed protein product 


233 


63 


11460 


AK024455 


Homo sapiens 


. FLJ00047 protein 


90 


73 


11461 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO:' 
7795. 


132 


64 


11462 


G00376 


Homo sapiens 

* 


Human secreted protein, SEQ JD NO: 
4457. 


103 


86 


11463 


K03207 


Homo sapiens 


salivary proline-rich protein precursor 


135 


36 


11464 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


127 


88 


11465 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6595. 


104 


77 


11466 


AF1 15549 


Homo sapiens 


Wiskott-Aldrich Syndrome protein 


141 


48 


11467 


AF090942 


Homo sapiens 


PRO0657 


89 


58 


11468 


AC005698 


Arabidopsis 
thaliana 


T3P18.6 


128 


47 


11469 


G03786 


Homo sapiens 


Human secreted protein, SEQID NO: 
7867. 


101 


48 


1 1470 


G03786 


Homo sapiens 


' Human secreted protein, SEQ ID NO: 
7867. 


136 


81 


11471 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4669. 


91 


45 


1 1472 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


102 


77 


1 1473 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


145 


55 


1 1474 


AF210651 


Homo sapiens 


NAG 18 


117 


57 


11475 


Y14482 


Homo sapiens . 


Fragment of human secreted protein . 
encoded by gene 17. 


117 


62 


11476 


AF1 16715 


Homo sapiens 


PR02829 


124 


77 


11477 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


136 


45 


11478 


G02828 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
6909. 


85 


78 


11479 


X91940 


Homo sapiens 


WNT-8B protein 

m .... 


204 


97 


1 1480 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


131 


71 


11481 


Y02671 


Homo sapiens 

■ 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


117. 

« 


63 


11482 


X92485 


Plasmodium 
vivax 


pval 


98 


52 


11483 


AF217518 


Homo sapiens 


uncharacterized bone marrow protein 
BM042 


895 


100 


11484 


U79260 


Homo sapiens 


unknown 


104 


80 


11485 


L27428 


Homo sapiens 


reverse transcriptase 


169 


52 


. 1 1486 


M15530 


Homo sapiens 


B-cell growth factor 


122 


67 


1 1487 


U49974 


Homo sapiens 


mariner transposase 


156 


62 


11488 


AF090942 


Homo sapiens 


PRO0657 


110 


44 


11489 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


156 


65 


11490 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


115 


52 


11491 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 S 51 
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SEQID 
NO* 


Accession No. 


Species 


Description 


Smith- 
W&termfln ' 

»▼ ■ U U MM 

Score 


I den tit 

IUCU %»% 

y 








7214. 






11492 


AF090930 


Homo sapiens 


PRO0478 


145 


58 


11493 


X63726 


Phoca vitulina 


cytochrome c oxidase subunit I 


556 


87 


-H494 


AL021897 


Mycobacterium 
tuberculosis 


pra 


119 


56 


11495 


X98485 

> 


Plasmodium 
vivax 


putative 


92 


35 


1 1496 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


156 


73 


11497 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


129 


72 


11498 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


115 


66 


11499 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


139 


58 


11500 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


113 


36 


11501 


AE003568 


Drosophila 
melanogaster 


CG12566 gene product 


105 


32 


11502 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


141 


78 


11503 


U12390 


Cloning vector 
pSportl 


beta-galactosidase alpha peptide 


102 


51 


11504 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


138 


60 


11505 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


109 


85 


1 1506 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


101 


59 


11507 


M81321 


Macaca 
fascicularis 


proline-rich protein 


128 


48 


11508 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


110 


78 


11509 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


98 


60 


11510 


X07882 


Homo sapiens 


Po protein 


118 


41 


11511 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


133 


79 


11512 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


112 


68 


11513 


AF000381 


Homo sapiens 


non-functional folate binding protein 


203 


93 


11514 


X92485 


Plasmodium 
vivax 


pval 


115 


44 


11515 


S58722 


Homo sapiens 

* 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


109 


51 


11516 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4425. 


141 


78 


11517 


AF090894 


Homo, sapiens 


PRO0113 


108 


58 


11518 


M15530 


Homo sapiens 


B-cell growth factor 


93 


72 


11519 


G03084 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7165. 


66 


73 


11520 


AF1 17065 


Homo sapiens 


male-specific lethal-3 homolog 1 


200 


61 


11521 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


335 . 


87 


11522 


M15530 


Homo sapiens 


B-cell growth factor 


104 


74 


11523 


AF1 16715 


Homo sapiens 


PR02829 


122 


64 


11524 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7871. 


145 


68 


11525 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


103 


61 



442 



WO 01/64835 PCT/US01/04927 



SEQU> 


'Accession No. 


Species 

■ 


Description 


Smith- 

▼» u ici mail 

Score 


% 

Irion rit 

y 






» 


gene 22 clone HMSJW18. 






11526 


X92485 


Plasmodium 
vivax 


pval 


111 


42 


11527 

'i 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


101 


80 


11528 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ ID NO:387. 


134 . 


58 


11529 


AF090931 


Homo sapiens 


PRO0483 


163 


76 


11530 


AF 119900 


Homo sapiens 


PR02822 


139 


44 


11531 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


126 


73 


11532 


U41038 


Caenorhabditis 
elegans 


Similar to cadherin-type repeat 


126 


50 


11533 


S79410 

♦ 


Mus musculus 


nuclear localization signal binding 
protein 


107 


46 


11534 


AF090930 


Homo sapiens 


PRO0478 


120 


81 


11535 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


105 


64 


11536 


X92485 


Plasmodium 
vivax 


pval 


123 


56 


11537 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


122 


72 


11538 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


108 


64 


11539 


AE003834 


VrosophUa 
melanogaster 


CG8054 gene product 


456 


84 


11540 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


135 


53 


11541 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


99 


70 


11542 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


77 


63 


11543 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


123 


73 


11544 


G03052 


Homo sapiens 


Human secreted protein. SEQ ID NO: 
7133. 


78 


58 


11545 


U79260 


Homo sapiens 


unknown 


128 


43 


11546 


Y95028 


Homo sapiens 


Human clone vp7 1 ORF2. SEQ ID 
NO: 128. 


122 


56 


11547 


X92485 


Plasmodium 
vivax 


pval 


116 


46 


1 1 548 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


III 


55 


11549 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear rna-binding 
protein) 


95 

■ 


75 

• 


11550 


U67056 


Acanthamoeba 
castellanii 


myosin I heavy chain kinase 


111 


44 


11551 


Y14480 


Homo sapiens 


Fragment of human secreted protein 

* 

encoded by gene 14. 


159 


60 


11552 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


127 

i 


50 


11553 


Z34975 


Homo sapiens 


ldlCp 


470 


86 


11554 


AF210651 


Homo sapiens 


NAG18 


101 


65 


11555 


X78677 


Homo sapiens 


ketohexokinase 


241 


92 


11556 


G04039 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8120. 


254 


72 


11557 


AB032976 


Homo sapiens 


KIAA1 150 protein 


565 


75 


11558 


U93572 


Homo sapiens 


p40 


127 


37 
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1 CI? ft TT* 

NO: 


rlCCCSoivu i w. 


1 


I/wCnp(fUD 


oniiiir- 

Waterman 

Score 


5? 

79 

Identit 

y 


11559 


U27488 


Pseudorabies 
virus 


glycoprotein gX 


88 

• 


38 


11560 


R80095 


Homo sapiens 


Human superoxide-dismutase-4 
polypeptide. 


113 


63 


11561 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


83 


65 


11562 


AF1 30089 


Homo sapiens 


PRO2550 


111 


33 


11563 


G00328 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


146 


60 


11564 


AF090895 


Homo sapiens 


PRO0117 


172 


77 


11566 


V01512 


Homo sapiens 


c-fos 


416 


82 


11567 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


72 


33 


11568 


X16074 


Mussp. 


L-34 protein (AA 1-264) 


117 


35 


11569 


Y45381 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


131 


77 


11570 


AK025395 


Homo sapiens 


unnamed protein product 


683 


98 


11571 


D78255 


Mus musculus 


PAP-1 


134 


80 


11572 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


110 


70 


11573 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


131 


78 


11574 


J01415 


Homo sapiens 


cytochrome oxidase subunit 3 


141 


59 


11575 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


102 


58 


11576 


AC007842 


Homo sapiens 


BC331191 1 


409 


86 


11577 


Z22636 


Sinorhizobium 
meliloti 


Thi 


103 


36 

■ 


11578 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


83 


36 


11579 

• 


G03703 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7784. 


115 


60 


11580 


AF162149 


Mycoplasma 
bovis 


variable surface lipoprotein 


167 


44 


11581 


AF 130079 


Homo sapiens 


PR02852 


117 


77 


11582 


AF229067 


Homo sapiens 


PADI-H protein 


136 


75 


11583 


R10755 


Homo sapiens 

- 


Non-A non-B hepatitis specific 
antigenic protein encoded by 
phageclone lambda HC2533 . 


132 

■ 


72 


11584 


Z70292 


Homo sapiens 


chemokineCC-l 


386 


93 


11585 

■ 


R10755 


Homo sapiens 

• 


Non-A non-B hepatitis specific 
antigenic protein encoded by 
phageclone lambda HC2533. 


107 

• 


67 

- 


11586 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XER8c} 


134 


75 


11587 


X05006 


Homo sapiens 


S-protein 


237 


100 


11588 


Y21850 


Homo sapiens 


Human signal peptide-contianing 
protein (SIGP) (clone ID 1880830). 


365 


77 


11589 


U43959 


Homo sapiens 


beta 4 adducin 


1 14 


54 


11590 


AF030162 


Homo sapiens 


inner mitochondrial membrane 
translocase Tim23 


104 


95 


11591 


X03557 


Homo sapiens 


56-KDa protein (aa 1-478) 


491 


74 


11592 


AF151074 


Homo sapiens 


HSPC240 


151 


73 


11593 


AF1 16719 


Homo sapiens j 


PR02987 


558 


89 


11594 


AF090942 


Homo sapiens 


PRO0657 


120 


41 


11595 


X55039 


Homo sapiens 


centromere autoantigen B (CENP-B) 


160 


32 


11596 


X98475 


Mus musculus 


vasodilator-stimulated phosphoprotein 


101 


43 


1 1597 


G00589 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4670. 


140 


73 
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SEQID 

|N LI. 


Accession No. 


Species 


Description 


Smith- 

vjr Hicnn&n 

Score 


% 

loeoui 

y 


11598 


U07151 


Homo sapiens 


ARL3 


113 


35 


11599 


S58722 


Homo sapiens 


X-l inked retinopathy protein {C- 
terminal, clone XEH.8c} 


123 


79 


11600 


AL050084 


Homo sapiens 


hypothetical protein 


119 


100 


11601 


AF090931 


Homo sapiens 


PRO0483 


124 


73 


11602 


Q02532 

* 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. , 


132 


60 


11603 


G03807 


Homo sapiens 


Human secreted protein. SEO ID NO: 
7888. 


156 


82 


11604 


AF224494 


Mus musculus 


arsenite inducible RNA associated 
protein 


473 


79 


11605 


M60047 


Homo sapiens 


heparin binding protein 


597 


95 


11606 


G00933 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5014. 


721 


97 


11607 


W12091 


Homo sapiens 


C57S derivative of antigen non-specific 
human GIF. 


573 


98 


11608 


AF229439 


Mus musculus 


zinc finger protein 289 


535 


80 


11609 


M86637 


Brugia pahangi 


unknown 


104 


38 


11610 


G03789 


Homo sapiens • 


Human secreted protein, SEQ ID NO: 
7870 


93 


70 


11611 


G02224 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6305 


508 


94 


11612 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613 


108 ' 


51 


11613 


AK024372 


Homo sapiens 


unnamed protein product 


105 


77 


1 1&14 

1 W l*T 




AX\JAAl\J OClL^lwllO 


Hum an Qecrpted nimtpin elr*n*» 

JJLUlilCUJ Q&UvlWU L/IUlCUl WAWlAW 

dd504_1 8 protein sequence SEQ ID 
NO-42 


no 

M 


62 


11615 


Y48547 


Homo sapiens 


Human breast tumour-associated 
nrotein 8 


641 


96 


11616 

I 1 V 1 \J 


Y27868 


Homo saoiens 


Human secreted nrotein encoded bv 
gene No. 107. 


108 


81 

U A 


11617 


L34081 


Homo saoiens 


• bile acid CoA* Amino acid N- 
acyltransf erase 


356 

W mf V 


82 


11618 


G01399 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5480. 


570 


98 


11619 

A A w A *r 


AF020038 


Homo saoiens 


NADP-deoendent isocftrafe 
dehydrogenase 


272 


83 


11620 


AB051901 


Homo sapiens 


VDUP1 


247 


98 


11621 


Y48547 


Homo saoiens 


Human breast tumour-associated 
protein 8. 


691 


92 

mW 4\m0 


11622 


U22961 

» 


Homo sapiens 

• 


similar to human albumin, Swiss-Prot 
Accession Number P02768; Method: 
conceptual translation supplied by 
author 


382 


71 


11623 


A06977 


Homo sapiens 


albumin 


562 


92 


11624 


X73460 


Homo sapiens 

* 


ribosomal protein L3 


502 


84 


11625 


AL035398 


Homo sapiens 


dJ796117.1 (PUTATIVE novel protein) 
_ — , . — ^„ r . i. 


373 


98 


11626 


D50310 


Homo sapiens 


cyclinl 


175 


100 


11627 


AE000950 


Archaeoglobus 
fulgidus 


thermosome, subunit alpha (thsA) 


131 


25 


11628 


D88315 


Mus musculus 


tetracycline transporter-like protein 


137 


81 


11629 


Y36204 


Homo sapiens 


Human secreted protein #76. 


406 


100 


11630 


AF130089 


Homo sapiens 


PRO2550 


110 


63 


11631 


J02982 


Homo sapiens 


glycophorin B precursor 


95 


100 


11632 


AF1 16719 


Homo sapiens 


PR02987 


558 


93 


11633 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


137 


92 


11634 


U09823 


Oiyctolagus 


elongation factor 1 alpha 


483 


85 
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SEQ ID 
NO: 


Accession No. 


Species 

* 

- 


Description 

* 


Smith- 
Waterman 
Score 


Yo 

1 den tit 

y 






cuniculus 








11635 


G01922 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6003. 


157 


73 

* 


11636 


AF116719 


Homo sapiens 


PR02987 


491 


90 


11637 


D14421 


Rattus 
norvegicus 


b isotype of B regulatory subunit of 
protein phosphatase 2A 


208 


86 


11638 


AF1 16719 


Homo sapiens 


PR02987 


544 


93 


11639 


AF130079 


Homo sapiens 


PR02852 


327 


81 


11640 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


150 


59 


11641 


G00361 


Homo sapiens 

i 


Human secreted protein, SEQ ID NO: 
4442. 


100 


66 


11642 


G03411 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


119 

. 


71 


11643 


Y86574 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:491. 


64 


100 


11644 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


82 


70 


11645 


G02360 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6441. 


109 


74 


11646 


AF130089 


Homo sapiens 


PRO2550 


128 


52 


11647 


AK024455 


Homo sapiens 


FLJ00047 protein 


138 


61 


11648 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


73 


84 


11649 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4449. 


128 


60 


11650 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518, 


83 


78 


11651 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


105 

• 


64 


11652 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla(AF078683) 


103 


62 


11653 


AF 13 0089 


Homo sapiens 


PRO2550 


116 


69 


11654 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


176 


54 


11655 


G03133 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7214. 


129 


63 


11656 


AF130089 


Homo sapiens 


PRO2550 


135 


77 


11657 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


146 


93 


11658 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


123 


6 * 


11659 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
47J 8. 


117 


47 


11660 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 


92 


84 


11661 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


70 


75 


11662 


U00029 


Saccharomyces 
cerevisiae 


Yhr217cp 


101 


50 


11663 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


107 


88 


11664 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 


110 


90 


11665 


U63542 


Homo sapiens 


FAP protein 


119 


69 


11666 


AF130114 


Homo sapiens 


PR02459 


78 


38 


11667 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


106 


64 


11668 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


66 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 
y 

• 








4478. 






11669 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


77 


63 


11670 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: ~" 
7492. 


116. 


68 


U671 


AK021618 


Homo sapiens 


unnamed protein product 


97 


43 . 


11672 


G02482 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
6563. 


108 


55 


11673 


S75997 


Rattus sp. 


nucleoporin p62 homolog 


95 


40 


11674 


AF229163 


Homoskpiens 


natural resistance-associated 

r 

macrophage protein 1 


104 


54 


11675 


Ml 5530 


Homo sapiens 


B-cell growth factor 


89 


81 


11676 


M92357 


Homo sapiens 


B94 protein 


143 


100 


11677 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


132 


74 


11678 


G0341 1 


Homo sapiens 


Human secreted protein, . SEQ ID NO: 
7492. 


126 


64 


11679 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


166 


66 


11680 


U16359 

. 1 » 


Rattus 
norveeicus 


nitric oxide synthase 


108 


85 


11681 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


145 


92 


11682 


Ml 5530 


Homo sapiens 


B-cell growth factor 


127 


64 


11683 


AF090931 


Homo sapiens 

* hj »P»-fc* * *P"A*W 


PRO0483 


152 


60 


11684 


AF130087 


Homo sapiens 


PR0241I 


150 


68 


11685 


U18339 


Variola vims 


D4L 


94 


70 


11686 


S58722 


Homo sapiens 


X-linked retinopathv protein IC- 
terminal, clone XEH.8cl 


104 


80 


11687 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


134 


68 


11688 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


131 


70 


11689 


AF090944 


Homo sapiens 


PRO0663 ' 


129 


73 


11690 


G03790 


Homo sapiens 

A * VpVA-* ^p* •jpfJPPpP' # W ■ 1 W 


Human secreted protein. SEO ID NO: 
7871. 


111 


67 


11691 


U79260 


Homo sapiens 


unknown 


98 


44 


11692 


W78226 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 1. 


361 


64 


11693 


G00962 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5043. 


269 


100 


1 1694 


AL359782 


Trypanosoma 
brucei 


possible (hny-6) ul 102, variant a dna, 
complete virion genome. 


132 


83 


11695 


Y94526 


Homo sapiens 


Human lysine-rich statherin protein. 


136 


82 


11696 


AF 116637 


Homo sapiens 


PR01489 


232 


95 


11697 


G00985 


Homo sapiens 

a ^^^^ ^p»^^^ 


Human secreted protein, SEQ ID NO: 
5066. 


108 


50 


11698 

m * ^ ^p* 


Y01158 


Homo sapiens 


Secreted protein encoded by eene 18 
clone HCAGJ81. 


120 


81 


11699 


G00416 

* 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
4497. 


150 


80 


11700 


AK024455 


Homo sapiens 


FLJ00047 protein 


148 


68 


11701 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


131 


72 


11702 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


105 


45 


11703 


AC005545 


Homo sapiens 


delta-adaptin, partial CDS 


127 


38 


11704 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
tenninal, clone XEH.8c} 


143 


81 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Sraith- 
Waterraan 


% 

Identit 

y 


11705 . 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613 


109 


67 


11706 


G00637 


Homo sapiens 


Human secreted protein, SEO ID NO: 
4718. — 


153 


67 


11707 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


129 


88 


11708 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


89 


69 


11709 


AB047600 


Macaca 
fascicularis 


hvDothetical protein 

, 1 


95 


63 


11710 


R96800 


Homo saoiens 


Human histiocvte-secreted factor HSF. 


100 


76 


11711 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


112 


80 


11712 
* * i a*» 


G03787 


Homo saoiens 

X X\JIH\J JHU1V1U 


Human secreted orotein SEO ID NO* 

AAUAlAtUA OvvlvlwU L/lVwUlj k-/ H / Jiv> 

7868. 

■ 


117 

A A f 


58 


1 1713 


AF008196 


Womn saniens 


ha X pnsilnn 


140 


71 


11714 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 

oTMwrMwtei'n 

UiXWJ/J UlAyAAl. 


113 


56 


11715 


G00397 


Homo sapiens 


Human secreted protein; SEQ ID NO: 


125 


67 


11716 


G03133 , 


Homo sapiens 


i Human secreted protein, SEQ ID NO: 


121 


74 


1 1717 

1 A / 1 r 


X JvOii 


Haiti n Qnni^nc 


AAUiliali aCwIClCU pitrlvlU CllWUUvU UUUl 

frptiP 1 1 

g9Uv A^. 


****^. 


si 
j i 


1 1718 

1 I / 1 o 






TJtiTTinn cpwrpttvl nr/\tif»tn CRr^ i i i "NTl'"i* 
iAuiiiau bcv>icucu piutcm, OCy aL/ I^VA 

6561 


yo 




11719 


Y20717 


Homo sam'ens 


Human neurofilament-M wild tvoe 

A AMI li till llvUl VAilvllAlVAlb I»A »r J1U IJlA/ 

nrotein fragment 59. 


106 


85 


11720 


AJ243883 


Periplaneta 
americana 


putative transcription factor 


118 


57 


11721 


AF220264 


Homo saniens 


MOST-1 


121 


74 


11722 


AF130051 


Homo sapiens 


PRO0898 


169 


72 


11723 


K02401 


Homo sapiens 


chorionic somatomammotropin 


507 


95 


11724 


K02401 


Homo saoiens 


chorionic som atom am m otroo i n 


461 


85 


11725 

X * * ***** 


U93564 


Homo saoiens 


d40 


340 


90 


11726 


AF130089 


Homo sapiens 


PRO2550 


107 


73 


win 

1 A 1 I 


G00376 


Homo saniens 


Human secreted nrotein SRO TO NO* 

4457. 


128 


79 


1 1728 


J02963 


Homo saniens 

A 1UIUV vU|/tvlU 


nlatelef plvconrntein n h nrecur^nr 

L/ lukvlwl KIT Wvl Uiblll AAV UlvvlUovI 


109 


90 


11729 


Y51824 


Homo sapiens 


Human OSBH protein. 


182 


89 


11730 


G03800 


Homo saniens 


Human secreted nrotein SKO TT> NO* 

AAUAA1CUA JVWivlvU \J\ "k/lrWlAlj JiJV^ AAV l^W. 

7881. 


107 


80 


11731 • 


K02401 


Homo saniens 


chorionic snmatomammotronin 


459 


94 


11732 


W48351 


Homo saoiens 


Human breast cancer related orotein 

A AMI 111*11 vl wUO* vCUlwvl 1 wlUlVU Ulwiil 

BCRB2. 


93 


42 


11733 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


111 


67 


11734 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded bv eene 27. 


103 


43 


11735 


S79410 


Mus musculus 


nuclear localization signal binding 
nrotein 


98 


75 


11736 


AF09093I 


Homo sapiens 


PRO0483 


123 


88 


11737 


K02401 


Homo sapiens 


chorionic somatomammotropin 


340 


92 


11738 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


139 


73 


11739 


G02624 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6705. 


131 


57 


11740 


X55695 


Lycopersicon 
esculentum 


glycine-rich protein 


164 


48 


11741 


AK024455 


Homo sapiens 


FLJ00047 protein 


89 


77 
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seo n> 

NO: 


Accession No. 

• m w » wW w*» W ■ » 4 *V* 


Soccics 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y ; 


11742 


Y91452 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO: 125. 


a m m* 

415 i 


90 


J 1743 


S58722 


Homo sapiens 


X- J inked retinopathy protem {C- 
terminal, clone XEH.8c} 


105 


94 


11744 


S58722 


Homo sapiens 


X-linked retmopathy protem {C- 
terminal, clone XEH.8c} 


% 1 m*" 

116 


87 


11745 


G00454 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
4535. 


131 


69 


11746" 


W13831 


Homo sapiens 


Human cyclin E delta 9 mutant 


mi M mm 

140 


52 


11747 


AB001431 


Mus musculus 


motor domain or KIF14 


179 


84 


11748 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


141 


72 


1 1 749 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


99 


63 


11750 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


88 


64 


11751 


AF09093 1 


Homo sapiens 


PRO0483 

- ,_ , _ .. . , • 


118 



68 


■11752 


AF150105 


Homo sapiens 


small zinc frager-like protein 


194 


68 


11753 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


127 


69 


11754 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


100 


80 


11755 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


109 


65 


11756 


K02401 


Homo sapiens 


chorionic somatomammotropin 


427 


87 


11757 


U52077 


Homo sapiens 


mariner transposase 


113 


82 


11758 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


133 


54 


11759 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


Ill 


80 


11760 


AF090944 


Homo sapiens 


PRO0663 


143 


76 


11761 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 6 


529 


99 


11762 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


171 


58 


11763 


AF 132200 


Homo sapiens 


PR01751 


127 


66 


11764 


AK025116 


Homo sapiens 


unnamed protein product 


141 


80 


11765 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


75 


62 


11766 


AF090930 


Homo sapiens 


PRO0478 


106 


59 


11767 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


127 


80 


11768 


AF090931 


Homo sapiens 


PRO0483 


164 


60 


11769 


S58722 

- 


Homo sapiens 


X-hnked retinopathy protem {C- 

. • * m mm mm m\ % 

termmal, clone XEH.8c} 


102 


81 


11770 


AF161356 


Homo sapiens 


HSPC093 


131 


68 


11771 


AF130089 


Homo sapiens 


PRO2550 


160 


82 


11772 


Y14482 


Homo sapiens 

* 


Fragment of human secreted protein 

* 1 % m\ am* 

encoded by gene 17. 


140 


79 


11773 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

m _> M\ Mm, 

4518. 


136 


71 


11774 


AF090942 


Homo sapiens 


PRO0657 


153 


78 


11775 


AJ223410 


Homo sapiens 


EBIl-ligand chemokine 


1 AO 

108 


52 


11776 


K02401 


Homo sapiens 


chorionic somatomammotropin 


395 


91 


11777 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


107 


70 


11778 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


99 


62 


11779 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


84 


64 
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NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identft 

y 


11780 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433, 


116 


68 


11781 


AF 194537 


Homo sapiens 


NAG13 


201 


65 


11782 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


139 


68 


11783 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


149 


59 


11784 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


112 


78 


11785 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


99 


68 


11786 


AF130089 


Homo sapiens 


PRO2550 


108 


83 


11787 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


124 


88 


11788 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


103 


100 


11789 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


146 


60 


11790 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


165 


73 


11791 


AL035494 


Homo sapiens 


dJ635G19J23 (novel protein 
(PUTATIVE PARTIAL isoform 3)) 


106 


56 


11792 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


100 


66 


11793 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


125 


61 


1 1 794 


A20770 

* 


synthetic 
construct 


(Aspl-Thrl61)-EPI 


95 


76 


11795 

■ 


S74221 


Homo sapiens 


IK factor=cytokine down-regulating 
HLA class II 


108 


61 


11796 


K01664 


Drosophila 
melanogaster 


Bkm-like protein 


96 


77 


11797 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


111 


65 


11798 


Y75995 


Homo sapiens 


Human skin cell protein, SEQ ID 
NO:173. 


324 


87 


11799 


J03071 


Homo sapiens 


chorionic somatomammotropin CS-5 


120 


75 


11800 


L00693 


Homo sapiens 


carcin oembryoni c antigen 


96 


85 


11801 


AL 160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


80 


44 


11802 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


125 


64 


11803 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


142 


82 


11804 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


107 


85 


11805 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


152 


80 


11806 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


122 


61 


11807 


U91985 


Homo sapiens 


DNA fragmentation factor-45 


585 


92 


11808 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


125 


60 


1 1 809 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


170 


89 


11810 


Y87156 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 195. 


191 


76 


11811 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


120 


100 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Idcntit 

v 


11812 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


156 


55 


11813 


P92219 


Homo sapiens 
(human) 


CR1 protein. 


121 


84 


.11814 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


107 


75 


11815 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


74 


78 


11816 


M15530 


Homo sapiens 


B-cell growth factor 


148 


68 


11817 


AF090942 


Homo sapiens 


PRO0657 


139 


60 


11818 


W90847 


Homo sapiens 


Human lymphocyte targeted peptide 
#15. 


117 


100 


11819 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla ( AF078683} 

A^A AA A A V* W U V " WW / 


133 


73 


11820 


Z72853 


Saccharomyces 
cerevisiae 


ORFYGR069W 


94 


42 . 


11821 

A A 4 


G03710 

■ 


Homo saoiens 


Human secreted orotein SEO ID NO* ■• 
7791. 


176 


65 


11822 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683} 


129 


60 


11823 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
DN A from olasmid oGCS 1 1 03 7 


104 


68 


11824 


G00333 


AAw&AXV lHyi¥Uw 


Human secreted orotein SEO ID NO* 
4414. 


99 


51 


11825 


W90172 


Homo sapiens 


Human heart muscle sDecific orotein. 


256 


34 


11826 


AB017007 


Homo sapiens 


PMS2L16 


248 


89 


11827 

A A v« r 


AL1 60371 

J*AV 4 W«J / A 


Leishmania 

* 

major 


orobable fhhv-6^ ul 102 variant a 
DNA, complete virion genome 


127 

Lit f 


86 

t/V7 


11828 


G03411 


Homo saoiens 


Human secreted orotein SEO ID NO* 
7492. 


135 

• 


71 


11829 


AF1 19851 


Homo sapiens 


PRO 1722 


129 


60 


11830 


AF 132200 


Homo sapiens 


PR01751 


120 


77 


11831 


Y08061 


Homo saoiens 


Human c-mvb orotein fra&ment 

XlUUJIUi V ILIJW UlvMfUl 1 1 fl f~l 1 1 V 1 1 1 






11832 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066 


128 


64 


11833 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


152 


100 


11834 


Y86248 

A WA* * V 


Homo saoiens 


Human secreted Drotein HCHPF68 

AAImUUU tAfWlvlVU 1/1 VlVlil AAV111 A 

SEQIDNO:163. 


156 


73 


11835 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


155 


88 


11836 


AP000616 

* * 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


134 


95 


11837 


Y91452 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEO ID NO:125. 


411 


91 


11838 


AK000496 


Homo sapiens 


unnamed Drotein product 


113 


78 


11839 


AF 119855 


Homo sapiens 


PR01847 


84 


68 


11840 


AP000616 


Oryza sativa 


similar to RING-H2 ringer protein 
RHAla (AF078683) 


128 


88 


11841 


S79410 


Mus muscuhis 


nuclear localization signal binding 
protein 


113 


69 


11842 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


151 


55 


11843 


AF090942 


Homo sapiens 


PRO0657 


139 


65 


11844 


AF220264 


Homo sapiens 


MOST-1 


88 


69 


11845 


AF184612 


Drosophila- 
rnelanogaster 


split ends 


95 


47 


11846 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


136 


82 
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SEQ ID 

NO: 


'Accession No. 


Species 

■ 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

Y 


11847 


AB017007 


Homo sapiens 


PMS2L16 


310 


100 


11848 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4449. 


100 


79 


11849 


W03642 


Homo sapiens 


Human carmabinoid GPR N-terminal 
sequence. 


HI 


61 


11850 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


HQ 


72 


11851 


AF090931 


Homo sapiens 


PRO0483 


164 


80 


11852 


AF1 16661 


Homo sapiens 


PR01438 


124 


67 


11853 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


119 


72 


11854 


G02872 


Homo sapiens 

■ 


Human secreted protein, SEQ ID NO: 
6953. 


135 


59 


11855 


U63332 


Homo sapiens 


super cysteine rich protein; SCRP 


100 


84 


11856 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


120 


54 


11857 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


135 


68 


11858 


AK024455 


Homo sapiens 


FLJ00047 protein 


123 


69 


11859 

• 


W48351, 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


94 


66 


11860 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


94 


66 


11861 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27 . 


108 


75 


11862 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


108 


86 


11863 


AF130050 


Homo sapiens 


PRO0872 


81 


58 


11864 


AF 119900 


Homo sapiens 


PR02822 


152 


90 


11865 


G02538. 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


78 


45 


11866 


J03071 


Homo sapiens 


chorionic somatomammotropin CS-2 


526 


93 


11867 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


123 


80 


11868 


G00591 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4672. 


119 


58 


11869 

- 


Y10830 


Homo sapiens 


Amino acid sequence of a human 
secreted protein. 


246 


100 


11870 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


101 


59 


11871 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


120 


59 


11872 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


129 


84 


11873 


D28113 


Homo sapiens 


MOBP 


458 


80 


11874 


D28114 


Homo sapiens 


MOBP 


306 


68 


11875 

* * 


GO 1828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


101 


85 


11876 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


110 


64 


11877 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


189 


57 


11878 


M15530 


Homo sapiens 


B-cell growth factor 


98 


62 


11879 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


130 


100 


11880 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


99 


70 


11881 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


101 


53 



452 



PCTYUS01/04927 

» 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- . 

Waterman 

Score 


% 

Identit 

y 


11882 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


219 


75 


11883 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


131 


61 


11884 


U93563 


Homo sapiens 


putative pi 50 


139 


75 


11885 


R07057 

* 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhem) gene preoduct 


154 


52 


11886 


AF090944 


Homo sapiens 


PRO0663 


149 


80 


1 1887 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


107 


94 


11888 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2, 


138 


62 


11889 


L26953 


Homo sapiens 


chromosomal protein 


123 


75 


11890 


U16359 


Rattus 
norvegicus 


nitric oxide synthase 


105 


69 


1 1891 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


129 


83 


11892 


R10755 


Homo sapiens 

* 


Non-A non-B hepatitis specific 
antigenic protein encoded by 
phageclone lambda HC2533. 


126 


78 


11893 


G00481 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4562. 


82 


70 


11894 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


149 


90 


11895 


S58722 


Homo sapiens 


X-Iinked retinopathy protein ic- 
terminal, clone XEH.8c} 


118 


75 


11896 


G02451 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6532. 


167 


86 


11897 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 6 


136 


77 


11898 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


126 


58 


11899 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


158 


78 


11900 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


137 


71 


11901 


G00613. 


Homo sapiens 


Human secreted protein, SEQ ID NO: . 
4694. 


85 


62 


11902 


G02558 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6639. 


242 


73 


11903 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


127 

* 


59 


11904 


G03472 

< 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7553. 


182 


85 


11905 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


135 


75 


11906 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


128 


79 


11907 


U93574 


Homo sapiens 


putative pi 50 


169 


64 


11908 


AF090944 


Homo sapiens 


PRO0663 


128 


75 


11909 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


100 


84 


11910 


AF090930 


Homo sapiens 


PRO0478 


159 


76 


11911 


J03071 


Homo sapiens 


chorionic somatomammotropin CS-2 


496 


88 


11912 


J03071 


Homo sapiens 


chorionic somatomammotropin CS-2 


551 


92 


11913 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
IDNO:103. 


168 


76 


11914 


K02401 


Homo sapiens 


chorionic somatomammotropin 


450 


85 


11915 


K02401 


Homo sapiens 


chorionic somatomammotropin 


478 


87 


11916 


K02401 


Homo sapiens 


chorionic somatomammotropin 


454 


82 



453 



WO 01/64835 



WO 01/64835 PCTYUS01/04927 

• 



ClJA tT\ 

bbQ ID 
NO; 


Accession No. 


Species 


vescnpnoD 


afultD- 

Waterman 

Score 


1 Q> 

Identft 

y 


11917 


G02382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6463. 


121 


73 


11918 


X65121 


Musmusculus 


alpha 1 (X) collagen 


112 


35 


11919 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


-115 


62 


11920 


AK024455 


Homo sapiens 


FLJ00047 protein 


102 


76 


11921 


AF1 50087 


Homo sapiens 


small zinc finger-like protein 


181 


52 


11922 


U55376 


Caenorhabditis 
elegans 


F16H11.2 gene product 


122 


78 


11923 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: ' 
4473. 


256 


80 


11924 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


127 

■ ■ 


74 


11925 


AF090944 


Homo sapiens 


PRO0663 


.144 


75 


11926 


AJ223475 


Escherichia coli 


InsA protein 


477 


100 


11927 


M15386 


Homo sapiens 


gamma-globin 


616 


92 


11928 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


105 


72 


1 1 929 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


122 


92 


11930 


AF090895 


Homo sapiens 


PRO0117 


80 


71 


11931 


AF 130089 


Homo sapiens 


PRQ2550 


129 


68 


11932 


AK025116 


Homo sapiens 


unnamed protein product 


113 


53 


11933 


Y91452 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO: 125. 


114 


70 


11934 


G00577 


Homo sapiens 

♦ 


Human secreted protein, SEQ ID NO: 
4658. 


69 


76 


11935 


AF1 16695 


Homo sapiens 


PR02221 


336 


70 


11936 


AF090931 


Homo sapiens 


PRO0483 


133 


77 


11937 


AFI30089 


Homo sapiens 


PRO2550 


135 


72 


11938 


L06237 


Homo sapiens 


raicrotubule-associated protein IB 


143 


36 


11939 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


124 


72 


11940 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
4508. 


158 , 




11941 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


130 




11942 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


HI 


65 


11943 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


106 


59 


11944 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


117 


65 


11945 


D28113 


Homo sapiens 


MOBP 


186 


62 


11946 


D38U2 


Homo sapiens 


NADH dehydrogenase subunit 1 


353 


76 


11947 


Y20717 


Homo sapiens 

• 


Human neurofilament-M wild type 
protein fragment 59. 


113 


72 

' 


11948 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 6 


139 


87 


11949 


AJ006591 


Homo sapiens 


cysteine-rich protein 


148 


76 


11950 

■ 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, done XEH.8c} 


113 


88 


11951 


AF 11 6695 


Homo sapiens 


PR02221 


205 


57. 


11952 


Y95028 


Homo sapiens 


Human clone vp7 1 ORF2, SEQ ID 
NO: 128. 


158 


75 


11953 


AF1 13685 


Homo sapiens 


PRO0974 


107 


72 


11954 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


107 


80 


11955 


AK026107 


Homo sapiens 


unnamed protein product 


129 


84 


11956 


AF109907 


Homo sapiens 


SI 64 


140 


94 



454 



SEQID 

lMfV 

. 


Accession No. 


Species 


Description 


Smith- 

Wflttnnfln 

TT nl^l 1114111 

Score 


% 

lUCIUIl 

y 


11957 


U23455 


Caenorhabditis 
elegans 


similar to D. melanogaster homeotic 
protein BarH2 (PIRlA41726) 


99 


100 


11958 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


124 


41 


11959 


K02401 


Homo sapiens 


chorionic somatomammotropin 


497 


90 


11960 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4449. 


155 


69 


11961 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


126 


85 


11962 


Ml 5530 


Homo sapiens 


B-cell growth factor 


114 


86 


11963 


K02401 


Homo sapiens 


chorionic somatomammotropin 

* 


472 


87 


11964 


AKQ24455 


Homo sapiens 


FLJ00047 protein 


95 


61 


11965 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


145 


78 


11966 


L27428 


Homo sapiens 


reverse transcriptase 


144 


50 




G02532 


Homo saoiens 


Human secreted protein. SEO ID NO: 
6613. 


187 


76 


11968 


W21581 


Homo saoiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCSl 1037. 


124 


81 


11969 


G00637 


Homo saoiens 

! 


Human secreted protein, SEO ID NO: 

m* m^wmimmM ^m^m-m- m^mm mwm, ~mr mrm^mm* mrmw A J m^mr a » m^ V 

4718. 


112 


64 


11970 

1 17(V 


AK024455 


Homo saoiens 


FU00047 protein 


120 


70 


11971 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


105 


61 


11972 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoorotein . 


106 


70 


11975 


G00577 


Homo saniens 


Human secreted orotein SEO ID NO: 
4658. 


70 

i 


75 


11974 


G00491 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4572. 


86 


60 


11975 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


161 


72 


11976 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


105 


42 


11977 


B03628 


Homo saoiens 

A AV>4A V m**m+m^ m. ^ * ■ 


Human phospholipase 2 HPPL2. 


121 


58 


11978 


AK024372 


Homo sapiens 


unnamed protein product 


104 


80 


11979 


S58722 

w % * *** 


Homo saoiens 


X- linked retinopathy protein {C- 
terminal, clone XEH.8c} 


137 


77 


11980 


S58722 

•J*/ V * mm mm 


Homo saoiens 


X- linked retinopathy protein {C- 
terminal, clone XEH.8c} 


96 


90 


11981 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


100 


56 


11982 


R59842 


Homo sapiens 


ApoE4Ll protease. 


129 


62 


11983 


K01664 

■ 


Drosophila 
melanogaster 


Bkm-like protein 

* 


132 


53 


11984 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


105 


90 


11985 


M15530 


Homo sapiens 


B-cell growth factor 

^mf 


117 


43 


11986 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


99 


65 


11987 


R59843 


Homo sapiens 


ApoE4Lx2 protease. 


117 


84 • 


11988 


L26953 


Homo sapiens 


chromosomal protein 


124 


79 


11989 


U93564 


Homo sapiens 


p40 


523 


88 


11990 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


155 


82 


11991 


AF090894 


Homo sapiens 


PRO0113 


126 


56 


11992 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


129 


63 


11993 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


144 


67 


11994 


U71363 


Homo sapiens 


zinc finger protein zfp6 


213 


68 



455 
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SEQ Ul 
NO: 


'Accession No. 


Species 

* 


Description 


omitD- 

Waterinao 

Score 


— X} 

Identft 

y 


11995 


G00637 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
4718. 


82 


65 


11996 


AF1 30089 


Homo sapiens 


PRO2550 


145 


68 


11997 


R95913 


Homosapiens 


Neural thread protein. 


114 


88 


11998 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


122 


56 


11999 


X62677 


Oryctolagus 
cuniculus 


retrovirus related reverse transcriptase 


147 


81 


12000 


AF090931 


Homo sapiens 


PRO0483 


152 


81 


12001 


AF090931 


Homo sapiens 


PRO0483 


145 


81 


12002 

• 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


124 


64 


12003 


S58722 


Homo sapiens 


X-linked retinopathy protein {O 
terminal, clone XER8c} 


103 


78 


12004 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


133 


51 


12005 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


108 


63 


12006 


L19527 


Homo sapiens 


ribosomal protein L27 


547 


93 


12007 


R96800 , 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


127 


82 


12008 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


127 


82 


12009 


M55409 


Homo sapiens 


pancreatic tumor-related protein 


257 


98 


12010 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


127 


64 


12011 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


291 


89 


12012 


X03717 


Homo sapiens 


pot. unidentified reading frame 


126 


43 


12013 


S58722 


Homosapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


131 


80 


12014 


AB011148 


Homo sapiens 


KIAA0576 protein 


135 


84 


12015 


S58722 


Homo sapiens 


X- linked retinopathy protein {C- 
terminal, clone XEH.8c} 


114 


87 


12016 


AK024455 


Homo sapiens 


FLJ00047 protein 


102 


70 


12017 


L27428 


Homo sapiens 


reverse transcriptase 


144 


87 


12018 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


135 


87 


12019 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


137 


93 


12020 


AK001363 


Homo sapiens 


unnamed protein product 

* * 


490 


100 


12021 


Y27868 


Homosapiens 


Human secreted protein encoded by 
gene No. 107. 


HI 


79 


12022 


W00838 


Homo sapiens 


Tumour necrosis factor-related gene 
product 


106 


87 


12023 


W80406 


Homo sapiens 

* 


A secreted protein encoded by clone 
dh40_3. 


157 


58 


12024 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


87 


66 


12025 


R95913 


Homo sapiens 


Neural thread protein. 


157 


45 


12026 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


138 


52 


12027 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


116 


60 


12028 


V00488 


Homo sapiens 


alpha globin 


191 


100 


12029 


X62677 


Oryctolagus 
cuniculus 


retrovirus related reverse transcriptase 


102 


87 


12030 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


114 


79 


12031 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


142 


86 


12032 


X01703 


Homo sapiens 


alpha-tubuiin 


534 


93 


12033 


U09823 


Oryctolagus 
cuniculus 


elongation factor 1 alpha 


557 


90 



456 
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SEO ID 
NO: 


Accession No 


Snecies 


Descriotion 


Sniith - 

Waterman 

Score 


/o 

Identit 

y 


12034 


X01703 


Homo sapiens 


alpha-tubulin 


619 


94 


12035 


AF255556 


Notothenia 
coriiceps 


alpha tubulin 


430 


85 


12036 


X53618 


Paracentrotus 
hvidus 


alpha-tubulin (AA 1-452) 


437 

• 


87 


12037 


M62810 


Homo sapiens 


mitochondrial transcription factor 1 


433 


95 


12038 


Y36156 


Homo sapiens 


Human secreted protein #28. 


157 


75 


12039 


X01703 


Homo sapiens 


alpha-tubulin 


604 


97 


12040 


X01703 


Homo sapiens 


alpha-tubulin 


517 


90 


12041 


S70154 


Homo sapiens 


cytosolic acetoacetyl-coenzyme A 
thiolase, CT {EC 2.3.1.9} 


533 

• 


94 


12042 


X01703 


Homo sapiens 


alpha-tubulin 


472 


85 


12043 


M23613 _j 


Homo sapiens 


nucleophosmin 


428 


80 


12044 


AL031174 


Schizosaccharom 
yces pombe 


hypothetical protein 


102 


52 


12045 


Y94653 


Homo sapiens 


Human netrin-like protein (NEL) 
amino acid sequence. 


152 


82 


12046 


AF0 16507 


Homo sapiens 


C -terminal binding protein 2 


172 


100 


12047 


X05196 


Homo sapiens 


aldolase C 


454 


93 


12048 


X03796 


Mus museums 


aldolase C (aa 1-227) 


351 


74 


12049 


X05196 


Homo sapiens 


aldolase C 


408 


88 


12050 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


147 


70 


12051 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


169 


55 


12052 


AF090944 


Homo sapiens 


PRO0663 


U9 


59 


12053 


V01 577 


Homo sapiens 


variable region 


223 


93 


12054 


G03411 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7492. 


144 


74 


12055 


AF090931 


Homo sapiens 


PRO0483 


121 


74 


12056 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
6983. 


105 


76 


12057 


U18339 


Variola virus 


D4L 


100 


52 


12058 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


135 


62 


12059 


AB001684 


ChlorelJa 
vulgaris 


ORF41c 


81 


75 


12060 


Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene 51 clone HUKEX85. 


128 


74 


12061 


AK024455 


Homo sapiens 


FLJ00047 protein 


70 


46 


12062 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


130 


69 


12063 


AF090944 


r Homo sapiens 


PRO0663 


110 


66 


12064 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


109 


53 


12065 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


94 


79 


12066 


AF090931 


Homo sapiens 


PRO0483 


123 


70 


12067 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


111 


75 


12068 


G00577 

- 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


144 


77 


12069 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


145 


76 


12070 


G03639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7720. 


114 


81 


12071 


AF1 16715 


Homo sapiens 


PR02829 


138 


63 


12072 


J02963 


Homo sapiens 


platelet glycoprotein Jib precursor 


127 


82 


12073 


U79260 


Homo sapiens 


unknown 


98 


73 


12074 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


132 


80 
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NO: 




Snecics 

■ 


Description 


Stnitb- 

Wflterman 

Score 


% 

Idea tit 

y 




• 


i 


clone HPMBQ32. 






12075 


J02459 


bacteriophage 
lambda 


T (tail component; 144) 


555 


98 


12076 


AK024455 


Homo sapiens 


FLJ00047 protein 


127 


-65 


12077 


X00911 


Rattus 
norvegicus 


pot. MSA-precursor 


190 


73 


12078 


AF164797 


Homo sapiens 


ribosomal protein LI 7 isolog 


538 


95 


12079 


A06977 


Homo sapiens 


albumin 


602 


87 


12080 


AF000198 


Caenorhabditis 
elegans 


Similar to articular collagen 


118 


38 


12081 


G00637 


Homo sapiens 

i 

* 


Human secreted protein, SEQ ID NO: 
4718. 


142 


86 


12082 


AL359782 

■ 


Trypanosoma 
brucei 


probable similar to ring-h2 ringer 
protein rhal a. 


303 

- 


50 


12083 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


142 


86 


12084 


G00413 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4494. 


64 


54 


12085 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


132 


53 


12086 


A 14829 


Homo sapiens 


preproapoiipoprotein 


619 


82 


12087 


AF1 19900 


Homo sapiens 


PR02822 


107 


76 


12088 


U52077 


Homo sapiens 


mariner transposase 


277 


74 


12089 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


126 


100 


12090 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) . 


134 


92 


12091 


W05278 


Homo sapiens 


Tumour necrosis factor-related gene 
product CL6.5.40. 


115 


42 


12092 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


112 


81 


12093 


G03343 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7424. 


114 


64 


12094 


AJ237660 


Bacteriophage 
21 


Ren protein 


187 


94 


12095 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


135 


80 


12096 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


161 


49 


12097 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


122 


92 


12098 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


148 


59 


12099 


AF090942 


Homo sapiens 


PRO0657 


315 


70 


12100 


AF090931 


Homo sapiens 


PRO0483 


112 


74 


12101 


AF090944 


Homo sapiens 


PRO0663 


103 


37 


12102 


Y27854 

• 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


76 


58 


12103 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


130 


80 


12104 


AL160493 

• 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


106 


84 


12105 


R95913 


Homo sapiens 


Neural thread protein. 


105 


75 


12106 


G0343S 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


149 


75 


12107 


Y09561 


Homo sapiens 


ATP receptor 


188 


85 


12108 


AF090895 


Homo sapiens 


PRO0117 


107 


83 


12109 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


117 


65 


12110 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


65 
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y 








6292. 






12111 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


117 


69 


12112 


Y 12478 


Homo sapiens 


congenital heart disease 5 protein 


291 


62 


12113 


AF220264 


Homo sapiens 


MOST-1 


142 


76 


12114 


Y 12077 


Homo sapiens 


Human 5' EST secreted protein SEQ ID 
NO: 390. 


109 


95 


12115 


G03411 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
7492. 


171 


63 


12116 


AF064869 


Rattus 
norvegicus 


brain-enriched guanylate kinase- 
associated protein 2; BEGA2 


294 


98 


12117 

• 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


128 


36 


12119 


G01984 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
6065. 


102 


73 

- 


12120 


U79260 


Homo sapiens 


unknown 


98 


65 


12121 


AF161356 


Homo sapiens 


HSPC093 


118 


67 


12122 


AF130087 


Homo sapiens 


PR02411 


115 


80 


12123 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


280 


75 


12124 


S48406 


Mus sp*. 


alpha 1 (XII) collagen {triple-helical 
domain COL2} 


59 


41 


12125 


AB030816 


Homo sapiens 


H-REV 1 07 protein-related protein 


420 


79 


12126 


R96418 


Homo sapiens 


Partial human transforming growth 
factor beta receptor type II. 


169 


100 


12127 


G03801 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


9,1 


46 


12128 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


109 


83 


12129 


G03787 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
7868. 


127 


57 


12130 


AF1 19900 


Homo sapiens 


PR02822 


108 


79 


12131 

. 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


127 


70 


12132 


AF303828 


Mus musculus 


ubc-like protein MMS2 


131 


43 


12133 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


127 


58 


12134 


U49974 


Homo sapiens 


mariner transposase 


124 


80 


12135 


X52164 


Mus musculus 


Q300 protein (AA 1-77) 


106 


94 


12136 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


151 


69 


12137 


AF192913 


Homo sapiens 


zinc finger protein ZNF180 


581 


94 


12138 


X58907 


Homo sapiens 


steroid 2 1 -monooxy genase 


128 


58 


12139 


AF090931 


Homo sapiens 


PRO0483 


121 


70 


12140 


AK024455 


Homo sapiens 


FU00047 protein 


84 


60 


12141 


AF090931 


Homo sapiens 


PRO0483 


122 


71 


12142 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


153 


60 


12143 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


12 J 


100 


12144 


G02922 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7003. 


191 


62 


12145 


U57092 


Homo sapiens 


Rab30 


257 


87 • 


■ 12146 


AF130089 


Homo sapiens 


PRO2550 


129 


74 


12147 


AF090931 


Homo sapiens 


PRO0483 


138 


80 


12148 


G02382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6463. 


135 


60 


12149 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


138 


60 
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Score 
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/© 

Ideotit 

y 


12150 


U51723 


Plasmodium ' 
vivax 


V-SERA 1 

■ 


107 

■ 


37 


12151 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


82 


68 


12152 


Y23884 


Homo sapiens 


Amino acid sequence of FK506 
binding protein 65 (FKBP65). 


124 


69 


12153 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


93 


70 


12154 


AF229067 


Homo sapiens 


PADI-H protein 


162 


68 


12155 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


125 


86 


12157 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


113 


39 


12158 


U89439 


Bos taurus 


ubiquitin-like protein 


146 


76 


12159 


G03725 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7806. 


677 


98 


12160 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


124 


90 


12161 


G02363 


Homo sapiens 

i 


Human secreted protein, SEQ ID NO: 
6444. 


109 


80 


12162 


AF090931 


Homo sapiens 


PRO0483 


125 


76 


12163 


U49973 


Homo sapiens 

• 


ORF1; MER37; putative transposase 
similar to pogo element 


191 


64 

• 


12164 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


96 

■ 


61 


12165 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


115 


68 


12166 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


2J9 


93 


12167 


AF1 18086 


Homo sapiens 


PR01992 


155 


78 


12168 


X12789 


Mus musculus 


cytokeratin 8 (AA 1 - 489) 


174 


63 


12169 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


94 


48 


12170 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


150 


57 


12171 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


141 


78 

- 


12172 


AF068294 


Homo sapiens 


HDCMB45P 


184 


56 


12173 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


116 


50 


12174 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


107 


59 


12175 


X92485 

a 


Plasmodium 
vivax 


pval 


108 


44 


12176 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


102 


84 


12177 


X83703 


Homo sapiens 


nuclear protein 


330 


68 


12178 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


147 


50 


12179 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 
protein fragment 59. 


103 


73 


12180 


AF132200 


Homo sapiens 


PR01751 


131 


64 


12181 


AC005514 


Homo sapiens 


CTF5 


261 


90 


12182 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


156 


75 


12183 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


137 


75 


12184 


P92219 


Homo sapiens 
(human) 


CR1 protein. 


101 


74 


12185 


G02607 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6688. 


156 


56 
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SEQ ID 
NO 
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Species 

* 
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Score 


1 * 

i lucnui 

y 


12186 


G02213 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6294. 


626 


97 


12187 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


96 


51 


12188 


AF1 18067 


Homo sapiens 


PRO 1578 


89 


73 


12189 


M63154 


Homo sapiens 


intrinsic factor 


130 


96 


12190 


AF090942 


Homo sapiens 


PRO0657 


144 


72 


12191 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


152 


76 


12192 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


134 


63 


12193 


S58722 


Homo sapiens 

♦ 


X-linked retinopathy protein {C- 
terminal, clone XEH.8cV 


159 


76 


12194 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


117 


76 


12195 


AF090931 


Homo sapiens 


PRO0483 


134 


62 


12196 


AF 130089 


Homo sapiens 


PRO2550 


117 


74 


12197 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


113 


57 


12198 


G02532 , 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


126 


70 


12199. 


J02459 

* 


bacteriophage 
lambda 


Fi (DNA packaging; 1 17) 


203 


95 


12200 


Y08061 


Homo sapiens 


Human c-myb protein fragment 


135 


58 


12201 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


132 


86 


12202 


AF1 19900 


Homo sapiens 


PR02822 


148 


68 


12203 


Y86248 

■ 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:I63. 


128 


59 


12204 


R59842 


Homo sapiens 


ApoE4L 1 protease. 


118 


91 


12205 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


121 


72 


12206 


S75997 


Rattus sp. 


nucleoporin p62 homolog 


131 


65 


12207 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


145 


65 


12208/ 


S58722 


Homo sapiens 


X-linked retinopathy protein (C- 
terminal, clone XEH.8c} 


145 


86 


12209 


AJ242956 


Homo sapiens 


El fusion protein 


96 


88 


12210 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


332 


78 


12211 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


125 


81 


12212 


K03179 


Homo sapiens 


pro-alpha- 1 type- 1 collagen 


96 


37 


12213 


GO 1895 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
5976. 


310 


57 


12214 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


493 


82 


12215 


R13144 


Homo sapiens 


Deleted in Colorectal Carcinomas. 


700 


98 


12216 


AF 132200 


Homo sapiens 


PR01751 


109 


61 


12217 


AB013897 


Homo sapiens 


HKR1 


324 


64 


12218 


AF071172 


Homo sapiens 


HERC2 


123 


86 


12219 


L38593 


Homo sapiens 


integral membrane protein 


78 


62 


12220 


AF036233 


Homo sapiens 


cdc25B phosphatase 


204 


51 


12221 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


107 


60 


12222 


AL049795 


Homo sapiens 


dJ622L5.9 (eukaryotic translation 
initiation factor 3, subunit 2 (beta, 
36kD) (TRIP-1, TGF-beta receptor 
interacting protein 1)) 


163 


86 


12223 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


120 


76 
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SEQID 
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Species 

** 


Description 


Smith- 

Waterman 

Score 


V. 

Identit 

y 


12224 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


102 


64 


12225 


AF130079 


Homo sapiens 


PR02852 


103 


80 


12226 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


163 


86 


12227 


AF1 19855 


Homo sapiens 


PRO 1847 


96 


78 


12228 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


195 


87 


12229 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


103 


90 


12230 


U49957 


Homo sapiens 


LIM protein 


212 


100 


12231 


AK022759 


Homo sapiens 


unnamed protein product 


646 


100 


12232 


Y01158 


Homo saDiens 


Secreted Drotein encoded bv cene 18 
clone HCACJ81. 


110 

A A V 


70 


12233 


Y27868 


Homo saniens 


Human secreted Drotein encoded bv 
eeneNo 107. 


109 


74 


12234 


W03642 

i 


Homo sapiens 


Hum an cannabinoid GPR N-terminal 
seauence 

iJV^J MVI IWV * 


123 


76 


12235 


AF220264 


Homo sapiens 


MOST-1 


104 


85 


12236 


S58722 

* 


Horn cs oani&nQ 


Y-linW ed retfnonatliv nrotein (CI- 

terminal, clone XEH.8c} 


113 


87 


12237 


G03800 


Homo caniAnC 

IA\JLL1\J ddJ/ICild 


Human oArrpfpH nrnf f» in ST^O TO WO* 

7881. 


143 


84 




AF090947 






101 


DO 


12239 




* 


Hum an c^rrptr-H nrntpin SFO ID NO* 

4435. 


170 

1 A7 

i 


77 


12240 


Y36495 


Hnmn canj pnQ 
nuiiivj oa|7iciia 


Frsiofn^Tit nfViumfln ^ftcrpfrpH rwrttpiTi 
jt lagiuwut u> jiuiiiaii ovviwibu ^Jivvt>jii 

encoded bv cene 27 


135 


75 


12241 


AP000616 


Oryza sativa 


similar to RING-H2 ringer protein 
RHAla ( AF078683) 


136 


95 


12242 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal clone XEH 8c\ 


148 


72 


12243 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


118 


39 


12244 


AL3901 14 


T -pichmstfiiji 

maior 


rvmhnfilp /T"ihv-*5?\ ill 1fl7 variant a 

DNA. comDlete virion genome 


78 

/ o 




12245 


AC005200 


Homo sapiens 


plasmalemmal porin 


131 


79 


12246 


G02490 


Homo ^aniens 


Human secreted nrotein SEO ID NO* 
657 L 


150 


65 


12247 


W48351 


Homo saoiens 


Human breast cancer related nrotein 
BCRB2. ! 


90 


55 


12248 


Y02999 


Homo saoiens 


Fragment of human secreted nrotein 
encoded bv zene 121. 


77 


64 


12249 


U63542 


Homo saoiens 


FAP nrotein 


108 


56 


12250 


S58722 


Homo saoiens 


X- linked retinonathv Drotein {C- 
terminal, clone XEH. 8c} 


101 


75 


12251 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


139 


63 


12252 


G00985 


Homo sapiens 


Human secreted nrotein, SEO ID NO: 
5066. 


122 


71 


12253 


AF1 18086 


Homo sapiens 


PR01992 


73 


70 


12254 


AF083929 


Mus musculus 


ES18 


108 


44 


12255 


AF130089 


Homo sapiens 


PRO2550 


124 


88 


12256 


AF090931 


Homo sapiens 


PRO0483 


132 


80 


12257 


X95190 


Homo sapiens 


branched chain acyl-CoA oxidase 


166 


86 


12258 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


131 


^ 3 


12259 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


124 


88 


12260 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


85 
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SEQW 
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Accession No. 


Species 


Descnpfion 


Smith- 

Waterman 

Score 


Identit 
y 








4409. 






12261 

. ■ 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6292. 


133 


72 


12262 


GO 1828 


Homo sapiens 


Human secretedprotein, SEQ ID NO: 
5909. 


110 


70 


12263 


Y08061 


Homo sapiens 


Human c-myb protein fragment 


125 


85 


12264 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


126 


65 


12265 


AP000616 


Oryzasativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


114 


95 


12266 


AK024455 


Homo sapiens 


FLJ00047 protein 


108 


63 


12267 


G03798 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7879. 


166 


73 


12268 


AF090931 


Homo sapiens 


PRO0483 


108 


82 


12269 


S58722 


Homo sapiens 

* 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


120 


68 


12270 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


120 


66 


12271 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


148 


74 


12272 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


129 


60 


12273 


AF130089 


Homo sapiens 


PRO2550 


134 


79 


12274 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


118 


80 


12275 


AF194537 


Homo sapiens 


NAG 13 


151 


52 


12276 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


75 


68 


12277 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


113 


60 


12278 


W88627 


Homo sapiens 
* 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


111 


70 


12279 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


119 


92 


12280 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


108 


90 


12281 


U66464 


Homo sapiens 


hematopoietic progenitor kinase 


102 


83 


12282 


G03438 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7519. 


118 


65 


12283 


W00838 


Homo sapiens 

* 


Tumour necrosis factor-related gene 
product 


162 


82 


12284 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


130 


77 


12285 


AF1 16661 


Homo sapiens 


PR01438 


104 


74 


12286 


G00379 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
4460. 


98 


66 


12287 


R13556 


Homo sapiens 


Protein encoded downstream of hhcJM 
oncoprotein. 


105 


74 


12288 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


129 


78 


12289 


X53581 


Rattus 
norvegicus 


ORF7 


111 


47 


12290 


AF090931 


Homo sapiens 


PRO0483 


147 


87 


12291 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


106 


69 


12292 


AP000616 


Oiyzasativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


137 


53 


12293 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQEDNO:163: 


142 


56 
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SEQ ID 
NO; 


Accession No. 


Species . 


Description 


Smith- 
Waterman 


% 

Identit 


12294 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEO ID NO 163 


117 


60 


12295 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMB032 


112 


66 


12296 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478 


140 


62 


12297 


G03798 


Homo sapiens 


Human, secreted protein, SEQ ID NO: 
7879 


144 


70 


12298 


G02455 


Homo sapiens • 


Human secreted protein, SEQ ID NO: 
6536 


95 


51 


12299 


S79410 


Mus fnuscutufi 


nuclear 1oriali7^tion qional hindim* 

XI Uvluul llFwallMlUUU DlgXlCLL UXllUXXlg 

nrotein 


108 


70 


12300 • 


Y02671 


Homo saniens 


Human secreted orotein encoded ov 
cene 22 clone HMSJW18 


122 


85 


12301 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888 


1 17 


73 


12302 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492 


115 


62 


12303 


AF130051 


Homo sapiens 


PRO0898 


116 


65 






xlUIUU dapiCxxd 


piaicjci giyuupruiciii iiu prcwir&ur 


110 


Dv 


12305 


V00662 


Homo sapiens 


ATPase 6 


118 


100 




/ir II// 'H/o 


xiomo sapiens 


UW1 Z5 


oc 


00 


12307 


A05308 


synthetic 
construct 


glioblastoma-derived T-cell suppressor 
iacxor 


376 


93 


12308 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 

Clone nx IVlJDv^JZ. 


173 

i 


71 


12309 


S58722 


Homo sapiens 


X-linked retinopathy protein {O 
terminal, clone XEH.8c} 


140 


74 


12310 


U12390 


Cloning vector 
pSportl 


beta-galactosidase alpha peptide 


98 


47 


12311 


Y2757I 


Homo sapiens 


Human secreted protein encoded by 

oAne lSfn ^ 


130 


42 - 


12312 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795 


134 


76 


12313 
12314 


S58722 
U79260 


Homo sapiens 
Homo sapiens 


X-linked retinopathy protein {C- 
terminal clone XEH 8c V 

unknown 


98 
97 


82 
64 


12315 
12316 


L 27428 
D00570 


llnmn esinipn<3 

A A L/I XI KJ OuL/lUliJ 

Mus musculus 


ICrW jw U OlilSwl ll/UUC 


115 


83 


12317 


rvr U7V/7J i 




open reading frame (196 AA) 

PR OflASn 




74 


12318 


G02485 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 

UJUU. 


123 


84 


12319 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5000 

J7U7. 


107 


70 


12320 


G00412 


xxuixivi otx^fkviii> 


Wiimnn QwrpfpH nrr\tf*in ^F!fl TTj "KIO* 
xxuiiiuu owvicicu |/iuicxxx, oivy JLi-» nvi 

4493. 


07 ! 




12321 




rmicptiliic 

IVIUO XlJ UdW»UllU> 


mif*l**flir 1f%i*Ali7Atirtn cionnl HinHino 

llUl/lCOX lUWHlltiBllUU SMglicU l/XXlUlXXg 

protein 




SR 

JO 


12322 


G00362 


XX.UXXXU OdjpiVXlO 


Human c*w*rptpH urntflin 5JEO 1 1 TWO* 
XJLUXJlall acuicicu piuiviu, ody xxy 

4443. 






12323 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492, 


115 


75 


12324 


AF286472 


Homo sapiens 


retinitis pigmentosa GTPase regulator 


121 


53 


12325 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


162 


75 


12326 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


129 


75 


12327 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


125 


88 
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SEQ1D 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


12328 


AF090944 


Homo sapiens 


PRO0663 


122 


1 76 


12329 


AF090931 


Homo sapiens 


PRO0483 


156 


81 


12330 


W00838 


Homo sapiens 


Tumour necrosis factor-related gene 
product 


107 


80 


12331 


AF116715 


Homo sapiens 


PR02829 


127 


72 


12332 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


145 


49 


12333 

■ 


S80862 

• 


Western equine 
encephalomyeliti 
s virus Western 
equine 

encephalomyeliti 
s virus, Peptide 
Partial, 259 aa 


nucleocapsid gene C 

■ 

- 


108 . 

• 


35 


12334 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


141 


83 


12335 


AK024455 


Homo sapiens 


FLJ00047 protein 


133 


71 


12336 


AF090944 


Homo sapiens 


PRO0663 


139 


84 


12337 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


124 


71 


12338 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCSl 1037. 


102 


69 


12339 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


121 


81 


12340 


G03807 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7888. 


149 


54 


12341 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein #153. 


134 


66 


12342 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


102 


60 


12343 


AF 118086 


Homo sapiens 


PRO 1992 


80 


63 


12344 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


104 


80 


12345 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7496. 


145 


72 


12346 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


143 


75 


12347 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


116 


37 


12348 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


176 


76 


12349 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


103 


48 


12350 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


101 


81 


12351 


Y02926 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 101. 


108 


70 


12352 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCSl 1037. 


101 


84 


12353 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


105 


42 


12354 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


147 


66 


12355 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


125 


65 


12356 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


96 


68 


12357 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


122 


43 


12358 


AL1 60493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


132 


38 


12359 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


90 
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SEQ ID 
NO: 


Accession No. 

i 


Species 


Description 


Smttb- 
Wa term an 
Score 


% 

Identit 

y 




1 


* 


4478. 






12360 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


113 


56 


12361 


AB001684 


Chlorella 
vulgaris 


ORF49b 


92 


44 


12362 


U93564 


Homo sapiens 


p40 


257 


84 


12363 


U60269 


Homo sapiens 


putative envelope protein; orf similar to 
env of Type A and Type B retroviruses 
and to class I! HERVs 


427 


84 


12364 


G03064 


Homo sapiens 


Human secreted protein, SEQ ID NO: ' 
7145. 


102 


73 


12365 


S58722 


Homo sapiens 

i 


X-Knked retinopathy protein {C- 
terminal, clone XEH.8c} 


102 


68 


12366 


AF090944 


Homo sapiens 


PRO0663 


94 


45 


12367 


Y76198 


Homo sapiens 


Human secreted protein encoded by 
gene 75. 


150 


55 


12368 

■ 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


105 


85 


12369 


AF130089 


Homo sapiens 


PRO2550 


112 


78 


12370 


AF09089£ 


Homo sapiens 


PRO01 17 


125 


64 


12371 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


121 


95 


12372 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


145 


72 


12373 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


144 


72 


12374 


S58722 

* 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH. 8c} 


133 


79 


12375 


AP000001 


Pyrococcus 
horikoshii 


58aa long hypothetical protein 


96 


51 


12376 


U16359 


Ratals 
norvegicus 


nitric oxide synthase 


105 


76 


12377 


S58722 


Homo sapiens 
* 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


123 


78 


12378 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


126 

, • 


83 


12379 


Y53643 


Homo sapiens 

• 


A bone marrow secreted protein 
designated BMS6. 


329 


98 


12380 


U64094 


Homo sapiens 


soluble type II interleukin-1 receptor 


114 


100 


12381 


J02459 


bacteriophage 
lambda 


D (head-DNA stabilization; 110) 


467 


100 


12382 


L10908 


Mus museums 


Gcapl gene product 


100 


89 


12383 


AF220264 


Homo sapiens 


MOST-1 


113 


68 


12384 


U79260 


Homo sapiens 


unknown 


94 


60 


12385 


B01372 


Homo sapiens 


Neuron-associated protein. 


148 


52 


12386 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


116 


63 


12387 


G00952 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5033. 


125 


92 


12388 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


130 


65 


12389 


S79410 


Mus museums 


nuclear localization signal binding 
protein 


150 


62 


12390 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


102 


38 


12391 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


113 


56 


12392 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413. 


73 


92 
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SEQ1D 
NO: 


Accession No. 


Species 

it* 


Description 


Smitb- 

Waterraan 

Score 


Identit 

y 


12393 


M69297 


Homo sapiens 


ORF3 


167 


75 


12394 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


142 


43 


12395 


AF220264 


Homo sapiens 


MOST-1 


131 


57 


12396 


G02532 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
6613. 


139 

■ 


62 


12397 


AF1 30089 


Homo sapiens 


PRO2550 


120 


62 


12398 


AK023117 


Homo sapiens 


unnamed protein product 


529 


97 


12399 


AF123652 


Homo sapiens 


FEZ1 


157 


96 


12400 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 6 


139 


87 


12401 


U49973 


Homo sapiens 


ORF 1 ; MER37; putative transposase 
similar to pogo element 


306 


60 


12402 


AC00503 1 


Homo sapiens 


neuronal apoptosis inhibitory protein 


143 


96 


12403 


J03071 


Homo sapiens 


chorionic somatomammotropin CS-2 


585 


100 


12405 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


125 


100 


12406 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhem) gene preoduct 


150 


63 


12407 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


157 


85 


12408 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


120 


75 


12409 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


141 


50 


12410 


G03465 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7546. 


103 


51 


12411 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


147 

i» 


71 


12412 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


159 


70 


12413 


S58722 


Homo sapiens 

• 


X-linked retinopathy protein {C- 
terminal, clone XEH. 8c} 


129 


79 


12414 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


156 


72 


12415 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


74 


72 


12416 


AL390114 


Leishmania 
major 


probable (hhv-6) u 1 1 02, variant a 
DNA, complete virion genome 


102 


50 


12417 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


171 


75 


12418 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6983. 


137 


55 


12419 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


HI 


52 


12420 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


129 


53 


12421 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


110 


55 


12422 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


126 


87 


12423 


U93564 


Homo sapiens 


p40 


350 


93 


12424 


M17885 


Homo sapiens 


acidic ribosomal phosphoprotein (P0) 


484 


88 


12425 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


130 


69 


12426 


X60376 


Brassica napus 


proline-rich protein 


89 


47 


12427 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


83 


88 


12428 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


117 


75 


12429 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


107 


67 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Idcntit 

y 


12430 


AK027208 


Homo saoiens 


unnamed orotein product 


108 


64 


12431 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
cene 44 clone HTDAD22 


154 


68 


12432 


AF1 30089' 


Homo sapiens 


PRO2550 


118 


88 


12433 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6983 


90 


55 


12434 


AK024455 


Homo sapiens 


FLJ00047 protein 


113 # 


75 


12435 


AP000616 

ill UVUU 1U 




similar to RTNf*-H7 fino*er nrotpin 
RHAla (AF078683 > > 


107 


80 


12436 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888 


114 


75 


12437 


AC005498 


Homo saniens 


R31665 2 


163 


75 


12438 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAlaf'AF0786831 


132 


100 


12439 


G031 16 




Human opcreteri nrntein SFO TO NO* 

-1 iUlillQ.lI ObblvlvU LJJUL&UJL, JUy 1 JLV 

7197. 


145 


84 


12440 


AJC024455 


Hrvmn QfinipnQ 


FLJ00047 Dmtein 




1% 
§ j 


17441 






Vfitmnn cprrptprl nrrvt^tn QFf^V TD MO* 
XlUJUOll dCUlvLCU LUU1CI11, OCy XJ^r llv. 

4478. 


177 


07 


1944? 




nuiuvj oapiCllo 


jm 00/5^7 
rivuvoj / 


17A 


79 


12443 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


159 


53 


19444 


r\.r i JUU07 






1 1 n 


/J 


12445 


AL021395 


Homo sapiens 


dJ269M15.1 (similar to peptidylprolyl 


225 


64 


19446 


AFOQOO'JI 
/\.r \i ™ \i ✓ j i 


nutilU oaDICIla 


PR O04 




70 


17447 


f*07Sl S 

VJ V/i. J 1 J 


14nmn cnnipnc 


nuiiLau. stcuicicu pruLcin, ocy xu i^u. 

6596. 




01 


1744ft 




Unmn cpnipnc ■ 

nuiuu bajjiciib 


WTuman c*»r*r<3t^>H nrrttoin Cpn 1 P> ATO* 
XlUilloJU SCU CU/U pi U LCI 11, OC^ tU IN L/. 

4473. 


17? 


74 


17449 


AF1 19855 


I4nm r\ cani pn c 
11U1UU ociJJlvlO 


PR01JU7 

ji\\/i(rt/ 


Oo 


7? 


124 SO 


G0032R 


UnmA QAntPflC 


Human cftfrftftfl t\rntf>in QFO TT^ Wfi* 

4409. 


147 
it/ 


€>*T 


174S1 


AIC024455 


ilwlUU OCUSiwlld 


FT T0n047 nrntAin 


7R 


oo 


17452 


AF116712 

AT 1 IV// IX. 




PR077^R 


1 1X 
1 1 o 


6? 


12453 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868 

/ DUO. 


109 


67 


12454 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795 


144 


59 


12455 


AK024455 


Homo sapiens 


FLJ00047 protein 


146 


64 








Hiimflii cprrrtpH nrnfpin &PO TTi KJO* 
jaujuoii scuiclcu pii/iwiu, Ouy i i y ny, 

6613. 




77 


12457 


AF090944 






17Q 


7Q 
fy 


12458 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6582 


119 


70 


12459 


R22278 


Homo sapiens 


Human gp .-specific component 

UllwUvlyl/v \Jvl» 


250 


65 


12460 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561 


143 


58 


174A1 


YR71 5fi 
I Or I JO 




T— It im an cpnrf>ti=>H nrntoi'n cAniiPn/>o ^T70 

num<m bcurcica protein i>cc|u.cncc ocv^ 
ED NO: 195. 


in 
i j l 


rtl 


12462 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


147 


84 


12463 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


94 


38 


12464 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


134 


74 


12465 


AF200715 


Homo sapiens 


PTB domain adaptor protein CE1>6 


332 


95 


12466 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


139 


64 
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CPA ■ TV 

NO: 


Accession no. 


ope tics 


iscscripuun 


Waterman 
Score 


0/ 

Identit 

y 








7867. 






12467 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


117 


55 


12468 


AF1 99023 


Homo sapiens 


phospholipid scramblase 4 


476 


98 


12469 


AF265575 


Homo sapiens 

• 


ubiquitous TPR-motif protein Y 
isoform 


104 


89 


12470 


AF108841 


Homo sapiens 


pol protein 


306 


87 


12471 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


111 


79 


12472 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


124 


78 


12473 


U18339 


Variola virus 


D4L 


117 


60 


12474 

■ 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4458. 


124 


63 


12475 


AF 13 0089 


Homo sapiens 


PRO2550 


105 


52 


12476 . 


G04078 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
8159. 


80 


62 


12477 


M91242 


Rattus 
norvegicus 


calcium channel alpha-1 summit 

• 


156 


68 


12478 


AF 1 30089 


Homo sapiens 


PRO2550 


154 


79 


12479 


M15530 


Homo sapiens 


B-cell growth factor 


154 


70 


12480 


U39904 


Mus museums 


citron 


718 


97 


12481 


R99364 


Homo sapiens 


Human REST protein DNA binding 
domain. 


229 


37 


32482 


AL160371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


97 

■ 


54 


12483 


AF161356 


Homo sapiens 


HSPC093 


94 


55 


12484 


Y69166 


Homo sapiens 


A mature human N-acetylglycosaminyl 
transferase protein. 


97 


75 


.12485 


AF090942 


Homo sapiens 


PRO0657 


103 


55 


12486 


U79260 


Homo sapiens 


unknown 


98 


74 


12487 


AL160493 * 


Leishmania 
major 


probable (hhv-6) ul 1 02, variant a 
DNA, complete virion genome 


96 


53 


12488 


U37100 


Homo sapiens 


aldose reductase-like peptide 


400 


97 


12489 


U93569 


Homo sapiens 


putative pi 50 


141 


56 


12490 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


104 


66 


12491 


R59842 


Homo sapiens 


ApoE4Ll protease. 


86 


89 


12492 


Y00876 


Homo sapiens 


Human LAPH-1 protein sequence. 


219 


45 


12493 


AK024455 


Homo sapiens 


FLJ00047 protein 


85 


53 


12494 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


95 

* * 


63 

■ 


12495 


Y36722 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 98. 


216 


60 


12496 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


102 


57 


12497 


Y77551 


Homo sapiens 


C-terminal domain of betalc integrin. 


132 


67 


12498 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


122 


69 


12499 


AF161356 


Homo sapiens 


HSPC093 


98 


51 


12500 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6983. 


150 


81 


12501 


G00492 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4573. 


97 . 


52 


12502 


AF1 32200 


Homo sapiens 


PR01751 


94 


69 


12503 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


76 


65 


12504 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 


99 


75 



469 
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SEQ ID 


Accession No. 


Species 


Description 


Smitb- 




NO: 






Watennan 


Identit 








* 


Score 


y 


12505 


* At 4 A*m — — . — 

M19651 


Rattus 


fos-related antigen 


132 


92 






• 

norvegicus 








12506 


G00354 


Homo salens 


Human secreted protein, SEQ ID NO: 


113 


48 








4435. 






m j**. mm mm 

12507 


U94832 


Homo sapiens 


KSRP 


« m\ mm 

117 


48 


12508 


W47029 


Homo sapiens 


Human N-proteinase (70 kDa short 


121 


62 








form). 






12509 


AP000616 


Oryzasativa 


similar to RING-H2 finger protein 


124 


66 


■ 






RHAla(AF078683) 






12510 


G03807 


• 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


90 


80 








7888. 






12511 


AF042384 


Homo sapiens 


BC-2 protem 


516 


98 


12512 


Y12950 


Homo sapiens 


Amino acid sequence of a human 


96 


50 








■a ■ ■ ■ ■ 

secreted peptide. 






12513 


G03482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


121 


45 








***** m* 9 ** 

7563. 






12514 


AL 162044 


Homo sapiens 


hypothetical protem 


1 M A 

174 


mm A 

48 


12515 


AF130079 


Homo sapiens 


PR02852 


96 


73 


12516 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


113 


77 


12517 


D32202 


Homo sapiens 


alpha i C adrenergic receptor isoform 2 


164 


86 


12518 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


85 






- - 


7001. 





• . 


12519 


U00694 


Gallus gallus 


vitamin D3 hydroxylase associated 


187 


45 






. ... 


protein 






12520 


AB030829 


Rattus 


carbonic anhydrase III 


503 


63 






* 

norvegicus 








12521 


J04204 


Bostaurus 


32 kd accessory protein 


618 


95 


12522 


M36341 


Homo sapiens 


ADP-ribosylation factor 4 


374 


93 


12523 


AF090931 


Homo sapiens 


PRO0483 


163 


82 


12524 


AB007925 


Homo sapiens 


KIAA0456 protein 


130 


62 


12525 


G02314 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


79 




• 


■ 


6395. 






12526 


AF178948 


Homo sapiens 


TALE homeobox protein Meis2a 


321 


93 


12527 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


156 


85 




. _ _ _ 


brucei 


complete virion genome; 






12528 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


57 








7492. 






12529 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 


68 








6953. 


• 




12530 


AK024372 


Homo sapiens 


unnamed protein product 


1 12 


56 | 


12531 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


76 


65 








ft 1 4 m M 

encoded by gene 17. 




• 


12532 


G00328 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


132 


66 




i 




a> a t\ a 

4409. 




i 


12533 


A 9 ** AAA AM 

G03807 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


t A mf 

146 


■ m> mm 

53 








*W A A A 

7888. 


■ - 




12534 


AF090942 


Homo sapiens 


PRO0657 


83 


60 


* A*m\ mm>mmk. MM 

12535 


W48351 


■ nmmr ft 

Homo sapiens 


WW ^^^^ u m _1 • 

Human breast cancer related protem 


136 


'A 9 *** 

65 








BCRB2. 






12536 


R95913 


Homo sapiens 


Neural thread protein. 


- mm mfi* 5 

106 


52 


12537 


AF130089 


Homo sapiens 


PRO2550 


148 


84 


12538 


AF102826 


Homo sapiens 


RD1 14/simian type D retrovirus 


605 


A 4 

91 








receptor 






12539 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


143 


52 








SEQIDNO:163. 






12540 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


87 


65 








4435. 






12541 


G03600 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


209 


92 








7681. 







470 



WO 01/64835 



SEQ ID 
NO: 


Accession No. 


Species 

•»' 


Description 


Smith- 
Waterman 
Score 


Edcotit 

y 


12542 


M31964 


Saimiriine 
Herpesvirus z ■ 


collagen-like protein 


92 


52 


12543 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


76 


12544 


Yoo24o 


riomo sapiens 


Human secretea protein huhttoo, 
SEQIDN0:163. 


l 


iZ 


1 O C A C 

12545 


AC004912 


Homo sapiens 


similar to OKlo, bH3 domain oinaing 
pro Kiln, similar to zzu jjhua 

JJL/.g 1 JO l\if\I) 


■ 


OQ 


12546 


V00662 


Homo sapiens 


URF A6L (NADH dehydrogenase 
suoumi/ 


175 


91 


12547 


AF 130089 


Homo sapiens 


PRO2550 


112 


40 


12545 




Homo sapiens 


1 T nfwwAtarl nvA^ain OCA TT"\ "NT/"^\* 

Human secretea protein, o.bv<J rsu. 

0013. 


i no 

1 \)y 


*fo 


io</io 


ADD 1 IU 


riomo sapiens 


neurite ouigrowm-prornoinig protem 


A70 


79 

. 'Am 


1Z3jU 


AyoZyo 


riomo sapiens 


uoiquitm nyaroiase 


3UU 


OO 


1 Of CI 

12551 


K94317 

■ 


Homo sapiens 


T^nMJt^fi m i4a ■ ■ Jm f jinn ^~ m n mi 'niiltn^AMAn Tj I) 

Hepatocyte proliferation suostance nr- 

U4I v. 




09 1 


12552 


G01623 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

J /U4. 


142 


90 

• 


12553 


D86966 


Homo sapiens 


similarto human ZFY protein. 


161 


56 


1 O CCA 

12554 


DU057U 


Mus museums 


open reading trame UVo AA) 


too 1 


O/ 


12555 


Ar 130089 


Homo sapiens 


rKO2550 


OA 

y4 


QQ 
OO 


12556 


A rAii'l 1 OA 

Ar 043 184 


Homo sapiens 


1 cell receptor beta cnain 


007 


o4 


12557 


U93564 


Homo sapiens 


p40 


154 


100 


12558 


U42026 


Homo sapiens 


pJasma membrane Ca2+-ATPase 
variant 4a FMCA4a 


200 


fit 
97 


12559 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


103 


50 


12560 


ITOACO 

U90552 


Q T AB aak A m^ mmm mt mm mmm mm 

Homo sapiens 


m. — ^^mt k^^but& « f *M 

butyropnum 


J JO 


OS 

yo 


12561 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


156 


37 


12562 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


I3o 


41 


12563 


VI ^ AC A 

XI 6454 


it 7 „" " 

Homo sapiens 


carcinoembryonic antigen subdomain B 


353 


no 

98 


12564 


AL031186 


Homo sapiens 


DK984G 1 .4 (Ewmg sarcoma breakpoint 
region 1 protein) 


119 


100 


12565 


M68941 


Homo sapiens 


protein-tyrosine phophatase 


223 


93 


12566 


Y14487 


Homo sapiens 


cytosolic serine 
nydroxymetnyltransierase 


145 


84 


12567 


PAID?') 

(502832 


Homo sapiens 


TTii.ni n .1 finnu4iul pen TT^ XfA\, 

Human secreted protem, bbxi iu isu. 
6913. 


14y 


01 


125 oS 


A D/lACA/11 

ABU UD 047 


T V mf^mmmmmm AMm At» A^m-m. m #% 

Homo sapiens 


oH3 Dindmg protem 


1*4 


jZ 


12569 


PAO/« 1 1 

O0341 1 


T T ill ■ >■ il_ ^"fc . . m ^m mm- mm 

Homo sapiens 


Human secretea protem, bey iu invj. 
7492. 


lUz 


<1 
jj 


12570 


vio i >n 
A68142 


Uryctolagus 
cuniculus 


elongation tactor l gamma 


O/o 


yj 


12571 


L20315 


Mus musculus 


Mrb i protem 




1 1 


12572 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) u 1 1 02, variant a dna, 

mmm m^m* mmmimm mmma ■ M mm mm. m m m, mmmmx A^m mmm m%m m^m. mmm mm^mmmmlm. mWm 

complete virion genome. 


114 


50 


12573 


A T? 1 ?AAOA 

Arl3O089 


Homo sapiens 


FRU2550 


1 Oil 

124 


•71 

fx 


12574 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


51 


12575 


AF005902 


Monodclphis 
domestica 


kinesin homolog 


534 


82 


12576 


Y56021 


Homo sapiens 


Human CD40 receptor interacting 
protein 4C4. 


711 


96 


12577 


Y99662 


Homo sapiens 


Human GTPase associated protein- 13. 


634 


100 


12578 


Z48008 


Saccharomyces 
cerevisiae 


Soklp 


225 


100 



471 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


y. 

Identit 

y 


12579 


AF1 12207 


Homo sapiens 


translation initiation factor elF-2b delta 
subunft 


195 


97 


12580 


M22538 


Homo sapiens 


NADH-ubiquinone reductase 


224 


82 


12581 


AB034730 

* 


Mus musculus 

* 


This gene. is isolated by means of 
differential display method using ttw, 
an excellent mouse model for ectopic 
ossification.; similar to megakaryocyte 
stimulating factor precursor and 
cartilage superficial zone protein 


112 


44 

• 


12582 


U93567 


Homo sapiens 


p40 


364 


93 


12583 


AY007233 


Homo sapiens 


phosphomositol 3-phosphate binding 
protein-1 


475 


100 


12584 


L07592 


Homo sapiens 


peroxisome prohferator activated 
receptor 


m\ ■m /X 

113 


81 


12585 


AE003682 


Drosophila 

• 

melanogaster 


CG8135 gene product 


.191 


38 


12586 


AF1 19851 


Homo sapiens 


PR01722 


88 


64 


12587 


AF090895 


Homo sapiens 


PRO0U7 


144 


59 


12588 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


144 


57 


12589 


U73199 ! 


Mus musculus 


Rho-guanine nucleotide exchange 
factor 


111 


32 


12590 


J04615 


Homo sapiens 


small nuclear ribonucleoprotein 


383 


•97 


12591 


L77967 


Ovis aries 


small proline-rich protein with paired 
repeat 


60 


33 


12592 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


118 

* 


78 


12593 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


123 


83 


12594 


AF090944 


Homo sapiens 


PRO0663 


% A 9 ?** 

152 


82 


12595 


Y15243 


Homo sapiens 


ULBP-3 ammo acid sequence. 


852 


100 


12596 


X98235 


Drosophila 
melanogaster 


type I 


159 


42 


12597 


AF191309 


Mus musculus 


zinc finger protein 


407 


66 


12598 


W82841 


Homo sapiens 


Human cerebral protein- 1. 


107 


91 


12599 


AF070664 


Homo sapiens 


HSPC008 


128 


89 


12600 


U03688 


Homo sapiens 


cytochrome P450 


257 


96 


12601 


AB015798 


Homo sapiens 


DnaJ homolog 


226 


73 


12602 


AC003O58 


Arabidopsis 
thatiana 


unknown protein 


88 


40 


12603 


G03267 


Homo sapiens 


W ■ 1 • m^m^\4^± M f ^ " ^ 

Human secreted protein, SEQ ID NO: 
7348. 


80 


35 


12604 


AF200187 


cercopithicme 
herpesvirus 15 


EBNA2-iike protein 


109 


28 


12605 


AF161532 


Homo sapiens 


HSPC047 


720 


100 


12606 


M21302 


Homo sapiens 


small proline rich protein 


60 


59 


12607 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


107 


47 


12608 


AJ302650 


Rattus 
norvegicus 


RP59 protein 


599 


54 


12609 


J05401 


Homo sapiens 


sarcomeric mitochondrial creatine 
kinase precursor (EC 2.7.3.2) 


763 


96 


12610 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


216 


87 


12611 


W78175 


Homo sapiens 


Human secreted protein encoded by 
gene 50 clone HSTAG52. 


337 


100 


12612 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


145 


66 


12613 


AF119855 


Homo sapiens 


PR01847 


157 


61 



472 
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SEQIO 


Accession No. 


Species 


Description 


Smith- 




NO: 






■ 


Waterman 


Identit 






- 




ocurc 


y 


12614 


U82301 


WnmA Qfintpnc 


11TtlfTlAVt/T1 
UiirviHJ W 1 1 


107 


69 
oz 


1961 5 


f?nA6i7 


nOTJiU dCtpiCILD • 


Unman cpt r^f«»fl rirntAin jr> TsIO* 

xiuman sccrcica pj oicin, jcy il/ in vj. 


146 
i to 


40 < 

4y 








471 R 






19616 




Jtionio SapiCUS 


JrJ\.^JUo7o 


9S4 


^6 
JO 


19617 




IvallUS 


"MP^t79_rAlat'AH ftif*tAT 9 
lNF**X^Z**rcialCU IaUlUi Z 


X07 

Ov f '. 


jy 






w 

. Jiorvcgibus 








1961 R 
IZOi o 


Vfl9 1 6R 
I UZ J Oo 


rlQmo Sapiens 


A iaciiJiauvc giucose rransporiei 


4S9 


1 AA 








piULCUl \JLt\J io. 






1961Q 

1 ZO 1 7 


HOA107 

\JUUJ7/ 


riuiiiu sapiens 


n Lull dii scwrcieu proicuij ocy iu in 


1 SR 


R4 

0*4 








4478 






1969A 

1 ZOZU 


HA1R19 


numu Sapiens 


Unman c&f^TAtorl r\r/\tAin QCA TTt "Mf^" 

xi urn an sccrcieu prousm, ocy jul» inw/. 


946 


6n 
on 








78Q1 






J Z UZ 1 


YSS6R1 

AJJUOj 




AYtf^ncin ( flncc T\ 
calchjiu yi<iass iy 


7 A 


41 






C !>v U 1CI1 I U 111 








19699 

1 ZOZZ 


API flQQ(i7 


xiuinu sapiens 


<5t64 

lj 1 UH 


Q1 


AA 


19691 


V161 56 

I JOlJy 


nuoiu oapiciiD 


Unman ca/ k i b a1'a/4 nvni^in ^f9fi 

n mud ii sci/iCLCu proicui rrZO. 


1 14 


61 


1 9694 
JZ0z4 


A 171 16719 
/vTlIO/lz 


xiomo sapiens 


PT?n9715l 


1 A4 


4R 
4o 


1969^ 
JzOzO 




jiomo Sapiens 


Unman pancafo^ MfntAim C"C/^ TT\ XJ/^. 

xiuman secrereu proiein, ocxi uj vs\j. 


ftl 
o J 


6* 1 








! /U/j. 






IzOZO 


Ar 1 13oo3 


xiomo sapiens 


nn rAAQT/i 

rKUUV /4 


1 f\A 

1 v4 


43 


12627 


AL1 60371 


Leishmania 


probable (nnv-o) ul 1 02, vanant a 


116 


45 






major 


JL>NA, complete virion genome 






IzOzo 


A 1?1 AT/! A/Z 

At 1 U /4U0 


xiomo sapiens 


uWlzo 


133 


35 


12629 


G03415 


Homo sapiens 


Human secreted protem, SEQ ID MO: 


100 


57 








T40iC 






lzo30 


Ax504 /oUU 


Macaca 


nypotnetical protein 


1 TO 

1 /z 


oo 






iaSClCuialTS 




| 




Jzoil 


IjU233o 


Homo sapiens 


xiuman secreted protein, ohKl lu inu. 


1 JO 


03 








661 o 
ooiy. 






12632 


Ar 1 1 5u5z 


Homo sapiens 


nn /"\ "1 A A*> 


135 


45 


1Z033 


fJAA107 


xiomo sapiens 


xiuman secretea protem, ii/ inu. 


1^1 
131 


A9 

Oz 








44/5. 






1Z034 


A3Z-104 


mus muscuius 


i^3 uu protem ^aa i-//) 




35 


1961^ 
1z03j 


A I?A6/I^O'7 


xiomo sapiens 


T TKTX7 1 1iL-o Mm^tAin 

ijiiNts-i iuce protein 


1 AC 


46 
40 


19616 
1/030 


VA96Q1 

i UZ0y3 


Homo sapiens 


xiuman secretea protem encoceu oy 


IK 
1 JO 


40 

4y 








gene 44 clone xIIL/auzz. 






JZ03/ 


W4533 i 


xiomo sapiens 


xiuman oreast cancer reiatea protem 


1 1 A 
1 AU 


<9 

JZ 






X3V/X\X>Z. 






1961R 


V9A71 7 


xiomo sapiens 


xiuman neurom am en t-jvi wuu type 


70 


7A 








pruicm iragment jy. 






19610 


1M9Q699 

ivizyozz 


ivjLus muscuius 


open rcaauig name z 


R1 


6A 
OH 


19/C4A 


Mz04OU 


xiomo sapiens 


retmoDiastoma i 


1 AA 
1UU 


43 


19£41 
1Z 041 


ojo /ZZ 


xiomo sapiens 


A.-unKea retmopatny protein \v^- 


1 1 A 
1 1U 


61 
03 








tArmm»1 rlAnA YT7H Slf\ 






19640 
lz04z 


AxJU4 /sou 


ivLacaca 


nypometicai protein 


1 AA 
XUU 


^9 
3Z 






IaSClCUlarlS 








19647 
IZ043 


fSA'S 1 11 


xiomo sapiens 


xiuman secretea protem, ojbv^ iu nl>. 


140 


41 
41 








7914 






JZ044 


V144R9 
I l****5z 


nomo Sapiens 


riagmem or n urn an secrciea protem 


141 


4R 
*fo 








encoueo oy gene i /. 






1964^ 
1Z043 


A T? A0A049 


xiomo sapiens 


rxiUUO j / 


13/ 


34 


12646 


G02872 


W nnnn innipn<: 


Hum nn secreted nrntem SPO TT> "NO* 


130 


56 








6953. 






12647 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


267 


65 








6953. 






12648 


AF130079 


Homo sapiens 


PR02852 


109 


57 


12649 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


61 








7868. 






12650 


X55686 


Lycopersicon 


extensin (class H) { 


65 


35 



473 
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* * 



SEQID 


Accession No. 


Spedts 


Description 


Smith- 


% 


NO: 






• 


Waterman 


Idcnttt 












y 






WvUiwlilUUI 








17651 


G01714 


Unmn canlpna 


Tinman cwrf*t#»r1 nrntpin TT^ NO- 


87 


ft4 








7795 






1 9659 


API 10851 


nvillU Dap I CJ la 




1 50 


70 
/u 


~ 17fi5l 


Y09671 


riUiilU sapiens 


Human ctwPtpH nrAtPin pn/*AHpH Hv/ 
xiuuiaii acucicu piuiciii ciivuucu uy 




50 








cene 22 clone HMSJ W 1 8 






19654 


Y45187 


ITUIJIU sapiens 


Human cpcrptprl nrAfPin frawm pnt 
. nmutui scvoicicu piuiciu uaKiiJcjiL 


146 


70 
/u 








pticoHpH from opnp 9ft 






19655 


nftni54 


nuiiio sapiens 


Unman cpprPtP/4 nirAtPin CP ^ TT> MO' 
nuiiioii dcucicu proLciuy dcy ul/ in 


n6 

1 JO 


61 








4435 










Haiti a canipnc 
n\Jiwj sapiens 


Human cpfTPtpH nrntein SPO IT) NO* 




57 






* * 


7870 

tot \f» 






iz>u.s / 


TT01561 


JTUlilU sapiens 


mitativp a! 5 A 
puiauve piJU 


1046 


50 


1965ft 


nAnRQ 

UUj / 07 * 


Jl 01710 Sapiens 


Utimon ea^AtoH ntrktoin QC/^i TT™*4 XJO • 

nunian sccreicu proicin, oivv^ lls iskj. 


146 
J*rO 


70 

/v 








7870 

fO/\J. 






17650 


V17Q50 


nuxiiu sapiens 


Aminn ar*i/1 cAmipnr*p /if a tinman 
rVUllHU OUlii aCLjuCLiUC UX a IlUlIldll 


104 


70 








QPf k n*tpH APTtrirfp 


• 




IxOOU 


A 171 10R51 
AT 1 170J1 


fiumo Sapiens 


Pl?r\1799 


Oft 


56 

JO 


1X001 


Q 5 51799 


xiomo sapiens 


A-iinKeu reunopauiy proiein \v^- 


do 


65 
OJ 








tprminal rlAnP YPH Sr>\ 

iciuixiiax, i/iunc yvr.ri.oc/ 






1 9££9 
1ZOOZ 


riAi9£i 


rlOniO Sapiens 


Human cM«r»Wt nmtAtn QCH T T~\ XT . 

jtiuman secreiea protein, jc^ iu r»u. 


50 


64 














1/00 J 


A 171 1 ©AQ9 

Ari louoZ 


Homo sapiens 


DD/^I] OA9 




00 


1ZOO-* 




ivius tnuscuius 


ocap i gene proauci 


1 1j 


50 I 


1Z 00 J 


tJii saoi 


JrilaSeOJUS 


nyaroxyproune-ncn giycoproiem 


194 


17 






vulgaris 


* 






1ZOOO 


AI?AOAftQ5 


riomo sapiens 


PPAA1 17 
rJ\SJ\) 11/ 


76 
/O 


57 


lZOO/ 


+ 


Homo sapiens 


Human secreted protein, anv^ iu inu. 


1 A^ 


/II 








7 87 a 






lzooo 




Homo sapiens 


Human oreast cancer related protein 




01 






J3wi\JDZ. 










homo sapiens 


r>-ceu growtn tactor 


OA 


SA 
5U 


1x0 /U 


AT71 QQAAH 


Aspergillus 


ooo riDospmai protein jlj 


91 *7 
Zl / 


<o 
oy 






niQUlauS 






19671 
1XO f 1 


nAi7QA 


xiuino Sapiens 


nunian scvieteu proiein } ocv^ lu iskj. 


117 
1 j f 


51 
Jl 








7R71 






1 9679 
izo/x 


Y90717 
I ZU / 1 / 


nomo sapiens 


xiuman ncurouiaineni-ivi wim iypc 


77 
/ f 


6ft 








piuieui lTagnxeui ^>7. 






19671 

1XO / J 


A FA 550ft 5 


vynunuuerea 


pyiTUiiuuxie a *riui aiiiigcii 


67 


40 






VUlYUlUS 








1 9674 

JXU /t 


y 704 17 




4nC riHACAmal YtrAtPin Q 1 9 


177 


75 


19675 
JZO fD 


AT71 074O6 

Ar it/ /-wo 


xioroo sapiens 


9ft 


74 


CI 

ji 


19676 
iXO ivj 


V71066 
I / jyOO 


noroo sapiens 


Unman nrncfatA ti im r\r PCT -fVo tvi *»«+ 

riuiiidii prubiaLc ujjnor co l jxagnxcni. 


OQ 


67 








HpHvpH ntntpin iil 5^ 
uexxvcu jpxutejxi truj. 






i^u / r 


G07877 


xiuii&u sapiens 


Human cpprptprl nrAtpin CT70 TTi NO* 
xjiuxixau acwicicu piuiciii^ ox^v^ wj nv/. 


177 


65 








6953 






12678 


G0059G 

VVVV/W 


Hnrnn qanipnc 

A X\JkH\r OtlUlviw 


Human qecreted nrotein SFO TT> KfO* 


99 


62 








4671. 






12679 


GO 1984 i 


Homo ^aniens 


Human secreted nrotein SFO TDNO* 

llUIUttU SvWJlwlvU 1/ivlwUi} OijV^ 1 ' # 1.1 VpXv 


128 


85 








6065. 






12680 


AF 194537 

ill * S~~J *J f 


Homo ^aniens 


NAG 13 


142 

JL i Z* 


59 


12681 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


155 


77 


12682 


AF161356 


Homo sapiens 


HSPC093 


104 


50 


12683 


AFI07406 


Homo sapiens 


GW128 


84 


54 


12684 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


90 


39 


12685 


V40883_cdl 


Homo sapiens 


03-DEC-1997 Coding sequence of 


728 


100 








clone BG366 2. 






12686 


M24732 


Homo sapiens 


lamitvlike protein 


94 


54 


12687 


X00824 


Gallus gallus 


collagen 


66 


42 
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SEQ ID 

NO: 


Accession No. 


Species 

if* 


Description 


Smith* 

Waterman 

Score 


% 

Identit 

y 


12688 


G03790 


Homo sapiens 

# 


Human secreted protein, SEQ ID NO: 
7871. 


184 

- 


64 


12689 


AF1 18086 


Homo sapiens 


PR01992 


140 


61 


12690 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


131 


54" 


12691 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


180 

* 


72 


12692 

• 


G03790 

- ■ 


Homo sapiens 

m 


Human secreted protein, SEQ ID NO: 
7871. 


104 

• 


72 

• 


12693 


AF130089 


Homo sapiens 


PRO2550 


204 


51 


12694 


KOI 664 


Drosophila 
melanogaster 


Bkm-like protein 


92 


52 


12695 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 1 02, variant a 
DNA, complete virion genome 


110 


52 


12696 


L39103 


Homo sapiens 


glycoprotein lb alpha 


61 


45 


12697 


M15530 


Homo sapiens 


B-cell growth factor 


121 


64 


12698 


M36913 


Zea mays 


cell wall protein (put.); putative 


75 V 


40 


12699 


S58722 


Homo sapiens 


X-Iinked retinopathy protein {C- 
terminal, clone XEH.8c} 


101 


68 


12700 


M24732 


Homo sapiens 


lamin-like protein 


95 


35 


12701 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


67 


50 


12702 


AF298594 


Nicotiana alata 


arabinogalactan protein 


105 


30 


12703 


X92485 


Plasmodium 
vivax 


pval 


97 


38 


12704 


AF210651 


Homo sapiens 


NAG18 


89 


64 


12705 


G03790 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7871. 


190 

■ 


53 


12706 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


93 


78 


12707 


U52077 


Homo sapiens 


mariner transposase 


451 


51 


12708 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


244 


77 


12709 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


138 


65 


12710 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


99 


82 


12711 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


104 


34 


12712 


AF242772 


Homo sapiens 


mesenchymal stem cell protein 
DSCD28 


106 


45 


12713 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


116 


36 


12714 


AF1 16715 


Homo sapiens 


PR02829 


139 


68 


12715 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


89 


62 


12716 


X55684 


Lycopersicon 
esculentum 


extensin (class I) 


86 


54 


12717 


Z93891 


Hegeter politus 


cytochrome oxidase 


72 


48 


12718 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


105 


46 


12719 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


74 


46 


12720 


U30221 


Crithidia 
fasciculata 


NADH dehydrogenase subunit 5 


92 


39 


12721 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


131 


82 


12722 


AF1 07406 


Homo sapiens 


GW128 


72 


54 


12723 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


55 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
. Waterman 
Score 


% 

Identlt 

y 








6720. 






12724 


AF1 16661 


Homo sapiens 


PR01438 


145 


59 


12725 


Y53871 


Homo sapiens 


A human brain-derived signalling 
factor polypeptide. ~ 


584 


98 


1272$ 


L10908 


Mus musculus 


Gcapl gene product 


95 


37 


12727 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


124 


37 


12728 


X55686 


Lycopersicon 
esculentum 


extensin (class II) 


66 


36 


12729 


G01984 

• 


Homo sapiens 

. 


Human secreted protein, SEQ ID NO: 
6065. 


110 


55 


12730 


L23852 


Homo sapiens 


Putative 3* end of coding region; 
putative 


83 


37 


12731 


Y13141 

* 


Bromheadia 
finlaysoniana 


extensin 


59 

T 


48 


12732 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


88 


69 


12733 


AF090895 


Homo sapiens 


PRO0117 


130 


55 


12734 


R95913 


Homo sapiens 


Neural thread protein. 


162 


45 


12735 


U72355 , 


Homo sapiens 


Hsp27 ERE-TATA-binding protein 


972 


57 


12736 


AF247039 


porcine 
adenovirus 3 


163R* 


108 


35 


12737 


M22332 


Homo sapiens 


unknown protein 


148 


45 


12738 


AF090895 


Homo sapiens 


PRO0117 


89 


72 


12739 


B08525 


Homo sapiens 


Protein encoded by a novel gene 
associated with colon disease. 


296 


86 


12740 


X90872 


Homo sapiens 


associated to Golgi apparatus 


116 


54 


12741 


M26460 


Homo sapiens 


retinoblastoma 1 


75 


37 


12742 


S52010 


Mus sp. 


orfl 5' of EpoR 


104 


31 


12743 


G02639 


Homo sapiens 

■ 


Human secreted protein, SEQ ID NO: 
6720. 


88 


50 


12744 


AB047600 


Macaca 
fascicularis 


hypothetical protein 


111 


47 


12745 


AF090944 


Homo sapiens 


PRO0663 


152 


65 


12746 


Y21106 


Homo sapiens 


Human bc!2 proto-oncogene wild type 
protein fragment 3. 


62 

* 


64 


12747 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


97 


76 


12748 


X66861 


Mus musculus 


Hox-1.4 


92 


34 


12749 


AF166125 


Homo sapiens 


selenoprotein N 


978 


98 


12750 


W87504 


Homo sapiens 


Human N-methyl-D-aspartate receptor 
sub-unit encoded by clone NMDA24. 


98 


29 


12751 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


91 


78 


12752 


X72963 


Nicotiana 
tabacum 


pAP8 product 


80 


45 


12753 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


123 


61 


12754 


AF146191 


Homo sapiens 


FRG1 


208 


85 


12755 

■ 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


154 


75 


12756 


GO3790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


72 


48 


12757 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


111 


51 


12758 


U83280 


Leishmania 
donovani 


39 kDa antigen 


106 


60 


12759 


AF1 16661 


Homo sapiens 


PR01438 


73 


48 


12760 


AF164615 


Homo sapiens 


envelope protein, 


257 


83 
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. * 



SEQ ID 
NO: 


Accession No. 


Species 


Description 

• 


Smith- 
Waterman ' 
Score 


% 

Ideotit 

y 


12761 


Y02671 

• 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


135 


70 


12762 


AF073519 


Homo sapiens 


small EDRK-rich factor 1, long isoform 


111 


50 


12763 


M29622 


Mus musculus - - 


open reading frame 2 


73 


68 


12764 


L27428 


Homo sapiens 


reverse transcriptase 


111 


60 


12765 


S79410 

* 


Mus musculus 


nuclear localization signal binding 
protein 


106 


45 . 


12766 


AK024455 


Homo sapiens 


FLJ00047 protein 


108 


66 


12767 


M36914 


Zea mays 


cell wall protein (put.); putative 


78 


36 


12768 


L27428 


Homo sapiens 


reverse transcriptase 


159 


59 


12769 


AJ005567 


Mus musculus 


SPR2I protein 


55 


39 


12770 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


115 ' 


82 


12771 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


76 


64 


12772 


AF2 17374 


Acanthaster 
planci 


cytochrome oxidase subunit I 


126 


85 


12773 


AF130114 


Homo sapiens 


PR02459 


129 


50 


12774 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


220 


71 


12775 


AF287482 

. . » 


Chlorobium 
tepidum 


OrfJ22 


166 


68 


12776 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


84 


50 


12777 


AF130079 


Homo sapiens 


PR02852 


258 


58 


12778 


AF090931 


Homo sapiens 


PRO0483 


65 


76 


12779 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


152 


66 


12780 


AB047948 


Macaca 
fascicularis 


hypothetical protein 

* * * 


59 


37 


12781 


AF1 19900 


Homo sapiens 


PR02822 


132 


59 


12782 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


97 


51 


12783 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4669. 


77 


77 


12784 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


93 


84 


12785 


L27428 


Homo sapiens 


reverse transcriptase 


183 


39 


12786 


AF1 19851 


Homo sapiens 


PR01722 


128 


61 


12787 


AFl 16715 


Homo sapiens 


PR02829 


111 


72 


12788 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


147 


77 


12789 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


84 


66 


12790 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


155 


71 


12791 


AF124729 


Mus musculus 


acinusS' 


122 


50 


12792 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


86 


52 


12793 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


108 


74 


12794 


M62415 


Pseudopleuronec 
tes americanus 


HPLC6 


82 


35 


12795 


Y36156 


Homo sapiens 


Human secreted protein #28. 


133 


48 


12796 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


119 


74 


12797 


L22030 


Glycine max 


hydroxyproline-rich glycoprotein 


170 


32 


12798 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


114 


43 


12799 


Y76184 


Homo sapiens 


Human secreted protein encoded by 


90 


60 
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• ■ 



cva in 

SEQ ID 


Accession Mo. 


apCCICS 


Description 




~5T 

SO 


NO: 








Waterraan 


Identit 






• 




Score 


y 








gene 61. 






12800 


X62677 


Oryctolagus 


retrovirus related reverse transcriptase 


102 


38 






cuniculus 


* 






12801 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


98 


69 








6613. 






12802 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


177- 


69 




* 


■ 


6613. 






12803 


AF130089 


Homo sapiens 


PRO2550 


94 


74 


12804 


S77772 


Homo sapiens 


aspartylglucosaminidase, AGA {C- 


83 


68 


• 


• 




tenninal, alternatively spliced} {EC 








• 




3.5.126} 




• 


12805 


AF063866 


Melanopius 


ORF MSV233 hypothetical protein 


82 


28 






sanguinipes 


• 








■ 


entomopoxvims 


• 


• 




12806 


AF026689 


Homo sapiens 


prostate-specific transglutaminase 


80 


60 


12807 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


47 








4414. 






12808 


X03145 


Homo sapiens 


pot. ORF V 


150 


34 


12809 


LI 0908 


Mus musculus 


Gcapl gene product 


111 


34 


12810 


Y95028 , 


Homo sapiens 


Human clone vp7 1 ORF2, SEQ ED 


170 


55 


■ 




- 


NO: 128. 




■ 


12811 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


62 






• 


6566. 






12812 


AJ233591 


Mus musculus 


reverse transcriptase 


287 


67 


12813 


U82303 


Homo sapiens 


unknown 


149 


52 


12814 


AF107406 


Homo sapiens 


GW128 


103 


58 


12815 


U82303 


Homo sapiens 


unknown 


93 


83 


12816 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


161 


54 








6613. 






12817 


BO 1390 


Homo sapiens 


Neuron-associated protein. 


81 


38 


12818 


AF287482 


Chlorobium 


Orfl22 


174 


69 






tepidum 








12819 


AE003499 


Drosophila 


CG12706 gene product 


166 


28 






melanogaster 


• 






12820 


M15530 


Homo sapiens 


B-cell growth factor 


124 


71 


12821 


U83303 


Homo sapiens 


line- 1 reverse transcriptase 


161 


53 


12822 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


150 


77 








6574. 






12823 


G00497 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


66 






4578. 






12824 


AF064597 


Homo sapiens 


LINE-1 like protein 


64 


45 


12825 


X71442 


Rattus 


ORF 1; putative 


113 


45 




• 


norvegicus 


• 






12826 


U62039 


Elephantulus 


reverse transcriptase 


74 


46 






edwardii 


• 






12827 


M15530 


Homo sapiens 


B-cell growth factor 


127 


54 


12828 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


84 






• 


6595. 






12829 


U63332 


Homo sapiens 


super cysteine rich protein; SCRP 


59 


57 


12830 


AF229067 


Homo sapiens 


PADI-H protein 


184 


61 


12831 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


69 


34 


12832 


AF144054 


Homo sapiens 


apoptosis related protein APR-4 


83 


46 


12833 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


168 


61 








6613. 






12834 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


40 








7214. 






12835 


U19098 


Ly copers icon 


proline-rich protein 


81 


38 






chilense 








12836 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


229 


68 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smitn- 

Waterraan 

Score 


% . 
Identit 

y 


12837 


Y13141 


Bromheadia 
finlaysoniana 


extensin 

• 


81 


60 


12838 

• 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 




71 


12839 


X72963 


Nicotiana 
tabacum 


pAP8 product 


94 


40 


12840 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


315 


56 


12841 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


94 


55 


12842 


AK025947 


Homo sapiens 


unnamed protein product 


124 


57 


12843 


AF159055 


Homo sapiens 


leucine zipper-like protein 


69 


55 


12844 


AF040257 


Homo sapiens 


TNF receptor homolog 


98 


50 


12845 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


94 


76 


12846 


X88799 


Oryza sativa 


DNA binding protein 


94 


37 


12847 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 

■ 


98 


48 


12848 


G01984 


Homo sapiens 

1 


Human secreted protein, SEQ ID NO: 
6065. 


164 


56 


12849 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435; 


75 

• 


59 


12850 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


122 


54 


12851 


M31964 


Saimiriine 
herpesvirus 2 


collageri-like protein 


79 


50 


12852 


AF 11 8082 


Homo sapiens 


PRO1902 


94 


77 


12853 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


139 


54 


12854 


R59842 


Homo sapiens 


ApoE4Ll protease. 


108 


82 


12855 


G03295 


Homo sapiens 
* 


Human secreted protein, SEQ ID NO: 
7376. 


99 


66 


12856 


GQ3789 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7870. 


89 


56 


12857 


D29833 


Homo sapiens 


proline rich peptide P-B 


64 


52 


12858 


G03714 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7795. 


155 


80 


12859 


G02482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6563. 


88 


61 


12860 


G02639 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


66 


33 


12861 


Y13141 


Bromheadia 
finlaysoniana 


extensin 


75 


47 


12862 


M29622 


Mus museums 


open reading frame 2 


74 


62 


12863 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


141 


70 


12864 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


117 


54 


12865 


AF090894 


Homo sapiens 


PRO0113 


108 


63 


12866 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


121 


54 


12867 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


103 


44 


12868 


S68106 


Ascaris suum, 
Peptide Partial, 
100 aa 


type IV collagen alpha 2 chain, alpha 2 
(IV) {alternatively spliced} 


75 


35 


12869 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


97 


56 


12870 


G02872 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
6953. 


146 


62 



t 
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NO: 


ACCCSSIOD INO. 


o pec ics 


Description 


omiCu- 

Watcrman 

Score 


% 

Identit 

y 


12871 

i 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


78 


76 


12872 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


105 


56 

• 


12873 


AB030234 


Canis farniliaris 


D4 dopamine receptor 


61 


58 


12874 


M22332 


Homo sapiens 


unknown protein 


128 


41 


12875 


U05313 


Trypanosoma 
brucei 


CR3 


67 


48 


12876 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


115 


65 


12877 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


66 


54 


12878 


X55684 


Lycopersicon 
esculentum 


extensin (class I) 


78 


34 


12879 


G03790 


Homo sapiens . 


Human secreted protein, SEQ ID NO: 
7871. 


122 


51 


12880 


AF130079 


Homo sapiens 


PR02852 


125 


75 


12881 


AF090895 


Homo sapiens 


PRO0117 


142 


59 


12882 


AF181251 

i 


Rattus 
norvegicus 


lung Kruppel-Iike factor 

1 


93 


39 


12883 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


167 


74 


12884 


M26460 


Homo sapiens 


retinoblastoma 1 


136 


41 


12885 


AJ277557 


Homo sapiens 


mitochondrial 5'(3>. 
deoxyribonucleotidase (dNT-2) 


280 


100 


12886 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


119 


62 


12887 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


145 


43 


12888 


AF119851 


Homo sapiens 


PR01722 


94 


50 


12889 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


121 


39 


12890 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


95 


68 


12891 


AK024455 


Homo sapiens 


FLJ00047 protein 


77 . 


51 


12892 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


115 


83 


12893 


L27428 


Homo sapiens 


reverse transcriptase 


89 


29 


12894 


Y01154 


Homo sapiens 


Protein sequence Seq Id 54 from 
WO9901020. 


112 


72 


12895 


G02597 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
6678. 


72 


62 


12896 


AK023563 


Homo sapiens 


unnamed protein product 


260 


64 


12897 


G02832 


Homo sapiens . 


Human secreted protein, SEQ ID NO: 
6913. 


76 


66 


12898 


AC002304 


Arabidopsis 
thaliana 


F14J16.29 


111 


36 


12899 


Y86445 


Homo sapiens 


Human gene 42-encoded protein 
fragment, SEQ ID NO:360. 


83 


53 


12900 


AF116638 


Homo sapiens 


PR01546 


123 


57 


12901 


AB010361 


Mus musculus 


mszf47 


64 


43 


12902 


R59842 


Homo sapiens 


ApoE4Ll protease. 


130 


86 


12903 


L06498 


Homo sapiens 


ribosomal protein S20 


249 


56 


12904 


LI 3635 

* 


Rattus 

* 

norvegicus 


growth response protein 


176 


69 


12905 


AF156961 


Homo sapiens 


gag 


183 


44 


12906 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


207 


67 


12907 


AF107406 


Homo sapiens 


GW128 


99 


50 


.12908 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


93 


75 
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* 



SEQID 
NO: 


Accession No. 


Species 1 


Description 


Smith- 

Watermflfl 

Score 


Identit 

y 


12909 


AF263540 


Homo sapiens 


MIB006 


1809 


100 


12910 


Y48292 


Homo sapiens 


Human prostate cancer-associated 
protein 78. 


60 


50 


12911 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


179 


87 


12912 


Y02999 

■ 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


108 


72 


12913 


272499 


Homo sapiens 


herpesvirus associated ubiquitin- 
specific protease (HAUSP) 


5242 


99 


12914 


AF092136 


Homo sapiens 


PTD015 


297 


100 


12915 


AF161356 


Homo sapiens 


HSPC093 


123 


42 


12916 


AF044670 


Homo sapiens 


33 kDa Vamp-associated protein; VAP- 
33 


506 


90 


12917 


W27087 


Homo sapiens 


Human transforming growth factor 
alpha HIII. 


1243 


100 


12918 


Y59807 


Homo sapiens 


Human normal ovarian tissue derived 
protein 84. 


111 


43 


12919 


AJ388518 


Canis familiaris 


non-histone chromosomal protein 
HMG-17 


108 


84 


12920 


AK023392 


Homo sapiens 


unnamed protein product 


119 


60 


12921 


Y0267I 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


104 


87 


12922 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


124 


81 


12923 


X78444 cdl 


Homo sapiens 


21-MAY-1997 Human UCSP-2 cDNA. 


1090 


100 


12924 


AF 116661 


Homo sapiens 


PR01438 


89 


52 


12925 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488.. 


82 


80 


12926 


AF1 19855 


Homo sapiens 


PRO 1847 


120 


63 


12927 


AF090931 


Homo sapiens 


PRO0483 


92 


90 


12928 


G03800 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7881. 


119 


88 


12929 


Y91429 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 21 SEQ ID NO:150. 


108 


38 


12930 


Y12661 


Homo sapiens 


neuro-endocrine specific protein VGF 


2506 


99 


12931 


Y84546 


Homo sapiens 


Amino acid sequence of a fragment of 
human collagen type 1 protein. 


61 


50 


12932 


G02485 


Homo sapiens 

* 

• 


Human secreted protein, SEQ ID NO: 
6566. 


. 121 


53 


12933 


G00481 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4562. 


104 

■ 


52 


12934 


G04072 

■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. . 


104 

■ 


60 


12935 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


331 


77 


12936 


M15530 


Homo sapiens 


B-cell growth factor 


83 


71 


12937 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


146 


64 


12938 


R59842 


Homo sapiens 


ApoE4Ll protease. 


112 


61 


12939 


AF090895 


Homo sapiens 


PRO0117 


94 


80 


12940 


X73459 


Homo sapiens 


signal recognition particle subunit 14 


549 


98 


12941 


L23545 


Homo sapiens 


putative 


141 


45 


12942 


AF054178 


Homo sapiens 


CI-B14.5ahomolog 


238 


84 


12943 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:I63. 


110 


55 


12944 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


143 


55 


12945 


D00570 


Mus musculus 


open reading frame (251 AA) 


166 


34 


12946 


U62039 


Elephantulus 


reverse transcriptase 


63 


50 
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SEQID 
NO: 


' Accession No. 


Species 

* 


Description 


Stnitb- 
Wa terra an 
Score 


[ % 
Idcotit ; 

y 






cdwardii 








12947 


AE003499 


Drosophila 
melanogaster 


CG 1 2706 gene product 


104 


31 


12948 

• 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


65 


100 


12949 


X70343 


Nicotiana 
sylvestris 


extensin 


102 


38 


12950 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


103 


45 


12951 


G03789 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
7870. 


85 


56 


12952 


AF090895 


Homo sapiens 


PRO0117 


139 


68 


12953 


AF090896 


Homo sapiens 


PRO0131 


93 


85 


12954 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


101 


75 


12955 


M96982 


Homo sapiens 


U2 snRNP auxiliary factor small * 
subunit 


292 


68 


12956 


X71087 


Homo sapiens 


monocyte chemoattractant protein 
(MCP-3) 


408 


96 


12957 


S80864 , 


Homo sapiens 


cytochrome c-like polypeptide 


591 


68 


12958 


AF109907 


Homo sapiens 


S164 


188 


63 


12959 


W48351 


Homo sapiens 

* 


Human breast cancer related protein 
BCRB2. 


173 


50 


12960 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


168 


62 


12961 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


98 


56 


12962 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


106 


84 


12963 


G02532 

• 


Homo sapiens 

* 


Human secreted protein, SEQ ID N6: 
6613. 


125 

■ 


44 


12964 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 


107 


57 


12965 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BGGF). 


71 


65 


12966 


AC009853 


Arabidopsis 
thaliana 


hypothetical protein 


102 


44 


12967 


AF090942 


Homo sapiens 


PRO0657 


99 


62 


12968 


MI 5530 


Homo sapiens 


B-cell growth factor 


90 


65 


12969 


AF090895 


Homo sapiens 


PRO0117 


83 


54 


12970 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


119 


54 


12971 


G03438 


Homo sapiens 
* 


Human secreted protein, SEQ ID NO: 
7519. 


123 


57 


12972 


L32558 


Homo sapiens 


sequence is expressed in human Tera-2 
clone 13 (embryonal carcinoma) cells. 
The sequence may contain mismatches 
(one strand sequenced only once). 97% 
identical in 320 bp overlap with human 
54 kDA prot; ORF 


426 

■ 

• 


85 


12973 


AF008196 


Homo sapiens 


baxepsilon 


78 


87 


12974 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 
protein fragment 59. 


96 


80 


12975 


AL031848 


Homo sapiens 


(U202O8.2.2 (novel rodent HES2 (hairy 
and Enhancer of Split 2) LIKE protein 
(isoform 2)) 


176 


100 


12976 


AJB45905 


Chlorocebus 
aethiops 


HSBP Mike protein 


94 


100 


12977 


AF067519 


Homo sapiens 


PITSLRE protein kinase beta SV1 
isoform 


1923 


93 
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SEQ ID 


Accession No. 


Species 


Description 


Sraith- 


% 


NO* 








Wfttennfl n 












Score 


y 


12978 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


168 


73 






■ 


7888. 


- 




12979 


X52164 


Mus musculus 


Q300 protein (AA 1-77) 


105 


39 


12980 


Y13141 


Bromheadia 


extensin 


53 


36 






finlaysoniana 








12981 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


87 


77 


• 






7795. , 






12982 


G02211 


Homo sapiens 

ft 


Human secreted protein, SEQ ID NO: 


83 


89 








6292. 


* 




12983 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: . 


92 


4* 








4478. 






12984 


AF1 13685 


Homo sapiens 


PRO0974 


111 


67 


12985 


U31086 


Gallus gallus 


neuron-glia adhesion molecule 


52 


50 


12986 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

ft 9 ^ 


132 


68 








7870. 






12987 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


91 


71 








4433. 






12988 


Y27854 


Homo sapiens 


Human secreted protein encoded by 


135 


81 


* 


• 




gene No. 101. 






12989 


AB029042 


Homo sapiens 


ATPase inhibitor precursor 


364 


100 


12990 


G00442 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


174 


72 








4523. 






12991 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


174 


57 








4718. 






12992 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


158 


57 








4473. 






12993 


AFi 18082 


Homo sapiens 


PRO1902 


11 


78 


12994 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


76 . 


60 








4718. 






12995 


M15530 


Homo sapiens 


B-cell growth factor 


105 


76 


12996 


AJ005562 


Mus musculus 


SPR2D protein 


104 


36 


12997 


Y48346 


Homo sapiens 


Human prostate cancer-associated 


184 


77 








protein 43. 






12998 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


82 


53 








gene 22 clone HMSJW18. 






12999 


LI 1645 


Homo sapiens 


alpha-tubulin 


196 


68 


13000 


AFI 13685 


Homo sapiens 


PRO0974 


123 


52 


13001 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


78 


53 








terminal, clone XEH.8c} 






13002 


AJ005564 


Mus musculus 


SPR2F protein 


76 


44 


13003 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


76 








4478. 






13004 


AL159178 


Streptomyces 


putative secreted protein 

* * 


89 


37 




■ 


coelicolor A3 (2) 








13005 


L77967 


Ovis aries 


small proline-rich protein with paired 


64 


36 




• 




repeat 






13006 


G03243 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


85 


70 






i 


7324. 






13007 


AF068294 


Homo sapiens 


HDCMB45P 


237 


52 


13008 


G00382 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


128 


50 






* 


4463. 


• 




13009 


AFI 44054 


Homo sapiens 


apoptosis related protein APR-4 


100 


75 


13010 


G03453 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


148 


67 








7534. 






13011 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


152 


66 








clone HPMBQ32. 






13012 


B01390 


Homo sapiens 


Neuron-associated protein. 


99 


32 


13013 


A27282 


Homo sapiens 


TGR-CL3C 


69 


42 


13014 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


135 


46 
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» ■ - 



NO: 






TV*e<*ri r>H ft n 

VwClluUUU 


Smith* 

Waterman 
Score 


Identit 

7 




1 


1 


4770. 






13015 


AF130089 


Homo sapiens 


PRO2550 


88 


60 


13016 


AF266223 


Gillichthys 
mirabilis 


ribosomal protein L27 


122 


50 


13017 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


116 


40 


13018 


U82303 


Homo sapiens 


unknown 


64 


60 


13019 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


70 


57 


13020 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


169 


91 

■ 


13021 


V00488 


Homo sapiens 


alpha globin 


213 


100 


13022 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


146 


59 


13023 


G00381 


Homo sapiens . 


Human secreted protein, SEQ ID NO: 
4462. 


109 


86 


13024 


G03472 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7553. 


151 


84 


13025 


U82303 


Homo sapiens 


unknown 


86 


52 


13026 


Y02999 , 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


124 


61 


13027 

* 


G00689 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
4770. 


83 


50 


13028 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


74 


63 


13029 


Y84546 


Homo sapiens 


Amino acid sequence of a fragment of 
human collagen type 1 protein. 


59 


46 


13030 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


96 


68 


13031 


S71494 


Mus sp. 


SmD homolog {Gly-Arg repeat} 


93 


55 


13032 


AF 113685 


Homo sapiens 


PRO0974 


100 


48 


13033 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


242 


85 


13034 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


178 


41 


13035 


G01129 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5210. 


228 


81 


13036 


X662B5 


Mus musculus 


HCIORF 


103 


44 


13037 


AF026689 


Homo sapiens 


prostate-specific transglutaminase 


90 


55 


13038 


U47924 


Homo sapiens 


RPL13-2 


424 


63 


13039 


AF090942 


Homo sapiens 


PRO0657 


111 


44 


13040 


AK0251 16 


Homo sapiens 


unnamed protein product 


131 


52 


13041 


G00403 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
4484. 


92 


72 


13042 


Ml 1759 


Lycopersicon 
esculentum 


cell wall hydroxyproline-rich 
glycoprotein 


100 


69 


13043 


AF130089 


Homo sapiens 


PRO2550 


109 


45 


13044 


G02879 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6960. 


116 


52 


13045 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


162 


70 


13046 


L10908 


Mus musculus 


Gcapl gene product 


162 


40 


13047 


L02321 


Homo sapiens 


glutathione S-transferase GSTM5-5 


1072 


93 


13048 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


89 


60 


13049 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


153 


60 


13050 


U33547 


Homo sapiens 


MHC class H antigen 


69 


85 


13051 


AC006014 


Homo sapiens 


similar to mismatch repair proteins; 
similar to PID:gl304125 


652 


91 
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t 

WO 01/64835 PCT7US01/04927 

* 



SEQ ID 

NO: 


Accession No. 


Species 

*** 


Description 


Smith- 
Waterman 
Score 


% 

Ideotit 
y 


13052 


A27282 


Homo sapiens 


TGR-CL3C 


65 


51 


13053 


Y69166 


Homo sapiens 

A 

■ 


A mature human N-acetyiglycosaminyl 
transferase protein. 


106 


60 


13054 


M20030 


Homo sapiens 

* 


small proline rich protein 


57 


33 


13055 


AF022U7 


Balaena 
mysticetus 


metallothionein 


119 

■ 


62 


13056 


Ml 1759 


Lycopersicon 
esculentum 


cell wall hydroxyproline-rich 
glycoprotein 


78 


43 


13057 

* 


R20305 


Homo sapiens 


Val(l) to Met, His(2) deleted, Ala(76) 
to Lys beta-globin mutant 


274 


98 


13058 


X55686 


Lycopersicon 
esculentum 


extensin (class II) 


56 


39 


13059 


U10696 


Zea mays 


Ec metallothionein class II protein 


66 


26 


13060 


Y40417 


Homo sapiens 


A human N-acetylneuraminate lyase 
(hNANL) protein. 


1183 


80 


13061 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


125 


41 


13062 


AF002210 


Homo sapiens 


copper chaperone for superoxide 
dismutase 


702 


61 


13063 


AF1 18086 


Homo sapiens 


PR01992 


69 


84 


13064 


M24097 


Homo sapiens 


MHC HLA-C-alpha-2 chain 


1550 


96 


13065 


AC003113 


Arabidoosis 
thaliana 


F2401.6 


51 


40 


13066 


AF161531 


Homo sapiens 


HSPC046 


488 


97 


13067 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


141 

i 


50 


13068 


AL359782 


TrYDanosoma 
brucei 


nossible fhhv-6\ ul 102. variant a dna. 
complete virion genome. 


116 

m A w 


46 


13069 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


145 


69 


13070 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 103. 


224 


71 


13071 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


153 


52 


13072 


AF2 16965 


Homo sapiens 

Jr 


ancient conserved domain protein 3 


876 


100 


13073 


Y15917 


Homo sapiens 


COL1A1 and PDGFB fusion transcript 


108 


35 


13074 


X77664 


Homo sapiens 


retinoic acid receptor beta isoform 1 


53 


71 


13075 


AF156961 


Homo sapiens 


gag 


136 


37 


13076 


R59842 


Homo sapiens 


ApoE4Ll protease. 


117 


75 


13077 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


121 


65 


13078 


U23 1 83 


Caenorhabditis 
elegans 


gene lies in inverted repeat and exon 1 
overlaps tRNA; may be pseudogene 


100 


79 


13079 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


125 


56 


13080 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


138 


65 


13081 


Y12473 


Homo sapiens 


centrin 


359 


79 


13082 


G01518 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5599. 


375 


98 


13083 


Y57891 


Homo sapiens 


Human transmembrane protein 
HTMPN-15. 


1030 


99 


13084 


AB046765 


Homo sapiens 


KIAA1545 protein 


161 


93 


13085 


X55686 


Lycopersicon 
esculentum 


extensin (class II) 


66 


40 


13086 


AF130079 


Homo sapiens 


PR02852 


137 


45 


13087 


K01664 


Drosophila \ 
melanogaster 


Bkm-like protein 


86 


33 


13088 


G03812 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


44 



485 



WO 01/64835 PCT/US01/04927 



SEQ1D 

NO: 


Accession No. 


Species 

• 


Description 


Smith- 
Waterman 
Score 


Identit 
v 








7893. 






13089 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


114 


63 


13090 


AL121905 


Homo sapiens 


dJ534B8.3 (novel protein similar to an 
asD attic protease) 


963 


100 


13091 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867 


95 


47 


13092 


AB030236 


Canis familiaris 


D4 dopamine receptor 


65 


38 


1 JU7J 


AF161316 


Homo QanipnQ 


HSPC093 


119 


48 


1 30Q4 


714014 


Tslipr»tijmn 

rM lt«V Holla 

. tabacum 


Pistil evtensin like orotesin. nartial CDS 

only 


67 


50 


iJv/J 


Gft1 740 


Jl villi \J caj/IvJJO 


Human ^ecr&f&d nrotein SEO ID NO* 

XjUlllCUl OvvlwlVU piUlbill) tJJU<V^ XX^r llV/t 

5330. 


124 


56 




Y77R6R 


n.uiiivf sapiens 


T-Tumnn qprretpH nrotpin en prided hv 

gene No. 107. 


159 


73 




nn^7i4 


Untnn canipnc 
xlUilll/ oapivilS 


Human <spprptpH nrAtpin RPO TTj "WO* 

7795. 


139 


62 


1 10QQ 


a piinn^i 
Ar i juuj i 


JaOIIlvJ Sapiens 


PROftRQR 


194 


61 
\j i 


13100 


Y86445 

* 


Homo sapiens . 


Human gene 42-encoded protein 
fragment, SEQ ID NO:360. 


80 


70 


111 A1 

13101 


O025jo 


Homo sapiens 


nmnan secreted protein, otxi id inu. 
6619. 


in 




111 no 


1 14 


Homo sapiens 


tiuman secreieG protein, onv^ u-' jnu. . 
7795. 


170 


71 
/ 1 


111 Al 
13 lUi 




riomo sapiens 


riuman secretea proiein, aty il* xnvj. 
4414. 


1 \(\ 
1 ID 


OJ 


111 flA 

13104 


TTOOQT1 


caenornaDuius 
elegans 


s miliar to kjj uuiaem repeal region 01 
RD protein (nuclear rna-binding 

pruLciii y 


OA 

yo 

- ■ 


OH 


13105 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


176 


56 


.1 J I UO 




v^aeaurnaDams 

ClGgtlllo 


a miliar iu jl/. mciaiiv/gadivi iiuuiculiw 
nrotein BarH2 rPIR*A41726^ 


101 


7S 


13107 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla ^AF078683^ 


100 


65 


13108 


Y02886 


Homo sapiens 


Fragment of human secreted protein 

pnmdpd hv ppnp 00 


137 


77 


13109 

* 


AL390114 


Leishmania 

Ulojui 


probable (hhv-6) ul 102, variant a 

0*MA cnrnnlete virinn oeinrime 

1^1 1 i») w\Jlll£/lwlw T111V/11 gvUwlllw 


94 


88 


13110 


AF1 19855 


Homo sapiens 


PR01847 


103 


49 


inn 


AF1 IQR^I 


XJotti r\ ennicne 
nUillVJ duJJlvllS 


PP0 1772 


142 


80 


13112 


Y00311 


Homo sapiens 


Human secreted protein encoded by 

<rpnp ^4 


107 


51 






HVim a canipnc 

XllUJf IS ooplClJD 


^prrptpd ntYitpin encx\(\&A hv pene 04 

UvVlblvU L/Awlvlll uUwUwU Vj H u ^ • 

clone HPMB032. 


104 


54 


13114 

X *f X X T 


G02532 


Homo saoiens 


Human secreted orotein. SEO ID NO: 
6613. 


144 


71 

• * 


13115 


G03356 


Homo sanien*; 


Human secreted nrotein SEO ID NO: 
7437. 


138 


50 


13116 


G00397 

UUUJ7 / 


Unnin oanienc 

XXXJXXXKJ SOU/X\Hg9 


Human secreted orotein SEO ID NO* 
4478. 


110 

AAV 


56 


13117 


AK025653 


Homo sapiens 


unnamed protein product 


188 


100 


13118 


AF130089 


Homo sapiens 


PRO2550 


129 


68 


13119 


AF161356 


Homo sapiens 


HSPC093 


131 


59 


13120 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


121 


45 


13121 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


82 


55 


13122 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


87 


45 



486 
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> 



SEQIO 


Accession No. 


Species 


Description 


Smith- 

*r iuu umu 

Score 


% 

Irfr>nrit 

lUCUUl 

y 




- 




4718. 






13123 


Y38394 


Homo. sapiens 


Human secreted protein encoded by 
gene No. 9. 


213 


100 


13124 


G04Q72- 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


108 


55 


13125 


AF1 19855 


Homo sapiens 


PR01847 


109 


45 


13126 


R95913 


Homo sapiens 


Neural thread protein. 


137 


47 


13127 


BO 1390 


Homo sapiens 

* 


Neuron-associated protein. 


101 


47 


13128 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


122 


67 


13129 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


105 


52 


13130 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


170 


72 


13131 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


500 


90 


13132 


AF138957 


Bos taurus 


type II collogen cyanogen bromide 
fragment CB8 


99 


38 


13133 


AF084256 


Homo sapiens 


beta glucuronidase isoform d 


157 


69 


13134 


U92817 


Homo sapiens 


unnamed HERV-H protein 


113 


48 


13135 


Y19743 


Homo sapiens 


SEQ ID NO 461 from W09922243. 


967 


99 


13136 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492; 


155 


52 


13137 


AF090928 


Homo sapiens 


PRO0470 


133 


69 


13138 


AF090931 


Homo sapiens 


PRO0483 


150 


60 


13139 


AF1 16661 


Homo sapiens 


PR01438 • 


146 


57 


13140 


AL022318 


Homo sapiens 


bK150C2 2 fPhorbolin3) 


443 


56 


13141 


S79410 


Mus musculus 


nuclear localization signal binding 
Drotein 


114 

n. 


80 


13142 


AF055985 


Onchocerca 
volvulus 


pvrrolidone-rich anti&en 


86 


43 


13143 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


130 


62 


13144 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


146 


66 


13145 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


123 


71 


13146 


AF 107406 


Homo sapiens 


GW128 


120 


50 


13147 


L10908 


Mus musculus 


Gcapl eene product 


91 


42 


13148 


AP000616 


Oryza sativa 

• 


similar to RING-H2 finger protein 
RHAla(AF078683) 


113 


66 


13149 


S58722 


Homo sapiens 


X-Iinked retinopathy protein {C- 
terminal, clone XEH.8c) 

^m r ^k am a m m ■ • ^b»^ ^b^*^ ^b^ ^b ^b^bi^^b. ^b> ■ ^b* ■ 


119 


67 


13150 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


168 


86 


13151 


G00418 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4499. 


74 


58 


13152 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


108 


61 


13153 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


131 


67 


13154 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

^b> ^b> m^w<^ ^^^^m* m^^^ ^*^m »^b» ^bv b^ m » ^ ™ — ■ ^b^ ^^b^ ^Bf ^b*^b^^ ^ ^ ^b^ • 

6953. 


113 


54 


13155 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


127 


84 


13156 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


167 


80 


13157 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


125 


68 


13158 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


514 


71 


13159 


Y86248 


Homo sapiens 


Human secretedjprotein HCHPF68, 


130 


66 



487 



WO 01/64835 PCTAJS01/04927 

* * ■ 



SEQID 


Accession No. 


Species 


Description 


Smith- 

W9icrai&n 

Score 


1 J,^ A ■ A 

Idennt 
v 




i 


* 


SEQ1DN0:163. 






13160 


AF1 16661 


Homo sapiens 


PR01438 


152 


63 


13161 


AB046048 


Macaca 
fascicularis 


unnamed portein product 


105 


53 


13162 


AF1 07406 


Homo sapiens 


GW128 


159 


62 


13163 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121 . 


132 


59 


13164 


Ml 5530 


Homo sapiens 


B-cell growth factor 


97 


80 


13165 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


67 


36 


13166 


Y02671 


Homo saoiens 


Human secreted protein encoded bv 
gene 22 clone HMSJW18. 


126 


57 


13167 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


95 


53 


13168 


G00407 

• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


96 


57 


13169 


AL049795 


Homo sapiens 


(1J622L5.7.2 (novel protein Cisoform 2Y) 


250 


95 


13170 


G00366 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
4447 


116 


56 


13171 

A & W A> 


L10908 

A*^ A V ^ 


Mus m us cuius 

A'AVAm AAA *A W 


Gcaol ffene oroduct 


75 


53 


13172 


S73853 . 


Homo saoiens 


NF2=Tieurofibromatosis tvne 2 
(altemativelv spliced, form A4^ 


160 

X w 


73 


13173 


AP000616 


Orvza sativa 


similar to RING-H2 fineer nrotein 

wlilAUUi A M t ^ A AAV A.UAWV1 A^ A y VVI A 1 

RHAla(AF078683) 


134 


76 


13174 


A52568 


Homo sapiens 


HMGI-C 


84 


44 


13175 


G03807 


Homo saoiens 


Human secreted nrotein SEO ID NO* 

A AWAAAtAAA vVviVIVU t/J V WHIj WWV IV A « 

7888. 


91 

✓ X 


49 


13176 


AK025116 


Homo saoiens 


unnamed orotein oroouct ' 


130 


44 


13177 


AF090901 


Homo saoiens 


PRO0195 


103 


84 


13178 


AF090894 


Homo sapiens 


PRO0113 


92 


62 


13179 


AF1 30079 


Homo sapiens 

■<* » MA ■ JdrJ V4AA-V 


PR02852 


118 

A A fc** 


38 


13180 


AK025047 


Homo saoiens 

A Aw AAA W tJUU 1 WAAtf 


unnamed protein product 


141 

X ~ X 


67 


13181 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded bv eene 121 


87 


59 


13182 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


220 


74 


13183 


S79980 


Bos taurus 


ribosomal protein L37 


186 


94 


13184 


AP000616 

A AvA WW A\/ 


Orvza sativa 

X^A T IM JUVI T LA 


similar to RING-H2 fineer orotein 
RHAla(AF078683) 


113 


90 


13185 


AF155232 


Pisum sativum 


extensin 


95 


36 


13187 


U16359 


Rattus 
norve&icus 


nitric oxide synthase . 


90 


78 


13188 


G03787 


Homo saoiens 


Human secreted orotein SEO ID NO* 
7868. 


113 


79 


13189 


G00377 


Homo saoiens 


Human secreted orotein SEO ID NO* 
4458. 


158 

A a/ W 


53 


13190 


Y02671 


Homo saoiens 


Human secreted orotein encoded bv 
gene 22 clone HMSJW18. 


130 


59 


13191 


AP000616 


Orvza sativa 


similar to RING-H2 fineer orotein 

W AAaA AAfcAA WV/ A XA1 l V-A* A AAm* AH IfiWI L/A W IV AAA 

RHAla (AF078683) 


131 


66 


13192 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


439 


86 


13193 


Z66499 


Caenorhabdrtis 
elegans 


T01B7.8 


127 


35 


13194 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


126 


61 


13195 


Y13141 


Bromheadia 
finlaysoniana 


extensin 


58 


43 


13196 
13197 


G03172 
Y19767 


Homo sapiens 
Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 

SEQ ED NO 485 from W09922243. 


95 
116 


76 
60 
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WO 01/64835 PCTAJS01/04927 

t 



SEQ1D 

FUJI 

( 


Accession No. 


Species 


Description 


Smith- 

it merino u 

Score 


% . 

Monti* ' 

menu i 

y 


13198 


X58236 


Homo sapiens 


36/8-8 fusion protein with epitope for 
anti-lectin antibody 


98 


41 


13199 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


102 


58 


13200 


AF220264 


Homo sapiens 


MOST-1 


141 


45 


13201 


G02872 

* 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
6953. 


112 


58 


13202 


Y17221 


Homo sapiens 


Human secreted protein (clone fk3 1 7- 
3). 


115 


57 


13203 


W50192 


Homo sapiens 


Amino acid sequence of salivary 

protein CON-1. 
* 


91 


39 


13204 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


126 


65 


13205 


Y30822 


Homo sapiens 


Human secreted protein encoded from 
gene 12. 


98 


68 


13206 


Ml 5530 


Homo sapiens 


B-cell growth factor 


105 


76 


13207 

A W Am V » 


W48927 

Tf T\Jm T mm W 


Homo sapiens 


Schwannomin-bindins protein C- 
terminal fragment. 


84 

Xm * 


45 


13208 


L27428 


Homo sapiens 


reverse transcriptase 


205 


49 


13209 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


77 


57 


•13210 


M81321 


Macaca 
fascicularis 


proline-rich protein 

^+ A *mA^m m, m^*mm f <* ^m mmm* 


104 


48 


13211 

• «^ mm A A 


X55685 


Lvcopersicon 
esculentum 


extensin (class I) 


122 


33 


13212 


Ml 5530 


Homo sapiens 


B-cell growth factor 


159 


61 


13213 


AF 130089 


Homo sapiens 


PRO2550 

m* ^m mw ^m 


126 


56 


13214 


AF210651 


Homo sapiens 


NAG18 


125 


68 


13215 


G03714 


Homo sapiens 


Human secreted protein. SEO ID NOr 
7795. 


114 


76 


13216 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


157 


70 


13217 


R59842 


Homo sapiens 


ApoE4Ll protease. 


102 


72 


13218 

m> *J Am A V 


K01664 


Drosophila 

mmrA VyVVUivM 

melanogaster 


Bkm-like protein 


123 

m mmmr 


37 


13219 


Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene 5 1 clone HUKEX85. 


101 


55 


13220 


D14167 


Gallus gallus 


ribosomal protein L37a 


126 


40 


13221 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


178 


82 


13222 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


141 


55 


13223 


L10908 


Mus musculus 


Gcapl gene product 


114 


53 


13224 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


216 


73 


13225 


G03259 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7340. 


104 


76 


13226 


AF130079 


Homo sapiens 


PR02852 


158 


66 


13227 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


125 


60 


13228 


S77772 


Homo sapiens 


asparty Iglucosaminidase, AGA {C- 
terminal, alternatively spliced} {EC 
3.5.1.26} 


100 


46 


13229 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


156 


72 


13230 


M15530 


Homo sapiens 


B-cell growth factor 


162 


72 


13231 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


161 


66 


13232 


AL160371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


80 


52 
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i ■ ♦ 



• 



SEQ ID 


\ A — - — * m 

Accession No. 


Species 


Description 


o —.1*4, 


% 


NO* 








Waterman 


1UVU til 










Score 


y 


13233 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


51 






• 


7871. 






13234 


AF126163 


Homo sapiens 


HHLA3 protein 


131 


78 


13235 


Y02886 


Homo sapiens 


Fragment of human secreted protein 


125 


85 








encoded by gene 90. 






13236 


AB001684 


Chlorelfa 


ORF49b 


55 


45 




• 


vulgaris 








13237 


M15530 


Homo sapiens 


B-celi growth factor 


114 


66 


13238 


AL355929 


Neurospora 


conserved hypothetical protein 


99 


46 




- 


crassa 




- 




13239 


AP000616 


Oryzasativa 


similar to RING-H2 finger protein 


114 


63 








RHAla (AF078683) 


• 




13240 


S79410 


Mus musculus 


nuclear localization signal binding 


110 


40 








protein 






13241 


AF1 19851 


Homo sapiens 


PR01722 


167 


63 


13242 


S79410 


Mus musculus 


nuclear localization signal binding 


147 


68 








protein 






13243 


U28971 


Caenorhabditis 


similar to RD tandem repeat region of 


95 


65 






elegans 


RD protein (nuclear ma-binding 












protein) 






13244 


W48351 


Homo sapiens 

* 


Human breast cancer related protein 


99 


84 








BCRB2. 






13245 


Y14437 


Homo sapiens 


Human secreted protein encoded by 


104 


62 








gene 27 clone HSAWA27. 






13246 


AF161356 


Homo sapiens 

■ A 


HSPC093 


116 


67 


13247 


X61046 


Hydra sp. 


mini-collagen 


108 


49 


13248 


V00662 


Homo sapiens 


URF 3 (NADH dehydrogenase subunit) 


458 


89 


13249 


Ml 1900 


Mus musculus 


1 5-kDa proline-rich salivary protein 

- A - + A 


92 


37 


13250 


AB017362 


Bombyx mori 

✓ 


fibroin H-chain 


70 


43 


13251 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


58 








6613. 






13252 


G00360 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 


122 


58 








4441. 


■ 




13253 


AF210651 


Homo sapiens 


NAG18 


151 


73 


13254 


AF224494 


Mus musculus 


arsenite inducible RNA associated 


299 


50 








protein 

A ..... 






13255 


AK026107 


Homo sapiens 


unnamed protein product 


102 


60 


13256 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


113 


61 








gene 22 clone HMSJW18. 






13257 


AK024455 


Homo sapiens 


FLJ00047 protein 


115 


59 


13258 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


155 


51 








gene No. 107. 




• 


13259 


AJ251579 


Arabidopsis 


cef protein 


115 


39 






thaliana 








13260 


L78671 


Homo sapiens 


CoxII/D-loop DNA fusion protein 


360 


82 


13261 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


61 








6953. 






13262 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


413 


87 


13263 


D23661 


Homo sapiens 


ribosomal protein L37 


487 


94 


13264 


AF1 30079 


Homo sapiens 


PR02852 


153 


43 


13265 


Y02886 


Homo sapiens 


Fragment of human secreted protein 


109 


61 








encoded by gene 90. 






13266 


Y01154 


Homo sapiens 


Protein sequence Seq Id 54 from 


133 


79 








WO9901020. 






13267 


L10908 


Mus musculus 


Gcapl gene product 


103 


43 


13268 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


70 








6613. 






13269 


. G04064 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


51 








8145. 
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Cf?/\ lff\ 

SEQ ID 

NO: 


Accession No. 


Species 


Description 


Waterman ' 
Score 


0/ 
% 

Identit 

y 


13270 


AF1 30089 


Homo sapiens 


PRO2550 


95 


47 


13271 


AF1 30075 


Homo sapiens 


PR02532 


90 


60 


13272 


G00577 

- 


Homo sapiens 

— 


Human secreted protein, SEQ ID NO: 
4658. 


103 


54 


13273 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


77 


91 


13274 


AF130089 


Homo sapiens 


PRO2550 


104 


57 


13275 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


94 


45 


13276 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6834. 


122 


52 


13277 


AF1 13685 


1 Homo sapiens 


PRO0974 


98 


36 


13278 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


85 


68 


13279 


G02515 

* 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6596. 


139 


89 


13280 


GO 1129 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5210. 


243 


79 


13281 


AF1 16715 


Homo sapiens 


PR02829 


108 


56 


13282 


G03789 


Homo sapiens 
* 


Human secreted protein, SEQ ID NO: 
7870. 


105 


60 


13283 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


194 

• 


78 


13284 


AL359782 


Trypanosoma 
brucei 


probable granule cell antiserum 
positive 8 


104 


47 


13285 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


104 


72 


13286 


AF130089 


Homo sapiens 


PRO2550 


147 


58 


13287 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


133 


58 


13288 


AF1 19855 


Homo sapiens 


PRO 1847 


115 


45 


13289 


AF161356 


Homo sapiens 


HSPC093 


131 


50 


13290 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


104 


43 


13291 


K01664 


Drosophila 
melanogaster 


Bkm-like protein 


112 


49 


13292 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4458. 


150 


69 


13293 


M36647 


Homo sapiens 


mitochondrial hinge protein precursor 


284 


76 


13294 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


444 


85 


13295 


D38U2 


Homo sapiens 


cytochrome c oxidase subunit 3 


633 


93 


13296 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


164 

■ 


76 


13297 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


147 


80 


13298 


Y13I41 


Bromheadia 
finlaysoniana 


extensin 

• 


84 


38 

■ 


13299 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


139 


49 


13300 


AF130079 


Homo sapiens 


PR02852 


98 


60 


13301 


AF090894 


Homo sapiens 


PRO0113 


114 


52 


13302 


AF130079 


Homo sapiens 


PR02852 


161 


83 


13303 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


154 


76 


13304 


AF 11 8086 


Homo sapiens 


PRO 1992 


120 


75 


13305 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 


100 


73 


13306 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


108 


56 
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NO: 


Accession Fio* 




mj ca en puu 11 


O HII LI}** 

Waterman 
Score 


7b 

Idcntit 

y 


13307 


Y74110 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein #297. 


448 


100 


13308 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4669. 


85 


65 

. 


13309 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


104 


47 


13310 


AF1 16715 


Homo sapiens 


PR02829 


124 


76 


13311 


G0I246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
.5327. 


142 


63 


13312 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO:" 
6953. 


155 


61 


13313 


AF1 16661 


Homo sapiens 


PR01438 


142 


67 


13314 


AF1 61356 


Homo sapiens 


HSPC093 


121 


46 


13315 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


169 

■ 


52 


13316 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


98 


90 


13317 


AF130079 


Homo sapiens 


PR02852 


109 


58 


13318 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


113 


80 


13319 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


81 


61 


13320 


AF1 30089 


Homo sapiens 


PRO2550 


106 


77 


13321 


AF090931 


Homo sapiens 


PRO0483 


143 


63 


13322 


D29833 


Homo sapiens 


proline rich peptide P-B 


59 


33 


13323 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


417 


78 


13324 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


119 


82 


13325 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


168 


59 


13326 


A3 1038 


Nicotiana alata 


PRP3 


88 


41 


13327 


AF090895 


Homo sapiens 


PRO0117 


87 


55 


13328 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


88 


51 


13329 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


103 


66 


23330 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 1 02, variant a 
DNA, complete virion genome 


101 


67 


13331 


AF090930 


Homo sapiens 


PRO0478 


93 


36 


13332 


AF 159055 


Homo sapiens 


leucine zipper-like protein 


118 


71 


13333 


Y02671 


Homo sapiens 

■ 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


89 


57 


13334 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


121 


80 


13335 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


160 


73 


13336 


AJ005562 


Mus musculus 


SPR2D protein 


86 


44 


13337 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


169 


66 


13338 

• 


G00528 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
4609. 


78 


57 


13339 


Y08769 


Rattus 
norvegicus 


microvascular endothelial 
differentiation gene 2 


102 


91 


13340 


AF090895 


Homo sapiens 


PRO0117 


137 


47 


13341 


K01664 


Drosophila 
melanogaster 


Bkm-like protein 


115 


46 


13342 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


100 


76 


13343 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


134 


65 
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SEQID 
NO* 


Accession No. 


Species 


Description 


Smitb- 
W a terra an 
Score 


% 

Identit ' 

y 




* 




7795. 






13344 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rbala. 


76 


76 


13345 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


111 

■ 


72 


13346 


G00552 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4633. 


143 


69 


13347 


AF119900 


Homo sapiens 


PR02822 


93 


58 


13348 


G03714 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7795. 


169 


75 


13349 


W48353 

■ 


Homo sapiens 


Human breast cancer related protein 
BCFLT2. 


83 


68 


13350 


AF1 16715 


Homo sapiens 


PR02829 


134 


64 


13351 


Y02999 


Homo sapiens 

* 


Fragment of human secreted protein 
encoded by gene 121. 


115 


65 


13352 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


118 


59 


13353 


Y02887 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


139 


77 • 


13354 


AF1 16661 


Homo sapiens 


PR01438 


112 


75 


13355 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


170 


73 


13356 


AF126163 


Homo sapiens 


HHLA3 protein 


116 


55 


13357 


U18339 


Variola virus 


D4L 


101 


58 


13358 


AF 130089 


Homo sapiens 


PRO2550 


127 


52 


13359 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


184 

i» 


82 


13360 


Y02926 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 101. 


78 


50 


13361 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


126 


71 


13362 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


174 


76 


13363 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


176 


51 


13364 


U12390 


Cloning vector 
pSportl 


beta-gaiactosidase alpha peptide 


99 


41 


13365 


G02532 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
6613. 


118 


56 


13366 


U 12390 


Cloning vector 
pSportl 


beta-gaiactosidase alpha peptide 


158 


50 


13367 


Y19467 


Homo sapiens 


Amino acid sequence of a human 
secreted protein. 


65 


64 


13368 


M15530 


Homo sapiens 


B-cell growth factor 


132 


77 


13369 . 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


119 


63 


13370 


AF130079 


Homo sapiens 


PR02852 


108 


61 


13371 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


150 


76 


13372 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


117 


65 


13373 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


118 


57 


13374 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


121 


63 


13375 


AF1 18086 


Homo sapiens 


PR01992 


147 


74 


13376 


AF130089 


Homo sapiens 


PRO2550 


149 


48 


13377 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


123 


55 
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• 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterraan 

Score 


% 

Jdeotit 

y 


13378 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


120 


53 


13379 


AF090894 


Homo sapiens 


PRO0113 


101 


52 


13380 


AF107406 


Homo sapiens 


GW128 


126 


51 


13381 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


156 


84 


13382 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


119 


80 


13383 


Y14437 


Homo sapiens 


Human secreted protein encoded by 
gene 27 clone HSAWA27. 

*—* 


84 


70 


13384 


S79410 


Mus musculus 

• 


nuclear localization signal binding 
protein 


107 


53 


13385 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


107 


53 


13386 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


141 


65 


13387 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


113 


S3 


13388 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


155 

■ 


77 


13389 


AF130089 


Homo sapiens 

* 


PRO2550 


159 


62 


13390 


AF161356 


1 Homo sapiens 


HSPC093 


154 


74 


13391 


AC006135 


Arabidopsis 
thaliana 


putative vicilin storage protein 
(globulin-like) 


171 


28 


13392 


AF113685 


Homo sapiens 


PRO0974 


226 


57 


13393 


AF118082 


Homo sapiens 


PRO 1902 


69 


68 


13394 


D23661 


Homo sapiens 


ribosomal protein L37 


498 


96 


13395 


AF1 30079 


Homo sapiens 


PR02852 


109 


54 


13396 


AF1 19851 


Homo sapiens 

* 


PR01722 


107 


57 


13397 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


165 


68 


13398 


AF130089 


Homo sapiens 

* 


PRO2550 


145 


68 


13399 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


134 


54 


13400 


J03798 


Homo sapiens 


small nuclear riboproteinSm-D 


451 . 


93 


13401 


D29833 


Homo sapiens 


proline rich peptide P-B 


64 


39 


13402 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


99 


76 


13403 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


159 


76 


13404 


AF090894 


Homo sapiens' 


PRO0113 


152 


70 


13405 


S58722 


Homo sapiens 


X-linked retinopathy protein {O 
terminal, clone XEH.8c} 


101 


45 


13406 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


172 


72 


13407 


G00407 • 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


135 


84 


13408 


G03714 


Homo sapiens 

• 


Human secreted protein, SEQ ID NO: 
7795. 


125 


63 


13409 


G04072 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
8153. 


138 


58 


13410 


AF130089 


Homo sapiens 


PRO2550 


101 


70 


13411 


AK024455 


Homo sapiens 


FLJ00047 protein 


112 


73 


13412 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


145 


48 


13413 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


95 


50 


13414 


AF090942 


Homo sapiens 


PRO0657 


171 


72 


13415 

■ 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


120 


64 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smitb- 
Wa terra an 
Score 


Identlt 
y 


13416 


Ml 5530 


Homo sapiens 


B-cell growth factor 


121 


45 


13417 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


115 


58 


13418 


AF026689 


Homo sapiens 


prostate-specific transglutaminase 


146. 


70 


13419 


Y01181 


Homo sapiens 


Polypeptide fragment encoded by gene 
12. 


85 


64 


13420 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


77 


73 


13421 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


149 


62 


13422 


AF130089 


Homo sapiens 


PRO2550 


111 


39 


13423 


G0341 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


106 


42 


13424 


AF1 19882 


Homo sapiens 


PR02492 


110 


70 


13425 


Y14482 


Homo saoiens 

A 4VAUV Ml/ ft Vi » 


Fragment of human secreted protein 
encoded by gene 17. 


104 


59 


13426 


AF1 18082 


Homo saoiens 


PRO1902 


129 


48 


13427 


X70771 


Chironomus 
tentans 1 

vw* % *j**a »*j 


Sp17 


95 


56 


13428 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613 


164 


66 


13429 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
eeneNo. 107. 


149 


80 


13430 


Y86248 

A XJ XX r \J 


Homo saoiens 


Human secreted protein HCHPF68. 

t*V*l***AU **WX*J WWV%* f*r * »>• W ill * *• ** A WW} 

SEQ ID NO: 163. 


103 


63 


13431 


AF1 16661 


Homo saoiens 


PR01438 


128 


54 


13432 


AK024455 


Homo saoiens 


FLT00047 protein 

Ml W Xr X# X# > r r 


117 


69 


13433 


Y13141 


Bromheadia 
finlavsoniana 

******* ¥ 


extensin 


91 


40 


13434 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


118 


64 


13435 


AF090944 


Homo saoiens 


PRO0663 


115 


50 


13436 


X55687 


Lycopersicon 
esculentum 


extensin (class II) 


73 


32 


13437 


AF090942 


Homo sapiens 


PRO0657 


77 


50 


13438 


AP000061 


AeroDvrum 

* *v» ^*rJ 

pemix 


104aa Ion 2 hvpothetical protein 


82 


41 


13439 


G00481 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4562. 


94 


51 


13440 


AF161356 


Homo sapiens 


HSPC093 


115 


53 


13441 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


125 


44 


13442 


Y59784 


Homo sapiens 


Human normal ovarian tissue derived 
protein 61. 


1096 


100 


13443 


U33547 


Homo sapiens 


MHC class II antigen 


120 


73 


13444 


Y86248 


Homo sapiens 

Mi \^^MM ^» Ml/ *^^***^ 


Human secreted protein HCHPF68. 
SEQIDNO:163. 


106 


71 


13445 


W54966 . 


Homo sapiens 


Synthetic human type III collagen 
SYN-C3. 


115 


34 


13446 


G02256 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6337. 


486 


100 


13447 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


155 


60 


13448 


AF130079 


Homo sapiens 


PR02852 


167 


56 


13449 


AB047890 


Macaca 
fascicularis 


hypothetical protein 


171 


62 


13450 


AK025326 


Homo sapiens 


unnamed protein product 


111 


67 


13451 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


93 


76 
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SEQ ID 


Accession No. 


Species 


Description 


Smith* 

Waterman 

Score 


% 

Trip nfit 

IULIIUI 

y 


13452 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


109 


91 


13453 


U65650 


Arabidopsis 
thaliana 


blue-copper binging protein 1U 


103 




13454 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


123 


69 


13455 


AF090894 


Homo sapiens 


PRO0113 


122 


56 


13456 


D38585 


Homo sapiens 


TSC-22 


710 


99 


13457 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121 . 


113 


45 


13458 


X55686 


Lycopersicon 
esculentum 


extensin (class II) 


64 


51 


13459 


AFI 07406 


Homo sapiens 

JT 


GW128 


162 


54 


13460 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


145 


65 


13461 


X6I045 


Hydra sp. 


mini-collagen 


107 


34 


13462 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


130 


46 


13463 


Y35970 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 219. 


189 


80 


13464 


L47668 


Homo sapiens 


alpha-2 collagen type I 


64 


39 


13465 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


155 


65 


13466 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


159 


76 


13467 


Y14437 


Homo sapiens 


Human secreted protein encoded by 
gene 27 clone HSAWA27. 


109 


70 


13468 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


144 


69 


13469 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


121 


39 


13470 


M15530 


Homo sapiens . 


B-cell growth factor 


144 


57 


13471 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


116 

- 


85 

■ 


13472 


V00488 


Homo sapiens 


alpha globin 


704 


96 


13473 


AJ012166 


Canis familiaris 


brain-specific synapse associated 
protein, Bassoon 


56 


45 


13474 


AFI 16661 


Homo sapiens 


PR01438 


128 


45 


13475 


GO 1828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


99 


78 


13476 


V00488 


Homo sapiens 


alpha globin 


278 


91 


13477 


W65762 


Homo sapiens 


Human HDCAP protein. 


490 


98 


13478 


AB033767 


Homo sapiens 


brain-selective and closely mapped on 

* 

the counter allele of CMAP in cy statin 
cluster 


2184 

• 


97 


13479 


AB033767 


Homo sapiens 


1 brain-selective and closely mapped on 
the counter allele of CMAP in cystatin 
cluster 


2184 


97 


13480 


Y42382 


Homo sapiens 


Amino acid sequence of ix3 1 7 11. 


613 


95 


13481 


D29833 


Homo sapiens 


proline rich peptide P-B 


61 


58 


13482 


AK023277 


Homo sapiens 


unnamed protein product 


488 


98 


13483 


X67703 


Drosophiia 
melanogaster 


Mst84Da 


53 


60 


13484 


Y27907 

T 1 


Homo sapiens 


Human secreted protein encoded by 
gene No. 119, 


135 


54 


13485 


AF210350 


Sceloporus 
siniferus 


NADH dehydrogenase subunit 4 


149 


79 


13486 


Y 14544 


Danio rerio 


Hoxc8 protein 


98 


36 


13487 


AP002032 


Arabidopsis 




111 


27 
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Accession No. 


Species 

■ 


Description 
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Wo h^rm ■ n 
rTaiuriuao 

Score 


% 

JOCuUl 

y 




i 


thaliana 








13488 


AB047846 


Homo sapiens 


gammal-COP 


4463 


100 


13489 


M37194 


Rattus 
norvegicus 


clathrin-associated protein 17 

• 


739 


100 


13490 


Y59743 


Homo sapiens 

■ 


Human normal ovarian tissue derived 
protein 20. 


138 

■ 


96 


13491 


U01317 


Homo sapiens 


G-gamma globin 


198 


90 


13492 


W04180 


Homo sapiens 


Cellubrevin-1. 


459 


98 


13493 


AL121585 


Homo sapiens 


6A504H3.1 (SKX5 (sorting nexin 5)) 


2058 


100 


13494 


X53777 


Homo sapiens 


putative ribosomal protein (AA 1-184) 


970 


100 


13495 


AJ271881 


Homo sapiens 


bromodomain containing protein 


3378 


99 


13496 


AF132970 


Homo sapiens 


CGI-36 protein 


878 


100 


13497 


V73498 cdl 


Homo sapiens 


21-AUG-1997 Human S100P1 DNA> 


273 


100 


13498 


AL 12 1585 


Homo sapiens 


bA504H3.1 (SNX5 (sorting nexin 5)) 


2058 


100 


13499 


X82456 


Homo sapiens 


L1M and SH3 domain protein 


1420 


100 


13500 


AJ277932 


Homo sapiens 


RPB1 la protein 


588 


98 


13501 


Y53569 


Homo sapiens 


Consensus sequence for human RAD1 
binding proteins. 


633 


84 


13502 


W73434 


Homo sapiens 


Human secreted protein encoded by 
Gene No. 2. 


1903 


100 


13503 


AJ005981 


Sus scrofa 


cAMP-reguJated phosphoprotein 


577 


100 


13504 


AF1 16715 


Homo sapiens 


PR02829 


143 


65 


13505 


U97553 


murid 

herpesvirus 4 


unknown 


84 


35 


13506 


X06323 


Homo sapiens 


put. ribosomal protein L3 (AA I - 348) 


1903 


100 


13507 


W82397 


Homo sapiens 


Human UBP protein #3. 


4264 


99 


13508 


M86246 


Homo sapiens 


EHS-2 


110 


71 


13509 


Y02671 


Homo sapiens 

• 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


121 


50 


13510 


AF073519 


Homo sapiens 


small EDRK-rich factor 1 , Ions isoform 


91 


44 


13511 


G03812 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7893. 


591 


100 


13512 


Y04369 


Homo sapiens 


Human HUCEP-1 4 protein. 


1389 


100 


13513 


G03714 

"F mm> W 4 W 


Homo sabiens 


Human secreted Drotein. SEO ID NO: 
7795. 


136 

* w *f* 


60 


13514 


AE000232 


Escherichia coli 
K12 


orf, hypothetical protein 


1480 


100 


13515 


X76717 


Homo sapiens 


MT- 11 protein 


382 


100 


13516 


Y00919 


Homo sapiens 


Human Rab protein, RABP-2. protein 
sequence. 


1039 


100 


13517 


G03714 


Homo sapiens 


Human secreted protein. SEO ID NO: 
7795. 


136 


67 


13518 


AF090895 


Homo sapiens i 


PRO0117 


109 


56 


13519 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


109 


65 


13520 


G02538 


Homo sapiens 


Human secreted protein, SEO ID NO: 
6619. 


145 


71 


13521 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


93 


60 


13522 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


140 


72 


13523 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


130 


73 


13524 


M15530 


Homo sapiens 


B-cell growth factor 


112 


67 


13525 


AF1 19851 


Homo sapiens 


PR01722 


135 


57 


13526 


G00541 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4622. 


98 


82 


13527 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


117 


41 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 
y 


13528 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDN0:163. 


105 


100 


13529 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. ~~ 


79 


78 


13530 


AJ000536 


Arabidopsis 
thaliana 


COP1 protein 

• 


98 


32 


13531 


U50403 


Homo sapiens 


breast cancer suppressor element 
Ishmael Upper RP2 


101 


62 


13532 ' 


G00352 


Homo sapiens 
* 


Human secreted protein, SEQ ID NO: 
4433. 


156 


56 


13533 


Z80779 


Homo sapiens 

* 


histone H2B 


628 


100 


13534 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


151 


62 


13535 


Ml 1759 


Lycopersicon 
esculentum 


cell wall hydroxyproline-rich 
glycoprotein 


82 


63 


13536 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


93 


56 


13537 


AP000616 


Oryza sativa 

i 


similar to RING-H2 finger protein 
RHAla(AF078683) 


139 


81 


13538 


AF1 30051 


Homo sapiens 


PRO0898 


90 


53 


13539 


XI 6832 

m mm> mm \gw tr 4m^F 


Homo saoiens 


preprocathepsin H (AA -22 to 3 14) 


1835 


99 


13540 


Y02671 


Homo saoiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


100 


80 


13541 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


131 


82 


13542 


G00452 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4533. 


94 

ti 


65 


13543 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


141 


86 


13544 


AF130089 


Homo sapiens 


PRO2550 


107 


72 


13545 


M36647 


Homo sapiens 

r 


mitochondrial hinge protein precursor 

L AY 


435 


89 


13546 


U63810 


Homo sapiens . 


WD40 protein Ciao 1 


505 


100 


13547 


AF090931 


Homo sapiens 


PRO0483 


126 


73 


13548 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


143 


65 


13549 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


82 


62 


13550 


AF1 13685 


Homo sapiens 


PRO0974 


126 


48 


13551 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


89 


53 


13552 


AF1 16661 


Homo sapiens 


PRO 1438 


109 


69 


13553 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


101 


85 


13554 


X 16832 


Homo sapiens 


preprocathepsin H (AA -22 to 314) 

A A AT X * A 


1835 


99 


13555 


Y73435 


Homo sapiens 


Human secreted protein clone yd73_l 
protein sequence SEQ ED N0:92. 

* A ^ 


528 


84 


13556 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


126 


65 


13557 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


103 


67 


13558 


G03093 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7174. 


189 


75 


13559 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


131 


53 


13560 


AF1 18086 


Homo sapiens 


PR01992 


157 


71 


13561 


Y00320 


Homo sapiens 


Human secreted protein encoded by 
gene 64. 


101 


64 


13562 


AF 159055 


Homo sapiens 


leucine zipper-like protein 


119 


74 
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SEQ ID 


Accession No. 


Species 


Description 

• 


Smith- 

Waterman 

Score 


% 

Identit 

y 


13563 


AFU8082 


Homo sapiens 


PRO1902 


83 


86 


13564 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


234 . 


97 


13565 


Y59778 


Homo sapiens 


Human normal ovarian tissue derived 
protein 55. 


66 


100 


13566 


GO 1828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


103 


60 


13567 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


152 


96 


13568 


X83703 


Homo sapiens 


nuclear protein 


1633 


99 


13569 


G02473 


Homo sapiens 
* 


Human secreted protein, SEQ ID NO: 
6554. 


242 


45 


13570 


G00521 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4602. 


105 

• 


90 


13571 . 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


110 


80 


13572 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


114 


46 


13573 


X92744 


Homo sapiens 

- a 


hBD-1 


290 


83 


13574 


P70494 , 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 


107 


68 

• 


13575 


API 07406 


Homo sapiens 


GW128 


356 


100 


13576 


W03642 


Homo sapiens 


Human cannabinoid GPRN-terminal 
sequence. 


101 


48 


13577 


AF220264 


Homo sapiens 


MOST-1 


95 


73 


13578 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


129 


68 


13579 


AKQ24455 


Homo sapiens 


FLJ00047 protein 


81 


76 


13580 


AF130051 


Homo sapiens 


PRO0898 


128 


64 


13581 


AF084256 


Homo sapiens 


beta glucuronidase isoform d 


142 


58 


13582 


AF1 18082 


Homo sapiens 

* 


PRO1902 


85 


51 


13583 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


114 


52 


13584 


G02455 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
6536. 


66 : 


54 


13585 


X52164 


Mus museums 


Q300 protein (AA 1-77) , 


109 


64 


13586 


AL359782 


Trypanosoma 
* * 

brucei 


possible (hhv-6) u 1 1 02, variant a dna, 
complete virion genome. 


109 


77 


13587 


AF090901 


Homo sapiens 


PRO0195 


114 


53 


13588 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


.95 


66 


13589 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


117 


58 


13590 


AK000521 


Homo sapiens 


unnamed protein product 


1313 


100 


13591 


AL357374 


Homo sapiens 


0A353C 18.2 (novel protein) 


421 . 


100 


13592 


AL357374 


Homo sapiens 


DA353C18.2 (novel protein) 


421 


100 


13593 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


139 

■ 


62 


13594 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


103 


56 


13595 

4 


U16359 


Rattus 
norvegicus 


nitric oxide synthase 


109 


76 


13596 


AP000616 


Oryza sativa 


similar to RING-H2 ringer protein 
RHAla(AF078683) 


127 


88 


13597 


G00325 


Homo sapiens 


Human secreted protein^ SEQ ID NO: 
4406. 


275 


100 


13598 


W03642 


Homo sapiens 


Human cannabinoid GPR N-tenninal 
sequence. 


108 


63 


13599 


AF1 16661 


Homo sapiens 


PR01438 


128 


64 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 
y 


13600 


X55682 


Lycopersicon 
esculentum 


extensin (class I) 


58 


43 


13601 


X52164 


Mus musculus 


Q300 protein (AA 1-77) 


106 


45 


13602- 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear rna-binding 
protein) 


122 


92 


13603 


AF149419 


Oryctolagus 
cuniculus 


eye sodium bicarbonate cotransport 
protein NBC2 


126 


75 


13604 


AJ005567 


Mus musculus 


SPR2I protein 


63 


40 


13605 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


150 


96 


13606 


G03263 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
7344. 


121 


60 

• 


13607 


G02867 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6948. 


85 


60 


13608 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


57 


35 


13609 


AF 107406 


Homo sapiens 


GW128 


108 


42 


1 13610 


D82345 


Homo sapiens 


NB thymosin beta 


167 


100 


13611 


G03714 

. _ • 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


161 


59 


13612 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


125 


50 


13613 


Y32193 


Homo sapiens 


Human receptor molecule (REC) 
encoded by Incyte clone 044150. 


490 


80 


13614 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


115 


57 


13615 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


124 


58 


13616 


AF1 19900 


Homo sapiens 


PR02822 


95 


53 


13617 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


124 


58 


13618 


S79978 


Homo sapiens 


prion protein, PrP {octapeptide 
repeats} 


88 


42 


13619 


AF220264 


Homo sapiens 


MOST-1 


130 


81 


13620 


U16359 


Rattus 
norvegicus 


nitric oxide synthase 


95 


65 


13621 


AL132841 


Caenorhabditis 
elegans 


Y15E3A.3 


168 


85 


13622 


K01664 


Drosophila 
melanogaster 


Bkm-like protein 


118 


65 


13623 


M86246 


Homo sapiens 


EHS-2 


100 


63 


13624 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDN0.163. 


104 


46 


13625 


AF067205 


Homo sapiens 


vesicle transport related protein 


198 


69 


13626 


U39529 


Echinometra 
mathaei 


bindin 


72 


47 


13627 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


102 


52 


13628 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


123 


54 


13629 


KOI 664 


Drosophila 
melanogaster 


Bkm-like protein 


113 


46 


13630 


AF161536 


Homo sapiens 


HSPC051 


582 


100 


13631 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


158 


87 


13632 


AF130089 


Homo sapiens 


PRO2550 


105 


66 


13633 


AF1 16661 


Homo sapiens 


PR01438 


116 


53 


13634 


AF116715 


Homo sapiens 


PR02829 


101 


60 
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SEQIO 
NO: 


Accession No. 


Species 

* 


Description 


Smith- 
Waterman 
Score 


% 

Identit 
y 


13635 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. . 


139 


63 


13636 


U33547 


Homo sapiens 


MHC class II antigen 


154 


79 


13637 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


107 


75 


13638 


AF090895 


Homo sapiens 


PRO0117 


87 


62 


13639 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


131 


69 


13640 


AF026689 


Homo sapiens 


prostate-specific transglutaminase 


128 


60 


13641 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


128 


81 


13642 


AC003058 


Arabidopsis 

* ^**P- ^^^^^ A ^^^^ 

thaliana 


unknown protein 


210 


67 


13643 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


287. 


76 


13644 


Y73483 


Homo sapiens 


Human secreted protein clone yl!8_l 
protein sequence SEQ ID NO: 188. 


376 


88 


13645 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68. 
SEQIDNQ:163. 


130 


85 


13646 


Y87212 

1 


Homo sapiens 


Human secreted protein secmehce SEO 
IDNO:251. 


423 


97 


13647 


U50403 


Homo s aniens 


breast cancer siiDnressor element 
Ishmael Upper RP2 


103 


86 


13648 


G03800 


Homo sapiens 


Human secreted protein. SEO ID NO: 
7881. 


128 


55 


13649 


X58521 


Homo sapiens 


nucleoporin o62 

UU Vl VVL/ vl 111 LSV/AV 


2610 


98 


13650 


G03714 


Homo ^aniens 


Human secreted protein SEO ID NO* 
7795. 


141 


69 


13651 


G03172 


Homo sapiens 


Human secreted protein SEO ID NO: 

AA-VAJJAIAIA JVWI VbVM h/1 v IVlAij w&JV 11/ A 1 * 

7253. 


111 

AAA 


81 


13652 


G02538 


Homo saniens 


Human secreted Drotein SEO ID NO' 
6619. 


140 

A » V 


60 


13653 


G03263 

VJ \J +J X+\J+J 


Homo sat) i ens 


Human secreted protein SEO ID NO: 
7344. 


121 

A AV A 


46 


13654 


U13066 


Nicotian a a lata 


arabinoealactan-Drotein precursor 


92 


33 


13655 


S79410 


Mus musculus 


nuclear localization sienal binding 
protein 


134 


50 


13656 


Y07766 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 23. 


156 


100 


13657 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
eene 22 clone HMSJW18. 


124 


48 


13658 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


171 


80 


13659 


AK000017 


Homo sapiens 


unnamed protein product 


611 


100 


13660 


U90446 


Mus musculus 


RNAse L inhibitor 


3100 


99 


13661 


AE000882 


Methauothennob 
acter 

thermoautotrophi 
cus 


phosphoenolpyruvate synthase 

■ 


88 


38 

■ 


13662 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


128 


65 


13663 


K01664 


Drosophila 
melanogaster 


Bkm-like protein 


97 


34 


13664 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


77 


41 


13665 


Y94890 


Homo sapiens 


Human protein clone HP02798. 


325 


98 


13666 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


139 


65 


13667 


AF090930 


Homo sapiens 


PRO0478 


119 


88 


13668 


AF1 30089 


Homo sapiens 


PRO2550 


120 


77 
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NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


13669 


AC006693 


Caenorhabditis 
elegans 


Hypothetical protein W02H5.e 


165 


85 


13670 


W80293 


Homo sapiens 


Human translocation associated protein 
designated Gp25L-H. ' J 


1003 


95 


13671 


U23455 


Caenorhabditis 
elegans 


similar to D. melanoeaster homeotic 
protein BarH2 (PIR:A41726) 


113 


91 

^ A 


13672 


G03714 


Homo sapiens 


Human secreted protein, SEO ID NO: 
7795. 


160 


60 


13673 


AL132841 


Caenorhabditis 
elegans 


Y15E3A.3 


178 


75 


13674 


AF1 18086 


Homo saoiens 


PRO 1992 


85 


75 


13675 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
eene 22 clone HMSJW18. 


132 


55 


13676 


AF220264 


Homo saoiens 


MOST-1 


133 


85 


13677 


Y19730 


Homo sapiens 


SEQ ID NO 448 from W09922243. 


122 


51 


13678 


Y65416 


Homo saoiens 


Human 5' EST related oolvoeDtide SEO 
ID NO 1577 


467 


98 


13679 


Y16589 


Homo sapiens 


A protein that interacts with 
presenilis. 


2286 


99 


X JUOU 


AT 160171 


T picHmflnifl 
mnior 


nrnhahlp Hihv-fi^ 11 1 1 09 variant a 
TiNA comolete virion genome 


104 




136S1 


AF090931 


Homo sapiens 


PRO0483 


117 


67 


13682 




T vooTiWQifivn 

esculentum 




60 


56 


13683 


AF090944 

/xl U7U7TT 




PRO0663 


93 


90 


13684 


AL096770 


Homo sapiens 


bA150A6^ (novel 7 transmembrane 

lo^cuik/i f i iiKwxjL/aiii laiiinyj t yiioviv) t 

receptor like) protein (hs6Ml-21)) 


178 


52 


13685 


AF1 16661 


T-Trimo osmipni 




115 


60 


13686 


U62039 


Elephantulus 

edwardii 

vUVYCU vill 


reverse transcriptase 


86 


53 


13687 


Y13141 

+ 


Bromheadia 

fl nlji VQOn i an a 

tlUlflj JV/JlllClliCT 


extensin 


77 


36 


13688 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867 


114 


60 


13689 


Y36366 


Homo sapiens 


Fragm ent of human secreted protein 

encoded bv Pens 3 

WlivUUvU U J gVUW *J . 


103 


53 


13690 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795 


107 


51 


13691 


AC003058 


Arabidoosis 
thaliana 


unknown orotein 

UIUUJV ITJJ LSI V^VAJU 


178 


85 


13692 


Y86248 


Homo saoiens 


Human secreted orotein HCHPF68 
SEQIDNO:163. 


94 


35 


13693 


G03807 


Homo saoiens 


Human secreted orotein. SEO ID NO: 
7888. 


143 


78 


13694 


G00416 


Homo saoiens 


Human secreted orotein. SEO ID NO: 
4497. 


98 

i 


50 


13695 


D63163 


Rattus so 


cvclin C 


111 

AAA 


86 


13696 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
cene 22 clone HMSJW1 8 


118 




13697 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


124 


80 


13698 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


129 


70 


13699 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
7868. 


93 


58 


13700 


W48353 


Homo sapiens 


Human breast cancer related protein 
BCFLT2. 


100 


73 


13701 


M76744 


Homo sapiens 


BGP 


112 


52 



502 



WO 01/64835 PCT/US01/04927 

* * 



SEQ ID 

NO: 


Accession No. 


Species 

* 


Description 


Smith- 
Waterman 
Score 


% 

Identit 
y 


13702 


Y15155 


Homo sapiens 


phosphorylase kinase beta-subunit 


206 


100 


13703 


AF090901 . 


Homo sapiens 


PRO0195 


100 


70 


13704 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


131 


79 


13705 


R95913 


Homo sapiens 


Neural thread protein. 


106 


36 


13706 


M15073 


Homo sapiens 


MHC HLA-DR-beta-1 chain 


74/ 


77 


13707 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


110 


35 


13708 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


150 


56 


13709 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


89 


77 


13710 


X70775 


Chironomus 
cingulatus 


So 12 gene homoloeue 


85 


38 


13711 


X80265 


Hordeum 
vulgare 


structural protein 


96 


41 


13712 


AB007922 


Homo sapiens 


KIAA0453 protein 


147 


64 


13713 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


124 


60 


13714 


Y02749 , 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


108 


65 


13715 


AF116661 


Homo sapiens 


PR01438 


120 


77 


13716 


AF118086 


Homo sapiens 


PR01992 


127 


64 


13717 


AK024455 


Homo sapiens 


FLJ00047 protein 


93 


56 


13718 


AF1 16661 


Homo saoiens 


PR01438 


137 


55 


13719 


W48352 


Homo sapiens 


Human breast cancer related protein 
BCFLT1. 

A«v^^A AW A A • 


93 


59 


13720 


AF1 19851 


Homo saoiens 


PR01722 


94 


58 


13721 


G02538 

WA*»^«# V 


Homo saoiens 


Human secreted protein* SEO ID NO: 
6619. 


124 


45 


13722 


Y27868 


Homo saoiens 


Human secreted protein encoded by 
gene No. 107. 


117 


58 


13723 


AF161361 


Homo sapiens 


HSPC098 


115 


50 


13724 


Y95829 


Homo sapiens 


Native human Tie receptor signal 
peptide. 


108 


100 


13725 


AF1 18086 


Homo sapiens 


PR01992 


166 


75 


13726 


AF 116636 


Homo sapiens 


PR01488 


95 


70 


mn 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


70 


13728 


G02532 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
6613. 


149 


60 


13730 


G00673 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4754. 


118 


63 


13731 


U33547 


Homo sapiens 


MHC class II antigen 


123 


69 


13732 


R59843 


Homo sapiens 


ApoE4Lx2 protease. 


135 


88 


13733 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


138 


75 


13734 


AF026204 


Caenorhabditis 
elegans 


C30E1.1 gene product 


102 


51 


13735 


Y12950 


Homo saoiens 

A AW AAA IIUwlVllv 


Amino acid seauence of a human 
secreted peptide. 


117 


63 


13736 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


112 


87 

• 


13737 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


115 




13738 


AF130089 


Homo sapiens 


PRO2550 


122 


70 


13739 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


124 


74 


13740 


AF130051 


Homo sapiens 


PRO0898 


85 


70 



503 



WO 01/64835 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% . 

Identit 

v 


13741 


AF093748 


Homo sapiens 


KH type splicing regulatory protein; 
KSRP 


91 


51 


13742 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


150 


77 


13743 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


124 


82 


13744 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


91 


48 


13745 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


116 


64 


13746 


AB001684 


Chlorella 
vulgaris 


ORF54d 


70 


66 


13747 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


140 


60 


13748 


AF090901 


Homo sapiens 


PRO0195 


92 


39 


13749 


AF090895 


Homo sapiens 


PRO0117 


148 


67 


13750 


AF1 19882 


Homo sapiens 


PR02492 


91 


45 


13751 


G02532 


Homo saniens 


Human secreted protein SEO ID NO: 
6613. 


109 

A 


60 


13752 

A ** 9 +f 


G00397 

. , i 


Homo sapiens 


Human secreted protein. SEO ID NO: 
4478. 


129 

A mm ^ 


65 


13753 

A. ^0 9 *r ^ 


AF161356 


Homo sapiens 


HSPC093 


99 


48 


13754 


Y02671 


Homo saoiens 


Human secreted protein encoded bv 
gene 22 clone HMSJW1 8. 


124 


70 

9 V 


13755 

A\ W 9 v v 


G00397 


Homo sapiens 


Human secreted protein. SEO ID NO: 

A- A MIAMI 1 ■ WW* \^ IVU Ul VhVUli KJA*J^£ AM-^ ^ V » 

4478. 


120 

i 


51 

A- 


13756 


AF130089 


Homo sapiens 


PRO2550 


142 


96 


13757 


AF130089 


Homo sapiens 


PRO2550 


132 


86 


13758 


G02532 


Homo saoiens 


Human secreted protein SEO ID NO: 

A *%JLaaa\AAA WIW Ui VlVUl* KJA^^^F A Am* 11 V» 

6613. 


163 


50 


13759 


Y 1 9609 


Homo sapiens 


SEO ID NO 327 from W09922243. 


92 


60 


13760 


G02832 


Homo sapiens 


Human secreted protein. SEO ID NO: 
6913. 


154 


52 


13761 


U62040 


Elephantulus 
edwardii 


reverse transcriptase 


134 


51 


13762 


AB001684 


Chlorella 
vulgaris 


ORF49b 


100 


45 


13763 


AF1 18082 


Homo sapiens 


PRO1902 


129 


50 


13764 


AF220264 


Homo sapiens 


MOST-1 


129 


66 


13765 


R59842 


Homo sapiens 


ApoE4Ll protease. 


135 


50 


13766 


AF220264 


Homo sapiens 


MOST-1 


115 


71 


13767 


G00397 


Homo sapiens 


Human secreted protein, SEO ID NO: 
4478. 


139 


73 


13768 


AF130089 


Homo sapiens 


PRO2550 


97 


34 


13769 


U80739 


Homo sapiens 


CAGH26 


564 


100 


13770 


G03714 


Homo sapiens 


Human secreted protein. SEO ID NO: 
7795. 


128 


55 


13771 


R10755 


Homo sapiens 


Non-A non*B hepatitis specific 
antigenic protein encoded by 
phageclone lambda HC253 3. 


102 

• . 


68 

• 


13772 


AE004507 


Pseudomonas 
aeruginosa 


hypothetical protein of bacteriophage 
Pfl 


93 


44 


13773 


AB044885 


Canis familiaris 


dopamine receptor D4 


78 


52 


13774 


AF026246 


Homo sapiens 


HERV-E envelope glycoprotein 


108 


53 


13775 


S71805 


Homo sapiens 


RNA-binding protein=TLS/FUS-ERG 


126 


100 


13776 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


156 


82 


13777 


AB015727 


Mus musculus 


truncated granzyme M 


86 


37 


13778 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


74 


52 



504 
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SEQID 


Accession No. 


Species 


Description 


Smith- 
Waterman 

▼ V Hill IMfl "J 

Score 


% 

Identit 

y 








6913. 






13779 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


137 


61 


13780 


M14123 


Homo sapiens 


neutral protease large subunit 


246 


55 


13781 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


99 


67 


13782 


AF132972 


Homo sapiens 


CGI-38 protein 


902 


99 


13783 


G02469 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6550. 


147 


63 


13784 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


128 


68 


13785 


AF1 30051 


Homo sapiens 


PRO0898 


138 


72 


13786 


AP000616 


Oryza sativa 
* 


similar to RING-H2 finger protein 
RHAla (AF078683) 


123 


95 


13787 


Y64890 


Homo sapiens 


Human 5* EST related polypeptide SEQ 
IDNO:1051. 


92 


53 


13788 


U 12206 


Homo sapiens 


unknown 

™»A*AAAA^^ ▼ V A V | 


84 


42 


13789 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


118 


58 


13790 


S79410 , 


Mus musculus 


nuclear localization signal binding 
protein i 


102 


50 

* 


13791 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6292. 


137 


76 


13792 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


104 


72 


13793 


AB006006 


Bostaurus 


neurocalcin alpha 


1000 


100 


13794 


U63332 


Homo sapiens 


super cysteine rich protein: SCRP 

mm *A pr mm m ^m J mm Ml mw A a a a mm a mm m^m a a A » m mm ^m ^» A-A 


183 


95 


13795 


W34499 


Homo sapiens 

A Ar^^AAA ^m ^rVV-B^ A^^AA^^ 


Obesity receptor C protein. 

^mm mm ^m'mm A w A ^m* ^m^^m* Br A mm A mm aa A • 


120 


85 


13796 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


127 


67 


13797 


U41038 


Caenorhabditis 
elegans 


Similar to cadherin-type repeat 


182 


83 


13798 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1$. 


112 . 


71 


13799 


U63332 


Homo sapiens 


super cysteine rich protein; SCRP 


91 


84 


13800 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6983. 


105 


67 


13801 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


154 


72 


13802 


API 19900 


Homo sapiens 


PR02822 


131 


50 


13803 


AK024435 


Homo sapiens 


FLJ00025 protein 


117 


95 


13804 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


142 


53 


13805 


G02532 


Homo sapiens 
* 


Human secreted protein^ SEQ ID NO: 
6613. 


109 


48 


13806 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: * 
4478. 


98 


66 


13807 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


98 


66 


13808 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 1 02, variant a dna, 
complete virion genome. 


150 


54 


13809 


M37679 


Mus musculus 


Ig heavy chain precursor 


70 


100 


13810 


AL451015 


Neurospora 
crassa 


putative protein 


96 


55 


13811 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


88 


77 


13812 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


128 


71 



505 



WO 01/64835 PCT/US01/04927 

» 



&0n> 

NfV 


Accession No. 


Species 


Description 


Smith- 

Wf A (arm n t% 

Scone 


Vo 

laennt 

y 


13813 


AF 130089 


Homo sapiens 


PRO2550 


93 


75 


13814 


KOI 664 


Drosophila 

* 

melanogaster 


Bkm-Iike protein 


119 


60 


13815 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


130~ 


64 


13816 


AJO 11435 


Blackstonia 
imperfoliata 


maturase 


93 


48 


13817 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


130 


76 


13818 


AF1 19851 


Homo sapiens 


PRO 1722 


141 


50 


13819 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


74 


60 


13820 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


76 


72 


13821 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 1 00 clone HNFIU96. 


135 


50 


13822 


K01664 


Drosophila 
melanogaster 


Bkm-like protein 


95 


52 


13823 


L10908 


Mus musculus 


Gcapl gene product 


103 


50 


13824 


G02538 


Homo sapiens 


Human secreted protein. SEO ID NO: 
6619. 


146 


62 


13825 


G00376 


Homo sapiens 


Human secreted protein. SEO ID NO: 
4457. 


125 


83 


13826 


LI 0908 


Mus musculus 


Gcapl cene product 


96 


43 


13827 


G02971 


Homo sapiens 


Human secreted protein. SEO ID NO* 
7052. 


102 

A WW 


64 


13828 


G02532 


Homo sapiens 


Human secreted protein, SEO ID NO: 
6613. 


135 


53 


13829 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


142 


70 


13831 


AP000060 


Aeropyrum 
pemix 


1 0 1 aa long hypothetical protein 


79 


50 


13832 


AFl 19900 


Homo sapiens 


PR02822 


94 


46. 


13833 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


145 


40 


13834 


AF2 18028 


Homo sapiens 


unknown 


117 


57 


13835 


LI 0908 


Mus musculus 


Gcapl gene product 


79 


45 


13836 


Z26876 


Homo sapiens 


liposomal protein 


129 


93 


13837 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


124 


76 


13838 


AF044311 


Homo sapiens 


gamma-synuclein 


603 


99 


13839 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


100 


82 


13840 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


154 


46 


13841 


G03714 


Homo sapiens 


Human secreted protein. SEO ID NO: 
7795. 


122 1 

At 


64 


13842 


G00354 


Homo sapiens 


Human secreted protein. SEO ID NO: 
4435. 


92 


62 


13843 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 

***A*A» A*" * ^ *^A^ A**A ■ ■ ^^W^ ^^»A A A A#£kA^^AA A 

RD protein (nuclear rna-binding 
protein) | 


93 


60 


13844 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


130 


63 


13845 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


145 


79 


13846 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


102 


50 


13847 



G03469 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7550. 


101 


68 



506 
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SEQ1D 


Accession No. 


Species 

■ 


Description 


Smith- 

waternitn 

Score 


% 

identic 

y 


13848 


G03240 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7321. 


107 


43 


13849 


U05313 


Trypanosoma 
brucei 


CR3 


98 


40 


13850 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear rna-binding 
protein) 


100 


65 


13851 


AF159055 


Homo sapiens 


leucine zipper-like protein 

MM A . 


103 


55 


13852 


R59842 


Homo sapiens 


ApoE4Ll protease. 


99 


71 


13853 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


142 


65 


13854 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


119 


79 


13855 


R59842 


Homo sapiens 


ApoE4Ll protease. 


93 


69 


13856. 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


114 


76 


13857 


W48353 


Homo sapiens 

• 


Human breast cancer related protein 
BCFLT2. 


94 


57 


13858 


U63332 


Homo sapiens 


super cysteine rich protein: SCRP 

mm Wmm mm> mwm+ mw J ir mjmw mm*** mw mt my mw » m WW • m^mt m ^ 


105 


58 


13859 


AF289022 


Homo sapiens 


fonnimmotransferasecyclodeaminase 
form C 


467 


100 


13860 


AF078844 


Homo sapiens 


hqp0376 protein , 


488 


100 


13861 


AB032436 


Homo saoiens 


brain-SDecific Na-denendent inorganic 
phosphate cotransporter 


2968 


100 


13862 


G02514 


Homo sapiens 


Human secreted nrotein. SEO ID NO: 
6595. 


91 

mW m, 


72 

/ mm* 


13863 


AF130079 


Homo sapiens 


PR02852 


131 


63 


13864 


Z29701 cdl 


Homo saoiens 


29-MAY-1998 Wild-type human c-Src 
tyrosine kinase cDNA. 


2380 


100 

* mm* mr 


13865 


L77967 


Ovis aries 


small proline-rich protein with paired 
repeat 


80 


33 


13866 


AL050318 


Homo sapiens 


dJ977B1.5 (myosin regulatory light 
chain 2, smooth muscle isoform) 


904 


100 


13867 


U37690 


Homo sapiens 


RNA polymerase II subunit 


358 


100 


13868 


X13923 


Homo sapiens 


cytochrome c oxidase subunit VIb (AA 
1-86) 


491 


100 


13869 


LI 3848 


Homo sapiens 


RNA helicase A 


6669 


99 


13870 


G03172 


Homo sapiens 


Human secreted protein, SEO ID NO: 
7253. 


133 


46 


13871 ' 


K02064 


Bos taurus 


cytochrome c oxidase subunit IV 
precursor EC 1.9.3.1 


96 


72 


13872 


D3I763 


Homo sapiens 


ha0946 protein is Kruppel-related. 


4606 


96 


13873 


Z14014 


Nicotiana 
tabacum 


Pistil extensm like protein, partial CDS 
only 


83 


43 


13874 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


117 


70 


13875 


Z52203_cdl 


Homo sapiens 


17-SEP-1998 Human PR0217 protein 
encoding cDNA, UNQ191. 


2135 


99 


13876 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


106 


64 


13877 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


143 


50 

• 


13878 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


104 


60 


13879 


L25404 


Brassica napus 


cyclin 


124 


42 


13880 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


139 


59 


13881 


AC003113 


Arabidopsis 
thaliana 


F2401.6 


70 


64 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 

VTaternian 

Score 


Idennr 

y 


13882 


AB000098 


Rattus 
norvegicus 


MIPP65 


905 


50 


13883 

• 


Z95114 


Homo sapiens 


DK212A2.1 (TNF-inducible protein 
CG12-1 (similar to apolipoprotein L)) 

v * Mr a jj 


1639 


100 


13884 


AF132984 


Homo sapiens 


Nuclear pore complex interacting 
protein NPIP 


551 


82 


13885 


AF121862 


Homo sapiens 


Sorting nexin 13 


1453 


99 


13886 


G02493 


Homo sapiens 


Human secreted protein. SEQ ID NO: 
6574. 


83 


69 


13887 


K01664 


Drosophila 
melanogaster 


Bkm-like protein 


109 


77 


13888 


AF217197 


Homo sapiens 


FBP interacting repressor 


2725 


99 


13889 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 6 


187 


94 


13890 


AF 118086 


Homo sapiens 


PRO 1992 


85 


62 


13891 


M58664 


Homo sapiens 


Signal transducer CD24 


371 


97 


13892 


AK023443 . 


Homo sapiens 


Unnamed protein product 


125 


45 


13893 


AB040972 


Homo sapiens 


KIAA 1539 protein 


2271 


99 


13894 


U03750 


Escherichia coli 


DeaD 


95 


48 


13895 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


73 


53 


13896 


AF078851 


Homo sapiens 


Secretogranin III 


2384 


99 


13897 


U62039 


Elephantulus 
edwardii 


Reverse transcriptase 


109 


48 


13898 


X07816 


Human 
herpesvirus 4 


Epitope C13 (57 AA) 


55 


53 


13899 


AF1 16661 


Homo sapiens 


PR01438 


146 


48 


13900 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


77 


60 


13901 


Y28643 


Homo sapiens 


Human serine protease inhibitor from 
cDNA clone HETDK50. 


2191 


100 
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TABLE 3 



SEQXD 
NO: of 
nucleotide 
sequence 

** 


, SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 

» 


CPA WT\ 

SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartie Acid, 
E»Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metnionlne> 
N=Asparagine, P=ProIine, 
Q=Clutamine, R=Arginine, S^Serine, 
T«Threonine, V=Valine, 
W=*Tryptophan, Y=TyrosSne, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1 

i 


13902 


A 

> 


1 


114 


434 


At FKCVEGMFRlAMVNVCFt/SSflSLLi! * 
PLTY/ GVYDEWTHFAYMTIDLLEIPITO 
SHPWLNALFCLEAP \ WISFNTGSFAYP 
VYPKSLIAHDFAVEATMPYIRLSSr 


2 


13903 


A 


2 


124 


466 


KSNIFNLGDCGWESLFNR\QSWRSSLAV 
XNDTYSSKKSNABTFTFHADLCTLSDKD 
RPITIQTALAELVKHKPKATYEQLIAVL 
DEP/ANPLKKWWKAYDKENLFCEEG*KL 
CAASN 


3 


13904 

* 


A 


4 

■ * 


1 


427 

■ 


EGFLELLRTRNHSNSQLQLTTGIGLFLN 
EGLKLVDKFIjEDV* K* YHSBTPTVNFSD 
TE * AMKH IND YVEKGTQGKTVDLVKELD 
RDTVFDLANYIFFKGKWDRPFBVHDTEB 
EDFHVDQVSTVNBPIMKLLS \MLNIHPC 
FKL 


4 

■ 


13905 


A 


5 


1 


464 


KI KS F YAS KDT I KRMR VTDWKKI FAY * I 
SDKELIFTIiRTLKT* * K*GKQPNLKNGQ 
EI*VPISPQDIQIAHK\HLEGWSTSLVS 
ELAPCEAP V/RHP LTGIiTI AGLQG FGEA 
GRLVRGRWGC*W/VHPF*KYI /WQFLSK 
LHISLPYDPTTPLLGTCSR 


5 


13906 


A 


6 


308 


3 


HFVTHS KHDIiAIAHI/G I Y/ PREMKT* VH 
TKTCT* I FTVALS VIARAWNQPGRPLCS 
E WL\KYMVHTME * HSAI KRIiNYRYKNNC 
VNLFLGI TLS EKSQTQNVI | 


6 


13907 


A 


7 


587 


2 


FLTRETGDPTGRSSSHANTQSRFFPDDP . 
PG\ PLNNLGNTHGCGRRAGRCPGTGPDG 
P\AGOGGPRCWPSGHLAATC3)*GPSCX3R 
LGANRGEAG PAGFTACS PLSGCRTP YTH 
HFPASRMS CHLNCAS PRTYRSQGNRGCB 
RVAQGSQGAGGERGAKSQVPVPAPARNK 
DPAKCRKPRNRRPGNSGPWRAYRRQR 


7 


13908 


A 


8 


1 


474 


R I L.NEEHGKYEGLHE * EVKWHLY I KS PA 
FTDLHLCYQKDMNG I STSASS PAVGTVG 
MDMDEDDDPSKWNFYYSPHSYPDK*LTI 
FKTESRVRESDEVTQ I KVNWDEEVI SGL 
LTSLKDNVLKATGVLYDYAYK\YIiCEHT 
RSTTiKEESLKLERNLQNH 


8 


13909 

• 


A 

• 


9 


3 . 

• 


539 

■ 


SQCSPF I S PACSLTALEEETEAIiRVHPR 
LCLSPNLAPSSGPPRPPELAPCPPSSQA 
GLRTCHSWVKGLHQPLPVASGMKSTFCN 
KTYTCPYPPP/PPLCPNHSPNALTLPDS 
VTHAVPPB/L*SPSAPPSSTA*ILGSPS 
\CGASPCNHPHSHPGICPTPPGLWPVCP 
CAP RAWQRDGTRQ T 


9 


13910 


A 


10 


2 


453 

• 


RL*LGLEYALLVWGTPKV*H*GGFPIYY 
Y I VLLLS YALHQVTE YSMYVS I MAFNAK 
VSDPLIVGTYMTLIjNTVSNLIGNWPSTV 
SLWLVNPLTVKECV* TSYQNCCTPDAAE 
LCKJCLGGSCVTALDGYYVESI I CVS IAF 
V/W*VFLVHKFK 


10 


13911 


A 


li 


20 


475 


KMG VPP LLMS D PNR FLFP KNFIiRE KT I S 
PPKTF*PLKIWVKGQWVLNFLGFPGFKI 
FFPVFKFFFFFFFF / RDRVSLYHPGWS A 
VSQSEftTAALTSPGSGDQVILPSQPPK* 
ENHLNIiGGRGCSEPRIiPRAEFLDLRS FS 
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SEQID 
NO: of 
nucleotide 
sequence 

• 

• 


SEQID 
NO: of 
peptide 
sequence 

• 


M 

eth 

od 


1 SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide. 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G%=Glycine, H=Histidine, Wsoleucine, 
KHLysine, L»Leucine, M«Metnionine, 
N^Asparagine, PHhroline, 
Q=G)utamme, R-Arginine, S=Serine, - - 
^Threonine, V=Valine, 
W=Tryptophan, Y=*Tyrosiiie, 
X=Unknown, *=Stop codon, /-possible 
n u cteotide deletion, ^possible 
nucleotide insertion 














IIVIFP 


11 


J3912 


A 


12 


392 

■ 


63 


H IRADPGLEPR PSARTGLGPELG CCTMN 
KL6DAGSATQSL6GSQLSWQLSRREQBL 
EQARWEAQ*QVBTLGRVAREKEALAKBH 
AGLAVQIAA\AEHEGRTVSEEATHLQ 


12 


13913 


A 


13 


18 


338 


APHPQYIiQMPPMLLPPRTGPGQFSLPS S 
RHGGHLE/GKEHETSVTLCGGEPPPQTP 
PRDPDPGP*ARRAPCPRRPT*AHPRALS 
RAAPQE PRALAGPRARHP I CPGSL 


13 


13914 


A 


14 

i 


2 


371 


TTKQ*KDNPIQTGAKDLKRHFCKEDMQM 
ASNHT\KRSLTSLVSREMHIKTTRMAGI 
KKSDNNKHW*GRGE IRT/LLRCWWDCKV 
VQLLWKTVWQFL* GLYQYI PHDPVTSRL 
GISLILYIALRTF 


14 


13915 


A 


15 


443 

- 


1 


SRTFLDTDMKSMRGFKAS *NRQTLLLGA 
NAAGD\ KLKAMLTNHSENRKILQNYVKC 
TLPMLYKWNNKAWMTAYLFTTQCTEYFK 
PTVETHCSEKKISF\KILLTIYNAPGHL 
RALMQIYKEIHWFVPVNIPSILQPIVQ 
NVISTFKS 


15 


13916 


A 


16 


375 


38 


HQQNGFLKKTDPTLLCLQBTHFR / CKDT 
KRLKVRRY/QPNSNQtoAR/VPILIEDK 
IDFKTKKI FMMI KDLTIINI * ASNTRVP 
KSMKQKLAGLKEEMDNSVIMVGGFSYPV 
SIRK 


16 


13917 


A 


17 

* * 


448 


189 


NRDRVSIiCCPGWS * TPGLKRSFHLGLPK 
CWDYRRDIY*LL\FARHTQMISTHS*QT 
TNTCS YPAF* KSR P * E PGYNTTHTPHS S 
DML 




13918 


A 


18 


1 


426 


GMSHHARSLI INQLFKKCSTLFVL/REM 
QIKSLLASSSSSSSRNSVSGQGFETVGT 
HCGQKCKLVQPF\ WJCAVW * YLLKLNVFI 
LFLIYNRNAHLEDTCENVDRALFVIVKN 
WKLSKYPPGVKWRIKLWYSHSVESSTAV 
LPKV 


18 

* 


13919 


A 


19 


2 

• 


423 


YFETFQPLLRRLGYQGTFFFP \KPWSS C 
LGIEYNSGPDSCA* FFLQNQ I RL VNS AN 
IRLMAMTLKTNQVAIAQFLECKESDQQF 
CIGVTHVKARTGWA* F * SAQGCDLLQNL 
QNVTQ/GAKIPLW*GDFNABPTQEA*K 
HF 


19 


13920 


A 


20 


10 


443 


LKVDSGDSEVRYVFILQHITLLMCSAYM 
NQLLN I FVRPSLLAVAIiHMTPGFTKEDV 
YSCFRFLRDVFADEFIFLPGNTL*DFEE 
SCYLLCKSEAIQVTTKDILFTEKGNTVL 
* FLVGLFKP F/VBS YH 1 1 CKSLliDEK*A 
PFIEEP 


20 


13921 


A 


21 


11 


426 


VLVETNNLRMGQVTM * PELPDM/ S PDAW 
TLSDSPSQKIGHAQQ\KYSIIKWKWYTE 
DWAQACLEDTSKYEQVTQIPMAPNDATL 
/ PSS/ AHLAFTAS * GAP3DS * LRNTIiGL 
ICDGST\KPSSTN* KQTVIALQSHAGLN 
MKEG 


21 


13922 


A 


22 


1147 

» 


1768 


QLGMSHGY S S KSMP QKLMCFLFNHHLQK. 
GHECLPKVLNSNP PPII KYLALQDLMJiL 
SQYSPSRRQEVFSLSQPGGHPHNWTAIS 
RE CLNLLNGMTQKL I L YQEAAATNGRVS 
SSYPVBPKKLNSPEETAFQTPKSSQMPR 
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SEQID ' 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 

• 


SEQID 
NO: in . 
USSN . 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nude* 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine . 
OCysteine, D=Aspartic Add, 
£=Glutamie Add, ^Phenylalanine, 
G=Glycine, ENflistidine, I=Isoteucine, 
KHLysine, L=Leucine, MMMetfaionine, 
N=Asparagine, P^ProIine, 
0=Glut8mine> R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W-Tryptopban, Y=Tyrosine, 
X=XJn known, *=Stop codon, /^possible 
nucleotide deletion, \=possibJe 
nucleotide insertion 




t 

* 


# 


< 

• 


» 

• 


•i 


PSVPPLVKTSLFSSKL3TPDWSPFGTP 
FGSSVMNRMAG I FDVNTCYGS * AT * HS E 
W\LLNLSLLYHVWLOGVELLTTWYVSWI 
LFKIYATKAHVFPVQPPFAEGS * VPSKS 
VK*QSSP\LIKYLALQDLMLLSQYSPSR 
RQEVFSLSQPGGHPHNWTAISRECLNLL 
NG^KLILYQEAAATtfGRVSSSYPVEP 
KKLNSPEETAFQTPKSSQMPRPSVPPLV 
KTSLFSSKLSTPDWS PFGTPFGSSVMN 
RMAGI FDVNTCYGS PQSPQLIRRGPRLW 
TSAS 


22 


13923 

> 


A 

4 


23 


421 


3 


KI IFRAFKG* KSPSMPGFKA* KDRLILL ! 

LGTNADNDFKRKPM / LLYHS KNPRALKN 

YIKSIQPLYE*KKA*MTAHLLKVWVTEY 

FKPrVGTYC*KI/SFKILLLNDDAPG\H 

PKALIEMYRSINIFMSANTLFILQPMEK 

GVJ 


23 


13924 


A 


24 


226 


2 


THERTHSKIIHVIIIKITHPLNPSILRP 
QTTA* IKWRDLGSLQPUP/PG/LKRFSY 
LILPSSWDYRCPPLRPANFCIF 


24 


13925 

* » ' 


A 


25 

♦ 


2 


611 


FFFFLLIA3LLHQ I PD V5PTGKYTTLLPL 
MI ILMISGIKEVI Y/DHK*HGRQNS/ VR 
NTKLL * QDS WDTFKWKEVNVGDTVKASN 
GELLPADTVPMC\ Y IATSNPDRETN/VK 
TRQALPETASV 


25 


13926 


A 


26 


1 


443 

* 


ATQWRPSLVPASAENVNKARS FAAGIHA 
LGGTN INDAMLMAVQLLDS INQDELVT * 
GS VLL I I LLTDGE P T AWETNLMS IQNNV 
REAVRGRYSLFCLGLGFDVCYAFLBKLA 
LDNGGLARRI HEDSDS ALHMQDFY\QEV 
AHPLVTAV 


26 


13927 


A 


27 


2 


359 


KAQSKQWLPRGGSGPVK/ARVDEESAKV 
IVRVYWEAQGILLIDFLKGQRRTIYAYS 
ENILRKPEL* * RNAWGS FSRVLHHDRAP 
AHFSHQ/ TRAIG/REF* * KTIRHPHY/S 
PNLNPLDAFCF 


27 

. 


13928 

• 


A 


28 

• 


2 . 


427 


WRKTVKGHFTDQCRKHKAM/LGN* IHFL 
E YHIHKHS I QYSGI QATQEKFGGLKTS Y 
CSKD*YITVLDLTLKISNKDVSN+HENP 
ISMYKN/ IPTVIYGSKVIKDRKLKA* I* 
CP/HRNKKF* YFDYFYVLLNTMTFLKLW 
RIITEI 


28 


13929 


A 


29 


47 


412 


HCDVLLASSRYTC I LP YSDRDDGPQDQL 
KMS VDFRSRRTGS / WFPQNSWGHMGVCG 
WGGAGRTLDL I HLR I PMRGLRSGGFLCR 
RKLVSESYG*EPSPL*K*KGGWGSEPS/ 
LTTVPSQL 


29 


13930 


A 


30 


336 


419 


RLECSGMISAYCNLSLPGLSDPSNSASR 


30 


13931 


A 


32 


1 


408 


KIALKLRSNYSKISGY/MANIQKXiIIFL 
YTSNEQIEFEIKNTIPFTLAPPKIK*LG 
TNLTKC \ VQE LYE K I YKS VMKN I KELNK 
WRSSYGKGKSKIKIIKINK*MDIACSWL i 
RRIiKIVKISVLHNLIYRFNIVPIEIP 


31 


13932 


A 


33 

» 


412 


2 

* 


QGHLSLQKF L \ LP FVQ LC P APRGGVYRG 
RQDSLSCGGLHPF*ASGLLCLPTQASAM 
VGTPPPASLLLCSS ILDCCASNERGSVG 
VE PPEPGTGHNLLVCRLLRPLE KRS I RW 
KSAVGVARFSRCRPSWLPLARKKNSP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino add sequence (A=Aian ine 
C=Cystelne, D=Aspartic Acid, 
E=GJutamic Acid. ^Phenylalanine, 
G=GIycine, H=Histtdine, Wsoleucine, 
KNLysine, L=Leucine, M==Methionine, 
N*=Asparagine, PHProline, 
Q=GIutaroine, R=Arginine, S=Serlne, 
T=Threonine, V=Vaiine, 
W^ryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


32 

w *mW 


13933 


A 


35 


304 


33 


KVWGEKVWYWQKMTQIVQWDRTESPQID 
N*SLTKEIQWRKDSLFNKW*GNNWTAPF . 
SS/RSLNhNKDLTAVTKIKSKWVTDLNV 
KHKTIKLL 


33 


13934 


A 


36 


2 


424 


SKTYS IGHFTYEGKGTITSLWG KVNVED 
AVGETLGRLLVEYPWTQRFFDSPGKLSS 
ASAIMGNPFVKAHGIiFOSLTSLGDALQHL 
DDLKG\TFAQLTELHCDKLHVDPBNFFL 
LG*VLVTVLAIiHF/SRRFTP/ELQAS/WT 
PKM 


34 


13935 

i 


A 


37 


2 


433 


NKRL PGPG P* KRPNPPGEGGPAR YFNPL 
GGPNRBIPLGPEV*THFGPGMRPPVFLK 
I LKISGAFWGP PVGPPTWGG * EGLNPRG 
PGYN*P/R/PPAPPPWG*SQAPFPKPAP 
PP/TGIKPNF 


35 

• 


13936 


A 


38 


3 


425 

• 


GAAQLLP VSL P S AQRAIDLQ I LTTP WEN 
WTS IASLQHKTTIGRSWLITGPISATSS 
TPTSCTVASP\TRGHV*RSASWKYDVLG 
HSSHMWWHTAVYLGEAFHQVHATGS CH 
HRVLSCCPITGRSGRDAL*QLLPL/HAL 
TEA 


36 


13937 


A 


39 


2 


463 


LNEVRDI KLSSDHWPSKraS FLHS PGFL 
SRFBPQPASVA/PRP*SQQQSLPGKAST 
SLWPPNPVPFVTSSIiSLSALPGLFLWLP 
SIPPLPGSPFFSPB*PEVPLFPGPTYHN 
PSPPD/PTVLEAHPDQAPIiP\PGVPTAE 
QRPTPAP*AHRPS /LPIjPP 


37 


13938 


A 


40 


223 


408 


RNTV*NIGTDKDFMT*TPOAIATKAQID 
KWNL I EPMS FC/ T YRETI IGVTDR YTRD 
KIFESGR 


38 


13939 


A 


41 


3 


427 


NLKMKSRTSKENIGEHIYDFDIGRFVNT 
*SKPERKRLSFKTSVQEKTP*L*GKHK* 
MRLEP \LPYPKINSKWIKDNVRSKAIKL 
LEENRSESSIWQRML/SIMPKAQAIKEK 
VDNLDFFKI KNCVS KDTIKKAIiDRPSER 
EKI 


39 

■ 


13940 


A 

* 


42 


30 


448 


FEMRB 1 1 VAFHS ILI R YHGL IWLRKFQ * 
MDRRYS KEVQDLLETMKLFARI VGPL*H 
DKFIESHALEFELLREIKRLQEYRTAGI 
TNFCS ARTYDHLKKTR * EERLRRTMLSE 
VLQYIQDSNACQQWLRRETY\ IESGPNP 


40 


13941 


A 


43 


1 


928 


liRQRMPPQGLQPCPWPPRYP/ PDSPLGC 
PGGLA\PPNSPSWPKLASLAGSRGSSAP 
PGVWGLVATGA*PLQRPSAPPPCPR/LC 
PGREPCMCP 


41 


13942 


A 


44 


2 

■ 


409 


SKSPDPTQQPLRGGSLTHSAPGPSLSQP 
LAQLTPPAFGPSTSSLFNLQKSSLSARH 
PQRKRRGGPSEPTPGSRPQDATVHPAOQ 
IFPHYTP\VWHILGPQRHTP*SVB/HPG 
LDKRLLPETPGPCYSNSQPVWLCLTP 


42 


13943 


A 


45 


3 


136 


LPMTLFTELE*ATLRFIWNHITIQIAKA 
ILTNQK\NKARGITNIC 


43 


13944 


A 


46 


365 


2 


AWGSLRAPRS AVPE PGTPPVMGNALAHR 
QFSPCLDGLSCPSFLRG*NSPPHPGSPG 
LPKHA*APAAVSPWVDPDRTVQPPPPAP 
PST\D* PHCSPPCTPWCPRRLGSAPVMP 
GCPTASYPRR 


44 


13945 


A 


47 


1 


456 • 


AELSELYEESSDLQMDVMPGEGDLPKW\ 
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SEQU) ' 
NO: of 
nucleotide 
sequence 1 

■ 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 

• 

• 


SEQID 
NO: in 
USSN . 
09/515,1 
26 

• 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine . 
OCystefne, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=GJycine, H=Histidine, I^Isoleucine, 
K=Lysine, L=Leucine, M=Methtonine, 
N^Asparagine, P^JProIihe, 
Q=Glutainine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possib!e 
n ucleotide deletion, V=possible 
nucleotide insertion 














EEATGTRAAI PWVPPATOAQQIiEBEGPM 
EE\BEAQPMAAPEGKRSLANGPNAGEQP 
GQPS PGRR TSRAEDEA \BBFDDWBDD YD 
F\PREEPVKGARLRFLPPS * KTPPS FWE 
NRNTPLWGGLKI FY 


45 


13946 


A 


48 


116 


442 


PQNMGMERKPGPPKGLSWASGDLEDNLG 
VMSVGFLLSSPDDAVIWRGPRKTGLIKQ 
FLRNGTGEKVDYFMGDTQLGTSDEHLSS 
VATWAKAHLGEPGAFTFPQRVSLRE 


46 


13947 


A 


49 


434 


3 


FTVPGPLLTVIPIPISKVPFKPSLPQIF 
NFTPFGFPFQIRV*TIIPISLRFFRKKP 
LFFFF+NKVPLCSPYWNS IARSNVPAAS 

vfqvXkesfylnllsvppqvplnvfltf 
ffffflgrdr/slpllprmvwnswaoai 

LPLWL 


47 


13948 


A 


50 


1 


482 


EKPYQCSDTOKAFNMKTQ \LVVHQGVHT 
GNNPYQCRECGKAFGRKEQLTAHLIAHA 
R *KP YGCS ECGKAFS S KS YLVIHRR1KT 
GERP YECSSCDRAFCGKSQLI XHQITES 
TENHYECNECENTYPRKASLKIHQKiDL 
GKKPF/ECNEWGKALLK 


48 


13949 . 


A 


51 


1 


470 


REFLWQEGHRAFATVDEAGEEVLQILDL 
YADVYEELLAI PVANGTKTDKDKLAGGD 
YTAT I EAF I CASGRAI QGGTSHLLGQNF 
SAMSE I VLEDPE I PGENQFAYANS WGLT 
TRTTWVMT IGHGDNMGL/LLTPRVA* VQ 
/MVI I PGGIPfCAFLKRQKT 


49 


13950 


A 


52 


26 


448 


SPGTEREYRIGQQSVTGVTSVDDSNSYW 
R IRGKS ATVCERGTPI K*GQP I RLTHVN 
TGRNLHSHHFTSPLYGN/QVAL*GDIVI 
IL*RRKQRLKGFTEEGIKLRFKEVSAFG 
DEGEGDYLDDWTVLCNGPYWVRDGEVRF 
NT 


50 


13951 


A 


53 


3 


495 


AMEVKAFAETHIRGFTLNDAANSRLIIT 
QVRRHYLKEAATTLKTVLDHQHTPSRIiA 
VTRVIQAIiAMKGDV\DNI*VFHKMIjNGL 
EDS I GLS KMDF I NN I ALAQ I * NSNLDAS 
V*HIENMLTS*NNVIEPQYFGSAYLFRQ 
VLEEQLEPTVEKISIMAERLAKPLQ 


51 


13952 


A 


55 


1 


428 


QERGTKKEMEDRMTIiEETK \ BQ X LKLEE 
KL*ALQEEKHQLFLQLKKVLHEEERRRR 
KEQSDLTTLTSAAYQQSIiTVHTGTHLLS 
VQGS PGGHNRPGTLMAADRAKQMYGPQV 
LTTRRYVGIAAAFAGTPEHGQFQGRPGG 

vyg 


52 


13953 


A 


56 


2 


453 


EDGDLDAFSSDBGLTMAMSYLKDDIFRI 
YITETQECRRYHRPPCftQEAPCNMVHPI 
VICYGCYGPEVGTRYTCSVCPDYDIiSSS 
CKGKGIiHRGHTKLAFPSPFGHIiSEGFSH 
SRWLRNVKHGLLRWS * WEMGPPGNWSPR 
TSCA/GEARIjGP 


53 


13954 


A 


57 


3 


435 

■ 


ELNTS I FRSRP I EGLGLNTVLTTDNSNL 
VI \NRIGIV\PSVTEKEYTDPSSDGTYA . 
WKIFSHETITKAQIIiKLFLSYDYAVNNP 
WLAYPHYKSPEKCPS I ILHDRLYYLNGL 
* CAASAMVMIAI VTYNVALLAYHRWNGIi 
TYMID 


54 


13955 


A 


59 | 409 


1 


LCCEHRGKTVCVPRGKFTTLNAS INKER 
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SEQID 
NO: of 
nucleotide 
sequence 

• 


SEQID 
NO: of 
- peptide 
sequence 

• 


M 

eth 

od 


SEQID 
NO: la 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=*Tryptophan, Y^Tyroslne, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 




1 






■ 




SKISVLSFRLMKLKKEYVK*K* \EKEKI 
NNRIRIEKINKAKS/ WPL*HNKIE*TIR 
M INHQKREN I QITD FRNERTELPPDSTD 
I KKAIRKY * X* FYPNI FYNPEEMNKF 


55 


13956 

* ■ 


A 


60 


1 


393 


GNVSSGGDHPCEG*LCP\PDKVMS *GI * 
VPEEACTQCIGEDGVHHQFLESWVPDHQ 
PCHICTSLNGRKDNCTTQTCPTGKASTC 
GLCEIiARLRQNADQCCPENECVCDPESC 
DLPPVPN* ERGLQSTLTNP 


56 


13957 


A 


61 


250 


1 


REDCKRVLYKDKVSLCCAGPCSVA*SCL 
KAASTFQ\VXHFSLQSSWVYRHAPPGLA 
NFFHFLQRQVLTTPPGLVLNSWVHAI 


57 


13958 

i 


A 


62 


365 


3 


IKKKLIWRLFTFPKPSWGKKTKKGIPFP 
+GPKKIPGIKFPQGIKKAFKGNR*TLGK 
GN* KKKKKKKECGPGGDIPCS * IG/RNI 
VKILIIPKAiySFNTIHIKIPKTFFTEI 
ETTILKFL*NH . 


58 


13959 


A 


63 


245 


558 


FLPTQVIS YVKRALAEGAQI * CGEGVDK 
LSLPARNQAG YFMLPT2ITD I KDESCCM 
TEEIFGPMTCDVPFDSEEDVIERANNVK 
YGLAATVW / S SNVARV 


59 


13960 


A 


64 


106 


532 


ERACQSGTSGQGGVPRRAIGAPR/E/DA 
FTGAVY I YHGDAGGI VPQ YSMHP FA*S L 
YPSGQ/SVAARGNLSGDSFNCHN+GIjQG 
L1*I*ASSV*RPVMLLNDPIIYTTKN/ISG 
PKCHE 


60 


13961 


A 


65 


2 


548 


VQKYPQQNRS CVPP VAE WAVPQS SRLKY 
RQLFNSHDKTMSGHLTGPQART I LMQS S 
LPQAQLAS IWNLSDIDQDGKLTAEEFIL 
AMHLIDVAMSGQPLPPVLPPEYIPPSF/ 
RRVRSCSGISVISSTSVDQRLPEEPVFR 
DEQQQLEKKLPGTFEDKKRENS * RGNLE 
LEKRRQALLEQQRQG 


61 

• 

i 


13962 


A 


66 


72 

■ 


952 

■ 


SSRT YTTS LiPLLKKEVTRKIRKY F X LNN 
N *NTTYHNIiNDljTKWFSGKLIiF * RSR * 
E YRKDKI S DLCFYIQMLE/QRQVI KPKV 
STRKEINKSRTQ*SGKETMERPKLEDKF 
LVNINNINNP**LNL*RKKEGHHCIFYR 
H* KDNKG YI YANNFNILDQMDKFFERYV 
TKMD\QEQ I VSLNIP IADKSNA* LNNFP 
SSSSSSSSSSSS 


62 


13963 


A 


67 


139 


422 


VNGNEFEPL* KGISRHEHRRQPHNGFRP 
KNKGGAKNQ*ASLGMKTPEAPAHSDKPR 
RRQHAAYSS*AMPFLGICISFSQCNL/C 
PPKLNV 


63 


13964 


A 


68 


274 


1 


NLKNKAVI TKTA*H * QKNTDQWNR I KNP 
EIN^HPFYGQLIFDKDAKNTW/WKR/DS 
NFNK* CWEN* ILTCKKMQLHP/SLTTNT 
KLNSQWIKDL 


64 


13965 


A 


69 


22 


419 


KI IGEDGDT/ PLSEMAKSQRQ I FSKDLV 
KLDSTINQVDI IDIYKLHHP\KWTVYTF 
FSSH*GTFSKIDHILLDKTYPNNL*RTG 
IMQHMLSDIKVFKLBISTRKVTEKS *HT 
MSLCNKTLEHPSGNEDASAYLK • 


65 


13966 


A 


70 


57 


423 


KDKNSQVTS EEE EQEGKI KENLNS WRDT 
R/CH/WIGRXWIVRLSFIiPKLIFIFOTV 
PIKIRTQFFMKLNKPVLKFIY*TKRTRI 
AKSLLKKK/ RQEGKIS PTNRRTYYEAYS 
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SJEQID ' 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to - 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A B Alanine 
0=Cysteine, D=Aspartic Add, 
E=GIutamic Acid, F=Pheny (alanine, 
(^Glycine, H=Histidioe, I=Isoleucine, 
K=Lysine, L=Leucine, M^Methionioe, 
I^Asparagine, PHProline, 
QsGlutamine, R^Arginine, S-Serine, 
T=Threonine, V-Valine, 
W=Tryptophan, Y=^Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TLBS I GASMPS LDLG 


66 


13967 


A 


71 

i 


394 


1 


KHFRCMLQEDCRILIiLWFFSBTGSPCAV 
SPRIBCSSKirVHSIiKLLGSSDPRASLS 
*VAGTIGVHHHTQP*GI/CNI*GSGRRR 
SEKQWIKEKNVSSEAKRRRTF*NE*SV* 
KGDYmDQGLRLGTVAHAY^ 


67 

■ 


13968 

* 

* 

i 


A 

■ - 


72 


485 


1707 

• 


SALGLSQPRPPGASDAAPDSTPAGV*TA 
SSSAPAGEKDANQMEWARRDPGPLHQAA 
PAPH* SHLVRDKRAQERLRKTSRGPPHS 
HRSGPVDLS PRS SHSRDSTPVHGPYFRS 
APD PRG S AP AFRG P I ATRGR / RPRAPRA 
TASHVPLSS S SWKE PEEATAPRF * ASAC 
MKWRLEAGCPGHRGVWPGPPSRRLFMWA 
TATGE KS TP S P FTG VWRFGVEACT* AGS 
PTAHIQHSQRTWQPVPPCTNGSRSPWTS 
S YPKNQ S KVGTVP KRGWAOQ P I PSTAVR 
QAT I SQ * ARKGAARPHHQARQVEHSTQ/ 
QRPSGDSGKFDQRSLCEASTLLQPERPE 
VQGKICKFRV/GERRRTASPNSAVPEKR 
TRTQTMVGTAPIGCSGKWRRVYSVRR/R 
SSPRIPPPGVGSS\PLKTPPPP 


68 


13969 


A 


73 


1 


462 


QDHRSKSSHSNKRPSLASSLSENFFQRA 
AKPLQVNKWKKLYSTPLLAI PTCMGFGV 
HQDKYRFLVLPSLGRSLQSALDVSPKHV 
LTKRS VLKVACRLLNALKS SLKMS WVYG 
RVT * KYLL* F/YRTTNRGLL * SMGFAFR 
YLPHGDNPAHLBRGN 


69 

r 


13970 


A 


74 

• 


307 


436 


LP*VGCITEGIPEQDMVRGKHKLLRQEK 
TRLIGPNCPGVINPGECKIGIMPGHIHK 
KGRIGIASRSGTLTYEAVHQTTQVGLGQ 
S LCVGIGGDPFNGTEFIDCLE I FVNDSA 
TEGIILIGENGA/NAEDNAAECL 


70 


13971 


A 

* 


75 


1 


435 


E I SDS KAQLAAMAL I 1 DTWBRMNCFSE * 
NHEPLRTHCALAASKLLKKPD\QAE\RE 
HLCTSlA WSGTNTDKNGEELHGGKRVME 
RLKKALI IAHQGMDPSLQVRVFIEILNR 
YIYFYEKENDAVTIQVLNQLIQKILEDIi 
PNLESWK 


71 


13972 


A 


76 


58 


366 


EFPDLVKDMNLHI QEAQCI PNKINLNKL 
MNRHRI X / RLLNTTTKKR I LKAAKQK*N 
I/T/IRGSSICMMMDFSSEITEDRRKWH 
S I *KILK/ EKTQNCLPRVFCPVKI 


72 


13973 


A 

r 


77 


1 


445 


YHETGC FLMGAIVD I TLTFNTYVHFQGK 
MKGFSLLADPQEFWVDNSTSMSAPMLSG ' 
MGT FQHWS D IH \DNLS VTHVP FTDSACL 
LLIQPHYAFDLDKVEGLTFHQNSLNWMK 
KLSSRTIHLTMPQIiAIjQGSYDIj*DLIiDQ 
A*LSDIIiTP 


73 


13974 


A 


78 


53 


444 


ERGGYGAGPVAWQFLVPSTAPMLQSPPL 
GFAIDHTPPVPAPAKT*APCPLPYAA*RT 
TGPHHIAHIiDTTFGTGDTPHPSSPASPP 
STPWLPPAFPCPLPTWD/RPPCLQPLLS 
SLPTPRIiPFLCFLLLAPYAP 


74 

4 


13975 


A 


79 


1 


353 


HIRTELDYYGIiTWI I YSDT * EAYNYI Y 
IMVT *NVYKPQLWNI FDRATMHSQDVRH 
HLLCIRLMLKNPKHHAV* /VLNGHYAFV 
SR/SFKHALVQYVQAFRTHPDEPLYSFG 
IGLTFIHM 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 

• 


M 

eth 

od 

■ 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

.. - ... 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to — 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
OCysteine, D^Aspartic Acid, 
&=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H-Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N*=Asparagine, P^Proline, 
Q=Glutaroine, RpArgmine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


75 


13976 


A 


80 


1 


350 


LYFYALIjPLSSG/CVAYVATRDNCCILD 
ERFGS YCPTTCGIAYFLST YHI 2CVDKDL 
QTLBDILHQVEDKTS * FKQL I KAIQLTY 
NPDDSUCPNMIDAATLKSMKML* *IMTY 
EASILT 


76 


13977 


A 


81 


76 


386 


PA YPQVRGPAS TPASCIRP TNAR VLS TT 
PRGKSVAEAHSVSPSAHRGVTSVIKLWS 
AKRLK* YGA\KVRPNS \GCTP 


77 

* 


13978 


A 


82 

i 


1 


360 


ES I LQED ITVLNVGAPNNRAS KYMRQ / M 
LVELQRQIDES I IXCEDFNTLISEMDST 
RPKISKDIVGIiNGTIN*LDI IDI YRLL* 
PTTAEKTPFSSSRGAFT*MDHILCYETH 
IYKFKLHE 


78 


13979 


A 


83 


39 


344 


WGILEGGEYIWHVGGGKSLRASGGLWSQ 
PPRQPDDSRPLVLVPVCHLLPHWIGPTD 
LGHKRQ W / MG PG A VAHVCNP S TLGG * GE 
WIT*GREFETSLANMVKPC 


79 


13980 


A 


84 


1 


427 


QQAHLAGHHRIHTGEKP * KCEACDKVYR 
GKSSIi/MK/HRRIHTGEKAYKCEECHKV 
YSR/N/ SQTVKDRRIHTGEKP * KCKS \C 
DKAFGHD SHLAQHAR I HTGB RP YQCNGC 
GKAFSRQSTLVYRQAIHGVGKLY*GNDC 
HKVFSNAT 


80 


13981 


A 


85 


88 


307 


TWTQRRAKLVRRIGWALLPFPRPSGSKP 
PP*TPPALPPCVP/PQSSPWTPPQPPAP 
DSREVS KDWTQMR S FKEN 


81 


13982 


A 


86 


5 


375 


NSLLNTHSRGPGASHCTFWLHERASSRD 
LTGAESYGICRLRRLISLSIVFSKSIEG 
IiACI S I LFLFUFLFIjFFVKTGS wlccpg 
WPQAPELRQSSLLSIiLSSWNYRR* PPHQ 
LIF\DFFCRN 


82 


13983 


A 


87 


2 


342 


VIKNEDHYIMIKCLI*QEDQTIIjNIjHSF 
NKTNINT YKPHMTNLQKAVDKI TTTV* T 
CGTSLS I I AVCRLKLVKL/VEDLSNI IN 
KLDLM/DNI YKTLHLNIRDYTFLKHTWN 
IYKN 


83 


13984 


A 


88 


I 


338 


cne prs hhctpawrqs ktps kkkkq kkk 
ltts/cikasmksrvqg+lr/cwaqvmg 

**GGVLWIjFVK/REFFTLSINLSKEGHS 
RRVPCLGCXiKKKKKKKKKKKKKKKKKKK 
KKFF 


84 


13985 


A 


89 


931 

• 

* 


2 


VASPNALPE*LLPVSGHTHDLERVGVAR 
LGQPHHTELTACRARMESRDTCPGVTLH 
P*PPSFPSSSSPSPGGPRTRLTHQGAGL 
EGSQGPLQSQNPSAA/PLOACRGGWEWP 
QGPGSGS *GG*LMLRELRE/ WQEAAVQL 
PTPG*PS*ESGRPLPSSASGPTPPGSP/ 
SP*PGTQGLCGCHPSGLCATAAGPQDGS 
GPTPTPH\PVQPSCRDSGPGQRLSPTPS 
LTSWPNTRPSPPTGPGPGPGGRATW/PG 
S /PSPQRESPSPQLPCCTPGS FAGHPCP 
AP\AAPSSVACPLP/PDGFPRAPASSGI 
TTAPSPGPDP 


85 


13986 


A 


90 


66 


464 


IiWVYS S FPRLPLHSGAE S PEGD I L WPGL 
TLFLS P S PS PLS \CHSHLSPTCRRPRES 
PRFSLCWRPHPPSKPPPG*VKR0CCPSP 
DPVRLSVSENPSAGPR/VFLRPP 


86 


13987 


A 


91 


197 


2 


NAEI I PAPQKVGPFIKIWVIKFFFPFFF 
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SEQU) 1 
NO: of 
nucleotide 
sequence ' 

< 

■ 


SEQ n> 
NO: of 
peptide 
sequence 

« 


M 

etK 

od 

; 


SEQID 
NO: in 
USSN . 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
iog to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspoo 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine . 
OCysteine, D=Asparttc Acid, 
E>=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I»Isoleudoe, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Pro)ine, 
Q=GIutaraine, R=Arginioe, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unlcnown, *=Stop codon, /= possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














FFF* DRVLLCHPOWSAWQS *LTVASNF 
*F\K*SSCt*T' 


87 


13988 


A 


92 

■ 


317 


2 


SRTPDFR*STRLGLPKCWDYRCBPPCPA 
TSSF* *HLSLC+QGNL/WGPPDSIRMG 
AG\TRKTKCWEGCHPLPFI*RRSL/NSYA 
QAGVQWRDLCPLQSLTPRFKRLS 


88 


, 13989 


A 


93 


1163 


0 


FAFGFEM*YCS VAQAS \ VQWYDLTLIjQP 
PliPGVQSDSPASSLPNKLGLQGTSHHAW 
LNFL 




13QQO 
\jyy\i 

* 


A 




j 




VLWPCRPAGPS LGLNFPLYSWLRLQTFA 
AIRPGSTGRRLCLPNWVFTRNSLPFH+L 
VC/CSSRHNTYLQECTGHREPTYQLNIH 
DIKLLFLRPAMEQSFSADTGGGGRBSNI 
HLIPYIIHT 


on 




A 


yj 


17 




P E PR YVGF I SGGKVDI KKTWKTOGRLER 
WYYTGMYFINCH\VLKNTDSSWGTSP* 
* IQQHAGKRDNNYQLL\KLQG\QFSEAY 
TKCYSNPDSTHDVRKVYQDCPLLAPLND 
TS 


Q1 
y i 


13QQ9 


A 


y\j 


"S 

J 




ANYPRDLCKVTDEGGYTKOQ I FNADGTA 
LYWKKMP \SRTLME REEKS VSGVQAS KD 
RLTLLLGGS VAGDVK/ LKPML I YHSENP 
RIC* SYSLPVLCKWN/NKTWMTAHLFTT 
WFTECLNPI* 


92 


13993 


A 


97 


367 


2 


SLHRARRGKGVGVRMGERLPSFQSWRLR 
L/RRRKLCRRWGQGPRGN*Q*QKPDGGA 
RASQTDGGGERRRGVQTGSQAERDTSRE 
TYSPRGIQREIVRVSKGGKTGRQWQSQR 
QIQREKRVGR 


93 


13994 


A 


98 


115 


359 


LNFFFYLLNFMLLFHRYYALKVSYFKSS 
LDRKLLEIXWNKYWVNTLSSSSLLTVS/ 
DYTCKECI* ILSKLHSRLISETLFHRK 


94 


13995 


A 


99 


1 


490 


CVDPRVRTIFKKDKVGGLPLLNLCIYYK 
ATVT * TVWYWHKD / RTDQWDRI KSLE IN 
PCI YGQL I FFNYY F FS Q* CQDS SMEGNS 
LFNKMC/ CRPLTLYTRIGS KWI KYLM * E 
LKL YRLS LHDPEFGNGFLE I TS EVEZ TK 
EEIGKLD\IiAKLKTFC 


95 


13996 


A 


100 


195 


3 


SRCCKQRHLVQWNRTEKPE / YGRLI FNK 
GVKTN* WGKNNLFNKWYWDN * ISTCKRM 
NLVPYLSPYT 


96 


13997 


A 


101 


1 


371 


FSL I KI S MMLLMKME K*NLQFI WN/ HRR 
LQIAKARLNKKNKTEGITLPDFKIYYKA 
WWYWHKKRHIDHWNRLENSNINRHICS 
QLILTKVPGANTKDHPFNEWSWEN\VCT 
KMKPDP/YIiSSYTK 


97 


13998 


A 


102 


6 

* 


370 


KKGTI PNYF*RE* TDRSKPN*NYATKEN 
YI/PIS*NKKIL*KLANKIQQHIKKKPD 
NSLFYKIQFGSILETFITINQISKPKEK 
NHIIISTDAENA\FDKIQH 


98 


13999 


A 


103 


298 


24 


KCWTS I YLLLFFRDEVSLCCPGCAQTPR 
LK*SFCLSFPKCm>YRREPLVPGLDQ/L 
FLESNRS VSVLKKNVP* WFSNSSGS ECSG 
NHDAFCGIS 


99 


14000 




104 


2 


352 


GTIADPTQCCQAADKAACLLPKLDELP\ 
QNGRLRS AKHRLKCASLQKSG KRTFRAW 
T*AGLIHRFPIAEFAEVSNLPTELTTVH 
TECCHGDLLECADHRADLAKS ICENHDS 
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SEQID 
NO: of 
nucieouuc 

* 


SEQID 
NO: of 
pepuae 
sequence 

- 


M 
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r 
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Predicted 
beginning 
nucieouue 

correspond 
ing to first 

atninA qpiH 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucie- 

OlIUC 

location 
correspon 

Hint* 4t\ 
Ulug W 

last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 

C=Cysteine, D=Aspartic Acid, 

C/ \jiuuirii ii. AciUy jt^a uciJjimaiiiiiC) 

K-Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 

n=T?lntamin» R=Arpimn&. 5l=S^rinp 

•^Threonine, V»Valine, 
W«Tryptophan, Y«Tyrosine, 
X=Unknown t *=Stop codon,/=possible 
nucleotide deletion, ^possible 

nucicuiiuc inscruon 














ISSKL 


100 


14001 


A 


105 


139 


361 


KHTEAF*MPCAKYY\LFGGLNQICALPE 
DETYVPKELNAETFTFHAK 


101 


14002 


A 


106 


3 . 


350 

« 


ELEMIMLSEEVMLKAKIRQKLGLL/QPV 
S* WNAKEKFLKEIKSATPVNTQMI \KN 
SLIAD/MEVWIDDQTSHNVPLSQSLIQ 

RGWFTRFKER 


102 


14003 


A 


107 


225 


1 


QG I MMDTVEYVGKGS PVRCW WKCKLVAL 
L*KTTWKLIiRKLKI * \YDPAVPLLDIHP 
KB Jl Ko 1 XQKDS U JLoMlrXA 


103 


14004 

i 


A 


108 

■ 


390 

- 


2 


I^YSGVRAPLEEAVCRFSDLKIiRAGRTT 

RGGVYRGRQASLSCSGLHPVGASWPRCL 
PTQASAMAGAPAAASLPPCSLISDCCAS 
MIS * W> Vb Vo.roJs.trvj jLVaHM 


104 


14005 


A 


109 


304 


33 


kvwge kwywqkmtqi vqwdrtes pq id 
w*sltkeiqwrkdslfnkw*gnn#tapf 
ss/rslnlnkdltavtkikskwvtdlnv 
khktikll 


105 


14006 


A 


110 


135 


1 


KQAILWPGAVAHA/ WSCWPSTIiGGPGGQ 
IT*GQEFEKSLTNMVKP 


106 


14007 


A 


111 


140 


338 


IMSIILYYGSICYCYYWLID*LILRWTL 
ALVAQAGVQWRELGSLQPLP/PCLSLSS 


107 


14008 


A 


112 


26 


361 


RSFPWVFFFEKKSLLiGIPGGKKGPPLN* 
IiBPLAPGPKQFSOLNPLENGNWGPGPPC 
RGDFGFF*KKTPFPF\GPGGVKTPNPGE 
TF/EPKPPKGVGFPGETPGPGPGENLYQ 

Mir 
nK 


108 


14009 


A 


113 


11 


330 


AS I WLHI FLFLS YFIiE KG PR YVAQ AGLN 
LLGSTGPLSASRAAGTTGALHCTQL*TY 
LLLPKHGKAVFF\QETliLIiRSTHPFPLK 
HVYTPTYELLLVWDRDSLSRPR 


109 


14010 


A 


« * ^ 

114 


350 


2 

• 


GCC P CLIiFPGS PTV FGSJjYS I RJuVAFV R 
AVP PVWKAVPG * PPVKM * FFHLF I YLFL 
KQ/VSLWHPGWS *IAQSQVSRFFPRSLP 
SNWDYRGAPPRLTLFFFFFKQSQGIEAM 
VLAL 


no 


14011 


A 


115 


3 


341 


RPVYS FLLLSNVPVMDVPQDI YPFLVDG 
YIiSLPLVCCT*V*VLLCYPTWPPGLKRS 

G 


in 


14012 


A 


116 


1 


384 


PS YPGDLTEIMDEGG YPNQQI FS IDDTT 
FYWKKMPSRTFLTTEKS IS/GF* ASKNK 
li 1 c jj L>K AM VnuUr rJuJvfvr i. j JlAuEvnr'rC 
TLKKYATSTLPVLiCKWNNKA* VT/AHVF 
TTWFTEYFKLSVETCCSE 


112 


14013 1 


A 


117 


43 


347 


CAAGFGLL * TP PRT * TRKPRRNGTYKP I 
SLMNINAXILKK/ILANRIQQHIKTK\L 
LQSDQVGFI 


113 


14014 


A 


118 


351 


23 


FNK* CWHNW I FTCKGMNLEP YFTPNTKI 
rmiKAIQLLEENKEVNL^LGLGNGF/S 
RLPKAQMAKETXDKLDF I KLKALCS K\N 
TIKKVKKSEELEKIFANHIHGTSIQNI 


114 


14015 A 


119 


123 


340 


AADSSTHSIiTKPAYLEKQNFFFFQMEYC 
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OCysteine, D»Aspartic Add, 

E^GIutamic Acid. F^Phenvlfllnnine 

G=Glycine, H-Histidine, Msotaidne, 
K— Lvsine. L#= Leucine IVI= Methionine- . 

N~Asparagine, P=Proline, 
Q^GIutamine, R=Arginine, S=$erine, ~ 
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W«Tryptopban, Y=Tyrosine» 

X=Un known *=Ston rnrion. ^xnosslhle 
nucleotide deletion, \=possibte 
nucleotide insertion 




• ■ 1 










. LIPLAEGPWPNIA*MQPPbWW&£CL 
S LP S TWD * R /HWS PRLAN 


115 


14016 


A 


120 

• 


1 


371 


PS YP * DLTE I MDEGGYPNQQI PS IHDTT 
FYWKKMPSrrPLTTkKSIS/GF*ASKNK 
LTFLL * ANVAGDPKLKPVPTYHALRNPR 
TLTKYATSTLPVLCKWNNKA*VT/AHVF 
TTWLTEYFKLSVET 


116 


14017 


A 


121 


82 


367 


YSKRSOTIVAGD/FTPLSALDSSSRHRI 

TF^S£ANGPFSKTDHLLGYKTSIiKTF*T 
KLK*HQ/CIFSDHN 


137 


14018 . 


A 


122 


27 


253 


MKTENILGENIGEVFGVGKDSLDMSPKA 
* TIKEKNDKLFF I KVKNFS YSKHTI *K1 


118 


14019 


A 


123 


3 


305 


GTRQGCPIiSPL* FNTVLEILVR* ISQNK 
*INKT/SLTADP/MVIiHIKNPKGSIK*V 
LELINEFSQVAGYKINM/QKTVAFLYTN 

M*T .Q Win? T lTtf T T OP T T & G TTDT 


119 


14020 

• * 


A 


124 


1 


327 


QVIFCLGLPKCWDYRHEP/RMPGLRGAD 
FFSLI * ILWEHMF * VMCILPHPCGDFWA 
MLNF* EREGMFF/ CLKRWKSHNVSQA/G 

DrYTDUPlTP PGPT .TT.T.QQMDHBHUO'DWr.a 
r\£L tr IVJxC OlvD 1 JJiJOdVVlJfVtUlnxrirflLin 

K 


120 


14021 


A 


125 


74 


334 


IFEFGVLKVITCLSVSSHEVGKIiCPFFF 
FFKNGV* FCCPGLQGYNGS * L\TLELLK 

SIH 


121 


14022 


A 


126 


381 


60 


TVSQVIiAHTCSPSYLKAKAGGSLEPGSC 

YPGCSEZ**SCHCTPAWIT\SQTPSL*KI 

H*KKI IVKRAIVECVCVYVYTYIWLLIL 
k"p<? n<; t p pan two pwe rpp y wwnf* 


122 


14023 


A 


127 


323 


2 


RGFLGLSDEVSGQQLFS ILS E I ENQFKN 
NLEGCGGSCL/ SIPKCWDYRCQPSRLAY 

A* LIFVFLVE \TKNTGFRHVGQAGLELL 
TQGnTtPX AT.A^OMAWTiTTJT^HTTARPV I 


123 


14024 


A 


128 


2 


354 


ENCQ INNLRFHLRKP KKEEYTKPTASRR 
K*IVNIRIERNGIEQK\TIEKINELKSW 
LFENINKIDSHSGKWMVGEE * CVFVCVC 
VREROSDRERETRLIKI *NERGDITADli 
TEIK*LLILQKLNNY 


124 


14025 


A 


129 


74 


327 


GELAMLPRLILNS* \PCNPPA\SASIiVT 
GTTDTPLYPAHYYY\ I I I I I I I IEMEFH 
S\VAQAAMQWCDLGSLEPIj\PPGFRRFS 
CLSL 


125 


14026 


"A " 


130 


83 


382 


YEFHASDGGSRLSSQHFGRPRWKDYLSL 
EVEGQPGQHSETPSLSLSLSF/CFIiEME 
SCSIAQAGWQWPYLGSIiQPLPPRFRRF 
* CLSLPKCWNYGHEPQL 






A 


131 


740 

• 


7 

j 


ROSLALSPRLECSSTISAHHNFCLOGSS 
DS PASAS RVAGITGVHCHAKLI FVFLVE 
TRF\THVGQ\AGLBLLTSSDLPALASQS 
SGITGMSHHVQARCVIF/CX5PTFKNQDM 
KY*NKKEKNQSTFLEHLQNQQENRHAAH 
TLRHHTAS/LKSSEFLFAIRTSFLIS*K 
RSGTTGMF * YN/WT IGNGVEDRFVLGPP 
FGLGVQWYHHSLLQPQSPGLKQSSXiFSL 
PSS*DYRHVPTPSFLKFLRRWGLAILLR 
L 
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Amino acid sequence (A^AIanine 
OCysteine, D=*Aspartic Acid, 
E=Glutamic Acid, F=Phenvlalanine, 
G=Glyc.ne, H=Histidine, I=Iso!eudne, 
K=Lysine, L^Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, $=5erine, 
T=Tbreonine. V=»Valine, 
W=Tryptophan, Y=Tyrosine, . 
X=Unknown, *=Stopcodon r /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


127. 


14028 


A 


132 


371 


1 


KKNLNLNLTL*VKINTNWIMDLTIKYKI 
IT * KKQEKVGNLEQGKEFSDLTPKV* S I 
KGNTDK* DV* KPFS L* KPM*GNYRQATD 
* KKMSAKHVSNKYP \LSRI YKFNDKAKN 
KPIRKWAKOMN 


128 . 


14029 


A 


133 


2 


318 


NHKIILKDAKKAFDKM*HPPTMKILNK* ■ 
GIEQYAT/DI IKAMCDRPTADI ILNGEK 
LKAFPLRSGRRPLL\FH* I W*VLARTVR 
PRKRNKGSQEWWLMALIPSTLGG 


129 


J4U3U 

» 




J 


i 




WQFTi T £2 <Z S DP PAT. AKO T AG T IDMTCHYAO 

PGTDF/STSVYFTIiPTQIRL*AI 1 KRKFL 
L/EFK*IFASI*QFAKCSCMSRNFLPAQ 
PYHHWHFPNEETEAQGNFSLLWAC/LRE 


130 


14031 


A 


135 


12 


398 


KCSTS IVIRE IQVKTRE I YHYTPTKMAK 
I KKPDNAKC/WSGCGATKTL IHCW *KN\ 
SLEIWLFLMKFNICLPCDPEIVPLDLYL 

P T7MTCT Y T HRNTYTKV^ T APIiFItV OPTC*; P 

STGESINCSIIIPRMECII 


131 


14032 


A 


136 


309 . 


1 


QSEAPSQKNTKKQKTKVGGLTS * FQTYH 
NATVIKTVWH/W/YIGIDIDQWNSIQSP 
EINFHIHGK/LI SNKAAKTNEWGKNSIiF 


132 


14033 


A 


137 


2 


251 


QCXSKAFRAASVIiRMHGRTHPEDKPYECK 
Q *GKAFRSASHL * MHGRTHNQEK/ P /HE 
CKE CGKP FRSAQNLRI Q * RTQAHIRMHS 
G 


1 j j 






1 1D 


1 


i 


T.RT.T.Tcf* QAPTiOTiPK'CWDYRRRPPHPAIC 
MIL*SKSS\LPFIiCSECCLNTIWArjLLI J I # 
CSLVPLLTLPLKEFYAAH*SPDSLKYPV 
TjOKTj55KI«EMT_.VTjHETVORKKOLiTYSLYK 
PIFGFLVGYSFPLFCSYKTS 


134 


14035 


A 


139 


3 


384 


LiDFI KMKNCCSSKELQGKYEKP*T/MEK 
MFAKHISNK*FIFRI*K*LSKLTKKK*A 
KDLNRYSLKEDIQMANKGM/KKCSI*IiV 
IREKOTKNTMRYY* NGLS *RS /GOTK* * 
QGWGGTGPLIHCWWEYPMI 


135 


14036 


A 


140 


63 


388 


FiViYLFIY*I^/CXTlAQTKVQWCDHSS 
LQPQTPGLKH\ P PAS ASGVATTT I MPG* 
FLFFF* KRGPCWVAHARTT/WLNLGSF 


J JO 




A 

** 


14.1 




*» 


GPGHFPALFFKGPGLGPWVPP I I PALLE 
G*GGGFPGP/RGSGPPGQPCFFLKFQNF 
PGFV\ GAP WPGFPGNLGRGTALNPEAE 
• GS INLKGPPGLQ PGGKTKLF FQKKKKKY 
OHFOKTGLI LGF 


137 


14038 


A 


142 


363 


2 


REKATEENFATS +DWLMMFKEGSHFHS I 
KVU3ETPSADVETVVSCPEDLAKIMDED 
GYTKQ I FNVGET \ PSRTL ITRKE KS VPG 
FKASKLQRIG*IiLTjGVNAAGDFKLQSVL 
MDYSENHAE 


138 


14039 


A 


143 


99 


375 


KNKNQN I S PPLVRYTLLSPLIjTjP FI INS 
FFFFFFSF/CFF*KKSRFEPKVAFQGON 
LS*LNPRPKGSKBFSCLSIiQSTWITGaP 
PPHWADFDF 


139 


14040 


A 


144 


39 

* ■ 

• 


399 

• 


LQIIYNYIWLYIWLYIFWIjVSKIYKELL 
Q INNKETNKNGQKT* IDVSSNNMYMNG* 
* LHEKIHDIVIR*MRIKTTVTYHVMSTR 
MVI I KETENKYW * GYGE \KIL I YCWWEN 
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K=Lysine, L=Leucine, M=Methfonine, 
N=Asparagine, P=Prollne, 
Q=Glutamine, R— Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=»Tyrosine, 
X=Un known, *»Stop codon,/ 3 possible 
nucleotide deletion, V=possible 
nucleotide insertion 














FEK/SVWPFL 


140 


14041 


A 


145 


358 


3 


REHTAEDKYEASRSWFPRFKEK\SPLRN . 
1 1 VQGEAATINGBVAVS Y/PEDLAKNID 
EGGYTQQIF\NADETF*YWKMLPRTFIA 
REKS \RLPSRDRLTLLLGANAAGDFKLK 
PMFIYHSDSC 


141 


14042 


A 


146 

* 


3 


376 


AVMGMYK T NWP VPTNTS I LQ P I D * G VI 
LTFRSY*LINT\FGNTIGAIGSDSSDGS 
GQSKLKTFWKGFTIPKS \IKNTCDLWBB 
VKISTLMGVWkNVIPALIDLFERLKTLL 
EEVTAYMVNIGRRL 


142 


14043 


A 


147 


374 


161 


EFHRVSQDGLDLLTS * STCLSLPKCWD Y 
SKWQRRI*TQ/DLCDSED*SSIKPIQLC 
QELMS *MAEMAHGQP 


143 


14044 


A 


148 


373 


2 


FFFFFFFCOLLLFPKQPRSHASFSLPSV 
VPFPEMPSSSLTLFYLFVYNLLR*YL/N 
SVTQAGVQWHDKGSLQPRP\SLLSSWDH 
WCTTPCPAN* IFFL* RVGLAMLPRLLLN 
SWAQAILPLWSPK 


144 


14045 


A 


149 


3 


282 


ILKRIIHPDQVFFIPAMQGWFNI*K*IN 
TV\NHIIRIKGK\NRMIISLDTVKAFGK 
I QQPFM I \ * KLGI KGNFLNL I KGTYENP 
QLAGHSGLCL 


145 


14046 


A 


150 


3 


360 


DI I * ES KFQGHHTTG VQKGLQ YG I ILFI 
I * EVFFFAGFFWAFYH3SLAPMP2LGGH 
\ YPPTG I FPLKRLEVP / LLNTSVLFASG 
VSVTSAHHSLIEGNRKQI ILALS ITITL 
GIYFTLLQI 


146 


14047 


A 


151 


349 

i 


3 


KKSSGSHIiETKSNRIjPGPSOGPWCKEIR 
KGSLAWQEKS WP FYKPFLLEVGGGRVIR 
GSRMTRHGHLIRSS2CTIjIKHHVPK*TVQ 
FNFCLF I FLEME SRS \VAQAGVQWCNIG 
SLQP 


147 


14048 


A 


152 


500 


0 


MPSRLBCSGA/ ISDAQCNLQNSGAPSEP 
SSALSNPCG * D YRDTLPIiPGYFIfKFFVK 
TRSRYVTPAGL*LIASS\IPPTSA/FSK 
HWDYRHE 


148 

► 


14049 


A 


153 

• 


373 


2 


QAGFTLLTS S DPPASAS \QSAG I TGMNH 
RTWPGNF* PQKS CDS FVTKLMCTCHKNH 
I YAQSLVT* LGHS YI FF* DSLQPS PPRF 
KQFSCHSLPSSWDHRHTPPCPARTQLHF 
GYEYFSRI1FLRRR 


149 


14050 


A 


154 


332 


3 


THKFQRLTSKSKKLKRKLIFHQGAQKSN 
S LNPD I KKNLKRS QRETME* EKI FTDHLS 
SRIYRELLKLKAK/INLIKMGKGYFYEG 
• DT*MADKHIKRCSISPIIREMQIKTTIR 


150 


14051 


A 


155 


337 


21 


LGLLPFFQRCPAKRGGI * RGSLAAVALL 
NCGELHPV*ASWWLCL\PVRGKLPTEAS 
VMGDAP P P TKLECPRS TSDCCDGS KNFE 
SVDLSLLGSVGVGSVDLDPVAP 


151 


14052 


A 


156 


21 


491 


HPGSRGCSKPRSGHCTPAWGTKVKTPAS 
T KTKQNKKKK* RTVCLT P WHTLL / RVWC 
DGGDYSSShL 


152 


14053 


A 


157 


3 


394 


REDLLS PG FQGCCEL* LCHC I PAWVT/ S 
ETLSQKKKK 


153 


14054 


A 


158 


36 


354 


lsaAftsqli*grlrqengvn\cseprlh 
hctpawate*dsvskkkkgkpfwipnpl 
sslspafi sgqgp weshlknpgralqgg 
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Amino acid sequence (A-Alanine 
C=Cysteine, D^Aspartic Acid, 
E^OIutamic Acid. F=Phenvlalflnine. 
G^GIvcine. H=Histidine. l=IsoIeucine. 
K=Lysine, L=Leuclne, M^MetbJooJne, 
N=Asparagine, P=Proline, 
Q=Glu famine, R=Arginine, S=$erine, 
T=Threonine, V-Valine. 
W°Tryptophan, Y«Tyrosine, 
X^Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GFQPL I CPRPGRTHMSLKS PWRP 


154 


14055 


A 


159 


78 


340 


WFHKAQRGILLABLYPSOSAQHSIEPK 
TITLFFLGNKVLLCCPGWSAVA*S*STV 
ASN\FGPK*IiLCLSPPSSSDSRYARPHM 
ANPK 


155 


14056 


A 


160. 


2 


362 


HLS PLS IPQNRHCHHGPFSVSCWAHLPD 
GWAGQRGSSLP/M*GRPGRGAPSPPRR 
GGWPGGGLTPHLPSGRGGWPGRGTP \ PS 
Q*GRPGRGAPSPPGRGGWPGGGI*TPPPP 
SQTGRLAGRR 


1 JO 


14Uj7 


xi 


tat 


Z93 


2 


DUT.71 T Dnn'l'l'T m VC A T VMVITMDCDTV I/' If 

RQKSMPGFKSSKDRLTLFLGAK\AAANP 
KLKPMLID\HSKNTKALKNYAKSILPVL 


157 


14058 


A 


162 


2 


136 


LIVPILIAMAFLRLTERKILGYIQLRTG 
PNVWTPPTGATYRLLS/P*QP**LFTKE 

P\ lepvtctitlyitdptlalt ialvl * 

TPLP I PNP VGNIjISTLGLLFI LATCSLAVY 
QRLDAAYWCYVQTP I A 


158 


14059 


A 


163 


i 


464 


RQGI»/ DS VTQAGVQGRNLSS \ LQPL/ PP 
GLK*SSRFSVSNSWDYRCAPPHLANFFN 
LFFVQRG\ FTMLPKL* TRS \GPGD 




14UOU 




164 


i 


353 


ENPGTPTFGGAKGGGNPGIKILKTTGPK 
GGNPCFKKKPK* PHRGGRP / HGNPVYRG 
GPGKKNFW PPGGGGS KG* KS P I KPPPGG 


160 


14061 


A 


165 




389 


FFVCTFCRERSLLCCRSWSGTPGLK* PS 
CLSLPKFSDYRHKWPCSARNFIiPACLPA 
CLPPSLPSFFPPS/SPSSSPRQGVTVIQ 

LGPQ 


161 


14062 


A 


166 


330 


2 


WDYRCAPPHPVLYLKM/ STFNIFFL*RG 
VD4LSRLVSSDPPI SASQVARI IDVSHW 
AQL* RSV/ CYVFETGSGSLTQAGVQRYN 
HGS VQPQHSRLS *SSHPSLWrTYR YPPPC 




J4UCO 


A 


10/ 


z 




I WMFITALFLA\ES * CSSVGE* IGKTWY 

I PTMKYYS VMKRNAI SSHEKTWTRLKCI 
LPSER <3 RCDRIjOTV 


163 


14064 


A 


168 


486 

• 

■ 


2 


LIFGKIYKINNSSKTDNKRGKTQIOILG 
MidGDIMTY\QQTSKEYYAHEFNNLEEM 
DQLFKKHKRP/RIHQYEKHHLTGPMTIN 
E FEF INVKTPKNOS PGWDDPTGICFYOMP 
EE FLS ENKTGEL I L * S * YYS LTNS FYEA 
SITY* PKPDRO * KKOSOGP I S FM 


164 


14065 


A 


169 


200 


3 


GRVDLPPTQESRPPGPPPSDPGVQAPRP 
S IVRNCGILTRGSPGPD\PSPLSS *AQL 
CGSPPPPSFS 


165 


14066 


A 


170 


2 


327 


PGG1GCGELR *HHCTPTCATERDS V/ WN 
KI IHYI IMLASPNELILPLLNICLEIL* 
AGPLTAICISMFMATL\ FTIAQMGKQPK 
CPSTNEWIRKMWYIHIMEYYLHFKMK 


166 


14067 


A 


171 


2 


343 


PGWSQTPDFR*STCLGLPKCWDYRR/AA 
TVPGLFLFLTALC 


167 


14068 


A 


172 


367 


3 


FIRDVQFISALRYLLTPERMA/MIKKSK 
NNRYRCGCGK/RG/ TLLHCW* KGKLVQP 
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Amino acid sequence (A=Alanine j 
C=Cyste'we, D=Aspartic Acid, 
E* = Glutamic Acid. F^Phenvlalanine. 
G=»Glycine, H^Histidine, Hsoleucine, 
K-Lysine, L=Leucine, M-Metbionine, 
N=As da rapine. Ps=Pro fine. 
Q=Glutamine, R=Arginine,~S=Serine, 
^Threonine, V=Valine, 
W^Trvntoohan. Y ts ^Tvrosine. 
X=Un known, *=£>top codon, /^possible 
nucleotide deletion, ^possible . 
nucleotide insertion 














L**SVWRPLKELPLNSVIPLLGIYPK/D 
KEVIYEKDSCILVFIAVOPAIA/RNIHP 
TYCPINE*IKKMWYIY 


168 


14069 


A 


173 


102 


345 


YQLQN I PRWVYSKLQNLS FAFVLL* RDR 
VFLCHIiGWSSLL/QLKRSFHLYLASSWD 
YRQVTPRPAN* ILFRD/RGLALLPR 


169 

< 


14070 


A 


174 




4/11 


IPFLSQLKVQVHHRRYFHLFPEI»SNLPL 
LS KNVS * TY* KS FLS / WPSLR VL PCFVL 
FCFVTGFHSVQARVQWONHGSLQPQPPS 

J»U\ 1 n 1 r~l Irif'^ 1 


170 


14071 

* 


A 


175 


2 


388 


SDLQLRAGRTTALFKAVRQGHLTLQRFL 
LS F \ V * LCPAPRGGAYRGRQAS LSCGGL 
HPGRAYRLSCLPKQAWAMVGAPTPASLS 
SCSLISDCCASNHRDSVGVGPFEPCAGC 


171 


14072 


A 


176 


334 


1 


KLLELIN* FNKVAVYKINTQNQLHCYTL 
IRTTQKGSLKTILLTIAPKRIKYLGI /Q 
/ LTKEVKDLHSES YTLLKE I K* D INK* K 


172 


14073 


A 


177 


339 


2 


KNSPYNFPKGKNPGNL*SLKSRIF/CWG 
STFAHIGEIjFFFKMGKKFPAI / RPFFFF 
EDRVLLCRPGCSGWRSWLIAAST/CLG 
WDYRCTPPCLANFL *RQGLTMLPRLLSN 


173 


14074 


A 


178 


3 


303 


DLRRAACL I LPKC WD YRHVPPRPADNS G 
FLHLI 1 1 *VCLPLLCHAQLVFIFFIETG 
FCMVAWAGLKFLGSSSPLTLA/FSKCWD 


174 


14075 


A 


179 


342 


1 


INRLNIFTMAIFSTIiIYRFNTI PIKIPA 
GFVEMDKLILKLVWKFKAHGIGKTTLKK 
NRVPNS KAYYNATVS KT\ YWNEDRNINQ 

R 


175 


14076 


A 


180 


2 


323 


STS YPI TGSHAFL * PQNWDAETNS *HI 
NNVNLRLKI IKLLBENTBKNCHDXjGLAT 
DYY /SVTPKA*ATTTJCIDKIiEI»T?fTKWP 

CTSKDIT* KVKRQLIGENSCKSFM 


176 

* 


14077 


A 


181 


326 


3 


RKRKKKRREREPKKDEBRKRL YGKDI I K 
KRRKDTM/DWRKIIiQKGVRQRMCI*KI * 
RGVTQELKANP I RKGGNNLNKVHQ RI * M 
ANKHMKRCPKS*VIREI* IPTIMRYH 


177 


14078 


A 


182 


5 


326 


TKTIEKAAN* IRRSRRKKIKFKPEIKNR 
KTTTKINES KSCFYKXTNKTDKLLVKLI 
RKKKTKKI ITKDBKNHTI \*DCTEVKGM 
TECYEQVYANKFDNS \EKMDNFLER 


I/O 




A 

* * 






37ft 


TVLCS F T VWQIjDFFO* THTT V * VFPYV 
WNCVHNFLFRFFSAYI * YLLI FVY* S * I 
MHLCLICY* LLL/CSCI FFWIFYMIiYFS 
FWL/CFLYFYFYILCPFI FF/CI FICLL 
FFVYLFLLFI FFF I FLF 


179 


14080 


A 


184 


365 


3 


MQLKCIMKS IS VQAGVQWHNFGSMQGPP 
PW\LHHFPASASLVTGTTWARHQGRVIF 
/ IFFFFLMRRSL/DS VDQAGAQWHDPGS 
LQPLLPGFK*FA/CPCWDDRRAPQCPAN 
FVFLA\RHGFTHiAR [ 


loO 


14081 




185 


51 


293 


P YVNQEAS VLTNS FYQAS / I * KKSNYRP 
ISLMNIDAKILNKTQQIKLNKVKDSRQT 
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SEQ ID 
NO: of 
nucleotide 
sequence 

• 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 

t 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 

• 


Predicted 
beginning 
nucleotide 

mm mm w ■ 

location 
correspond 
ing to first 
amino acid 
residue of 
. peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon " 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E c= Glutamic Acid. F^Phenvtalanine. 
Gr^Glvcine* H = Histidine. I=Iso leucine. 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
- Q=Gtutamine> R=Arginine, S^Serioe, 
T=Threonine, V»Valine. 

m> mi mm m w ■ • • mm mr m ▼ « VI 

W«Tryptophan, Y=»Tyrosine, 
X=Un known, *=*Stop cod on, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














GEDQI PKYCTVSDGRMMRbY^pL]P66 


181 


14082 


A 


186 


326 


3 


TKICQWILNLGKKREFGFKKKKKIKCPN 
KRHRLANGIKK*DPTTGCLQDICFRSKD 
AORLOXKGWKXI FOKTGI * R /GVAVLTS 
DNIBFKIKTVTRDRVANY I IKGSIHQ 


182 


14083 

• 


A 


187 


17 


330 


KDNEM* IRRTMRCYYTSFTMAKINLScST" 




NIKC*OG*GTTGT /L/liYC?*WKHKTVOP 
FSKTV*QFLMRSNKYLRYNPAILLLGIY 
SN* VKI / C*KT YMRMFIVFLITKDWKQP 
NYSS 


i QJ 




A 

** 


loo 




3 J f 


KKKKGGNLGARG / VFKKRGRPQKEKPGK 
KGEKGGP *GAGI * GGGGNGVKXK 


184 


14085 * 


A 


189 


333 


124 


RLRQENRLNP /GSRG CRE PRS RHCTPGW , 
ATE * DS VS KTNKKMYKS 1 1 ARDCHABNS 


185 


14086 


A 


190 


1 


327 


KSWLIGDNIPSSQSLIQKKGVNPSFKSM 
KADRGKEAAEEKSEASRSWFMRKKERSH 

LHN/ I K* VQGKAAS YPEDLAS I IDEGGY 
Turn* TirKn?nv /iumirTfM*i?i3fii?T.'Poi?irT 


186 


14087 


A 


191 


53 


400 


VNILGPFHLFFTFVQNALSPNIYMTHML 
TSLQPLHMSSLQTGFPKPSYLKFQPSPR 
ovr vvjr / irurtyK f oinAovnvxlnv j. 
RMND X SHMX IS KDT * KAFDKIQHIGSLF 
PALSL 


187 


14088 


A 


192 


65 


321 


RARTE IYLIHTLLGVYAGETKTGPYTDC 
M*M PTTAPTT'LVKTW T * PJT«3T»<3 TOFTR T M 

KL/WILICTYMEYYSAIKSVLTHLTTWV 
NPH 


188 


14089 


A 


193 


127 


241 


I IKNDSRRICNIiTEKLPZiRKINS/ WPGA 
VAHACNPSTLGG +GGESRITYHESRNKG 


189 


14090 


A 


194 


1 


337 


DLPASASQSAG I IG*ES IRN+NKLVKKT 
LKCSQKDPKEDLNK/WKDIVCSWU3RLS 
LTKVS ILPKLX YKFAKIPTKLYYD*KKN 


190 


14091 


A 


195 


2 


333 

* 


NFNSLFFFVEXEKLILKFtl^CK^VARR 
ILQRKNKVGRLTLPNFNTDYKATVIKTA 
WH*HKDTHMDORNRTESTKINTYKGAKT 
I*WRKNSLFNK*CWDK/WITTCKRKKVD 
P 


191 


14092 


A 

n 








PHPORIiWOPSPPS X PRTFS P ^ <5 T POTO 

SWFNIQKSINVFYHIVILNTQHRSSSSS 
SSSSSSSSSSY*I^IKF/LNKLGIEGNF 
LI LIKAI HGKPTVNH 


192 


14093 


A 


197 


2 


199 


LALSPKLECTGAITAYCSLELLGSNDPP 
TlASKVF/GITGMSH*AQP<?VVFLIiIiYY 
FIFFFSSSSVL , | 


193 


14094 


A 


198 


226 


2 

mm 


KKFFFPNPNIiKNFPLGRVFF/ CS PG*KF 
FAPLGFFLVS FLRQKVPHIjGS PGKTFFF 
FFFFGDRVWTCHPCWSAVA 


194 


14095 


A 


199 


338 


93 


PQHNGSCL* SQLFERLRAEDPLNLRVQG 
CNEL*LYLCTPSWVNPVSEKKS IQEKNA 
NIiDFREY/RDKRRENWTCV*NS I ERSSE 


195 


14096 


A 


200 


1 


361 


PRSHLSQLRVRLSQ/IKKNQKTIGKGFR . 
AIG/MGKKF *GKTLKVPFTRGKVNQWKF 
LKLKS FLPRKKKTE I TL* TLGKHFRKWE 
KFFATYPS TRE * ISKXCRDLNHFTGKKI 
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NO: of 
nucleotide 
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SEQ ID 
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peptide 
sequence 
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M 

eth 

od 
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SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

tag to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D^Aspartic Acid, 
E^Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Htstidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagfne, P=Proline, 
Q=Glutamine, R~Arginine, S=Serine, 
T=Threonine, V=Valine, 
W-Tryptophan, Y^Tyrosine, . 
X«Un known, *=Stop codon, /^possible 
nucleotide deletion, \=posslb!e 
nucleotide insertion 














IIPIFQRAKGI* 


196 


14097 


A 


201 


2 


358 


ENKTTMRYH* IP IRMAEN* VLAVGTL/L 
QPWWEYKRV* PLWKTNSLAVTLLNINLP 
GTPT ILLVGI YPRERKTYVYTVACTQMF 
LTVLFTVAPNWKQSQGPSTGE* KNK* WH 
IHIMGHYS 


197 


14098 


A 


202 


110 


357 


IWVFIWCHFLLPEKTSPFPGPPQCliDIS 
* KTLFVHS VLFFFKDRVLLCHPGWS AVA 
Q* *FTAASD/ FQESSCFSLPGSWDYKRM 


198 


14099 


A 


203 


9 


374 
j ft 


IIiKLHAFFMAETPGACKTPCDIiROVFI F 
IFIYLF/MRLSLTLVAQAGVQWRNLGSI* 
QPPPPGFKRF*GRCLFYPKDGLVCPGIiA 
GS*TGQIGKGIRTLSPDLCLSDGFCGSK 
PKS /ASASQGHVG 


100 

177 




A 

Mm 


904. 






W\PLAI I * FISPLAETNRTPFDHTEGBS 
ELVSGFNIEYAAGPFAILFIAE YTNI I I 
INTLTTTI FLGTTYDALSHEL YTTYFVT 
KTIPLTSPALMNS 












t 1 


QHPT.flT.PVPUnVPP /HQTVPfiTA QT.MT.T. 

O/UUJVJiJ Jr Iv^rVJJ X IvXv/ J-iO X V CuU y^i iitiiiii 

LKS YVS KMKKCB * LVKPLKAKL * DS CKE 
L*VGNNLIMPSTYD(i'ENDRVDSSLTWIiC 
VT.T. t r,TJJ,T,Tir.T.T,FWRQSFALVARAGMRW 
RDLSSL* PLPPGFROFSCI 


201 


14102 


A 


206 


2 


377 


FRAVLQGRWSLQKFLLPFVQLCPAPKGG 
VYM*RQRS/SLSCGGLRPVRASWX 1 LCLP 
TQASAMVNAPPPARLLPPRSISDCYTSS 
EQGSVSMGPAEPGVGYDLLVCCLLRPLE 
KHSIWVRVSCFSRY 


202 


14103 


A 


207 


327 


1 


ILTGNFKQIRMLIYHSVNPRALKNYAKS 
TLPVNYKWNNKA* MTAHLFIAWPTGCFK 
PTVEP YCS KKKK \ I PFKVLLLIDNAPGH 
PRALMKMPEEIS IVFMPANPASIL 


203 


14104 


A 


208 


310 


2 


FCWWAPGMGFPLGE WGLILGPGS PFFLK 
EGFPGVGQPGPPKRWGLKG*PPGAHWKQ 
F/lFFFADRVSLCHPCWSAVAOSOLTAN 
PASQVQAIIiLPQPPDYRIR 


204 


14105 

■ 


A 


209 


374 

■ 


3 


GVRDP /LEE WCPF*DLKLVARRTYSLF 
FKAVRQGHLS LQRFLLPFVWL CP APRGG 
VCRGTQAS L S CSGLHP VRASWPLCLPTQ 
ASAMVGAPSPASLPPCSSISDCCASNER 
GSVGMGHS ETDVGY 


205 


14106 


A 


210 


3 


196 


LENLKF/IiDKFLETYNLTRLNQEETEIL 
NRPMTSSKI *SVIKNLARAMAHAYNPST 
LGGKGRRIA 


206 


14107 


A 


211 


34 


339 


I IKSKRINYMSCELYPKINIDNFFFFET 
VFRSVI \ KLKCYGAFMVHCTLNLPGS TN 
LSTSASQVSGAINRGYPGQF/ IGIiEK* F 
LVKTGFCHVA*AGLEPLGSG 


207 


14108 


A 


212 


200 


312 


HYGQFEVLFCSLF*MESHSVT/QAG/VQ 
WP/SLALQTPPPGFKQFSCFCLVSSWDY 


208 


14109 


A 


213 


300 


124 


NHI FFYFQIHRVCVCVCVCVCVCIiCVCV 
YRYIYCVSPW/RN* * SL* KECLSLVLLT 
PTC 


209 


14110 


A 


214 


1 


357 


QVDHTSDRKANLNMF* KI KVIQTWFSDY 
NGVKSENNRRKTCKLTNIQKLNST/L* I 
SNETNEKPQE IRKYFEVNINENTTYQDL 
WDE VS XVAQAHNPS TLEAEAGRLLE PRS 
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M 
eth 

w 
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26 


Predicted 
beginning 
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sequence 


Predict- 
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peptide 

sequence 


Amino acid sequence (A-Alanine 
0=Cysteine, B=Aspartic Acid, 
E=Gliitflmic Acid. F^Phenvlalanine 
G^GWcine. H=Htstidine. I— Isoleucine. 
K=Lysine, L= Leucine, M^Metnlonine, 
N=Asparagine, P=Proline, 
Q^GIutaroine, R==Arginine, S=Serine, 
^Threonine, V«Valine, 
W*»Trvo to o h a n . Y*=Tvros ine* 

FT M. B J W • V \f mm mm mm m M m\ j m VJ 1 ■ 1 m^Jx 

X<=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=posslble 
n ucteotide insertion 














LRSAWTTE 


210 


1/1111 

141 11 






iOU 


i 
i 


RALMEM* KEMNVFMPAKTTS IML PL I \ S 
SFNSYYLRTRIHRTFWVAQSWLQCPTT 
AISDIQIGPVAMGLMiIPSAFVIiCWLSI 


211 


, 14112 


A 


216 


3 


347 


SSRSRADGLFFYQCK*QTLYW*HLWYKT 
QKEKKKTQMKEKKQRINKAR\GDKLPEC 
BAVCOKPKNPANPVQRILVOHLDAKGSF 
PWQAKMGSHHNLTTGATLINEQWLLTTG 


212 


14113 . 

i 


A 

* * 


217 


75 


347 


PPIF KRTARGKVT PGWEYPAKG / CGGTF 
LGNRGLPPNPGPSGPPPKKQVWEGALCA 
PG*KVCTLKF*FI\SLAIPGNGSKNFFT 
LFEGPSPIL 


213 


14114 


TV 

A 


218 


3 


397 


uuV> * ovi Ivrtlnuvi X WrlKPLon, w xC w f JjrlrJv 
KKKKKKKKKPCKKKKKKKKKGGPPK3CKPR 

I KKDGREKHTRGELWKKTPI W/ EGEK I G 
EKPPKKI*DHEGKKKVLRGKG 


214 


14115 

> 


A 


219 


3 


401 


DS WATLHGNIMK*SAAVI»*ALGLV\ FGF 
TVSGPTGIVLSKS *LDIEIiHDTYYVVAH 
fill vijgiuAvr/ixiVTVjijxn ^rjurovji J. 
LDQTYAKIHFTI I FIGVNLTFFPQHFLD 
LSGMPRRYSDYPDAYTT*NIL 


215 


14116 


A 


220 


162 


3 


KPAQRNL* SNPEKEDINILKRNQS/WPG 

TX72V PUP Q TT/SfifXTtf OT TR rr\r * pnp 
X VnnnLlvroiiAJwyiilvUXXnuUV fiu« 


216 


14117 


A 


221 


374 


2 


WCRDRERERERATERERERERERQTDRQ 
TDRRREEA/ EG WAAWAIN* GRAPGTSLE 
AALECS CPRRP PQPAPPEQGP FPRTTAR 
GQPRPPKLLQPEAPSQTRPHGYPWPLRV 
t ,pn Qfjp EVP ppr 

XJlr^OVJcCa v Arms 


217 


14118 


A 


222 


3 


264 


DHMRPKVRGCSEL* SCHCTPAWAT\SES 
LSQKKKKKKKKRKKIGBKI*GAFFKVAP 
PFFFFKKKMNLKS P VGIAGVAKNTRWI L 
KHRG 


218 


lilt |Q 

14119 


a 

n 


us 


2. 


a in 


JlXOf XXiaia^XX Xlx^JXrvUXX't A UiJri C iAJXJ 

HALFVALTTTLTNTN I * HAEETQT * * T I 
LPAI ILVL I RLPSIiR\ ILYI SDEVNDPS 
ITIESMGHQWY*TYEY\TDYSGLILNCY 
I L P PL FLE PGDLRLLDGDDO VAL ? I 


219 


14120 


A 


224 


399 


3 


GVGKPGFPWAKKWALPPYSKGYP K* VQN 
LILKVl^I^nKKOTG^SLEATOi^raLP 
PKARAPKLKPNW/DSPKRKNPGAFKKKK 
KRQPTBWEKIFANHASDKRLTSRIYKEL 
00*KSKOPIFLIP I YHLNHFKG 


220 


14121 


A 


225 


360 


71 


NRTTWVPFKTPPLIiYHINTKKK*HTGNK 
KHKTKRAO^KKTKNAKK/EKEKNPHPQ 
RPQPO?KK/EEQKPKRGKPCKRGKKKKPFK 
E YS YL * K\ Y I HTNLQ 


221 


14122 

* 




226 


3 


386 


PSTHVS INLAMGIPL*AnAVI IGPHSKI 
KDALAHFLPQGTPTPLI PI LAI I ETISL 
LIQPIALSVRLTANITAGHLLMHLIGST 
TLTVSTINLSSTLIIFTILILLTIL* IA 
VALIQAY/VCPLLVGAY 


222 


14123 


A 


227 


3 


372 


YSLDSPSLTRFFTFHFMIiPF I SAAIAAL 
HLLFLHETGSNNPLGITSHSDKSTFHPY 
YTIRDALGLLLFLLSLMTLTLFSPDLliG 
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M 
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Ou 
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09/515,1 
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beginning 

nucteocioe 

location 

correspond 

ing to first 

amino acid 
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peptide 

sequence 
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Predict- 
ed end 
nucle- 
otide 
iocs lion 
correspon 
ding to 
last amino 
acio 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D^Aspartic Acid, 
n — vviuiHTDic aciu, x Denyiaianme, 
G=Glycine, H=Histidine, I-IsoJeudne, 
iv— i^ysine, L^jueucine, itj— ivjemionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
vv— x rypcopnan, x— i yrosine, . 
X=Un known, *=S>top codon, /"possible 
nucleotide deletion, Impossible 
nucleotide insertion 














UVUN t i\jaKTiftF V IsSJtrKeril \ JS>k f h J w i J? L» 

FA/YTILDPPYKL 


223 


14124 


A 


228 


2 

• 


263 


PRVRSRWEDCLRPGA*DQ/ LCAP*RDCF 














KKKKKKKKKKKXKKKKKKKGGPFKKKPR 
GGQNLKGGKKKKFFPKRGE 


224 


14125 


A 


229 


314 


1 


LIKTIFLYQVSLCCPG*SQTPGLKQSSC 
LSLPKCWDHRHKP PCPAQS FYFTFTERA 
LRVI Y* FI *RQS /LRSVAQAGVRWCGPG 
SLHPPPPRFKQLSCLSLLSSW 


225 


14126 


A 


230 

i 


86 


405 


VSYMS IQDHBNGHVSGYFKLVGQE 
GIVGG S TFES SNSHS YI YFFS FYI 
EMGLSVTQAGVQWHKHGLLQPQPI 
*SSHLSFPRSWDHRPVSPWPA 


?CLRV 
»FIFF 
»G\LT 


226 


14127 


A 


231 


386 


98 


FFPQRR*FPFGNFPPFFPPPKIiG\PSQK 
SPIGDFPPPPFFNPGPGPPFFFPPPFGK 
GFSFPPPL*FGPPQGFF*RAPPFFFFFF 
FFFFFFFFIRLLV 


227 


14128 

i 


A 


232 


2 


362 


TYVLFVI PYVETDLVQ * AG * S YS MDYSN 
iiTRLQTLNIjNVP FN I AALATLRLLYLHE 
TGSNNP IRITCHSDEVTFHPYYTIKDAL 
G * L\ LFIiLSLMTIiTLFS PDLLRDPDNYT 
LANPLNTPG 


228 


14129 


A 


233 


5 


365 


KNVTPPVNSSQHDLTDIYRNLYLMATEY 
TFFSN/AHGTYTKIDHNLSHKRNLNKF* 
RLKF * RLKLNS KGVL WPKWNQTTNQ * QK 
DNRKLSKHLKTKNTLLNNP* I KEEVS WE 
ILKMYLREG 


229 


14130 


A 


234 


3 


367 


NRIVPNHDIPYFSLCVCVIiFFFDRVS\T 
LSI** GWS AGAGSLLTAAPTS CVQAILCL 

SIjLS twnykrnplrpayl/s yfkssks f 

TL/LPKGIL 


230 

* 


14131 


A 


235 


107 


2 


QEAYE+DKERSSRIYMELK/SIQPKPPN 
NRVKKWTKEINR/HFSKEDTQTDKRHMK 
KC*SI,ICISLVISD*QRNAN*TPNEIRL 

rp 


231 


14132 


A 


236 


349 


0 


NNL * I FL I FLFTS VFLSL I S YLF YYI F I 
RSLP I OGLPTYFFS FS YI FLKYLMTLAi 
YYSF* \FFSI*IQVILEALIFI 


232 


14133 


A 


237 


3 


317 


TRMHISDKKTYNTNIKRIFRIHLM 
QLKIGKRYEQTLYKSIFEWTINTY 
QLNS KRKPTNTVI KK* SKDLYKHF 
IQITTRY\liKKC*ASLIIKEM 


RQTT 
KEVK 
SEED 


233 


14134 


A 


238 


346 


1 


AAAQMPAYQELVEEAIAYGRKLGGSQED 
QTMNANDQRFVMSAADILS * IPDWSTE 
VHARLSFDKYAMSARA\RRLIEI,SKEYG 
LSKGRI VI \KLSSTWEGLQAAKELDEQR 
GIGC 




141 j D 


a 








IiLPPLPSLIjLPPSPSSGLPPSPSS/SP 
PSP/S/PPPSLLHPPPPSLLLPHLPSLL 
DPPPTSHLLPSIP*LDIPNAPT\KSSNP 
3 CPCRPRS TEYFK 


235 


14136 


A 


240 


98 


2 


RPRRPRLVLNS* \LKLSAHISLPKCWDY 
RREPP 


236 


14137 


A 


241 


342 


1 


"DTASLLQPMDQGAISTFNVYYL/RNIFH 
EA1AAIDS DS PDGTGHSKLKTF * KGFT I 
LDAIKNICDSW/EEGVKISTLTGVWKKL 
ILTLMDDFEGFKTIVGGVTADWETARE 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 

fTs/^liiiamSf'* A IT=PK Anvlaf an (no 

kj — vjiuiaraic aciQ) r- rneojiaianine, 
fi=filvrinp HsHictiriine Tslentonrin* 

K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glataminef RpArginine, S™Serine, 
^Threonine, V»Valine, 

X=Un known, *=Stop codon,/=possible 
nucleotide deletion, V=possible 

nucicoiiQC insertion 










• 




LEL 


237 


14138 


A 


242 


3 


339 


RKDSRKRERKNKEKRKQRKEQRKKPIRM 

TVHF S S KTMVAK / ARRQ * NP VFRMLBE * 

KN\FPPRILYSEIIFFGSESQIKKFSGD 

GNPKGSFPELFFLRIPMEVFNLKEAYLR 
vc 


238 


, 14139 


A 


243 


216 


3 


NSTGTQTQSQKPMLSLVKKLISFIYL*D 
GVLLFAQDGVKWRDIjGSIiQPPPP/GPTL 


239 


14140 

1 

1 


A 


244 


408 


2 


VLTS * SNIAST \ W YGL YQ/ TQLR/KILA 
YSSI/THIG/WNNPVLQYNHKITILNLT 
I YI ILTLNA/FLVLNFNSS IGTILLSRT 
*NBIT * LPSLI PSTLLSLGGLPPLTGFL 
PKWAIIEEFTKXNSIiIIPTIIATITLIjN 
LY 


240 


14141 


A 


245 


3 


243 


I IMIiKAGQMTVFPNEDVWMAYKHTDRYS 
TSLVMNEM\Q I KTTMGYHYAP I TVAKLL 
N/SNTRYWP*RG/CSRVIHCWRKCR 


241 


I 14142 


A 


246 


2 


337 


FLIMDLQKYINPKIKEPQGETNHKTVSF 
MNINAKI LNKI LVI QI * * FPKKNNVS KF 
G/ FI P * /SQGCFN INQCYSLDERRKYM I 
ISKNIEKAFNKIQHSSFMIKTKNRKKLP 
EL 


242 


14143 


A 


247 


347 


1 


HYTP I RMTKI QNTDNI KCW *GCGNP /GT 
LLHCW*ECKMVQAL/WL*TVWQLFTKLN 
LLLPYNPITFFGIYPNEGRTHVHTKTCT 
LXr I XAAXjFI lAKTQKQ/ PRCSSVDEW I 
NKLWYIQ 


243 


14144 


A 


248 


319 


2 


KKI ILFI ICTERVFLCCPGWSQTPRLKQ 
• SSSLGLPKC*DYRRESS\PGLNVIL*CL 

/DSVTRAGVQWYNLSSLQLPPPGFK 


244 


14145 


A 


249 


329 


3 


KFPF FHTG EAKNYAV F V INKRI KKKQK I 

MMQ/SL*KTAW*FLKK*TMBLPYDPVIL 
LLRIYSKELKAGTRTDVCTSMCSFLFL 




1/11 /!£ 

14140 


n 




ilo 


3 


AIT I KLL\ KNNRVS LHDLR IGHGLLDMT 
PRAQAAKXKIKR* IDEFIETKNFCASKD 
NIKRV/NRQHKE*DTMFANHIS 


246 


14147 


A 


251 


152 


364 


QLTLSNHINNQI KYK* TRHS \NHHHHHH 
GMDITGPKNLRTI PL 


247 


14148 

• 


A 


252 


230 


1 


PCWICEFIVSSNFCWFVLETGSCYVAHA 
WD*RH/VPIANF*TLPRGGVL 


248 


14149 


A 


253 


3 


345 


DVGLAGLELLTSGDPPALASQIAGIIGV 
SHHSWPLLFFFFFFF/ CFFBGGFFFFLP 
V* SOGGDLGSREPWPYGFKGISRLRFPN 
BGKIGPQPLAPDMFCFFDKTWLSTWPG 
WFH 


249 


14150 


A 


254 


194 


1 


GRVDTKWANTHERFSKCS/TTVD*WIFK 
SWYLCTMEYYSTINKKEILSFVTTWMHL 
EDIMLSEVS 


250 


14151 


A 


255 


327 


3 


VKTAB F VNKWQKNS TKLWNS QAQ I DKKiT" 
IVNQINDLRQTEIWMGDRIMNLESRIQM 
QCDWNTSDFCVTPQ\YNETEH*WKKVKR 
HLEGREENLTL* IVKLKEQDFEASQ 
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SEQ ID 
NO: of 
nucleotide 
sequence 

♦ 


SEQ ID 
NO: of 
oeotide 
sequence 


M 
eth 

fw* 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

nucleotide 

location 
correspond 
ing to first 
amino acid—* 
residue of 
peptide 
sequence 

i 


Predict- 
ed end 

n tides* * 

mm Uvlv 

otide 
location 
correspon 
- din? to 

^4SHE^ %AF 

last amino 
acid 

residue of 

npntiHp 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=»Aspartic Acid, 

Kj=filiitnmir Arid. F*— Phpnvlalanlnp 

G=Glycine, H=Histidine, I=Isoleucine, 
K= Lysine, L^ Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 

OssGlu tannine. R= = Artnnine R=£prinA 

T^Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=l/nknown, *=Stop codon, /=possib!e 

niirlpotirfo rfolptiAn \zsnnceiKlo 

IIUUCUUUC UCICllvll, V^OUsSIvIC . 

nucleotide insertion 


251 


14152 


A 


256 


314 




KNF PSLIN FLRO S FAPNGOGGVORVDIjG 

SPQPLDSRVSDRIRLCPKTQKKKRFSP* . 

QTILKLSKV/E/MTERVLKTAREKHLVT 

YK/GKPIRLTMDFSAETLQARRE 


757 

J. J A 


J*rl Jj 


" a 
*» 


7-S7 






XvrTrYWrT \I3uDuuo ¥ L/UUUu V C C C C V.I? v f 

FFFCFYVFFFYLFFFCFFFFIFFYFLFF 
LFLF I FFFFFFVFFI F FFFFDCFFFFFF 
FFFFFFFFFFFFFFFFVFF 


253 


14154 . 


A 


258 


52 


186 


CFG KLGWDDCLRSGVQGGSET * ACHCT P 


254 


14155 

* 


A 


259 

! 


1 


269 


LKRS S H LSL PRS WNYRCMP P CPANF * TF 
FVETGSHHIAQAHLKL\PPASA/FPKCW 
DYRHE S LCP ALDS P FTN I N S ATI I T I LQ 

\aJ\,v»rf fll L. 


255 


14156 


A 


260 


387 


3 


PLLQVAKINPKRILDIiNVKPKTIKCLQE 
NTGENCWDFGSGKDFLDMTPKMQSTK*Q 

DIVKRIKIQSHKLEKRFVNHIPDKRLIS 
RIYQELFRTQHEKPHTIK 


256 


14157 


A 


261 


62 


72 


LRI I KFS EE S MLKAEMG* KLGLFHQAVN 

irr , McjT. , r r irKn7\ArKrawTrDT?T wnAVG^fiyw 
J\l»lXDij l v JNJct V ViMArJxKr JjisJSJnivoA 1 lr V it 

THMIRJC\DMEKVLB\)WIDQISHNISLSQ 

SLIQRKVLILFSSMKPARGEEAV* *KNY 


257 


14158 


A 


262 


2 


193 


GGRGCSELRSCHCI PAWVTRAKLS LKKK 
YKKRSPLIF 


258 


14159 


A 


263 


3 


330 


QLKKKY YEELYANKLGNPDEMDKFIxE I C 
ILQKLTQHKN* KFFKDSRRKS KMNRP IT 

KTD T. T f\ it D IfP T . D TTD TTTT .1? DTV5 . Vtf If 

HFFFKLTKPLHKFF/AKIKEEEPFPNS 


259 


14160 


A 


264 


326 

+ 


43 


TQETEAGESL\DPGGRGCN+LGSCPCTP 

AWATGMKLPFKKKKKKITSLPELRFLSY 

TCSS S QNFLYLNDDT I IHPVAQAKNLGD 
r»n<5 Q PTDP TO 


260 


14161 


A 


265 


311 


1 


FPPFP/SFPLERKEKFKGRKKRGGTKRG 
PFPPFFPFSSLPLGFP/ SPQRA*FLLFR 
FREFLKFFFPFYLPPKPGLGKFFFFFFF 


261 


J4162 


A 


266 


5 


323 


DSTKAQKNTWS IEPGEVGTLIHC/ WVQ 
PL*KTVWWFLK/DVK/LELRYDPAIAHL 
GI/YPKKNKT*TCTQVNITALFIMTKM* 

K* PKTJ / PTNNRWVT / JCIiWYMHTMRV* 
IN 


262 


14163 


A 


267 


3 


293 


GSLQ P LAPRFK* FS CLSLL/ S AGEWHEP 
WRRSLQRS CHCTPS S LG YRAGLHLKKNN 
NNNNKSE VSRI CTI GI FLLFI Y/ CI YBT 

ovUij v x wvnniii/r 


263 


14164 


A 


268 


3 


332 


HLS I INLNVNQLNS PLKAYTLPVWI * KK 
KWPN Y \ CCLQE I H FAS KNTYKLKVKELK 
KKFQANEKEKHADKTCFKSKTVK\KKNG 
HYIMIKRSVQKENIYiyMFLIADPDIC 


264 

* 


14165 


A 


269 


3 


326 


KVLERHDVLKLTQ/DKICNLNM/PYTKE 
MESIVNNIiPKKKTPGLHSFTGEFYQTFK 
NEMILTSYNFSQKFEABEILNSFYEASI 
IiL\ I PKLDKD 1 1 RKEN* R/ PNAKI I2JKI 
L 


265 


14166 


A 


270 


1 


318 


PYPAKLSSLSKGBIGTF\LDKQMLKEFV 
TTRPALQE ILKGALNVL I * /LERKDHYQ 
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SEQ ID ' 
NO: of 
nucleotide 
sequence 1 

• 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etK 

od 


SEQ ID 
NO: in 
VSSN . 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ins to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine . 
C=Cysteine, D=Aspartic Acid, 
E^G/utamic Acid. ^Phenylalanine. 
G^Glycine, H=Histidine, Msoleudne. 
K=Lysine, I>Leucine, M^Metbtonine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y^yrosine, 
X=Un known, *~Stop codon, /^possible 
n ucleotide d eJetion, \=possibJe 
nucleotide insertion 














QMQKH 


266 


14167 


A 


271 


i 

ft 


318 


LKLLTSGNLPASASQTAG ITGINHRAQP 
HLSFSL*LWNET/CVPQ 


267 


14168 


A 


272 


108 


322 


DSHVIFLLCGNLTVFVCLFLRWGL/DSI 
THPGVQWHHHNSL* P*TPGLK* SYHLSL 
SS I WNYRRTP LRIiAF FL 


268 


, 14169 




273 


341 


3 


GFFFPFPFFRQGFPLIiAQVK/VPWGNFR 
SLQPLPPGVKQFSCLNLPKN/WGFSLKG 
FFKKGCHFPF* Y * I FFNGVKKPDI KFS I 
FTNPFFFFFETESCS WOAGVO *HDLGS 
TRP 




l"tl /V/ 

1 


A 

• 


11 A 

*>f*T 


41 




FWLLTPGOETERERESETERESERERRA 
GEAERGRETDGIDSLIYSKLYSKTIHLK 
ILEYI \ ATYFKRLFYNWEL* T* AK/TRI 
IEKPSVRHQCQQRS 




141*71 




Z/ J 


"i 
J 




E AORTiL \ D Pfi^CSGC «5 ET iRSHHCT Q A WAT 

KAKLRFKKKKKKKCPGRYLGNIVLQ* FN 
I IAKCDKYHI VKLI F/CFSSAYTAS IQK 
LIOVSRTVTHRKNKKTROYVYILD 


271 


14172 


A 


276 


175 


313 


STLI SYFRDRVLLCYPGWSAVGQSQFTA 
A*NFW\VK*SSSLSS\PSSW 


272 


34173 


A 


277 


254 


3 


RPRRQFGIEGSFLNQIKNIYKKSTANVI 

VL* LP WI IRQ / EKVI *GMQ I VKEELNL 
SL 


273 


14174 


A 


278 


186 


1 


P ETMQARRQ * SB I FKVLKEKN/ LQHR IL 
YPEKLS FOSERE I KTF\ ORIiKKFI TSRP 
ALQRMLKE 


274 


14175 


A 


279 


288 


3 


GRLRGKKGFNPEGENSKEFFKPLPSGLG 

ajktopvftckkkkkkktrnptkk/wakdp 

NRYLTKEAI *MASRHMKRCSTS CVIRBL 
* IKTTMRYDYVLI 


275 


14176 


A 


280 


3 


329 

• 


LKESSRLSLLCSWDHRFALPPMLSGLVW 
NSCPO /CD PSAS AS LS IG ITGMSHHTWL 
*WLF/C*ETGSHPVTQAGMQWYDPSPLH 
PTKKGVYLT/RRFWRLGSPSAW 


276 


14177 


A 


281 


233 


3 


r XTKDKEEHFXMIKRS IHQQGITITNIC 
VPYNRGSK*MK*KLTKLMGENR* /HSII 
IAGDFNTTFSIMDRAIRKSAKNG 


277 


14178 


A 


282 


32 


309 


LPD ITPRDHLS PGTMDFIOBTGCSKCWR 
GCGTPVCC* WECKL/VTVQSLSRTVWRF 
LKKLKIGQVRWLMPVIS ALWDVEVGGS P 
EARSSRPAWP 


278 


14179 


A . 


283 


1 


302 


CKRNNKAWMTVHLFTAWLSE YFTPTVET 
YCS*KTIPFTIL/LLDDNAPGQPGVLVE 
MHKDMNVAFRPANTASILQPMNQGGIST 
FNSYYLRNTFPKAIVAIE 


279 


14180 


A 


284 


155 


1 


PRRFFFFFF*DRVLLCHPG*STAT*SCL 
TTKSASQV\RQFSCLEPPSSWDY 


280 


14181 


A 


285 


131 


15 


DRSNPGRFL WTSNS S L YERPRE I RPSSQ 
APPPVNDPI*T 


281 


14182 


A 


286 


1 


359 


FFFFFFSWFWCFSFFFFKKKGGAPGGG 
GPPPPKKTPLFSPQKKNFFLQKKPPGKG 
FKGPGLP PPNFGPG/ RNGGPLKPGDPHP 
PDFTPPRWLFKPPF * KRGEPFFPPPGFP 
KNLFLKKVP 


282 


14183 


A 


287 


52 


419 


LEERAGTS W PKiCBNKKQHFCMKKKKKKK 
KKXKKKKKKKKKKFFKKGAQI FSF*GGV 



530 
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SEQID 
NO: of 

nucleotide 

sequence 


SEQID 
NO: of 

sequence 

■ 

■ 


M 

ejth 

vU 

* 


SEQ ID 
NO: in 
USSN 

09/5154 
26 


Predicted 
beginning 

n u cleo tide 

location 
correspond 
inp to first 

IUK W 111 

amino acid 
residue of 

peptide 
sequence 

■ 


Predict- 
ed end 

nucJe- 

otide 
location 

correction 

ding to 
last Amino 
acid 

residue of 

oentide 

UvilUUQ 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D^Aspartic Acid, 
E^GIutamic Acid. F^Phenvlalanine. 
G=Glycine, H-Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asd a ra&ine. P=Proline. 
Q=GIutamine, R=Arguiine, S=Serine, 
T^Threonine. V^Valine. 
W=Tryptophan, Y==Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion \=nossible 

nucleotide insertion 














NL/H/ SGPGPVLKTGGGGRHSPPPPPSP 
PSSLFFPSSSLPSLPFLSLPLPSFLPSS 
PPLIiSS/CPPPLFSS 


283 


14J84 . 


A 


288 


236 


325 


LNIRTPGSWLXAYAHACNPSTLGGRGRR 
IT 






TV 




018 


*K/y 


LEKS FCLBTEPHSCLPQLVHWPDLGLLQ 
PMPPGIARF 


285 


14186 


A 


290 


3 


426 


HEETGSPRLECSGMITAHCSLDLPKLK* 
cqtTT^CTr^q /pifVPPrT APnrr.Y 


286 


14187 

\ 


A 


291 

! 


382 


1 


ANF YVTLVQ \QGFTMLRS WRP A* PCDPP 
TWASQSAGITGVSHHAWPKMSTITLGVY 
SFGESEVFSIFKFYFLETGSSFLPQAGA 
**CNHSSLyP*TPGLKQSFHLSLLSTSV 
YRYLPPCPLCPVLV 


287 


14188 


A 


292 . 


3 


309 


JIB VP X s TXtN UUu«n JL KuS KLKjJj V AAKKIj 
\ ISQI VNAKEKFLKE I KSVI P VNTRMI K 
QNSLIA\ETQEVSLVWIEDQTSHNIPLS 


288 

* 


14189 


A 


293 


2 


415 


ARDQYKKSTKK* AKDLifRHFKNEAI /HM 
ASNYK/KRCSIS*VIRKIEIKSSMTSCY 
THLQN/ALKLKSDNFKC*QKCTAMGIIL 

SAI/PTSGLYPTEMYRNIEQASLRMFWA 
WRGG 


289 


14190 


A 


294 


335 


108 


IPTPK*PLSSPPIHTPYTTIPKP/RPPP 
P*HPPHIHPPPPPLTP\PLPTHTLPTPT 

PTMT .PWT PPT iY9 T PP 9 QPTf T <? 


290 


14191 


A 


295 


266 


2 


GFPKKVFS FP IGPP KLGFPNS VFS / LFT 
FFNPGLCLKFFQFFP FSAFLPLGFS FPF 
FFFFFFFF * DRAVLCHPGWSAWRS WLT 

A1UV 


291 


14192 


A 


296 


347 


1 


DSLQP*TRGLN*FFHLSFLTRWEYRCAP 
QCPANFC/VFL*RWGFAMLPKLVSSDLP 
ALASQSAGITGVSHCAWLVFLLFSLYF I 
HFSKASETPWAQNQSQTPSEYPATYNFK 

TYSC 

•ft. 1 O w 


292 


14193 


A 


297 


284 


2 


CRQGFVLCRLVSNSCDQVLGVRLCV*VL 
GLOACAT* LGAEGVS IEFSCFHKGRGCL 
CRHRSS /HSMFSAWC*PHTHTHTHTHTH 
THTPSPMQIiV 




1/11 OA 


n 




i 
i 




GTRLCIiGAI TTLFAAVCALTOKDLTKI V 
DFST * SQLGLILVTIG INQPHLS FLHIC 
THALFKAILSMCYITIIHILNNDQDSQT 
IPJILLNTMHINSTS \LTIDSLS*L/GKP 
FLN/GYY*OKDLTKIVDFST 


294 


14195 


A 


299 


3 


134 


HEGRD * PGNHGDTLSLLKIQKLAGSGGV 
CL\KLRKLMCENHLNP 


295 


14194 " 


A 


300 

*j \j v 


2 


333 


GVGREGGSREGGEEESGWNGGIRERRSG 
KREEEGKEYRDEVDRKERGEEGGRERER 
GRGWEEEQKGAS * C/EEGRVK 


296 


14197 


A 


301 


354 


1 


TENELLSRIYKNFYQIN/RYKQKNSIKK 
*RK*QAI/TTETIQTVNKHTjKKCSISLI 
IRBIiQTKTR* I LYT * LVE KKLKTNG I YC 
WBCCRELGSC 


297 


14198 


A 


302 


1 


282 


GTRGMVAGAYNJjGLPSSWDYR/ *CLPPC 
IiANFFVSL VERGFTFJuTTLVLLS * L YDS 
PASRSETAGI PLIHTSCIVYMSLRNNAD j 
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SEQID 
NO: of 

nurlentiflfe 
umivuuuw 

sentience 

i 
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SEQID 
NO: of 


M 

etti 

nd 
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NO; in 
USSN 
09/515 1 
26 

■ 


1 Predicted 
beginning 

nucleotide* 
location 
correspond 
ing to first 
amino acid 
residue of 

nentirie 

sequence 


Predict- 
ed end 

mi elk 
otlde 

vuuc 

location 
correspon 
ding to 
last amino 

ftfMfl 

residue of 
peptide 

con ■■ An m 


Amino acid sequence (A«Alanine 
C=CystetDe, D=Aspartic Acid, 

Cr~VJlU Id 111 It nUU) I* JT llcll Jlalltlllu6y 
\j wijri.iiiCy n~*niaiiijiiiC) tr~ imwcuKtuv^ 

K=Lysme, L=Leucine, M=Methionine, 
N=Asparagine, ^Proline, 
Q=GIutamme, R»Arginlne, S-Serine, 
'T-Threonine, V^Valine, 

W— TVv ntnnh « n Y=sT v r n« i n p 

X=XJn known, *=Stop cod on, ^possible 
nucleotide deletion, \=possible 
nuuwiiut/ insertion 














IAILCKAQAY 


298 


14199 


A 


303 


361 


3 


LPKWWDYRHE PLCLATFFFF * AFPPVPL 
CMYVPPN/ RYVAFFFSLSS F * I CCTFTL 
Y I F PQLMDG* AVS S FLFLQPVGVKFKKK 

HPPWPRPRA 


299 


, 14200 


A 


304 


2 


352 


ARMVSVS* PCDPPA*ASQSAEITGMSHH 
AQPIIIS*TIAYVSFSLSSLGT*ISSVR 
o U 1 IvonoVcliLi x r x 1 V oJj/ Jrw 


300 


14201 

i 


A 


305 


334 


1 


\ KNSFFFFFKRSIiGIiljAQI* /VQWGDFRT 
LQPL P PGVKQ I S RLNLLKKWDY * RGPSG 
LGKFW/ I FL* KQGFPQLFQWLNFFFF/ 
C FFETESRS VAQAGVQ WHDLCS LQAPP P 
SSC 


301 


14202 


A 


306 


270 


2 


DWCAGGGGDGVP RRQ VTFV I LVETGFWR 
VGQAGFQJjIiAS S YIiPALA/ FPKCLDr RH 
j * P PHS ALKALFFF *DGVSLLLPRItECNG 
ASSPRA 


302 


14203 


A 


307 


306 


2 


HIr yCVCVXjVCVCCVVCVCVVW 

KLVISQT*IiSPLCSGP*A/CTFFSVCVC 

VCVCCWCV\CVCVCVCVCKVSDI,TNLS 


303 


14204 


A 


308 


117 


332 


S P PVLLRCS LSLS VQLNGKTIRFLK*IjK 
MEL * FHS AI PLLG I YPKGKKFLYQKDT/ 
CHSIFITALFIIAK 


304 


14205 


A 


309 


1 


351 


gtrktnn*kwakdlsrhfskbdirngq* 

V w / nnKKCoAJbij I J. Rl W* IS I TV \R YNIi 
TCIIM 


305 


14206 


A 


310 


1 


352 


PSPPNSPPSPLPSRPPQPPPYPPPPPRP 

ODCOT * VPTSnivriTiVTil /T3CTITT rpn P n fnc 

i'KbFij* X UrfAfi*! W/ PbPlLtl PSP/ PS 
DPPEPRSSRP 


306 


14207 


k 


311 


1 


405 


FKPSP * pltgallgllmtsglam* fhlh 

oil \1jJj1JjL.\JjlLj1J>I1 X i\J WKUVIK 

ertyqghhtppdqkglrygiilfitsev 
fffagyf * afyhs s laptpqlgghwppt 
gmtplnp levlllnts vt las + vs i 


307 


14208 


A 


312 


3 


176 


HEILGSNF/CGETQVSIHCPG*S*TREI 1 
is\2 o oxlufi J-j lr JlvwVvfUatfUV 1 1 V fkiJ-i 111 rn 
LGKPKHKRVR 


308 


14209 


A 


313 


298 


327 


SLTLSPRL*KIKK*RKK*NKCWQGCKEI 
GAFTHCWWGCKMVQPLWKS\W* LPKKVK 


309 


14210 


A 


314 


344 


3 


HSTSLVIREM* IKTRMTYPFTTTKMAI I 
*KNRS * \ WLGCGKTGSIiIHSW* KC*MVQ 
SLWNTFGQFFIKLI*ELTIPLLDLYLRE 
MKTYDHTKTCI *^3F I VALFI IAQNVKMS 

JjV 


310 


14211 


A 


315 

• 


18 


321 


WALFVYCLEKKEFGQP PRIiACFFFWDR V 
S LCLPGWRAVMGS WVT /VRPKL1AFN*S 
SHLSLPSSWDYRLIPPCSAKDSICEKNR 
VSTVDGADKKVRRGKEGD 


311 


14212 


A 


316 


289 


52 


ETSCDVPSKGILHFGKRNTHIMPEKWAK 
GMR*YFH\KEDK*MAHIHVKRC*TSLVI 
MKMQI KS STPYHFI PPRLTKPLKSANTK 
CW*TCIWAIYLSSSMKILSHALCPFFWH 
DVCISLTEVKNAFTGHITRRFS 


312 


14213 


A 


317 


57 


324 


YCVTFFFFGKQSFVLSPRLNCMGPFWVN 
CSLSLLSTWNYGLVPGTQPFF\EFLVQR 
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SEQ1D 
NO: of 

n iirlpntirlp 

U UVICV LIU W 

sequence 

■ 

• 


SEQID 
NO: of 

sequence 

■ 

• 


M 
eth 

nil 
uu 

■ 


SEQID 
NO: in 

09/515,1 
26 


Predicted 
beginning 

II UtlCUllUC 

location 

c nrre.cn find 

ing to first 
amino acid 
residue of 

|IC|f uuc 

sequence 


Predict- 
ed end 

IIUUP* 

otide 
location 

correspon 
ding to 
last amino 

residue of 
peptide 


Amino acid sequence (A=Alanine 
OCysteine, D^Aspartic Acid, 
i^vxiuiaiiiit /vciu> r-rnciiyidianine, 
G=Glycine, H-Histidine, I=Isoteucine, 

*»• UJrOHIw, Vr*JUCUUUC| J.»l 1'JLClllIUnHltJj 

N=Asparagine, P*=Proline, 
Q=Glutamme, R«Arginine, S^Serine, 
Threonine, V^Valine, 
" iry pivpnaii, i e iyru>inc, . 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 

iiulicuiiuc insertion 














GFHYVAQVGLKLLTSGDLLP * GSKOAQP 
TGLTHC 


313 


14214 


A 


318 


65 


341 


QWLLKSRSCYFFFL*DRSWAVAQA*VQW 
R/NPPPPGFKQF/SCVSLLSSWDYRCLP 

KREIPNLPGLG 


314 


14215 


A 


319 


2 


182 


ARGDYRHMPPYT\H*LETGSCFVIQAGL 
ELLDSSHPPALTSQCWDCTCEPQSWSAP 
CVSI 


TIC 

315 

i 


14216 


A 


320 

\ 


345 


3 


MPPRQAHF \ VFFIEMGFHHVARAGLELI* 
GSINPPTSASQSSWDHR*SQSAWITOVS 
HHAQI/3T ITS YHI LLFLKKGRAHAC* SQ 
HDPTTCLHVNLCCSLLLAISRPPTTTNIi 
PRA 


316 


14217 


A 


321 


122 


466 


QSFSTYC* KNWVTI / CFLKS IDS YLVPY 
TKIDSKWI I DLNVKPKT I KLLGGNMGEN 
LCHLMLSKDSSDVTQKA*SIEKQINKFN 
SHGGTRLRSQPIiKRAEVGRVRLSPGGRS 
YSDL 


317 


14218 


A 


322 


3 


345 


HEDAVSAPCNIiHLPGSSNCPASASRVAE 
MTVPA\PCPANFFFFFF3jEREFGFCCPG 
/CK*K*VTOKKPKGkkKGRlEGKKC^ 
PKLKERERKKRKKERRKRGKERKEVRTB 
MNGG 


318 


14219 


A 


323 


206 


446 


GNLHQCPYKWTHLAKYMCEKQDS I C* IL 
KECYQKPLLEKSHL I AQVENDEKPAD W/ 

dt.t w/"t vmTbuKin 
PJjLtKlrE I V Ji WWJU 


319 


14220 


A 


324 


62 


327 


VERLIjRDLRENFCRNPDGSEAPWCFTTjR 
PGMRAAFCYQ I RRCTDD VRPQGEAQAWG 
LQ SRAGS LE PEGRGEVS A* W / SARTRHR 


320 


14221 


A 


325 


441 


1 


RKKK*ILQGIi/EFRLBHWASPHRDHRV 
W PRQEGKLFSEGKNKATRMTVLYPEE / S 
SKKLGSRS*GSE/CTSVFRITLSVGYVA 
YTGAPVS I SERPSAAI YHRMNKW * HQHR 
RHLLGFSEQEPVQGVEPLQGSSQGPKEP 
QPSPSNPAPRA 


321 


14222 


A 


326 


2 


358 

• 


ARARTLRIMVNLNYAKSTLPVLCKGKNK 
vTVVTAHiiFTASFAE X FKPAVETYCXjEKI 
FFKILQLIGDAPCHPRPLMDMHKEVNVF 
VPAKTSIQQPMD\ISPFKSYY*EITFQA 


322 


14223 


A 


327 


1 


430 


ARBNMPGHIjHRCIIEQDWRIHMLIPKU!? 

TQMIKIV*YWYKDRYVDTE/YNNIESTE 

VNLYICGELIFSRSAKIIQWWGKRIVLN 
inj/*^jr^n /mxtt. / Qr , T?OM#MnovT. , PT.r , T , V 

ISJriKJrlUlJ/ VivUjf 0UXwn.nI/0 XIj i JjU I \fv 

ITPWIINPTGRAKTMRLLP/EKTGVSFC 
DFG 


323 


14224 


A 


328 


146 


374 


KCLMLTKPYNIMRLT* PYEIR* / CGDTF 
P * S YYA* TGTAVRT WGLTPUT PAL WEAR 

AGGSRGQEMETILANTEKPRL 


324 


14225 


A 


329 


2 


322 


ARASRTF I VRKTQ CLAS KDKLTLVRG* C 
S CDFQLKSMLI DYSENPRALK/NMLYKR 
NNKAWMTLGLFTAWSTE\KPTVETYCSQ 
*\K1TLKIWL 


325 

- 


14226 


A 


330 


350 

• 


47 


EMGFLHAGQAGLELLTSGDPPASAFQSA 
GTTGVS HRARPAN * KKKF / CLKT / RVFI* 
, CCPGWS*TPGLK*SSCFGLSSHWNYRHE 
PPRPAQFIiLIDFYLRHYRES 
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SEQID 
NO: of 

n U LICUl f UC 

sequence 


SEQID 
NO: of 

npntirla 

j/cpiiue 

sequence 


M 
etb 

UO 

• 


SEQID 
NO: in 

09/515,1 
26 

■ 


Predicted 
beginning 

fJULIcVllUC 

location 
correspond 
ing to first 

*■ ttiuiiiu nuu 

residue of 

peptide 

sequence 


Predict- 
ed end 

mi a 

nucle- 
otide 
location 
correspon 

Jinn 4t\ 

uing ni 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A«Alanine 
OCysteine, D=Aspartic Acid, 
Ci— ijiuiamic Acia, r— r oenyiaianine, 
G=Glycine, H=Histidine, ]=Isoleucine> 
K=Lysine, L= Leucine, M^Methiooine, 
N-Asparagine, P=Proline, 

0=/?lii4omina QsArainin* C-- I C A _l_ a 

V^HjiuiaiDlne, iv^vvrgjiiiuc, o ss 9eruie, 

^Threonine, V=Valine, 

W=Tryptophan, Y=Tyrosine, 

X=Un known, *=Stop codon, /^possible 

NIImIaA^I^Ia jIaIa^IAM V«ff%A.*AS|t|A 

nucieotioe a e let ion t v— possiDie 
nucleotide insertion 


326 


14227 


A 


331 


753 


458 


FFFVB*GVL\HVAQAGLKLLTSRDLPTS 
AS * VAETTSTHHHA* IiHLFNFEIOTCKE 
QILLCCPGWSRTPALKRYSHLCLPNSWD 
I Kri£>nUJN Ct f Ui a JLoJj J. 


327 


14228 


A 


.332 


256 


3 


TLVLMQSCLSLLSSWDYRHKPP/ *L"SnT| 
PNLPMGLSGHNPI INGGISVSLDRVWL 
FLFC/LCFETGSHSVAQAGVQWHDHGSL 


328 


14229 

■ 


A 


333 


25 


358 


TPDIiR*STCLSLPKCWDYKHVPPHPANI 
FI*QKLFFISDGQYANVLTWFEGGTFHT 
*A*RPNHHSYKLRKDQTIIFIYLFF/LR 
WSli/NS VAQAGVQ/ WHNLGSIiQSuPFBF 

K 


329 


14230 


A 


334 


3 


329 


HBDWS P FWPGWSQTPDLR* SACLGLPK 
CWDYRREPPHLALNAFLT*NIFNL*W/C 
hSGSNPIIRSICIRFU3KGG*LLFIFIF 
VWFLTQSLTHSWQAAMQWHDLGSLQ 


330 


14231 


A 


335 


26 


330 


SQIiLGRPRQENHLNTGGEGCSL/CKIIL 
VCFWLGHLLPVWFRC / ILYLF / CWSLG 
FNFCFSLVL/CWFLLCFFCAFMFCLFY* 
WFVI CHFIiFFFFLFYFFVMFLFF 


331 


14232 


A 


336 


3 


348 


DEVFKS/YH*HPMLKC*GDYSLLFHGES 
DLDTTQ ILTHPS TTAM YFVHYCQS P * I L 
YGTIDT* PPV IHRNPIHIRTPYPCDQAS 
TAINLQLSHMNCNSKATPHSLGYHQTYP 

T\Y fin r 


332 


14233 


A 

.■ -.. 


337 


2 


329 


ARAARAAR VTG I AWAYHL IG KGLE P FF F 
MALLFLR*GL/NSVTIiARVHWYNHGSM* 
A*PPGLRCMSLCPA/NFLYFLVE/IGFA 
TLPRLQAICl^SSDLPTSTSGSAEITGV 

s 


333 J 


1 4234 


A 


338 


60 


354 


NGKECKKLI YLS I YLS I YLS I YLS I YHL 
C* XL/ SCLS I *LSii *LAVYYLS VCLSVS 
VCL\VASCL*ISIYLSIHLSFYSIDRIL 
AHHCVPPTCSLPHATH 


334 


14233 




339 


J 


351 


RD AD V ML LQ I LVI FQAIiALG VQTEGMAG 
VTHHDRVLVNWVRFPQCL/HRV*SNGT\ 

CEPPCPVNF/CFFV 


335 


14236 


A 


340 


190 


358 


FGSWLVFFF/CFLETDSRFAP*AEMQGP 
«r U"UHrr rtrvjLiiwsr ruLtoJ-iirV? ivji/Xvjir 
V 


336 


14237 


A 


341 


306 


119 


LR WENHLS LGN*GCS EP * FRHCTPTWTT . 
LPETLFT 


337 


14238 


A 


342 


1 


122 


GTRGCGEPRSRHCTLA\W*QSKTPSKKK 


338 


14239 


A 


343 


92 


316 


VCWDIPHRJCSRTICILFYPLVIFLFETK 
I CSSFSF* KGCM/WLGAVAHACNPSTLG 
GQSGWIT*GHE/FKTSIA j 


339 


14240 

• 


A 


344 


83 


329 


ATAPAPCyFLILCICMCVCVCMHTSVCT 
/COTLCMCMGLCTIL*TCVA/CTCLGMR 
FLLPAHI I ICVKRFCLMCGSGNSGSLCL 


340 


14241 


A 


345 


127 


357 

r 


VGVL6RGRVLPCWPG*SRTSGF/SSPEA 
NSWPPCLANILYLVETGFCHVAQAGLEL . 
LDS SHPPTSASQSAE I TSMN/HRA 


341 


14242 


A 


346 


312 


1 


LWNLNRS VGALLTRGVKCHKAFRST I S L 
LIGALK* *QSKSD3^ISLSKVVKDIiNKD 
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SEQID 
NO: of 

n it r* 

nucicuiiuc 

sequence 

- 

• 


SEQID 
NO:of 
peptide 
sequence 


M 

eth 

■ 


SEQID 
NO: in 

09/515,1 


Predicted 
beginning 

11 UCICUlllIc 

location 

LUI I CO|l VUU 

ing to first 

ominn orirl 

n UIIUv uVIU 

residue of 
peptide 


Predict- 
ed end 

11 U Lie* 

otide 

Inratinn 

correspon 

dine in 

last amino 
acid 

rpcirtiif fff 

I CSIUUv Ul 

peptide 
sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 

Pi— ||| m J ft A alfj C»-Ptinn nlo Inntna 

c/~ vjiuunniw r\wio, r— r iiciiyiaianinc, 

G=Glycine, H=Histldine, I=lso leucine, 

Ksf,vcinp I si j>nrini>> IM— IVf f^HiiAnirio 

*V *Jjr5IIICj l/"MUUUCf IT* - 'ITXC ill H/N1DC, 

N^Asparagine, P=Proline, 

\^^wiu utiii mis, iv - v*r giiiiuc, ^j^tjCiIIIc, 

T=Threonine, V«Valine, 
W=Tryptophan, Y=Tyrosine, . 

YsTTnlrnnnm — C+rv »% pnHnn /snnooiKlA 

nucleotide deletion, V=possible 
nucleotide insertion 














IYRSr»LRQSEEDTS\KWKDIPCS*TGTP 
N'T VKMS TTiTK AT YP FNASP 


342 


14243 


A 


347 


350 


243 


FHHVGQDGLNIiLTL*ST/PCIjGIjPKCIWD 
YRCEPPRLAY 


343 


14244 


A 


348 


165 


338 


HTLDPASHEGPTFLFSL/ PVEQCLK*LT 
QSQ 


344 


14245 


A 


349 


328 


2 


SVCPHGS INLSPAETTGACH/RRIiA*NI 
DSRV*AKTFKGK*TNCSYNNNNKNNNNN 

FNGATPAGPPSCLRRSHHSSAVCSC 


O A C 

345 


14246 

i 


A 


350 


3 


338 


FLTLTTKLTNTN I LHAKE I DTVRT I L PA 
IILILIA\LPSIiRILYITDEQNDPSLTI 
JkoXwiWW 1 XE* i xJJXQfcXjxrxlS X xxjJtP 


346 


14247 


A 


351 


124 


382 


NTFKS *NSNKTKNAI RT * AKDTKQHFTG 
EDIQ I ANKHMKRCL I PLGKR \ S LNQEI S 

IT 


347 


14248 


A 


352 


3 


368 


RDRASLFCPGWS*HPELK*SSCLGLPKC 
WD YRR / AATAPGLLCL 


348 


14249 


A 


353 


390 


161 


FSRDRISPRWSGWSP/DtRLSP/CLSLP 
K.CWDYR/ R*AxTflPGTFrnFNLKNQDAKN 
QIITCPNLTNKYIYMTSSFILVHS 


349 


14250 


A 


354 


123 


345 ' 


LAPQHFAS LRLARP / PAGPPCDRPLAPP 
RPS*FSPAPLPPGADRSVPLSPSVPP*S 
LPPPAGAPPR/PPPPRDSPRR 


A* f\ 

350 


14251 


A 


355 . 


37 


393 


TKHFVSTCYVPE F I AGTNQR KRNR I S VL 
ANVKDKQI ILLRVRTAITEGYRNYHVLF 
LVFVCFFSPPKQGL/DSAFQAGVWWHNH 

MPT #T1*»¥ 1 TV"»T ITT O OUT T"\T /*»0 CUr>VT5D17TJ) 

/RMSSYF 


351 


14252 


A 


356 


2 


335 


ARETS KDRLTT iT JiG ANAGGD F \ KLKP VL 
LYHSENPRPLKNYAWST/LPVL*KWNKA 
*MT\HVFTEWLAEYFKPTVQTYCLEN\K 

T?CTTT,TWKTDT CUDD JVT -MTVDTJlfPT'TT'TTT.M 
r oXLLui-JNCIiriJOflxr KJiXmLUrCr XVJS 1 XL/ vXarl 

PA 


352 ' 

• 


14253 


A 


357 


199 


377 


VGSSCFIiKVCQ/C*AVCVCVCVCVCVCV 
CVCLRCV/ CVCL/ CV\ CVGFFFFLVLOG 
CGCVCVC 


353 


1 /Of 4 

14254 


A 


358 


1 


335 


r"PDTTPMVtrimnQClMTl?*TT7TT?OM'f*'l' OMf 

LPRLVANSPPQTILLLQPPTVLPNPERT 
SALY/YRPLLDLSPTSLSPRIPSSPLTS 

A 


354 


14255 


A 


359 


3 


369 


KPS P * PLTGAxjSALLMTSGLA\M * FGFH 
SITLLILGLLTNTLTIYQ*WRDVTRKST 
YQGHHTPPVQKGLRYGI I LFITSEVFFF 
AGFF* AFYHSSLAPTPOL *GHLPL I VI I 
LLNSL*FPTLY 


355 


14256 


A 


361 


337 


121 


LWSQLLGRJjRQEDCLS PRSQGQNKP * CH 
CTPAWVTE*DSVSKKGKVVYIHNKKSSIj 
KKK\NSCNMQQFGGT 


356 


14257 


A 


362 


25* 


375 


EKKTIVQYPHTYGINV* knlpakktpgp 
DGF IDKLYNTFRGE I TTS PHILFH * FKE 
VAVLIjNS FS K\ AAS I TLT PKPNKD I MRK ! 
ENYSPISSYIWNQCSFFLEQIFPIiSFKL 
NCPGLIIPH 
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SEQID * 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Add, 
E*=Glutamic Acid, ^Phenylalanine, 
G=GIytine, H=Htstidine, Msofeudne, 
K-Lysine, L=Leud ne, M=MethlonIne, 
N=Asparagine, P=Proline, 
Q=Glutarmne, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W-Tryptophan, Y»Tyrosine, 
X=Un known, *=Stop codon, /-possible 
nucleotide deletion, \=posslbte 
nucleotide insertion 


357 


14258 


A 


363 


170 


1 


S QLLRRLRS QGACHHAWL I FVFS VDTGS 
HHIGQAGLKLLS \ * AIHPPCPPKILARA 


358 


14259 


A 


364 

* 


330 


3 


RIMSASASQSAVISDM3HDVRIiIDI*HE 
P*FYQALYSFRQ* ITSSPSYR*SYLLSY 
KWQDBARC /AFVTAWQAEAGGMLEPSRL 
VLDS*PRDLPTSASQSAGITDMSHHSC 


359 


, 14260 


A 


365 


1 


341 


SQYGLDFHDLSDLPTMASQSAGISGVSH 
RAWP AS G * F YVSREV I CS YRL * G YTP S L 
LVLESGSSSVTQAGAQWHNHS ILYPHTH 
GLK* SSCLS ITTR/ WDYRHESP 


360 


14261 


A 

• 


366 


2 


338 


ARMVS I S * P RDLPTMASQSAGI TGVSHR 
AWPATGKF YVSRGVI CPU * KDTHLFF / 
CFLETGSSSVTQAGAQWHNHSLL* PQTP 
GLK*SSCLSIPKC/WDiTRHESP 


361 


14262 


A 


367 


2 


337 


ARAGILKH*QRKCKSIR\PLWNNVGYYI 
LKLNIG ITHDPAI PFLVLYPS EMHTHAY 
*KTSMRMCITTLFIMAIN*KFKCS*SEE 
E / IGKFWYFT* * LLYSNENEHLQLHTTW 
IN 


362 


14263 


A 


368 


59 


348 


SAFGIQENEMPAGRGGSHL* SQHFGRPR 
RVDHLRSGVPD * LGDR/GETPSKKKKKK 


363 


14264 


A 


369 


350 


3 


GWSQI PDLRWSTHLSLPKODYRRGPHA 
QL*TLMYKYLFEHLIiSVLLGI CPEVEWC 
/DHMAILCLISLGTVILSSKALHHYTPS 
PVTCKCSNFSISLSTLVTPHCPLHACYF 
PLFSRA 


364 

• 


14265 

■ 


A 


370 

• 


908 

■ 


30 


ECSGTILAHCKLPLPGSCHSPASAS*VA 
GTTGVCHRARLI PTjYFLVHTG fhrgtjjtl 

LTS*ST/SASASQSAEITSVGHHAQIj/C 
LELELK* STCIiNLPKCWDYPA*ATVPSO 
FGSVFLRIF*PMFISNIGQLFSCSAFVW 
P*HQGNAGIiIE*VRKYFLLFKFLKEFA\ 
RTG INFR * MSGRTDW * NHPVRG I S LLKG 
F+LVI*CSY*L*VYPYFLLF*EVSLLSP 
RLECNGVHSESAG ITGGSHHAQLVYP YF 
FFVGDSVLVGFVFLE I CHFI WLFNLLVY 
LCLWYYLI ITLIAVKSQ 


365 


14266 


A 


371 


164 


1 


SCRQVBSYGIKRITCISLASSWNYRHAP 
PRPANF\ * FPAEMG \FLHVDQAS LDS C 


36*6 


14267 


A 


372 


3 


383 


MVSTPAEDAVNIVDMTTN/D * EYSLNLV 
DKAVAGLERIDSNFEISSTVGKMLPNSI 
ICYKElFHERKNQLMQQKSLIiPYFNKLP 
QSIQHSAITTLISRQLSTIWQS/PPPTK 
/RFQLTEGSDD 


367 

• 


14268 




373 


1 


349 

- 


TLLGNDHIYNVIVTAHAFWIFWIVIPI 
I IGGSGN* LI PL 1 1 GAP VMAFARINKLR 
LRLLPTS \ I LLL\LAYAI LEAGARTG *T 
V* PALTRNY*NPEAYGHLSMFSLHLTGV 
SSISR 


368 


14269 


A 


374 

< 


443 


2 

* 


SSGSRS ECRFLPLLFNWIiEIIiAGAIRQ 
/EKE/IKGIHIGKVKVSIKLTEVINKFI 
KVA* YKIN \ Q KS WCL YTNNE QLEKKFK 
IPFKIASKRIKYLBINH*GKDLYNENLK 
TUjKEMKENKWINIPCSWIRRLNNVNIS 
ILSKVTYKF 


369 


14270 


A 


375 


3 


201 


LWIKKLDIIPriPS*LFFNTELDKLILK j 
LIWKFRGPR/ ITQNSFEKPIVGGLXLSD 
FKTYYKATVLK 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 


SEQID 
NO: of 
peptide 
sequence 

_ 


M 

#h 

od 


SEQID 
NO: in 
USSN 
09/515,1 . 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

i 


Predict- 
ed end 
nude- ' 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

nentide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Add, 
E=Glutamic Acid. F*=PhenvIaIanine. 
G=Glycine, H=Histldine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
0=GIutamine. R~Areinine. S=Serine 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion. \F?nossih1e 
nucleotide insertion 


370 


14271 


A 


376 


11 


324 


DKFLEHPPLLVLQAVAGKGSPLHCLSTL 
PTSLP I * DRVLLCSPAWSAVAQTWFTAT 
ASWAQ* S SNHSPSD* R/PHTTTGHFFCR 
DKVL /TMLPRLVSNS WTOAI 


371 


14272 


A 


377 


85 


364 


YQHNYS FCFISQ YVYRLFFFF * TF FKKK 
GFWF*RLNFQGGNMG/SMEP*PP*IRQF 
FCLNPLN/SPDYRGPPaGANF* IFLKKK 
GFQVFPSLSKIP 


372 


14273 . 


A 


378 


1 


382 


GTSGTSYSTIFAGTLITALSSH*FFT*V 
GLEIl^MIiAFIPVIiHQKYT\P*yTHAAIM 
HFLTDSGA/ SVLLIEHjUjYSNSLGE*S 

I HAL CLDRF QLLH I LL I 


373 


14274 


A 


379 


24 


323 


I PGLKRSSRLGLLKCWDYR*DNFQHMCL 
KNFNNWAAY YFVCRI CCAVDPHGCS IKV 
owe iur / ^.r c x* * iwA J. v cr\^/vCt v yvaVaUJLAs 
* LEPLP PGLMPFSGLSL 


374 


14275 


A 


380 


2 


317 


AGWFQTPDLR*SPCLSLPKC*DFRG*PP 

PnKTiT7T.f*P T ITT F * TP T.TC /PVT.MTT . * HQT, 

PLMHFKKNVIYFILYKAALFFFFLRRSL 
/HSVAQAGVQWHDLG/S /LQTPS PGFK 


375 


1427/5 
it* /i/ 


A 


JO I 


i 




VJfPO PTRWP KT Pi /T'T»VP H72 T . T TP T V 

KGLKQLEGKN/KSNNLILKWAKEDXQTA 
NRYMKRCSTSLI IREI \MQIKTTTRYHL 
TPVKMAFI KR* /GNNEC* *GYSEKRTL 1 


376 


14277 


A 


JO* 


o 

• 




TO PL I LRCLPPRS T YPPN<2 T P I It T fWNP 
F\ *EKEKSLLKF I QNLKGP * I TKTILRK 
KKVDGKTF PDFTMYYKATVI KTVSWYQ 


377 


14278 


A 


383 


178 


1 


HNPPLAI S FCFFPFFFFLRQGL/NSFAQ 
AIVKWYDHDLLQP* PLGLKRFSHLSLLS 
SWDH 


378 


14279 

• 


A 


384 


312 


26 


FLRGVFFFFFLRRSL/DSVAQAGVQWPS 
FGSLQAPPPGFMPFSCFSLL/SFFFFFL 
VE/MGFTMLARJCVSIS*PFGPPASASMP j 
VEITGMSHCLANMFF 


379 


14280 


A 


385 

■ 


1 

■ 


849 

■ 

• 


FFFFKQTKFI KLSKYKNI IKKS /SAFLY 

HLTFYSKEHYTN* VTHKWNNITHS *TGI 
FNS * I FVLHKMICRYNATSIKI PVTYFI 
DIF/EKAiTilCFIWYHiCTP*l^K^ICri^ 
GI / LPDFE IHreTCTVVTKTVWHIjNTCKPnT 

GQWSRRKREQKYISVFTAN*F* IQVTFF 
FKGNNS I FNK* CL ENFMSTCR\ KKK* DP 
HLTPYVKINSK* ISHLNVRPKTLKLL\H 

QKIE*KPHNIGLGSKFFDLT*ISQDTRG 
RTSOSDHF 


380 


14281 


A 


386 


3 


318 


LREMQNALESLSNRTEQVEETTSEIiKDK 
AFE *TQLNKDK\KKR I * KNE* RLQEVWD 
CVK^PDLRIIGVPEEEGKSKYLENMFEE 
IIEQNFPGLARDLDIQIQVAQR 


381 


14282 


A 


387 


1 


382 


FTPTRTAWKK/ SNDWCWRGCGS IGTLR 
HCSWECKMAQLLW*TV/KTFPRKIRQPS 
DVCCDMVIGLP YDPAVLLLGI CPPJEMKT 
YVH1WMSVITSVSLVII\ADSENNLNV 
PSADEWINEMWYXHTVDY 


382 


14283 


A 


388 


1 


341 


HKLENLENI VKFL * TH/ TLPRLNQEKI Q 
TliNRS ITS S KI KSL I KUTPTRKBKKK\N 
PGPKGFPREFFPRA\KKQGVPTLGNPFQ 
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M 
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nucleotide 

location 
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peptide 

sequence 
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Predict- 
ed end 
nude- 
otide 
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correspon 
-dine to 
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acid 

residue of 
oeDttde 
■ sequence 


Amino acid sequence (A^AIanine 
OCysteine, D=Aspartic Acid, 
E^=Gliit»rnic Acid. F^Phenvlalanlne. 

G=GIycine, H=Histidine, I=Isoleuclne, 
K=Lvsine. L= Leucine. M=IVIethfonine. 
N=Asparagine, Ps=Proline, 
0=Glutamine. R^Areinine, S = Serine. 
T=Threonine, V=Valine, 
W-Tryptopban, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion. V=DOSsible 
nucleotide insertion 




■ 










K FRE RGS LLNY FYKPGLFR I PKLGKNPR 
GKKK 


383 


34284 


A 


389 


168 


1 


KKFFFFSFFF*GDRVLLCHPGNNAVMQT 
RLTAAS\TPGLKQSSHFTLPSTAGYTG 


384 


14285 


A 


.390 


337 


I 


TGITPLNPLE V\ PL INTSVLtJ^QVS 1"T 
♦AHHSVTESIGNLI IGALLITILLGLYF 
TLVQASE*FESPFTISDGIYGST*LVAT 
. GFHGLRVIIGSTFIjTICSIRQLIFHFT 


385 


14286 


A 


391 


1 


235 


IiNFSYESSMYFALFTIVF\WVFLNF*KF 
VFCFFFFFFFFFFFFFFFCFFFS 


386 


14287 . 


A 


392 


1 


258 


SCDRI*FANHLSNKELVSRKYIYIF*KSQ 
DSTIRKQTDKK*AQELNRRFSTKDLQMG 
NKHMKRCSTPIAT /REMOTKTMTjRYHCT 

PIG 


JO / 




A 






1T7 
.31 / 


hhdsvf*pjvedinmheiqfisflxflnrd 
evslcctgws*tpglkrascldlpkcwn 
yrheplciafffngswfqcqi 


388 


14289 


A 


394 


91 


408 


LGAEFDVRAYLTSGRLTGPGVPFLTGFY 
SKDHI I \ETANIS YTNA*ALS ITLIATS 
LTSAYSTRI1LLTLTGQPRFPTLTNINE 
DNPTLLNPI KRLAAGSLFAGFL 


Joy 




A 
<r» 


J SO 


j 


iOJ 




LVQPLWKAVW*VLR* /LKTELPFDPAIP 


390 


14291 


A 


396 

■ 


661 

■ 


1 


LCPLS S F YRKCALLGFWFVCIiFVFQIRV 
SLCCPRWLQTPGLKGSSHLSLPISWDYR 

ISLSFE*QQRN*LHriiISIiL*MVDrCLT 
OFTKWIFL*PKNS* *LKROCKLPKITEli 

/PKNRIQIYPTD*MSSIPHSLSFYLVCL 
FVCLF*MBSCTVT/TMAAVQWHDLGS\L 
QPLPPGFKRFSCLSLLRSWDYRRPSPC ! 


J7 1 


14292 


A' 


197 


JjCU 


I 
i 


PDSKOOIFNVDETVFC/WKFMPSRTFIA 
REEK5MPGLKASKDRLILLVQANPAGGF 
KLKi^Tra/SI^PRAIKl^AlvS\TVIi* 
KWNSICVWMTGHL FTALNVLS PRRAL 


392 


14293 


A 


398 


1 


158 


C IGPMWENRLI LGGRGCSEL * S CHCTPA 
WAT\ S KTLSOKEKKK *NMRKVMIOC 


393 


14294 


A 


399 


264 


1 


LINEFSQVAGYKINKQKSLVFLYTKYXY 
SKLSEK* IKPCMPFTIPAKKKEIKYIjGI 
NLTKDVKDLYNEN YKILKKI EDT\KKWA 
DTPC 


394 




A 

m m 

i 


400 


j 


J*rJ 


HEOKROSKVFJCVREIjSOGLNMDRXRWSO 
DIxlSTCCRTVAIiSPYHHTPJjALNCPGRQW 
FVGVNFF*RKKIFFFFKFWDRVLLCHIiG 
WSAWPSWLTAALISW\VK*SFRLGI1LS 

sw 


395 


14296 


A 


401 


1 


345 


GTRKNTDNTKCR * VCEETGYFIHCWWGY 
IvMVYPWKLVWHFIjKKVSIIILPYGTSAL 
hSmXBKbTFT/arFKTCTQM/FUVVUF 
PIVIOWKQPKCXPVGl^N^VfYISIHT 
IYSAI 


396 


14297 


A 


402 


102 


355 


DkVIPJ^FCIFGRbRVSPCCPSWA^r - 
GLKRSTS LSLPKCWDHT * ATAPG / IjRAI 
LLYFWDYRCLSPDLVNFACEHFSVBLVA 
FF 


397 


14298 


A 


403 


156 


3 


N1STKNGFP* SMLFQPSNGIiVS I *NI \PUC 
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M 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D"Aspartic Acid, 

V^sOlutnmic Arid V^Phpnvlalaninp 

OGIycine, H=Histidine, I=Isoleucine, 

KsiLvsine I^l^ucirip M=M**th inning 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T«Threonine, V«=Valioe, 

X=Un known, *=$top codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














*QKWFSFFFEERVSLFHPGWSAW*SWL 
TACSLDIiPAOVI LPPIiSliPNS S C 


398 


14299 


A 


404 


I 


364 


GTRS E V I YKLL LQT P WVI * HMRTRLLLQ 
ACGAR*MWMIL\MI ILGLVTCRLTI YQ* 
WRDVTR B S TYQG HHTP P VQ KGLR YG I ML 
/FITS * WFLSCVI \ WVFF 








4U5 


i 




WPrVTQHWTDT.CnQDT*CtrTT MT.T.MCMTTA 
nil u * *> nn i. f jjo y o ir x w t» jv a. Jjti IjLitJ o haa 

ERGEEAAEEKLEAS RGWFMRFKKRSHVH 

S\KVQGEAANADVEATASYPEDLVRTID 

TAREEKQCLP 


400 


14301 

■ 


A 


406 


3 


356 


HERSDQLYAN/ KLDNLDKRSKFL\EGYK 
LQKLTQKEIQNTmPTTRI/EISKKKKK 
KKKKKKKNFSPGGFTGEIPPTFKGQFKK 

PKIKK/DPSKKK 


403 


14302 


A 


407 


116 


362 


YKYSLTPQKLYNHSYIKKNHNKQNNRNH 
STTTTTKQP PP PGFKRYSCLS F PLS * D Y 
RCTPS CP VNL F/ CVFLVETGFHHVG 


402 


14303 


7V 

A 


408 


3 


360 


HEVRiiWlJt Ar ERNEGGGENEEKVDWIjNY 
RMWFHLIF*AECYSICRIiHAYYSCILGP 
VLLLLFVLIILIiFCCL/SFFDYF/VFMF 
FIFFYSFFFLIiFSIiFFSLFL/CFIFLYF 
FIF/CFFLYFFFF 




14JU4 




409 


TOO 

388 


1 


■ft IjLiJvc Fr FrtVliivJLfc't'*labJfr tf*\?F* Luab 
/LIFPPYFGRETLFFFF*NKVPLCPPDW 
SSMAR5RVPAGSVFPVK\ESLSLSLLSV 
PPQVQVNGFLTFFFFFFS W* RQGLPLLP 
RMVWNS WAQAILPL WPHAS 


404 


14305 


A 


410 


1 


386 


VFNAEESAY/YWEIMPQRKFIR/EEKQA 

F w l*KU£.KXo w Xi-iU>IAVVra±Kx IrlXLKP 

ANPQALKKKKKKKHQLTVFWLYTKKS*T 
mtjtt.pt .rnjprviPPVDTrupwT.AQTinT dt 

KVLL ILDNAPGHLKPHI FN 


405 


14306 


A 


411 


2 


417 


AHH I FTARI DVDALAYFT YVT 1 I IDI PT 
GDBPFS * LATLR * INMK * S GAVL * ALRL 
I FLFT VRGLTG IALPDL* LD I VLHDTYY 
VGAHFHYGL SVGAVFAILGGF ID * WPLF 
\ SCYTLDRPYAKIHFTIIFIG /VDtAPP 

P 


406 


14307 


A 


412 


295 


397 


WQWPGTVAHACNPRTIjGS*GGRV/TLRS 
GVRDOP 


407 


14308 


A 


413 


437 


3 


PGFGSLIGNLIPASGNGRKSKSCVCVCV 

LQLMV/CPQNCVCWCPESGLFLDESVCV 
RLCVCAWMAVWMGGSGSG*VCG03C\MC 
I CVGPLLDSELCLCVC/LIXtQSKCDCGL 
RCRLPAWCV 


408 

tvO 




• * 


did 
***** 


o 

Jm 




HLOF I FFW I LKI FH YTj / FLWFW FYLTSM 
ALFYLQ*QKRNCMYEVFNRGLFF/ SOGE 
GVSGSPASSSSSPSCSSSTRGGGAVGGG 
GLGFVCFLLLFWGFWFVFLFCWFLVCVG 
WVWCVWV7VFFLLGV/CCWCFFF . 


409 


14310 


A 


415 


376 


1 


GFQASKDRIi/IjTJiG/ATAAGDLKLKPMIj 
I YNS KNP RVIiKN FAKCTLPVL YRLYR * K 
NYAWMTAHLLKSWFTEDVKSTIQ*KISF 
EMI^XjI INNVPGHPRTPMEMYKELNFF\M 
PANTS I P * PMDQG I VL 
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SEQ1D 
NO: of 
nucleotide 
sequence 

i 

i 


SEQ1D 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQ1D 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

peptide 
sequence 


Predict- 
ed end 
nude- 

■J HVIV 

otide 
location 
correspon 
ding to 
last amino 

tuvi a 1MB* u V 

add 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
OCysteine, D^Aspartic Acid, 
E^Glutamic Acid. F=Phenvlalfittine. 
G=Gly cine, HNtf istidine, I=Iso!eudne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaroine, R=Arginine, S=Serine, 

T^Th reoninp V— Valine 

W«Tryptopban, Y^Tyrosine, 
X=Un known, *=Stopcodon, /-possible 
nudeotide deletion, \=possible 
nucleotide insertion 


410 


14311 


A 


416 


409 


2 


FFP IKKKKFAGALVFFFFFRKI FFFFPK 
GEGKRGVLVS TiNTJ iPT .P * TmVPQPP PPft 

BGVLWPPPPGPIFPFPFXiKKSFCIiPGQG 
GFYNPSLYLI PLSPFKI WG*TRGFT/LP 
PPALF F YFFFF * DRVS LCHPG WNA 


41] 


14312 


A 


417 


425 


3 


RELLAFWQNFKLRPQGP^t'SlpVaPGGbK: 

PfP + QTiPTfTDPlfrUTiDDnD /CDVT/7DT DO/IT* 

QLPPPRGG/SPQGPPPLPKGPQGWGPPF 
P*RSPPCPPHRIGVPQVSSSPTPGLVFP 
RGPLNPGNLGGTKKKKKI SS KAARDLEL 
VRTRG 




1 A"X 1 1 

i 




/tic 

4io 


i 
i 


1A £ 


* I FGHPEGYMLMLPGFGI I LHI VT YYSG 
KKBPFGYIGMV*AMISIGFLGFIV*AHH 
IFTV+ I D VYTRA YFTS AT 1 1 IAIPTGVK 
VFS 


4 J D 


14.314 




Aid 

4iy 


L 


382 


UC 0 HMHUJlb IbIijljt\jA*AGVIJ3AAIjS 
LL I RAE LGQ PGNLIiGNDH I YNVI GTAHA 
FVINFFIVIPIII\GGSGN*LLPLIIGA 
PDMAI ARI NN I S FRLLP PSLLXjLLAS AX 


414 


14315 . 


A 


420 


261 


378 


KKNFFFF*KLNFF*KFFLIFFPPKKKIF 
FKKKKKFFFYKIFF/I*KNIFFSPQKNI 
*PFLFFFI*YPFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFTUT 


415 


14316 


A 


421 


124 


428 


KPAATHACAT I FMCLDQEAI I SN* H* AT 
TQTTEVSL\S FKLDYFSI I FI PVALFVT 

I T I L I LVTANNLFQLFIG 


416 


14317 


A 


422 


8 


355 


PVFSYWHSTLLTFSLSLSFFFFFFAPGIC 
KGE KNGKPGP FRGGFFPKKI KKPFPP P / 
GGPKKPPGPGFPQKPHFPFKGGPWPSKT 

T.I^PfiGTePTTOr^Pr>KPPT,V*NPPQfY2P / 

HGPSGGG 


417 


14318 


A 


423 


356 


3 


KPIX3IDLTKKVKKLSKKNYKTLMKKIBD 
DSN*KDNSCL* IR/MTILPKAIYRFNTI 


418 


14319 


A 


424 


2 


338 


PSWLGGHWPRSGITPUJPVE/VGLLNT 

IRLIASGVSITGAHHRLIECWRHQriQA 

I i IiITILl*GLY\LTLLQAS*YFQAPFTIS 

DGI YG*TFFGSTGFQGLHVI IGSTFLTI 
CF 


419 


14320 

* 


A 


425 


55 


342 


GPFTPWSLC*GDLQR* P/RAVKFFLKKK 
KKKKKGNP IRN*RDISS* FLKNLETAVR 

NFWTFFSYFKSKKLIQKSDHPLKNVKKI 
LONDKRYIiKLDWG 


420 


14321 


A 


426 


2 


367 


DRRRFCT YHKD IGALYLLLGA* AGWCT 
AVSLLIRAELGQPGWVLIjGI^HVYNVTV' 
S AHAFVI MF F\MBYP I INGGFGN* LVPL 
ISGAPDMECSGINNISFWLLPPYVLLLL 
ASAIVEAGSRT 


421 


14322 


A 


427 


359 


74 


ICADYTRKPPYMGAPIEFSCATCVL*LT 
STQRECVCVGVCVCVCVCVSM*VAIiIPD 
SN/CIPIiSAYVCVCVCVCVCVCVCVSI* 
VAIiLLDSNSYH 


422 


14323 


A 


42fc 


52 


361 


NYPQLSEI***VS*LHY/I*FSFLFFYFF 
* LVSR FVARLECS GAWH \ I IAHCSLDLL 
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SEQ ID 
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Den tide 

sequence 
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NO: in 
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26 
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nucleotide 
location 
cor res no nd 
ing to first 
amino acid 
residue of 
nentide 
sequence 


Predict- 
ed end 
nude- ' 

MM %M W)*> 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid. F^Phenvlalaninc 
G^GIycine, H=Histidine, I=lsoleuclne, 
K = l» vxine. L=Leucine» M=Meth ion ine. 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W=TrvDtODhan. Y^Tvrosine. 

TV ML m J k# W mM mm MM mm ■ Mk A* J m III w% 

X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














GSS / DSS PASASLVAGTTGMCHQNQ 1 1 F 
LFKKVFVGRVOWLTSVTPATWE 


423 


34324 


A 


429 


105 


361 


SRLFFFFFFFKTNFGFVPKVGVK/WAFF 
GLMEPFAFRVK* FSGLTHPRTWNYRNVP 
HC P VNLE F* VKTG FNLVDKAG P KLLT * K 
DFP 


A') A 

• 




A 


4iU 


i 

l 




* * RLHS I TLL I LCLLTNTL \S I YQ* WRD . 
VSRESTYQGHHTPPVQKGL*YWIVLFIT 
SEGVFFAGIF*AFYHSSLAPTPQIjGGHW 
PTTGITPLNALBGPL 


/IOC 

425 


14326 

• 


A 


431 


J 


39Q 


innrvvuvuT .t dm t itd t jvrra\TNM qttt xx wt 

i\JV V AAW AJNJjJ. I71 W 1JL l^KidYW VrliTlo VunAV X 

YKFRAI CLR/T FHFHRKKKNLTWA* KHK ' 
KALIPKTILKKNGKSGGITLSDYKM*YK 
TTI TTTI RFWWKF/D INK YNKI TTHE I Y 
FHIYGQMYFFKIAKTFQGIK 


426 


14327 


A 


432 


l 


392 


GKKRKKKKGVFFFYFF2jGF*FFLWVFFF 
FFFFLGFFFFFWFFFFFFFFFFFWGI FF 
FFFLLI I YNFFCKF* FFFFFFGFPNFFF 
FFFWFFFFUDFFFYFFFPRV 


427 


t A 

14328 


A 


433 


382 


1 


P KNI KGGGKK* TP PKKGGVS \ P PPPKKR 
KSPPHKK/IGNFFPPPRGKGGPPKNPKK 
PGPPFFFFKNPPPFFFFPFFFFLFFFFA 
HKDGLLAREQTQAEVKT 


428 

* 


14329 


A 


434 


375 


1 


KYSQLIFDKTAKAIK*SKDCLLKEWFWN 

VMTOTtlVCT.nT. /CDT?*PVTMCV* T*W*T WT 
tx/f 1 o IflAoj VLiLXf Oirf 1 ZVJUNoJS. X luiiiNlj 

KCRIITLLENNIEENLDRLGFGNDFLTT 
LPEAQSKTBLISY/DFIQTNFCSANATV 
K*MKRQATEWEKIF 


429 


14330 


A 


435 


1 


459 

■ 


PTRPPTRPSTRTLGFTMIAKLGSNARPY 

NGVHRPLLIW*EFLVEISFIFDFFLETN 
PGFITKGELQGHNLGSLQVSPPGLTLFS 
CLiSLOKSGFYG/HIjAKYLVKF /CFGIFS 
KRGF*GGYPGGAQSPPP 


430 


14331 


A 


436 


14 


382 


MAR3NTS QKDIRI DPNNKCLWLVKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKRKNYFHVI \LKNKAVNNGSTITS 
TNVMSPGLHIRLI I ILSLMI YKKSTTE* 
VEKQTCYYILM 


431 


34332 

i 


A 


437 


3. 


192 


T PGLK * S P CLGL P KCWD YRR / AAAVPGL 
♦DILAIFFHHILSESOiFLLVFIHCFNA 
NLFLRWSS 










i 
i 


304 


LADRDAI*NLWQIPIVAS*YIPLGF/WS 
KAMPCSVDIYSSFEKKKKTVFFDRGQGP 
TPEIQVLGKAEGVKPFEPRGSKPSWRIM 
AKPKFYKRFKL 


433 


14334 


A 


439 


336 1 


2 


GMIPEMKGNAGP / SG/PRKPGFFIiG* KG 
KVKAWLNFKPPFRAFWPKKKILLPLGFS 
GILGPGPKGNLSRPGPPPQAFPLGFOQV 
liGP/SGPGVGFPKPGQIGWSNNPCLTLQ 
KNK 


434 


34335 


A 


440 


1 


333 


HKLENLENIVKFL*TH/TLPRIiNQEKIQ 
TLNRS ITS S KI KS LVKNTPTRKE K / S KK 
PLVPKGFTIEFSQLCKKLRVPVFMKPFQ 
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SJEQ ID 
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sequence 


Predict- 
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otide 
Inrsrinn 
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ding to 
last amino 

Arid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 

K^fHiitftmif Arid F=Ph**rivl»1*in!nA 

G=Glycine, H=Histidine, I=Iso leucine, 
K—ljVsine. I ,=1 .Piirint* IVfslVfetttlniifnp 

N=Asparagine, P*=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V*Valine, 
W=TrvDtr>nhnn Y=Tvrminf* 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KYGERGLFLNGFYEPGLTLI PKFGKNPG 
G 


435 


14336 


A 


441 


320 


1 


wginnwlsp*bkkgrpfll\shpkinsk 
wikdlnvkgki*kpLegqig*ylmifgv 
ekglfkggpkytlykrky*nrellfikr 
yfs lteks pae i chi kklakkk 


A1£ 

436 


, 14337 


A 


442 


o 
5 


391 


I/QHV/NQLPKITHTITSVSVGKARDKF 
NIYS*FKLGKTEKERNFLHWIKSICTSP 
AASILLEGKESPF\PFRSGTRKD/CLL* 
P/liGSGMVLEALGTAERRTRKEK 


437 


14338 , 


A 


443 


363 


2 


HHVGQACLELLTSGDPSNLASQSAEITG 
MS YRAQLS I VTFSAYFLW* KLSHARGL 
ML*QS I I I /HVLHFHQ VKKAFHVS / P EN 
S QPLHNVE PEDW I F * EHQRKTVPPIHTA 
RKLQDIiEPCLV 


438 


14339 

■ 

• 


A 

• 


444 

■ 


1 

• 


900 


DSS AG IT / GICHHAQI I /LFVFWETGF 
HHVGQAGIjELLTSGDPPASASHTGGDYR 
HEPPLLASLSFLNKELCTWPERRRKPI F 
SLPKLDPNKRKFSFPP*LF/IGS*TLIW 
SLFCFRNVCTQLADPTKSIAYQSSLMKP 
FGQKS ICFGSEKFPDKVYLFSSDRKAKE 
QVJjVVVVV VB I * i *iJ < M£«Cl^ w JUjt3J:L.I 
CLS I WN/ YELPEKFEVIACSLPSRNNDL 
I LS LKKKS QNS F FVCVFFF FKTKSNFVP 

GSC\QTGAVP PP PVNFGFLKKNRVS P 


439 


14340 




445 


3 

• 


346 


QIGKVKKLNK*VPHEPSKNKL\FLEASS 
LILCNNNEPFLSRIVTWDENWILYDNH* 
QPAQIiLD* EAPKPNLHQKKKKKKKGLAP 

KRG 


440 


14341 


A 


446 


2 


246 


FKCGKTRALMHCWWEGKMLRQLi\RKVWQ 
LLAMLNMELPREPAAPLLSISPRERK/S 
TTAKTCA* MCIAWCITVKKWKHRLLG 


441 


14342 


A 


447 


38 


393 


VILHRQGLSL/ VTQAEVQWYDYNSL* P * 

KWM\GFTMIiPGL I LNSWPQ / CNPPAVDA 
QIAGIRGFHS/VGQAGVQWHDLSSLQPL 
PPGFKQFSHLS 


442 


14343 


Pi. 


448 


KA 

D4 


D4U 


VSSDGQ\FRKFGISRLGNSGIYXAAFPL 
HDCKFRRQSEDPSCPNERYLLYREWAHP 
RS I Y * KQ PLDLI MKYYGEKI GI YFAWLG 
YYTQMIiLLAAVVGVACFLYGYIiNQDNCT 
WSIEVCHPDIGGMI IMSAQRDRL 


A A*> 

443 


1 A~i A A 

14344 


7\ 

A 


AAC\ 

449 


2 


310 


PPGFK* FS CLSLP S S WNYRHAPARPANF 
FLYF * * RQGVTMLTRMVLIS * PRDLPSS 
ASQSAG I TG VSHHAWAKISL 


444 


14345 


A 


450 


2 


466 


KQKIFSVDETAFYWKKMPSKTFHS *REG 
TASKLHTAG* ILLGPNAAGDLKLKPVFG 
DDSGNLRALKNYAES PLPVL YKWNNKAW 
MTAHL FT AW FTE YFKP S LRS \ FRKI S WK 
I *LFMDNGPHHPRALMEMCKE/KAVFMP 
ANTTSlIiQPMDKGVILI 


445 


14346 


A 


451 


670 


212 


SSSSP*GS*YQNRTNTIPSPKKKQTKK\ 
KNNHYKS I FFTDAEI LSWrLANAIQQS I 
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ino tA firct 
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ed end 
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r a Ac ti An 

ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
OCysteine, D=»Aspartic Acid, 

FVsfSI ii tfimir* A i*iH. "P=Ph< J n vlal-cininp 

tr~VjiHJ LUI1I1V. /VHU) I I UCU j laIaillUC| 

G=Glycine, H=Histidine, I^Isoleudne, 
K-Lysine, LFLeucine, M^Methionine, 

1" rtOJIill aVIIIC, x X,l fJllllCj 

Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine. V-^Valine, 
W«=TrvntAnhftn Y=Tvrn*jini» 

TV A 1 J |*IV|S 11 till , 1 1 JI Wlllv) 

X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KRF IRHEQLGFI LWLGWVNI QTPTNAT 
I * KI1FPI1RGAF 


446 


14347 


A 


452 


407 


219 


PL I S LRWENHLGPGVQGCSEP * LCLCT P 
AWMTE * YPISPSQNKTKKQ\HTRKQNQH 
KNKCVKKT 


A A T 

447 


1 iT in 

14348 


A 


453 


2 


395 


WPT UT7DT?DCTJTU / ID 1 7TVKTC? 74 TV O 7i FiX TO R 7Y 7\ 

vie uric KctcwnJjrt/ KV^^l&AiioiwVx&nA/i 
S YPEDLAKI I DAG \ AKQI VS VEETAPHW . 
KKVPSRTFITRBKTTSALKG/RC*LSLV 
DNAAGHF*VEA\MLIYHSDNPRTL/KN/ 

VTiDCTT TJITT WWATKr V A MM**' ir 1 ! WPC 

X Ado i. Jjit V Li X R.W JMW J\ANn AAV—ur lo 


448 


14349 

i 


A 


454 

! 


424 


38 


EETE?L/HRPISSAEVELVIKNLPT/KH 
KSPRPDGFIAEFYKMQKE * LVPI LLKLC 
QKIKEEGLLPNSFYEASISLIQ\NSGRD 
TV/RKENFRP I FPMNIFAKILNKILANC 
IQQH INKLIHHYQVLFSS IiHS R 


Ay A r\ 

449 


* A aW tfK 

14350 


■pt 

A 


At AW 

455 


2 


* **% /\/*\ 

309 


PRVRS QTPGHKR5 TC-LCiiPKCWD XRHE P 

lhpatasflvaavpgm/fadpp/ cnmhl 

NE * MNE * MNGDDAS E I LS FEMRSHSVTQ 


450 

* 


14351 

r 


A 


456 


3 

• 


441 


T**« "ft* Tf T ^**» V *T p ■ T**fc *f»*****\ *r% y» «t***i ***** *f*l *f***l T ■» *■**! **T*1 IT1T /1V*//il*tf m 

DAWGLVUDRERPFFIFFFFFFLGNGGYF 
RGPGGRPGGEGP PMI&TSVS PVKKILRGP 
GQGGGKARKPHPLGGPRGPNHKTGG*KR 
AYPTGENPVLTEKPKFTGPGGNGPETPV 

TGGKKGK 


451 


14352 


A 


457 


1 


234 


PTRPNHLGLGDRGCREPRSSHCTPAWAT 
ERDWVSKKKTKG/ EKF*KGRTK*MS *VE 
QGGSGGKGI FTLAISNSPLPFF 


452 


14353 


A 


458 


42 


470 


t*nTor>T wont tit vtvtt"j tv"< tttjt r/** v t mtt r> Ti 














PRERPKNFF*KKKTLKTPPKQKFLRKKS 
P FKKPHFIiL YS KAT KKQKRGKKKRAP P P 
KKKKKKENP KI VI FRTE I TVAS PVLSWA 

t/i/PTTffT PPPPPTHTPtfAA TfPPPP 


453 


14354 

■ 


A 


459 


2 


393 


DRPIEQWNKRERPRINLYIDGQMIFSKG 
TTAFE/WK*NLCKKYVWNAWIC\NRRKN 
PYXTSSRXINLR * I IDIKVK/PKAIKLP 
EQN IRCSL* VLGVGRDFLENMNYTRKK\ 


454 


14355 


A 


460 


338 


33 


GDEKIEKSFFFPISSLCFVP/AYL*QQI 
J .T PTP MQPTiT iP T Y / CT iFIiEMG9 HFVT 

QT3VQWCHHSTLQPRPLGLKQSSHLSLL 
SS WDYRHLP VVLKWTEACAV 


455 


14356 


A 


461 


77 


435 


AKEVBASLSTTARPISIQQQQQQQQNEK 
KNKN*PRIAKAI\LS*KGEITLP/ELQL 

ETNPHTYSEIiIFDKGPKSI\SLFNK*CW 
EYWIFICTR 


*tJU 


iHjj / 


A 


462 

^f\JAm 




v 


NGRVDLKIQKLARCGGACIiQSQL/RQEN 
HLNPGEKGCSES * LHHCTPD* VTKQ 


457 


14358 


A 


463 


363 


3 


PSVAQAGVQWCHHSSLQPQPPRLR\HPP 
ASA/SQ* LGLQGQATAIiYIHKYILFPCN 
VLISFIFPFFFLEMRSHSV/SQARVQWH 
DGSPQPPPPEIiKRFSWEAELAVSRDRAT 
ALQPRRQ 


458 


14359 


A 


464 


3 


396 


LKSIAEKVKKIIKFKKQLRL * MKKI INR 
YLKEN / LNQLLE I KGTLRELQNAVE S FN 
NRL*QIEEGI SKLEDKAF* WTESGKIRK 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E^Glutamic Acid. F^Phenvlalanine 
OGIycine, H=Histidine, I=Isoleucine, 
K—liVsine, L=Leucine. M= Methionine 
N=Asparagine, F=Proline, 
Q=G)utamine, R=Arginine, S=Serine, 
•^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine> 
X^Un known *— Ston codon /=nnt«;ihlp 
nucleotide deletion, \=possible 
nucleotide insertion 














INK* TK* TDYI KEPNIiRI IDVPGGEEKA 
KSLKNLFEVI IEENFPGLAR 


459 


14360 

* 


A 


465 

* 


399 


1 


PPGVLLNGPPFFFFFLILGPGWVFGQPL 
KTPLVFFPNFQYHI FPFKISRIF/LPTR 
FPFQVPPLCP*PFSSPWGLKVVFILLSF 
FFFL*DRVSLCHPGWNAVM* SQLTAASN 
TW\VK*SSHLSLLSS*DYRCVLS 


460 


14361 




WO 




i 
l 


TTN T FWP R P W3 QTH iOPT .NMKT? TfPIfPNV 

X ACvJi AAVX^JvTV VV70 i UHMrnnininp.^r.np.ni V 

KRRQATGWEKIVAKDTSDKGLLSKIYEE 
LIiKLKNEKTNK* / ILCLKWSKGFNRYFA 
KEYIQMANK 


461 


14362 , 

i 


A 


467 


2 


436 


RGELPQLDKYLKKTTAIILNGEEL/E/A 
CPLRSRTRQGCSLSPLL*KKFFPPKSHI 

GKEEMKMYLFTDGNHLCKNSERIRN/YS 
KVAGYKVNV* T * ITFLYTSREQVEFEIK 
KTLSMNRFF 


462 


14363 


A 


468 


2 


420 


RTTALF * AVRQGRLS LQRLLLS FC \CLC 

LLCLPKQAWAMAGAPPSASLLPCSLISD 
RCASS QPDS VGVGPS EAGVGYNLWRGL 
I*3RSEKR\NIRLGVTRFSRCV /LSPLSL 
TRR 


463 

* 


14364 


A 


469 


1 


416 


P S P * P liTGALS ALliKTCGLAM * FHFHS I 

GHHT\QAVQKG\ LRYGI ILFITSEGS FF 
AGLV*AFYDSRLPPTLQLRGHWPSTGIT 
PLN\HLQVAVLGE P VLLAS * VS IT * ADH 


464 


14365 


A 


470 


211 

* 


398 


I FFFPKMGGGLS 1 1 LCWRRKRHPQKLK* 
cTfT/sT.Pvwronrvpnrpp /vpcvMnrrawpD 

* KTKESPPIiVSGGASSASNIKL* IALPP 
FLEKKKFKKQGFKDPPFPLFFTQNPGQG 
NFFFGKI * YCPP I FF FFLRRS FTLVAQA 
G VQWHDLGSLQPP PTGFKRF 


405 


\A1CC 

i43oo 


TV 


All 

471 




AO A. 


RTLNHT*PCNPAIPLLGVSLREMITYVH 
KKPCP * MF I V7^LFMVTKNSKHLKCSLTC 
KW I NKI»WS LYTMK Y YIATKRKELLNHNR 
SCRNYSSDMS 




14367 


A 


472 


427 


20 


LGLRLPFVEQEENKPYYPLAPFSPPEPP 
MSL YKNS P I YPGKGFLFPSKNNKI PPLN 
FLKMSNLF* / FALLLGNKDFHLLLLGGP 
IGIMGDNMATTSQMFYSRVPIFF/CFFE 
TGSH55VAOAGVOWCNLSLLQSLPPRLK 


467 


14368 


A 


473 


424 


98 


NWYLYYYYYYHLLF* RQGL/DSAAQAGV 
QWHHHCSLQPRRHRLKQILPPQLIi\TSW 
NYRHKPPRLS PYVAQAGL* LLGSSDLPD 
SVSQSAGI I GNEPLHLAS L * FIiFAFP 


468 


14369 


A 


474 


415 


75 


NHFLKFFPQTQANGPGGKNKFFFLKTRF 
CFFPPGKRPWAYYKSLQPPNSGGQQI\S 
APTP*IKGAPKGGPPTRVKFFFFFLF/L 
RWSLRSVAQAGVQWHDLGSLQAPPPGFM 
PFS 


469 


14370 . 


A 


475 


3 


413 


PVQKGLRYGI \ILCITSEV\LFFAGFF* 
AFYHSSLAPTPQLGGHWPPTSITPLDPL 
EGPLLITSVLLASGVS IT* AHHSLIEN\ 
NRDQI IQALLITILLGLYLTLLQA*EYF 
ESPFTISDGMYGSS FFVATGFHGLHVI 
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sequence 
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NO: of 
peptide 
sequence 
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eth 
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NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

Ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspa rtic Acid, 
E^GIutamic Acid, ^Phenylalanine, 
G=Glycine, H-Histidine, I-Isoleucine, 
K=Lysine, D=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G1utamine, R-Argtnine, S=Serine, 
T=Threonine, V^Valine, 
W n Tryptophan, Y sd Tyrosine, 
X=Un known, *=^Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


470 


14371 


A 


476 


1 


440 


ITLTDREIiSYY*ANRLIj*LAYTITFIV* 
I PLYGLHL* L\PKAHVEAPI AGS IVLAA 
VLLKLGGFGVIRLTLILNPLTKHIAYPL 
LVLSL*GIIITSSICLRHKDLKSLIAYS 
SISHIALWTAILIQ'TP*SPTGAVII,I I 
AHGLTSS 






A 


477 


7 


397 


L PVYNHI DVGALYLL FGA*TGVLGTGLS 
LLIRAELGHPGNLLGNDHIYNVTVTAHA 
FVIILFIEIPIIIGGFG\N*LVPLIIGA 
TDMAWPRINNISL*LLPPYIiLLIjIASAI 
AEDGAGTGGTDY P PLTGNYS 

m% ammmmmmmmm ^mTmm awmmmi mm m mW ^m* mmm mmw mm mm9 mam ammm mm mmmm mm- w mm* wmw 


472 


14373 


A 


478 


442 


44 


SSSSPFTPPRGKFFFKKTPRKKFFS/SP 
GNKGFFSPLSP * KFFFFFNPFFFFGGFF 
PNFPPPKKNFFFKNSPGFFFFPPLKKKI 
FFFPPPLNFAPPKVFFKSPPPFFFFFFF 

mm P» mm mm mm mm mmmmP* mm- at, P pi mam, m m, w *m mm- tw mmmw mm mam mm> mm mm mm • mm mgm, 

FFFFFFFFFLRGHPWGREGI 


473 


14374 


A 


479 


1 


439 


PTRS PTRP IiVLDRERP PFFFF FFKRGP P 
S VPPAGGGGANLG* GNPPPL/GVKKFFG 
PKPPKIGE*RPWPPPPGQPECSFFLKKN 
GLPHGGQGR*N\ PPPRP 

\J *- * Am ilWW^VV ■*■ " » 4mm mm *ma mm mm mm 


474 


14375 


A 


480 


420 


0 


YSPPSPPK/YRAPGKKFF*KKPRKEKF* 
KKKILGFFFPLSPLKFFFFPKAFKFFGG 
VGPNCPPPKKRFFSKNSPGGFIKPPLKG 
KNFFFPAPVKFGPPRGFFKGPPS FFFFF 

mm mmm W mam mm mm m \ m mm W m vmmwm mmmf mm mm mm mymm* mm mm. mm m ^am* mm mm mmm am mm mm mm mm 


475 


14376 


A 


481 


371 


2 


NKI*NKFSKLNSKKSSSSSSSSSSSSSS 
SSSSSSSSS PSVNK*IKKCSTSLTISSS 
SSSSSSSSSPT* IAKIKNIDSPKCWQGC 

ammm wmW mmm r mmW mmmr ^mmr mmaw amm* ma^ar mm mm mm/mm mmmm mm mam mm mmm at mmm mmmr -mmmr mm mm ^m mm* m w mm^ammw ^mmm 

RETGLTV\LVQPLWWQFLLKLSMQII*D 
SANMILSMHARG 


476 


14377 


A 


482 


3 

i 


335 


HASGKDRHTDQRNRIKNPETDT* IYSTF 
*QKC/RLI *WRKDSLVNKWC*SNWA\SP 
MKKIKLDLSSSSSSSSSSSSSSSSV*NV 
KTjLGNNVGGNIiQ YRGLRVHTVD VKAQH I 


All 




A 


dRT 


i 
i 


418 


GVR*FSPLNPPSRWGPKHGPPNIAIjNFF 

*mmT m m\ m mm* mmm 9 mm mmmimi. m mm- mm ammtr mm ^ m m ama* 4m 4m mmm ma ammr mmm mm mm m mmmm \ mmmmm m ^m mam 

FFFFVFLVETRFHPVGQAGLELLGSRAP 
PASA/FPK 






A 


*TD*t 


i 
i 




FCANAVRSM I KTVLI DKATKP / RAWKGN 

m\a ^mma\ mmm mmm m w mm mm mmw m m mm map am m m^ mr mmmmm mmm ammT + mmmm mm mm am mm J m*^mm mmm a> mm mjmmmwmam t 

YKYHLPVFNCKTGRT\DSGNPLNWFYQC 
FVPBIRKYLARVGLP FNVFLILDNAPGH 
PEPHEFNTEGFRWFLTPDMPLIQPLEQ 
GVLRTLTA*YQCFVPEIRKYIiARVGLiPF 
NVFLILDNAPGHPEPHEFNTEGFRWFL 
TPDMPLIQPLE^GVLRTLTA 


479 


14380 


A 


485 


166 


406 


FIFWGSVSSSAEGGGWSWSSLPRVTVR 
PDETVDVT IHLLKE * CRPGAVAHACNPS - 
TLGGRGGRI /TLRSGVQDQPSQHG 


480 


14381 


A 


486 


107 


429 


FWVTQTFGFFWLNPPGGLELWPPPPCPG 
NFGGNFKKKGGFPLWPGGVQTPGPRGIT 
P PGPPRGGK* RGGPP/ SPGPGFIXJKPKG 
GGP 


481 


14382 


A 


487 


399 


2 


GS PAPGLPKGWGFRGGPPGPGKFVFFKL 
PKGGFPGAEI FLGEFFQRGKKGLG P I FP 
PVFLKTEEGGTLS /NSFFKGRVFLVFKP 
KAGPKKKKKKTNYKP I YLNNTDTKFFNN 
MLANRI * Q C I KEVMQ YDRTRG 


482 


14383 


A 


488 


419 


2 


PPPPPPPREKKGGLYFNPKIFLGKSFPP 
RNPKDPS * KPRVGIFFEGPKPPKKKKF* 
FGPWGKPPGDG* RKNPFK\ PFFFFFFFF 
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Dent id e 
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peptide 

seauence 
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nucle- 
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last amino 
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sequence 


Amino acid sequence (A=Alanine 
0=Cysteine, D s Aspartic Acid, 
E=Glutamic Acid. F=OPhenvlaIanfne. 
G=Glycine, H-Histidine, I=IsoIeucine, 
K=Lvsine. L= Leu cine, M=Methionine. 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T«Threonine, V^Valine, 
W=Tryptopban, Y=Tyrosine» 

X=Unknnwn. * s= SltoD codon. /=nossihle ' 

* 1L %mv mm l\||V WW III *J tU WUVIIi ( UvwJlVIv 

nucleotide deletion, ^possible 
nucleotide insertion 














* DGVLLCRPGWI AMAQSRPGIiQE WNS I S 
KKRSQ P CVXjQ ISS KEVPDA WADA WADA W 






A 






n 1 


SSSSR*SLLCKCFKN*GEKIRNPIEKWA i 
KVMHRLFTKRCINI * KDVRSTSLMVRET 
* ITTTLR YHFSP I RI*SKT*KLNRTLYG * 
DCX3BTGIFIHCWWECKMF*VL*RE/VWQ 
YLIKLL /LHIPVDTA 


484 


14385 


A 


490 


2 


351 


KNRPEMDPGM*GY*VNDEGDTTYQQGNE . 
EIiRS /WCWDDNRLAKVKLDLYFPPHITI 
EINSKWI *DLNI KNPPIHVLEENMN/I * 
F*HVGIGKHYLTI*IiKI*NP*K**IWLH 


485 


14386 


A 


491 


42 


507 


NLAK* IQ* * IQTIMHHDQVSFIPGIQGW 
LNILKS INI IYYTNRL* DKTHITISIDA 
EKAFDKI * YLFMKGK\KPIiSQLGIEGN/ 

X UlVil X AM A XX 


486 


14387 


A 


492 


18 


417 


REGKKS RVHHFN I KQGRVMSTKH* KTQL 
/SHSNAHSNKAQISKSQPHGLHDFFKKK 
KKKKKKXKKKKKRGE KKKKKKRG KKKKK 
KKKRGGGPP * KG VREALFCFFWRKKFFF 


487 


14388 


A 


493 


413 


82 


FFSSPPPPLPFFFPHLFPPPPNFFFFFS 
PPPFFFF/PPP*KTKFPPPPPLFFPPPP 
LFFFSPPPPPFFFFFSPPPPFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFWt* 

EEEEEEEBEEEEEEEEEEEEEEEEE rVXJ 


488 


14389 


A 


494 


1 


413 


PTRP PTRP PTRP PTRP VLDREHSPSNIaP 
KKKKKKJOGCKKKrOCKXKKGGGPLKKKPG 
GAKKKG /EKKKKKFF* KKGEKKKP PGKF 
GKKfCKI WGGEKRAKTPOKKKTP *GKKKI 
LKGEGGKKNP KPRGGKKFFSGEKKKKK 


489 


14390 


A 


495 


274 


1 


I YRIDCAYMKKVERSKIS ILS FH I RKLE 
N/QLNPK*/RRRREIKIGAEINEIENRK 
+ IEKINETKS YFVWKISKPIAKLIKEKT 
QITNTRNRAY 


490 


14391 


A 


496 


336 


1 


VFQYTYWKLVSMFYYCFFL*RWGLTVLL 
RIiVSNSG\ PSD\LPSSAS 'VAGTTDAHR 
HTQLCFTTLSDIiLEPYFHLGSFIFSLL* 
FFFLNTESHSWQVGV*WHNLGSLQPIiP 


AO) 












PRVLI FFFLTPRGP PPPP PPNKVFFPPP 
PPQNFFFPPPPPPSWGGFAPK/PPPPPK 
SFFPPKPPPVFFSPPPKEKKFSFPPPPH 
FAPPPVFF*PPPPQ 


492 

• 


14393 


A 


498 


430 


2 


SPPPKPTRM* \KVKFGR*KKGGGGGGGS 
P PIiS PPLWGS KPWP * VPfCGR / PLPG * A 
GKPPFFLKFQPFPRPGGGPRCPPLFGGL 
GQKNGFTPEWLLL*PKFSPFPSSLVDR 
RIYCnjLKMIxEYKSVhWILLFASCLNLYT 
FPIV 


493 


14394 


A 


499 


363 


3 


KKLVTPARVLGDIIPRNRFLQMPQQREN 
FLCK3VWMTKPPTTIFVKTKTGKWYLISL 
KRFCPAKKKT/ IKI LVNRVNRQPPEWEK 
I FANYASDKGLISSL YKDIiKQVYKRKNN 
PITK*VKGMY 


494 


14395 


A 


500 


145 


435 


VFMCINSSFLLFIPWYEYTTVWPVTCRR 
T*MFIAALFVIARNWN*SRCPSTGNW/L 
KLWYIHTME 


495 


14396 


A 


501 


162 


1 


FYNYTIIFIYIiFIFIFLRRNIi/DSWHT 
GV*WRHLGSLQPLPPKFKPFPRLSLP 
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peptide 

sequence 


Amino acid sequence (A=Alanln \ y 
C=Cysteine, D=Aspartic Acid, 

E=Glutamic Acid F==Phpnvtnlantnp 
G=Glycine, H=Histidine, Msoteucine, 
K=Lysine, L=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 

Q=Gluta mine, R»=Arginine, S-Serine, 
Tt=Threr»ninp V<=V»Hnp 

A J 111 WVFIIIIIC, » T (tllLIC, 

W=Tryptophan, Y=Tyrostne, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 






A 

** 








IKKPRWClPKPPT.lfFPPRV/ejPnPWTT /W 

XXvEVf XvWV?Jtr<Cvtr * AJJvCitr JEtaj v ^ CftrXJV C X. X / Vi 

FGNPGPVPGKDFKFPNFPKKKKWGGRAP 
P WPTTPGG * MGRFP * PPAFC3VPRGGTJG 

SPPPGAPKEGPLSKKKKKKNEKEKERKE 
KSARL 


407 

1 


143 OR 

• 


A 






4fK 


PKF* KVI FKTPPGPPPFPPGPPLNFFFP 
KTDAPIiFF*NHPPKSKIWAIAPPKKKIF 
LNPKKPPPFFFPPHFFKKNFPRAFFKNF 
\AFSPRGGASPSPPP 














PPG*FGPPQGFF*RPPPLFFFFFFFFFF 


499 


14400 

i 


A 


505 


26 


426 


GCTGLLHS *MYAKWCCTDHP I T* VLS P 
AFISRISYSPTHPQLILFF* IFFLFLVE 
TESCHIAQAGLK/LPASSSLPVKASQSA 


500 


14401 


A 


506 


337 


2 


IPNLKESA/CPNLPKGWGFKD*PPPPSQ 
I SIj I F Kx PKrLKFE X * KKKP PLKKGLFW 
WV*RKKAPVGF *G\ EMALGEKFFFPPFP 
FFFFFFF FFPRRS FAFVAQ AGVQWRDLG 
S 


501 


14402 


A 


507 

■ 


454 


2 


TSKTGQPGRRGSPFPI*WAAGQKRSSPP 

RPGRGAPLFPDGAAGQRRSSLPRRG/VP 
GRGAP \QSQMGCQLGRGAPHFPDGVAGQ 

D r*I?Dl? < D'P Wf2C rrtn? A T -D r rOr\Mr!/YDrM3r"FTi 
rtv^r fr r KVfooU ViuUir X QUNvjy r UKu J. xr 

HIPDGVAAGQRR 


502 


14403 


A 


508 


390 


13 


RIPPPKSRWKGKGPF*VS*NPRPQI*KN 
FLPPPP*K/YGDPRGPPPPPIKFLPLKK 
KGAPPICPGCFEIPAPRESPPLAPPKS* 

WoKijlirrrrrr r ivAJMM f nLtoJvl JSJ\Xi.t\. 

IFFFFFEAGSCYVA 






A 


juy 


i 


Zo4 


I YTKTYA * I LLAAL F I MAKTWKQGVLKK 
V\TDKVWHIHMTEY 










4UU 




PTSAS * VAGTIGAYHHAWI IFFF\ IFIY 
C3imVLLYWLGWFPKGLLKCWDYWCEP/ 
RMSGLFVFWSDLGFRSBVS* ILSSWDYR 
CVPPCLDNFFF 






A 


C| f 






V X IvCi£\X>V7XXf Xc\XXV Ulilyu) / J J, X il'l JLVJ V 

PNNRTLKCMKQKLTEIiKKB I 


506 


14407 


A 


512 


139 


374 


SLWGKRI FFFFGADPHFVPQAGGA\WGN 
HG*LQPTPVGLKKSSLLTFP ISWYYRLG 
TPPPANFKI FCKNGVKPCCPACF 












401 


KGKYVYMGFFI \LFFFLFLHKI * LLQEN 
FVYVCIiKKNKTKLKA* TKKQKKNETG *N 
I LVDLNVLS *KKWKCIiLWGFILFYF/NF 
LRQSL/NSVAQAGVQW 


508 


14409 


A 


514 


7 


386 


FFVFSTHITLLFLFLFFFFLFPRPPGFF 
LAGKKNTPPAPPBKPPPP\PPKKKRGPP 
FFFFGPPQKKPPNSPRGGRGPPPPRPPR 
KMEKGGPPPPREIPPPPFFF* PPKTPPP 
PPGGGGGGKKTPPKR 


509 


14410 


A 


515 


370 


2 


FVFFQWLPK*NS FLFS PF* KGLC* RDL . 
AFKJlFFPWPPNKKIiWPSLF*GFLKlTS*K 
FNFSLKSLEVPFFF\CFPKFFPW I * AFP 
RFFPFFPPPP*RG*IFFF\*DGVLLCHP 
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Amino acid sequence (A=Alanine 
OCysteine, D==Asparttc Aci(l t 
ci— Ciiutaroic acki> r— rnenyiaianine, 
tj = tjiycine, o— jiisuaine, J— jsoieucine, 
K=Lysine, L^Leucine, M-Metfaionine, 
rv^ASparagine, r^rroiine, 
Q^GIutamine, R=Arginine, S=*Serine, 
T=Threonine, V^Valine, 
w— j ryptopnan, i— lyrosine, • 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














<j Wfa AV vy rw Jj 1 


510 


14411 


A 


516 


1 


163 


NTREKLSGGGGTQIiS * /LLGRPRQKNHL 
NI/3GGGCSEPRS\HCIPAWMTE*DPVSK 


511 


14412 


A 


517 


1 


390 


RIWVYYKHLC/Y*GSVNPREKIDNFLD 
TFNI>S RLNQEE I ETLSTP I SSS I IBTVi 
KSPPT\KQKSPGPE 


512 


14413 


A 


518 


47 


312 


EYTGLILRLYIE/LFTTQH* KQKALNQA I 
K* LAKGLNKHF I KRGCMNDPKHMKKSSA 
SLAIRKM*IKTMMRCHYILT*MAKIENK 
TESTRP 


513 


14414 


A 


519 

i 


320 


1 


QNPINKQ*KKNNKKYRLV*HKKPPPFTT 
. QPHQSNQHQ\PN* * PQKHT+TKNHHTPP 
PPP*SPPPPSLSPSPPPPPSPPSSSHPP 
LLSPSPPPPSSPPSSPPPRTSP 


514 


14415 


A 


520 


1 


368 


j LK7TQQKANNLILKWAKDLNRYLIQK/VY 
RNVKHTKRCPTS\RELQIKTRYYYISIR 
♦RVKYKKRDNTKC* * GRETTGTLVIHYS 
*EYKMV*PLWETV*QFLTKL/E*SY/P* 
DPAITLLGIYSKELK 


515 


14416 


A 


521 


2 


264 


GKKIPMLYFAEIEKCILKFV\KRS*KAK 
* ILPKNNKAGGLKFPDFKTYYKAAAS KH 
QVGGLWKELRSTIiQAQQTWVPREELRST 
SPS 


516 


14417 


A 


522 

• 


1 


408 


LEKMSTSLAI/RVMQSKTTMKCHYIPIR 
KAKI*NNDNIKCWQ*CRETGPLIYCC*E 
CT/LVQPL*KTV*QFTjIKVNM\DPAFVL 
I FI P KK* KHMFTHKKNCKHTFRAALFVM 
ANTRI I PNI FQFVE WLNKLSYKHI VDY 


517 


14418 


A 


523 


320 


3 


TYIYIKTCPRMFT/AVMLTITKKQKQPK 
Y PP TDK *NV\ Y IHSNAYYS EMKRNEVLI 
HATTWMNPKNIMLNKASSRKRPHTI *LH 
I YEMSRI IQYMVLCNCLLLPSMF 


518 


14419 


A 


524 

■ 


426 


9 


AELPAS PTPGTCTPQPLGSGRDQVP\GA 
VGGTHPGGSGLAGS PP*GGLGMAGCKS * 
AIiPRGEVTEAWRQFKCGERRQASSTGGP 
GAPSAAAGPGAKPLTAWGQRHQPAAPSA 
G P AE P LP TQNWCWP ASNPG S RTRLS LHT 


519 


14420 


A 


525 


13 


461 


ICIWRKMNILLPYXKLNSRWIADLYVKG 
TTIMFLEVS VREHLHDSQKKIF* *DAKL 
MNQKB*LDILDSINIKSFVH*KAPLREN 
KGKLQSAIHIYEKGLVSRIYEEIiLQ\TY 
KETLKANKKKIDNP I EKWAKNLNRHFTR 
GCPS I YKHVK 


520 


14421 


A 


526 


288 


570 


ATGS LCCPGWS I VA IHGFNHS AL* LLTP 
GIjKHPPASASQVAGTAGMSDCTQLCKIN 
FFVFVFL/RRQSLAVTQAEVQ 


521 


14422 


A 


527 


3 


438 


AVSHDCTTGLQPG*QREILYQRKKKGTF 
SPLPPPSPGIGGKPALPFWGPKTWPPPG 

TTltllJT T? P Ti tj nticAp/^ripnTi T"VT tjtt apcd 
L We lib bi'ir fPfay i> WjGPRPIiKT. iaAtabr 

SQNPWNLRDGAPPPPGGGFFG/CPPFGP 

PGGLG/PPGAPPPPLKNFPKKKGGGGGF 

LFPPQAKG 


522 


14423 


A 


528 


2 


616 


FFFDTEWRSVAQSGVQWRDLGSLQAPPP 
GFTPFSCLSLQSTWDYRRPPPRPANFFIi 
YF* *RRGFTVrAR^fVSIS*PRHLPAtAS 
QSAG ITDVSHRALQVCFITTUi* LS K/H 
QFKKAGVTL/PHLQCLHEIGLDCVLHKH 
/W3VHHSPP/HKTNVC 
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sequence 
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SEQID 
NO; of 
nentiHp 
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M 
eth 
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Predicted 
beginning 

11 U V.ICUIIUC 

location 
correspond 
ing to first 
amino acid 
residue of 
pepuuc 
sequence 


Predict- 
ed end 

uu tic- 

otide 
location 
correspon 
ding to 
last amino 
a cm 

residue of 

peptide 

sequence 


Amino acid sequence (A= A la nine 
OCysteine, D=Aspartic Acid, 

SJ \J 1 U loll] It i\v!U, * UltUjEuluUvUKf 

G=Glycine, H=Histidine, I==lsoleutine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S-Serine, 
T^Threonine, V=Valine, 
w— iryptopnan, tyrosine, 
X^Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


523 


14424 


A 


529 


57 


485 


RHSSLGNKSETQSQMKKKKREKKKGKEI 

FITALFI IAKTQNQPKCPSTDE* INKRV 
\IYIYI YIYI YIMKHYSPIKKNEIM/ SF 
AATWMELKAI I I S ETMQKQKAKYCL FS C 
I 


• 524 


1 A Al K 

, 14425 


it 


530 


399 


3 


FF FF r FFFWENPPPPPK/ WRGGGKKKAP 
FFKFFKYKK*LFFF*GGGGGHQQKKKKK 
KSSSFFFFFFFFFFFFFFFFFFFFFFFF 
FF^FFFFFFFFFFFFFFFFFFFFFFFFF 
F*FKGFLLGTTHASAHASAHAS 


525 


14426 . 

i 


A 


531 


491 


4 


S KW I KDLNLRTK/ TTKFVEKT * VKI FMM 
LGLGNDFLSMTPKA/LKIDKSDFMKIKN 
/ FGS S KD/TI KEVKRQHTBWvKI FTS YI 
FDKGSTSRTCKEL/RKQQQNKPIQKCSK 
VLNRFFFREDIQLSNKLMKGCSTSLNH 


526 


14427 

4 


A 


532 


184 

- 


509 


PQWPAHSFLPALGSSGTGPY*WRQIFD 
SKDKESSQ/WSHETSDRPKPADHRRRSR 
PSLATSP\PRLEPHPSLPNHSGLPILSS 
L P WGAVALAPTHFS ALAWP * RPLPCNS Q 

GEKFFFWFGFF\ERESHSATQAGVLGHD 
LQSLKPPPWGSKGFFCPSYSGG/WNQKN 

TXT £»T> 

HLSP 


527 


14428 


A 


533 


401 


3 


VIREMQIKPLHIQ/PGWL*SKSVKKY*Q 
GCGQS/NVLIHY/WLKT* IMGPFWKIVW 
QFL I KLP YGTVTLLSG I YILWRMEKKP P 
KFCTQI FI AAFF HQ* PKSENNPNIHQP 
KHQ * D VHM Y I HTME x xTjAX RMN IH 


528 


14429 


A 


534 


419 


93 


SLAIRQM* IKIPPRYTRVAK\ IEKSHNT 
KXWKRHGATGTLIHCWWECKMA*S*KIW 
*FLIKSNINLPFWPVILL*GIYPREMKT 
CLYKD \ CM *MFMAVS FIEQNGKQPKCL 


529 


1 A J O A 

14430 


A 

i' 


535 


AAA 

414 


2 


NFLARGYINCGPQFFFFPPRKNLKWGvP 
LSNFPPKL*KGGIFGEGPKKVKKNFFFF 1 
FLFFFFF*KKPPCSPRLKAK/WKIPGPIj 
KPPPPGLKKFSFLTPKKNGDKRGGPPRR 
GNFFFFFFLVFLVKTEFHYVGQAGLK 


530 


1 A A~i 1 

1443 i 


A 


536 


32 


Ate 

435 


DRATALQLG * KSETS S YKKKKKKKKKKK 
DGGAPLKKNPGGAKKKPGEKKKNFSPKR 
GGKKNPPGNFEKKTNFGGGKNGAKPPQK 
KKT/AWGKKKNLKSGKRGKKNPKTIjGAKK 
fpfrg* KKKKKPPAARPGKASS 


531 


14432 


A 


537 


348 


1 


EYIKSTHQMGKOTPLKNTVSSGQVQWLS 
PVISAQrAjCAKJbWSQPKCPSljNfcWIKKL 
WCV/WCVCVCVCVCVCMMEYHSAIKRNE 
LMAFAVT * MRIiET 1 1 ISEVTQEWKTKHR 
MFSL 


532 


14433 


A 


538 


523 


66 


TDQTSHNIPLSQSLIQSEVPTLFNSVKA 

RNMS V\GEAAGS PEDPATV/ INGGGRTQ 
PQIFSVAEATLNWKKTPCRPFIGREKSM 
PGFRAS KDS LTLCRDFKTHLCLVFHYWN 
AKHVQVYITJjKVTAKI 


533 


14434 


A 


539 


413 


2 


RDKV*PCCRSWSQTPELK*STRLCLPKC 
*DYR/R*APAPGPFLL*VHRHVSIFKSG 
PLSCRCSNFQD/HDS IKPS FPTIQYAHG 
N KFQL * T PDTLI FFLKWS FTFVAQTGVQ 
WHNLSSLQLLPPGFKRSDAWADAWADAW 
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correspond 
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icjiuuv vi 
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sequence 


Predict- 
ed end 

UUHC^ 
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location 
correspon 
ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A»Alanine 
OCysteine, D^Aspartic Acid, 

1P~- 1 1 1 4* a m i #* A fiti fT^Photivlalanlria 
£i VJ 1 U lalll It. /itlUj JT 1 ucll j lalallinC) 

G=Glycine, H=Histidine, I=Isoleutfne, 
K=Lysine, L*=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S**Serine, 

W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


jo4 


144 JO 




54U 


12C. 




MT&QWTVV /r , 2\DnT,2iT'Dl?t/rJTV^i fV rVTVXIV 
a±\03 rji j. r^± / c/iKyii/iiKovyii\.i x.y irl v 

iptrmakvkkt/dktecwhqc*ntlincw 
wdckmmqllwkk/ svwhssrgkmyvqlp 
ctpaipllgiytelktcshkn/t/ct*t 


535 


14436 


A 


541 


44 


398 


RPPFFFFFKKRPLWKKKQGFPPPA*RGE 
IPSAKK\PPLGPVSGPGNPPGP*KP/PQ 
NPP PLGGS PFGKKP PUD IP FPGGP IKGK 

APPFPSQKF ' . 


536 


14437 

* 


A 


542 


370 


10 


FFLRTKVSLFPRVEG\KVQSQLIiAAPPS 
WGQVIPPLQSPE*LRQKDPFSPGG*GCS 
EPCSCPCLPAWMTEPNSVSKS*KKKRKK 
EEKFI w * rUsJvKJbr IN i TGKlRr RI In 


537 


14438 


A 


543 


1 


370 


FLLRHILLCHPGFCSVATAYCSRDPGSS 
D/FLPPQAPLPDQ* PRLQA/WHRLAPPH 
SANFF* *RQGFTVLARMVSISQPCDPPH 
WASQGATATKADDYQK 


538 


14439 


A 


4™ A AX 

544 


50 


395 


I PGLTRQWLLDP CAS PSTP P YT / P * VQP 
STPQNSSPSPKTHNQKGLPMPLSPTPKP 

STAWKKAILEHTSSSSSSSSSSSTCRNR 

ii 

NG x T YT v PJiHRPAKGril Ao Q *R KQVIjAA 
THKP 


539 


14440 


A 


545 


1 


370 


LCSVTQAGVQRRKLNSVQPPPPRFKQVS 
CLSLLFFFFF*KRVFTLFPRLEGRGPII 

QGQLINLF/ CLGATGPTYGAQGGFKSPG 
LRRWAPLGPPRA 


540 


14441 


A 


546 


178 


365 


Y KKTDATKT KMD K * DS IKLKS FSTVKET 
INKGNR\QPTEWEKI FASHTSDMGI ISQ 

LUKJlJ-iilU 


541 


14442 


A 


547 


11 


236 


KGTTKLVVKLSDFKS YFKAI VI KTVRN/ 
WHKYK\H I DQWNR I QNPE INPH I YGQMI 
FKKRAKNKQWKKDGLHKKR*WSNDFQET 
CQEQTMEKGRSSQEKMM 


542 

* 


1 A A A'y 

14443 


A 


548 


Off 

355 


3 


I KKKNLGRKRFC F FQEEKTW P * TP KKKP 
LENPGGVFPNPGFPPFFKAKI PEGPFPG 
V/SFF*RGGFKGYPPPLFFFFFFL*DGV 
SLCHPGCSAVAQSRLTASSASQVHAILL 
RHVDEGR 


543 


14444 


A 


549 


1 


373 


CFLICRHTSHMWRIWFQTTAINQIWQ*K 
Jsomxv V&^\ijyx w i^nuijxclVvj w HrV w K 
YTQ*LLLENADTK*AHAVGKMVLIDLLK 
AAI*SLIFNL*KNK*NLKNWPGMVAHACN 
PSTLGGQGGWLTS 


ZAA 

544 


1 A A AC 

14445 


A 


CCA 

550 


44o 

- 


2 


CWGHRRE P PRPAYPGI * ILFLTPP *NPL 

FAPPPKKFMLCF*PFFFFFF*DRVSIiCH 
PGWGAVERSWLTAAPISQ\ IR* SFHLSL 
LSSWDHRHA 


545 


14446 


. A 


551 


342 


26 


WAPPIFFFPPYKRSPQKPFPPPGENNSR 
GF* IFG/RGGFFKKEGLSQFFFFFLKKV 
FFFSPGGEPQGYFPPPKASFLKRIFFPP 
PPIKKGDPGRGSPPRGNNNPFF 


546 


14447 


A 


552 


3 


left 


QPHLQDCL*QQNPVAVG*AFSSHPRDSL 
NNPW \W PGAVAHACNPSTLGGRGGRI X 


547 


14448 


A 


553 


3 


413 


TLITALSSH* FFT+ VGLE INMLAFI PVL 
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OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenvtolanine. i 
OGlydne, H=Histidine, I=Isoleucine, 
K=Lysine, l>Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
0=GIutnmine. R s Areinine. S^Serlne. 
T=Threonine, V»VaJine, 
W^ryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














TKK INPRS TEAA I KYFLTOATAS I ILLI 
AILFNNILSGQ*TITNTTNQYSSIiIIIM 
AIAIKLGIAPFHP*VPEVTQOTPLTSGL 
LLLT*QKL\APISFIYQISPSLKKK 


S48 


14449 

i 


A 


554 

* 


441 


1 


SSPPPQARGEIFF*KNPGEEISQHNKKN 
BVFPPPPP*KFFFSPKAFIF/RGGGGPK 

FPPPVKFGPPRIFFKSPPPLFFFFFFFF 
FFFFFFPPFFFFFFRMGCDRWHGKFWIL 
RDGF 


KAQ 

545» 


J«*45U 


#4 

• 

* 




•a 


AKS. 
*05 


WTN^tsMwrTWffTT.BnQT./nQVTnanu 

JN 1 liOcii I VL1I VP J. JJiC^OXJ/ JL/O V X UnU V 

qwrdlsslqplpp\gswdhrhlaqq*yl 
fnvlt,lrv\fvyergtimvpil*dcnvn 
crvpgtrslsvycvmgifyrc/hibsml 
cdrcf i ffd i gshsvtqagvhw cnlgl\ 


550 


14451 

■• 


A 


556 


1 


299 


RRMRQENRWNPGGTGCSEPRSHHCTIiAW 
ATEHDS VSKKKKKIRSSLGKKT* FYHNI 
LKAMG/CITGIHKGPEGAWQREGHFTPP 
QGPKKAPQRGQTYPDP 


33 J 


1 AACi 






i 
J 




QGETASASVEAGVSYPEDLAKITDEDGY 
IKQQIFNVDICTT/ YY* KKISSRTFI VRE 
KSMSGFKASEDRLM* FLGANAASNF*LK 

P T LI CHS IT T PMTFKS PARS 


552 


14453 


A 


558 


337 


1 


TRFPPFLGGFSTKFFYRPLVRFPLL/RE 
RKI FPLP P VFVGG FP P ALGP FWG FLF FN 
FSKKRPFGFFWGFFF/ CFPKFLPFKPFL 
GFFPSPRRGFFFFFETVSLCQPS * SAW 
0 


553 


14454 


A 


559 


117 


419 


IPPLLLGVGLFFFFFIRKKRGVFSPRWR 
GGGSNFGLLETPPLG IDPFSGLTPPKS W 
ELRAPPPPPIKF*KFFLKKNGF*WVSPG 
GLEISALLVFPA/SASO 


554 


14455 


A 


560 


347 


1 


I YKELKQItCRKKSNYL I KKWAKNPNRHF 
SEE IQMANRQMKRCS \TTPVI IREML I K 
TTLRYPASPVKMAFI *DR* * *MF/ WQRC 
GEKGT/H/CWWECKSVQPL*RAVWRFLK 
KLKILLP 


555 


14456 


A 


561 


2 


375 


I PPPLQNGEKIFMI I * TGAEKAFYKI P F 
PFPIK/TLNKVGIKGNFPHIIRAL*EKP 
PAY I FHGEAES FPVRSGTRLCPLLIiLIiF 
HPVLEVLVRAICPLK/E/IKGTQIGKEE 




\aaki 


A 


J ox 




i» 


ASGS KG KKKYP PFLRGS P PLPGN PLFPL 
EGGEGKFP* P/RNWGPPPKI FPQKGQGN 
PFFFFFLKGPKGGGFFPPQGEKGVGFPIi 
PGE KSQRP F * GEKGGVP P P * RKKPPP FF 
FFFFFSETVLLLPKLECS 


557 


14458 


A 


563 


384 

JOT 


3 


ISDSGVHPLGLPKRILLCQIiSYHVWP* P 
KCFC/PHLCLFMTLFQPP*GLPSVCAPS 
KQEEHGCFP I S VARQEC* PLFPKNLDT * 
HFVGNFLKF F F F FF * DGVS LGHPG WS A I 
IiTHCNLCLPGSSNSLPQ 


558 


14459 


A 


564 


522 


120 


SLFLPTLBCNGTXSAHYNLHLIiGSSDCP 
ASASIiVAGITGMCNLCLPSSIDPSTSAS 
♦VAIDLRK* * GARGNCPVS /VLEKGWPE 
AEMGRGESAS WWD PRLGRPGVAKRI CKR 
HCCPVQQPQVSMEPSGPASPN 
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^pyi it pn cp 


Amino acid sequence (A=Alanine 
OCysteine, D^Aspartic Acid, 
E=C»lutarnic Acid. {^Phenylalanine. 

G=Glycine, H=Histidine, Msoteucine, 
K=Lysine, L=Leucine,M=Methionine, 
N=Asparagine, P=ProIine, 
O^GIutamine. R=Annnine. S=Serine. 
TVThreonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=$topcodon,/=possible 
nucleotide deletion, \==possible 

mirlpfttfrip incpr+inn 

IIULICUUUC II1J>CI nun 


559 


14460 


A 


565 


73 


264 


KDRHIDQWWRIESPQIKSMTIHLWSPDF 
FLQGY* *KDRHIDQWNRIESPQI\NL*L 
YIYGHLIFFYKGTE/IQWRKKTLSSKWF 
WDNWIFKCKQ 




J*T*t01 


A. 


J DO 




■ 


*LSLVPVSSLPLQSLSGKNQ6P*ARRVA 
LCFGKSPGSQQIWF/RLP/EDIVTTVQA 
S YS KKRLFLS LLDF Q YVLQRREGAVNS A 
NLSLAPW 


561 


14462 


A 

• 


567 

i 


397 

■ 


1 

• 


FU5QDLTVAWARVQWCSHGSUCPRLPGL 
i\usoKSiii\7/ onoriuvrnr uonur wnv 
YIL*I*YKTMTGI*MEM*FTPSCPMYSW 
RDV* IFFSFETEACPVAQAGLQWRSLGL 


562 


14463 


A 


568 


437 


1 


KFSEDPAKLIGEDGYSKQQVLNAN/ETA 
LYWKNMLS STFIiARKBTSMPSKLQGTG* 
LLLGANAAGDFQIiKSMLTCHFKNPRALK 
NYATS P LP VLHKWNKKAWMTVHLMTAIiL 

ASAHAS 


563 


14464 


A 


569 


234 


1 


FFPPFPFKASSPPQGTSSSRGVF/PPFF 
PPPKKGFFPKIPPGS'SSPPPF*EKTYFR 
FPPFFIiAPPGVFFR 




J*»40j 


n 
n 








Fo At XnruLiAuxirUlJovjxJWoFAuXlvV 
KS LQVP WNT\ SGLLASGEE I T * ADHNV 
IQNNRNPJ^IQALLITVTiLGLYFTLLQAS 
ENFKIPFTISDGIYGSTFFECAGLHGLH 


565 


14466 


A 


571 


3 

i 


403 


HASGLPS SWDYRRP PPRPANFFVFLVEM 
GFHHLNKAI IKSFACNBIQPIiSAVSVAG 
LVGCV*VCKCLFPVL*Q*LFQF* /S/HS 

TQSRRDPCSLPGFLYQVLSLAKFR 


566 


14467 


A 


572 


100 


371 


YKSNDFYVYGILHFLTTI\FFFFFFKKK 
KFPPVFQVGGQGGNLS* LNPLFLGLKGF 
S CLKLPRS WNNRGAPPPPPNFGFFS KNG 
VSPCNPG 


567 


14468 


A 


573 


371 


1 


REGARES /TWSS SHTPVQKGLRYGI ILF 
ITSEVFFFAGFF*AFYHSSLAPTPQLGG 
HWPPTGITPI^PLEVPIXNTSVLLASGV 
SIT*AHHSLIEN 


568 


14469 


A 


574 


353 


103 


SIiQLQPPGLN*SSHLSLI*SSWIYRHKFP 
CPANVR PFVFFOTTV^VTjTjVADAGtiRR /R 

ASQSAGITGVSHLAQLVNPLLFYFSRAG 
n 


569 


14470 

• 


A 


575 


1 


206 


FCIKKIIGFIPGK*GWFNISQPV*SLTN 
R I KEKIH I SMTAEP 1 1 S I ITEKVFDKIQ 
HPLILQKPFGQK*EYKFLNLIKGICEKN 
/PIPNI IYNGEILKQGCLLSPFLFNIVL 
EDIVIQSVKIKEGINIETQELR*SLTNR 
I KEKIH I SMTAEP I IS I ITEKVFDKIQH 
PLIDQKPFGQK 


570 


14471 


A 


576 


157 


255 


YISPTPPFPPDLPF*NYLPNPKLTSPAP 
NFQIY 


571 


14472 


A 


577 


365 


3 


PPHPSVFFF*KSPPPPPPLFFFPPPPFL 
FRGGPLYYS PP/S PPLFFSRRGKDTNPP 
PYSGLS VAGQTPPPPPRRTPPLVSP PPP 
PFFFF*RGGPKNKKAKKQTITFKSPPGT 
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Amino acid sequence (A=Alanfne 
OCysteine, D=Aspartic Acid, 

F^fllntftmio Arirf. WsPhenvlAlnninp 
i^^v* mia in •*» nwiuj a a mvhjt itHaiilUC, 

Cr=G 1 vc i n c H=Histidine. I^Tsn leucine 

\J \7IJLIJIV) AllJUUIIIVj * UvlvUWIUvf 

K=JLysine, L= Leucine, M=Methk>nine, . 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine 9 S=Serine, 
T=Threnninc V=Vnline^ 

W«*Tryptophan, Y-Tyrosine, 
X=Un known, *=Stop cod on, /= possible 
nucleotide deletion, \= possible 
n u cleotide insertion 
















572- 


14473 


A 


578 


2 

• 

• 


323 


VGSWEPDEKVSHRELLLSFLFYPFFSHI 
SS*ETLIDFALTSTDIWALWHDAENQTV 

VKY INPE /HVWVT * FTTHPT*; PDA T <5P * P 

DERPGAVSHACNPNTLRGRGRQIT 


573 


14474 


A 

# 


579 


369 


1 


AHLKGTLGGFPDFPFINLBFRGGLTSCP 
NPLNNWE *R/ RPP PPPGKI FGAFFFFFF 
FKVETGFHRVDLLVL IS * P CDLP AS ASQ 
SAGITGVSHRARPTPG*FKKKIMTF/CL 


574 


14475 

* 


A 


580 


394 


125 


AAEGEQGREGWRNRPRGRBRASDRERER 
VR/ ERG EERERG E ERAPQ * SERWR/E/R 
SRERERVRELWSDSDRALEREALLR 


575 


14476 

■ 


A 

■ 


581 


416 


1 


KGQDLYGEV*KVLLKVIKQVLNKWKDMS 
CS \KINIliI 3r PIRIALKFFF/ ELiDEI / 
CSFV* *NKC*RIAKEVLKKKQ* CLGGHL 
/TSPHIRRSYNTTFSQSIWCCYRTKQID 
* WTRTQNPE KDLR I *GDLI YDLRCCSSS 

KA 


576 


14477 


A 


582 


3 


403 


NCFSQFNVIIMEIPAKFFIDINKLILKC 

FCKGRS ILKKKR * EDS / QRRNFLVIKTV 

WYWQRE CH I DQ WDR I NNPE I NLHKYS KL 
TT.nvna vh to *t tree / t / vc v+i^* A-KnavT? 

CM*EK/IDPKLNPLYK 


577 


14478 


A 


583 


402 


121 


QSLIHSKALTLFNSMKAEEAAEGKVEAS 
Dr , TJT?MD i? * t?dktut TXWT tfTT/WrPS fPO A/Tin? 

AAASSPQD*GKTTDE/GGYTTQQIFNVD 
ETAFYWKTMPS 


578 


14479 


A 


584 


255 


517 


IYIFGFIFRGRVLL/CTHPS*SWLQS* 
LTAALNFWAQVS /LPSSWD YRCTPPCLA 
r r r r r ^lC\wo llJUrKuVuNoNrij^c r r 
LGTPCF 


579 


14480 


A"" 


585 


1 


414 


SSHSCCSKA*SSMGPSPAFYKLVPLFLC 
LFIWDGVWLCHPGWSAWQSLLTAASTS 
Q\VHPSS\CLSLRNSW/EYRYVP 


580 


14481 


A 


5oo 


392 


43 


PI»7r>VD/ -l T7DDr r r ft \ TPTVCOVTOrtS/IT rr 

V- WUxKL&Fir LXiA \ 1 ft iXsoo x 1 oUAUxicxj 
LASSNPPMVCHPKCWDYSHETMHSAQNF 
FLKGIS * F* LCCSHF IHNHLLWLGKVIH 

GAIK 


581 


14482 


A 


587 


1 


394 


GTR/YGINLFITSEVFFFAGIF*AFYHS 

<5 T .JVPT POT iftHHWP PTY3 T TPT R PT»RfJPT jT. 

NTSPLLASGVLIT*AHPSLIENNRNQGI 
QAIiLITIVLGLYFTLLQASEDFEDPFTI 
CDGI YG* TFFWTGLHGLHV 


582 


14483 


A 


588 


190 


3 


DG I NLKI PGGI F FFLKARSHS VTEAGMR 
W/P/NYGSLQPRPPGLK*SSHLSIiI*SSW 


583 


14484 


A 


589 


157 


387 


TKKRGRVGGWWI KILG INIiTKE I KDLY 
SENYKTSLKEIKER/DTNKWKNIPCF*T 
ERVNTVKMSILPKAIYRFNVIP 


584 


14485 


A 


590 

* 


891 


0 


PPPPFFFFliPALIFFFPPPPQ/PKTPKK 
KKNPPPPPPP\PPPFSHIP*QFLSPPPQ 
HHHFIHHPPPQPPFHPFYYNPFKFFFF* 
NINVPPPP*K*IYFF*TKFFFFIPNIFF 
F\TKKKIFLSPPPKKFPPLLTNLSLPIL 
IKNLLSKPPPPQI 


585 


14486 I 


A 


591 


1 


496 


GTRAYQI VKPTP* PLTGALSALLMTSGL 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=G)utamic Acid. F—Phenvlalanine 

G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Meth ion ine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Tbreonine V=*Valine. 
W^Tryptophan, Y^Tyrosine, . 
X-Unknown, *=Stop codon,/=possible 

miMpntM p Hplptinn \=nn%cihlp 

IIULIGU1IUC UCICL1UII) \ pU«IUlC 

nucleotide insertion 














PMRVHFHS ITLLILRLLTNTLTI YQRWR 
DVSRESTYQGHHTPPGQKGLPYGIMLFI 
TSKGFFFARPL*APYH\SSLTPTPQIiGG 
HWPPTGMTPLNTLKDPLLNTCVLLASG/ 
VSIT*AHHSLIENNRNRIIEALLMTIV 


586 


14487 


A 


592 


175 


2 


PFFFFFLLRHSLTLAQAGVQWCDLCSLQ 

sk 


587 

* 


14488 


A 


593 

i 


107 


733 


AAAAAS KVLM * REGQLPGAT \GTGGVQA 
* APGS VA/ABGAS VBGPGFG0TAPAHQG 
LS PTRSHGQGGAGRAS / S SQQGSPGGRG 
DGASEVWSC5AL/SPGGGKDGASASVPRG 

QAPSGS / YTGPNARPAPWP IPGQGGGLR 
RDQAG+VSSWTGSTEPGAHTAHRAPGHG 
QKGQS PQQPHPQGPGQIPT 


588 


14489 


A 


594 


10 


435 


FKWLLKSHAI CFWTRS * S YCDNVCVPSL 
WAHHLGIRTEIPEFFLSKFIjCTSIIPHF 

GAAQFTLPGMDVFVCFVFCF/CLFEMES 
HSVT*ARVQWCDLGSLQPLPLGFKQFSC 
LGL 


589 


14490 


A 


595 


437 . 


3 


DEPKKWKTIPCS WIERTNI / VLKMAILP 
KAI YRINA/VP I KLPTS FFTEVGTFS QN 

*ITTTTT Yt?OMKTn\ tTDIl * T5\1T1VflV40D VMV 
w J\ i. 1J. JjJEvr KWINy \ JSivn w 1 Al\Ai iv 

ASS I T/ PDVASNYKTTVTKTARHWYNNR 
HVDQWUT I ENTE I KLHTYS Q * I LSKAGT 
SKQWGKEHV 


590 


14491 


A 


596 


2 


498 


FFPFLGKTKPPTHLFFFPPPPFPIFFFK 
I FFGPLFKKKNP PK/ PFFKRP PS FPKNF 

ITITWQ DDDDITKTDT ,T?Q iflTJOO T VWWVTfK VT 
C C C o c tr r cc VtrLiC a&JiJtrcrrXV ff rrivlll 

FFFPFPPIiFKKSPPKTPL* ILGFFPSFP 
FFPPGFPFFSLF*GGVSLCHPSWNPWL 
SRPSSKLASAFRMPPVEG/SPFPPSP 






a 




l 


Oil 


RCQRFRENSCVAPRHCKGPGGKGLQALF 
♦VPVAQLGEPGAQLGDPGAHGEATVPBV 
QGE / P AALLPGTAK\ PGGEGS PGFI PPR 
HCRGTGEBGSPGFTQGRETS KAQQRTGC 
LLSCRGCIjCLSVSGDS 


592 


14493 


A 


598 


302 


2 


FLRF FHH KKFT KALNRHFS KD I L I LLS S 
NHMKGCSMSLVIRD/MLLKI T/ MKYQFI 
PTRMA/ 1 1 KKXDRYWQKN/ CEKI *TFIH 
CW*\TfEKMVOPSWKAVWOFLRK 


593 


14494 


A 


599 


3 


386 


HTWPPPPRSPTAPPARTQPPSLQSALPA 
PQPGKKXKALRNEKG*NGSKKG/RGQPR 
PP PLRG PNGRTRS PAG ICKGGGGFCPGV 
SKAQGAPRPGGRBTVVPGFFGRRPPLPR 
GPGPPGVLWTSRHCPIiR 


594 


14495 


A 


600 


374 

• 


53 


EGFFFFFPRWVGGQRALFWS PHFPPPGV 
KI FFPPPPPGWGGLRGLPPPPFNFFFFL 
KKKGFSPFCPVFSQLPPAGVPPPPPPP/ 
TGLDLRG* PPGPPPFFFFFFFFF 


595 


14496 


A 


601 


393 


1 


VSPPKGCVSENKIPPPHNYYFAKTFVHV 
KLFGVAPLFIiQII>F*GPFIiTSIRGLIL\ 
WI PYS PGY\ TLTLL I APRHFLRTI I PVF 
RS VLP KT YIiGLSGMPRRYSD YPDAYTT* 
NrGAFR IARES WGRKLDL 
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sequence 


Predict- 
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nucle- 
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ding-to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=«Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine. 
G=GIycine, H=Histidine, I^lsoleudne, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W^sTryptophan. Y=Tyrosine, 
X-Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=po$sibIe 
nucleotide insertion 


596 


14497 


A 


602 


1 


155 - 


FCFVT* SG VQWRNHDSLQPL6PGFK* FS 
/CSSWDCRHAPPHLASFCIFRSTRP 


597 


14498 


A 


603 


170 

1 IV 


455 


KPNASLKRVLILCIQLOSSROOKNNPT I~ 
VKKWAKDLNILFSTEDL*TVNRYIKKYL 
TSLVI KKYHFIPTRLSKMKKTEHSKCWE 
SCGEIETLTCSW 


598 


, 14499 


A 


604 


3 


449 


TLLSNLEEAKKKKEDALNBTRBS*TKLK 
BLPGVCNETMMALWEECKPCLKQTCMKI 
YARVCISGSGLVGRQLEEFLNQSSPPYP 
WMNGDR IDSLLENDRQQTHMLDVMQDHF 
SRAFSI IDELF\ODRFFTREPODTYHYli 
PFSLPHRRP 


599 


H500 


A 


605 


169 


435 


KYENTKINRR/ VAPN* JSLS PEKXKKKK 
KKKKKKKKKKKKKKKS ASS SKAS PSSSR 
GG 


600 


14501 


A 


606 


453 


29 


TSPPPPPGENFF*KKPP*K/HFFPPPQF 
RFFFPPFPLKIFFFPPSLFFFWGVFPHF 
PPPPKKVFFPKSPRGFFFPPPLXKKFFF 
SPPLFFFPPFGFFLSPPPFFFFFFFFFF 
FFFS P FFFFFF F FTVKNFYIiLCYFYKN 


601 


14502 

* 


A 


607 


170 


408 


NEYDHFSI IKERRDFIVCVFFFIFSR*S I 
FTWTQAG LS SLQP PLPGYKRFS CLS FL 
CSWD Y / RWHPQGS ANFFVFLVEMG 


602 


14503 


A 


608 


2 


448 


SLHPVIYSEGIKSRQSPCWR**KTVKKK 
KKKNPNOKPKKKKN /RPKNPLGGGKKFF 

♦PEKKPGPGKKIliKEKGKKICPFPPQKK 
NFKI L KKKKKG AP L KKNP * GAQNFPGMK 
KINFFP*REVKKNPIGIFKKKPLFWGGP 
IGANPPPENL 


603 


14504 

- 


A 


609 


441 

■ 


28 


GGTFLKGI KSAPKI * RGFPGWGGARFPA 
RDPPOFGG/PKGGGSLSPRGIJIPPOPPG 

* TLFFFKKPKLTGGGGPPPLI PALRGVR 
P*NSLYPQRGGGAQIPPGPPPPGGKRAP 
CFG KKKKPRRKEKKK 


604 


14505 


A 


610 


445 


1 


LWLKKNTGG0GG3PPLFPPLGEPKRD/G 
FPRGRGFGPPPPPIKNPPFFLKNQNNPG 
GGAQPGI PGPWGG*GGEFPLPP\SPGFH 
♦PRFGPFPPPRGTKREPPFQKKKKIANC 
FLLSDKSLLLEEAWGQWGPSPLEPTVA 
PKPNPRGKAQ 


605 


14506^ 


A 


611 


253 


437 

9 


KKKKVQDMFSENFKML/NBI KATLNKWK 
DISYS*VRRLCVVKMAILPT/IQSYRI < N 
IIPIKI 


606 


14507 


A 


612 

W mm* 


186 


2 

mm 


KKKKVQDMFSENFKML/NEI KENLNKWK 
DISYS + VRRLCWKMAIL\ P\KQS YRLN 
IIPIKI QA 


607 

■ 


14508 


A 


613 


37 


453 


KTPPPGENFFIjKKPQKKNFFPPGNWGVF 
FPPSPI»KFFFFP*TFIFFGGVWPNFPPP 
KKKFFFKNSPGVFF/SPPP*KKKIFFSP 
PRYFWPP/HGFFFKGPPP 


608 


14509 


A 


614 

» 


1234 


0 


KKPPKTPN* * WAKDLNRHFIKDSSIKDD 
QYY+SLGQCKLISQYNFTCTRIAIIKKT 
NNNKSW*GCGETGNLIH/WIECKMVQ 


609 


14510 


A 


615 


327 


188 

4 


FFSSSLLI I S I P KFNHVTSLLRNLK* AM 
LGSSKL*LQLIiRRLRREDHIiSPGVSGCS 
KL * SH \THCTP VWVTE * DGRKGLAGDGS 
HSVTQTGVQ 


610 


145H |A 


616 


103 


446 1 


KKKKKKKKKKKKKKKKKKIOCKKK^^ 
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G=Glycine, H=Histidine, I=Isoleudne, 
K=Lysine, lr=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
O^Glutflmine. R— A rpirnne. S=Sftrin<* 

T=»Tbreonine, V=Va)ine, 
W^Tryptophan, Y=Tyrosine, 
X=Unknown, *=$top codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














G\KKKTPGGFWKKNPFLGGGNPAPPPPK •. 

KKKPLEKKKNF*GGRGBKPSPLPOQKEK 

FSHKKK 


■611 


14512 


A 


617 


3 


429 


glls 1 1 ykellqinkisnlvgkwtkdk/ 
nkqflkkeihrlak\ymkrystslvtie 

M*T V^DVVT7WDT.WVTtrWTWMTWP#*riVir 
Fl J-iJXl X. X I It Xlir JuN IvJ. rVTUJr(l\ ±XH_ "blA. 

EVGILHILLEA+ IGKPFWKAITVTVLNA 
PVPCL* PCFEMEFHSCCPSLKCRAPTSF 
IGAS 


612 


14513 

* 


A 


618 

* 


427 


6 


WGGPPPPPPIFFFFPPPP\PYFPFFFIiQ 

PPPPPPFCV\FFFPPPPPPPPPPFFFFP 
PPPPPFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFLVENCVFINS I TSD IEQKIS I 
SC 


613 


14514 


A 


619 


68 


At A 

434 


I oisXJ FIN I jKJuuUUUUNluUUUilfcAr 
KPGGGKNKGGKKKKNFF*KGGGKKNPRG 
NFGKKTIiFGGGKKGEKPPQKKKSLEGICK 
KI *RGKGGKKT*NP /CGGKKFS FGGFF* 
KKFPPGGGGKY 


614 


14515 


A 


620 


454 


Aft 

90 


PFFFPG * KQRGGFXS PQKP PPQGKTI FR 
PHPPKKGGPQGPPPPGGKIFFFLEF\KK 
KGGFPG*PRRFFFPGPGKPPPRPPKKAG 
IQGETPRPGP 


615 


i in/ 

14516 

« 


A 


621 


432 


2 


TAKKYRVFLPPFPLKIFFFP/LRALIFL 
GRFAQ I FPPQKKGFFPKIPRGVFFCPP * 

WTTKTff P17T?T .Qfi * T T?^DPf2 TT7T?KYa& D DTTTTT? 
J\-tSJMN c r r JjOVj iriirroir r XUJxrur trf cf 

FFFFFLDRVWLCYPGWSAVARSRTS 


616 


14517 


A 


622 


422 


2 


GGGPPLP PPRGGLP PPK* * KA/ VGGGKK" 
PPPPAPL*NPPPPKKKIGGGGGGKNPPP 
P PLACFFFGFFLP P P PPSFWGGEKFFFF 

KNKKKNQTKKKKKKKRAAARDPRVRPRV 


617 


14518 


A 


623 


176 


401 


KFSMF I 1 LWKAY YTLCVCVCVCV CVCVC 
VCVIMYLFISSQ/RRLCFLGEGKICSIS 
LT VLRRGSRVCV * SQDLPVC 


olo 


« 


— 




i 
1 




VCVC^CVCVCVCLCVCVCL/CLYPGPQV 
PK*SKNSSTSa,LFHFT3ARTICP/CC/ 
CWFCFCFFA/CYQTCD* ILTSCL/AIC 


fii9 


14520 


A 


625 

- 


2 

• 


400 


HTRLI FCRDGVLLCCPGWLQTPELKSLL 
FS PGFLSLFFFFLERDPGSHGP \ VOGK36 
PKHjG* WKFWPLG* KOFSFLTLWRRGNT\ 
RGPPPPPPGIiVFWF*KKTGFCFVAQAGL 
EQRPFGDQPVWASQGAGITGVP 


620 


14521 


A 


626 


307 


3 


VFFRLPLTQGPGIiFACFCPPPPCRFFFS 
FFPGSNS PFFP/L* KLNFWAFEPPPFFF 
FFFFFS I L FKDRVLLCHPGWNAW* S * * 
P * TPGIiN* SSHLTLPSGRV 


621 


14522 


A 


627 


1 


391 


NPPPLGGAKEGGPPRARGSNPPYPYWET 
PFFF*NPKIFPGGGGHPVIPSFPGG*GR 
KFPLPREGGVPL / RPNFP P 


622 


14523 


A 


628 


191 


375 


LILIFKTIFNFLILNFNF*SGGATQAGG 
QGRNLG*LQPPPPWLKRFSCLNLLSSW\ 
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i 
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HUUIr 
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location 
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ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 

VT^WIjUIIVj Jl 4 X9UR7UI.IIIC, 

K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glu famine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 

\V=:Trvnff>nh an Y=Tvmunp 

T» ~* X 1 Jr DIVU II a II) X ■ YIWMIHBj 

X-Dn known, *=Stop codon, /^possible 
nucleotide deletion, Impossible 
nucleotide insertion 














DYRGWP 


623 


14524 


A 


629 


3 




ITWrYl T P T . T? T T .T . T Tfl Q T?\7T /PPPVT . PP PUT 

GPCS VTEAGVQ * CN/HLQPLS PG/PPTS 
AA*VAWTADVHHHAWLFVdiLI*DGSWS 
VTQAGVQ*RDPSSVQPPTPGLK*ASC\P 
AFQKCLDYR 


624 


. 14525 


7V 

A 


630 


396 


2 


QKLSLLAPQQWNAKESLLKEIRSALPL 
NTQMIRMQNSLSADMEDIL / WWIR/DP 
TNFNIPLSQSIIQSNA/L/TLFNFMKP* 
KvjbbAA w EJ\jvIJAIoG*r 1 IrKhKo \ 


625 


14526 , 

i 


A 


631 


47 


392 


LHSFFFLFLGKKIFTPGGGGGGQKPFF* 

NPPPPG*RNFFFLPP*KTWFMGPAPPPR 
irT wot tnrtrnoi5T /vmt /CTn'raonDnT 

PPPLPPRGGCYGGGPPPPPFIFFFRTFK 
NFSN 


626 


14527 


A 


632 


39 


382 


Lr FS P F F F KjnF r F F FGRr FrWGGVAPi F 
PPPKKIFFSQFP/I»QVFFFSPP*KKKFF 
FFPP* I FAP PKTFFS I PPP PFFFFFFFF 

r r rlfzfr 


627 


14528 


A 


633 


2 


213 


LDKPGKHSKISSLQKI /HNGVCLQS*Ldj 
GWIiRQEDCLSEGS * GCSEL * FYHCTPAW 


628 


14529 


A 


634 


2 


206 


QENGMN PGGRACS * PRLRYCTSS WATER 
D S VSKKNE / TNKTTLLREI * HFVGGPNG 
KKGLLKTVKGGLT 


629 


14530 


A 


635 


one 

205 


2 


KQISRLNIjLKKWDY+RGPSGLGKFW/IF 
1»* KQGFPQFFRWFN 


630 


14531 


A 


636 


3 


399 


QVQQTPAS CPIjDSDIiSEDEDLQLAMASM 
TFPKKKKKKKKKKKKKRGGAP* KKTRGG 

Ir X Kv*lvRJ\. 


631 


14532 


A 


637 


122 


373 


VSNILWTQSLLFFFFFLKGSSLFVPQPG 
GQG LD LG * TKFP P RGLKEFS CLTLRI SG 
NYGLAPP/HPG* FCFF IKKGVFPCCPGW 
F 


632 


14533 


A 


638 


390 


3 


ANAAMLPVQPAKP * AHPTFFFY* FPSFR 
FFFARMG*SS\YPVPNLTKLSEILKGPI 
NDMREVFFP I FRliLI LFFFGDRVLLCGP 
GWSSWO*L*LIAASA 


633 


1 ii C"^ il 

14534 


A 


639 


392 


ill 

42 


PLFPPKPLKNPKAPQNRKNLGP ICPPPK 
P /QGPLR IS PQNSQSGSL*GPNKGNCFP 
A*TWGPPGAP*RGPPKGPPFFFFEKVME 
AMGLA 


634 


14535 


A 


640 


390 


* 
1 


t c T?OTnrr*MCT , aT?a/^ir~kVT/2DT e*f\ fTQntrr 
lib e ociTAjrld 1 AciAoUJ^JJjr JjvJV/ i iaU V X 

NAKETFLKEIKSVIPVNIQMIRK*NSFI 

ADKEKILW* I*DQTSPN/IPL/SQCLI 

QSNVLTLFSSKKAERGEEAAEEKFINFS 

* RLRKEAAS ADREAAES VPEDL 


635 


14536 


A 


641 


368 


3 


KKGGYGVP P P FSP P PGFLGEGS FLP PKI 
KVQKTL/SWPPPPPPGGKKKTPFFKKKK 
RKNFLTGFFFFP *KKGPFFFFF* CP W* 
KFWGGGFKP I FFF FFF /RDRIiS LCCPGW 
SAW*SQFTPRV 


636 


14537 


A 


642 


134 


413 


QKDQRNRIESLE/IR/PHTHNTLIFDKG 
VTGFQYGKDSLFNKWCWDN*RB/ITQNL 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
c/~\7iuusiiiic nLiU) i ucnyiaianine, 
G=Glycine, H=Histidine, t=Iso leucine, 

N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S^SerineV" 
T=Threonine, V-Valine, 
W=Tryptophan, Y^yrosine, 
X=Un known, *=&op codon, /^possible 
nucicoiiue uciciion, v— possiDie . 
nucleotide insertion 














SLDPAHTTTNVNLKWTRCLKARP/VKTL 
VENMGENLRD 


637 


14538 


A 


643 

- 


2 


436 


GRVES INLPLGCRIjFKKRDNI KCGQ \G* 

IKLNIHLPRDSTILLLDIDLDGHPKTSA 
LFIITHNCCPIHNHP*LEQPKCPSTDVW 
MHKP WP IHSMEHYSAI KSAAVWDNFKG I 

T T. 


638 


14539 


A 


644 


446 


75 


LDLLTS *SACLGLPKCWDYRSEPPRPAF 
NF* /STTK*SLWSSHKQDY ICRFLLSYT 
ATQKFSTLTTY * SHLKTF *NAQMPRL YL 
QSIiILGSIF*KSKLPR*FQCAAXABNH* 
* SDLt YCQ XMLtr b FM 


639 


14540 


A 


645 


318 


1 


KGVPRKCKNSYVGPPSGPPIK* ILFF* I 
F*DRISLCSPGWNARVFS*LPV\T*TTG 
VKKWWPLSLLNI * G YKGVP PRPG I FFFF 
FFFRDR/ SIiTMLPRLVTWVQVTFQ 


640 


14541 


A 


646 


3 


348 


QFSSFFVLIYFTLPFMVSFCF/WCFSNL 
VLFCE I IFFLI FY\ YYSLVLSFFYCLLV 
L FLS KLS FF/C * YLNI TVF I CSMTSFF * 
RIL/CCIiNFIKTSFNFIiL* * FGMVLESC 
FFFFFFWP 


641 


J 4542 


A 


647 


2 


553 


AILIPDKIDLKK\VTGDKE*NVTMIKGS 
IHQEDVTTINIYASNSR/APKYMKQKLK 
GKEKQISP *K/ WGI FNTPFSI / DRAPR 
QICINQEIEDLNNIIRQIDLKDIYRTFHP 
TSEYIFFSSAYET/FSKIGHKLKHKTSL 
NKYK* TE IMQSMPS D / HHGMKL ENNE /N 
VGKLTNMWKSDTLVSDWIiGAVAHA 


642 


14543 


A 


648 


too 

388 


* 

1 


G PPR* S P P FGKPRGAVPQ/GGGGLKP PG 
PQGENPFFFKKPKITLGPGWGP/RNPPP 
LGG*KGKIP* /PPGGGGSNKPNFSPSGP 
PGKQKQNFFSQKKKKKKEKKKEKPES*C 
PGTS PQPVTDGSQAINASISS 


643 


14544 


A 


649 


387 


25 


PGMRGGFPPFPLKNFFFPPGP* /M3GGG 
GGPNGPPPKK3FFPKNPPGVFFSPPKKK 
KIFFSPPPENLGPPRDFLKGPPLFFFFL 
QKNPGVFFRAGQGNP PQKIAKI FPLPPG 
GGGGAVRHI 


644 

• 


14545 


A 

• 


650 


918 

■ 


2 


LGLKGLTI YKILKS T I ADHTFFS SSHGT 
FAMTDHILGHKIHLSTFSKE*EIIPSTS 
FQ/HHSRNIiW*K* INNKVNWKIPK\rWR 
IiNKTLLNNT*TKE/GLKRHKNIIiS*TKN 
I TYQ I LWDAG KAVLRGKF I VIiTT * IRKE 
ERSK\TTSFNTRKRNKKLIINIRTEINE 

lilNKJvyiJjKlI_iN\ IKSWi? biiKK-KilKJblK 

TLARLNKRKRKETQ I PKHQK* RRGI TTG . 
NMANKKI KRNIHEQPYTHKLGNLDNIGQ 
FLEKRYTiPKT.KOGE T * SGWAYIN /55KEM 

ESI INTL PKRKAQDIiKMFSQS E * YQTFK 
EKK 


645 


14546 


A 


651 


282 


265 


GVFFLKKRPTVF* KKKI * LGPHPQLKGP 
PGVFQ/HFP I *NFGI SGRGDLFFFFFFF 
FFFFFFLFLRQGQVSVAOTGAQWHWQGS 
LQPPGLK*FS 


646 


14547 


A 


652 


6 


223 

* 


LYAHKFNWIiDEMDQFLQRHirrP KLIQEE 
/HRLTVIK* IKSIVINIiPKQKA*GPDGF 
SGEMYQLLKEEI I YKVF 
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nitcl&itirip 
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correspond 

inn tO first 

■ W All * J* 

amino acid 
residne of 

nentirie 

sequence 


Predict- 
ed end 

otide 
location 

Vvl I wVVII 

ding to 
last amino 

nviu 

residue of 

peptide 

sequence 


Amino acid sequence (A=A)anine 
OCysteine, D=*Asparttc Acid, 

In -torn i#* AriH T^Phonvlalanlno 
jl*^\jiuiuiihv /V.VIU, jc r licujruilauiuc, 

G==G1ycine, H-Histidtne, I=Iso leucine, 
K=Lysine, JL^Leucine, M=Methionine, 

TV— A ennrnoi ni* P=Pi*ntinP 

Q=Glutamine» R=Arginine, S^Serine, 
^Threonine, V=Valine, 

X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


647 


14548 


A 


653 


128 


337 


FXFFFXXFFFXFPXXXFPPXXPFXFXFF 
FXXXFFFFFPPXXFPFPPFPPFXPPXIjT 

FFXFFLFFFFFLFF 


648 


14549 


A 


654 

• 


238 


2 


MAAASTTLLALVI I STTF * LPQVNGYI E 
KSTPEECSFDP\SPARVPFSIKFLLVGS 
PIQTHALDIALFLSHTRALPHTHT 






ft 






5 


Peru vvkti re ■* t d a if ktwmtiwt .vuatmT w 

rhfsreniqvaekhi * i csas lglremq 

iqttkc/whpvwktv/wtilqn/lhtrl 

pyd/ pailllgi yqrjslktcshknmyts 
v 


650 

» 


14551 . 

i 


A 


656 


3 


1655 


F FLGMES PSVAQAG VQRCDLCS LQALPP 
EFK'FS WIjTIJjSI^xRRI/PPCIjAN 
VETGF\TMLSRMVX,IS*PCDPPTSASQS 
QGITQV3HRVQP ALLFFLS CLCVKM I VH 
APLVKTCRPSTVGSCL/SVPAFWE 




14552 




657 


519 


84 


SRVRCQSA\QMGGASHIa3YSGVRDPLEE 
AWPFSDLKLHAGITTTLFKAVRQGHLS 
GFAKDF*RPAEQ /GT* YTSKLA/STKQN 
ILHEE * ITFVLLLPKHEGI WMPSVPPKP 

♦HYVLITSH 


652 


14553 


A 


658 


38 


383 


BKCI FFFFFLERGFTFGGPPGGRAQFWF 
NKTPPP/HKKREFPALPPPGGGNNGLGP 

DBDT7HT ITT?* VTTD/"*IPI3T"./VVDr , T MMnAn 

NPPP* PPKGG/GNNGGGPPPRQKK 


653 


14554 


A 


659 


98 


394 


KPPGKKRETPSQ/NKKKKKKKKKKKPPF 
NFPPVMG*RGKNLPPPRKPLFPPPQGGG 


654 


14555 


A 


660 


214 


567 


GFSVINIGIiKMMMMIIIILRDSLTVTQI 
GVQWHNLSSLQPPPPGFK+FSCLHLLSS 
RDY*HAPPCPVNFCIIjS/BRQFHHVGQA 


655 


14556 


A 


661 


463 


392 


S*HFHPSSSSSSSSSSVINYY*/FGLLIi 
C/QVAHFYNS IDQQMIQSQRPMMXiQSAL 
AFEQI IKVNGLLILL 


656 


14557 


A 


662 


205 


242 


GTG WQGGGLDG * QMTLG WPGSGVGGS AA 


657 


14558 


A 


663 


63 


351 


QSETPSQFKKKTQIELLYEPAVSLLGIY 
QKBPJCSVLKKHYTPMFTAALFTI\VWDQ 
PR/CPKYMKR*NVMY/HVYTHTHTMEYY 

t 5*\TT?'RTVrB , Tr."B , \ PITTW 
OnlliiUiEilljr \£nX IN 


658 


14559 


A 


664 


1 


358 


FFLRQGL/ DSAAQAGVQWCNHS SQRAPP 
LGLGRS SH \ PAS LAGWSAVA* S RLAATS 
TFRVQVILLSQPPE*LGLQSCSITQVGV 
QQWRDLGSLQPPLPGFKRLSCLSLPSN* 


659 


14560 


A 


665 


10 


257 


GMNERGNI TKEI KKI I WEYYE * LGTSQL 
DTL / ELNKFP ER YKL / PRTNSQS I ENMN 
RTITSDYISNFKRLPKKSPGLDGFTGEF 
I 


660 


14561 


A 


666 


360 


2 

* 


ENKKI FNS APPPGS PFLFPPLWE I RPEG 
FLKPRVI*NPPWPPN*APPFKPPMGPPFF 
PLFRGVRVGDPLSPPGLRLP* PGIiPP\G 
PPG* PWGQNPTFFSKKKKKSWLDSSGFL 
EPNISVNF 


661 


14562 


A 


667 


328 


1 


NLFAENYECPENKKKNFQKKKIHQIWQN 
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Amino acid sequence (A=Alanine 
C=Cysteine, I>=Aspartic Acid, 
E=Glutarnic Acid, F=Phenvlalanine. 
G^GIycine, H-Histidine, l=Isoleucine, 
K=Lysine, Lr=Leucine, M=Methionine, 
N=Asnaragine, P^Proline. 
Q=Glutamine, R=Argtnine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y«^Tyrosine, 
X=Un known, *=Stop codon,/=posslble 
nucleotide deletion. \=DossibIe 
nucleotide insertion 






« 








SE FMKVTE YKLN YQN/SQLYLYMPEKNT 
*KV/ILEKHPL*PQITKYFGEILKKDVL 
NL YTDKYKI SLRE VKBEQNKWNNVS S 


662 


14563 


A 


668 


405 


566 


IHCQWECKTEQILWKTVWWFFTKLNILL 
PGDS AI MLIjGI YPKELKMY I HTETCT 


663 


14564 


A 


669 


218 


2 


LLMLESLMFVPPPPSPEKWAKL*IY/CH 
GAHANFLPSPFPSPPFFSLSFFFFYNRV 
PLCCPGWSAWRSQIiT 


664 


14565 


A 


670 


2 


210 


NFGQEENSEMNSLCSYLHNLEKGEQTRP 
KASRRKEIIKI*AEI*K\VQRSNRENK* 
KKWFFEKINKIDKPIARLTKKWRT*ITI 
VI? KETGTI TKD PADTKR IMKE Y /YICLTiY 

MHSF*NLSRNLKRYREVIEKINERSGSL 
KRSIKLTNL 


665 


14566 


A 


671 


200 


3 


S liCHL PHVAS KATLET \GLVEHM W *D PF 
CTFFETESHCIAOSGVOWCNTjGSTjOPT/PP 

RFKLFSCLSL 


666 


14567 


A 


672 


94 


374 


PKFRPQETTEQTSQ Fl»QQ INCKGRKRQG 
KRTYRLGE IQKQS Q PMATFELYLD/H * F 
* LAKETL* IjGAVAHTCNPS TLGGQAGWI 


667 


14568 


A 


673 


312 


3 


WKMGQLPGRPQLPQIiSQEKIIjSLNSPNV 
FKKKKLK* * *NILPKKKRPDCFTHKFYQ 
TFKEEIIPF/L/HKLT*EFBKKEILLKS 
F \ YEGS W* AAKLNADTNKKTTH 


668 


14569 


A 


674 


34 


438 


QLT*PD*HFIKYSTQQQKTFFSSTHRVF 
AKLGHNL VY\ KANIiNKFKWLQ VI SNMFL 
DRDRIALKINNKK/PSSPLKYLQTHLLL 
NDPRI KEGSKRE I I KD FALNDNATY* NL 
WCL * NGTIiRKVYTTKCLFR KEG POM 


669 


14570 


A 


675 


3 


349 


QKDRKS VRPSS *NPPIAVEKI PI/ LKQN 
PERFNGPAFFKIAHGLTSSLVFCLANSN 
YERTHSR I I ILSQGLQTGFAI KTFRKLL 
ASLANLALPPTINLLGELSI 




if j / 1 


a 






i 


KMVKIiTj* NI VWKTjIiK\ GI/TEVPHGSAT f 
PFFIP IFPKEMETNVHIKTCTWAFPAAF 
FPMAKKKKQSGNNPTTDE* IQKK\ w*th 
1 1 D YYS AI KRNE LLTHATS CN I TLS KRS 
QTKK 


671. 


14572 


A 


677 


357 

• 


6 


AAGDSELKPMFIYCPQKPMPL*IMIiKST 

llvlykwnhkawmtaclftawfte /hck 
pkletyfsekrfiisnit*lmkasghpra 

LTEOTKEINVVFKPSNTTCTLHPMDQEV 
ISTFNSY 


672 


14573 


A 


678 


401 


155 


YYSVAQAWQWRDLSSLQAPPPGFE*FS 
C\PASWDYRHKLLTSGDLPASASQSAGI 
TGVSHCAQPLSLFVPFIjTNQRFVAALS 


673 


14574 


A 


679 


292 


395 


Q*KISFEMLLIITNVPGHPRTPMEMYKE 
LNFF\MPANTS IP* PMDQGIVLTFKSYY 

a/rnvfckaiav/dsdsfdgfgqsklet 
fwkgft i * dvi kni ddl/ wrgvki p ilt 
gvwekliptlinnfev7kasleei 


674 


14575 


A 


680 


275 


3 


wproaslalnt*sihiyihtdmrthaht 
yi pgmks vtqagvqmhnhgsiiep * s pgl 
rrsshpslptsgehrrtppr/ladflhc 
l*r*gf™l 


675 


14576 

■ 


A 


681 


330 


507 


GSV* PVNLIRNCQPFVQSGCA/ SLHSHQ 
E\WYMCSPHPRHYLVASVIFIIAILFFP 
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Amino acid sequence (A=Alanine 
C^ysteine, D=>Aspartic Acid, 
E^GIutamic Acid. F^Phenvlalflnlne. 
G=Glvcine. H=Histidine. I s Isoleucine. 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
0=Glutamine. R—Areinine. S=Serfne. 
T=Threonine, V=»Valine, 
W=Trypto ph a n, Y=Tyrosine, 
X=Unknown t *=Stopcodon,£=possibte 
nucleotide deletion. \=nossible 
nucleotide insertion 














LRRSL 


676 


14577 


A 


682 


390 


43 


HHLPIRVICLSISSILCLSPIMYLS/IY 
LSIYLSIYLSIYLCQRCQIPLVFLCLLL 
S *IiS * 1 1 LTIiNFS CPFVORTTVLS 


677 


14578 

i 


A 


.683 


396 


63 


AD PTE I QRI I S D Y YDQRHTNKLENL* KM 
DKFLKTYNLPRLN* EEVETLIRP1ANHE 
IALI IKSLPIKRSPVLGGFSVGP*HIDK 
RELTPVTjGLPOirX TKKEEILANSFYBSP 


678 


14579 


A 


684 


i 


396 


EETLPLFADDMILYI IKKKQ2CTKKTTKE 
STKRY*N* *M\NVFGKVAGYKVNTQKSI 
N* Q YT I * KVKLAS \ FSS S PQKNKIGINL 
Tin? t nwwQPWYTprr >vt?t smt.wir* u<3 t d 

X IvEi X \^Vi V I. OHil X ^1 XjIVCiXIVXJXjJPIIv "AO A tr 

CSQIRRPNIVKMTVLLKLIYR 


679 


14580 


A 


685 


283 


3 


NCVESSNTHLWEPSQKPIiSSVSLTDVCP 

DMT a_Q A A T .FAT A \ T? Q VTT .DTP. Q Q T TO? * T IT 

KMWYIHTMEYYSAFKKKRIMSFLMPRME 
LKIFRVNKIR 


680 


14581 


A 


686 


53 


324 


HCPCDRIRAAFCFYILDCPFFISITHKL 

1^PLPPPJJRRFSCI*TLPRGGDYGLGPPC 
PT/KFCVF 


00 1 


I4JOZ 


*x 


DO/ 




ZUo 


CiXC C V^XCbvr/U^XjrUvXjVOiVO HnJrVjJW XXX 

IHW^/STLGGLGGRITQTQEFETSLVDA 
GA 


682 


14583 


A 


688 


47 


341 


SSGRVFVLFCFFETGSHS/VLFRLAYSG 
RTS AYf*fifHr.r,fififln^PTfiATP /htthm 

CCSAQLGFCVCVCVCF/IFVF*VKIGFC 
HVAQAGLE LLD S SNPPTS 


053 




A 




-Ley 


i 
i 


am vnnpn t Apnp t?k ar t a <? aottpiv^ 

V7I% VUUKU XnUUaf AAlUJnOvOnlJX Xvj V O 

HRAQPFFCFC/LIiFWFBMGSHSVTQAG 
VQWCAPGSLQPLPLRFK* FSC 


(LQA 
Oo*f 




71 






t 
l 


KKGP PGGPTKPEGGPPP RGP PGGGKEVP 

PPPP PGGKGEKKRGGAPPPFFFLKKKRK 
SGKGG\N* FFP PGGGKKGGQKKKKRVFF 

PFfyTfKXyK'tOfATJ^T,gT^T^TifiTJ^T>fiT.Tri 


685 


14586 


A 


691 


372 


0 


YDYLYIKKL*N/LREKIDKFLDTYNLSR 
LNQEEIENLNTPITSNKIETVIKSPPTX 
KQKSPGPE 


OOu 




A 




1 17 


907 


APLLESSSASYLHFSN*DLOMAH/KHTK 
RCSTSLAFRBM* I KTTRRHHFTPTKMAT 
SKRHT 


687 


14588 


A 


693 


122 


322 


EQTNS WS I DFFFQRCHSNAMGEKNCLFN 
KW* WDNYVAI \AKKMKLSLFKPYIKTNS 
K/WI KDLNIRAKT 


688 


14589 


A 

r 


694 


21 


342, 


RSHFRHLNEYYATQQGNSDVHPLLIHSN 
ISTAI IWQLQRQERRKEKRLKEVKNEKA 
TjKEI FFLRGWVLLCHLGWSPWGS*FIV 
TTiN\ FGIJROSSHIiSIiPSSWDYRPV 


689 


14590 


A 


695 


2 

mm 


343 


KFVPLHSGLGNRSQKQTLSQRKKKKGGV 
FVAWAKVQWHNHG*QGP*TGLRQSSCLS 
L*GGWVKL/PGGFJfflAWGFHHVKKVGLE 
LLTSDDAP S SASQKAGITGMSHTAPGPK 
NLV 


690 


14591 


A 


"696 


336 


3 


IPPVKSPKFPILLKKKKFSLFCFSGAPF 
PCYSPPRAQKKGW*KIFSPPPK\QKKPG 
ALFFS P KPAFFP P P FKKKKKKKKAKEGH 
YLMIKGLIQQEKI ILNI YASNIGARRRG 


691 


14592 


A 


697 


374 


2 


AFY*KKMVSRTFIAREKSMPRFKASKDR 



561 



WO 01/64835 PCT/US01/04927 



SEQID 
NO: of 

sequence 


SEQID 
NO: of 
peptide 
sequence 

■ 


M 
eth 

Ou 


SEQID 
NO: in 

09/515,1. 
26 


Predicted 

beginning 

nucieoune 

location 

correspond 

ing to first 

ammo sciu 

residue of 

peptide 

sequence 


Predict- 
ed end 

flu CI© - 

otlde 
location 
correspon 
uing co 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D-Aspartic Acid, 
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G=GIycine, H=Histidine, I=Isoteucine, 
K=Lysine, L=Leucine, M=Methionine, 
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v^Oiuramine, k— /trgintne, apocrine, 
Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /^possible 
nucleotide deletion, V=possible 

toft % ft Jft ■ A* a * » Mft m dft aft 

nucieotioe insertion 


■ 




■ 






• 


ltfllgagnfklkpvlvyhsentralkn j 
yaks tls vl yqrttkawmtshlptewpt 
bffep\tvetycsekkipskillltdya 

rKrlrKl iiriCiiyiZibU 


692 


14593 


A 


698 


361 


3 


PQMCPPPPR*QGIIFL*L*THCFFCTVK 
ARLFLFPPF/S *RKGGFFFFE *RKFFKY 
FGWELFQ I RFKVFFFCDRVS LCCSGWSA 
UAKl*LlAAbTbPAQASL»L»PIiTLPSSWD . 
YSHVLPRLV 


693 


14594 


A 


699 

i 


75 


362 


KDCAYGI CSKKQKGFCLTQTGAWQQYEE 
I*GLET/GFSRSVHSMGQRRYWDYRHAL 
THPANFVFLVETRFLHVRQAGIELPP/S 
SSQSGGITGVSQ 


694 


14595 


A 


700 


2 


277 


FFIiYTSNEQSEKKI / RETNWFI ITSK* * 
NRNVNKE I * DLYNENS KTFLKE I KEDLN 
KWKGILSS* TEKLNI IK/PTAI * *FNAI 
LIK/IPMVFDK 


695 


14596 


A 


701 


1 


407 


GTRKHFSKEDRQYHKSLRKQIKT/TVSY 
CFTPTKLTTVGKTANSK/ CVJAGCKVIGT 
LIHCQ*ECEMVWLLWT/SVQQFL/RNVK 
LYETAI 5 LLGICTRE I KTPLHI 1 1 \ I IA 
KKWQQLRCLSTDEWINNM*YIHTMEYYS 
A 


696 


14597 


A 


702 


209 


3 

i 


SLNRRLSFFFY*RQGLALLI/EPGWGAV 
ARSWLTPASN\FELQRSSCLSLRSSWDY 
RRVPLHRVSLRFSC 


697 


14598 


A 


703 


308 


3 


TTGLKRFARVGP PKCWDYRCEPPGIATS 
SVLKAFQPIGSVPPRIiSRILSFM+NQ\* 

•j ft*Vft. ft ^ft* *» M J"% aftft ft#* ■» ■ ft, Pft ""ft ft jpftt ft?^ft> "ft* aftf Wft «n /^tST Wftt ft* ftft 

YRMLITSTKYFPRMVSISRPCDLPALAS 
QSAGITGMSHHAELVPLV 


698 


14599 


A 


704 


603 


2 


PLPP\GLQVESP/CVSLPSSWDYRHAPP 
RPANFSVF* *RRGFTMLAKM/VLIS*PC 
DPLA£ASQSAGISGVSHHAWPKQT*LLD 
TDKTEGI FLTNHLLRIPPPMFYQYIIjIK 
VLiESVKETKDKRKSVSSCTLYRCL* ILF 
IS * AYI KFTTFFFF \ ETDSRSATQAGVQ 
WHGLSSLQPPPPGFK* FSCLSLLSSWDY 
RSKPPRPA 


699 

< 


4 a y r\f\ 

14600 


A 


f\ r 1 

705 


3 


366 


ARVTYYSGKKEPFG YIiGMA*AMIS IGFL 
GVI V* AHH I FTVGIDADTRAYFTSATI I 
MAI PTGVKI FS * LATLHGSNMK* SAAVL 
♦ALRFIFIiFTVGGLTGICLSNSSLDIVL 
n/ UTilVGAnr 


700 


14601 


A 


706 


358 


3 


GFHHVGQAGLKLLTS * SARLRLP KCWD Y 
KREPPHLAPDRKYFSPLVFARDNFVPQG 
P* \HVWINICLSQVKAATGIYPGEAGAL 

ISTLCSC 






A 




J DO 


0/ 


OT.MQTTWY* * K'NT/'R TnnWMKT ADMTCIUH 

yUOO llnl JCVJLM/ XvXU^I«i«^Xx>UJJt iOVR 

RKLISDKVTKPIQWSKDSLFNKECWNN* 
TS I CKK/LSIjDSALTLFTKTNSK* itep 
GSVAHTCNPTTLGG 


702 


14603 


A 


708 


34 


363 


RTVFFFFFFFFFFFPKKTPFFFPGPRGG 
, *IPPPEKKIFPPVFRGKFKKPPHLDPGG 
GNFPRKPPFFFDGP\ SFFFLGKTliGGGG 
WIPFPPPWPNLFFKKIFWGAGGPSPF 


703 


14604 


A 


709 


1 


380 


STCKRMTLDSYFTPYTKINSKWNKDLTV 
RGRTIKL IEENIGVNLHDLGFGSGFSDM 
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X=Unknown, *=Stop codon, /^possible 
nucieoiiac ueieuuo, \~*posoiijni 
nucleotide insertion 


- 


■ 










TPKAQAKRKIN/WDFIKYLKFCA*EDSI 
BRV/N/RRSTEWEKIFANHIQ*GN*NIQ 


704 


14605 

• 


A 


710 

• 


382 


30 


PGTTGVHHHTRLTFVF/ CGRAKISXCWL 
VWNS*AQSACLSLPKCWD*BPQHPARPV 
SFNTRSGIPIISTNPSLSLSAFPSQGFV 
FLFPHFPASHPBTALAEFFLGTFSLSPP 


705 


14606 


A 


711 


1 


377 


LLVICGMQMKITVRCN*TLNKIAIIEKK 
TAKpC * HYGGLKTLI HQWCKSNMVQPIjC 
KTV*QF* KKLNIQLLYLLK* KXAYIHPS 
NCTQMFIA/ALFSVLAPNWKQSAY 


706 


14607 

t 


A 


712 


J 


390 


TW*AYMALTRLEPGRGDHTAALQPQRQS 
TTPF*KKKAVPFRA/RPVKMREREI+KP 
FSPE\RTYSCAQEGPGRTFGSAQDLEAA 
GGRGHHRMGAVWQEPHRLLG 


707 


1 A /AA 

14608 


A 


713 


2 


393 


KVRRQI INVDTATDS I GI KWI I WKHYV * 
LYANKFNYLAEMDTYVERYKLP\KEIDV 
MNSLVPAKEIIVWKILLTKTPDPHIFT 
NEVYQTFKE YY/ PVLYKLFQKIEDERTF 

t O CCf'H'Ti/"* T7V7T.T C T/"" , VD'Cn , T? 


708 


14609 


A 


714 


219 


1 


PRPGAVAHACWPS TLGGQGGRI * TQSS I 
I KACCLI HOLAI QTQKQMT \ WPGAVAHA 
CNPSTLGGQGGRITRSGARDQPSQHSKT 
P S LLK I Q KKLAS WAG ACNPG YFENY I Q 
KLLRGPGT 


AAA 

709 

T 


1 AC 1 i\ 

14610 


TV 

A 


715 


381 


126 


X XAUUUy V L-l w by JUr iviCuKy JsIXHJjia rtj V 
RGYKEL* S YPCTIiAWVTE*DPVF* KKNY 
/TLNTSPKETNKK*GVRCIKTGKVIjIFV 
AAG 


; 710 


1 A C 1 1 

14611 


A 


716 


417 


1 


YKKNNWFVP VNTRS FVQLMDQG I ILNF 
XS YLRN/ TF/HKCIAVINSNSS * ESGQS 
KLKALWKGFTVLYAI QNI CDSGQVKMPT 
LTEV/ WKEVILPLMGDFVRNRTS E KQNY 
KLS 


7ll 


14612 


A 


717 


406 


122 


MFSRDAVSLCCLGWS*TPELKLSSSLSL 

RNI IRNSDAWQR * TTLVS YDRNPP * FPS 
PPPNSPLCPAA 


712 


14013 


ii 


"71 © 

718 

• 


535 


1 


LWVRR \LVKI PVLPKLFYSVI VFPIRSL 
VRYFIE VCTFTLKCI PKEKDLKQPNRF * 
KKKKKVHLPFIKVYY\KLQ*YIATVIKT 
VWY/W/ YSKIDKAKW YKTEKAEV* PHEY 

KNMNLNLSFTSYT 


713 


14614 

ItU I** 


A 


719 


755 


i 
* 


GKLQPNKGLI SLI Y \NL* KLVRKSNPKE 
3CWAKGMNR * LTEKG IQI AFRHMKRGS PR 
F IMKAMHI I TTWS YHFSS KRKTR I QRLT 


714 


14615 


A 


720 


381 


0 


KFIAVNEYI IKEIDLPINNLTLYLKEL/ 
EREELTLCKASRAEVI\KIRAKI IRELT 
PIRKTDNQSWFFLKIHKIGQ/PFKLD*L 
QKKKRVREKVP ITKLR YENRVITTNFTE 
/ 1 KRI IREYYDH 


715 


14616 


A 


721 


3 


381 


IYREWRDLTRDGTYQGHHAPPGEKG/VC 
RYGIILFITSEGFFFAGFF*PFNHCSLA 
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T=Threonine, V=Valine, 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, v=possibIe 
nucleotide insertion 














PTPQLGGPWPPTGITPLNPIiKVPIiLDTS 
VLLASGESIT*AHHS\LIENNRNQLIQA 
LLITIIiLGLYFTLLQA 


716 

► 


14617 

• 


A 


722 


56 


455 


KTI INIKPPAI IQ xQTPLFv *SVLItAE 
\LVLLSLPGLAVGITILLTDRNLNTTFL 
DPAGGGDP ILYQHLF* FFGHPEVYILIL 
PGPGI ISHIVAYYSGKKEPFGY IGMV*A 
M I S IGFIA3F I G * AHHI FTGGIA 


717 


14618 


A 


723 

! 


398 


3 


HQLP VFWQ YN KKAWTTRPLFVDWLH* CL 
VFEVS KYPAS KGLPFKWLMLDNAPD/H 
PTQNSMSS /DS KG I EVI YLP PNTMS L I * 
FLDQGVIRTYR/RHYPQYSMQRS IRPMQ 
E I SNKENI I KVWKNSTTDDAIVA 


718 


14619 


A 


724 


318 


14 


TPPFFLNIPNLFFLKFPSPPPPWFLTKS 
LFFPLKIiKPKVFGK*SLSKF/CPFPKFN 
PKPQKKNFFFPPPPFFFFFFFFLRHNL/ 
VCHPAWSALAPSQLTATSAS . 


719 


14620 


A 


725 


2 


371 


APRCKRFSCLSLPSGWDYRCMPPRPVKF 
F/SIFSRAGGLES*PPMIPPA*A/FPK/ 
CWDYRHE 


720 


14621 


A 


726 


430 


97 


PKKNPFWKRA*NGGAQKDVPLSLP/SGW 
AGGSPWGPGF*PPLGPKGGPPFSKKPPG 
GVGPFLMPPTPEGGGGKLA* PWPPGFNP 
PVS I FLPRP PRAGGKKGTPFPKKKKKEK 


721 


14622 

• 


A 


727 


406 

r 


1 


GGAPPPPFFFFFFIKKGVFPF*QEGFFF 
PGLKN/LPP+ PPKKVGVKGGSPPPGGFF 
FFFKKKVFFFFPGLRTRAQIKVPSTLVG 
QG* KI LPPQVSQEGGVKREGPQAQLFFF 
FFFFEMESHFVTQAGLQWRD IGSP 


722 


14623 


A 


728 


98 


368 


KPHKNAKCP I RVAI VADMENNKYVJ * GYG 
EMGTL/LHY* WEGNIEQPWE /T/VWQLL 
EKLNTELPCDPATPLLGIYPKTUSSRDS 
NRFFIFFFSF 


723 


14624 


A 


729 


19 


404 


VCVISICXATEILFFLHTRPCVCIYCVC 
VCVCLAVWGTLCVYVCIESCVCVCFHLT 
GVLC/ V* VCVI S I CLATE ILFFLHTRPC 
VC I Y CV CV CV CIJww GTLCVYVC I E S CV 
CVCFHLTGV\CVCVCVCIESSVCVCPPT 
OGTVCVCVRPPGVLCVGVCVCQAAWVAV 
SMCLTP/CPC/VCVCVCVCV 


HI A 

12A 


14023 


A 


730 


432 


12 


FFHKPNFPAARKGRJjFFPIYPSKCIjISP 
KAL* F \ FGGVGP FFP PPKEGFFP KVPR * 
CFFRPLIRKKQILFLPPLNLAPPGVI *M 

A*S *L/TAVRTRGLK*SSCLSLPKCWEY 
KCEP 


725 


14626 


A 


731 


332 

- 


7 


NKTRKEKFLTRSISRF/CFPPFPXjKIFF 
fplra* ffwgglpq i fpppkkvf f/ p ki 
pkvffkwp plrkki ffffs pl ilgpprv 

LLKGPPLFFFW*VFFF*DRALLCHPY 


726 


14627 


A 


732 


70 


294 


FLRFCVETAINKIAIILANF*NFL*RWG 
\FTMFPRLVSKL*DSSDPPTSVSQSVGI 
TWSHHARLNFFYYQNSRR 


727 


14628 


A 


733 


1 


355 


FLLRHX LLCHPGFCS VATAYCSRDPGSS 
D/PLPPQAPLPDQ+PRLQA/WHRLAPPH 
SANFF* * RQG FT VLARMVS ISQPCDPPH 
SGPQGA 


728 


14629 


A 


734 


50 


395 


I PGLTRQWIiLDPCASPSTPP YT/ P* VQP 
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STPQNS S PS P KTHNQKGLPMPLS PTPKP 
STAWKKAILEkTSSSSSSSSSSSTCRNR 
NGYTYTVPEHRPARGHTASQTRKQVLAA 
THKP 


729 


14630 

i 


A 


735 


451 


0 


GPSSSPTNIPTPVSWNSFK*SSQVAGAT 

SAQLCQYFMIFLRQGVSLCPPGWSAVAW 
SKLTT/CLCLPGSWDHRCALAHPANFLY 
FWWR/HSC 


730 


14631 

i 

i 


A , 


736 


2 


477 


PSHVSKKRIjISKIYKELIHIiNSKKVSNP 
I*K*GQSLNRDFSKASFIiPP*KC\QMVN 
RYIKRCSTSLIIKBMQIKTTMRCHLTPV 
RLAIMKKSKDNKPSLVWDC* IRVLHLF/ 
CK/WICQYVQL/FWLLLSNFLVKQPF\Q 
LPAPPNSPTLG 


731 

• 


14632 


A 

■ 


737 


3 

* 


2344 

• 


AAGGP TAQS PAQLAGRALRIAR WRAVAV 
GACRPGAGS PCS VQGGAASEL S PRPQTW 
I GS LKP * TFGAAAG * AHRGCGGS ALLN* 
ATPRPAPGLPAS PTSSQALPAPLGAWGH 
SDHQPRAFP*SPQASTAIRKEKKQRAQP 
GRASVCPASNPFISSRAIiPVLQHGPPAI 
SGAGS AVASQAPG SS /GSHAESGS PALA 
HTP*GS*EPHSLIVESTRKS\ELPSSSQ 
GRLLLPLLTPGVAS / PVGTKLPGATAAT 
AGALHQPGLRLSSLQGVGGAKNKQTGCC 
CLQLPTTGLPQAPGALRPIiGRLGPAAAP 
GEHRQRTS PQGTVP YGGIRQGWEQPQRL 
RAYGTALPPPHTPPGSSGP/RQAPGCRG 
SGAGEAAGIRDTGGGGPGRRAPCSQASP 
GRGGWQAQVGCETCRGCAQSS /GGGAVQ 
P3LPRKPYPHS/AR*ENLWPFPCSPRT 
RAQEPQTQGEE/GVEGPQGSPCAPGAVR 
GRGIQLSSEPGKXiARQG/PASGDGP*BG 
TGQEPSQAFSSARHPWPSEARMPDTAIC 

QPNS YWDGGNS ATLGRTTNTRRHCGMSN 
FGARGDLLGTVPT * Q PLMQRRKE KPRVG 
GEPVQSHTVCG*\PAGVSRGWPLRPMLI> 

WR PT ■ <? BlQ ^G HOT? PGT iKP P <?T » A\ (TAP 

SSSPHRTCSP/GIiDPGLPDAGSIKPPSL 
VGAGQGAGS T /GLD / GPGLS LL S PGKSL 
L PPS SPATGLSGLG WAQSAF * SLLTVA* 
WLNPVPNG PS DTAD CTP AOAPTAPAMLE 
NQANKSDFFFH 


732 


14633 


A ' 


738 


37 


450 


NQKKWPPPRSSKPARPLFFFI* *QKYIKD 
P*KTYQGCFFFSPPPKKKNPPPPKGGGF 
FFFF*KKKGGGPPP/QAKKRGGGGPPQK 
GGAKKNP PRG FTRVFK3PFKKRGPP 


733 


14634 


A 


739 


2 


426 


QEFVLGSAPGCGCWVGGEGRVSSPWGDR 
KVGGPSNSRHWNKGAGPSCPPGPSLAAG 
LPGRRAGSWAPVLW\PGPP*GLAQLPSP 
LW/PPSPPQEREPPLPD 


734 


14635 


A 


740 


21 


392 


AQEFKSSLGNMERPLPYQKKKKKKRPGG 
GA/RPLETQPFGGPGGGEKKGGKKKTSR 
P / IRGKPPFL* KKKN\CtAWGGGPAL*P 
QFFGGVGEKKTFNPGGGGLGKPKPPPFT 
PTRGKNQTPQKKKKK 


735 


14636 


A 


741 


1 


408 


PSSQGG*EAGG* LEPRSSRPAWAI*RDP 
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Amino acid sequence (A= Ala nine 
OCysteine, D=Aspartic Acid, 
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K=Lysf ne, L=Leudne, M=Methionfne> 
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W^Tryptophan, Y^Tyrosine, . 
X=Unknown, ^top codon, /^possible 
nucleotide deletion, \=posslbte 

mirlpnfifip incprtinn 

UUtlCVllUv IllOvi uuu 














/ GSPEKGKKKKKXKKKKKKKKGGGGFKK 
TPGGAK I KGGGE KKNF FFKGG E KKNLGG 
ILEKKLFFGGGKNGEKPPKKKKGLREKK 
1 KF*RGKGGKKAENPGGKKI FQKKKK 


7JO 




a 

.«* 


7/17 




I 

1 


RQGHLSLQRliLRLFVCIiCPAPRGGAYRS 
RQAS LNRGGFHP VRAYLLLCLPKQASEM 
AGAPSPA*LPPCSLISDCCASNQ*DSVG 
/ VG PS E PGEGYNLWRRFLS 


/J / 


I HO jo 

* 

■ 


B 


7A3 

I 




1 
1 


PHPGRVFPP*DVNFNQWIFVSL/ISSLK 
DEPAFSSTLYSF*QHFFFITYLPPP/CF 
FLIES CSVAQGGVQWCNFGSLQPPPPGY 
KRFSCVGLPSSWDYRCV 


Tift 
/JO 








<> 
2 

■ 




TDT WOT cnzOT" T.MTC17T.T.aOrt\7GMa#arklJ 

SLIENNRNQ I IQALVI TILLGLYFTLLQ 
AS EYFESPFTISDG I YGSTFFVATGFHG 
liHVI IGSTFLTICFIRQLIFHFTSKHHF 
ur Cinriv x rvtir* vn / VKuLiritiuoi in 


739 ' 


14640 


A 


745 


393 


1 


PPPPPRGRGSPPPPFSRKGGPKGKPPPP 
GNFFFFFKKRVFPF\CPGGFKS * V*GVP 

PPFPPKI fkfkggtp"fpglws plks kks 

QFBWPKPGERKKLFPPFFFFFFLRPCL 


740 


14641 


A 


746 


2 


290 


KNLCRENYKYE * KKLKTMKDLCKE YYKV 
*MECKIKENM*KDKLMNFK\NNIKISTLI 

TriTTYlfPTTlVT DTKTPMTPPAP\ Tilf * TT.TfP 

V*NYKRP*IVKAI 


741 


14642 


A 


747 


59 


364 


CRFFFFFGLGGGGGGGGGGPPPFF * KNP 
PNPPKKI PNWGFFFRGPGFAPKKKKGPF 
LGEPPPWGAPLYNPPPPFGGGGAKFPPI 
FFPRAGGPQKRVG/RPPPPPPPPPPRPK 
KK^OJIiH*LECVEGTEEPPTRPPTRP 


742 


14643 


A 


748 


2 

* 


371 


S S CXiDLPKCWDYRHE LlJIIiAHWL VXiGI F 
liLLFCFVTiRW/RS CS WQTGV* WS DLT 


743 

* 


14644 

t 


A 


749 


339 


198 


IADMEKV* AF*TEHKTNHISLNQN* I*S 
jKVLTLFNSMKAERD*EGPEEKFGASTGW 
FMRFQE I SQLYN/ IDV*GEAAS ADGEAA 
ATCAEDPAKI PDEGGS PK* YI FNVD*AT 
FIRDIiSWI FCTSCCSFSISTCCFTLHVM 
L 


744 


14645 


A . 


750 


249 


2 


KIFKPPAFPVSFPPFPL*DSSSPPEP*I 
SGGRVGPI\FPPPKKGFFPKNPPGVFFS 
PPLRKKIIxLVPPPLNLGPPKGPLKRPP 


74 S 




A. 


751 


i 
j 


Zoo 


WNHMTHPCT^VSPT Y*OPT J nTi\ 1CRICTRW 
PVQKMTKCPKRRFSEEDEQMARKHKQC3W 
PASRAIRGTQSKIIMRCHFTHTRMARIK 
KEKEAGPGGSCL 


746 


14647 


A 


752 


494 


2 


ETGSWFGPQRIiEVQWWIKSSLQP*PPW ! 

APVILPPQ/LSSG*DHRHVPSRPG*LKK 

KFFL*RWGLTMLPPJXLNNLPTSASQNA 

AITGVSHCARSASVFLII/RCX3I.WyWEB/ 

GAQW*SAIiSHLPV* /LSR*WCQP*PLG* 

DSGSGSPLWRRLAYLPLCAIiWKEVTMCS 


747 


14648 


A 


753 


1 


533 


YIiSPRLECSGGITAHCN/LRPPGLK*SS 
QLSLQ\SSWDCRPAPPRPAARLIFFCRF 
EEGPJRSHYVAOACI*KLPGS\SNPPTVA/ 
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Amino acid sequence (A=Alanine 
OCysteine, D==Aspartic Acid, 
E=Glutamic Acid, F=Phcnylalanine, 
G«Glycine, H~Histidine, I=Isoleucine, 
. K=Lyslne, L=LeucJne, M=Methionine, 
N=Asparagine, P^Proline, 
0=Glutamine. R=Areinine. S=Serine. 

^mW V mm m* * — ■ ■ ■ * mm mm m m* m* m m±m Mm* mm mm Mwl mm* ^/Vl V) 

T=Threonine, V=Valine, 
W-Tryptophan, Y-Tyrosine, 
X^Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 




* 










FPKCWGYRCEPPHTGLCS/GLSEFGFCL 
WWGACSRSSHtfNPGCGTA/CPAVFLHSV 
GYASHPSSTSRTAFTLGWVLVPVPRDKG 
LTVFSQAGSTVGVILG 

mmm m* W mmf mV%mr mmw mm w V Alw 


748 


14649 


A 


754 


1 


346 


els kstl p alcggndkawmtahrf iawf 
tdylkptvenyw* ekkirlkvlllidna 
pghpkallemyteihwfmpadttsil* 
pthqgvistlkfcyl*ntfhkvq\cyid 

SDSS 


749 


14650 

r 


A 


755 


1 


323 


EDQTSHNVLLSLNLIQSKALMLPNSMBA 

E/R/SBEAVEEKFEASRE*FMRLKERSC 
LONT KVOGEEASATX3E PPACDPRDT AH T 

TDEGDCIK*QIFIVDKTAFYSKKMPS 


750 


14651' 


A 


756 


1 


344 


ARQQPGD PARGGGRGRRL \QPRQAAS FC 
TGTLARARHTCEGAGVLRRPADKLASLN 

*HFSQKKKKKKKKKKKKKKKKKASSSSS 

KKTCFCDSS YSS<5SSRRRC» QGQ Q q QQpppp 
xvxvixxvu^w i ooouoivuvjijLJOiJiJijijijrr rr 

PPP 


751 


14652 


A 


757 


119 


317 


NRYFPFLRVKKF P WVEGPFF F LR* SLTL 
LTLVAQAGLQWRNLSSLKSPLPGFKPFS 
/PSA/LSRWDYRR 


ISO 




t% 


/JO 


z 




RQA/ 1 KRKNKTRRYQL I P VRMTL I KKKK 
RW*RCEEKGRIiAHCWF\ECK*RQPL*KT 
K*RFLKKLKLP\FITAIALLDIYPKQIK 
SE / CKKH / CATLF I ALFTTAK 


753 


14654 


A 


759 


358 


1 


KADFCFPPTFSPRVKKIFPPPFPKKWG* 
KGP/LPPPGENCVFFFFKKKRGFSPLGG 
G FFYFFP PGFPPPW P FKKLGFKGLTPPP 
GPPOKILVFFFFFFFFFFE /DGVUjC3RP 
GWSAVAQS 


754 


14655 


A 


760 


316 


2 


KATRSQD IRRIQVKLKE I EFQKPLQKTN 
KSRSYFF* KH* * NRTLTR VMEJtfKREK/N 
EIDPIONHKEDITTDPTEIOTIRKYYKH 

Mmt mmtmrn* mT ^yilj»i> tmt ti * mm, mm mt 4m* m\ mm mm9 mm, ym* mm mm- mV\+\ mm mm iui 

LYAHKLKKLEEVDKLLDTNTL 


755 


14656 


A 


761 


3 


321 


FPLGLWGSLGAKPEGQAPLYPPTLSKRV 
VPLC/DKGRSAVTFNTVYSSGSSRWFP 
P PCM* I YRLCI KGRS SDPEQKKKKKKKK ' 
KKKKKKASKKKKAPKDSSSPKT 

m\ mmm » •% M «*W mrmmf mmm mm* mmw mmw mm m> m mm 


75t» 


14657 


A 


762 


315 


2 


KTERW / VFEKINKMGQS * DGLTKKNREN 
/V/ QINKIQNERGGIiSTGSTEIQRVIRG 
PCEPL*AQILDNLEKMD*FLQPFHLPRQ 
NYKKKK1 I *KKPISKVIELVIKNL 

mwl mm 4 WVl\lV^ ^ iUVL AW * V W mm, mm»mm* W mm* &UI w 


757 


14658 


A 


763 


226 


329 


NPTTLEGQGGRISKGPEF/CSRDEVSPC 
WPGWFQTPDLWRSVRLGLPKWWDF/RR* 
APPPGLNLFFRMLSTHGSW/COHPCFKF 

■* m-m* mm mt *m+ mm+mtV mm* m mm 4 >A i4JW «* AAV f X#**> ^ 

PTSAFQYIYIYIF/SFLSRSFTFVAQAV 
VQWHDLGSPQPPPPGFKQ . 


758 


14659 


A 


764 


335 


1 

m\ 


QS QERPDS FQ VMNLTLPRAPSMFL VS PR 
TELSKSP* \PPAFLCPCVWPHSTLLSQT 
LKY* IKYP* SPTPFPSFVLF*RDRVLLF 
HPG WSTLAGS * LTAASNS WAQ VTLL 


759 


14660 


A 


765 


100 


310 


HFGRPKRENCLSP*V*DQTLHTHTHTHT 
HTHTHTHS RS ENGGMRLT PHI PERVRW\ 

Girc 


760 


14661 


A 


766 


315 


3 


I FPNPPRI F I QNPHPFLFGFFPKKXBGE 
KNWAMETPR FS P PPLGGKKAPTGFPPP P 
IGF*GPKPLIKGRGP*/PP*KNTKIFFF 
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K=Lysine, I>Leuclne, M=Methionine, 
N~Asparagine, P=Proline T 
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W»Tryptophan, Y=Tyrosine, 
X^Unknown, *=$top codon,/=possible 
nucleotide deletion, \=posslble 
nucleotide insertion 

mm UVlf * • i IliJVI mm W ■* 














FFFFF*DRVLLCHPG*SAVARW 


761 


14662 


A 


767 


1 


309 


KE P YKLTELHNE IRVACGHLS KKKKITW 
VSDS * ASCPSKTRGNFIGPSAGSRAPSE 
KLEAHFHGCGS VNRVHLT/ CK* HKRRP L 
GVAHLEVSDOKSGRTSIiALD 

V4U Hill V 1»J L/>^A\4JWAV A I^AMftUl/ 


762 


14663 


A 


769 


1 


316 


GRAPPPPPPPPPTPRVHTSSKSESEPER 
/DGRE*EPTRSLERL/LFFASYFCL£AR 
QSTSALPPLPLSPRSGPT\VLLCPSPPL 
PGERPPNPTKLSREKQTKQTHRAR 


/OJ 


i A AAA 




77A 
/ fU 

i 


L 




AP R T T T T.TF* fJM T TPT.NTffnR T* <5T. T T. A 

SLI I FIATTN\LLGLLPYS FTPTTQLFI 
NLTMAIPL*AGAWIGFRSKIKNALGHF 
LPQGTPTPLKPILPIIETIILLIQPIAL 
AVRLTANITA 


7AA 


J400D 




771 

III 


L 


JM 


KfaPPVTQftlAWTPVPT *.QTTPTnPPT?ntrKT? / 

LNGLQ IGKEEE I FLFKDAML+ I ESPKES 
VQKRLD VI YKFSR I AVCKINIQKSNI W 
YVWNTQ FENEVKT I *DPQ/DIKHWGIIL 
SQRKKE 


/OJ 


J40DD 




/ AZ 


O 




X4T?DTJ* , VT 1 .KT.CT.T.T?Tr.aTVCT. , PW , CTT *C 
IXCtJrlAjlVJJlS 1 l&JjJjr XJ_tiix XoXl JL V XoJLJLt w o 

G*ATNSNYAIi IGALRAVAQTIS YEVTIiA 
I ILLST^LPJlGCFNLiSTLMTTQEHL* LL 
LPS * PLAI I * FI CTLAETNRTPFDLAER 
ESELLSC/FNIEYA 


/Oo 


\ACJLn 

1400/ 




/ / J 


1 
J 




HIT T T .ITM VT /5n AfYVT? T P PT . PPT7T /1 17 

HLK/HKSCSVTQAKVQRRHLGSMQPPPP 
GFNQFSVAEIKDVHHHAQLIFYIFRIFI 
F* F/NFLRHSLALVAQDGVQWRDLGTLQ 
PLPPPG 


7A7 

10/ 


\AAfSL 




774 


1 

■ 




RTR Y A AMr AT/5P T PTiPTaP RT .TOT VT.A 

NSSLD I VLHDTDYVGAHFHYVIiS IGAVF 
ALIP\GFIH*FPLFSGYTLDQTYAKIH\ 
FTI IFIGANLTLLPQHFFGLSGMPRQYS 
DYPDAYTTW 


J oo 






77^ 




joy 


ARCS T CMjPOTRIiKTVT A Yfl S TRH TOT. W 

TAILIQTP*SLTGAVTLIIAHGLTYSLL 
CCLAN SNYERTHS \ RI I ILSQGLETLLP 
LITE* *LIjAPJCiANIALPPTINLLGELSV 
LVTTFS * ANIT 


/ 








O 


IS'? 


ARGTGA* VDS * LTTLHGSNMK* CAA\ LL 
\WTLRCKILFTVRGLTGMAITNSTLDIA 
LHDTYYWAHFHYVLSIGAGFAIIRGFI 
H*FPLFSG YTLDQTYAKIHFTI IFIGVK 
ITFFPQ 


770 






777 
ill 


■a 


*?s? 


HEGLHL * LPKAHVETPMDG* lALDAGLL 
RLRGYGI IHVTLILNPLRKHILHPFLVL 
S 1 *GI 1 ITSS I CLRQTDLKSLIAYSSI S 
Hl\SLEVTAILIRTPIi\SFTGADILIIS 
HGLTCS 


771 


14672 


A 


778 


367 


2 


FCPI / CPNQKFMGVGAVGPALYPNPFGG 
LGGLV/ TPGAGVLNPAGP PG * TP P PPKN 
PNLGGGPALFFP PLKGLGWKIALTPEAK 
GSINPNSPPALPGGGPNQTFSKKKKRKK 
LMLVYSIELTSRA 


772 


14673 


A 


779 


3 


432 


HEPLHWLSSCVCPACVCVCVCQ/CLWLQ 
VGVQ* CNYS S WQHEPLH* VHTRVYVCVC 
LCHSVAQAG VQ * CNYS / CTAACNS W 


773 


14674 


A 


780 | 213 466 DDI L P VWNYS IS FL FF FENRAFFL PPGW 
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1^=Threonine, V=Valine, 

w — 1 rypiopnmi, 1 — tyrosine, 
X-Un known, *=Stop codoD, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














GAGTQL*LPAASN*GPKGPSRIiTLL*W\ 

Uvi KJtlKsr rt/ uJJJM r L c LfiJ. AJUM Li 1 JjJulr KJuA 

LN 


774 






7R1 
tot 




17^ 


R* VFPPPPLKKEPHPPPO AGGOMOBMG * 
LHPLPPGAG/RF* CLTPPHSWDNGEGPP 
RPPNFCFFRGNKVSPCGP 


115 


14676 


A 


782 


»■> n a 

389 


1 


PFPKN*CRvNSRPNVKHKTI/QIiIiEGNL 
GASVDDLEFGDDFLDTI PKVQSMKBRS / 
WD F I KI KNVC FAKDNVKRRG PATNWKK I 
SVKDLSNKGLLPKIYEELLKLNNTET*R 
LT * KWS KVLNRQLTRETRA 


776 


14677 ■ 


A 


783 


1 


383 


GTS P * P LTGALS YLLMTCGLAM * *HFHS 
ITLL I LGLLANTLTI YPGRGDVSRQS AY 














QGHHTPPVQKGLLYRIILFITSEAFFFA 














GFF*SF\YHSSLSPTPQ3jGGHWSPTGIA 
PLNS L * VPLLNTWRLL 


111 


14678 


A 


784 


1 


389 


GTS I VI PTGVKVFN* latlhgsnmk* sa 
AEL*ALGFIFLFTGSGLTGIVIANSSIiN 
I VLHDTYYGE AH FH YVLS I GAVFAI IGG 
FIH* FPLFSGYTLDQTYAKIHIiT\IIFI 
GVNIiTFFPQHFLGLSGNA 


778 


14679 


A 


785 


374 


1 


ELNAYWNVMNLQNLI WNAQPLS IMQ I FQ 
ILIKSQIQNTLWSISDTGYLPGIDKWN 
CI * IiKI LCI VKVT INRARMPVI DWENTF 
S TYTNDKGLI P KYKELKHS \ KQTNNL I K 
KWAKGLHSHSRA 


779 


14680 


A 


786 


* 

.1 


363 


GTRL YHANTN * KKLRVAI L I SEK\TD FT 
VKKIRNKEGHYIMIKRSIL*EGITILYV 
GTPSNRWNYIRQKLIKLPGBIDBSTII 
LRDFNTPLSVIDASTRKKISKNIVESKN 
IISQLDLID 


#<t t\ r\ 

780 


14681 


A 


787 


1 

1 


361 


GTLFS SERKNPTWVSLNQKLEMIiCLIEB 
GMLKAETGHKIiRLIiQQ/TSQVVNAKEKF 
I*KE I KS AT PVNTRKTRK* DSLLADTGKV 
li VAC I ED*TSHNVIjFS *SLI QS KALTIiL 
NCMKPSRGE 


vol 


14662 


A 


TOO 

755 


1 


352 


V»TKNXAJxSl. SSJ6 rJj x KWNiKAWM 1 AY Jjf T 

AWCTBYFKPTVETY CSBGL/SLKHiLLI 
DNASSH* RALMBMYKQINWFMLDNRI S 
LUJPVDQ*VILTFKSY1JOTFHKALAAR 
DNDSSD 


TOO 


I4o83 


n 


TOO 


365 


125 


LLGRLRHENLLHPGGRGC/SHCTPAWVI 
E * DCLKNQTKTKAPRRVS ISHSA 


783 


14684 


A 


790 


217 


257 


WSGGVAHP VI P ATQEAEAEELSLGGRDC 
SELRSCHS \CTPA* VTQAGVQ 


784 


14685 


A 


Trt 1 

791 


2 


363 


IjAIIjHQTVS * FVHAKEKFWKDIJj \KSAT 
PVAI*MIRNLNSLYNLIMBTV*W*KED 

GKGAVE* KFEATRGWVMRLKEKFCLS/H 
HIKVQG 


785 


14686 


A 


792 


2 


362 


GQKLGLLHQTVSKFVNAKEKFWKELL\K 
SATPVDI *MIRNRTS\LISDMETV* W* 
KEDQTSPLIPIR* SLTQSKAVNLFRAMK 
PVRGKGAVE * KYEASRGWFMQYKBKSC/ 
RMCNIKIQG 


786 


14687 


A 


793 


2 


360 . 


ARAGS TMAFKNYAKS TLPVLYKWNKKA * 
MTAHLFAARVTKYVKS/ YCLNKKI PFKI 
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ing to nrst 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- '. 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A«AIanine 
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IN—Asparagme, r^rroune, 
Q=Glutamine, R«Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /=possible 
nucleotide deletion* v=po$siDie 
nucleotide insertion 


■ 


■ 


- 


• 






*VFIDIVPSHPRALIQXYKE/ INVFMPA 
NTTSTLHPTDQEVI S TFKCYY FKNRFRK 


787 


14688 


A 


794 


2 


359 


AREKLYIiS ILTPLQRSLVI AFAATELTR 

vvtt ttittti i hpt ttsot iv roTn*nft\ nnwiT >T 
X IX Jur JsTo. L»JL Pi>laAJ. b 1 K* GD \^Pi£RLiN 

AGT YLL FYTL VGS L P LL I AL I YTHNTLG 

S LG I LLLTLTGQELSNS * ANNL I * LAYT 

IAFIVKXP 


788 


14689 


A 


795 


157 


365 


GI I EEKGYLPEQI FNAKCSGMISAHCS P 
RIaPGSSYSTSA* TDF*CQVQWYDLSSLQ 
PPPPRFKLFYLSDPSTWDYIIiAPP/RPS 
* *FVFFVEMGFRQTPE 


789 


14690 


~m . 

A 


796 


224 . 


372 


I FFT F I F I YLFTE WRQS FALVA\QAGV Q 
WRNLGSL* PTHPPPRVKRLSCL 


790 


14691 


A 


r-m i-i 

797 


482 


1 

* 


NSFFFFFFLFPEAKDHKDFPSPPFPVFK 
KLKGFC*APP * VQKP VFYLKS F/ CLKDY 
MLSFPESKVNFPSPPFL/SPSPLFMVGD 
PPLPPETYFSFCF*DPHSINFFIiPPGPF 
L FHFFFFLRWVLLCHPG * STAV* SLLTA 
TS\VK*FFHtiSLPSSWVYRCTSC 


791 


14692 


A 


798 


1 


415 


NLGGGGCSELRSYHCj3PAWATE*DSISK 
QTKTL/NKDHTRAGWERA 


792 


14693 


A 


799 


2 


401 


VQTGFHHVGQAGL * I»1»TSGNP PAS AS / Q 
SAGITGMSHRARPQ 


793 


14694 


A 


800 


73 


307 


PMALEHHGCGMCLDFLPTFGKSHCFVLR 
CAEMETRSFLPSWSAGA*Y\CLLQPPPP 
RFS * LRLPSR WD YRHLLPCPAN 


794 


14695 


A 


801 


87 


401 


SLIEIWTLKKSTCHNLYNVVIiS*HSIJCA 
I VLARHS /VESLTHVLKMCLV+NFVFAF 
S*RSLCFFLKISRPVVVAHVCNPSTLRG 
* GGHITRSRDRDHPSQHGE I PS 


795 


14696 


A 


802 


3 


354 


LRHYTP\PG*QSETLPFKKKKKKKKKKK 
KLSFFPPPKFLKKKKGLFKNPFKKKGKI 
FFNPPPPKKRGFFLIVNPPPKKKNPPPL 
GGGGP PKKI YF *KTFFFAPPPKFENPFF 
FFSPRF 


796 

+ 


14697 

- 


A 

• 


803 


1 

- 


830 

■ 


VETGFLHVRQAGLKLLTSGDLPTLAYQS 
AGITGVSHCAWLFFFF * * CIAVTQTEVA 
PS * LTIASN\ PGLKLS S FFTL PHHARL I 
FKIFSRNEVIiLFSR\PSC2TPNIiMQSSGL 
SLPKCWDYRCEPLYPABSLSF\FIKLSC 
M L KCLE V KCNDVCNIjL * NTL I KW I NEW I 
EGWIGRLTCVKQIEKNVICRI*VMLKAT 

FP VFVSVFVLRQS /LCFVAQAGVQWSHL 
SSLQPLSPRLK*SSCLSLLSSWDLRARA 


797 


14698 


A 


804 


389 


3 


F I WPF KGAPENS FF VFMWI KGHKCP KKG 

RFOGPFFFFFEMEFHS \ IAOAGVOWCDL 
SSLQPLPPRFKRFS CLSLPSS WDYRHAP 
PRPANF\CVFSRDEVSPCW 


798 


14699 


A 


805 


161 


21 


RGGGCLQSQFR/RRLRHENRLNPGGGGC 
TEPRSYRCT? AWARE * DS VSK 


799 


14700 


A 


806 


405 


82 


FFLRWTL/ DTVTRGGIQWCNLGSPQPP/ 
PPRFKRFSCLSLSSSWDYRRPPPCPANf 
LYF* *RRGFTMIARI*V1jNP * PRDPPASA 
SQS AG I TGVGHRAWPMP I FENRFDL 


800 


14701 


A 


807 938 


2415 


KITFWETFWITTVHPHLCKEREAIAGIL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
Dentide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1. 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- ' 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E^Glutamic Acid. F— Phenyl alanine. 
G=GJycine, H=Histidine, I=Isoleucine, 
K=Lyslne, L=Leudne,M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
Threonine, V=Valine, 
W-Tryptophan, Y=Tyrosine, . 
X^Un known, *=Stop codon, /^possible 
nucleotide deletion, \=posslble 
nucleotide insertion 




• 




j 

i 


■ 




RRWS * NNW/ MIHTHTKNLNP YLTPDTKA 
TFK* I IGLT*RAKTMQL/LESFCQKENT 
GENLS DLG VGKDFLRHKKHGS I KGEKIA 

KI FAERVSDRGCVSRRYKEL * L/ IELKD ' 
NPIRKGGNNLNXVHQRI *MANKHMKRCP 
KS*VIREI * I*TIMRYHCILPRMAVMN/ 
SD* SHGDKNGGSSGTLIH\CR* ECEMDQ 
LTjWKT IGOFLSK* M * RC3TTDS S I P FL 1 1 

QEK*NCISTKKTCTQTFTAA/MYLLVIA 
KNWKQIiPyPSSVWI/QQIWCIYT\MEYY 
S AI KRTVDTLNN\QMDLKI IRFNE * S/H 
NLRRVHCMY 




X4/UZ 


a. 


5U5 


414 


I 


«v rftMMip VrVKy V V iyitW^&lW ' i TV r. J-t, rv >-t 

PLN \RM I RKQTSLTADTEQV* VI W I EY* 
TSHSIPLSQSLIQSRVLTL\SMKAETGE 
KAAEENLEDSRG *FTRLKEKSHLHNIKV 
QGEAVSADGEAAAGYPEDLAKI IDER 


BUZ 


147UJ 


TV 






l 


/CNFFFFFFFTGAFLSLFGI \LV*CLML 
VWGNFSP* PFKYCFTFSFFLSFI FLIFL 
WLLLNTCL Y ICYTFCiS YLTVL * CS FLFS 
S TFFTiLYF* FWRIiIjILIHPPPTRP 


803 


14704 


A 


810 


l 


392 


I LLAGAI EDAEPGSG * TGYPTLTGS YSH 
PGAYVYLT ILSLHLAGDSS ILRAINCX \ 
TTLINIKPPAVARYLTPLFV* PVTiDTAM 
LLILSLPVLADGVTILLTDRNLYSTFFD 
PAGRGDPI T.YOHLL *LLRH 


804 


14705 


A 


811 


66 

■ 


411 


RETRAGAQGLSRFS PPWGNPSQKI FLGP 
GFKKNPGPQKKGNFFFFF*KKFPLVSPG 
GRGGGNFRSLQGPPPKVPPFFCFNPPRK 
GG* /MGAPP 


805 


14706 


A 


812 


3 


392 


LSVPQFPPFFLKKTFRNCPPFCFSPPNI 
LGFPLKGS KK* GS FFPFFPPPNYS FFWF 
P S Y<3 F * NO P CI/TRGP F * TIOKFPR'L * KGV 

PILRL*SPLFFGPPPHSF/IFFFFDRVS 
LGHPGYSAVAQS/LLTAASS 


806 


14707 


A 


813 


690 


153 


IISIDAKKAFYEIPPPSVISSSSPTDSL 
YYNIiG FLKNFKKGREVKYLNI I KYI YEK 
PTANIIFISGKLKAFSLRSGTROICPLS 
PLQFNKVLEVLARATWQEKEIRNIHIBK 
EEVKLSLFVSNILC IENPLKMP * KNS SN 
* * IQRVAG YKTN I * KI S \AFLYTKNKLF 
CCLSHPIYDIPL 


flu/ 


LH/UO 


n 






J fO 


Ii FGIARS Y I TEGGRLPEWPTIPHG * REF 
WELCNKCD \ TMRPKPS LHCSRCGHCVTR 
MDHHCPWINNCVGEDKH* LFLQLCFYTE 
LLTCYALM I S FCHYYYFLPLKKRTLVRN 
VYI 


808 


14709 


A 


815 


2 


423 


YPLYPFKIFIFPKGFNFCREVGPICPPP 
KIKVLSKNSQVGFYTAPYKEKNNTLPAR 
VNFGPPKDSLKRPPLFFFFFF*RDGILL 
CYPSWS*TPGLKQSRCLGI.LKHWDYIC* 
PIJIPASF*KKHYSLRILP/SLLAbAW 


809 


14710 


A 


816 


404 


54 


SLAHFFPPPKKGFFPKIPRGVL*PPP*K 
KKFLFS/HPPVNLGPPRDFLKGPPPSSS 
SSSSSSSSSSPGG*MIFY*/P**FGPAIi 
VQPMACYCKASCKVKAPFFSTHSGLMVA 
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SEQID 
NO: of 
nucleotide 
seat! en ce 

• 


SEQID 
NO: of 
Deotide 
sequence 


M 

cth 

od 

- 


SEQID 
NO: in 
USSN 
09/515,1 
26 

" 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

neotide 

sequence 


Predict- 
ed end 
nude- 

MM MM %i •* 

otide 
location 
correspon 
ding to — — 
Jast amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~Aianine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F^Pbenvlalanine. 
G=GIvcine, H=Histidine, f=Isoleucine, 
K-Lysfne, L=Leucine, {^Methionine, . 
N^Asparagine, P=ProIine, 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W»TrvDtoDhan, Y=T\rosine. 

v mm ■ j mm » w mm mm, mm m m m> J m will! V v# m\ 

X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PPTTAQS 


810 


14711 


A 


817 


3 


410 


DAEVGGSLEPG\SGGCSEP*SCHCTPAW 
VTG*ETVSKKKKKKKKGFFPPFFFKRGP 
PFRE IEKKFGKKFFS KKKKKKKKKKRGG 
P FKKN FNGGKNLRGG KKKF F F FLGGE KK 
KPRGFFEKKTF FWGGKNWATTTQK 


Oil 

oil 


, 14/1Z 






i 
i 




DNLTRTI PI *NINF* I YKQNTGPDGS IS 
KFY*TIRKT*Q*FYTSFQKIKNEEVLPN 
S FYEASTPLAS PKKKKKKKKKK 


Oil 

812 


1 Alt 1 

14713 

i 


7\ 
A 


©19 


421 


*> 


T.DDlfOPPPQDTf DDRf2ti , PPIXPQ*fiinflJT / 
AJtrJr JVTvivV».r Ox'IvJrlrjrior rrJvrO vHVJM*^/ 

CFPPR*NLAPPGVFLKGPPLFFFFFFFF 
FRGRVG*SWVPAASPLQGQVFLPPQ/SS 
* VGGSPGPPEHARVNFFFFFFPFF*F+ * 

GLQ 


813 


14714 


A 


820 


84 


418 


ILTCHNARLGEKSQIVTMQASQNNPKLV 
NICEVFLVFFFWKP I FFGTQVGGQYFNL 
SSL*VRLPGYTHFFCLTLPKSWDYRPIP 

P«3 V P T PFPPT7NKTK3VSPGVP(3Gf!fJrTTiVP 


814 


14715 


A 


821 


290 


3 


FSPPPENIiGPPRNFLKGPPPFFFFFPPF 
FFFFFFFMFCVLIY\FIFFCTNliLFIiDF 
CIT* +T*NWFISSKAVCLFLFLPLQPAS 
QKTKNRCRKSFF 


OK 

815 


f Alt £. 

14716 


-M. 




416 


•> 
2 


YTAWLRPQSMITLKDSSERKAPHWGEL 
NW\HVHHFIWKER* PEV* I YIES *AID 
GGLAS WS GPE KNKI GKLGAKKV *RRDMK 
TDP LECTQNIN I FGFVCLFGDRVS LCW 


010 


1471/ 


zx 


oZJ 






PPPPFLPPP*F* K'FTK'TCAPRRflSOGPKF* P 

PPPPKGKPPFFKKKKKKPGGGGGPPNPP 
PPKGGGQK/QFFPPQGGNFNKPKSPPPQ 
P PRGOK3GCPP POK2CKKKKKKKKRLNVTC 
PQFLQLESGTY 


817 


14718 


A 


824 


1 


408 


I FS EHNG I KL * INTKRNFRNCTN I WELS 
NMLLNNH\ * VNKEN*NKKFI KTNE/NKN 
ITCONL * DTTKWLREI YSKKCLLOKKK 
KKKKKKKKKKGGGALKNYIiGGGQFYGGE 
GNFIFFFWGGAKKPCGWIFRRRPFFW 


818 


14719 


A 


825 


259 


274 


YL* YLLL YRRYSA* I PL/LGYYLKKRKL 
LYQRDTYTPMFIAAIiFTIAKMWSRP*SP 
W 

WW 


819 


14720 


A 


826 


422 


105 


FFFSKIPWGNFFPPPPKRIFFPPFPPKI 
FFFPPPPFFFWGGFSPFSPPPKKVFFPK 
SPPVFFFPPP*KKKFFFP/HPPLIFPPP 
GFFFKPPPPFFFFFFFFFFFFFFW 


820 


14721 

- 


A 


827 


15 

- 


402 


IKSLGKNIGINLGDLA^LNNGF*YTLPKA 
OATKEKIDN/MDIiAKAKNICA*KDTIKE 
VERQPKK* QQTFVNHLFDKGLVSKHV/N 
EL*LINKKTNH\HL* *WAKD*NRACSKB 
DVHMTDKLMKDVP SLVIRE I 


821 


14722 


A 


828 


420 


3 


KDAHTCPPGCKGG FSGHRALKAI CRNQG 
FLLTTTEFLTHKCPLPGQEPWGQHQG*R 
DS PRRVS P APGTWQ P PCHRERACLAPSA 
VE/GPS * IQEQEKSLLFFFFSIiLRWSL/ 
NSVAQVGVQWRSLGSLQPPPPRFKQFSC 
LS 


822 


14723 


A 


829 


1 


269 


I KI KNLARHGDS CL* S PLLLEMLRLEEN 



572 



WO 01/64835 PCT/US01/04927 



SEQID 
NO: of 

IIUVlM/UUC 

sequence 


SEQ ID 
NO: of 

pcpuuc 

sequence 

• 


M 
eth 

• 


1 SEQID 
NO: in 

09/515,1 
26 

- **i 


Predicted 
beginning 

II U LICVIIUv 

location 
correspond 

ino tA fircf 

IIIk IU 111 31 

amino acid 
residue of 
peptide 
sequence 

* 


Predict- 
ed end 

II UVI™ 

otide 
location 

rnrrp^nnn 

Cvil 1 wUUli 

ding to 
last amino 
acid 

residue of 
peptide 

sequence 


Amino acid sequence (A=Alanine 
0=Cysteine, D=»Aspartic Acid, 

G=Glycine, H=Histidine, I=Isoleuclne, 
K=Lysine, L=Leucine, MHMethionine, 

11 iUUHl ItKlllCj M. * • VII •■■ 

Q=Glu famine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan,Y«Tyrosine, 
X=Un known, *=Stop cod on, /-possible 
nucleotide deletion, V=possib1e 

nucicoLiuc insertion 


- 






■ 






IiS PGSRGCS EL / CQPARQGETLSQTN KQ 
TNKKPHQVQLP CFE I INFKEFI TCLVLR 


823 


14724 


A 


830 


432 


1 


G YNNQQI FDVDQTAFY * KKI P SRTS I AR 
EEKSMLGFKASKDRLI LLLGAKVAGDFK 
LKPMLVDPS^TPRELNN*ARCTLPVL/V 
KWN/ KAWMTAYLFTV* FIQYFKPTVDTY 
C/FKILLLIDNAPGIjPRAVMDR*EBSNV. 

Vnir YIN 1 J. Jj 


824 


14725 


A 


831 


1 


233 


KKLQ I KYLG INLMKYT I HWS I CQ KLLM 
KEINEDLNKWRDVLCLWI*RLNNMSILH 
KLIN/RINLLQLRI 


825 

« 


14726 

t 

r 


A 


832 


3 


477 


YQTCKEDLTQMLLKIFQKI KGEGI I PNS 
Li YD AS I AMMP KS DKDRTKKF \N xRF 1 S I* 
MS I DAKI LNKI LPNQ IQQHI QKFI FI PE 
MKG * FNIRK* INLIHLITKMKTKTPMI I 
SIGTEKVFDKIKY/PFPITTLTKLGIDG 
RSLDAIMVARE 


826 


14727 


A 


833 


3 


402 


RYyTPHj V \ * TVJjITTaUjliGIjStiPVIiTG 
GIT I PLTDRRLDTTFFDPAGGGDPILYH 
HLF * FFGRPEAHILILPGWGI I SHIGTY 
YSGKKEPFGYTGMR*'AIISIAFLRVIV* 
AHHI YTV* IDRDTRAYLPSLHN 


827 


14728 


A 


834 


1 


395 


DFOKVoCMAJjSbrJor r rife WaKKbJjr r P 
GREGRGQIWVNGTPPLQGKRNSPASPPG 
GGGITKKAPKLHPPKKGWGGGLEKPP*N 
QRGGFGI * SPGGKKGFAGG* KQGQKSKG 
GGKKKGRA\DP*AYIPLK 


AAA 

828 


14729 


A 


835 


At 

41 


AAA 

444 


r\T3D\rD VVCDDKIT VDGU0TVPTT7C\7T*MT? 

UrKVKlUvxiKoJNJjrKo VoAivbXr»bVJ.*«r 
PTKKI *GLDGITSEFIQIIiPKKKPK\ IK 
KERTFSNPFDKTNITLI /SKPETTFTKK 
EN/VRPVSLMNI / DCKILLKVLANCTL\ 


829 


14730 


A 


836 


3 


415 


HAYY I VKPSP / W I P KGALS ALLKTYGLT 
M*MQFQSIKDLRLGLLTNTLTIYQ*WRD 
VTRKST YQGRHTPPVQKGL * YGI I LF IT 
SEVFFFAGIF*AFYQSSLAPTPQLGGHW 

PPTfiWTPT^MPT.PnPTT tNT^OTJ ^ITflUQ 

tr tr 1 VjJL» 1 IT J-lLMirUIZjJJJr XUTl X &\31ALA3 X\3 vo 


830 


14731 


A 


837 


378 


0 


TP PKGPGGKI FLKK/SPGRKI F* PPGNG 
/ PFFS PLSPLKFFFFPKAFNFWGGGGPQ 
GPPPKKGVFSQNPPPGKKRPPQKEKKNF 
FP/PPGKMGPPQGFF*RAPPP 


831 


14732 


a 
A 


838 


383 


38 


SPLKNFFFPQGVFFIiGGGGPKRPPPKKK 
GFFQKTPRGF * I APQKKKKKNFPPRGKF 
GPPQ /R I F * KGPP P FFFFFFFFFFFFFF 
WSL 




14733 


ii 


835/ 


59 


418 


AEI IDALIFPFNLPIWLHKKPDCSWKVS 
VYCKLK*WSLISATGLDIINT\LLBEM 
KRVSGTWYIATDLAKTFFYI /PDQKBFA 
FSWYG*KDIF 


833 


14734 


A 


840 


5 


335 


IRHLF* INKFENLDKMDTT* KLTYEGIE 
SLN/RII*NF/PWKSLGSGSFTSEFYS 
LFKEE X TAI YKP I WRVEKGGI \ LPTCFA 
TRI TL I P KPETC I MRKKNCGPVLFMNGH 
F 


834 


14735 


A 


841 1 


38 


RRLKLEDHLSLY*KINSKWIRELNIR/P 
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SEQID 
NO: of 

mm\ mm jkIaA m ■ ^4 ^fe 

nucleotide 
sequence 

"i 


SEQID 
NO: of 
peptide 
sequence 


eth 

00 


SEQID 
NO: in 

09/5154 
26 

• 


Predicted 
beginning 

mm mm. mmm Jk JkAa J jm. 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

mm mm mm B A 

nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A—Alanine 
C=Cysteine, D=Aspartic Add, 
Hy— ijiutamic Acid) r^rnenyialamne, 
G=Glycine, H=Histidine, I-Isoteucine, 
K-Lysine, L^Leucine, M=Metfaionine, 
N°Asparagine, P=Proline, 

/^fc A*~* 1 _ fc X _ , _, *_ -fc Ta_— * A ' m^jmw i mm mmm, *m O Q | 

v2=Glutamine, K— Arginme, iy=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 




■ 










ET I KL V KKD KGNV \ QDTD PGKD FMVKTT 
KTKIDKSDYIKLQSFYTANETVNRVKLQ 
PIE* GKI FKNYS SGKGLLSM I YNELKQQ 
HRNNNL I * KLEDHLSL Y 


835 


14736 

* 


A 


842 


2 


424 


W 0*m mmm mm Wm, W &m A 0%m* mm* *m. Mmm mxmmm mw mmmmm *m - " a+m>m^km « mmm\ 

GRVGS I PLSQSLIQSKALTLFSFMKAET 
GQEAAEKKVEANRGWFTRFKERSGLCNI 
KVQ /GAG GDTAAS S P ADPAKINDEGGHS 
/KQQ I FSGNETAF Y* KKMS S GTS IAREE 
KSMPIFKASKDRVTLLIGANAAGNF\KL 

m* m mmm. m mmm 

KPMP 


834 


14737 

■ 


A . 


843 


281 


3 


FFFFFFFFFFFFFKTFLFFGYWFSIFFP 
LLFSISUiLSPSGSFVTFAFQPFF*VSR 
YHYQHFNSTF*FL\KL*PIF*IFFLISI 
LFLFHGYNF 


837 


14738 


A 


844 


409 


1 


RCGGTCPWSPVLRRLR*KDHLSLRG*GC 
SE P * LHHCTPAWATE\DSVS KKNKKKG I 
S CRQHI VGS FF FHLKMCLLNGLLSFFTL 
NVI IYWEFKFTILLFLLYSICSLSHLP 
SFPAKSWINSKFESLFVCFNSFS 


838 


14739 

• 


A 


845 

- 


63 


451 


KNQEQESEETLPNLFYKASITIjIPKIiDT 
QK*KKRKEKKRKKKK\EN*KPITHRNIY 
AKI LNKI LAHQ IQQ YIGKI IHHDQVGCI 
P/GAFDKIQYRCMIR/TLQKMGIEGTHL 
NI I KAI YIRPTDS I I ENREKP 


839 


14740 


A 


846 


60 


460 


RNNEPFLDQI VTYFEKWILYDNR* * / PT 
QWLIXJEEGPKHLPKPNLH/QKKVVVTVW 
WSASLTHYSFLNP\ETITSENYSQEIDE 
MH* KLQSLQLASVNRKGQIPLHDNPRLQ 
VAQPVLQKINEIiGFEVLPHPPYSPB 


840 . 


14741 


A 


847 • 


344 


2 


NFLKFHPGGETFIiQKGYTGYFSPFTP* K 
FFFSLKPLNFLGRVGPFFPPPKKGFFSK 
IPQGGLISPPLRGKVFPPPPRLNLGPPR 
VFLKAP/HPFFFFG I \ PHGLQPRPMI * K 
PTRP 


841 


14742 


A 


848 


439 


132 


RRVAAPPPSKNIFFPPGSYNCGGVWPQK 
SPPPK*GFFPKP\SGVYKNPPQIEKVFF 
FPHARIVPPPGDPIKTPPPIPFFCFNSI 
NFS LRGAQLYVWDFLGSGC 


842 


14743 


A 1 


849 


94 


470 


V ^ mm* m mmm* mm *mm mwm mmmmr 0m mm^^^mv 0m mmm mmmmmm mm- M wm 0tm 0m mm m*mtmw 0mm mm 0m mmm 

LNHFLSFFLS FFLS FFLSSGSLSItSLSF 
FFFWKGSPPPP*GTK/QKPCLKKKKKKK 
GGGGGQNFFT PPGG KKNPPPQGGGGGGS 
PAQKKKP / BGGGP PGS KKKNPGGGFKKK 
KPPKKPRGGGGKPFFP 


843 


m A 0+ m A 

14744 


A 


850 


2 


*m. 0* ^ 

396 


*m\ tmf m-m m*m^mr mm 0m. m mmm m> -m m mymf mwmmwmm] wwt m* *mr am w mm mm- Mmtm mm* 

FFKKENIQMANKHVKTYTTLLVIREMKI 
PTSMRYHFTS IRVAKIIiKTDNTRC* QEG 
RTIRILMLC*LQNKMVIiLL/WKNWQFIiT 
KIKI FLF\ FDPTISLLGI YPREMKTYVH 
S S F IHGNS KNNRTGNNLNVHN 


544 


14743 






Soy 


io 


Q VCL W OlS T t .1? V DD T .V *> T?\7TtlT .DTD Q D DT /Z\Jl 

KMP/FNYKGGPGKGISLLIFPKKKFIPP 
GGHPPFPFP/PFFFMDRVLLCHPGWSAV 
AVSQLTTTSASRL\K*FSCLSYSEG*DG 
RIS 


845 


14746 


A 


852 


1 


367 


PPPPKIKSPSGPGPPSKRGSPGAPPRGG 
KFWFLSQKGWGGP* FPPPPGVKPE/NIP 
* PPRGRVPL/HLNLAPAPPPGPPK/PKP 
PSP 


846 


147 47 


A 


853 


1 


378 


AMLATL I SNS * PQ / CDPPALAPQS AGIT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 

- 


M 

eth 

od 


SEQID 
NO: in 

IIC CM 

UoolN 

09/515,1 
26 


Predicted 

beginning 

nucieotiae 

location 

correspond 

ing to first 

4"k W* V Wm. m*\. *f*k t+\ 1 fl 

ammo aciu 
residue of 
peptide 
sequence 

i 


Predict- 

ed end 

nucie- 

otide 

location 

correspon 

Aft k V*. mfV 

oing to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
iv—vviutamic aciu, r— rnenyiaianine, 
G=GIycine, H=Histidine, Wso leu cine, 
K=Lysine, Lr=Leucine, M=Methionine, 
N-Asparagine, P^Proline, 
Vtr^vriuiamine. iv^Arginine, o— oenne, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possibIe 
nucleotide deletion, V=possib!e 
nucleotide insertion 








■ 

• 






GMSPCARPTSCSFYP*ILLTVRTFIKIR 

QPPGPKKSFNLSLPSRWDYGGAPPCPVI 
LCIFCKDEVSSCGSG 


847 


14748 

a * 


A 


854 


369 


1 


SACFGLPKCWDYR/ R* ATTPSLEI I IY I 
ER* EKTTS PDNANRFDVTKLHHYDCS VM 
DF*KAIj*NGDDFPVIjMS1>1w \ 
FVFCFLRQGIiALSQ/AGVQWRBHGLPQP 
QLPRLQPSSHLSL 


848 


14749 


A 


855 


380 


335 


NPGGRGC SELIiLCHCTPAWAT \ S ETLSQ 
TKIQP KKQE V*LL 


849 


14750 

* 

■ ■ 


A 


856 


.373 


1 


LRI KILNKKGLANLI P * *R * KI IHSQVE 
F I PEMQA* FI I RK/ TPS IKWP I VRTKKK 
KKSQMLISIDTEKIFDKIQHLFLVKPVI 
KIjGIEGTFINFIKSIY\KKSTTATIISG 

IjNAS plrlktrqdf 


850 


14751 . 


A 


857 


369 


44 

• 


KVQNLYFEKKLLNKI KVDLNKW/NSLCT 
RFRKLNIANT IMFKIiIHRYN/ SKVKI P I 
KI PANCFGE IDKL I LKFI * KFKRPQ IAK 
1 1 LKKNKDGY I L PDFKTYYI YMQYHLC 


851 


14752 


A 


858 


1 


348 


W r.W V -H W*K mm. jmm J mm. mm, ■mmm, m-y. mmm. 7-^ f > _A_ -mm -mm* mm mm, m* fMtT 

QWHAPAS / LARPPPPRFK* FSCLSLPNS 
WDYRHAP PRIiANFVIiFCFVG * GG IHL 


852 


14753 


A 


859 


1 


366 


CTSPTFNQKLEM I KI*/ EEGMLKAETG * K 
liCLLCQKV/QWNAXEMFLKE I * SASLV 
NTTMI \RKQSFTADMEKV* W+ IEYQTS 

HNI PIjSQSLIQS kaltlyssmkaergee 

AAKEK/LEASR 


853 


14754 


A 


860 


12 


366 


P S TLGLRRASCLSLLSRWD * RHMPPHPA 
N*KNFFCRIX3/SLtMIjLRLVL\KSWPQA 
ihspwppq 


854 


14755 


A 


861 


424 


78 


NPRPLEGQTASFSIiAPI^*TTIjGKKVNP 

ff/slknppppppppkiwapqgpfgpps 
pgglnggfflpprvkapiihkgapppqf 
ppqkkkgprvfpkkkkkkerkkekgmtr 

AFCYP 


855 


14756 


A 


862 


327 


3 


shwffaav\greism*amapdqtkkxcp 
rsaedai kyfltqatgs i il irailfnn 
rlseq * s i tnttnq yssli 1 i mai ai kv 

GMAPFHF w VPbVTyvsSFLi I SUIjJjVLi 


856 


14757 


A 


863 


1 


341 


ydrnkwkdipgs* ierlnivklsmlptv 
iyrfnvilikipmtffaeteksivkfsr 
dye*pkqs*keqnwkthtp*fq/qfftca 

TVIKMVWY*NKDRYIY/DQWimi*SLGI 

1nPL.± . 


857 


14758 


A 


864 


32 


324 


LVWPFFFFFFFF*KKKFFFFPPF*RKGK 
NFFFFTPPPPGLNLFF/ CPHPLKKWEKR 

PPLETPPLTPPKGLN 


oDo 




A 

n 


(jOJ 


■ 


J 


RVKNPRPFWGF*MVLKPLSFFSKKTN* I 
LFPLKIFSPPKTVPWGKIFLGAL*NPFF 
CFKNPLWFFGF*KL/SFFFPPLYFF*KP 
LAPLKRRFS FFFFFFL* DGVSLCRPGWS 
AVARSR 


859 


14760 


A 


866 


342 


118 


GSVTQAEVQWCDHGSLQPETPGIK* S/H 
PPTSSYQWGTTGVCHPGLAMLSRLVLS 
SWPQAILSPWPPTVQGLQV 


860 


14761 


A 


867 


1 


354 


VKPSP* PLTGALSAJULMTFGLTM* IHFH 
SITliLILGLLSNTLTIY\Q*W\RDVTPE 
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Amino acid sequence (A=AIanine . 
C=Cysteine, D=Aspartic Acid, 

E=(*lut»mie Acid. F^Phenvlfllnnine 

G^GIycine, H^Histidine, I=Iso leucine, 
K=Lysine, L=Leu cin e, M=Meth ionine, 

N=Asnn rapine P = Pro!lne 

11 119110 1 flEI II V, J XI UIIIIm 

Q=G I uta mine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W«Tryptophan, Y^Tyrosine, 
X^Unknown, *=Stop codon, A=possibIe 

nurlpntiHp ftplpfinn 

UUUCVLIUv UvlCUUIlf r^|lwo9IUM5 

nucleotide insertion 








■ 






STYHGHHTPPVQKGLRYGI ILPITSBGV 
FWAGSV* TFYRS SLSPTPQLGGHWPPTG 
ITPLN . 


■OOI 


> 


A 


ooo 




1AA 

+J t ¥ 


OIKNPDNI SC / W I RCG ATGMPI1HCWWKC 
KLLQPLWKTT*HQ*SRR*AVPIiSGMFP/ 
NRYS STCTPAS I * KTFLSALFMMTPL/ C 
LSRVEWIK*SWCNQKL*Y*SVMKRSBEF 


862 

■ 


14763 

i 


A 


869 


345 . 


1 


KQAGLKNVKITG / EGASVNQSR* VS / YL 
GT T TOTT TPRTfHVT ,P P P. VPNAO A QAVT? * d 

KKLPQRTFIS K/ EE KQAP /GSEVGKDRL 
TLLFCANMSRFMISTALLYKAADIiQSLK 
GKDKHRL 


oOi 


14/04 








AK 
4j 


SLPKC* DHRC/AA/ VPAWFQRCILEKKN 
AT Y* PAERTIi* SPR PHTRFTTiDQSTVNT T« 

PCPLSKVI FWKKKNQI Y 


864 


14765 


A 


871 


209 


72 


KLWDIHTMEYYYSAIKRNKPLKYTTWMD 
LKDIMLCEKANLKRSHTV 


865 


14766 


A 


872 


3 


353 


SFFLGPPPFFFFFFPPPQPLPIFFWRFF 

PPPCDDWDDD /PPPT.*f2T!CPPODP+ , D/l 

FFFF*KKKGGVFPPPFFFFFSPKVSPPP 
PKKGGGG/ PPPPFFF 


500 


I47o7 




oVJ 


3 


2o7 


G*GGW ITRSGVRDQPGQHGEI 


867 


14768 


A 


874 


1 


346 


PRRPFFFFFFFLIVFIYFKCVYLF*DK\T 
LVCHPGWSAVPHHGSLQP* PLRIKQSSH 
LSLSSS/WDHRHAWLVFVSFYRDEVSPC 

TAMPPTPTi*PAnPQMT.PROyVT.VATPT,P ' 
umns lit jj " t\\^L*r oniirKy/ v juxvtVXV^lix^ 

WPPSL 


868 


14769 


A 


875 


1 


172 


KLLSSGS PPASASQSAGITGVSHRARPG 
TtTiKrppr , *r , APQVpnpr , T*rvDnTPTH /t.p 

PPSFHQSVIASLDFSCLS*L*MSP 


869 


14770 


A 


876 


1 


34$ 


LLFCFNNYSFFHGVKPKIPAFFVFGPGC 
IjPHPPP P T TP AP P P PT iDRVTjTjPPPGWRP 

MGPF*APPTFSPQG*GVFPPPPPQ*LGP 
RGPP PQPGVFFFFCI FGRDSA/IAILPR 
L 


870 


14771 


A 


877 

■ 


1 


203 


GFHYAGQASLEVLTS *STCLSLPKSWDY 
RRGPP / * PGLSYFIiYPSLRS * F I CAMS T 
HIPFIKKKKASD 


871 


14772 


A 


878 


343 


102 


EWED CPS PGGRGCSEPRSHHCTPAWVRE 
TJjS P KKYLFLOP * KLKNTKI YLS I S IP * 
ELET \ ILKI LHFKMMNPLHS YNFFF 


£72 


1477"? 


A 




1 




RDI TKGDMOMETKHM * SFS1SLTSGI \H 
NSPIRPSKMKNIDSTRYWRGCRAIRALI 
HC!WWRC*MVQPFGE*ISSFSKKLNMYLS 
YDERPTFRYEK* K/ P * VHPKI CV*MFMA 
AFFLISPNW 


873 


14774 


A 


880 


205 


1 


FFRGVTEGL * EPP YVES VI / AGGTTARR 
PLFFFFFFFFFFLRWS FALVAQAGVQWR 
DLGSPRPPPPGFK 


874 


14775 


A 


881 


150 


2 


CRARVDGVPWRNPGSLKPPSP \GSSDPP 
TSASQECGITGAHHHTRIiT*VF ' 


875 

* 


14776 


A 


882 


345 


1 


KGNQPWKTEKRAIiFC/TLKKKKIFKNSP 
PRGNF* KFFVKKKI PLKKEKG FS PTWGK 
KKEFFFKKKKKKKKK* *AKDLNRNFSRE 
DVQMAKKHMKRCSTSLIIREM+IKTTIR 
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C=Cysteine, D«Aspartic Acid, 

J-r— v>iu mill iv> /vim, i" * IIC II J lit 12x11 lilt?, 

G=Glvcine H=HistidinA I=Tcn taurine 
K=Lysine, L=Leu cine, M=Meth ion ine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T^Threonine, V^Valine, 

X=Un known, *=$top codon, /-possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














YHF 


876 

* 


14777 


A 


883 


2 


355 


DRLLFSASHLDLGTbYIiLFGA* SGVLGT 
SF\SIiLIRAELGQVSGCLI»*NDHIYNGI 
ETAHAFVMDLFI VRPII IGGFGN*LGFP 

ASAIVEA 


877 


14778 


A 


884 


262 


2 


PTCEQSEREIKKTIQFTTESK* IKFVGI 

♦KNI I YV\SWTRRFNNFKMLGWAQWIiMP 
VIPA 


878 


14779 


A 


885 

i 


16 


318 


ILRADCADLFFFFFLGGKKGFCFPRC^" 
RGETLLN*TLIFR\VKGNF*PKLPKIW/ 

LVLNS* IKEFFHLWPPKGVA 


879 


14780 


A 


886 


298 


322 


KRRTP * Y PPGE PHFP PP PPQG * KARGAP 
PPPPKKGFF\PEKKKKPGVMVQGTTQKP 
PxvR.1 K.VKV1J 


880 


14781 


A 


887 


353 


3 


FPFWGFLKTLFGKALWLFKGFFFFAHQK 
FFFLPSLPPPP\LGFFLRGLT* FKIPSP 
LF* KGAPQKGISHPLFFFLISPKPPPPF 
tfrlfb vc PFfirr r v Fr r crerS \RUKVLiL> 
YCPGWGA 


881 


14782 


A 


888 


22 


341 


IPCTCLKLHGKVHDHDQSQLWWPKPQEE 

MUTT TPXTRTT TVlfT ACTf r\MT T\TPT T3VT\"nl\ T 

ICAREIKTYV\QQNCT*TFMLALCITAK 
KW/ KQLKCPSTDE WVSRMWYTCTR 










51 




GGRAWAI FKI FEFLA\PK* KQFSCLTLQ 
TI * VYGLNP P P RKNFVFLKERGL FHVGQ 


883 


14784 


A 


890 


1 . 


225 


GRLRPENHLNPGGRGCGELRLCYCTPAW 
VTERDTI S /RHTHT *NYFY *GS I *QLAN 

o v. \_i* 1*1 uiN xvc n v x v» x unn v* 


884 


14785 


A 


891 


357 


164 


GKGCSDSRLCHCTPAWAT\TKTLSQKKL 
I LKKQNKFRS * LDD * INMRCTVWPCFHV 
FLFIRAAPIiFSDWLYNK*MNRNT 


885 


14786 • 


A 


892 


207 


302 


EPFSGI I INESIHQEGIIVLNVYTSSNR 
THLLVIDRTSR*KR 


886 


14787 


A 


893 


86 


332 


VMRVSCCCLKD* ISLCHPTWSAWQS*L 

TVASNSW\ VKQS SFLGLPALWEAEIiGGS 
IiEVR5>IjP. PVWAT*TP\ FPTfAnrifT. 


887 


14788 . 


A 


894 


2 


336 


FFFWPPPQTFLKNPGGPGPKREEGPGGS 
PRGQKQGPPGLHLTGFGGQPGG* KNPEP 
GPGGRAPKGETRGEGGGPTPPOT P /r»TiT 

NGKSGKPPKVTItNGAWGTIKIFIilKTPV 
G 


ooo 


i<t toy 




©SO 


i 




aRFQT.aQVT.PPCJT.pppT.T/^T PPT.PQTTT P 
iUULoiiAor l-»Jtr rDUr re l.i 1 Kjxjt\.r Jjir oni-ilr 

FYFSFLPSFLGFGSFLPST*VPSFLPSF 
LLFFPS FLP WLWFLPS /FLPS FLPSFLP 
SW 


889 


14790 


A 


896 


131 


352 


TLHESDSESVPRDFKISDALAVEDDQRS 
PGTLNAAELS/SSVRERKKKEK/KPEPG 
L*DQSIKESDSYMVSGGRIQ 


890 


14791 


A 


897 


3 


184 


CSVAPAGVQCHDLTSLQSPPP\SSSNPP 
TSSNPPTSAF * VAGTTGMCHHAWLI FVP 
LVDAA 


891 


14792 


A 


898 


327 


264 


NRAVSLMNLDAKVL \ KLISASQ I YVYIK 
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OCysteine, D=Aspartic Acid, 

f,— CZltttnmir Ariff P^PhpnvlalanlttA 

JL> VIIUUIII1II* /%1>1U, A A UGUV|a|jt|||UC| 

G=Glycine, H^Hfstidine, ]=Isoleudne, 
K=Lysine, Lr=Leucine, M=Methionine, 

N=s A enaraainp Ps=PrAlinfl 

Q=Glutamine, R=Arginine, S=Serine, 
T=Th reonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 

Y— 1 T n If n n wn *s:Stnn rnrinn /znncclHIo 
/V %J II iVUU W1J j — ljIUJJ vuuun, r*~pussiPlc 

nucleotide deletion, ^possible 
nucleotide insertion 




■ 










KMIH/HNQIRFTMRM*GQPII*TAINIY 
HKLIK*GRKHMIPLIYAVKIFEKI*YHN 


892 


14793 


A 


899 


3 


313 


TKAASHSQ \RANLQFLVGRIH* HLKS RT 
I S PGRVGATAAVNS TAI LEYLTAEVLEL 

.HJvLN I\X/F X V Jul 1 irnl Jjy x\.VJiJ r> rS 1 1 

GSLFKSTIAGKGVIPHIHKS 


893 


14794 


A 


900 


340 


1 


HLCPHKEHLY / LLQTSCPIAI I * FISTI 
AEHTKAPFDMAKVESELVSGFNIEYAAG 

DPfcT TTT? T Ti'CVDTVTT T TTMTT. r nrrT nT.r , 'o r n 
rr.rt.Lir r lAiiXrrJl J.111n1Jj1 vile i-Ajf 1 

YVALS PELNTTYFVPRKI VWRVFI I IM 

FV 




14 Jyj 




yui . 


1 




birr ir yxiJji-AjXiO^n FKK X fc>U I f U/ilAJl w 

NILSSVG/SSFSLPAVILIIFMI*EAFA 
SKRKVLIVEDPSINIiE*IjYGCPPSFPSF 


895 


14796 


A 


902 


326 


1 


LGPPPPLKEPPGVFPIFPF\*FWNFGPV 

PPF*GKIPFF*KNTPPFF*GIGGSIPHP 
PLFFFFFFL*DGVLLCHLGWSAVA 


896 


14797 


A 


903 


1 


352 


KGIFPVWSC3G*KGTQKKGGLSQGGQGNV 
rtrQAJlrir V ruirl/Jr FCsAQJjIiIo^uQRIjI 
F*KGPKKGFLP* TVFG/RRAQFEAPR 


on "7 


1475*8 


rx 




325 


2 


D IQKRPDQTWVQCDACLKCRK* PYGMDQ 
HLEK\ WY WCNNHDS Q PR YCMVPKDPEDM 
DLVHPNYGKPYKKPSKETYWFSQMP 


898 


14799 


A 


905 


1 


329 


I GtiAI HGAERI WGQTAKSLAVHN\ VCE 

y ivlxX ru\JUU3<aJr^Kiir yWJjl\W HIM IrruNrVrur 

SF*RGLGPQSNFFYLKQ/RIiGFFF*KKP 
PKI * I LAMEPPNKK I F* KNKNKAHFFF 


899 


14800 


A 


906 


148 


2 


DVDKLFLLRSLPTL * RPEYGS YMIEGT \ 

XuvJcr JtVjvalrloiirr* lljiUWIFLKxK . 


900 


14801 


A 


907 


384 


1 


ESKRSIFGPPNPPPPGFKGFSRHAPLWL 
RSREPPHKAGVKKGFPRFRPGS*MPIP* 
SFLQNCLPQRVGFPSLPPPGAPFIiBCISIi 
FLRQKSRSVTQAGV* \ YGLLQCRPPRLK 
SFSCLSLPNSWDYNR 


901 


14802 


A 


908 


3 


297 


TKIKSI** INHLI RAKTVKLLEENMG INI* 

IMKLS FVKMGS FI SQRTPLRKERQLTK* 
EKI FADVYLI IGRRG 


* 


14803 




yuy 


1/CO 

ioy 


373 


AoNliioAl JJJ.OJN 1 r oFirtjoytar ooxaAi 

VEAGTYYTNFS CLGQVKVF* YWMQLI VS 
IVLL*LL*VTQKL*L/GPRQEKFV*YVP 
AST* ASIiPLKPCD PGGPKVFEMS YADS / 

AYTYS PLEKKLFILLGCFVMICFCFCFL 
RRVL 


903 


14X04 


A 


Q10 


132 


368 


GRIFLFVGOEKGARVSFLFLF*DRVSIiC 
HPGWSAVAQSQLTTS/TWTQNNS SHLTP 
P+VAENHRCA/PHTPNVLFLC 


904 


14805 


A 


911 


2 


339 


NS WAQE* AG ITGS CHHTQLTIjEF* AHDR 
QMMFFLFFLKTNFTFCPPA*RPWPGIiGL 
TE\PLLPGLKHFF/ CLTLPSNWDKGHLP 
P/HPYQI/CGFLRKNGASLISG 


905 


14806 


A 


912 


3 


381 


LNFCDTHPLTPRPVSIQRQCLPLVEAGI 
RWRALISPQLHPPT/YSLLSSWDYKHAA 
PHPANFFFFFFKKGSLTMFPRPGGS* IF 
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K^Liysine, Ju-Jjeucine, M— Methionine, 
N-Asparagine, P=Proiine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














I YLLI YLERES Y/ AVAQAGGQG PNFNSG 
QPLSPRFKQFSYLSIjIiR 


906 


14807 


A 


913 


394 


1 


MS KGK I G * KLG * KFAQXVS QF VNAKEKL 
LKEIKHFTLMNT*MI*KRNS/LIADLEK 
ILVVWIENKTGYSIP*TQNRIQSKALIL 
FYSMMTERGEEA\ KKFEARRD YFMRFKE 
RSRLQNMKV* DEAANADGEAA . 


907 


14808 


A 


914 


326 


200 


HNWLS I WKKVKSNCFLMPHTKINSRWI K 
DIKI /R/LHRIKILQENVGNHI YKI * RQ 
KRS * PRHNFLHKNCKCLVWHKTNK*NH* 
KIYL*Cr*KTFAVFM 


908 


14809 


A 


915 


3 


334 


L WRVKGPKFKFWLHHQLGLYC * V I S FF 
INKMKEYVLIHRI K* DHKNI F * MGYKKL 
*NVNIW*LVLFGFSSKNHL/WSGAVAHT 
FNPSTLGDRGRR ITQGQEFQI SLTNMVK 


909 


14810 


A 


916 


14 


299 


YQKLPQKKSPEPNAFTDKFYKIFK*LTA 
ILQ*LFQKl/E/KETPLPK*SNESSNAL 
I *KPDK0IT/SKNYKPISVSNIHAKILN 
KI *GYQVQQKQLYSK 


910 


14811 


A 


917 


3 


339 


SLQPQSCRIiKCSSCl^SLSSCWDYKHKPP 
CLA/NFF/VFF*EKGJCTVKN*NGDYPL 
GRRVT KRDHERGF *NAGKVLLVGW / CGF 
ETQS R FVPQAGGQGRNLGS LQALFPGLM 
PFSC 


911 


14812 

■ 


A 


918 

- 


2 

■ 


321 


GI I SG I YKELSQPKMTDSS I KK\KDLNR 
NFTKE YVQMACKHMKRCSASLVIREI K\ 
IKSTMR*H*\TPTRMTKIKD\DKNTKCW 
*GYGAVGMLIHCQW\inT(MVQPI/WKN 


912.; 


14813 


A 


919 


51 


326 


FFVFCLFSLVFISLNCGLF/PTFFRIPS 
L/PYLQIil * I YLFFY* LKPI FLKFFRDR 
VSLCHPGWSTVA* 5 *LIEASKYW\VQ*S 
SHLSLLSS*DY 


913 


14814 


A 


920 


239 


2 


DSLIIiSAVQ*HDLGSLQPPHPGLKQ/FL 
PSNWHYRCMSPCLAYFLLFLVETPFCHV 
TQASLELLGLSNLPASASQSAGI 


914 


14815 


A 


921 


1 


242 


PRPRRRLljKYFILFNFLRQ/RSRSATQA 
EVQ*HDHSSL*PQTPRIiK\HPPASET 


915 


14816 


A 


922 


249 


1 


ALFCALKKTLCGTLMF IAALFI IAE \SR 
HNQTQPTCIiSTDEE INKMWHRHAMGNYS 
AIKRNEVL I YAKI * MYLENIMLSE 1 1 1 


916 


14817 


A 


923 


404 


1 


SRPQKKLGELKCS * RPKGLNKKNQRKPG 
TFFFFETKFHSFCQAGVQWVYGC*SHPY 
LLHIIFP* *VF*MFLIIiITV/PSRNKDL 
IGNSBKQIHN/WSSLYFYF* LFFlaRWSL 
TLVPQS GVRWRNJjGSIiQPPIjPRFK 


917 


14818 


A 


924 


6 


358 


FFCSLLGHGGETPNNS PLPHP WL * ETKS 
PPKPPHPLKKKKHSLFFKIFILKKKKNG 
PPr , FPFAGPNPRG*NPPfc , fc*Q/ Ri»t»KlQ£'P 

PPKEKINPPPPPQGKNFPKGKRPPLGPF 
PKKGGGY 


918 


14819 


A 


925 


268 


2 


HM YAFVKI DLTAHLRFS AGKKE I PAGKL 
YF/L* RKNPKNQ I GNQKKKKCLPRAGRM 
VAHAWKPSNLGGQGAGTT* SQALETSMG 
NMGKP 


919 


14820 


A 


926 


74 


338 


I AG ITGVSH * ES KNS Y * KQFL / WPGTVA 
HSGNS SNLGGQGEW IT* GQE FETSLANM 
VKP 


920 


14821 


A 


927 


339 


1 


lM,tACPT>LP^WDYR6kPPb<ilt'E 4 StflH 
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OCysteine, D=Aspartic Acid, 
fi=Glutamic Acid, *-rbenylalanine, 
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Q=Glutamine, R=Arginine7$=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Mop codon, /^possible 
nucleotide deletion, V*possible 
nucleotide insertion 














FFKKQGFICPIEKGRELSNCCFQKEVLI 
*KKVILKI*PFFFFFLMESCSVA*AGVQ 
WHDLQPLPP \NS WD YRCP PSRPAKFC I F 


921 


14822 


A 


928 


3 


416 


RCWWENKIN/PFWK/S/VWPCVTKVNIH 
SP YDLT ISLQYLP YRDE YLCSYKNLMFL 
rVLFI ITKNYKQPKCAS VG* WLRILWYL 
/ YMENYS AI KMNK 




14823 

i 


A 


929 


344 


69 


YKRS PLL\KSA*DGPANWfCSLPRS * YKR 
S PLL I KAKPDQ I I S G WEAS PL I I \ PRAL 
LQKLKPAASKF*W/NPKKKRALIS/KAI 
PSKKNKAEG I MFPNFKV* YRDTATKTA* 
WWYKNSSSL 


923 


14824 

i 


A 


930 


378 


1 


HGKKIDPRFIPYTKI/NLKWFIDLNVKP 

ktikfle*nige/tifvidhkk*np*kk 
klvn/wdfikiksfcfvh*kpavgk*kr 
/qp *trrnyvqthtsdrglisrickblf 
* lnskvi ns i rkwaqsln 


924 


14825 


A 


931 


479 . 


83 


SRKGLGRGLKRGGFFPPPPPKGLPGGVA 
PPGFEGFFGSL*KG/DRG*PFI*KGVKGE 
KGKRI *EGG/EKGGGFLPPPPPQKGFSK 
NFLGFYAGKKENP*GGSPPPPLFGPAQK 
ISLGKGENKVPPPPGNLFYLFIF 


925 


14826 


A 


932 


36 


440 


TTYASLDEAQLPRAKFNAFLTTF*HI IK 
NGP ILGKIGRKYMIADQ \GHRMKNHHCK 
LTQVLNTHYVG PKR I LLTGTPLQNKLPE 
LWALLNFLLPTIFKSCSTFEQWFMAPFA 
MTGERVHLYEEET I LI IRRVHKVL 


926 


14827 


A 


933 


8 


398 


CLQGKEETLTTNWHLRYFLP FFFFFWPK 
KGARKNGGRP PGGS PKNQRAPCPKPFEN 
PGEKTPQKRGFFPPKPCPGPGPGPTGGQ 
GKPP/HPQRPQILPFVPQAGPQGRKGG* 
WNPPPPG*KGFPAPTPPRM 


927 


14828 


A 


934 


1 

r 


441 

■ 


TRSHPRALKEVYTE INI VFMPANATS IL 
QPMD*GVILTLKAYYLRSTFCKAIAAVP 
SDS/SSDGSGQSQLKTFLKGFLILDAI* 
NIGDSWEEVKISTLTEMWKKLNPIFMND 
FEGFKTS VEEVTADWKI TEE VQVESED 
GTEFLQSH 


928 


14829 


-m, 7 

A 


935 


3 


383 


PGFKASKDRLTFSSGNNAAGNFKLKPVL 
IYHSENPRVLKNYAKSILSVLYKWNNKA 
WMTAHLFTAWFTE \ P YVETYCS \ EKN I F 
FKILLLIDSVPSHPRALREI * KQMTIVY 
MP/STTTS I LQPMNQG 


929 


14830 


A . 


936 


417 


1 


CFFSRDEVSPCGPGFSPSPD/HHDPPPP 
PSQCWDYRRDPLWVPPHICFL IHKKKRS 
SHMGSSMYDP* KPPHKWMKSPPPVSVLY 
GSIPVOVQIAPPPETNPVYFFSPPPPFW 
GGGRVCVCVCVSSEVCDFFGGDENVIV 








* 


412 


3 


SPFFGGLGQKNGFTPEAKGS INLKFPTA 
PKSWGTQQKSLFKKKKKKKS I LKFIWNQ 
KINPKQIKQS*GITLPE/FKLYYKEATV 
TKTAWYR YKNRP IDQWNGTDRNKATH 


931 

- 


14832 


A 


938 


1 


416 


KNRHIDQCSGIERPEIDSHKYSQLIFDK 
GAKVI * * RNDNLFNKWYFNNWMSTC\RK . 
INLDTDLTYLFQNGSQT*P*NIKLIiENS 
MGGNLGDL/GMSQ*VSSSSSSRRIHDKK 
LVS/ WDFI KIENI CSEKGI I KRMKSQTT 
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SEQ ID 
NO: of 
nucieoti a e 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 

TIC CM 

uy/jMjji ■ 
26 . 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

i 


Predict- 
ed end 

nU Cie- 
onoe 

location . 
correspon 
ding to 
last amino 
acid 

residue oi 

peptide 

sequence 


Amino acid sequence (A=Atanine 
OCysteine, D=Aspartic Acid, 
Hr-ijiiutamic Acid, r— Jroenyiaianinej 
vj — oiycinej-ri— rtisiiaine, i— is o leucine, 
K-Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
* Q=GIutamlne, R=Arginfne, S=Ser(ne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^yrosine, 
a— Unknown, *=Mop codon, /=possiole 
nucleotide deletion, V=possible 
nucleotide insertion 














D 


932 


14833 


A 


939 


4 


443 


DFDYNHDHDYAELGTRPGSVGQGSPDPQ 
FTPSRMGREGEGTHS IIi * CS S LGMGVTA 
DLSTD PTELE KRALEVAGPIX3QASAI S P 
ASPRRKAGDGGHRRALPGCTSLTGETTG 
KSGEAGQDGKPPGD/GPIGPYSLPGSGP 
GSGESMMG 


0\ 4 A 

933 


14834 


A 


940 

i 


3 


404 


LFMFLEFFF * KPRS C* VS QAGMQGC YLK 
ALQAPPPRFTP/SLLSS*DYWSLPPPIiA 
NFLYF * ETRGFTGLTRMIS I PQPTEMPG 
LAS QTAKIiI FF * K/HRVLVES /HG * SAR 
AVHRGDLHILEP * TPGLK* YS\CISL 


934 


J4835 


A 


941 


397 . 


1 


FVPNSQVANAKKKFLKEAESTTPVLSGI 
RKQNTFAADME * V* MVWIKDQPS YS I PL 
SQSVIQ/SRALTLFDCTKAERNRRGKLQ 
WEPAV\EGSKGWLMRFKERSHLHNIKVQ 
DEAVS YPEDLDKMDAVNTKHQI • 


935 


14836 


A 


942 


441 


29 


SVTLCKHTVHVPTFLRRGKRCPLFGQCQ 
PWPSPG/PHPLSMSPV*PRPQ\PIPLHS 
APPLALGPFIPMLSMAGLPFPFQGSGLR 
NWKPPFPQPSLLQ* *HFPKPLKPIRGLR 
PFPPSTLSIPLPGTRPRAKLGVMRSRL 


936 


14837 


A 


943 


2 


397 


ARDAAPEPGEHLLQGLSARHGLRPPRDS 
RPGPD\PP\SPPHPLPLPAVPTFVFSFG 
DSWLL* S P FFPAP WASEGGACAPNPHAIi 
VPS PSGQASGLRGGAPSWKAGLDTDGQQ 
AGRQSPGPAPPSPPLPPSQDC 


937 

* 


14838 


A 


944 


3 

E 


449 


LRQVWHEGEMPNKTTLIYHYTPI * ITNI 
KNTDNTKSLWGCAEARSFTHC/WMRIKM 
VKPLWETVQ*YLI* *KLQLPFNPAVALW 
\SICPRKVICrY/CHRKNKTRS*MFIV/A 
LFVTAQTKN 


938 


14839 


A 


945 


380 


175 


RAPAVPATQEAE VGGL/ H * TWEDRLNPG 
GEV/CSEPKWCHCPPAWATKPNCVSKKK 
KKKEKRINHQHDEK 


939 


14840 


A 

■ 


946 


21 


331 


VAPLKYGPSKETI / IQS VRQSTECKKI F 
AYYPSDTGIilTRIYEEL*RIiNRKEKLNS 
SVYKYAKDIjI*PFSKEI7IvMAwRR/H*K : 
KCSTSLI IREMRFKTTMRVDAV 


940 


14841 


A 


947 


365 


1 


GGGPFWFFFFFFLGGFLKIGKIWAKKKT 
GPSFPQKKKRGPKKKKPPGFFFFFFF*K 
GFVAPGGGQWEGFGSLQPJjPPGVKQFFC 
PRFLRKW/DSRFFFFFLIKSNAEQFPTL 
YAGVRMRYIK 


941 


14842 


A 


948 


357 


137 


ISAHCELRLPGSHHSPAIiLCSPGLSRTP 
NL KQS S L FS LPKCWD YR \ LATVPGQIiLL 
F* ARHCIS IDPSLHLNNM 


942 


14843 


A 


949 


53 


352 


RE DNHHKC I RNKM I NRKWANE VNRCFSE 

VTf\M\TVXnJWT.T .'h. QTJOTDQMnTinJWV T- 
■CtX^nv JMSjla tSJjJLiHJlriri.fwri^ J.HKX I \ii 

NSLA* KKEKKSD\NTKLWQGFGETKSV* 
RYIIN/PYDPTIPLLGIY 


943 


14844 


A 


950 


937 


3 


KVSPYKINIHK*VAFLCTNNIQAEKK/M 
NMVLFTI ATOKKI / S YLVTH* IKEVKYL 
YNKNYK\ LLKKNRDNANK * KGTPCS * I * 
RINI I KMSVLPKATYRLSAISLKL* MQL 
FMELEKE \ T I LKVI WKNKTAHLAKATLT 
\QSNPGGLTLPGLYYKATVIHITW/HCY 
KNTHVDQ* /NRRESPE I SLHFYTQLFSD 
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NO: of 
nucleotide 
sequence 

• 
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SEQ ID 
NO: of 
peptide 
sequence 


M 
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SEQID 
NO: in 
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26 
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nucleotide 

location 
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ing to first 
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peptide 

sequence 
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Predict- 
ed end 
nucle- 
otide 
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correspon 
ding to 
last amino 
acid 

residue of 

nentide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G)utamine, R=Arginine, S=Serine, 
^Threonine, V=Valine» 
W^Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, Impossible 
nucleotide deletion, Vpposslblc 
nucleotide insertion 




■ 

• 










KVNRSKQ/WKNCLSNKWYWESWtiiCRR 
RKLKLYLSPYKKCNSK*IKNLTVKLKAI 
FGiVEHQKSTVIiVTGIjWKGPMTNTIKVNA 
TKIKVKN*HLIKLKNFCTPKBT/T*QIC 
RQQRVWNKLFSNF 


944 


14845 


A 


951 


246 


1 


AAS TKTGVQT KTCT * ML I AALVI TVKRE 
KQSKCSSVE\B*TMWYIHTMEYYSVTKR 
NDVLIRTTWMNLENSMLSKKDSHKG 


945 


14846 


A 


952 


2 


255 


QGCGEKGTFLHCWWEYQPLWKTVW* FLK 
DPETDIPFDQAIPLLGIYPKEYQSLYYK 
DT\AKTWNQPKCQLMVDGI KKMWYIVDA 

A , 


yHQ 


mis**/ 

i 


• 


. 


j4j 




GS T YTKEME PK\7NNliOK* KAS CTDQL* G 
E FY * TFENE I P I LYNLPQKTEAKGTLTN 
SRDEAS 1 I LI PKPEKCTRRXENDK/ P / 1 
SLMS I DAKLLNKI LAN* IQQSRI YSRHT 
RLIQ 


7** J 


14545 


a 
■t\ 


VD4 


1 


1AQ 


LKTT* FQGFIFCFFFFFK/QAKSCSRIK 
GRGPI XAKGTPEIiPGLRDPPTS PT* VAG 
TKGGGPHTQLKFIFIAK*FYISF*HNGK 
FCSRRGR 


OA ft 
745 


14047 








3D I 


GT.KNYAKSTTjP VT»YKWNN\ KA* MTTHTiP 
TAMSTE*FKPTVETYCS/EKKIPFKILQ 
LMDNAPSHPRALMEMYKEINAVFMPVNT 
I S ILOPMDOGVI STFKS YYL/RNTFCKA 
IGAIDSDS 


949 


14850 


A 


956 


336 


3 


PTKENFEPDGF\TG*FYQTFKELIPPQT . 
FLKTSREYFSSSSSSSSSSSSSSSSSSS 
SSPYRPI /SLMNTD /AKILNKILTNQVR 
LHIKRNTHHDLVGFI PK\ WFNTRKTDQS 
NN 


950 


14851 


A 


957 


181 


1 


RWDLTMLPWLVSNS \GLRRSSH/LPKCW 
D YRFE P P CTA* GWFL IGPHWNVS TWGS 
PRISRQ 


951 


14852 


A 


958 


3 


293 


GGLTS PHVKTY YQ ATVI KTAWNG * RGVC 
MDQ/YNKTENPETDPCKYSQLMFSEVTK 
AIQ * RKDS LVN IENWNN*MS IHKKSS SR 
KHLNQYLTPYTKL 




14853 


A 


959 


350 




NKKKGQSLVFRSPTLFFFFFL* IEMGXF 
IMLARLV*NS*RRNMTTSGSQSVGI1X3V 
SNHARPKRKLFLFSI ITTIGWAGV\WWL 
TLVIS ALWDYRRE PPRPAVLLPFH I QRS 
RMPLSN 


953 


14854 

i 


A 


960 


349 


1 


GGFPPPLFFFPF/PPPWGKPPPPFFYQX 
SFPPKKKGGPKPPFFFGSPWGVHKKQFL 
VKSPLPFFWKKKKPPPPF* KKFFPP\VF 
QKQPGRVKKPPYERNSWLLGPPTKKSPK 
KSTGS 


954 


14855 


A 


961 


379 


160 


PGPQGEPPFFLKNPQKLVGQGGRLF*AP 
LLRGVRQKNSLNPGGGGALKPGS/HLWP 
PSWGKKGDFLF*KKKKKKK 


955 


14856 


A 


962 


184 


2 


TFFSPKKIFPFFFFSKKFFPKGTFFSPR 
ENFFFFFFFF* DGV* L\CHPGWSAWPS 
RLTATS 


956 


14857 


A 


963 


1 


328 


ARLVLKS*PQ/CDLPASASESSGITGVS 
HCASASQS AG I TGMSHHI RPKWI S L YLG 
FWSFNKNVLHFFCASSLEGESMNNELLS 
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nucleotide 
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SEQID 
NO: of 
peptide 
sequence 
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M 

?th 

od 
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NO: in 
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26 
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beginning 

nucleotide 

location 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
dine to 
last amino 
acid 

residue of 

peptide 

seauence 


Amino acid sequence (A=AJanine 
C=Cysteine, D=Aspartic Acid, 

E^Glutamic Acid. F == PhenvIftlflninp 
G— Glycine. H=Histidine. I=Isoleucine. 
K=Lysine, L^Leucine, M=Methionine, 
2V=Asparagine, P=Proline, 
0=Glutamine» R=Areinine. S=Serin& 
^Threonine, V=Valine, 
WeTrvDtoohan. Y=Tyrosine« 
X=Un known, *=£top codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














KKNWEFLDKJCLHGSTKLGGNHVCVNP 


957 


14858 


A 


964 


1 


334 


HPLGGRVGGVPLGP/VVLNPPPPQ*GTP 
FFLKKPHNPGRGAKP VIPAPRGG* GGKF 
LLPPRGG/PSMNPNCPPAPPP 


958 


14859 


A 


965 


206 


360 


QKSMSLHQQ*QDIHSSQAHKNSSW\PGA 
VAHTCNPSTLGGOGMQITRSGDOD 


959 


14860 


A 


966 


10 


332 


NFFSPGAPPPPFPFGGGFFLKKGPPFF* 
KKKILGGPPPPKKNPPGVFPFSP/AKNL 
GFPPPPPPFFFF 




JM-OOl 


a 


i 


i ■ 
i 




LRKSTLKPRDMPHIY*ND*LYKKNDDIK 
CW * G YGVTG AVFH FHWSRMVQ PLLNNW 
TVPRNVKFTL 


961 


14862 


A 


968 


339 


195 


FFFCRDGVSPCCPGWSRTPRPKQSTCLS 

Lttr ]\\-ri U I rCJJ Al V xroi-iU X XjT 


962 


14863 


A 


969 


3 


342 


IKKGPHPQLKKPPGVFPVFQFKKSYFPP 
PPPLFNNPFFFR\DKASLCHPGWSAVML 
S YFTAAYTS W\ VRS S SHLS LLS S * E WRH 

X Ve'v^Xjf r w X* w ±* Xj/ <wi<o A. 


963 


14864 


A 


970 


384 




GRKNVAGFPLSPLYTPPHGHGLGPPQTF 
GAGPPAHKSHQKVGRpKRGPGFVPPRPP 

ziirr«ir??pir]??f , i?*C'i/T.irv , HT4»7WfWT.vMGr cav 
ac urrc r r ivji w o vxj£<i\JSxiixi\xjXjVourAi 

KI LNKVQGETQCBGRAHIC\ VCVCVCVC 

VSVCTCVHVCALAIC 


964 


14865 


A 


971 


427 


109 


LNSQFELQEKRISNLEDNEKI I *SEKBS 

ECPEGBEK/GKRKRKTIKKKMAANFPHL 
MKNINLHIQKAL *TPTRITCTGPYGG 






n 


7 11. 


1SIQ 

JO J 


1 A* 


QT.D'D*TPPT.MT^QCHrT.QT.TJC:cwnVT3 ("TTDtD 
OXjyr"* X JrKXjTliJOorlXJOiJijOO WLJ IKLV 

HLANFLYF* *R*GFAMLGSSNSSA/SAS 
QSGGITGVSHHAPWFTFKKKLVADT 


966 


14867 


A 


973 


170 


419 


VS FLFFFFFLERQF YFFS PGGGEGGEFG 

PPLPAYFGFLI K\ TGFPLVGRGGLDLRP 
SR 


967 


14868 


A 


974 


212 


3 


QNSMIRYSRSMSRSCLCTTILl3t,S05R5" 
LKSPR* / WPGWAHACNPSTLGGRGGRI 
TRS ED * DHPGOHGETP 


968 


14869 


A 


975 


28 


356 


VFETSLFRNKKEIVKGTSxxPDFRTYFKP 
LI I ES I * CLHGIGQNNAWNRVGNLQIDA 
/ECAENLIYEKGGISIQW/SLFNK*CWS 
ITIHKEEKKIRPLHCI KI KGTKDFKKLS 


969 


14R70 

1HO/U 








i 
i 


TjPPKETCVQI.tiPPRR FFPPniMKTCQTT.T JQT. 

XJf WM^VO UiJV«JrVjt^f Csf XT wlTinivO X 1.11 hj>jj 

P KMRGYR /RLAPP PQliE I FFHFFLTSKA ' 

TPIi/CLGRSLKRLNSQMFSPFFSEGVSL 

CSQAGIQWRDFGSLQPAIE,WFK*FSCPS 


970 


14871 


A 


977 


4 


343 


PLHSSLGNIWPHLLNNSNKNDTTKCW* 
GCE KPGFLIHCI WE YMMV* tiWKTDWQF 
LIKPNIHLPYDSAVAHLGIYPR*MKTFL 
QKL/CP * MFIAAVFL IAKNGKQLRCPS V 
SK 


971 


14872 


A 


978 


3 


424 


KRMRRQATD WEKI FAKGI SDKELLS KI H 
KELLKLNKKKS PKQVACFKMGKQT* TAH 
LH*TIYR/ CRRVTHRKDHSTRSWLLREL 
QILNT 


972 


14873 


A 


979 


114 


325 


QQQQNN * I KKLAEDSNRDFSN/EDIQMA 
NK * RCSTS LVI MEMQVKVTVR YYYLTLI 
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SEQID 
NO: of 
nucleotide 
sequence 

'i 


SEQID 
NO: of 
Dentide 

sequence 


M 

eth 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

n ti ciefit idfi 

1 ■ U VlwllUv 

location 
correspond 
ing to first 
amino acid 
residue of 
nentirie . 

sequence 


Predict- 
ed end 

n nrlp- 

11 HVIV 

otide 
location 
correspon 
ding to 
last amino 
acid 

ft* V* V* 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=»Aspartic Acid, 

r.cfwttlfcltTl I** A(*iH ¥*llf»n vlfilan Snck 

■j 1 u wi 111 iv, /%viU| a j ijciijr Kiiuiiuic, 
G=Glvcine. H=Histidine. f=Iqrt leucine 

K^Lysine, L=Leucine, M=Methtonine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S»Serine, 
T»Threonine, V^Valine, 

W=Trvnl t nnhnn Y=Tvm«ine_ 

X=Un known, *=Stop codon, /^possible 
nucleotide deletion, Impossible 
nucleotide insertion 














SIAII KQKKKQNPKK 


y 13 




n 






Z 


NPPFFFGPQKGFKPFFPPPYLNPGKNPP 
SKKPLFFFF* KKGFFKPPPFFFFPFFF* 
GRVSLCCPGWSAV\A*SRIiTVTSTSHVQ 
T 


974 


14875 


A 


981 


2 


377 

♦ 


QKANKHMKSFSTSSAFMKM* IKNKMRSH 

SW/WECKMVQLL*TMV*KFLI KY/DTYH 
DPAISLLGSIKKKSKFCT* /T/CKILYL 
NVIiFIMNPNWKQSNWYSMG 




146/6 

1 


A 


000 


1 


457 


INK I bAKNT I i»Kr N KTEJUTS S S SS5SS S 
PLVGLTKRKKSQN\NEGD I TTDTI E I I / 
TIIKDYSE* IYA/NKIEQASSSSSSSKT 
VNLSGPHHEGTESLNRPIMSBEIESLSQ 
NLPTNRS PRADS FTGKFL* TS KDELPPT 


976 


14877 


A 


9S3 


332 


2 


TPKAGQMQVLFPQYGSKVARAPFSVVAL 
VVVitKr r rlr r l_r FHL^JuPPQIPPPr S 
RFFLQVKSSQQGPPPSLLGWGEQ/NTLP 
FFFFF* SHSLALLPRLECSGAISTAA 


GT7 

y/t 


. 140/0 




9o4 


So j 


2 


r^DXUjrlNiFJt'ijr r r juKJNrGJrJwtK/SGrr 
FCPFKNFFNPRVPGPNFNFKGFFFLKKG 
PTVF * I KKFWWGPL PPLK* PSGVFRF FH 
FLIWDFRPRPPFFFFFFFLVERGFHHVG 
QASLELLTS 


y/o 


maty 




nor 

y&D 


1 




AGITGVHHHTRLFFFFVGFFFNF 


979 


14880 


A 


986 


225 


1 


LYIYRERERERDRDRDRDVSHTHLYIHT 
RTHLDNILFCHPGWNAVA*SWFTVALNS 


980 


14881 


A 


987 


2 


375 


GTDEYILIALIWMVSWVCLFFFFEKKA 
PFCP PAGKTGANFGLRAPP PPG I KE I FW 

GSPL* PRLFG/HPRP *GNRPP+ /PPQRG 
GNNKGDPPPPPGGIFF 


OR! 

JO J 












HTVTHVW Cf3CLJT.T TT7/TCT T T?T T V\TOr r \T \ 

NVGAL*ASG\QASENRISVSDFLLPSFY 
LPKAGL* S FPAFMMMGHKI LIP EKVI PH 

MTiRRP.\ TYTRBP 


982 


14883 


A 


989 


295 


369 


I LVTRNS E L* VLPEQRTRFLEKAMALRS 
IPLLKR/NGRPGTVAHACNPNTLGGQGG 
RIKRSGV*DQP 






a 




A 




T*KLQNRQIYK\TERIYIYIYIYIYIYI/ 
FFFFFLERKFLFLFPRGEGGGPF 


984 


14885 


A 


991 


463 

• 


0 

■ 


GFRQLS CLS \LPS S W Y YRHMS PHLANPF 

\ 7*P*T7PP^3PPXTl7V^nar3T.PTT.TPMnciDCC 
\ i r vjbnor rvn voy/iuuCiubi JrJN.Ujjro j 

ASQSAAIH/ GMSHCTQP 


985 


14886 


A 


992 


82 


365 


E I CHQLYRS FLCSLFC/ DHPSNKSH*DT 
^CVHFIIRLiIiNFSFFFFEKKFRFVPQV 
EGQGSNLG* LKVLPPRLKPFSCLTLRRS 
WGYRGPPPPPVN 


986 


14887 


A 


993 


3 


317 

i 


GLKQSS CLGLPKCWDYRHKPPCPTHIIF 
NTH*IIKVLNVSFSFCI/PLCWSIALSD 
HVQ/ PV* LYNMLVS S FLLLLLFFE IEW 
ILACLFCPVGSPILFFF*YILTF 


987 14888 


A 


994 


350 


139 


LFRRLRWEQIEjS \QGGQCCSEP*SHHCT 



584 



WO 01/64835 PCT/US01/04927 



SEQ n> 
NO: of 

nucleotide 
sequence 

■ 

< 


SEQ ID 
NO: of 
nentidf* 

sequence 

• 

■ 


M 
cth 

■ 


SEQ ID 
NO: in 

09/515,1 
26 


Predicted 
beginning 

n 1 1 f Ipnti f l<» 

location 

ing to first 
amino acid 
residue of 

peptide 
sequence 


Predict- 
ed end 

nti<*l#w 

U U VJ C 

otide 

Inrfltion 

correspon 
ding to 

last ftmino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCystcine, l>=Aspartic Acid, 

XfrziCl 1 ■ i f <-> m tr AfiH T<z=Phprivl;»lnninp 
M-j^\j i u mill ik- /sviUf a ji ji at j id iijj jfJC, 

G=Glycine, H=Histidine, Wsoleudne, 

K&l .vcinP I J=T .PlirfnP [VT=IVfpth innino 

N~Asparagine, P=ProIine, 
Q=Glutaminc,- R=Arginine, S=Serine, 
T s =Thrpnnine V— Valine 

W-Tryptopban, Y=Tyrosine, 
X=Unknown, *=Sltop codon, /^possible 
nucleotide deletion, \=possible , 
nucleotide insertion 














PAWATE * DPVS KI QQNKNQDTFSHTFI L 


988 


14889 


A 


995 


219 

4 


1 


RPRRPLIQKTKAKNHHFTPTRMPVIKDN 
NKC*QGCGEV/WN/CRMVQPLWKT/IQF 
LKMLNTELP*DRAIPLTGGKE 


989 . 


14890 


A 


996 


3 


498 


CLPWAAVARQPPSAHPFPQPGP*LRTLA 
PDTATAVNQALQRQE S * TGC * S P SGWPA 

VtrxifVAirKa/ Jt , ±rbAv7 w V»VVlJl^KAWPKiiX«G 

PPGQHPKEKTYQPVPAP * VQIiS PTRQTP 
APMACSSLLLPALPPPAPAARPPCPAPP. 
FLPSRTSTLPPRRLSSTIKSKTPPGP 


990 


14891 


A 


997 


2 


419 


ADTEAAAS YPDNL \ANDESSYTKQQI FN 
VUKTALYW KKTP SRTS IAREGKS KPGFK 
TSKDKLTLLLGANAADDFKLKPMLTYHS 
KNSRAHKNYAKSTLCFRN*NKAWTEHLF 
IAWF/EYFKVTVVTNCPEKK/VLLLTHN 
APG 


991 


14892 


A 


998 


388 


405 


G*FFPLSASKFFFCP*ALKLWMEGCHIR 
PPPQVRFPLQSSQA/GFISASLKGKGFP 
ASMPANIGPPKGYL* PAPPFFFFSFLRQ 
SHSVTQAGVQWRNSSIiQPQPPG/LRRSS 
HLSLPGSWDH/MPPdPANFCIFCTGG 


992 


14893 

_ 


A 


999 


1 


423 

* 


WILHD/NLRQPAQWIiDQEAAPKHFPKPN 
SH/ QJOCVMVTVCWSAAGLIHY SFLNFGE 
TSTLEKYVQQIDEMH* KLQCLQPALXNR 
RGP I LPQDNARPRVTQ PMLQRLNKLGCD 
VLPHPPYSPDFLPTDYHFFKHFDNFLQG 

VTJT? 

luir 


993 


14894 


A 


1000 


2 


406 


3DLFRAI *ETTSVIVYDVSMDYLEIDSE 
M I RDLLNTFPV\HLELQQDS SGV I QGAG 
I TEASTINAKEIMHLLMKGNRQRTQEPT 

EVRQGRLFMIDLAGSERASQTQN 


994 


14895 




1001 


290 


3 


VTQLP ETVPHFLRDR VS FCCSGS / DHS S 
LQPQTPGLK\ YP PTSAT* VGGTCHHAQL 

PPPGFKRLSCLRNR 


995 

i 


14896 


A 


1002 


143 


408 


IKKKIKREI*KYLDISKNK/DTYQNIiWN 
ATKAVL*GKFIGINVYIFKNRKN*NK*F 
NSTSGKLKKLEKKEQTKLKISRKETIKI 


996 


14897 


A 


1003 


84 


409 


DKLAALPSSWTLHPGSPLGVTRAPPPPP 
PPPPPPPPPRVLEPVPRSLYPGLAVPW 
PRALH* P PHPGSLPACPAPGLLGGTRQC 
S\QTILPKKKPPPLDADPANEPPPP 


yyl 


14898 




1UD4 


J yZ 


2 


aw a v xir Li v o J. w a v f i y v tiu 
NKTNS WG I AGYP FHFLKRNQRFFNALKV 
PGPFSK * TIHLVNLKNLFFF * DRVSFCC 
P SWSAWKS * FTAASTS lA I KQS SCIiNH 

P S S * DCRHTPLLFFVRMRS 


998 


14899 


A 


1005 


2 

* 


421 


F VS LLLL I TS AI VAAGAPTGRTVYP PLS 
GNYFH PGAFVHLT I FCLHLEGVS S I LRA 
INFIAS I IS 1 1 PPAITQYQTPLFG+ SGL 
ITAGIiLLLSVPVLAAGITI \LLTEPSLF 
TTFFDPGGGGDPILYQHLF*FFGHPEVY 


999 


14900 


A 


1006 


1 


428 


D IHYG* I IRY JaHADGAS I FCI CLLLHIG 
RGLYYGSFLYSET* SMGI ILLLATIATA 
I IG YVLP * GQ I S F * GATE I TYLLS AI P Y 
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NO: of 
nucleotide 
sequence 


SEQ no 
NO: of 
peptide 
sequence 


M , 
eth 
od 

■ 


SEQID 
NO: in 
USSN 
09/515,1 
26 
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beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid- 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Add, 
E=Glutamic Acid, F=OPhenylalanine, 
OGly cine, H=Histidine, l=lsoleucine, 
K=Lys ine, L=Leuci ne, M=Meth tonine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonfne, V=Valine, 
W=Tryptophan,Y=Tyrosine, 
X=Un known, *=Stop codon, /-possible 

I x ■ ft ■ ■ i * % * ■ % 

nucleotide deletion, \~possible 
nucleotide insertion 


- 












IGTDLVQ* I * GG YS VDS PTLTRCFTCYF 

ILPVI iarlpalhl\lflletgsnkpig 

LAS 


1000 


14901 

• 


A 


1007 


1 


419 


S PIVP I VMAMACLMLTERI ILGYIQLRP 
GPNVGGPYRLLQPFADAIKFLSKEP\LK 
PATST I TLYI SGPTLTLT IGLVL* TPLP 
I PNPLVNLNLGLLFI LATSSLAG YS IL* 
SG*ASNSNYTLIGALRAVAQTISYEVTL 


1001 


14902 

* 


A 


1008 


411 


27 


GQPGQHRKTPS PLS I FYF/ELAGGGDAS 
T * FQLLRRLR * EARLS LGI * GCSEL* LP 
PCPPAWVTK*DPVFGGRKKKERLSQLRK 
LRPRBVKSLVQHYVAHKGGNWDPNPGPM 
ATQPGRS CTTLCVPLCHR 


1002 


14903 

* 

■ 


A 


1009 


21 


377 


MSLGGQQDSAQTTRSPQFAGFFGQRSIP 
DRTGLGFHLCVCKCVCACVQVSVCKCVH 
MCACACMCTRTPMCECVQVCECVQVCVR 
VSMCVQVCACVRVCACVCSGACG/CTCV 
T*MSLACG 


1003 


14904 


A 


1010 


1 


412 


HKANTSTYLLTLVNTFSG*VKACPTTHK 
TAEWASTLIEQIIPRF/GPAFISKIVK 
QVTTTLDVNWKLHTPYHPQSSGKVERAN 
SLVKQHL I KLALKTRQSWVTLLPFALAW 
LWAAPQS PTG INP FELLY/ SAPLPLSN 


1004 


14905 


A 


1011 


1 


410 


FRAVAGAS RQENGA\ T VILRD IARAREN 
IHKSLAGSSGPGASSGTSGDHGELWRI 
AS LEVENQS LRGWQE LEQAI SKLEARL 
NELENS S P VHRS TAPHTQQVS P IAPVEP 
PAEKPAT PT * DDVDDD IDLFGRDND 


1005 


14906 


A 


1012 


269 


32 


NISRIYKELLQLNNKXT\SISKRAQYLN 






RNFTKDI*MAKKPMKRCPTLLVIREMQI 
KTTMRYYFTPARMATIKKTDNNEY 


1006 


14907 


A 


1013 


21 


443 


RIRKKNSYPHYVKS IVAYTFI IRLFPTT 
I FMCLDQEVI ISN* H* ATTETTQLSLSF 
KLDYLSI ILIPVALVATWAI IQFSL*YM 
\NSDPNIKQCLKDLLMFLMTILVLSTAN 
NLCQLLIG* EGA* MISFLLIS * * YARAD 
A 


1007 


14908 


A 


1014 


I 


309 


S S RAAA I HGGACL WLQ I LHRLR WEG * LS 
VGGQGCS KP * LHH* TLAWATERDFVSQK 
KKKKPQKANP\HLETSCKKKKKKPMLGP 
PKSQRKEGPKAPQGRTLWP 


1008 


14909 


A 


1015 


3 


256 


E KNQ I LKWTMDLS RHFS KK \NMQ VGNRH 
MKKC*TSLI IREIQIKTTMKYHFTPVKM 
AIIWPGAGITGMSHRTQPSPTLLLTHW 
A 


1009 . 

• 


14910 


A 


1016 


397 

■ 


3 

• 


LFFSPPPKWGFFFFPPPRFFAPPPFFFF 
PPPPFFFFFCEKPFFFFFFPPGGGAPPL 

mm mm- mm* mmm mm— mmm mmm —mm. +*-nmm, %^ T"» *■» *>wj ■■■■ / m ah ■\ak«h 

SPPFFFFFFFFPPPPPPFFFFK/SPPPP 
KKKKRAAARDLELVGRVGGRV 


1010 


14911 


A 


1017 


2 


164 


EKERRE*KREE*KERK\RKEGEREGGRE 
RERERKKKRKRERKKDRCMFIVHFRE 


1011 


14912 


A 


1018 


1 


406 
« 


NSSPPKNKN*NK* KQHT* * EKNFANYIS 
DIELVFRTYNELLQLNKTGNPIKNSF*K 
NQFKNKGK*PNFYITQSF*RNKPI*KF/ 
AKDLNKRFSKEDM* IANKRI KTCITPLA 
IRKRQ I KTTVKSS FKA/ 1 YMWVKET 


1012 


14913 


A 


1019 


302 


2 


IVEPLFLGWLCLFLQNPRVKGPGPYFSK 
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SEQID 
NO: of 
nucleotide 
sequence 

- 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

in t? to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
otide 
location 
corresoon 
ding to 
last amino 
acid 

residue of 

npntidfi 

sequence 


Amino acid sequence (A=Alanioe 
C=Cysteine, D=Aspartic Add, 
E=Glutamic Acid. F^Phenvlalanine 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leuclne, M=Methionine, 
N=Asparaeine. P=Proline« 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V«Valine, 
W=Tryptophan,Y=Tyrosine, . 
X=Unknown, *=Sitop codon, /^possible 
nucleotide deletlnn Vsnnscihfp 

nucleotide insertion 


• 


■ 










GKANPWFPTV\SKQPFKKPAFPVYQKQ 
L PLF/ PALVDKKKVL/ RFFPF F * DRVLL 
\ CCPGWSAMVOS WLTTTSTS 


1013 


14914 


A 


1020 


47 


382 


AKIALLHSSLGDRVSHTHIHTHKRSNEI 
ELVI KS S P KKKS SGIjNGFP I E FYCSFK* 
ELIPIFLKLFQQ\VEAVGILPNLF*EVS 
I TLIPK / PROEONSNXTI YRP XSLMNIG 
GK 


1014 


14915 


A 


1021 


273 


1 


QS KERRQRTQEEMGKTFHNNRENGTLFG 
FSQSCLKDE I i I I I FETGFRS I SQAGVQ 
W/P/NHGSLQPRFPRLKQSSCFSLPSSW 

TTVW*T.PW 


1015 


14916 


A 


1022 


327 


1 


KFLKGWP I YLKKGFPF VPQGGGQWANYR 
S LVPQ I YGVK\ YP SAFASKINSTMGMCP 

LVAQAGVQWHDLRS PQ P PPPG FKRFSC 


1016 


14917 


A 


1023 


3 


407 


THLTQR* /HNIQTPP/ WV*TGTLSTVLM 
NFYLWM* FHFQSI TLF ILRLLTTKLTWY 
QGWUDVTQQRSYQGHHTPPVQKGLRYRI 
ILFITINTFFLEGLF*ALYHSSLFPTLH 


1017 


14918 


A 


1024 


3 


425 


LPPQKKRGFPKIPPREF*KTPPKKKKII 
FPPPGKNWPPTK/RFLKGPPPSRFIPFP 
FFPLPSSPSPP 


i m o 
lUlo 


1 /fOI Q 




W2.D 




1 


nr , PDDPPPirKTPPT7PT.CP*TT7fl /Pf3VPOV 

FPPPKKRFFSKNPQRGFNNPPQKKKKIS 
SPPPVNFAPPRDLLKRPPPF FFFFFFFF 
FFFFFLFMHPSFPTSVPQWQITSFLCLL 
LIIiQCCVCVLIGMWGDQSESSPSHHLQ 
PRWL 


1019 


14920 


A 


1026 


463 


494 


RKTGFPRLT * HLDLOFFFFF FFWKGIBy 
APRAGGRGRNLG * LKP P PLGWAP FSGLT 

PP ETQWNWtft&PDDPP PMT? /r*VP* P JCTYJE'D 

RLTR 


1020 

■ 


14921 

- 


A 


1027 


606 

■ 


1 

* 

■ 


AGGPGP P KWPGAFGWG / S LNG * S * LKG 
KGSKGAP I PQSGPTVFPGGIPPKES PLR 
PPGGMQGRRGPRWPGASGC/VG+DSVLR 
QP /GVTGEAPCPGASS PQASQAARRGSM 

G * / PGRRGGQSGCS CPKG * AWWRPCLAG 
SHSSAKHKGMSSEGPAPRGRAISPVSAG 
S IGPIAARGF 


1021 


14922 


A 


102S 


436 . 


1 


GGAPQGVLKRAAPFFFFWKQNFNFVARI 
I KGKGGVLS PROFP PLGFKKFWGPTPL / 
SRWGFKEGLKPPGS FWFFKKRWGFSMWP 
RWV*\IPGPKEL\PPPFSQRGGIQGGT/ 
HRPPIIPPSLF/CFFEMETHSVAQAGVR 
WRDLGSLQPL 


1022 


14923 


A 


1029 


1 

A 


436 


PPKKI I FSPKP * I FWGGGGPNFPPPKKS 
FFSQNPPGVFFSPP*KKKKIFPPPPENW 
GPPKIF/YKRPPPPFFF 


1023 


14924 


A 


1030 


28 


366 


EDHLS PGG WGCNEL * LCHCTPA\ WRQNK 
TLSPKEKDHK 


1024 


14925 


A 


1031 


3 


142 


FDCSALQEFGTRLYCPSWSQIPELKRCT 
HLSLPKCWDYR/R*ATTPGLRIVLELQK 
KLRR*CRBLLYTPRS VTPNINDI * HWGG 
TFVTINEI ISIH* YIALAGLRFLSSSDV 
LTSASQNVGI TGMSHHTWP 
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SEQID 
NO: of 

ii micviiuc 

sequence 

"t 


SEQ ID 
NO: of 

sequence 

• 


M 
eth 

nrf 


! SEQID 
NO: in 

09/515,1 
26 


Predicted 
beginning 

11 u ncuituc 

location 
correspond 
ing to first 
amino acid 

1 vSIU UC vl 


Predict- 
ed end 

n i if* low 

otide 
location 
correspon 
ding to 

laSl a III HIV 


Amino acid sequence (A= Ala nine 
OCysteine, D^Aspartic Acid, 

t? — (~1 1 1 ■ i i m i r* A rJH C — hpn vlaln nlna 

ju — vviuiiiiiiiL A.1.1U, r x iiciijmiBnine, 

G=Glycine, H=Histidine, I»lsoIeucine, 
K=Lysine, L=OLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
a = x nreonine, v— valine, 










peptide 
sequence 


acid 

residue of 

peptide 

sequence 


W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \ppossible 
nucleotide Insertion 


1025 


14926 


A 


1032 


7 


361 


PALR P S ARGQQ YYDBT KNKTLCRNAQND i 

* KAP FGLYWNCGARVCR \QG I SAK* TEA 
CGLGTI KPS FFIi I PLKQGELLG YPA YNE j 
NLKKKXKK 


1026 


14927 


A 


1033 


472 


3 


TKQQMLNVDRTAFYKKMSSRTFIANAEK 
SMFGFRAS KEG * LLGANTTGNF\LKAML 
1 xHbKNPKTPKNK?UGSTIjIi v P* K*WNKA 
LVAAHLFTA* FTEYFKPPIE IQIITADN 
APGHPR\MYKEMNVFMSAN\TSIL*PMY 
QGIISCRIPAARIEVKGA 


1027 


14928 , 

f 


A 


1034 


488 


0 


PLR* ILAQRSGIHSMKT*SGEAAAE*KS 
EARSSWFLRFKEKSHLHNIKAQDKAAST 
DGKAAES YLED\TD*DGNTK* ICNGDKT 
AFY* KEMPS RMFPAREELMPGFKASKDM 
LTLL LRANAVGD VNLKPMM I YYTENPRA 
LKNYDKTQLC/PVLYKW 


1028 


14929 


A 


1035 


114 


A\ r\ i 

491 


NYFKNFTFSNKLTL/ CLL* IiFLL* KLFN 
F*TF* LLC/I *HLA*NTNTLYSCTKIFS 
FFI FLFYKHFLKFI YFFYFLDFI I * GAY 
* S LCYY* WYFS CQFCQVLFHVFWGLCC 
HVLVRLLYFQTDEPFY 


1029 


14930 


A 


1036 


1 •* A 

114 


496 


LLRAILTYLKYKISAMNLVSA* FI YLHL 
TYHCVF /DHPVQGR* LLNK* INELFCYR 
S FG F* WVFS YSHLS ED * ALEE KYLRERS 
RWVK/DLNVTSETVKILEEILGICILLDI 
V9JJV9 KJi r MMK 1 o QAM X 1 K.f 


1030 


14931 


A 


1037 


381 


45 


KNRGRKKNS * LGGF\SDKFYKTF* KLIQ 
IIYKLFQKNEKEGTIj\*NSRPISLINIE 

aniqn/ravlami iqkhi kki irhnqw 
f/ipgr*g*sntcksi\nvihhikqrri 

KSIEF 


1031 


14932 


A 


1038 


134 


394 


SFCDKFTNCISESLCQ/SYSPTYIIKIF 
VLFKIRSGSITQPGVQWHDHSSPQP*TH 
SWAQTIFLLSSQWGTKGMCHHTWL/IF 
tiFLFVF 


1 032 


14933 


A 


1039 


483 


30 


YDGGSE I INYVLESRLIGTEKFHKVTND 
NLLSRKYTVKGLKEGDTYEYRVSAVNIV 
GQGKPSFCTKPITCKDEL/AYVSTTIYT 
SETCTFVDL* * D INKNDLPL * 1«Q ILAPP 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 


SEQID 
NO: of 
DeDtide 
sequence 


M 

eth 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ine to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 1 
otide 
location 
corrcsoon 
ding to 
last amino 
acid 

residue of 

nenfide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E^Glutamic Acid. ^^Phenylalanine. 
GKJIycine, H=Histid ine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asoaracine. P=Proline. 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V« Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=£top codon, /^possible 
nucleotide deletion. V = nos<;ible 
nucleotide insertion 














AHIISQEKEIKKIQI*YKEEKLPLFTSS 
SLFI*VKNLMEFAKKLIiELINEYNKVER 
YKINIKNILLAKNTWTLKF 


1036 

■ _ . 


14937 , 

. - 


A 


1043 


50 

K 


457 


TKYQLANKNMKICSTLPI IRE/MQIRPH 
E I SI/TPIMMAT • <3 EEOKNN1CYWGGCREVE 

TLVHCWW* CKTVQSLRKTVHRFPK* LKT 
EL P * ELPPK / PGS ORD I CTPMFIAALFT 
1AKRFWKHPKCPSTDE* IKEMWYIQS 


1037 


14938 


A 


1044 

i 


2 


469 


LKQSSCLGLPKCRDYR/R*ATVPG* FIL 
*NTTEDPNLTKN* ILNTNH/WCYSVQLI 
LGESNSTIKTSLSFSQASSHQNTTVQFL 
STS P S FKNCQWLAI LPRK+ GLTLS FCFA 
FEROGTj /Nfi VAOAG AG WTJIHGST .'KAOP P 

RLKQ/FLPNCPTLMECYRLQP 


3038 


14939 


A 


1045 


42 


200 


LICT*MLIVALLVIARNWKQIRC/SST/ 
DEWLNKLW* I PMEYY*AIEKEPTIGTC 


1 AT ft 

1039 


1 j< ft/t A 

14940 




1 r\AC 


475 


io 


CLS \QQSRF I FKDTNRLNI KGWRK\ TFH 
ANSNQKIA/G VAVYVSNKTN \ FTRNKE * 
NYILIKGSIH**CIIIINICATNS*NI»K 

I *NKLTEMKEE IHCSKITVED/ YNTLF* 
TMDROKFFHRMO M 


1040 


14941 


A 


1047 


3 


326 


FFFFI I I I I FETERDSVSKI I I IIKNKK 
DLF*FLVTQVTPEITNQYISTLPLQSKT 
KWDRPGTVAHSCWPSTLGGRGRRI/TLR 
SGVRDQPDQHG 


1041 


14942 


A 


1048 


403 


15 


EVKNS SFSYRLMTE I MP FGYVSKLYN CD 

FKSIiLLLMWKLLHSAIFTSA*TPRGL* I 
YF I YMYVLETWSHS VAQAGMQWHNQGSL 
QPS PPGLKEFHRTDI VYAR 


1042 


14943 


A 


1049 


25 


457 


YMWWNSHHSCMFTLLFDSS FP P PSLCFI 

MSVFHLILKLQ *NIHNIYFTILTIFKCT 
VQWH*VYSHFSVAKTAT\lSRTCIiIPSS 
WD YRHMPLCLANFFIi* RQGLALLPRLVL 
TSWIiQ 


1043 


14944 


A 


1050 


14 


478 


FEPYCRH* TTRWN\CWWG YELVQSLWKA 
TEOYVLKPDICVF /LEPGVSLLGMYSKE 
LCTtiC\YQKTRMFIATLFALVKS* *LPK 
SS*MVE*ITKL*YFHTMEYYTAM/KNE* 
ITI * TTIWMNI /TI ILNERSQTQK 


1044 


14945 


A 


1051 


2 


465 

• 


GIDQPSKPIPL/TQSFTQTKA/LTFFNS 
VKAERGEEAAE /ER/VEASSD* FMRFKO 
RSHLHNIKVEGEAATADGETAGS \SYPE 
DPAI TDEGSHTKQQ I FS VDBTACCWKI S 
SRTFAAKEKSMS VS KASKDRLTLLLRAS 
AAGDLKEFPRLV 


1045 


14946 


A 


1052 


392 


24 


DFAPRRKKRETRRSKINSLTS* *KELEN 
QVKTNQKGSRKQEITKIKVELREIETLK 
TFPKKKVNKSRS WFYEK\ INKLTXJTLAR 
QINKKRKENQ/DTIRNNKGDI I TDPME I 
*TTIRGIPPLWR 


1046 


14947 


A 


1053 


204 


488 

■ 


CAVCYTQISLTEWVDL* SLDPGFETVSH 
LFDRVNQP * D CKRVTDMLMVF* NTaHFVI 
LFFLFWF/DFVFVLRQSFPVVTQAGVQW 
RDLGSLQPLPPG 


1047 


14948 


A 


1054 


498 


2 


SVWCLFLSPGITSHTYVPMIFKIGAKK 
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■ 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
Dcntide . 
sequence 


Predict- 
ed end 
nucle- 
• otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
neotide 

mj%*M&WrM\M *-* 

sequence 


Amino acid sequence (A=Alanine ■ 
OCysteine, D=Aspartlc Acid, 
E=Glutamic Acid. F^Phenvlalanine. 
G=Glycine, H-Histidine, I=Isoleuclne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
~Q=Glutamine, R=Arginine, S=Serine, 
T«Threonine, V«Valine, 
W=TrvDtonhan. Y^T v ros i n c 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion. V^nossible 
nucleotide insertion 


■ 


• 




■ 


• 




VHWWKS I LFNKS CWRNLISTCIRMKVDP 
M/LTPGTKINtWIKDLTPSAESIICLK 
ENIQ /THFHDIRLGNAFWDMTPKA*ATK 
EN* IPWITSK*QT\FCAAKNTVN*VKR* 
PMD*E/RIFAKHISEKRLICVIYKEQLE 
LN 


1048 


t 14949 


A 


1055 


28 


209 


IDGRSRGLWW\CWLASQLLWEAEAGGS 
LEPKSLRLQ*AMIM\DCTPPW 


1049 


14950 


A 


1056 


163 


479 


TSFIiGSQSAGITSVSHCPSREVFFLKLI 
HWRQGGQVALLVATPHSPCCPQYRLAPI 
PRARHDFACASLI FVCILLVHVIiLMPRS 
VAGRGVGSGLLG/CQAGR*I>TC 


1050 


14951 

* 


A 


1057. 


2 

- 


1258 

• 


ELFPWHFQPSRAASLWAKAIRDKVPGT 
RRWPVPTDGRPEGFAIiSEPHHS CLHGRH 
QGPERCRHMAASP*EAGNK*PKDXDGHP 
GERTETAAGVHTGCGGKGPAAAGGR*AQ 
GGSGEAGAAAGGAGAAVGGAGAAVGGSA 
GGRWPAGLI/3QHGAG * GEGP CRQHGPP P 

GRG * GPAGPRGGAAAERAGAGAMPAQGP 
AGAAAEPAEGEARPGAGDYGPAANHPGA 
RTG AGG AEG AGAAAGGLPR P AQAHR\AQ 
IHGPVPlrGTGGRSSS/GGIPDNMSHRLW 
QRHRSHGEASAGQRHPHPGPTGGER\GL 

EEAPIQQQVFRLCPGNLWR/RPPHHMRP 
* AVLL*NI FHISRRREDVMDPMPSSPIC 
PL 


1051 


14952 


A 


1058 


2 


437 


ERSVRTACCKQSRGLSSXiLCPPPAPRIiS 

luorv xuoo/uiarAcrnfVs \routurn 

SPGPQPLHPTjGPRS PS PSPEQ TVPSHPA 
RLPSLSPERTQG\PKAQ*SQLPPPS1AL 
AQPAPAVGGREEVAS PACPS WKDKSRLR 
AVPGSA 


1052 


14953 


A 


1059 


47 


456 


TRCYGTLLRLDAPGKLWTRKN* WASTY 
ARTDPJCSSASFRPT^PLPRGFOT.STjHFS 
LTSPSCLAFSANTHRGLRGENYHITK/C 
DMAPKV/HN*TQAWQWHDFGSPRPPPP 
GFGRSSCLGLPSGWDYSPLWHHEQIL 


1053 


14954 


A 


1060 


484 


13 


FPTSASQNTGTTGVSQHAQP 1 1 F/LFVE 
MGS PQ V/ GPAS LKLPTS RDLPTPAS RAA 
ESVSAHHH*FLNFL*RWASGFVAPSWS* 
TFLIj\ KOSSCLS1jPVC*DC!RCYNHWAWP 

HSVFSSQFIiSLFPFNFSFLFFIiSFSFFK 
DRISLCRPGWEFHQTVQWSR 


1054 


14955 


A 


1061 


1 


494 

■ 


GS PGHP VCGRRRS ALENPGQ PCSLQPGV 
VSGATGAI VRVQRTS SAWATAAAMGAYV 
E /TTRFRYT* KAGVGCDRLWGAWLKADG 
LGETKAEHTLHDG/ PPEDALYGLIEGDD 
TNFTIQGEWHCWYTGTLPDGTVFDTNI 
QTSAQKMM/NAKPLSFKV/ GIGKVSH/G 
WDE 


1055 


14956 


A 


1062 


3 


395 

* 


SDPSPCGGIRFDEMEKFLENYNLPKLKE 
/DKINDPTFVK*IGSVMKIFHTVEP/PS 
HKCFTSEFY*TFKE* IAPI SHKLF* SRG 
IMGTATPPVMFHLQENVIPK\PTEKK*T 
YRPVSLMNIN\KILNKILTYK 


1056 


14957 


A 


1063 


402 


260 


WYCPPKV/LLRFSVYSSPPEVWAVGSIK 
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SJEQID 
NO: of 
nucleotide 
sequence 

- 


SEQID 
NO: of 
peptide 
sequence 

- 


M 

eth 

od 

- 


I SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
'amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nude- . 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence. 


Amino acid sequence (A=A)anine 
C=Cysteine, B=Aspartic Acid, 
E=GJutamic Acid, F=Phenylalanme, 
G=GIycine, H=Histidine, I=Iso leucine, 
K=Lysine, L^Leucine, MHMethionine, 
N=*Asparagine, P=-Prohne, 
• Q=Glutamine, R=Arglnine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y^yrosine, 

V T 1 rf*^ a m m mm m 

X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














AEL YMLRP LF PRTREVD * I F * ICPV/LK 
G S P KNKD F CP * GAKT FE YFYPATLGLNL 
KI*IFRQRW 


1057 

> 


14958 


A 


1064 


66 


379 


EFGRGCGEPKLRQCTPAWVTEEDPVSGG 
KKRKRKEREKT*REPSVSRPESH/PPSQ 
EAF WWQALSR PVL PNF I SGRS ASGLGK 
S EAI KS LFL YLAVCS TTEEQA . 


1058 


14959 


A 


1065 


223 


437 


KS CM S CDSNKI FFLRQGLS CSLTQAGVQ 
WHYLGSLQPLPQGPK* *SHLSLLSNWDY 
RR\PPR 


1059 


14960 

» 


A 


1066 

i 


414 


65 


NKSPDRNELAS * KCTQ* THTS IARDFFA 
KTIQWGKNSLFNIWC*DNWISTSKRIKTj 
DPYI I PYTK\TNSKWIRDLHVKAKTIKL 
LEKKGPGAWCPCNPSYSGGLLEARSLR 
PAWAT 


1060 


14961 


A 


1067 


418 


1. 


LWVFFFSRQSL/ DSVAQAGVQRCDLGSL 
QPPPPRFKRFSCLSLLRSWDYRCRPPHV 
ANFVFLVET\GFTMLARMVLNS * PCDLP 
AAASQS AG I TGVSHHARL * FCGF * YIHR 
VAQPSSQSI*DDHPIKKFCIL*QPTPFR 


1061 


14962 

• 


A 




24 


440 

• 


EV*NL YS ENYKMLLKEMRGDLNKWKSI P 
*S/WI*RIjOTVKMAVIiPKLIYRFNLIPI 
RI PDEFFVKSTS * L / CKFI WNCKRLR1A 
KTIF*K*RTK/SGDLILTDFKTYYKTMV 
IRTI *Y*QKDRCID*WDRIES/PEMNPY 
IY 


1062 


14963 


A 


1069. 


442 

i 


26 


IiRNQVSTPSSKTPRFFFFFFIiKRQTGAQ 
RHNHSSL* PRTPGLKQPSTSASGVSRTI 
GASHHTPL I FFYFFN* GKEGWGGCS CFV 
AQAGLQL\RLQMILPPQPPLKCWN*RHK 
PPSI/TYPIFLKRTP+SCFSKEKATRIPP 


1063 


14964 


A 


1070 


294 


46 


KNKLKRKERKENNKKKAKIND I KNKSML 
EKIHKDKN* FFERWKKIDKLLATN/RIT 
KKTQ I VI FMNKIRD I TTYPTDI KNVKR 


1064 


14965 


A 


1071 


141 


469 


PKKQG VQLTQNAPL P FR VGGKRGLCPKK 
XXERERDRE IISIIIIII KLPTKKTPGP 
DGFTDBLYQTYK* LTS ILIiKVFLKKRRE 
CFL/PYSMDPAVPAIALNPPP 


1065 


14966 


A 


1072 


49 


472 


EFQI IKK*NSFFADTEKALVW I ENQNSH 
NMPLSQSL IQSKALTLFNSMNAERVGKD 
AEEKLEASRGFWFMRFKKPSP*HKVQGE 
AANLDVEATASYPEDLPEIIDEGGHGKQ 
/ QIFNMDEIVFYWKMPSRTFPAREERSM 
PG 


1066 


14967 


A 


1073 


82 


419 


LTVGFRETRS SS WVCS S S PKRALHALRF 
TPALKF* FFFWSFSRLFFFFIjRDRVLLC 
CPGWSPTLGLK*SSCLGLPKCWN\YRYE 

PP 


1067 


14968 


A 


1074 


1 


407 


LEDGFFKITQSDKKEKKRI \KKCK*NIjQ 
E VW\DYVKHSNLQVI E I PE* EVKCLENL 
FEEVIEANI / PHLASYLDIQIQAVQRTQ 
RGYIARQTSPKHIWRLSKVNMQEKIR3 
LPKEKHL I TYKGNS I RLTAKPS PKPT 


1068 


149*59 


A 


1075 


3 


396 


LHAYHIVKLSP*PLTGALSE\LLMTSGL 
TM * FHFHS ITLLI LSLLTNTLTI YQ*WR 
DVTRESTYQGHHTPPVQKGLRYGIILFI 
TSEVFLFAGFF*AFYHSSLAPTPKLGGH 
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SEQ ID 
NO: of 

ii u cicuuue 

adj HCULC 

"l 


SEQ ID 
NO: of 

pep Hue 


M, 
eth 


SEQ n) 
NO: in 

26 


Predicted 
beginning 

n u cieonuc 

Jl/LilllUfl 

correspond 
ing to first 

d 1111 11 u m.lU 

residue of 

peptide 

sequence 


Predict- 
ed end 

n 1*4*1 a 

HUCIc- 

location 
correspon 

H ■ nrv 4n 
UlUg w 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 

l?^/?|i|4 , nmiA A fill! PsPKahv InlnntnA 

Hi— v« i u u ni ic aciu, jt— r neny laianine, 

— li/pi n TT- Hicf iH inn T — Ycr\ latt/*SnA 

\jr—%jtiy%>mVj Ji—jniiiiuiiicj i— isoicudDC, 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P«Proline; 

— 1 ii 4o tn ma RsAroSnina C^QarlnA 

v^^vviuianiincj Ev Arginiiic, o— ocrine, 

T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucieoiiue insertion 














WP PTG I TPLNPLEVPLLNTS 


1069 


14970 


A 


1076 


250 


47 


VAACQNAFCTFSCYWL*NLLCRK\MDLD 
ISI,LP*TKVNSRWIAGPNVRALTIKILE 
ENIGSAFGNRHSQ 


1070 


14971 

1 


A 


1077 


2 


393 

• 


PLTGALSNLLLNSGLAMRDDLHSITLLI 

LGLLTSTUTI yqg\ wrevtqestyqghh 

TPPGQEGLRYGTMLFITSEGFLYTGFF * 
AFYHSSLAPTTQLGGHWPPSGITPLYPV 
ESPLLNTSALLA*GVSIT 


1071 


14972 

+ 


A 


1078 

■ 


2 


500 

» 


LHVIGVSEGEEN \GMKQNKI FEEIMGPN 
F/PNLVKYINA* / IQEMQ*TPNGIHLKK 
TVHRYIIFQLVRTNNKERILIVAREKWH 
GIFGGTMIi*MNDDFSSEFIKARKIEMMF 
LKYKRGISSSAKHC*SRILCVAQISFVS 
KG/ E ITTFSDKRKLRAFI I SRPAHRNAK 
GK 


1072 


14973 


A 


1079 


419 


1 


LRWS FALAAQAI VRWRNLGSLQPLPPSS 
SNSPGSWRLQ*ANTAPLRSSLGNKNETL 
SQKKRK* KDNSEWEKIFANHASDKGLVS 
K* I KNS YNS I TKN\ DI * KWTKYLNRHFS 
KKD I KMANKHMKRCSTS 


1073 


14974 


A 


1080 


468 


2 


KLPPPRGAGSSAP/PLFFPPTKKGFFSP 
PPP*KFFFSPNPLIFDGGFFPIFPPPKK 
NFFLKNPKGFFFSPP*KKKKIFFFPP* X 
FAPPKFFFKSPPPLFFFFFFFFFFFFFF 
FFFFFFFLRTRGRTRGRTRGRTRG 


1074 

• ■ 


14975 

* 


A 


1081 


467 


1 


PTYKFNAIPIKIQ* *FFKKIENCILKFL 
/ WNLKE PQIAKTVLK* NKVEGFTLPDFK 
\LTVTKTVWS * HKDRHRDE / WK* TESLK 
IKPYKYDQFFFFFF* FSFL/HFF* YLTY 
S QHHKL I FNRMPKP FNKERLFNK* CWEN 
W lb 1 CKX I KjuDHYijTPxTK 


1075 


14976 


A 


1082 


387 

• 


3 


LKQP P PHKSF VKNKNG VSLCCPG * F * T P 
GLKRSS \HFDLPKCWDFRCEAPVLSLLN 
NIKL*L/CGRGLIKPQFP/SVK*K**AP 

orjfi 7\T_IT TlDTOBPPB/VtT mT T Tii/rnritfrtMti 

b W S Atii4iU£ I ire r r RQGJjTIuj PWPQ V QWH 
DPGTVQPQTPGHK* SSHFSL 


1076 


14977 


A 


1083 


2 


435 


FHPPTNWGGFSPPPP*KFFFSPKALNFG 
GGVGPNFPPPKKRFFPICNPRGGFIYPPL 
KKKNFSFPPPLKLAPPRV/SFKRPPPFF 
FF 


1077 


14978 


A 


1 no A 

1084. 


447 


47 


Pt>RKFr FLiAPPKRGFFPPFPIjKNFFFPIi 
GGLFFG/RGVFQFFPPPKKGFFFKKPRG 
VF*PPPKKKKNFFFPPRENLGPPGVFLK 
KlrfFr rr c rr rrr t Pre rbsCtiJfvX. 


1078 


14979 


A 


1085 


438 


2 


AKNHPKGFS PFKKFS PPFGGGG I FIRGQ 
L*KSFFFFFKKPKFLV*KPPFKI FFF/ I 
FFFKQRVLAL/CPPGGNKGVPQGPFPIF 

FFFFCS FFL*RQGLTTSHRLVLNSWA* A 
ILPLQPPK 


1079 


14980 


A 


1086 


43 


265 


I KHKLE Y I KIKNFCASNSNR/ TERQPME 
WEK\MFANIjISDKGLIPRIHKQLL*LN/ 
TKQQNS PI* KWVKDQLQWP YL 


1080 


14981 


A 


1087 


11 


412 


IASGLHDFFNKKKKKKKKKGGGPLKKTP . 
GGPKFNRGGKKKI FS FKGGEKKTPGGFL 

eknpflgggkmgqnppkkkkpfgekknf 
* gerge kkp kfpggkkmsps pqe / nkap 
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SEQ ID 
NO: of 
nucleotide 
sequence 

• 


SEQ ID 
NO: of 
nentide 
sequence 

• 


M 

cth 

od 

■ 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

f ng to first 

amino acid 

residue of 

peptide 

sequence 

* 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspnrtic Acid, 
E=Glutamic Acid, F=PbenYlalanine, 
G=Glycine, H^Histidine, I B Isoleucine, 
K=Lysine, L=Leuclne, M=Methlonine, 
N=Asparagine, P=Proline, 
Q=Glutemine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y«Tyrosine, 
X=Un known, *=Stopcodon,A=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RDDPRPPE 


1081 

* 


14982 


A 


1088 


2 


292 


RS LPAS ADS S S LVAAS LAG VRDRVSTCW . 
PGWS/AV/VQTL*SARLGI»PKCWDYRCE 
PPCLAEAPSFMRSGKASCTLETVWEDKH 
KYE E AERRF YEHEAT 


1082 

• 


14983 

- 


A 


1089 

* 


376 


3 


H IGLY IASCKTLLKEI KGS * IDILCSWI 
ERLNTVKKAILLKIiI YGFNTIAI /KILS 
GliFIETERMILKITGKCNRSQTANTILK 
PNKVGRLALPNFKTYYKATVIQTVWSWH 
KDRWYGFFCVPTOI 


1083 


14984 

* 


A 


1090 

i 


277 


407 


QIKAERSHHKQITSMRNVKHCSSENYDS 
*RKPAPGYIIIK\F*K*LIKRKTLNSGR 
LKNRPIKEE*RYAYQDISETIKVRR*EN 
DIFKVL\KEKKSCQPRLLYPLKIS* IEV 
KYFFDKSKLKEVITNRS PV*EMIJ^I VXjO 
/INYDSPYEP 


1 Aft A 






i noi 




z 


CTHPGVIiFKG* * A*"KTRKSL*NC!S FFNY* ^ 
SFSNMQTRIKNV/WPSTVAHACKPSTLG 
GRGGQIT*GQEFETSLANTVKP 


1085 


14986 


A 


1092 


369 


3 


FLKEIRSVTPVNT* ENLIAD/ IGEKVLV 
VWT F-KTOT^HNI PT.GOST.T09 KVT. /TFN<5 

MKTER * /EKLQKFEASRGWFMRFKERSH 
LH/NIKVQGEAESADVBAAGVYPDPAKT 
I VKGGYTQ* Q I FNVDV 


1086 


14987 


A 


1093 


71 


506 


FAE DNGL * LHPCS CKRHDLAVF YGCTS F 
VLTFGL* PWFLTQS / FLNPLEFS 


105/ 




a 




1 15 


JO J 


KLRS FFSAQ * TKQSTE *RRQSTE/ WKEK 
IFADYSSNRGLISSRQETQTTQWLKKIN 
KNALCTL 


1088 


14989 


A 


1095 


419 

i 


1 


DAKI PGQMVAR/RI PRP IKK I FCPDQVW 
FI PGMQGGFHIGKS INVAPHIQMG *KSF 
NNIOHPFMI * KKKNLSTGTTHEGDITOH 
/R * MLHEHTTNAI LNRE KLKAFPLRTG I 
RQGCLL* PLLFS IAVKT* P* AMRQEKEI 
N 


1089 


14990 


A 


1096 


34 


464 


NSSKKEKRKI PHDLGLRFLDMTPKT* ST 
NGMMD / KDF I KI K/ FCAS KDT INKVKRE 
♦EKLOI I YL/DKRIiVPRIYKDSYYPlRO 

mimJ *m\m mm\m at 1 ^ V f 1\ >*» 4 ^ ^ tarn 

LKKWVNNLNKHCFKEOTQIANKHKKECS 
T\S I RE*QTKTR*RSHFISTRVTKIKMS 
G/N/NKYWQGC 


1090 


14991 


A 


1097 


458 


28 


RI I KVDLKMCTHNFDSLEEMNQFFKKSR 
WPR*NR*FE*SYNH*RN*IHNLKVSKSS 
GPDAFTGE FYOTCE EELAP I LGNLFOKM 
ESIRSNSFYKVRITLIP*GDNGS/TKKE 
NYSWSFMNLnAKVLIKILAN* I*VFIK 
REFH 


1091 


14992 


A 


1098 

A W *J 


41 


500 

^^^^ 


FWIRHFLSRHKQRKNR* MDF I KI KNLYA 
S KDTVKEVKR* ST * WENYFQ I / RMFDTG 
LVPRVYKEFI *LSNK\DNHI * QWGKKAV 
SRGFSKGNTQMAKKHMQRCSMFFVIRKM 
* I \ KP KMR YHFTPS SMT*EKNKDNTCFK 
SLX5EYGILIYYYWGI 


1092 


14993 


A 


1099 


1 


484 


RRAPFFFF FFKRS FPFWARAP / LQWAFF 
GLVQNPPPRQIPPPGLHPF/SCPNPPKK 
WGYKGPPPTPGKILGF* * KGGGPRLNKK 
IFISRPCDPPAPTP\QKVGIQGGTPPPR 
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SEQ ID 
NO: of 
nucleotide ( 
sequence 

■ ■ 

i 


SEQID 
NO: of 
peptide 
sequence 


M. 
eth 
od 

- 


SEQID 
NO: In 

09/515,1 
•to 

- 


Predicted 
beginning 
nucleotide 
location 

A A m+ MA|I mm. A wm. ## 

correspond 
ing to first 
ammo acia 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 

Iaaa^iah 

location 
correspon 
uing to 
last amino 
acid 

res jo tie oi 

peptide 

sequence 


Amino acid sequence (A= Alanine 
C=Cysteine, D=Aspartic Acid, 
£/— uiuramic Acia, r— x'nenyiaianlne, 
G=Glycine, H=Histidine, ]=lsoteucine, 
jv— i>ysine, iy-L*eucine, lvi— ivjetnionine, 
N=Asparagine, PHProline, 
i^=vyiuiamine, jr.— Argimne, serine, 
•^Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
A-unKuown, — drop coaon, /^possiDie 
nucleotide deletion, V=pqssible 
nucleotide insertion 














P FFFFEMVS CS VAQAGVQS CDLRSLQAL 
PPGFEQFSCRGMPLKWIGSHPSPHP 


1093 


14994 


A . 


3 f sin 

1100 . 


144 


260 


tt tt near PUDCDUcraiTLV'Tninn/T r»r 

NTMYIIQPLDQ 


1094 


14995 

i 


A 


1101 


265 


457 


DPAAPLLVTYTRE I \ * MY I TKRYTAALF 
IKTKNWK\RPKCPSKGE* INKLWYICTM 
EYCSAIKNE 


1095 


14996 


A 


1102 


387 


40 . 


RIJRADILRILVYNNFLQRLKKQRYLFTG 
PQYRFLFLELWLCVC^CVCVCVCVRANF 
*IISRDKVLLCCPV\CFCFLNPPCVCVC 
VCVCVCVRACAI INKLLVETRSCSVAQS 
GLR 


1096 


14997 


A 


1103 


465 


208 


QNMTKTFQDYRPISLMKIYARKQQYRyR 
Y*RMLHNYQMGFTLGIiRGWFNI * K/S 1 1 
IHIN* *EKKIMIILIESEEAFDKC*QSL 

■mm 'M* 

II 


1097 


14998 


A 


1104 


57 . 


250 


MTCSRMQ/RNI KLLRYKSBKL *EENSTS 
RNEITILSEEDSFTNVKLEN*MKTVKQK 
KKTSKKTGVYKMAIKKQISEFKGKTSNF 
Y* EQINT * EKKNFKENWSIi 


1098 


14999 


A 


1105 


2 


378 


YVDPRQCGGILKGVLKGEFTISNBKANP 
GRGSPSSVSKD/L+ CQNIKTIESKKTCL 
I QKLI KVKAEI YERESRKTMEEINETK/ 
SQFFEKI /NKID* / PIARLTKKKKKTQV 
LKSEK\GNI TTNHTE I KRV 


1099 


15000 


A 


1106 


291 


1 


VSIYVSYNLDSCKGQIRIKSCDDQYIFK 
RIiYFFEITFLKFIjN* k* vlkkirdnvll 
C/YPGWPQTPGLKQSSO^LPSSWDYRH 
VPLHIiAP INYFLW 


1100 


15001 


A 


1107 


24 


397 


DIDHVVIEFETVFYHVGQAGLKLIjTSDDP j 

TALASQSAGITGVSHRTRPSSGHFNNTP 

EPPl^SIiS9HPKIiHKSPVT*NGA^i/Y 

GSSKLLSCVLNGPISLVHSTIiRLRKGGP 

QGNISQISIiMAPP 


1101 


15002 


A 


1108 


64 


414 


AQLTPTQPQGRAA2jHKGHV*RKTAPTCL 
FMAEKNQAASFFIiFPFPSICINKE/HFK 
KKKKKKKKKKKKKKKTRQKKKKGPEKPR 
KQPGGGLLISKIIiVYAPPPGFFLTREGP 
PPIFF 


1102 


15003 


A 


1109 


62 


475 


FEPLFYLMCr.iTtNLFPLQLPRHPFLFLTV 
DLVOTWGCPLPSSPQ*EWLLAAPHRSTP 
PPLSSGFPAHRQLEPGAGARGP/HHTQA 
LHLSFFFVFLRRSL/DSVAQAGVQWRGL 
GSLQPLPPGFV\ILSSPLSLPSLTY 


1103 


15004 


A 


1110 


16 


477 


EEWKKIiYRWRENISNLISDKELIC* IN/ 
NQTFNI QPLKTKKPTKDVNDLKTFTNED 
TQKYTYIHQ*R\SHMKR*SP*LFI\KMQ 
MSTALRFLYPPITMDRIQIPBNIKFWRG 


1104 


15005 


A 


1111 


1105 


3 

* 


KEERSETLAKGKLIAAGA/ YN* KEERSQ 
I NNLTLHLKGGKKTNDKPS KR/QE / I *Q 
IRAEISRIRPRKKKEKNNEFVFLKINKN 
FKTHSYIKKKREISNTKIINKRGDSTTD 
ATEMKKITRD*CEQPSSSSSSSSSSSSS 

ssssssssssssssssssssssssssss 

SSSSPPKKKIPGPDNFTGEFYQTFQEEL 
M PALCKI FQ\NVQKPEGD I /PNPFYQVN 
IYLVPEPDE/QPFVIKTLKELVIYGN/Y 
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NO: of 
nucleotide 
sequence 
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SEQ ID 
NO: of 
peptide 
sequence 
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M 

eth 

od 


SEQID 
NO: in 

I ICCM 

UoaJN 

09/515,1 
26 


Predicted 

beginning 

nucieotiue 

location 

correspond 

ing to first 

amino acid 

residue oi 

peptide 

sequence 

* 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last ammo 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-AIanine 
C=Cysteine, D=Aspartic Acid, 
Hi— Glutamic Acid, J<=PnenylaJanjne, 
G=G]ycine, H=Histidine, J=Isoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
QrG fu tarn ine, R-Arginine, S=Serine, 
1-1 nreonine, V^Valme, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














L/K* IKIIHEKLTANIILNGEKLKSFPL 
GSGIR*QHLFLPLLFITVLEFLLRIISQ 
EKVIKRMQIGKEKV\KILFTDNR1M*WV/ 
ENP/VNSQNIGKIMKFNKVSGYSNTHKS 
VAFP * TNNEQS ANKI LK 


1105 

• 


15006 

• 


A 

■ 


1112 


37 . 

■ 


466 


DRAAEFPTENPLELISKASKVEEFNVSI 
QRPIVFLCISNKQLENEILKIQFH* /PI 
S VAS KNS KYGG INLRKYL* DMH IDNK/ P 
LNKNRGGLCSWSATSVLPKLIDNFSAIS 
GNYSRFFFFGEIEKLNLKPIWKYK*VRV 
AKTVSY 


1106 


15007 

i 


A 


1113 


33 


436 


PGLN* * CWKNRISTCKRMKLDPNLAPYT 
KITSIWrQDlNIRPVriKILKENRGiCaii 
QH*MWQ* FF* I * PQKQE * TNRTKSDFIK 
TAQQRKQQNERQSVEWEKI / FANQISNK 
ELLSKIKFRLLPLDNNKQLT*KWA 


1107 


15008 


A 


1114 


2 


391 


PLTHSRMAIKKETK\NYKCWQ*SGGIGN 
LGNCW * ECKMF * PLWKIVLQFLEKKKRN 
TELP+WPLIPILGIH/YKRNVCIQMFTE 
ALFIITK/ SWKQPR 


1108 


15009 


A 


1115 


3 


458 


I RD PLEEAVCP FADIj^CLHAGRTTTLFRA 
VRQGPXiS L * KLLLP FVQLSHVPRGGVYR 
GSQASLSCGGLHPVQAS*LLCLPTQASA 
MAJDAP \PPASLPPCSSISDCCASSERGS 
VGMGPSEPGMGYNLLVCRLLRPLEKPSI 
RVGVSLFSRYHM 


1109 

♦ 


15010 


A 


1116 


204 


13 


PKMKTDSRWILGLFF*FLRQSL/SSVIQ 
AGVEWRNPGSLQPPPPRFKQFSLENQGI 
PPHGLWS 


1110 


15011 


A 


U17 


34 


454 


IHISWEFLTSKLYGrLTSQYNKKTNIL 
I K*MGKRFNAHFT* EGI* I VNKHMKRLP 
TSSVIREMQTKTTVGFRSVLTREAGIKQ 
TDS /NQAWVRKWSNSTIxLCCWWKHRVLQ 
PTWN\TAWQLLIYKMIWQLHFYAQKDIY 
VN 


1111 
Jill 


15012 


A 


1 1 1 o 

U18 


83 


451 


CFLKFFLYRELI S PFLL *Q\ FVRPS PAF 
RRKPPPWVAFFFSDPS / FS FLVFFPKGI 
VFFLGDALKKVLT * KKNFFFFGRD* VLL 
CCPGWSRTPALKHSS CLGLS KCWNSTTR 
P 


1112 


15013 


A 


1119 


316 


441 


F I FFFFLKTNFN FFAQVGGHNRNDG* LK"~ 
IiPLPGLKQFSGLTL 


1113 


15014 


A 


1120 


67 


312 


RQIDQWTRIES*ETDPREYSQLIFDKGA 
NPIQWRK\LFNKWFWHNWTSTCKQIKKN 
LDIVIASFTKTSTSHRLRENLYRIHI 


1114 


15015 


A 


1121 


27 

• 


490 


GTQLHSREKKNSPFNK* YWEN* ILTCKR 
MKLDS YF I AYTKINS KWI KDLNVSLLGL 
LAKIK* SPKCKI QNY* TSRRK* KGKDLG 

KNFCIA\NKTINRMKRQPTVWEEI IANI 
ISDKVLISRICKKTNR 


1115 


15016 


A 


1122 


490 


48 


EHTNNKRDTLTVDIGKVLVI * TEDQTSC 
N/IPSNQSLIQSKALTLFNS1 *AERGEE 
ATE/EKLEASKGWFMR* KERSHLYNIKV 
QGEAASADEEATASYQEDLAEIIDENG\ 
KQQI FNSDKIALHQKKM/ PRBKSMPHFK 
VSKDRLTLLREF 


1116 


15017 


A 


1123 


2 


259 


I YGQL I FNKG VKT I Q * RKNS VFEJRCYWD 
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■ 
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M 

eth 

nd 
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USSN 
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26 


Predicted 
beginning 

II HVIvVHU* 
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correspond 

ino to first 

IV mil 
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nentide 

sequence 


Predict- 
ed end 
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corresDon 

Wl I ftJ \r U 

ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Aianine 
OCysteine, D=Aspartic Acid, 

E=rj Intnmir Acid. F=Phenvlaianinp 

G=Glycine, H=Histidine, £=Isoleuclne, 
K^Lysine, L=Leucine t M=Metbionine, 
N^Asnaraoine P^Proline. 

Q-Giutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=TrvotODhfln Y=Tvrosine. 

X=Unknown, *=S>top codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














N* I S I WKRM \GLDP YLTPHTKINS KW I K 
DLNI RTKTLI CS S WRKKKRENPLKKKRK 
KL 


1117 


15018 


A 


1124 


206 


24 


CS I FFC I YLF I YLFI YI»F I YLRQSL/ DS 
VTOAGVOWHYFS SliO PTjP PGT \. GLF* RT 
SYSWDF 


1118 


15019 


A 


1125 


3 


351 


RRGRGFT I LVWAQT PDLM /HP PALAS QS 
AGITGVSHCARP I CLFFKRQGF/NSVAQ 
ARVQWWDHGSLQLQPLTLR * YYHLS\SW 

nVPPM /PaMT/5T?"B^7T?VTiYPnTTT.T.DPMP 
u x rv.ru. v i / r/U'iLVjf r vf iniruiiLurKVfc 

WISGLNELP 


1119 


15020 


A 


1126 


2 


349 


STNHKDIGTLYLLFGA*AGVLGTDLSLL 

T D A R T /2H DfSNT .T jCZ MTiHT VNVT VT IVU a PV 

IIFFIVIPIIIGGL\GIWILP 


1120 


15021 


A 


1127 


362 


2 


RVSLHCPGWSQTPELKLSSCIiSIiPKC*D 
YRCE P PHPAYLTS F I TEDS KNFEYVTTL 
F*NNIY*TMFIEIPP*VHYIGAHIiF*DQ 

\VQ*CDLGS 


1121 


15022 


A 


1128 


1 


349 


IYFLTRSHSVSRAGVQWHSSGSLQP*PP 
SSSFSSSLFLRSGWGGKGVLWS 


1122 


15023 


A 


1129 


367 


24 


INCVFYPCKGVTKKPWYT/HF*RKGFII 
INI FKGPFKLTF VCP EGG KNS ETLYP FC 

QGNYGVGWVGFF FFS AFVI F I KKLVLS P 
YYKN 


1 1ZJ 


1 CIV) /I 


7V 


1 1 JO . 


ol 




S VTQAGA* WRDLS LLQ PLPPGFKQF\SC 
FNl^KNHCYRDVPIiLLNTLCI IFKT 


J iZ4 




n 


1111 


z 




c c c CUV c oJ? c UUKUC i-i JrJ >j tr tr tr>j v c 

FFLPSSFSL/CLVSRDLLTLSQTKVQWR 
DLGSLQPLPSRFKRFSCLSLPSS*DYGR 
TPLW 


J 1ZD 


i ^noA 
1 jl/ZO 


A 
n 


1 1 to 
1 I JZ 


z 




FFYRFF* 9^nMR^P^TJXriAfSVHWPnT/5/ 
£<f IE ec ^ oyriu 0 c is jL/u^rvj v nniu/Uw/ 

S IAPYAS WVQVI ILPQPPPPGFESFSCL 
RFPSTWDNRHAPPRLAKKRNKI 


1126 

> 


15027 


A 


1133 


2 


221 


PS LLKKIiARHSGRCL* SQLLRNLRNENR 
/CVPAWVTK/GETPPSKKKKKNPPPKGP 
GAOTFETPGFGKPKGKIICV 


1127 


15028 


A 


1134 


331 


2 


RG PPP PLKE PSGVFR I FQFKNLE S RGGG 
ERGKQSPPLGPPIRRGLSPKPPPPEKGL 
TVFFFFFFRCKDGVSFCCPG /WISRTPG 
LK*SSCLGLPKCWDYRHEPPHPAPAL 


1128 

■ 


15029 


A 


1135 

■ 


3 

• 


391 


AVNTQMMRK*KSLIADVEEV*WWIEDQ 
TSHNI PLRQSLVQSKAIi/T*RAKPSMKA 
ERGICEAAKGKIiEASRG* FMRFKENM\ RI» 
HNI KVQGDTAS ADVEAAAS YS * DQAKIT 
* GGCX3YTKQQIFNTDETAFY | 


1129 


15030 


A 


1136 


107 


350 


TYSIVAI VMRFYFFI IFYFLRWSLAWI 
QACCCHPGNWHDGSLQPLPPGFHKFSCL 
SLLSS *GYRCPPPCQA\ IFIFLVEMG 


1130 


15031 


A 


1137 


35 


297 


TLMHYTNRVKEKNI 1 1 S I EAEKAFDRXQ 
HPHM/IKYLNTIKAVYNRPTASIILSG* 
NLKAFPLKSVTQQGCPLSPLLFNI I LEV 
LSTRP 


1131 


15032 


A 


1138 


378 


1 


PFTKINIiKWIIDLNIKHKTIKLLEDIIG 

ENLDNLGHDYDILDPTPKA*AMKKIIiIN 
, . — 1 
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lllg IV IliSti 
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ed end 

n tl a1a_ 
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location 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 

In ft) mif* A f\t\ ITssPIt An«rlr%t ahIha 

£/-*v>iui2iinic aliu, r— j Dcnyiaisnme, 
G=GIycine, H=Histidine, I=Lso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 

Q=GIutamine, R«=Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
/v—u 11 Known, — oiop coaon,/- p *uossiDie 
nucleotide deletion, ^possible 
nucleotide insertion 


• 












VALC*NHESFCSENDTVIRMRK*ATDRK 
NP I YKWAKDIiNMLL 






xi 


* 


Z71 


JbJ 


MTKQRCQGNSM/WAKDSLPNKWC*NKWI 
PF/C/RI*NLNSYLTPYRKIK*KWIRDQ 
NITVTT KLLE ENMR FS FAPG I G KD 


1133 


15034 


A 

• 


1140 


28 


340 


LLYF* YKRGFTMLSRLVSNSRPQ/CDPP 

rSAb HTAla XADGSHHARXiIj F r LiE K \ EPA 

FGPPAGRKGANFG* REPS PPG FRGIPP/ 
LPPPGNWD\YGGPPPPKPNFGFF 


1134 


15035 


A 


1141 


94 


128 


GSQMPRHLVD*MTRHIATLRES\CYSR\ 
VYPRFIEFLHFDIQSTGQKSHR 


1135 


15036 


A 


1142 


85 


492 


VWVGLLS LEGS P S KFGN F I EFGVLLSSG 

gfsawrlfffvyflrqsl/nsvaqagvq 
qwrdi»sslqplppgfk*vlkqrgvclfv 
cfeteshsiaqagtqwcdlgslqplspb 
fkrfsclrnsrslrrdiepsegnqc 


1136 


15037 

< * 


A 


1143 


372 


3 


kmnr*pisikeidfivknlpk\kfgpdg 
ftgily\rhfkkei iqtvcnlfqkteke 
gmls \ i ykas iaqi pkpekdkqtkatd/ 
ykpsgikakiinrhiangiqqyl\hdqv 
rftpgmqgwsniqi 


1137 


15038 


A 


1144 


56 


A AM 

482 


TMKTLLYWQKNG * VDQWNG I ES PE IDP Y 
K\YIQLVFDRKAKTCNV/RKDSLVNKWC 
TYAKKKKTLNIiFFTPFTKITSKTYIiKNW 
N/SVNFLNrPLGKTLGDLGFPRDFKFLN 

IPBTT71VCT nwri / immr \TmvnVT/ l <e<*VMU 

VKKNKKT 


1138 . 


15039 


A 


1145 


190 


482 


RQGLALS PRLECSGVI IAYRSLKLVGS S 
DPPTLASQIAGITGIRHCPWPKTLFLAS 
V ±rie AHN b * irb lib F V PbP \ b JjP b c ruuA 
3QSKRSQ 


1139 


15040 


A 


1146 


138 


485 


IFFPSV*TIFLLLALFFS**MYL»IFVKS 
SLSIYSVYQF/SFLCFWSPISDPKSQRF 
SLLSFIVCFFVFPRNRVLLCHPGWSAW 
* S*LTAVSNS \LLKQSS 


1 1 Aft 

J 140 


15041 


A 


1147 


101 


426 

* 


GJJWKKr i IYKnbbbKSPijIIjFEKKKGVlj 
EEYSSFDI * *AIKVIYHISRKRK\KNQL 
ANLTDAEKALNKI * HP FMRKTTQQ FSNR 
SFLHLRKGIYKKPTANMIIKEQI I F 


1141 


15042 


A 


1148 


458 


15 


VFVCLKI CKHRKGTAKKWYYNFMGLPLY 
MRSWDQNGTM* KMTI YAPNIGAPKYIB 

yiLiMLxAWKb VlXbJN 1 X LiliCjUrb J. r Lib XL/ 

DKELPELNFT*NKTDLTDIYRTFHPMAA 
KYTYFAGTHGTFS KMCP VGFD IRVI LVB 
FRRSTYGLDW 


1142 


15043 


A 

■ 


1149 


1 


507 


KGPPAPPPYYKKKFQGGPKKFPP*SHPF 
TH5PPGKDP /VTf PPMT.VPP&JZVTnnrWPDPP 

KKKQRG KGP KNP PPWKVKPE KS F * PS PK 
KTKIWPPPPPGGQKKKKKPPPPKKKKKK 
KSVSSSPKKI/YRSLFCTNLKKSFPT*F 
*NLMP/TKLPV*EFPPPPYNSF*SAP 


1143 


15044 


A 


1150 


436 


32 


NLCMYVYMYIiIYVRTYVCIYSIYVSIYL 
IDLSLYHHHYLSICMCVSIYPIYLSLIY 
LCISLIYHLFNLCIYISIYLIYLS/YVC 
M/SSI*SMYVCIYLIYVCIFSISIiYYLY 
IYLSIITIIYLYIiCMYCLFYLSSIEF 
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SEQ ID 
NO: of 
peptide 
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eth 

od 


SEQID 
NO: in 

09/515,1 
26 


Predicted 
beginning 

nil itlnAiir^A 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucie- - 

otide 
location . 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A- Alanine 
OCystelne, D=Aspartic Acid, 
n» — ijiuiarnic aciu, r — r ncnyiaianrae, 
G=Glycine, H=Histidtne, I^lsoleucine, 
K=Lysine, l>Leucine, M=Methionine, 
N=Asparagine, P^ProIine, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /= possible 
nucleotide deletion, \Fpossipie 
nucleotide insertion 


1144 


15045 


A 


1151 

• 


3 


657 


QDCKIQNEHKIQNTLFbYTRNAHIKIKF 
HIiHSKKI IjAVNLTKHV*NFYAENYTTLM 
KEKI * I ERNKDLHRKQ \NLNP C I W I VKL 
W1VKIPVX\PWR*KSjJjI lPlllPAK\er 
GDKEKI ILKFIWKGKGIR*LRF* KIRIK 
GGLNLPN\LRLTVVTVIKIV*YWWKERD 
RYLNQ\ WKKIENS /EKLKPRKYVQLISF 
VFDTSG 


1145 

i 


15046 


A 


1152 

i 


430 


438 


TFWVKKFFLLNLPK* INPGFKKKKNKRG 
G*KRP/SGSQVLKKLRGKIALTPEGKGK 
I RDCFCPPPPPLRKKKKPPIRP * KKKKK 
KRKRD P P FGNH YSNS YRHTS CYFTLLHL 
TLQIFFTSRSPVAMSCRINIj , 


1146 


15047 


A . 


1153 


104 


478 


ELLLEGSPCLRAESKAGPAGRLVPTIiLS 
WRVQSPGVLCAWEEECEEPPSTGEDAPS 
LAIRETQ I KFI TRNHLS PI*SMAI FQTNN 
DN\ RTKRK* CWHGCGE IG/T/ LLHCWWB 
/CKLTFP 


1147 


15048 


A 


1154 


171 


476 


NSSDYYYYFLRQSFTLVPQAGVQWHDLG 
SLQVTSDDPDLR* FARLSLPKCWDYRRE 
P / RMSDSNY FLK* VIJ VN*KARA* LLFI P 
SSCQAG/ASAGQTLSVS 


1148 


15049 


A 

- 


1155 


3 


562 


PDYQTARRRQAPADRCQPGPAATICPR* 
PKARSSSMNRSNPTVTRTAGP * PSWKTC 
AASKSQTGDGDRKE LI PPNRTPNN* VST 
AAI K* QVRP YLTQKVPAG * T FCPHQKLQ 
GVRPLCRGP P\RP FGS PAVTDNRQAKLK 
TPTQ/HPADPPEQKRPPARPPVGRHAEV 
KQ PGP PFPAGPRPAPS TDGL 


1149 * 


15050 


A 

■ 


1156 


67 

* 


417 


TLSCETQPGQHGEAPS / PTKNA* IALPL 
S WARP L Y F PAP AR FTARGG PVFT S KRVP 
SPPNRRGWRPPP\QFCHAWNPSPKFLAP 
KIPPT/GPLPPNKEPSKENVNPG/PLCP 
L 


1150 


15051 


A 


1157 


446 


87 


PDCINVISCFLTKI YHVFFFL* RQGLTK 
L PRL VSNS * AQAI LPP / S ASQSAG ITG V 
SLHTRPIMYFSSYYIIFGDTVLF\ETQS 
HPAAQAGVQ* CNLGSLQPPPPGFKRFSC 
LGLPSRRMA 


1151 


15052 


A 


1158 


1 


938 


FFFFI FAT YLFNKGK \ LTGKRQSD *S I C 
KHIjKGY i 1 KKUXJjI Ao KJi/ V/ jloJj 
AI REMQ ILTG I IKCC * R CEGNG I FTHYW 
*ARALAQLLWKNV*QHILKLSMCITYNP 
TI LFLGI SNRKVYM/ CCPK/DPCSRMSR 
AAQFVI 


1152 


15053 


A 


1159 


370 


3 

i 


^DVAATlTPlV AMOX7nV^/*»/^n 1 tTT.KIOTJT7Trr Tin 

At jWViliuAm^bVi(juyrVbNbWVVJLw 

VTTTT/CPTTFIMPDNTV\RGDILELTR 
\ PLGAQ VAMAVHQWLD I PEK*NKHK/L V 
VTEEDVEIjMYOKAIJSINWVONKKKAAGIM 
YTFNAHAAIDEFHR 


1153 


15054 


A 


1160 


55 


471 


SPPPGLPKKIFFWPLSFYVWP*PQRFCP 
LFAP FKQESNPLKVWPPPGALQNKRS KG 
CSPSM/ SARQES*PLFPKNLETPPFGGK 
FLKFFFFFF*DGVSLGHPGWSAI / 1 DSL 
QPLPPGFKQ/SLCLSLPAS 


1154 


15055 


A 


11^1 


3 


467 


GYTNQQIFPVDKTAFY* KMPSRTFMARE 
KSMSGFKPSK/DRAGDFKLRSMLIYHSE 
NPRALKNYPKSTLPVL\KNKAWMKHFCIi 
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Amino acid sequence (A=Alanine 
OCystcine, D=Aspartic Acid, 

t?.=l"21iif a m if* A<*!r) F^Ph^nvlolanina 
cr*viiuiaiiiii. stviu, a ueny lalaDlac, 

G=Glycine, H=Histidine f I=Isoleudne, 
K-Lyslne, L=Leucine, M=Methionine, ' 

nls A end ratlin a P— Pi"/\ltfift 

Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 

WeTrvn^A n h a n VnT vrnci n a 

X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possibte 
nucleotide insertion 










• 




QPPVETYCSGKKI PFKILLVIDNVPCHP 

GALMEMYTSETNVVNTATNTTQ TT.+ D /W 

SGVIS 


1155 


15056 


A 


1162 


216 


464 


SSEGPGSPGELPYGAQEFVMKAPQGILV 
IR*T * FFETESHSVAEAGVQWCDLSLIjQ 
LPPLRFNQFCLS LP \ S WDYRRP ERHPAN 


1156 


15057 


A 


1163 


29 


433 


AVEFGGDSCSPQHPLNPPLGSPQHSPPP 

LiILjIC 1 / Kxit'trWOUGxjx^VPuPPAxjP z P 

STPSWGGICVCYGRGGVAPPP*HPPLT* 
LGGGSTPTSALPAPGAPRHPGHPLMVLQ 
TLPWGPHTPARKPYINKVLSCVDF 


1157 


15058 . 


A 


1164 


486 


11 


STCLGLPKCWDYRREPPRPALRCPFFLN 
IxjxjFJjKFTLiSEINI PTPAFF*LVFAWYI 
FFSLFS F * P /DLEHYI *SG ICVDTM* VG 
SSCFFNPT/WSISSF*LVCLDHLYFFLF 
FFFLRQSL/DSVAQAGVQGQDLGStiQPP 
P PWFKRFS CLSRI PAHFEWSRA 


1158 


15059 

• 


A 


1165 


467 


131 


NRSLE WAKMYKTYDE I KLVTNLPT/RKS 
PGLiCKITAKFYHI YKXKjLVLLLNILQE I 
QVGFHPNS * YQ \ 1 1 LI PKYSEGTTKKGN 
CRPIFLVNIETKILHEIRAI * VHKQIRT 


1159 


15060 


A 


1166 


455 


32 


EEGVLKAKRGCKSGLIxHQTVNQWN/AN 
LEKFLKE I KSDTPVUI *MIRK*NNLTAD 
KEKVA* IQDQTSNNIPLTQSLIQSRALT 

KEKSHPHKHKS AS KAAS ADVEAVAS YLN 
S 


1160 


15061 


A 


1167 


26 


418 


KIWDYVKQTNIi*ITGIPERGGEKVNNIjE 
NI FEAIVQ/ES VPSI FKEVDTQEQEIQR 
ASSSSSS 


1 161 


15062 


TV 
A 


1 168 


64 


313 


KWCKCjINiS i X oiUJxjiU*jxjKx 1*j V /OLK* VQ 
/DL*SBNYQILLKEIKEDLNKWKDIPCS . 
*IRRLNIKMVIFSKLIYRFNAIPIKILA j 

A 


1162 


15063 


A 


1169 


464 


36 


QQAEAAESL\DPGGRGCSELRSCHCTPA 
WATEQSINK*SINQSIK*KSKLC/CWEN 
TLVKHI FHKRLTSR I YKEL\QQLNKKTN 
NSL* K* EKDVNR YF I KKI YEDI *MANEN 

FVLFVF 


1163 


15064 


A 


1170 


290 


487 


GSLPHHTPKR * PFLF/ CVFETGSRS VT* 
AGVQWHNHGSLQP* PPGLKRSSHLSLPV 
LIFLFSVEMGV 






n 

n. 


1 1 *71 

1171 


Z 


413 


l>A.Vc I'fa Xxxrl/ tjv-iNiyiJv x bKo \xj1>*AxjK 

FIFLFTTARGLTGIVIiANSSLHITLHDrY 
YMGTHFHYVLYIGAEFAI IGGFIH*LPL 
FSGYTLDQTYSKIHYTI ILIDINLTFFP 
HHSLGLSGRPRRY*DYPDAYTTRYILS 


1165 


15066 


A 


1172 


381 


2 


P P KLQKNF FFS STG KF FLGGGRAFS P P P 
KKGFFSQI PRRFFFPLPKKKKIYFCPPP 
CFGPPPTFFLRAPPFFFFFFLLLFCSFL 
VGVKFILFL/FYxxFIFRDTV+LCYPGWS 
AWQSWLTAALTPRP 


1166 


15067 


A 


1173 


9 


405 

* 


NLDKKGRNRTPQS WFQANPMASMTFS KK 
KKKKKKKKKGGGAL KKKP WGAQ KKPG KK 
KKNFFLKGG\EKKTPRGILEKKPFFGGG 
KKGPNPPKKKKPLREKKKF*GEKGEKKP 
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SEQ ID 
NO: of 
nucleotide 
sequence 

- 


SEQ ID 
NO: of 
Pcd tide 
sequence 

- 


M 

cth 

od 


SEQ 10 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 

seauence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 

E^GllltfUTlIC Acid F=Ph**nvi!»l«inin» 
mu> v»i u uniJ«fi» null) m a iJCfljr laialllHCy 

G— Glvcine. H=Histidinp T=lcnlptif>inp 
K=JLysine, I>Leucine, M=Methionine, 
N=Asparagine, P==Proline, 
Q=G!utamine, R-Arginine, S=Serine, 
T=Threonine. V=Valine 
W«Tryptophan, Y«»Tyrosine, . 
X=Unknown, *=otop codon, /^possible 
nucleotide deletion, ^possible 

nnrlpntirip tncprttnn 














*KFFFKKKFPPPSKKTP 


1167 


15068 


A 


1174 


410 


0 


FSYYPPPTRGGCPSSPPQIFLPPPLLGV 
FFPFSPLKIFFFPRGFKFFGGVVPFFSP 
PKKKVFFKNPRSVYKNPPKKBKK*SLQP 

PG*VWAPPGTFTTR /PPPPFFFVTFQ 


1168 


15069 


A 


1176 


319 


3 


! REARS VFRVEGRG* KNFSASGPPGGGNP 
GGGRPGRGNFVFL I KKGGP P PGPKG /S P 
FFDF/GGPPPPPPPSGGGSGVNPPPPPF. 
FFFFEMESRSVSQSGVPDAWADAW 


1169 


15070 


A 


1177 

i 


385 


1 


SAS FGLPKCWDYRHEPTRPASVSVTVTS 
SRIiS GA * ARS GKGTL VFWAQMVFK \RI P 
LTKYSDHSREPSSLQLCMQSTPSKAEFT 
VAKADKRLCCTAKSSTAKSIAQIKCQND 
ALsKo Jr U Ctb Ljxi Vb(£K V 


1170 


15071 

■ 


A 


1178 


147 


808 


KLKEIKKLLEEKAGINLYDLRLGSGFLD 
MTPKAKQQKKENLKWDVIRMKNSCASKD 
TITFYN\YRSDKGLV/SKKYKEI*NSITE 
RQPWF * KDLNXDFS K/EQMAKS T*KTTE 
KJjr IKhJlKIKl i /MJE/ xHr iiSTRMAKI * 
KDSNR / RL CRKTGTL IH CW * GTAPANLL 
KNCQPLFPF* GTVWQFLKRLNliELPDDP 

aipplvicpkemkt/c/cyteictqmft 

AA*III 


1 1*71 
I I /I 


I DU/Z 


a 
ii 


1 1 /y ■ 


I 


3yo 


oKoKxlL. 1 JPAW vJKv 1 ij VijKKKJilCEiKEiKBE 
KKRCPCYIVSHQSL* KPAWQFL/RNVK/ 
LELPYDPAI /PLLGKCLKE I * KYAYTKT 
CM*MF I ALFI I AKKYKQLKRLSTDEWIN 
KMWYIEAI 


1 1*79 


1 jv 15 


a 


1 1 Cfk 






n VTiCV T T3KTD * BT TTl'A \ T DAT T T T»/*VT 74FJI 

* FYLQSMWVLILGLLTDTLTI YHRWSDV 
ARKS TYQGHHS P PAQKGLR YGIMLY^5TS 
EVLLLGGLF* AFYHS \SLCPTPQLGGHW 
APTGMTPLDPVEVPLLNT 


1 \71 
VI to 




2k 


1101 

Hoi 






■ aMConRPTT rM^tiDar dt/^t onnnvotcn 
AIno o JJ/iv J. ijiJ^HKiVLilr ijy Jjf ir L«v» i b X o Jr 

AAQSSTVPMPLLLIPPPHCNRTPSIjWHY 
SPASNPTNPSPY\*TAPSPPPPIPRNNP 
FS*KFPYYVWVYTSLTTCSQPPVSSPNF 
TQPSVL 


1 MA 




A I 




"5 
J 




NYARS I LPMLCKQEKKEKKKK\ AWMTAH 
LFTAWFT*YFKSTGET/CAKEKIPFKIL 
VLIDSACSHPRALMEMCKEINVVFMPVN 


1175 


15076 


A 


1183 


1 


410 


PPLP * KFFS PGGVQS * KGGRFWPPP* VG 

VLPSSSP/IIFFTPGNWGCFSPFSP*KF 
FFSPKfiFI FnGGVGPFFDPPFTnJFFQTTT 

PPLVFFSPPFKKKIFFFPPPVNFGPPRV 
FFKRPPS I FFF 


1176 


15077 


A 


1184 


406 


62 


PPI INMLCSLAPPFFSPPPLRGVLPPFP 
LKNFFFP* GVYFWWGGGPHF/S PPPKKG 
FFPKFPPWVFNPPPLRKK/YHNFPPPGK 
FGPPRVFFKAPP 


1177 


15078 


A 


1185 


337 


383 


RS Y I S FQK* VKDLNKHFS KDDTQMAN/ K 
HI KR YS TLVI REMKI KPMTRYHFTPTRM 
AVPF 


1178 

• 


15079 


A 


1186 


3 


471 


LAPSDK* ENRLPGNRPL*EVRS PSARQP 
PHXiRSEEPLRPAATPSGK*GASPPSSHA 
VREGGGGQPPPGQPPRPGGEGRLCPAAP 



■ ti t 
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SEQID 
NO: of 

n UC'Ci) uuc 


SEQU) 
NO: of 

UCUIIUC 

sentience 


ML 
eth 
uu 


SEQU) 
NO: in 

09/515.1 
26 


Predicted 
beginning 

11 UklCUllUC 

location 
correspond 
ing to first 
amino acid 
residue of 

• nentifte 

pvUllUw 

sequence 


Predict- 
ed end 

fill J*ln_ 
II UUC 

otide 
location 
correspon 
ding to 
last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCystefne, D=Aspartic Acid, 
K=Glutamic Acid. F=Ph en via Inning 

G=Glvcine, H=Histidine. I=Isoleiicine. 
K=Lysine, L=Leucine, M-Metbionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginlne, S=Serlne, 
T=Threonine, V=Valine, 
W=Ti*VDto o ban. Y=Tvrosine_ 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














TGK* GAPLP/ EP P PGIiGG 


1179 

• 


15080 


A 


1187 


2 


406 

- 


FLVETEFGYVGQAGLELLTSRDPPASAS 

tvOJ-V*Jl i 1VJW V Sc»- W O JJff L / (OO V 

EAGTS FGLS FLSSS WALS AQEGCLAVPS 
/SGSRGLLVGALLLWTKPSPQLSPVPAS 
ORIiS S IjS LMPPLPOPOHTiTHTS IET 


1180 


15081 


A 


1188 


160 


459 


NFMTIDMLCSAWIHFCSSGLDFQL* KL 
FKSQ*ENNLINKWAKDLNSFTIFSYG*K 
LAHEKMLNI I RE I YFTAIMS * YTLTRTV 
KVKKTDTKCW*GCTATGNLIHCCKNV*P 

T.PWTAWfA FKTCTT.NTYT.PYHPILTPT.SDM 

NSRE * KNI /CHAKMCAWIPI 


1181 


15082 


A 


1189 


232 


2 


KAP P P F FFFFFF F FFFFC * SGSHSVS WA 
GI*W/PGV*WHHHGSLQPQSPRLK*SSC 
LSLSSSWDYRYVPPHLANLKK 


1182 


15083 


A 


1190 


2 


402 


VFSWPSY*TPCLSALTtASVLSLL*QRS 
PRTL FI TNKC/ D F / PASHSS CRI PAGL * 
ALGRQGLFSCFFCFFETESHS FAQABVQ 


1183 


15084 

► 


A 


1191 


19 


390 


WCVPAVPATWEAEPRRSR \ RS KPLTGRQ 
SKSLSY/NKKKKKKKKKKKKKKKTGGGP 
* KKLLGGPKYKGGKKKKFFFFKGEKKKS 

GKKKFLGGRGGNPP . 


1184 


15085 


A 


1192 


485 


112 


QRPDPR \ SAEAAI KYFLTQATAS I ILL I 
AILFNNILSGQ* TI TNTTNQ YS S L II IM 
AIAI KLGI AP FHF * VPEVTQGTPLTSGL 

T J ,T.T *• fiTTT AD T S T T YO T <? PfiT .WV 


1185 


15086 


A 


1193 


55 


385 


THAFADAWADAWGL F KG I LT ENFPNLQ K 
YINIQVQGG *RTSSRFNTKKTNSRDLI I 
ILPKVTOKLSKKKKKN\AENKKILKYRG 
GP F * * KPGGAQF YGGGRKS F FFFFGG 


1186 


15087 


TV 

A 


1194 


a no 

408 


3 


Lyon \JjJjUun^U-uu V £\±JtrLjnE*£\. 

VCPFSDLQLHAERTTALLKAVRQGHLSL 
PRTiTJ iS F\VCLCPAPRGGACRGRQASLS 
CX3GLHPVRASRLLCLRKRAWAMASVPPP 
A<? IiP % Pr5 IiTSDCCVSNO * DSTV5RV 


1187 


15088 


A 


1195 


5 


371 


LFSTNHGDFGPLYLF P \A* AG VXiGTDLS 
LLIRAELGQPGNLLGNDHMYNGIVTAHA 
FVIIFFIVIPIIIGGFGN*LVPIiIIGAP 
DVAI PRISNI SF*LLPPSLLLLLACAIT 
EAGAGTG*TVY 


1188 


15089 


A • 


1196 


76 


404 


PT PLRTHDQS S KVSRYKVNI QKVGAFLY 
MT^STRTTGI * \ IKSTSFPFASPKVKYIiT 

INLTKCVQ/DL WKE KKL/NE I KEDQNK * 
/NVPFSWIKGPNIVN 


1189 


15090 


A 


1197 


so 


334 

• 


ILHMVSIPSISIY*HIiLPAG*AGTHIG* 
LP PA* FFWVMG/RDRVLLCHPGWNAWQ 
S*LWASNSW\VK*SSHLGLSKY*D*RH 
EPSYPASGTKLN 


1190 


15091 


A 


1198 


250 


1 


QY YLVS SNT * S 1 1 D FLQLPQKC I FTVGF 
SK* *CICGFIFKNI *LFF\FCLFESESC 
S VAQAGVQWQDLSS * QPLPPVFKQFS 


1191 


15092 


A 


1199 


2 


402 


TDARHHTWLI FVFLLEMG FHHVGRAGLG 
LLTSSDPPRPAKLNFL*R*GHAVM\CPG 
WPHE \TCLGLPKCWDYRCE/HTAPKPEH 
FLFFFFFSBKNFIPVPRRGGG/WKNLN* 
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SEQID 
NO: of 
nucleotide 
sequence 

• 


SEQID 
NO: of 
peptide 
sequence 

- 


M 

eth 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude* 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

seauence 


Amino acid sequence (A=Alanine 
OCysteine, D^Aspartic Acid, 

E=Glutamic Acid. F^Phenvlnlnnint* 
G=Glvcine. H— Histidine. I=lsoleucine 
K=Lysine, D=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Giutamine, R-Arginine, S=Serine, 
T=Threonine, V=VaIine, 
^V^TrvntoDhan. Y m Tvrnsine. 

X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possibte 
nucleotide insertion 














MEPPPPG* KGFFCPGP PKYWN* KAP 


1192 


15093 


A 


1200 


50 


437 


STLYLEREKEQISLSVIVSSPLPSTSTG 
PTAAAQSS ISGPSLPLPPHNGAGDAPAG 
LG/YGQGPSGPPWGPS/SGPSPR*ALVC 
PTDPSGAARGGRGG\ SRGSCCAPAGPAG 
LGDHRPGGMGEGPAAPPKSS 


1193 


15094 


A 


1201 


86 


313 


PPPPGGYEJHFSFXXXXXXXXXXXXKSLL 
PPGKGPNPPKVGLPPFXXXXVSPKXXXX 
XXXXX*NPPHSPPRAPGGPP 


1 1Q4 






1 


1 rn 




dvlk/yidtqtiilrdfntp/lnsvmsw 
rqktn3cdnldlnlmlgqldl i d i srilh 

VSTR/ YI FFSSLHE I YSKSGHMLSHKAC 

ijwki p * TT T T? T T T . T T T .T . VUP 


1195 


15096 

• 


A 


1203 


1 


476 

■ 


PHFGPPPPPQDPRF*G\GPSTRKGFFPQ 
PPGFFPQAWPRAGQPPPGGT/EPVFPPP 
KHP*RRNPYPKFGSPPLPGKIHQGSPGP 
SGQFR\PPRWGKKRAQPTFWAHPPPIFF 
FF*GVLLCHPDWGTVARSR\PPQPPPPG 


1 1Q< 








1 


2o9 


lnTPUPTVHT J^TV / UC If rtMVTJM invert?!^ T M 

V vara 1 1 ULAsl. J\/ WolUJMJvKN If X ivbiJJLiIi 
MADEHPRRC\ * HHLTP PRTAQVKT iRT iE P 
NE/WQGCGEAGPATPCWGDAAPVQPLLQ 
IGRRFLKK , 


1197 


15098 


A 


1205 


459 


42 


KLKPMLIYHSENPRAFQDYAKSMIiSVLY 

CSE*EISLQI*LLI/GHPR/SLMEMYKE 
M3WVFMPANTTS ILQSMDQGVALTYEYH 
YISNILYEAIDNTENPLIDLGKANAWVG 
PG 






j\ 


i on*; 




1 


EMHRKLQCREKR / 1 KRN/ E INVQE L* YH 
YKRCNHHVMRMPE KEQRKEE I FKVIMAE 

YIYIS*LQKAKEKESILKETCQKTSKPK 


1199 


15100 


A 


1207 

■ 


7 


383 


LDIGCFWWEDKYSSCYFSLATSGKLKYF 
S FS CL THCFT jVCVCVR AHK \ C T * <SR Afi * 

CVCVHK\CI *SSAS * CVCVCARTNGI * / 
CQCFLVCVCAQV/CI*SSAS*CVCVCTS 
GI*/CQCFLVCVCVCVC 


1200 


15101 


A 


1208 


405 

TVJ 


Ait 


AEAGRSIiCPGGGGCNEL* S P PCPSAWVT 
\SETLSQKQNPHHHKKKTGQVRGQSSYL 
PLWEVNR 


1201 


15102 


A 


1209 


384 


2 


GVTHH/ARARFFC/LIiDTGFHSVFQAAV 
0*A\NHGSLOPOPPGLKPSSCLSLPS<3R 
DNK 


1202 


15103 


A 


1210 


403 

♦ 


2 


VPS IQTNGKSLTYFDFFFKQLFKTKLKL 
DASFVKVCVNVEPFRTFALSKTQSUiCL 
SLFIiLI F*TINGNFCFIjRON /LRSVAQA 

GVQWRDVGIjLQPLPPEFKQFC 


1203 


15104 


A 


1211 


156 


405 


eskqmliqcksqmlyc*ri*tv/clcvc 
vcvcvcvcvcltcgtkrvcivflfiwl 
nck/plcepcc*cnrqgwapwllpvist 

RW 


1204 


15105 


A 


1212 


70 


408 


KAFSLIiPPGVSPGPLCNPRRPNFIGGGP 

kkkpppppqrwtqgffnpppfffflwg 
g*gqd/pppvaqg*mdppprppk\grgp 
rheppppaf 


1205 


15106 


A 


1213 410 3 


kksmpgfkas kdrmtllleaqaagdf\n 
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SEQ ID 
NO: of 
nucieouae 
sequence 


SEQ ID 
NO: of 

»-i An w\ a 

pepnae 
sequence 


M , 
etb 
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SEQID 
NO: In 

09/515,1 
26 

- 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acio 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
umg co 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
n^uriuiamic Acid, p— r nenyiaiantne, 
OGJycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, MNMethfonine, 
N=Asparagine, P=Proline, 
v^vviutamine, k— Arginine, o^oerine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Vn known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LKPVFI \ YHSEN PR I LKNYAKYT / PVLY 
KWSNKAWMIANLFI A* FIKYFKPTVBTC 
YSDKKIPFKILLLIENAPDHLRALMEMY 
KEIHAVFMPANTTS ILQPMDQGVTLT 


1206 


15107 


A 


1214 


3 


AAA 

340 


KIRVEVNKMYNRKLLEEIKKI KSLFFEN 
INKIDQFLPRLRKTRQKMQINKIRNEQG 
GISINIMEIKYSYKEML*AIICKKLNNL 
HEMNKFL/EHRVPKLTQVBIENLNYIYK 
K 


1207 


15108 

• 

i 


A 


1215 


1 


57 


RGLKIQNVNEI KS * CFEKIKI DKFLARQ 
NFKRKRFKFRD\*KGDIMADITEI*GII 
RAYY * QL * GNYLENLEEMGKFLVTYNLP 
KLNH/DIENLKKPVTT/REFKS VI KSLP 
LKKSPLHDSFTAQFYQTFEEE*VSVLKR 


1208 


15109 

• 


A 


1216 


350 


1 


QSSFFSIYPNFHLLSFLFCFKDFLFFYA 
SVLATNSLPLFFPQISRYFP\PFERERE 
RE RENWL \ CCLG WS AVMQS * LTPALNY 
W\VRQSSLLSLQAS*SYRCGPPHPANIF 
HFIFCR 


1209 


15110 


A 


1217 


139 


358 


KNTFFVWEAKRLFVPQAGGQGGNLSFK 
FPAPP/NLRG*SASSPSSGDYRPTPPCP 
ANFFFFKKNRISPWGSGW 


1210 


15111 


A 


1218 


361 


50 


WGDHGSLKPQPPLRSKNPPPLASIiVTGT 
Q* LG/HPVAPFP IQLI FFFFFLRRSL/D 
S I AQAGAQWCDLSSLQAPPPRFTPLSCL 
SLKKKGNP VFVI TWMNLEDIMLS 


1211 


15112 

• 


A 


1219 


1 


361 


LKQHVSNVEKTAFYWKTQSKTFVARE*K 
GEILSSNRGECLLQSFKGQPLLLGANAA 
GDFKVK\PMIiIiDKSEN/ PKALKNYTTS 


1212 

• 


15113 

- 


A 

i 


1220 


292 


362 


HI PWPAIQQAEVGGSLEPRRSRP*PPR 
LK*SSHLSLLNSWDYRNVLP/RLANFCI 
FLCPVR\ FKLLGSNHQNASVSQSARITG 
VSHHAWP ILKLKKLFVCLFEMGSHYVAR 
AGLK/PPGLK*SSRLSLQNCWESRREPP 
HPDTIH 


1213 


15114 

* 


A 


1221 


372 

* 


3 


RLFFLLPPRPKGDFFPTLLIW*RHGFSP 
PHVFKPPPLNLILGALKKKFFLPS/LPY 
VKFYFFKRAPLFFF/IFFFRDRVSLCWS 
+ PPGLKQYTHLGLPKHWDYRHESACLAN 
NHNNNlvTNr FFBTE 


1214 


15115 


A 


1222 


3 


291 


RSGDQDCPGQHGETLSLLSLTFVNLSLI 
CNL*TLSLIFVKIQKLAWHGGVRL*SQL 
/LRRLRQENHLNQEG \ D C I PAWATEKDS 
VSKKKKKKKGGPF 


1215 


15116 


A 


1223 


379 


19 


HMQILTIMR z YTP IRMAKMKKIG YTK/ C 
W * GCAATG YI \ WECKMVQS L/ WQNTWAD 
S*KLNTHLSYDIAIQFL/GFYSKEIKAY 
IHTKP CT * MFTATL 1 1 KSQI /MKQYKCL 

<? TD T TfrtTJPflTV T 
o x u x jvyivrcvs J. i jl 


1216 


15117 


A 


1224 


387 


40 


KKTLSTP\PEKHPTPPLFHPPPQKGGKA 
PPSFKPPPPKHPPPPKGISPPPP* IPP? 
PIYPPPPPAKPPPDFFt,*SPPPPPPFSP 
PPPP*TPPPFFSPPLFFFFFFFFFLLIR 
LLV 


1217 


15118 


A 


1225 


21 


477 


IVSRLITVRLQKPRLDPRVRPRVRRKEN 
YSSV/SFINIDRNVLNKIL/SSQIQQYI 
KKL I RHE * FWF / 1 PG IQGWFNI LKSV I V 
TYVNK\QKWKKHTIISVDTEKAFDKIHY 
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SEQ ID 
NO: of 
nucleotide 

sequence 

• 


SEQ ID 
NO: of 
peptide 
sequence 

- 

• 


M 

eth 

od 

— 


SEQ ID 
NO: in 

09/515,1 
2o 

•* 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
£>=vyiutamic Acid, r— Phenylalanine, 
G=Glycine, H=Htstidine, Msoleucine, 
Jv^Lysine, JL— Leucine, M— Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X= Unknown, *=Stop codon,/=possible . 
nucleotide deletion, ^possible 
nucleotide insertion 














TUT *TT T T^tFTAWt^T^T* T\ T T"^ T 1 ^V*T^T Ti»W 

LAXiXLI KKKKKKKRALPQSF IPGDLFKK 
PNGGFPGLKMMGRAPGE 


1218 


15119 


A 


1226 


1 


398 


ER INHTLF FLAEAQKRLPTGI SGRGRPA 
I PHNTPQRAPDHAYL PAALAAQHRAGGQ 

QAP ppg\sspss p yde vkdregdvtash 

GLRGNGWGS P * ATSLVLNNLMYMTAKYG 
DEVPGPEMENAWNAIANNEKW 


1219 


15120 


A 


1227 


3 


238 


dawvaghdg/ rtp* sqllrklrwehrfs 

SGDRVSDP* S CLCTLAWVAE* ds ss kkk 
KGG P FKGTKFNS RGGGRNY FFYGA 


1220 


15121 


A 


1228 

! 


207 


2 


nrvspccpvqwhdhsslq/prtpglkgp 
\sasaf*vagttgvhhhaqlifhffffy 
*drviqagvqwrnl 


1221 


15122 


A 


1229 


15 . 


413 


RKSVNVI IHHSE I LKKKNYMI IS IDTEK 
SFDKI*YPFMTKLSEN*DSLNLINNiy/ 
AKP / VANT ILHS KRLNAFP I KQGCLQGC 
\PPFLFNIILEVLASTKGRKEIKSLQIR 
VEEI /KPLSLFADNM/ IVYIENPKBS 


1222. 


15123 


A 


1230 


363 


1 


AADP FACRP VS PHPPLYLVTS SQSLLTS 
VS PKKQPPS AGMQLGVGDS LSGWGWGRT 
KKTRLFFFKTRS LS AAQTR RQWCEHRS Q 
P*PPGLIKRSSHLSLPSS*NHRHMPPHL 
AN\LCLFV 


1223 


15124 


A 


1231 


3 


402 

■ 


QANS CI FSRD FGS VGQAGLEP LT PGD P P 
ASASPRCRDYRC\GPHAQLLIiTFC* IS I 
LILV/CIS*MKDYFITCI YFFNYS*QI I 
FYRRASDFFP FLRQGLALS PMHDHGSLQ 
PHPPRLN\HPPTSAKGVTGT 


1224 


15125 


A 


1232 


3 


396 


FLSQHGFLFLFFAGIDKLILKFIWKDN* 
NNYEKED*RGVI TLP/QYKAYSVATVIK 
MVWHWQSDKHVDQ WNRE PRNRPNYYMS Q 
* FPL* YVSQRNENLPLHKNPYMNVHNGF 
ICNSRKQSRYPSTGEWVNKL 


1225 


15126 


A 


1233 


3 


395 


LPKCWDYRR* PPHPALF/LFF* KHPKFP 
KMQVK/WRKH*TENRIjVMYFSEKFGD*L 
KDQHKL YYLDMAFQFIFTFTYVMRYS IL 
FKFNIR*LTLSSVKMAVFLVETGF\TML 
ARMVL IS* LRDLPTS AS QNAGI 


1226 


15127 


A 

• 


1234 

■ 


416 


2 

• 


KIDKLILTFIWKCKRPLLAKIVLKKKNK 
I *RLPLPSFKTYGKBMV/TK/TKWGWPK 
NRPIKGTGF*VQKKKTPHIYGQLVFDSG 
ART ITN/ WY * DN * I ST * KRMKLDLYLT I 
YKINSKWIKDLNVRAKTMTFLEENIWVI 
LD 


mi 


15128 


A 


1235 


384 


22 


FFQEI*NAIPVNTQMVRKQNSHPANIEK 
VL/WWIEGQTSNNI PLSQSLTQSKALT 
LFNSMKA/E/RGDKAAEEKMETCRGWFM 
RFRERRHVHNIKV*GEMRSCGSIAAGS 


1228 


15129 


A 


1236 


379 


2 


SPYTKIHSKCLKGLNVRPLTFKPLEENP 
G/VMVPDLGPGKKFIS*APKAWATKTKI 
IQWGYIGLK/AFCPAKEAIPRVKTWPSE 
S EG I PAGHAFYEGF I FQ I FKGLQ PFHS K 
KKKDLILKLDLLSADIS 


1229 


15130 


A ! 


1237 


1 


407 


YRVGOAGLKLLTL* SAHLGL P KCWDCGR 
EPPCLTI IAL* SVFSLVLPAVTjIKLINF 
F\CRDGGQTMLPRLVSNSWPQMIC/LPW 
PPKVLGLQ 
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NO: of 
nucleotide t 
sequence 
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SEQID 
NO: of 
peptide 
sequence 


M. 
eth 
od 

■ 


SEQ ID 
NO: in 

1 TCC7V1 

,USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
Jb-Uiutamic Acjd, ^Phenylalanine, 
G=GIycine, H=Histidtne, I=Isoleucine, 
K=Lysine, L=Leucine, [^Methionine, 
N=Asparagine, P=»Proline, 
Q=OIutflm]ne, K-Argmine, S= 3 Serine, 
T=Threonine, V=*Vallne, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


1230 


1513J 


A 


1238 


3 


380 


ELSQRTCCMDJMS YYKAI VITMAWY* FK 
DKQRVQQNEVESPKTLLYIYRLWIYYEG 
DTTDKG*TF\NKWYVJDSWI / ST/ CKNM* 
FDYHFTQCI KIN/ SQWM 


1231 


15132 


A 


1239 


1 


396 


FADDMI VYLENLKDSS KKLLELVNQFS K 
VSGYKS KV/ NVHK3 VALLYANSNQAE / Q 
IKNPTPFTIAAKKQK/ YKQKNNLGI Y* T 
KB VKDL YKENYKTSLKE I I DDANKWKY I 
PCSWMGRIDIVKMTILPKAI*RFS 


1232 


15133 


A 


1240 


392 


58 


SFSMLARLVSNSR/PSRVLPASASQSAE 
IIGVSHYAWPSKLSF*LTIPQTHLSCNL 
FIYVFFERRSCSVTQAGGQWYGHSSLQP 
♦TPGLKQSSCFGLPKCWDYRLEALPRLM 


1233 


15134 


A 


1241 


500 


204 


SIiSLSFS*DGVLIiCLGWSSTPGLKRFSC 
LSLRSSWDYRCVPSSQTNF\VFLVBMGF 
HHVGQAGLELLTSSDNARLGLPKCWDYR 
RVPPRPAAFFLFFKG 


1234 


15135 


A 


1242 


2 


397 


NFMINNLPKKKALCPWFTGEFYLPFKE 
ETIP*VIRISLSL/IFSIIFQKIETEGI 
LPNSFYEGCI ILI SKP YKD I * \ ENYRLT 
SVMHIDARFLDSILANPIQQCIKVIHCS 
HVAFV/SGTQDWFNIQKSM 


1235 


15136 


A 


1243 


12 


362 


AGFYHVGQDGLKLLTSSDPPASASQSAG 
IKA*ATVPGLSPLNFCEVRFMESWSWKR 
P*RTS\VCSALAKCWDRNASQWDKLPS \ 
FPT/RM* SPCLHPAVSAFCGSASVLPGI 
IVTTSV 


1236 


15137 


A 


1244 


183 


383 


KQAGRGGSCL* /LPRCWDYRREIjLHIiAF 
MPG * LKK/ FFCKDGGLTMWPRLVLNFWP 
PVILLPRPPKVLG 


1237 


15138 


A 


1245 


410 


2 


FSRDRVSSCWPGWS/PNS* PQSDQLALK 
CWDYRCEPLHPAFC*MNFLKVSG*HGTS 
/DSKY/CQHVSPKNKDISLCNYNIIIIP 
KKFNI L* YI WFLDFFFFETE FCS CCW* D 
LGSLQPPPFGFK* FSCPSLPSSWVHRHV 


1238 


15139 


A 


1246 


440 


46 


KTEGANINKNTTYQ/NLWDADKAVFRRK 
FRALNAYTRK* ERAQLNLSSTLKTRKKE 
QNKLKADRR/QIMQI * * KLGKVENKQT K 
TIQQIH* TKSWFFEKISKIYEL*QA* *R 
KKGEKTT I TNVRNKRYSGRVG 


1239 


15140 


A 


1247 


266 


340 


RKCW PGAVAHT CI^XTLGGQGGRIT 


1240 


15141 


A 


1248 


3 


391 


DAWADAWGSRSRAVALFFFFWGFLGGGG 
LKTGF YFI PQVERRGLNFD * WNPPPPGL 
RGS S P PT / LP KQWE PGGGP PAPSNFW FF 
FE KRGFPQVTQAGFKLWN * GDPPAGPS K 
GVGI TGGTPS PHPLFLKKR 


1241 


15142 


A 


1249 


3 


323 


MHHHA*LVLVFFCGDKVSLCCPG*S*TP . 
DLKRLSHHGLP KHWD YRC/RATTPGFFS 

T PT7T?T?FT?KrDt3T7PT.T.DW2VDPQVCIT.irnWTf T 
iurrrf r r r* ±r ir'xjr Xjjjr" Jr I oijx\^1Nx\.J. 

FLKR FLE P LAP P KGLPLKRAKGR I 


1242 


15143 


A 


1250 


3 


405 


QPDS YS P QPGRLS AP PEQEGG PWLILPH 
ACAPSQI CVGPHGAS ILS YQERKGTKVL 
S CEGHCKLS S PVGLVGQSFCWQQPDAVQ 
WVPFRRRTQTPGTVAHACNPSTLGG*GR 
RITS /RPG VRDQPEQH 


1243 


15144 


A 


1251 


407 


2 


P FLKKTKKELP * GPAI PALGVPSFFBFF 
FLPYKRKBIKGIRTPPMRIAALFTIAKI 
LNQPKYPSVDI * LHKRHTSNTTLLSRMK 
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sequence 

■ 

• 


SEQID 
NO: of 
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M 
eth 
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SEQID 
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nucieoiiue 
location 

cuixcs po n u 
ing to first 
amino acid 
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pepuue 
sequence 


Predict- 
ed end 
nucle- 
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location 
correspon 
ding to 
last amino 

QCIO 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCystelne, D=A spar tic Acid, 
ji^oiuiamic Aciu, r—i^nenyiaianine, 
G=G lycine, H=Histidine, I=Iso leu cine, 

\f — ¥ weirta Y ~\ Angina IV/l" - lV/f a+liir%n»nA 

iv~jbysine, ju — j-icutine, ivi— jvieujionine, 

N=Asparagine, P^Proline, 

Q=G lutamine, R=Arginine, S=Seririe, 

T~Threonine, V=Valine, 

vt — i rypiopnan, j — .tyrosine, 

X=Un known, *==Stop codon, /^possible 

nucleotide deletion, \= possible 

nucleotide insertion 














H* WIKKT* /HTDMMEYYSAIEKNGILSF 

TV A TP M O Ti VT\ T MT G TP T r*r\ TV f\ VKTD V 


1244 


15145 


A 


1252 


1 


253 


VADVRES LDPGD * GCS ELRLCHCTPAWA 
T\SRTTSQQQLKKQNENTT*KTYGIQLM 
NCQIANYCSKLLYFRKMSNLQSNLKKKK 
K 


1245 


15146 


A 


1253 


3 


372 


j KKKAQKSGTIVLPCNPSYSGG*VGENA\ 
WARSQPGLIS*KQQ*QQQNQQQTTSKTD 
PYI YG/YLIHDRGGK\NTQGWDRLLNRW 
GWG I WLMTQRKI KLELYLTP YTRTNSTW 
I KDLNVAGCGGSCL 


1246 

. 


15147 

* 


A 


1254 


2 

• 


410 


KNLTPIVGLKAATN*ESIPPQT/SRKKK 
KKKKKKKKRGGALKKKKFKGGGGGKKNF 
. FKGGKKKKLGGGVKKRGEGKKPGGKKKK 
RFGKKSFFSRGGKKKKNRSSSSSSSSSS 
SSSS 


1247 


15148 


A 


1255 




487 


GRPGPTHAFWIDSFFMGNEARFINHSC 
DPNCEMQKWS VNGVYR IGLYALKDMPAG 
TELT YDYNFHS FNVE KQQLCKCGFEKCR 
GIIGGKSQRVNGLTSSKNSQPMATHKKS 

grsy/re* kknkkkrggrfkgsqfsspg 

MQG 


1248 


15149 


A 


1256 


118 


5 


MFIAELFI IA/RRWK* PKCPLTDEWINK 
M*YSHTM*HY 


1249 


15150 


A 


1257 


390 


1 


TQ KNT PTPKPKPTS QQHHQKNKPT PPG F 
FFFSPPGK\GGFFPSPLFWVPPGFFPPP 
VFKTRPPEFIFGAP*KKFFFSPPRSLNF 
FFJaRGP PS FFFFFFFFS * VGKEGS S PSH 
ENPLFVPTEGW 


1250 


15151 


A 


1258 


378 

i 


3 


GAFFFFFPPPKKGFFFTPFEFFGPGFFF 
S PPFF I PPPQIFFFGPKKKKKFLPPPPQ 
KIFFFLRPPPLFFFFFFFFFFFFFFFFF 
FFFL\DLD*NVLSGTSAPEKNQEIjLGMV 
AYACNPSTLiGGRGG 


1251 


15152 


A 


1259 


462 


287 


RDG\FHHYGQDGJjHL1jTS*STRLSLPEC 
WD YRCDL P CPAI PAVTL YQIQ YRPLGLE 
SKA 


1252 


15153 

■ 


A 


1260 


446 

- 


34 

■ 


NVRAET/ 1 KTLEVNTGYNLiUDLS * /GKA 
FLDARP KAQ VAKE KSRQ PSLHRRLR/ FC 
ASKDPIKEVKRQPTE/WEKIKHVSDKGL 

IflJVTtlVCT T PITT CMTiTn T * WltlVWT^T MPUra 

SK/ELQIASKHMRRCSASLVIRKCNEIV 
DPD 


1253 


15154 


A 


1261 


2 


383 


GSQRKWFLEMESTPGDNAIDIVEVTTKG 

SKILSNS IACYR* IFCKKKSQLMQ/QTS 
LWFYFKKLPQPPQPSAATTLISQQLYTL 
RQDSPPAKRL* FTDGL 




1C1CC 






I/O 


JV 


* PPGP\SDPPTKSSTREF 


1255 


15156 


A 


1263 


143 


472 

• 


TGAVPIRPSWN/RPPAMIFF*NAQGIIjF 

aeflasqrailweffekayhesvlrksa 
kglabkcpgklhqrvlvqydnalahfsh 

QTRTTRSSTRQFR 


1256 


15157 


A 


1264 


179 


489 


YI FFFLSLFFFLWPRPEYRGAITVHCSL 
NLPGSSDPTASASS VAGTKPH* YI FFFL 
SLF/CFSVAQA* VQGGNHS SLQPQPPGL 
K\YPTASASSVAGTKPHPLIFFFF*KKK 



606 



WO 01/64835 PCT/US01/04927 

i ■ 



SEQID 
NO: of 

mirlpnttfip 

sequence 


SEQID 
NO: of 

sequence 
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SEQID 
NO: in 

09/515,1 


Predicted 
beginning 

nitdpfittflp 
llUUCUllUC 

location 

ing to first 
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residae of 
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Predict- 
ed end 

n iiflpw 

otide 

Inr&tfnn 

IVmI HUM 

correspon 
ding to 
last amino 
acid 

residue of 
peptide 

Sequence 


Amino acid sequence (A=Alanine 
(Xtysteine, D=Aspartic Acid, 

IT — (~1 1 1 1 frt m 5 f* A /* iH (7=PHon vrla In n In a 

vjiu uiiii jl s\nu> ft QenyiHiBninCf 
(^Glycine, H=Histidine, J=lsoleucioe, 

*v~jujf sine) Lr*ijcutiiic ) iti— iTieiniunin&j 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S«Serine, 
T=Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *~Stopcodon, /^possible 
nucleotide deletion, ^possible 

rt li *>l nAfi/1 a i iieiw^iAii 

nuciconac insertion 














FL \ C F VPQAGGQGHDFG * L* P P PP SLKQ 
GGGLTLSRIGDYRGPP 


1257 


15158 

t 


A 


1265 


I 


536 


FRGGWGSVRRAPGTASCYLALAGGPPGQ 
CPPAAISCPSPLPWGS*TE/PYVPRRPG 

UriAAr/ raw w V FKirlj V FKH VFVjRLjKVIj 

QDGCGQAQDI QAAAS LMADTAPDSPGSA 
• GSVRALPACVPEISGS/SGLPPGAALP* 
VAA/RPTPGRHVDTHPRQHSFCGGQBOD 
IRAFPTLYLEVYPGPP 


1258 


15159 


A 


1266 


403 


2 


TSIoHPRGiMRLLRQGFISAPCGYMRLPR 
GSSLHPRGYMRLLKTGVHRCTPWVHAPS 
QTGVHLCTPWEHTPS *RFISAPLGAASH 
VT/GSLSIQHIYIFVFCLLRQGL/NSVT 
QAGVQWCNLGSLQPKLPQAQVILTK 


1259 


15160 


A 


a 


2 


iAl 

401 


FVLNPGGRS CS E PRSIjHCTPAWATRAYL 
QLGKKKKE KKKERGEENKNQGP PRPLKR 
EVGTPGQKKPL/WGGSNGAGQKQPA/QK 
KGGKKKADHKGQRG KKF* KRKREGGGSTS 
KNNSRGTGAKA*NPTIWGGGGKKI 


1260 


15161 


A 


1268 


49 


416 


LRGRALD PRLLRE CGDLGAP PAP EVALR 
AGTCWTR/CTL*APPRGA/DRSPWPPRS 
PMCKAG/DECQDGI PG\MKAWSCGLRTR 
QCPWP * PKLPCGPGHPAWRT* PLPQTA/ 
CGPGPAAPCAG 


1261 


15162 


A 


1269 


420 


47 

» 


GPPGWASFRIiNFPKARR*GEWKTPGESG 
GAFFS PPGKNF \ AGQN*GRPPNPPpPGP 
GKGG I QTRGGAGLG KNPFRFWGGFPNPG 
NK*GGGTKKKEGPPPPIiFFFFLNTDFCN 
LIKRDRLGVGAHL 


1262 . 


15163 


A 


1270 


404 


1 


AEILELKNVlDILKNVSESIfNSRIDQAE 
ERISELEDRLFENTQSEESK*KRI*KNE 
ARQQDLENS FKKANLRVIGLKEEVEREM 
GVESLFKWI 1 1 ENS PNLEKHI HI VQEGY 
RAP\FNSNKKTSRHSIINSHTK 


1263 


15164 


TV 

A 


1271 


387 


2 


TVCVI/trnfT T TDTrT T.T T*ST V*El/*»XYT/"\TJt»TT 

KKkKKjkbVXIXr VIjLiIiPIjK* rQNIQVWIj 
VRELEKKFTGKYW\ FA*RKI LPKPTQK 
RCTKNKQKRPRIHAPTAAHVAILEDSVF 
PGEIVGRRIRVKWTQQLTRVHSDKAQQN 


1264 


}5165 


A 


1272 


1 


393 


FRMGRL PGGGAPHFSDGVAGQRRSSPHR 
RS RGRAEALLTS QTGRLGRGAPH I SDDG 
RPGRDAP\PS * TGWQPGRGAPHFPD\GQ 

foKorVrll J. FJJiAaKr'vjlvL/HtrJlr r* I va vflnb 

QRLQSRHFGRPRQAAGRWRL 


1265 


15166 


A 


1273 


3 


317 


SSYETKGVMIASFSSREADNHTAFIRIK 
TNASDSTEFIILPVEVEVTTG*WKTDEI 
ESCFVFSIAYSGKVKNNC*I*FFFP\APG 

TVGQTRMT.TlPnTT "D TT^TIVTCT T? 
X I DO 1 Cil y i±JUc\j X Liii. 1 ^Apl\J\±r 


1266 


15167 


A 

• * 


1274 


92 


368 


LCPPGGRAGQGGNXiN*GVPSPPGPKGFW ! 
GPTYGGGGGKRTTRPGGGKPEKSFD/GP 
PG PPPGGTKGNPAP KNFFAQ KERG P FFQ 
NGGPGNKLF 


1267 


15168 


A 


1275 


401 

• 


3 

# 


DMCDWFKKEFSDTTPKA* SIKEKQLTSF 
IKIKSSCSPS\KNTIKGLTRQEKLWGKI 
FANHMSDKGLVSRIYEELSKLN\K*KLN 
F * KWAKQ * DHLLKKTH* *QICKERYSLG 
KGKITMRYHSTSI IMCI IKTIHS . 


1268 


15169 


A 


1276 


298 


426 


GATMLVLLF * LAFDLRQSL/NCFPQAGV 
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nucleotide 
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correspond 
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peptide 

seauence 
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Predict- 
ed end 
nude- 

mm U wB W 
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ding to 
last amino 
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peptide 

sequence 


1 Amino acid sequence (A=Alanine 
C=Cysteine, D=AsparticAcid, 
E=Glutamic Acid, F=Phenvlalanine. 

■ m ^ - mmi * m ™* M m mtm mm y m m MMmtUm 1 II1IUII 

G=Glycine, H=Histidfne, I=Jsoleucine, 
KNLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R~Arginine, S=Serine, 
T-Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown. * s =Stoo codon. /=Dossible 

M mr m^y mm mm\9 mmf Tl lli mm* kl/ 9mM m— \J \J \f mm m § LF vdw M %J m mm* 

nucleotide deletion, \=possible 
nucleotide insertion 














QWRDLGSLQPSPPWFK 


1269 


15170 


A 


1277 


421 


1 


VSFKDQKL* INKENHPIKKWKKICNQPL 
PQ IRYMDANKH I KI FSTS LAARETNMKI j 
TT/RILWKTVWQFFKKVSIYLPLEPAIS 
SYLFTQEK* KH I DGSCS PSRVATAKLliA 
LQWRRPSWNCMHQRAGSWKQIGAPPFSK 
L 


1270 


15171 


A 


1278 


1 


218 


TRSGVQDQPNQHGETLSLLKIQKLASYG 
GVCV* SOLLRELROKNClANGPRSHHCT 
PAWAT \ EQNS I * KKKKG 


1271 


15172 


A 


1279 


88 


396 


TFLFS S S FFF FGGGVP PCP PGLRPG \ PN 
FASLHPPPPGFGGFPSLTS PRA*NWGPR 
PP PQATLGAFGLEGE * I FPPGRPPEP * L 
WGPPPPPPQRGGMAWPTPL 


1272 


15173 


A 


1280 


94 


399 


TDFLFL*TDFLFLCLCSLKNKIWVNEFR 
YGGFSLGVSNTQALPPSQEVNDAIKQMK 

TCHT.TCT.AK' /TiY TQTVRrTTKMOM*TjT^5YT? 
rvLiu cv Unix./ uivioiv rvv- j. 1 iuriV7i i I u kt i jv 

SS WQVKRI TWQL I VLNMLA YRAVFG I 


1273 


15174 


A 


1281 


236 


2 


TQSRLVFFYMTGPAVYLNHHALRTHQGS 
HliPFF PC FEME S H Q VTOAOW\ VWMPDT iS 

SSQPPPPRLKRFSCLSFPSS*DY 


1274 


15175 


A 


1282 


37 


410 


KKKTIiKKI KKIiCP PGVKGE FNPPF YVLS 
KVFPKKERGLFFKVTFVSLTPFFKKKNF 
KIGGGSGFPPPLWFFFFFY/RDRVLLCH 
PGWNAVTRS * LTTS VDSSDPS / CLRLPS 
SLDY 


1275 


15176 


A 


1283 


404 


232 


lsswdyrclpphpanplyf* +rrgftvl 
armv/gprds pas s sqsagi tgmshcaq 

PC3 


1276 


15177 


A 


1284 


3 


402 


MQIKITMRNYCTPVRMP*KKK\SRTKPR 
C*GGGGTTKILIPCWGDYKIGE/PLW/K 
S VWQFL I K*NNHLL YDPAI L PL X FYTRE 
MKTYVHT ITKYCNHNEK/CMOGFTEALF 
/TLPKTE/SQPKHSST 


1277 


15178 


A 


1285 


1 


253 


LRGKFMTLHS / S ILKRG* SQINN*TATL 
IK*KKKGKIKPK/RSRRKVI IMNRNK*D 
* KONNNKFNETRSWFFKNMKKI YKSLVR 
VT 


1278 


15179 


A 


1286 


400 


105 


FNFKKKKKKKNREVS E * QY * QAKYLKRN 
FTKNI * VAEKHMKNAOYVIKEMIiI * LTM 

m> V m aWA«4 At IA\A*4 *aK A IWm mmt my M M 

RYYYTPI K\MAKI KLTDNHKCQQLKPS Y 
MVGKEFGKFLIKLNM 


1279 


15180 


A 


1287 


400 


119 


TVFHHIGQAGLELLTSSDPPVSASQSAG 
I TGLSHW \ PAS S S FHSNHS CPVPLEPHQ 
VAQLDS FGYEKVCSFSN* VPGAQDKNDL 
SEN 


1280 


15181 


A 


1288 


157 


386 


NDLQFHP F YCLF/ DLFCQSLKKAGMQWH 
DDGSLQPLLPRFRRLLCLSLPSSWDCGH 
VS/PMPG*FCIYTHIYFLVKMGF 


1281 


15182 


A 


1289 


400 


124 


FDAPAKGQGGDFGSPPPPPPG/AQKIFP 
PflFSQ* IG * KEG APMGQP I FVS FS KTGA 
PPPGQGGFQIPTPCGGPPQNFGIQGGAP 
APGPGGGFF 


1282 


15183 


A 


1290 


419 


3 


KNFF FLERGFF F F PPGGRAGGGF* FPPP 
FFFQGKKISGPPPPKKRGPPGAPQKL/R 
EIFLVLKKKGGPPLWPGGFLNPAPKNFA 
RANFPKRGGSR VG PPSGGPNL FNLGGG W 
FFRFFFFFFFFFLVFETEFRFCCPGWSA 
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SEQID 
NO: of 
nucteotiae , 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M. 
eth 
od 


SEQID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino add sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, Phenylalanine, 
G=Glycme, H=Histidme, Msoleucine, 
K=>Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine,R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptpphan, Y=Tyrosme, 
X«Un known, *~Stop codon,A»po5sible 
nucleotide deletion, ^possible 
nucleotide insertion 














M 


1283 


15184 

• 


A 


1291 


397 


39 


WGQ VWAKD W I HGLGP KTQG C P P PLGVFG 
CP /Shi PPNLFFYGI PGGLKI FVHKNPP 
FPIKKEPANWIATFFPPGVPLFFL/CPP 
ETRSHSVTQAGVQ+RDLVSL*PTSGFKR 
FSCLSLPSSR 


. 1284 


15185 


A 


1292 


399 


1 


LNFFFLHRQKLCNILGSDDKVPAFS I YP 
EPRICYVFSLLFEIIFSFDFNKKIRIYK 
KHVS R I YKDLLKFNNKDNPVKKWAKDLN 
RHFSKDGIQKVKWHIRNC+ TLAIK/EMQ 
NTTTMR * HLMPNRMAKI RKT I 


1285 


15186 ' 


A 


1293 


412 


1 


ARMVS I SRPLGTP I WRSQKGGNKRVSPP 
TRPNPLFLKNPKKCPKSRD*NKPPLLKK 
KKKWSGPCPAQ/SCPGQNPHP/SPPKQL 
PPSPSTPPSRPSPTLAFPMAS*LAISPA 
HTHCCWRELSGTHTLS I P CLKPLRGP 


1286 


15187 


A 


1294 


229 


1 


FFFETESHSVTQAGVQWCNPGFKRFSCF 
GLSSSWDYRYAPPRP\ANF\*FLVETGF 
YYVAQAGLKLLS PGDLPALAS 


1287 


15188 

• 


A 


1295 


401 


2 


STL P VH * KWTTKAWMTAHL / FTAWFTE Y 
FKPTVQNYCSEEKIPFKILVLIDNAPGH 
PQALMGMNKEMSWFIPV/NTTSILQP\ 
ADQGVIFTLKSYCIRNLFQHQCTPAWVT 
QLDSVSKK* INI FQPGMVAHACN 


1288 

* 


15189 


A 


1296 


1 


336 


KKTPRRKTHKEHHNGEK\LRAFPLRSGI 
KQ/GCAF*P/YILNIWA/VLAKEYGKE 
I KAS FRKEELKLCL FIDM 1 1 FVE I PENS 
PQKL\ IN* FSKVAG+ * VSTQNSVAFLFF . 
FFFF 


1289 


15190 


A 


1297 • 


3 


384 


HTNMQKVLV/ VKTEDQPSHNI PLNQIL I 
QNRSLTLFNS I KAERSEEAA* EKCEDSR 
GWFMRFKKKK/RI S ITKMQGEAASAWQ 
AGTSYPKDLAK/DEGSYTKQQIENVNET 
VL Y * KM I / PTEDFWRKDKSM 


1290 


15191 


A 


1298 


3 


395 


SRSVTRAGVQWHDLGSLQSPPPGFKQFC 
LSLLRSWDYRLS PPHQQMGGI FL/VFLF 
FFFPGKGVFLGGPQAGGKGPNLG*WNPP 
PRGLGEFFGLNPPRGWGFQ/HPPPHPQL 
F/CCFFRGKGGSPRRAGRAPNP 


1291 


15192 


A 


1299 


260 

• 


4 


GFIVSSAVFPLLKCIiLDILVSSSVLLVC 
LWNFKIQKSFRFYNLSPLVSSFKNSH*N 
SNI/WPGAVAHACNPSTLGGRGGRITRS 
GD 


1292 


15193 


A 


1300 


91 


219 


LSRIiSAWDY*HVPTHVFFVEMGFHRWQ 
ASLELLS'GDPPALA 


1293 


15194 


A. 


1301 


277 


441 


TVE F * G VQ SMKE E I \E LCLFEDDMS VYE 
ENLKELTKN\LLK* ISNYSKVAGYKVNI . 
*KAIAVLHTSNEQ*NFEIGNTILF/TIS 

KELNTW/RDILCS 


1294 


15195 


A 


1302 


256 


482 


YNVYFKICIGPGTVAHACNPSTliG\G*G 
GRI/TLRSGVRDQQVQHG 


1295 


15196 


A 


1303 


182 


460 


VGREFLDMTGKEFIYKFLYIRKIN*SSS 
KLKTFVLPKTLLRD* KAKLQTGRKYL* I 
TYPVKGLVS RLCKKLS KLNS KKTTQLKK 
WAKDMNRHF 


1296 


15197 


A 


1304 


1 


468 


FKQFSSLSLLSSWDYRHPTGNFCEIAPY 
KQNAPS CSNVFTDNVPVIST * QGLQALS 



609 



WO 01/64835 PCT/US01/04927 



SEQID 
NO: of 

n iir IprtttH ? 

sequence 


SEQID 
NO: of 

sequence 


M 
eth 

UQ 


SEQID 
NO: in 

09/515,1 
26 
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beginning 

IIUVICU LIUC 

location 
correspond 
ing to first 

ontinA opifi 

dllllMV (1L.IU 

residue of 

peptide 

sequence 


Predict- 
ed end 

nuplf*— 
11 u u™ 

otide 

location 

correspon 

Hi no tn 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteinc, D=Aspartic Acid, 

1 1 i+<j mif Arirl I7^=Phpr»vl«i lonma 
i^^KjmialliiL ntlUy * a llvUjridutnillCj 

(^Glycine, H=Histidine, I=Isoleucine, 
K^Lysine, L=Leucine, M=Meth ionine, 
N=Asparagine, P=Proline,' 

0=f? In famine* R=zAraininp C — C«fi»»A 

1 U liJIll IIIC, 1\ YA 1 gllllUC) 0~*iJVriUPt 

T^Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














pnvneqspwretkktavdlcgviwkpcq 
vflrkkpktktgq/heqqifgf/ cffse 
teshs vtqagvq * hhlgsl * plpppfkr 
nsttrtsdydgp/hvpp/hpa 


1297 


15198 

» 


A 

• 


1305 


880 


1026 

• 


EEHGAGLSGSQDAAGGVPAG * GGWAQLW 
VTRRASLFLDKTHWP VDEQNLGSL YTI E 
ATAYGIiMQKLELGKYNErIHA.IAKW 

QELGGGFRSTQ/SDGDPRETTWALEAL . 
TRFREAVPFKGIQDLHVQIRAPKTALNV 
NWYIDHSNAYQQRSAKFLAQDDLEIKAS 
GNGRGT I S I LTMYHKS PESREDNCNL YH 
LNATLHSALEENKKGGETFRLRMETRFQ 
NN* EATMT I ME VSliLTGFY PNQDDLKQL 
TSDVERYAFQYKTKTSTSDSTVVLYLEK 
LSHEKNTELGFRVHRMLQAEFLQAALVT 
I YD Y YE PSRRCS TFYNIjPTEQS SIi 


1298 


15199 


A 


1306 


3 


726 


RTDHYQFQSLKHCLTGGEALNPDVREKW 
KRQTGVEIiYEG YGQS ETVVT YKWELSYE 
DANTYIVK* KTL* TQKEEG \ 1 1 CANPKG 
MKI KS VSMVKESLPYYVHrvDDEGNvLP 
PVEERNVTVRIKPTQLLCILNCYLDKCE 
KTAVS * QGDFYITGDRARMDKDGYFWFM 
GRNDD VINS S S YR I G P VEEE SALVEHTA 
VLEST WSSP YP IMGEVGKAYI VLTRAY 
S SHDT * ALTRVLQEHVI K 


1299 


15200 


A 


1307 


230 


486 


NAI CPKGS KeSLGLGS CS VTQTG VRW* D 
HSSLQPRPPGLK\YPPTSAS *VA\GPQA 
GVQWHGLDS LQT P P PG / FKRFS CI*S LPS 
SWDY 


1300 


15201 


A 


1308 


463 


440 


SAI PVHT * RRRKQTS LNADME KV* WW I 
SUQ 1 & Hjn 1 F L»N Ko li 1 QfcxKAb aJj rrfGMKA 
KRSEEAGEGKFEASRGQVMKFKERSHLY 
NVKVQGEAAGADGEAAASYPEDLAQSTD 
EGGYTKQQNFS IDV/TTFYWKKMSSRIF 
IAEFHHTD*GDNGSVP 


1301 


15/02 


A 


1 ^ Aft 

1309 


A/LI 

463 


59 


RTF IHCWWG YKLVQPLWKR V\ YTFP YNL 
AIPIiIGIYP*/NMKIYSHKRTCTKMF/l 
NSLFIISKNWKHLRYTFTREWIK/M*YS 


1302 


15203 


A 


1310 


438 


29 


PGGKGGGGPTPAPFPFPPPPPKVGVFW* 
GFPRPFLFKPPPPNKAPFPGAPHFF/SL 
PPFPRVFKKTFPTKKKFFPPGGGGKKKP 
PPPLQGGKKMGSALKFPSGGKKPWGISP 

fiT 


1303 


15204 


A 


1311 


2 


310 


HNQKNEFGP YVTS YVKVKEAK* IAKTRK 
LLEENIGVNIHDIELGSGFLAMMP\EA* 
AI RLK I E KLDF I KI KNS CASKDT INKVK 
KTGRPGMVAHAYNPSTLGG 


1304 


15205 


A 


1312 


19 


348 


RGQQDVLPG * RGQQDVLPGRGTYSE YGL 
IFSARNPSMEV*SVWHRPFHHHGVP/HQ 
NCSDSGSYFISKECGN/WVSACGIH*CC 
LVPCLPKAADLGE * WNGDFSLSGKEKKK 
EFYLIMTFSQ 


1305 


15206 


A 


1313 


2 


424 


ENKLTNHGKTGNGGAQSQ/ PPECEPRTH 
LQRGLEGRGGGEPWGQGQPD\ PPSNSSL 
KNPQAGVPPFSSLKGKVKRDRSVSVDSG 
EQREAGTQ * TFVNGKKKKKKKKKKAKKA 
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SEQID 
NO: of 
nucleotide 
sequence 

"i 


SEQID 
NO: of 
peptide 
sequence 


M , 
eth 
od 

• 


SEQID 
NO: In 

26 


Predicted 
beginning 

ftjm. am a|a #v^2 #S n. 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nude- 
oitoe 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A =A la nine 
OCysteine, D=Aspartic Acid, 
£/— iviutamic Acid, r— Phenylalanine, 
o— Glycine, xi— Misname, i—iso leucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=*Asparagine, P=Proline, 
Q=G1utamine, R=Afginine, S=Serine, 
T=Threonine, V=Valine, 
w— tryptopnan, i — tyrosine. 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














AYSPDSKA/P$SKSSPKSS 


1306 


15207 


A 


1314 


375 


3 


KKRTLL I PLMNMDAI /MNKI LAN* I QQH 
IKRI IHNDQVQFVPGMQGWFNIQKNINV 
IPyY*LFY*CYINRIKDQTLIIISIDTE 
KNDKIQNS FI VKTLRK\ I KENFLTL I KR 
IY*KSSNNKKNID 


1307 


1 1520$ 


A 


1315 


403 


3 


LEGVSFFFPRVASQGSILGSCNPPLPRF 
HHFS / CPHLLS KWG YRFPPS PPA\FF FF 
FLKTGFFFFCKNFALCSFP*KQASPPPP 
LTVFFFS/HHPFFFFFFPLFFFLRWSFA 
LVAQAGVQWHDLGSLQPPSLGFKRF 


1308 


15209 

i 


A 


1316 


38 


427 


PEXPARPTRPRPSAWQPPRLRSEEPLRP 
AAAPSEK*GASPPGSHPIWEVRSVSARQ 
PPRPGGRWWGVNP P PGQ P PRPGG EGRLC 
PAAPTGK* GAPLPGQ PPRPGGRWGGYPP 
ARSAASYGR*GAPLSSRP 


1309 


15210 


A 


1317 


43 


399 


LTFFFFLGEGARPPPPGWGPGAHQGI TA 
PLFWGGQGNPPP*PPGEGEPS/QGPPPP 
GGNVFFWKKKGSPRAPGGPWTGGPKGLP 
RPPPPKGCE*RGNPPPPTPKFLGFTTFQ 
KKGKNSGP 


1310 


15231 


A 


1318 


437 


54 


MNEQKGERGLSSLGLPAR*D*GKGNVKE 
GR/EGVTFSPREPKEESLRWSTPQKEIV 
GVINHPRGQRVDRQGAGSGCEGFALRAQ 
TGGP PA* EITRAGEQKAGGGSKGAQTIiQ 
RETRPPRGQRGGGRSASL 


1311 


15212 


A 


1319 

• 


351 


133 


GGGWAEVPI/TSQVGRSGRGTPQLPDGAA 
RQRRPPPPRRGSWAEAPTSQTGRPGRGA . 
PHLPDDGRPGRDAPHLP 


1312 


15213 


A 


1320 


474 


11 


KINSPSQKKKKKKQCSSIRKLA* /DQNR 
HFIEEETQIPNKHMKRSSIS/LAIKEMQ 
IQITKSYCIIIRLCK*LIKNSDSIKCWQ 
GCREI /GSLIHCWRKLKMNSTALQMVYK 


1313 


15214 


A 


1321 


485 


33 


C PASRVAG I TGAHHHAWL I FVFLVEMGF 
HHVGQAGIiE/LPVSWGNGGHL/RLCLLF 
GYCGQCCYKPS*RRFFCVSFYIRA*K*G 
FVCLFV* MVSCCVTRLEFWTHRCNHS K 
JjQP* IPGIjKCS \PASGF*VARTTGIjYHG 
VWHVSNSTALTSGLQ 


1314 


15215 


A 


1322 


451 


145 


THPFGRPRGGVFKVRKLNPPWLKKENPL 
FIKKKKKKPGPGGGPFIPTPSEG* /gkk 

KPPSLKKKKKKKKVYSLIF 


1315 


15216 


A 


1323 


470 


151 


GKAEERHDPGSRACSEPRSCHWTPAWAT 
\Si^PFKKKKKKLCISwPGTVESCFSVR 
T* * I * LRQYVLYLFLFQPKNVI YLFICL 
HDKANMVMFLDFKYYFCFLFLD 


1316 


15217 


A 


1324 


64 


367 


TWEGEGSWLTSQDRTTAPLHPSLDNRVK 

T .P T .Tf T? If If R VnPT P Q TT .T . ATfNfJ T K"Pf P2VP 

FTMA/S K/R I KYLG INLTKDVKDLYNEN 
Y*TLMKNLKGGAMCSFTSL 


1317 


15218 


A 


1325 


193 


381 


AQLFKTSMGNSETLSLQKKKKEKS*LTL 
PSSWDYRHPPPRPVKF\*FLGEMGFRRV 
AQAGLEL 


1318 


15219 


A 


1326 


410 


7 


GFFPYP* LPK/SLRC*GKDFYNQGSGQA 
RWLPPVIPPPLEAQAGGSP*GRSLRPTW 
PAWSNPSFLKKPTPLFLKFS *GRRIALT i 
PKAKVSVNWDSPPALQPGGPSKTFFPKK 
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location 
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. 1 q ct o m inn 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 

C=Cysteine f D=Aspartic Acid, 

Hi vjriuiHin re /\ciu, r— x nenyiaianine, 

l^s^IvcifiA Hs:Tf ictiHinA T — Tertians*! n a 
u vijiiucj ai.~ nio mui lie y j|~i!>uicucillC} 

K=Lysine, l^Leucine, M=Methionine, 
N=*Asparagine, P^ProIine, 
Q^Glutamine, R-Arginjne, S=Serine, 

W=Tryptopban, Y=Tyrosine, 
X=Unknown, *=Stop codon, ^possible 
nucleotide deletion, V= possible 
; nucieoiiue insertion 














; KKKLRIIGPGWAHACNPNTLGD 


1319 


15220 


A 


1327 


295 


3 


VCHFGIYFCVCSLYFTCLYFPFLIFL*V 
TLTFLVF YFDFTVL FI IS L * YFNDCSRD 
CNIHM*LLSLPVLIFHCCC\FVFRDRVS 


1320 


15221 


A 


1328 


2 


392 


RQASRPP * SAPPAPAGKEGSGE* PPSPK 
/PPPPPPKWPPRGFPFPPAGPPPPPPP 
KKSPPPKTNPPPPPP/IKPPPPPPPPPP 
PPFFFSPPFP, 


1321 


15222 


A 


1329 


378 


7 


TDDL I LKF I * ER Y KPNQF * KRT KLKDS L 
PDFENYYKAKWKTRW* * YKE \ KHIDQW 
NRIESR*QIFNIASII/HIEKEYLCNKW ( 
[ \ CYRTTKMEEKNMNLKPLP * TI KFNSK* I 
TDLNIKTITIWLKIi 


1322 


15223 


A 


1330 


403 


2 


VETGFRHVGQAGLELLTSGDLPTLASQC 
WDYKHEPLRLGELRSLTAAWAMQQDPVS 
PNK*IKIK*l/P*SEQFSGI*SIHNVVQ 
NRHFYPDPTHFHHS KVKPLTHYWS PHS 
FLSPTPGNHQCLLSVSMDFSIL 


1323 


15224 


A 

■ 


1331 


1 


389 


KMKSQAIDLEKI FTKHI SDKGLGYGLYK 
ELL* FN/R/RCQTTQLKMGER*TNSSTM 

GMSFEWMVSKHMKRCSCAFPSLVTREMO 
VH/TTSLFVERYHYTPTRVAVIKQSDQV 

\W*ECGIRTLIHC*WECKMIAT 


1324 


15225 


A 


1332 


392 


3 


NNFMPSSA/PPPPPPFFFGGPRVFSPPP 
FFKP P PP F FFFG PQKKKI FS P PP PLKFF 
FFLRPPPPFFFFFFFFFFFQKNLGKGFF 
F*PRPEKKKKPPPGGVFFFSLFFLRCSV 
ALSPRLECSGAISAHCNLCI 


1325 


15226 


A 


1333 


394 

* 


1 


SQWGPTAIHYCAWLVFKFLGHFSKTTI 
SAPLLK*TSTRAETS KS FS \CPRKMERA 
EGRRMFLGKSLLKQIVLLF/LISGSCVT 
AS*FL/CFFQTESHSVAQARVQ*HDLGS 
LQPLSSRLKRFSCFSLPSRTRG 


1326 

■ 


15227 


A 


1334 


416 

• 


1 

. 


LTLLPKPTPYNTRKENYRPISLINIDAK 
IL/ NILAGKIPQ YI KvHN \KMGJjTPJSMQ 
GLFNI * K/RKS VNVI CHINRKEEKKILL 
INA* IVFDKNPTMI *KNSWQ\GEYKEI Y 
SNI LLNG KMLKACHlA KTR IN* GCQLS P 
DAW . 


1327 


15228 


A 


1335 


391 


57 


WQR VCNWDKNL YQ KEKRQDNVREK/ WAT 
DLI INFTHEEMQSVNNLQKDTHPHSLVI 
REMQIKTTSY/HLLAKILNSDNSSYW*G 
CX3GMGTL*CCWWVSKEIHALWEIVWFIY 
Six 


1328 


15229 


A 


1336 


285 


2 


I FFFPWRGEI I * HLSLINKGSRQNERAH 
\ KDNGD F S Q 1 1 S KLKLMYK YKI IjNQTT I 
KYMFFPKKHRTS IKIEHILGSKGSLNNC 
OR I S VLOTGRV 


1329 


15230 


A 


1337 


34 


391 

• 


AKIASLYSSLGNRANSVSKQNKKTKNQN 
LKN/IKTAGGLGWAHSCSPSTLGGLH\ 
DPGV*GCREL*WCRCTPAWASG\ETLSQ 
K* INKGKERKKKMKNCWWMNNSLIQLTS 
FSFTKTDWSS 


1330 


15231 


A 


1338 


292 


1 


GGG I S KT PGGEGA I NPKKPPALP PGGKK 
EAFSPPQKXKKGERA*RDISSEDI*MAN 
KHMKSCPPLLMIREMQIKSTMRYRFIPI 
RME\ILKKQKNS 
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SEQID 
NO: of 
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sequence 
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SEQID 
NO: of 
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sequence 


M, 
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SEQID 
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¥ 1CCM 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue ot 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 

vjriutamic Aciu, r— rnenyiaianine, 
G=G!ycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=*Leucine, M=Methionine, 
N=Asparagine, P=Pi:oIine, 
Q=GIutamine, R=*Arginine, S»Serine, 
l— I nreomne, v— valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


1331 


15232 


A 


1339 

i 


369 


16 


DPWQNEQLSRy 1 CPVs PRRA5MFRIT*YK 

VIIAALMAYSVGPRAVSCIRALWTTYG\ 

IMDNYKRHERRFLKPL*WPGMVAHACNP 

STlA^nGRHEKVGKVT<5JP^ 

VKASI 


1332 


15233 

* 


A 


1340 


394 


1 


TQSGVQRHDAGSLQPQPPRLKQSPHPSL 
PSSQDHRRVPPCPARPCSSHDSYVLREG 
PCX3RW/HESRGRVFLVLPS***LSLMRS 
GGFKKGEFPCTSSb/ SCLiPPST*DVTCS 
SLPSAMNVGPLQPH I FMHGF 


1333 


15234 


A 


1341 


400 


161 


KI GPNLP CFSTKRP WKIDKI QFWP PGKV 
F*/SPPPPKKILFFPPPPSGPFF\PPLP 
PLPKSFFPPKNPNLPPSFGFDIPPP 


1334 


15235 


A 


1342 


377 


3 


NRHFPKAETLKASKHLKRHLPLLVIREV 
Q I KTTN I CHN IATRLAKM * KADNTKCWE 
SCEPVLTFY\C* WEYKLVQPL* KMVWQK 
P KVCVP Y YIiAI LLLQHI PEGTCAS / CY * 
DIHKRLLTAMLFAM 


1335 


15236 


A 


1343 


170 


369 


RCNENKITKEVYFGEIFVRLDNKBKNTS 
FFYFYFLFY/ CWR*GIiALLPRLVLNS*A 
QvvLLPWPPKMIj 


1336 


15237 


A 


1344 


385 


2 


SDLRRSTHLGLPKCWD YRC/ RATVPGRL : 
FIFLMVSLKHKSFKF* *RPI YLFFLSSL 
MLLVS YLRNH * L I QGHKDLL PMFS S KS F 
MILVLTSRSII^LLFCFLFFVLRQGL/N 
S VTQAG VQRHNHGS LQ PQP 


1337 


15238 


A 


1345 


1 


281 


HMAT KHMKRC * / S EMCTLKP QLNTTTPD 
HQNG*N* K* IiVNFKGWQ*FGAVWNLIHC 
W*DCKLAQPFWKIDLS IRAKI SMS YSHV 
VRKIFKLKNK 


1338 


15239 


A 


1346 


176 


1 


SVIWNIKEP* IAKI I LKKKT I VGGLT VP 
DFKT YYKAMVI / KT V*S WLKDRQRNQWY 
RED 


1339 


15240 


A 


1347 


398 


2 


PRPPGPVRRRCPS/LTATSGSSSPPSSP 
FYLG YPQGFPLPAVLNRGPG ILFWGPHK 
KNXTJjPARGR* IGSS * 1 APPF Fr / SjjF S 
FLLFS FLLFDTG S YS VPQAAVQWFNHGS 
LKPQPPGLKLSSQLLRRLGW 


1340 


15241 


A 


1348 


9 


395 


GLQNPCVGFLVSGFFFFFFFPPKKGLGV 
PP KKK/ RGPNPNGP VGE FLATGP FFWTG 
PIjKKNPAPPRALFWGPPPPPPWGG*NPF 
PFLARGSPFFKKFFGGPDPFFYRTNPRG 
PNKRGPPWPLKWGNPPPK 


1341 


15242 


A . 


1349 


119 


1 


ENVFRNM/ W PGMMAHTCNTSTLGGQGEW 
XTR£GV*DQPGQ 


1342 


15243 


A 

■ 


1350 


58 


396 


GIRVGKVCF I IFFFGFFFFFGKGVSIiPP 
GRETGGPPL\LIKPPPFGVKGIFLPPPP 
GGGG\CGPPPPPRVIFVF*GKGFPP*GP 
rjrji?* p p AP KGS TiPP P P PT .WKNWGNR I Ft5 

GR 


1343 


15244 


A 


1351 


396 


1 


GPTKGPGPFLiDGQGFPPPTLKQNFFFPA 
LFFPPLGKRLILGGF\ PKPGGLLQPPT* 
KPLRFKPGGESDC FP KFF *GRI VCFQQS 
LCFFPPPKKKKTAFGKRFKKKTLLFFFF 
SDGVSLCHPGWSTVAjQCRX. 


1344 


15245 


A 


1352 

_ 


1 


250 


RRL* SQLLGRLRRQNGVNLGSGACSERS 
\ CSERRSRHCTPAWAT KRDS VS KKTKCR 
RHQHVGS LTVRLRPGY W I RHCGCHWWP 
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SEQ ID 
NO: of 

__ _____ 1 fyii £ — B A. 

nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 

- 


SEQ ID 
NO: in 

HCCXT 

09/515,1; 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing co nrsi 
amino acid 
residue of 


Predict- 
ed end 
nucie- • 
otide 
location 
correspon 
ding to 
last amino 


Amino acid sequence (A^AIanine 
OCysteine, D=Aspartic Acid, 

1? — Plii4iiintn A t*trt IT— DkonulnlnnifiA 

rj— iviuiaroic Acia, r— rnenyiaianine, 
G=Glycine, H=Histidine, I=Isofeucine, 
K=Lysine, L=Leucine, M=Methionine, 
fN^Asparaginej i^Jrroime, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vaiine, 










peptide 
sequence 


acid 

residue of 

peptide 

sequence 


W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, v-possibie 
nucleotide insertion 


1345 

• 


15246 


A 


1353 


249 


3 


CGATLCPKKYMKITPSFFVEIDN/AILK 
FIWKFKGPRIAKTTLK*KQG*RTHTFHF 
HN I LQKAI I VKTMW Y* YKHRQS PGRVAH 


1346 

* 


15247 


A 


1354 


15 


416 


LLTLSSSLKITIHTNKGRT*SFIRENII 
IFIATTNLLGLLPH* FTPTTQLFINLAM 
AIPL* AGAVI IGFRSKIKNALAHSLPQG 
TPTPLIPILVIIETISLLIQPIALA\AR 
LTANI TAGHLLMPL I GSATLTI 


1347 

• 


15248 

* 


A. 

• 


1355 


2 

• 


416 

• 


IKYLNVRPETLKLL* /ENIE*NPHNIGL 
GSDFFNLTSNSQGI KEKIDESDYFKLKS 
CCTESDTTNRIRQLKNERKCLQITCDKG 
LIF*KKLK*LY/KQKTNNH*K/WSKRLK 
YFSRKDIHMAKRY/MKKCSTSIIRENKP 
K 


1348 


15249 


A 


1356 


426 


85 


HARliVL* FVF/ LFETAYLS VAQA/GGAM 
AQSAHFSLELPGSSDTATSTSQVCYHRN 
T/RLLF* IFCGYG/RLCCLGWSRTPRLK 
QSSHLSLPKCWDYRWEPLYARPYLSCFP 
ENARLG 


1349 


15250 


A 


1357 


2 


301 


GGLLEVKVQDQLGQiJNKTPSL/ IKIYIL 
P I YTQKN * KNLTRHGpMCL * SQLLARLR 
*ED*SSPGI*GCREP*WRHWTPVWTIQQ 
DSKS RGE KKQ VI FTHYT 


1350 


15251 


"a" 


1358 


2 


389 


FLHVGQSGCELPTSVDLPASASQNAGIT 
GVSHRSWLKFSLLDVPSPLTP/VSSAVP 
LISYLATGWRQAAIAASPIFLHQLARPA 
QPAREAAADS * LP ADS AFP PKQWTC* * M 
FTGALFKKKKKPCFVAKK 


1351 


15252 


A 


1359 


309 


3 


KWDH I KL KNFCTAK * TINKVQRQPMEW * 
K/ 1 FVNYPVDKGL I TRI YKELKQLYRKK 
KSNNLIFKMSKS\SSLAIREMQIKTTMR 
YHLTPVR I LVYVLP KRARS 


1352 


15253 


A 


1360 


300 


2 


KRAFH * KKMP S RT FTAREGKS MPCFK / A 
SGWATPVI PAL * EAKVGESLEPRS SRPA 
WATRRDS CLF * K* * INK*KLTLLTjGANA 
AGNLTLKLMLI CBSKS 


1353 


15254 


A 


1361 


396 


60 


HKVTRKSDGMLCHSGFCS INQLAFCCLF 
FQPGGPRGTPLP/PYKPPPQKTPKKTGP 
GGGGLYS PPFGGGGRGNPFFPGAQGFF* 
PPSPPPPPPPGGRKKIFLPKKKKKSLLF 
FF 


1354 


15255 


A 


1362 


37 


385 


ALF S FS FF FG FF WGKKI r FFGQGGRACJC5 
HSNLPEPPPPGGGAFPRPNLSGGGE*RG 
PPIS \GE I FGFLKKTGVPPGGRGWFKPP 
APKEPPPPAPPRGGISGQDPLPPPVLPG 

WGKKW 


1355 


15256 


A 


1363 


12 


421 


E P WE PU i J-ivjJ. V- 1 KLiSj JL Lir rlu X U V IiMJnMJj 
VVTVRYHLTPVKMP FIRKTFDTAGM* KK 
KKGCYQGSKKGGMLI /HPPWGAL*MSIV 
KKKTWGTFFKKIQKELPWDPAIPWLGMY 
P KEGNS VFQRGMGLPS VIGTLFP I AG 


1356 


15257 


A 

_ : . 


1364 


278 


3 


CCTGEKLETFLLR * ATRQGCPHLFDNVM 
EVLANAVRYEKE I KG I QNGKKKVKT\SL 
F PGDE I V/ YAENPHKS * PKNSL / KLKS D 
CSKVNIQKPIAF 


1357 


15258 


A 


1365 


265 


330 


W PGAXAHACNP S TLGGRGER I T 


1358 


15259 


A 


1366 


379 


1 


KRKLMYCWWECK* IQPGWKTVWHFLKKL 
KLELP YDSVILLLYTFLKES / PVYARDI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M . 
etb 
od 

■ 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 

■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D= As par tic Acid, 
EXviutamic Acid, ^Phenylalanine, 
OGIycine, H=Histidine, I=Isoleucine, 
K=Lysine, ^Leucine, M=Methionine, 
N=Asparagine, F^rroline, 
Q=Glutamine, R=Arginine, S=Serlne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V= possible 
nucleotide insertion 


• 












CTPKFIATLFI IA\RCSLTHD * TKKMWY 
LSI ME YYSAI KNE / YLP FATTQMNLEDN 
I FS EES QA* KHGVF I FP 


1359 


15260 


A 


1367 

• 


1 


376 


HliPGAESQPPP V/ DNS WDRPAGRTQLLW 
TPA/DPHSYG*GGAGPHPCPSQPGCCAP 
VQSCS * APSEAQSLGAADS \GPAATLPA 
RQLITKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKDDSGKG 


1360 

• 


15261 

> 

t 


A 

■ 


1368 

• 


2 


409 

- 


EEAMPKAKMG*RPFAPNSQILNAKKKLL 

_ - _ _ _ _ __ _ mam » * « mm «m mm » « mm*m mm mm m± mm.m. AVAWV » * 

KE I KS VTPVNI * M IRK* NS L IADMKEV+. 
VIWIEDQTSHNI PLSQSQIQSKVRMLFS 
SMKAE RREEAS EE KLE VSR /GWKS AGFM 
RFKERSNLHNI KVQGEATSTDGEGA 


1361 


15262 


A 


1369 


220 


416 


PQPLFDWQMYTYTLHNDILVNDGRLSPG 
GQGCSELSLCHCTPI*MT\SETIiSQKKK 
KKMGQARGLS 


1362 


15263 


A 


1370 


310 


289 


AFR*AFHSKGSK**H*EKKVN/WNFTKT 
* NF CTAKD 1 1 KRM KRQPTKWEVI FANH I 
YIiTGILIS\KI YKELRTTQ* PETQSLK 


1363 


15264 

• 


A 


1371 


3 


322 


HASERTHRRGKSTETTTPAWATERGSVS 
KQKTNKKTNK/RNTGSS IHNMVSDFEKQ 
VTQT F * S SMQMSNKP LKRYLTS V I IEMN 
IKPEYHFTPIRMAI IKKTDNTKG 


1364 


15265 

• 


A 


1372 


3 


332 


GKE VS * E D I MMLHVYAPNNRAS KHMKQN 
L IEMQGE I/DE /HP VI IGDFS /TPLS 1 1 
DRSGQKI IKDIV/ELE/STVKOliDLl/D 
IYRTLYLKRVEYVF\FTSLSKTFATINC 
ILGHKV 


1365 


15266 


A 


1373 


381 


2 


CSVAQAGVKWCNHSSLQP*TPGVK*SSC 
FSLPSHSBYRHE\PALFFKFFVQMGS\T 
ML*FS*ESCIKGSRQPCSYTYLHSPPIF 
SSLFPSLAPESIKRQEPSFFFEMESCSV 
TQAGVQWRHLSSLQA 


1366 


15267 


A 


1374 


1 


385 


YKKG INAKI FNKI * * V\ESAI SHDQVGI 
VL VLQG FLS I /RNKS I S VTHHVNRUKN/ 
HMLI * IDAEKAFDKI QQSSEGKIGIKGN 
FFDLLKSIYRKPMANVILNSEKMKCLPC 
KC\KTSQGCQLSPLFFLFFF 


1367 


15268 


A 


1375 


1 


357 

• 


CSGVI S PHCGLKFLGSNDL PALASRVAG 
I IGMTP \HAQL I KNFCCCWWR * CLAPGG 
* RFKTMCRP I VK* NKS INK*NRTVCMSG 
CGDS/RQLLGRLRWSPGLRNVQGCSEP* 
WHHCTSACL 


1368 


15269 


A 


1376 


1 


375 


HRPKFKS ISLLEENMGESFHDLGL/GSD 

- — _ _ i— _ _._ ^_ . MdHk » i i ■ mm mmm W9^9* mm mmmmgrnm mm mtm 

LLDMAP I VQSIRDKTSDFLE I KKSCSSK 
GTV* I MKKQATDWE / RTFVKHTSNKGS I 
MYEEL*Nli* KL*NNP I * KWAREFSRHL/ . 
EDTQMATKHMK/KCSTAL 


1369 


15270 


A 


, 1377 


260 


a r\r\ 

400 

• . 


TCP f5T jCI VTVPPWPKTCKTOfJPPPP FPKTTSK 

GRKTLMVNPE* PKYYS VKKKKKKKGGGL 
PFPCKI FFPNLTIQTTFFWPKTOTLTLK 
KRKKPPKKTPTFWGHFFFERGTKKTQGG 
KKNPFKKGGWEKLFFLGQKKKMAPFFFF 
QKPPQR/WRKTLMVNPEP*NLLKKNM 


1370 


15271 


A 


1378 


403 


1 


TGVSQALSLFFFKYFKPRMVEFLVEPSR 
* KGPS VYGLQI FFS I PKVPFPCPFWAF / 
SLV*SSPPCLSLHIiLPAFGVLPKRSIiPR 
PMLQS FSPMFFARIG * FQILLLNFQSLL 
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SEQ0) 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
DeDtide 
sequence 


M 

eth 

od 


SEQDD 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=*Alanine 
OCysteine, D»Aspartic Acid, 
E=Glutamic Acid* FHPheoYJalanine. 
G=Giycine, H=Histidine, ]=Iso]eurine, 
K=Lysine, Lr=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Clutamine > R— Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W«=Tryptophan, Y^Tyrosine, . 
X^Un known, *=3top codon, /=posslble 
nucleotide deletion, \=possible 
nucleotide insertion 














S * PFFFFGDRVSLCRPGWSAVAP 






A 


1^7Q 

U Is 




JOI 


AERYGTS RSTADRI SCLF I * SAE I / VF P 
DSGKKTGGNNNNNNNNNNTNNKDNNNNK 
ISLSLKP 


1372 


15273 


A 


1380 


361 


39 


PPVKETAKDVNRRFPKQPMTNST*SDA* j 
VDEWCL*FOLIGRLK*SRRIA*AOEFK 
AS /HG QHSETLS LKNYLKKKRFS TLLFL 
REMQI KTTTR YHFTP IRMAKK/ SHHTKC . 
* QGLAMLPRLALNS WAQAILLLHFSLP I 
SWNYRHTTSSTQASLHVLFVIGCLGNRL 
LTSFAVSFTGG 


J J JO 


1 

f 




1351 




on 


RLVSNS * PQDPPTSASQSAGITG VSHHA 
WPAFFIMWFSSS YSR/TMH* *QHPHV*K 
IiHICPN**KS\LSCPQVPTHW 


1374 


15275 


A 


1382 


1 


431 

■ 


DNI PKKSAVYYW I TLFKKG* DNVEDEAY 

sgtlaisic/eenihlvcslsdeeqhst 

nOTT AWTT D TWfJT H VTT T.TAIf Q KT .NTTT . 
A .LaIN X 1U1 1 VouHI 1 J. -Li X iuOMjnIUj 

CT* *MPKLSYPLF* KIL* KWKTKNI V\ P 
NQLQIKAELPVEILHNCDQDPETWLGGV 
AHTYN 


1375 


15276 


A 


1383 


2 


432 


ELSADVS FFFI TTPLSGVL* QNWGIAAF 
IPIELRSPTEVTFSFDVGNGPFEISLQS 

LTPNTQPAPADGHVLLQLNSHLFVGGTA 
TTQRGFLGCI RAL/ RMNGMTLDLEERAQ 
VTPEVQ 


1376 

■ 


15277 


A 


1384 

■ 


1 


421 


NPPALASQRAGIADICHCAWPLLRLSKP 
O FSNIi * S KKF ATiGDV* SSNT L* YT Y T FF 

EMESRSVTQAGVQRHDLGSLQP\LPPKL 
SLLPPIOjSliLPPKXSASaiRLAGSWNYM 
HVPPRPANFCIFSGDGVSPCWPGRTRTP 
G 


1377 


15278 


A 


1385 


2 


416 


I FSVDETALYWKMMPSRTWYL/RKEKSM 
S GFRDSEERLTLFL \GLNAPGELELE I L 
LI*HPEI1^PLNYVKFTLSVFY*WINBA 
L ITAHMFTAWLTE CF KPSVETYCLGEEM 
PFNILLLIGNAPGYPRAPMEIQEINIIV 


1378 

■ 


15279 


A 


1386 


170 


1 


DGVFLI FGGQNEKLNKNKDG/DLTKLPR 
^♦NSWARQSSCIiAFSKCWDYQREPPCL 
A 


1379 


15280 


A 


1387 


47 


418 


FWGFFFFFIiKKKKAALGPWTPPPPGQGG 
PPP PPPRGPGKGGPNP PROTI YGFWGOR 
GPPPGGGGEPGPPPPGEP\PAGPPQTGG 
PQNWPPAPGP*KGF*GASKGAP*KGVDP 
GGERSQVPNRGA 


1380 


15281 


A 


1388 


58 

* 


503 


RPTRPGNYIMIKEFIFQKDRTI*NVYAP 
KNIALKY I KOKL INLKGKRDKLTLTVAN 
ISTCP*VTDITSR*KICKGTEEIiF*QPA 
\DLI0IYRTLYPTAA*YTFFSSAYNYSK 
IGNS IGHKTFSN/ CKRNDNIQWLFSDHN 
GIK*EINIII ( 


1381 


15282 


A 


1389 


2 


401 


LVSQWNSKNRILKKIKNSTPVHT/RQM 
I RKQNNL I ADMETVLWW I EDQTSHKI \ 
SQSMTQSKALTLFNSVKAERGEEAA\EA 
GRGWFTRFKERRQL* NLQVESEPPSXAD 
VEAVAS YPEDVAKI I DKDGYTKQR 


1382 


15283 


A 


1390 


294 


3 


KI VI FDFND VKNCS S * KI VI FDFNDVKN 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


eth 
od 


SEQID 
NO: in 

09/515,1 
26 

- 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
it*— ijriuiflmic /vciu, r—jrneny {alanine, 
G=Glycine, H=Histidine, l=Isoteucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
v^ijiuiamine, K="A.rginine, a— oerine, 
•^Threonine, V=Valine, 
W=Tryptopha n, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


• 












CSS KTV I EGKVSDCKKI FSKH I S DKYL I 
S 1 1 YKBLSNLNNNQQ PN* KWAKDL /NNM 
SSKHRKICSISFGKMQIKTQIRNHYIPT 
DAW 


1383 


15284 


A 


1391 


3 


422 


PLFKKEEGTNFP FPP FWGRFFPGGKS FS 
PFFFPPPPFIKGGPKVSQKGPRFFFFFF 
F*DGVSIiCRPGWILFPPQVTEAIiLRGFL 
ALSSLCFRLDKFFFFF*R*GHLLSRWDY 
RHEPPHLRIS*RKTKE\KGLIMLTRLVL 


1384 


15285 

i 

I 


A 

■ 


1392 


3 


400 


FLYI YKDI YR* YKTFQINNKLKI IYFK*. 
EKDLSKLFTKDVQMTNMHI *MGSTSH I 
KQMQIKTTMKHHSSIjPECLHFKAANITC 
WKRLGPAGTL I LRW * ECKS VKPL/WETF 
* QCLIQLSMQENYH/DPATPLIG 


1385 


15286 


A 


1393 


409 


2 

» 


IVRHFSKEDIHSTNEHMKKGFLSLVKBV 
QIETTMGHHHITVRRVKIKTDIFASA\N 
TKC* *GYGKTRTLINS/YWECKI VQPL/ 
WKN I LAVAYKI KHTFFPPRIHLSGSQKS 
HFSIYSKEMKTLRQHKDIFMATIiSQN 


1386 


15287 


A 


1394 


409 


24 


KSP FFFFEKGF PS P P PGGAQWGGFP P PE 
PPPPWK*FSPPTPPNKWGNRPPPSW/P 
G*LFFFCSPGGFPPLPNWFLTPPLR*SY 
PPGLPKRWGFKQKPMGRG*RAFFCPPKI 
S KKL WS GKKKKKGRPLAI 


1387 


15288 


A 


1395 


83 


411 


QRDS VS KTKQKKI NNKSRTDLNRHFTKE 
DRWM/ ANKHLKRYST* LVIREVQ IKTTM 
RYYYI S I KKSKI NKPDHTN CWQGCGRPG 
TIjI YC* WE/CW>fVPSLK*TFW*FIiKRIiN 


1388 


15289 


A 


1396 


308 


14 


NFFFFYSWSNI I YCQKGTAFTPISDKID 
FN* KI ISRQK*QYMLI KGS IKN/DITI I 
NTYTSNNSSPKYI KQTLTGLKGEIAPST 

IVVOVf Nl 1 Lio J. J. & 


1389 


15290 


A 


1397 


321 


1 


KQ* CS IE YS / FKYTR* * *SKL/ WPGAVA 
HAYSPSTLGGRGGRIT*GQEFKTSLA 


1390 


15291 


A 


1398 


183 


48 


KWPGAVNHACNPSTLGGQGGRI/TLRSG 
V*DQHGQHGE.S PSJj VGKVR 


1391 


15292 


A 


1399 


18 


422 


KAGMAILIDKVDFRTKNIITE*/EKHFI 
ITKGSVREENISVLNV/WPHYRASKHMK 
Q/ILIELHVELGEHTI I VG/DFNISFS V 
LNRIDKESARRR*NI *NNTNHQLNLVDI 
x*KTKI}TTNL3Kl I r r r»AriKJuJ? 1 llXril 


1392 


15293 


A 


1400 


28 


339 


YSCDHNSVQPQTPGLKQSLCQPPE*155~ 
ITDARHHIWPIFF\LKRSFVFIAQAGTQ ; 
WRGLL * KKKKVGG FKKGS CLTLPGRGD Y 

DDTOPPVIT T T") T5 O T? /"^VTC T T 

RKrK&bKIxjXPGKGrCNEJjl | 


1393 

■ 


15294 


A 


1401 


390 


1 

■ 


FGLGKHVLHMTS KAQS IKEKTD* LDFLK 
I KNFNFS KDTIKGMKRQARNGEKI FPRH . 

np /r\T\T?17+M1V TTYT »T»'OTiI^CT , »VtrM7WTVtIT V 

y S / DDFF* WAlDb TFPiUliDKVMAN lUiXK 
RCLRLLIjL\SMRYHYH*NKKTDHTKC*R 
ERG\ELELLCCW*EYKMVQ 


1394 


15295 


A 


1402 


149 


402 


RFWIHLYRKRVFPPLLNPVITWKEKRGF 
VFVFVF * DRVLLCHPG WSAVPWS *LTAA 
STSWA\K*SSHLSLPSY*DHQHASPCLC 
N 


1395 


15296 


A 


1403 


1 


403 


KRKSVPELIC/APRC*APCSYRPNLPSS 
PGPHLDPVGPTPTPSPPRPGPARGPPAA 
RPKEPAEPGKEERRGL\PAPGG 


1396 


15297 


A 


1404 


342 


3 


KEDNL3DKGGSPKQQI FKVN* TALY/YT 
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SEQID 
NO: of 

f JUlfcUIfUC 

sequence 
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AT .Y<? VK T PPRTT. T AGKEKIiMPGFKA<?KN 
R/TLLLGANAAG\DFKLRPMLIDHSEL* 
/RPKSTLPVL*KKA*LRAHLFIALFTKH 
FKPTVE 


1397 


15298 


A 


1405 


498 


66 


KNYAKSTLPML * KWNSKALMTAHLFTAW 

I DNAHS H PRALME I YKE INI AFMS AKTA 
STLKPMDQ\KVILAYKSSYLRWI\FHKT 
TAALSSDSSDASGQSKFK*IWKGFTRTR 
GRTRGSAR 


1398 


15299 

i 


A 


1 A f\£ 

1406 

* 


•* 
3 


All 

411 


RHEPLYIASMGVLNCPISNLGQCIPP*A 
AWWSLLLGC\P I LMLNLVQTPDLPSVLQ 
PRAPGPR\HPPVSAS *VAGTLGTHHYAW 
LSMLLGSTL+DVFPFLSPSLIPSEE 


1399 


15300 


A 


1407 


361 


2. 


I RVPQ KKKI KS PGRGHKI r S F * RAGP P F 
FFSFFFLRQKESHSVTQAREQWHDHGSL | 
QSRLPGPK*FSH\PPPPHPPVTGTTPLF 
IYYYYYFLRQSL/DSVALAGVQWHDLGS 
LHPRVRPRV 


1400 


15301 


A 


1408 


393 


2 


FGFN I o VJjK.I\Ao r r fa F JbCaKKJbr rfa 
QPFFFGSPRVFPPPPFFNPPPFFFFCCP 
L*KIFFSPPPGLKFFFFKRAPPFFFFFF 
FFFPDRVSL\CCPGWSAMVQSRFSATST 
LRAQAIL 


1401 


15302 


A 


1409 


393 


60 


T \n TTvTmvTT +CJVT T*l l/TITi O WIT VUTT AOVT?DT 

S KTS KAQAKE VTLCKWDYNKL KMFCTAK 
KTL/IKVKR*STE*KKIFVKY/STTK/G 
* T YR I FWKLNNKNNTQFKNRAETSRSKM 

iUi 


1402 


15303 


A 


1410 


422 

* 


2 


VNDRKMDGWMKfGWKEGR*TQKKKGGRRK 
REKERSMFLEGRKEGREKEREBREKESQ > 
KERQKRGKARKEQRQEARKQGGR/EREK 

PP FPPPfiJP PPPfiPPGP PfCJQQ + TRCHn/W 

1 1 LLAVKMLVHLKS LHLQKVCVFS FSRS 


1403 


15304 


A 


1411 


398 


64 


PGFFFKKI FVENFFGPHFQFFFPPPPKG \ 
KNPL* PPQRGPG* \RFPGVFPPLKKGSK 
KKFPPPPRGTQPRWGKKPLLPKRPGFPF 


1404 


15305 


A 


1412 


391 


125 


EPAPFFFFPPPGKRGFFP\PP*FGYPPG 
FSPPPFLKTPPGNFFLGPKKKKFFFPPP 
GEKICFL*RAPPHFFFFFFFFFWPPRQA 
ICFIIE 




IDiUO 


A 
*\ 


1413 


i 
o 


ittn 

DOf 


FFKPFASLGSISM*YHSPVCLA5KLRIL 
FDSSFSPSATAKHPLTSAFAAAWLSPFA 
LIQPHSSESSLYSSYCPVPILLF/IYLF 


1406 


15307 


A 


1414 


409 


1 


KRSGPFFYF/SPPFKKGSFAKPFFLGSP 
VFFPPPSSPP\GFFFFSPPPKRGFFPNP 
FFWGAPGFFPPPLF*NPPPDFFFWGPKK 
KKNFPPPRGKKFFFFKGPPPLFFFFFFF 
FFFFFFFFFYKI I S I KGEIGRSSWR 


1407 


15308 


A 


1415 


3 


385 


HRPPPVMGLS PRPAAAEGVPGP PAVLAH 
QHHAQFLARP * LP PHGAGLRTCS P PCLS 
LPNPPPWAPVWPEPP* *ALPPAP/AVPG 
PIN 


1408 


15309 


A 


1416 


2 . 


238 


PGGRGCSELLPRHCTPAWVT\SKTP3QK | 
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- 






RKKRE KPGS L * EKTL FFCQTLVFGPKKK 
NGPPLLGRVWAKRKGGFHPPLKN 


1409 


15310 


A 


1417 

■ 


3 


394 


NKHLKRCSTSSVIREMQIKTTNRCHFSE 
*KHW/IINMCW*ECGQTELSALLIRC*\ 
WDSEMVQPLWERKEYLKEITAESSWDPA 
I PLLD I \ YPKRNEAS C 


1410 


15311 


A 


1418 


307 


357 


LETRKTAEKI KRRAG FEK *NLETNKNGN 
TISQNLRDVAKEVRREKFIAMNA\*SKK 
KEKP* INNLMLSLKELEKDEQTKPKVRR 
KKKI LN I RAE 1 1 RD * KNSG KD * KKS W F 


1411 


15312 

i » 


A 


1419 


341 


.3 


IFVFLVGRGFPPLTRMVLIFWPHDPPPL 
PP+VIEGLFH/LPAFRPGAFLRKMVPPY 
*KNKIPPPRKNTFFFLKPPPFFFFF*DR 
VLLCHPGWRAVAQS WLATASTSQAPAI L 
PPR 


1412 


15313 


A 


1420 


405 


1 


WGEELVPSGRVHELFSSALRSERDTDSV 
CSCGGQCHCPGAIPGAALKAACRGDPAS 
RVGVETGSTMGNNG F FS FF I FKP YFFRD 
RVLPCHA\AMAQS*LTAASTS* \VKSSF 
CLSLRS S WD YRCVS PHLANFKTF 


1413 


15314 


A 


1421 


1 


399 


QPTLLTELGLRPVPPHPATMNILY* * IN 
T ILHYS YKNY FLFLFLRQGLQ PLQAGVQ 
WQNHGSLQL*TPGVSFLSSWDHPNPANF 
FI I FFLERQGLTLL PRQVLNS * AQ/CNT 
PASAFQSAGI IGMSHCSWP YKN 


1414 


15315 


A 


1422 


168 


414 


EWREKAGESLEPG\GGGCSELRLCHCTP 
AWVTEQDSS YKKKKKTPGGGKGVF/C * K 
LGGG INF* KRVFCFG KG PTKKKTGGEA 


1415 


15316 


A 


1423 


369 


1 


EHFCKSQDRDS VPKKK YI Y I YPMS VFMP 
I NLTNW I KFLERQKLS * Q I EEAI ENLNS 
I Y* /PNESEFI I * SLPITKAPGP YGFTE 
FYYS FKGEAMQ I LHKL FQKRG * RHS FSN 
VIKTLKSKPEK 


1416 


15317 

- 


A 


1424 


403 


1 


KNTEHSKRYSIPQRYCGFSSREFFHFLV 
HMKVMFPLFKCAIALCLKQVYTVFFETB 
SCSVTR* /EVQQQDHGSQQPQPPRFQ\H 
P PTSASQ IAETTAILENFRKICI YFLRQ 
SL/HSATQA* VQWRNHGSPQAPPPG 


1417 


15318 


A 


1425 


401 


3 


FFFFFKKNFPFFFQVEGKGRKLG*QPFP 
PG\LKNFSAPPLPGGGN*R\RPPFARLF 
FFFL*KRGFSPLNQGGLGSPPFFSPP\R 
APKNVGFQGITPPPGNFFFFFFFSEAES 
RSVTQAGVQWREPGS\QPLPPGF 


1418 


15319 


A 


1426 


386 


27 


CRINGVLI LC* CECKMAQTL* I *FGNFL 
QKKLDLTCDSTI PLLGIYPRDMKTYVH\ 
KKTCT * MFTAALL I ITKSWEKLLRPSEG 
EW INSRHP YDRI VLS NYETHNVNKIQRH . 
YAEAKKTV 


inly 








IOC 


L 


ubu VriAi. 1 jvt rivr r vn ltjov^i lAy/is^Jac 
LLVSSNPPG/SSFPKCWDYRCETRRWPF 
LFKFSLPIFVF*LNHLFCLHLLPLFIIjK 
LNLPCFMYYLFIPPALYSLSLLALFRND 
CDLS P I HLEVKKYFS KKDRL 


1420 


15321 


A 


1428 


401 


70 


HHAQLRCFFNSFVEVGVSLCCPGWSQTP 
GLKRSSHLNFSKCWDYKR/AAITLPSFF 
* IALF* ENPPNCHSLPTPLS PPKLAALG 
NSCLHPWLPQPPVKGLSDSFVCMHPGS 


1421 


15322 


A 


1429 


388 


131 


RGGGCLQRH \ RG FKQENW FN PGGRGSS * 
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PRSMYCPPAWGAQGGSLSQKKKKKKKKS 
KNYQKIIjSTQKTQKSRVF IVwSKHCTRH 
CBK 


1422 


15323 


A 


1430 


184 


413 


VS FL FD F YFL FF * RW / RSHS VTQAAVQ * 
CDCGSLQPFILFYFLRQSL/NSVAQAG/ 
VGSLQPLPPGFKQFSCLSLLSIWD 


1423 


15324 


A 


1431 


76 


533 


SCKRTTGRQFPDLPTRP PTRPQGFALVA 
HAGG* WRDLRSLQPPPPGFKRFSCISSV 
LWCTKAFNFDFQEIYSFSS*I*GFDSFE 
VNFFFLEMGVSlA FARAGLKLLGSGKLP 

mm. mm mm ^m mmm 9 mkm mmm m*m mmm mmm mmtmr w^MV M * ^» 

ASASE /SGGI IGVCHWAQLL 


1424 


15325 


A 


1432 


373 

• 


29 


RQG FP PMGRWLDS LKKKKFQI PNS KI F 
PGGGAPLVIPP$REGEAGKFF* P\GGKG 
AIKQNYC/HCPPTWGKEGNFVSKKKKKK 
EKKMHQK *RCVKDQQ * T *MLLFPSLKS I 
YYFK 


1425 


15326 


A 


1433 


389 


73 


TDCPAIGRNHRDPVRPLLSS PHRKI / YA 
NLYYYIIVRILTLIQSTDLM*ISPWLV 
LTRVCVCVCVCVCVCI ILPS /CYPVCRF 
MYPTTTVKVQNI S IGQEKQKTKT 


1426 


15327 


A 


1434 


402 


39 


QAGLQLLAS S DLP ALAS QS AG I TGVSHC 
ASPSISL*APLGLDTFSDFPCFNDLDSF 
EEHWSGML/ SECP * TGI * DLS FFS * VYW 
GYGFLEDHRGKVPFFITSYQSYTPPTQL 
I IADVKFEQLV 


1427 


15328 


A 


1435 


105 


387 


EFSKVAGYKINTYKSIVILYNSNE*FEN 
\KKI IPFIIASKRIKYLKI /QFAKAVQD 
FYAENYKTW*RIIKEDLNK*KNVLC*WI 
GRL/NI/LKMVLLP 


2428 


15329 


A 


1436 


242 


2 


FGGGFFWVGTPNKMGPFPQIIKNLF/TR 
PPENF*KPP/LPFIAFFSLGFLGVFFFF 

1 mm mt m mm, mm- ttmm mmtmam. mm m tm mm mm mm mm. mm mm mm mm. mm mm mm mmm mm ^ »K mam 

*DIVSLCCPGWSA*AQSLLAAALTSPG 


1429 


15330 


A 


1437 


411 


0 


LLFVAQECLKWFSALGGPGVPCSXLPAN 
QQPS VWVP*QS \ EHVTVLQRFCLWPVLG 
PAYGQ I WEPVPS TIiK/ PPPPGFK*FS * V 
SLLSAWDCRCTP / PTDPANFCI FSRDAV 


1430 

• < 


1533) 


A 


1438 


41 


412 


FFCTD *GSLCCSS WTQTSGLKRS^HLS F 
PNCWNYGH/RASVIiGLIFFFFIiRKKT/C 
P* KKFL* NEDLLCCQIKSLEIiLAPRDS PA 
LGPQSVGTANVTPRPW/LLFL*TC2NYLN 
SSRVRFSPAPLKYKEV 


1431 


15332 


A 


1439 


I 


406 


HA YN IV\NPSPVS LTGAL S S LLMTYGLT 
MGCHFLSITLLILGLLTNTLTIYH*WRD 
VTRESTYQGHHTPPGQ*GPRYGIILFIT 
SQVIFFTRFF*AFYHSSLSPTPQLRRHW 
PPTGI TPLNPLEVPLLNTCVLLR 


1432 


15333 


A 


1440 


3 

■ 


422 


MEKVSWWIDQTSHNIPLSQSLIQNSAL 

b mm mm m ~ m mmmm, — - * «k «k « vkmb V ^ 4**\ mmm mm * a 

SLFSSMKTE* GEEAAEEKFEASRAWLMR 

PDDLAKI IDDSGYPKIiHIFNIC*KPTDT 
DrVDETALC*KKPSTYFLFFFLKQSLAL 
PP 


1433 


15334 

* 


A 


1441 


375 


2 


kvss praenvi lyncpplffflgqd/ ch 
svtqaarqwrhhsslqp*tgl\asqvag 
tagahlypsqnligrgirsla/ svaqtg 
vqwrnigswqplpvrikgfsslslkssw 
eyrcpppcpdNlfifl 


1434 


15335 


A 


1442 


397 


122 


GQAS LELPTSGDPPASASQSVRI TGMSH 
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CARPILGSCISLFAYPFCFLKCFLF/CC 
REVILL* CSGWS *TPGLKQSSGI*SLPKC 
WDYRCEPP 


1435 


15336 


A 


1443 


3 


299 


WEKMFAMYLSD*ATVSRPYR*I*LQLNKT 
KYKLKLSQS /MNRHFTRGDVAMANKHTK 

/SVKCWQECG*GGGKCG 


1436 


15337 

i 


A 


1444 


10 


410 


CTLFLS ISPDSAGGICTMLLFQNTLNFK 
ISIALGVSDKCFSNFLIMTIVRNVFYIH 
GKKCILYCSPGCMTHMHTLMHTHIHLK 
QFFLRRGLTLLPRLECSGAITAHCSLDL 
LGSDS P P * * EM YFI YMV \ EMYFI LFPRV 
CTHTYAHTDAHTHTPETI FFETGSHSVA 
QAGVQWRNYCSLQPRPPGLRFSSYLSLP 
GS*NHRHVPP 


1437 


15338 


A 


1445 


431 


9 


GPAQSLDSSVSPLGTISSTVKWRHNAYL 
AGLSGGI /PCDCFLGPRVPWGSGICQWH 
WAVSPTLS\PTVRPPLFRBTLYFT*V*V 
PPLETCPQ* HI SHCRCVINWCEDTHTPK 
KHLVCVHACVCVCVCVCVCVCLGRSRQS 
EH 


1438 


15339 


A 


1446 


85 


416 


PGCLSLPKIWD YRCEPPRPPHFFIFSS I 
QGPLTDFFLTPLEQVQISPTAJG3FLKKI 
PFFFF * * RRG VP I LARWFKS * PHDLAA 
SASQKFGITGVSPHVHPIFLY/CSSIQG 
PLTDFFLTPLEQ VQ I S PTAKGFLKKI P F 
F FFLiRQGIjT \ KQWSTL HLiGJjPKC WD* GCc 
P/PMPGLHCEFFKG 


1439 


15340 


A 


1447 


376 


3 


I KSEI IETESRKPIEKIS/RAQSCFOGM 
VYE I DKLLARLDRKKMKTQLNN I RN *RD 
EINTDFTDIKRI I RENYKQD YGTKFNNL 

IKDIEIIV*NLSTG 


1440 


15341 


A ■ 


1448 


2 


192 


SKWIKNLNKELKP*DS*KKPEGNI*HDIR 
FGNDFLDVTPKN/MATKGKIDN* TLPKF 
KMLMRIWRN 


1441 


15342 


A 


1449 


411 


3 


V FL PPLP PFCF IjNFF * GKGGFFFKI FF* 
KKKGVEG PQLG P P P P FGPFKKF F FF* I F 
LKAPLFVPPTLKPLF/ cppkkknwennp 

ODT?l?T?<^PP r l7VV T rAT7V1?K r l3'K'T? , B , T.P1 l JlJ\7T»T. 

xfrrc z r v*\ivj.yr i r rrr Jrr jjJvriKvijjj 
CHPRWSVMV*SCLAVASTSWAQAICP 


1442 


15343 


A 


1450 


413 


1 


EA*TGEWO^PGGGACHEPR*PPCPPPWA 
TERGFVSKKKRKQEK/SRIMECFQDNLP 
GFFQ I FS VMKNKQEGR* FWTK/ VKMTKY 

VTA ♦'KTXrMr'G <k VTaT TTiTmrT? r'VTlTTTTV If T T? 

NA IN Vflbb w x WiNJ\ivc»iNiLuDJiU X U V viSiSJjr 
QSILS * YVKKAR * KPGCWLMAA I PALW 


1443 


15344 


A 


1451 


4 


384 


DPAIPLLGTYPREMKTYVHIKTCTQMFA 
AVL FTVAETSKRP KCPTDE / VNKI * CT Y 
T /MTYYSAMKRDE 


1444 


15345 


A 


1452 


3 


390 


LPDHPGSSVSTPRGVIITGRGFWLLFPW 
VSFFFFFFFLKRISLLPPNWRGGGQNLIi 
In1CSPPPRGF*NFLA*PPQGGRKKGPPHP 
PGAPGDKNP / QP FFFFGEKKKI FNP PTG 
GEKKKPPS P FKWGGGGPN 


1445 


15346 


A 


1453 


37 


381 


LILYIIWCVCVCVYIYTHTQMGSHIVAE 
AGVQWCNLGMW * P * TPGLK* / FLLS SGD 
YRLTL PHLANL Y I FFFFFEQRG\ FFFLL 
KLV* NSG P KATL PRVG I TGLTHT PRP YR 
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GDLFF 


1446 


15347 


A 


1454 


3 


400 


TEMVLHTEMVIiHISVWDPLLDRPGRGVT 
WLMLTAEL FL YGGTEAFSSS FDV*6PCS 
ETVECFSD/L*A*GPP*NT*SPIYRFAS 
FL.TjA.FC I YL.LETRS P \ SVTOSGGK/WOD 

LTSLQPLPPRLK*SSCLSLPSSWE 


1447 


15348 


A 


1455 


77 


398 


RAEIVPLYSSLGNRVRPPSQKMYIYYEQ 
LHAHTFGNLAEMDKLVERYKVLSKFT*B 
EI\LNSPVSIKEREF\AVISLPQKETLG 

pn q PTnn FYT FKFFT iMPTT iOR T.FW 


1448 


15349 


A 


1456 


397 

* 


69 


CPPGLSGLPWVA*AVPP/RRPAPPPPPP 
AGIDAGLGDDPLQQTHHA/PPAAAGSAP 
AGCAAGPARGAPRGRS PPRRGSAGAAPP 


1449 


15350 


A 


1457 . 


400 


2 


ALFFWAPKKKKNFS PPPGKKFFFFKGPP 
PLFFFFFFFFFFFFFPFFFPFFFFFFFF 
FF*PJRPIFILFYFLNQKNIPRDFFFFFF ' 

GLKQSSHLS LPS S *DYRWVPPC 


1450 


15351 


A 


1 A f O 

1458 


"i At 

343 


11 


TNRHFTKDNIQMENKNMKDVQARCSGS * 
F*KN*NFKSYAL+PKCSEI/KKSVTGTN 
WRNSQIYDKLSTLQNN*WIFSRSIiYNFE 


1451 


15352 


A 


1459 


109 


401 


QNYRQLGQWDRIBSTVIDPCKYGKIilFD 
n Q ZV Tf2\TP wp frno * TJUflTQ /WWTTF <5 FTP V 

U^raXvrvl CinlvIUJC lillylO/ HnALOc lf\ V 

KALEE ITKRKVNSLAS ISVKTD VHVTKV 
i * LKKKKKKRGGRFKE 


1452 


15353 


A 


1460 


42 


424 


CP7\N*NSFSRDR/SLPMIiPRLILNSWPQ 

\ ATT .T . / DTjJT? P K"\7Mf*ZT f\ 


1453 


15354 


A 


1461 


419 


69 


IFPLKKKKRGGGVGPPLYPPPLGAKRGG 
SP * KQNLKPPRPHKENSPFFYKKKKKGG 
RGGAPFFPPPLGGKTKKFFLPPK/IKVS 
FNPNLFPPPPPPGGKKKFFFPKKKKKKK 

^PWTVWF 
o ru.i i vnr 


1454 


15355 


A 


1462 


1 


277 


CHTNFLCMSFPPAISHHPLMPPHPVSRS 
VAQAGV* WCDLGRG /CS ELRS CHCTPAW 
MTERDSISQLIRKKKKLIKIKKKKNKNK 
SVIFLGGGS 


1455 

< 


15356 


A 


1463 

• 


382 


3 


KAGGS INQNSPPPPPPGEKKKKPPPKKK 

V Tf KKTCP F W(2fSPP T T .PTTMPPPFFP * P * ff T 

FLGPKFLTPIjGPKI KPLFFFFLKKKKKK 
RMTY/HSVAQGPLLNKDTLQAG\LSKA* 
RS PPKS KS MG OS FHRN 


1456 


15357 


A 


1464 


561 


86 


ND P I LSLKAE KTFGK I QDS FLI VS S 3 L\ 
NKPGIEGNLLS * * KASTKQNP IMNLRV\ 
IjNVFPLRS * TRQVCLLS PLIjFNI VLB I L 
AHI ISQEKEIKKIQI* YKEEKLPLFTSS 
STJFT+VKNTJMEFAICKI.LELINEYNKVER i 
YKINIKNILLAKNTWTLKF 


1457 


15358 


A 


1465 


3 


221 


RFHRVSQDGLDLLTS * STRLGLPKC / WD 
YRX J EPPCPAkTCLILNGSC*VFLCYLSC 
FKAQERP S QNS WGAFVTL 


1458 


15359 


A 


1466 


2 


396 


WW P AWTHT PG LKQTSRFS LPECW D YRRE 
PP\ PGLVKFLL/ IQYRWTQHVGLGDWV 
QDHHPHI E ICLHKSVI *NGI VFAYNLRT 

SSLTLFYLRRGLF/ sapqagvqrhklsl 

LEPLPPSFKRSS\CSSLLSSWDYR 


1459 


15360 


A 


1467 


349 


413 


RLGL* PRKDITD/HVSLRKDTG/ WPGAV 
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Amino acid sequence (A^AIanine 
OCysteine, D=Aspartic Acid, 
iv- vriutamic acio, r— r nenyiaianine, 
vj— "vriycine, ri— nisuuine, .f^iso leucine, 
K=Lysine, l>=Leu cin e, M=JVletb ion id e, 
N=Asparagine, P=Proline, 
Q=Gluiamine, R=Arginine, S=Serine, 
T=Threonine, V=»Valine, 
vv— i rypiopuan, t— tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














AHACNHS TMAGHRRQ IPRS GV* GQPGQ 


1460 


15361 


A 


1468 


403 


2 


LPVEWKGGIFGSFHPPPLRLKKFFSPTP 
PKS * D/ YRGGPPPFG YF F FF FLKRGF 5 P 
FGRVIFKFPPPGVPPPPPPQNFGFKGRS 
/HPSRALFFFFKRVFF* IYPFFFFFEME 
PRSVPQAGVQWHDLGSLQALPPRFM 


1461 

■ 


. 15362 


A 


1469 

• 


423 

• 


3 

■ 


IEFSFFAPSKDKGAILGPSTLPLPRFNN 
F FC P P FP INKD I RRGPPPRPNF FFFRKR 
GGF PHWAGMFLS FNLQEGP PPS P PQ / S V 
GF*GRN/HPPRAQFFFFKKFFF*CYPFF 
FFFEMEPRSVPQAGVQWHDLGSLQALPP 
RFM 


1462 


15363 


A 


1470 


13 


427 


RTRGLVFDKTEFKPPKIKKKKKA\GPFL 
RGNRS I LKKKATFPKNLFAPNPGPPKFI 
KKVLSDLQKNLNPPPRMGGNFTPSLLK* i 
NKSSKQQITRDFRDLTSPLDQGDL*KIY 
KTFYPKTTEYTFS SAPHGFY/ S * FDHKI 


1463 


<A V A 

15364 

* 


A 


1471 


378 


1 


FVRPPFFFFSSSRPFFKWGGPLPPAPQ 
FFFKTPRGNPLLRG* KPPPQTPVGGAKG 
S PPW/ VPPGFPKKGGGVFFFQQGIi* KI S 
PPGPAPP*IiVWGGESPPFKKKKSRPGW 
AHACNLALWEAKAG 


1464 


15365 


A 


1472 


412 


53 


SRLSFFIiSSEPHGPPSPLGPFATNKVAF 
C*/PPPPF*PSPPPKFFFF*GPKSVIYF 
YQR*PPTSRIFPPKGGAGPP/PPFWGFV 
NP KKK\ LKPFS KP S P PAAKFKK P EGTKL 
GFFKKKKKKL 


1465 


15366 


A 


1473 


116 


34 


DWNIKPVLISNVMTGLMSMIITIiLLQLF 
LL*LFFFK/WELTFFGGKFHQRVDPSFQ 
GS IX3RGRQFFLLTKTE YHI FFL I KVFNS 
KTEGAGLAT* I K 


1466 


15367 


A 


1474 


42 


428 


E I IMES I NRFDVI T F * NFCL * CYLMSKI 
NRQAS SW/ ENV*NQYVGEKILI FLI YKE 
C I Q IKKK\ KLRP KI DTQVKDKQ I SEEGM 
QMANQHWI QC \Q PS li VXiNKKQ X E XAcGK 
HLPYQINKDSKMMVEERRQ 


1467 


15368 


TV 

A 


1475 


87 


433 


KIRKIHKS PGNNBYFGGFFLFFS FLFFF 
Hi Av?ri\ r v r yAvj v yuyuLw " Jjy r Lr i-uf is. 
GFS/CPQPPRTFFFFFFLRWGGFIQKK 
KFLFV 


1468 


15369 


A 


1476 


212 


430 


SLIIiMTSNGIHFIYFLFLLLLFFFF*DR 
VSACHPGWRAMS* S YFT* ALTSQVK\QS 

VT . C T .C C C *"MVDTI r P7A T MD 
IlidJuoOO MIKnliiil'Ur 


1469 


15370 


A 


1477 


493 


2 


PGAAAEACNPSTLGSQGGWITRSKIPEH 
PG\TLWNPRS Y* KTKMCGIiEAP 


1470 


15371 


A 


1478 


454 


492 


HRVGEERF/CLFETESHSLTQDRVQWHD 
LGSLQPPP PRFKQFS WYHI * PHAW 


14/1 


J J J /a£ 






i. 




/ RNR YRERERLRG \ RRERRKRRDRMT* M 
P RE REREN LNSL YRET YRDFETEWVMDR 
ERQRLKRRLL*AVIVPSHSSLGSRRKTP 
FQKKR*REGERKSEFLIQRDI 


1472 


15373 


A 


1480 


77 


453 


S FGDSLTLSPRLAVQW VYLGS L * PPPPE 
LK\YSPTSASQVIHYLLFFFFFFGKKVS 
FCPQGGGEGP PFGLLE I FAPGLMP FFCL 
NPPKGWVWR/RPPTMPKLFFVFFIKRGF 
SPGEPRGVSFPBPGT 
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NO: of 
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SEQID 
NO: of 
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sequence 
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M 
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SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location . 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-A)anine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G^GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M-Metbionine, 
N^Asparagine, P=Proline, 
Q=Glutamine, R^Argmine, S=Senne, 
T=Threonine, V=Vallne, 
W=Tryptophan, y=Tyrosine, 
X-Unknown, *=Stop codon,/=possibie 
nucleotide deletions \-possible 
nucleotide insertion 


1473 


15374 


A 


1481 


398 


96 


KRP WCFFSPPPKKGFFPPLI FVGPRFF 
SPPPVFK/SPPPFIiFFFPP*KKKLFSPP 
PRKI I FF*KPPPPFFFFFFFFFFFFFFF 
FFFFFFFFFFFFVFILLLS 


1474 

■ 


15375 

- 


A 


1482 

■ 


484 


64 


QKRQMANKYMKI CS PLF VI REMQTNTT I 
RYDD I PTRVAKNKTNK* KSNSTKYCQEC 
ESARP* IYCWWKYKMVQPLWK/S/VWQY 
LLKLNI * LP YDPVI PLLSMYPSQIKTCI 
PTKTCAQMYLVTLFI ITKNQKLAHSARV 
G 


1475 


15376 


A 


1483 


2 


400 


YNKKGQGVAPRYD VDTAANF P KELAK 1 1 
DEGGYTKQQS FN IDYTALYWKKTPCKS Y 
SWR* IH/SLASKDRLT\LLVGNWGDFK 
LKPVLIHCSKNPRVFENYTKSTLFVLCK 
WNS KAWI TAHLFTAWVAE YFEPT 


1476 


15377 


A 


1484 


459 


65 


GGPPPPHKTIDFFFFFFFRGESPPPPKK 
KKKKKKG * DNAEDEAQS * RPP P S I * EKI 
MHLICALTEEY* *LTAEIIASNIHISTG 
SAYI I LTEMLEL S KLST* WVPKLFYPNQ 
LRT / RAEL*MVI LNK* DQQF 


1477 


15378 


A 


1485 


2 


518 


PPPPPQRFGLRGGAP,PPPKKKG*DKAED 
EAQS * RPHPS I * EKIMHLI CALTEEY* * 
LTAE I IANNIHI STGSAYI I LTEMLELS 
KLST*WVPKLFYPNQLRT/RAELSMVIL 
NK*TSDP 


1478 

i 


15379 


A 


1486 


122 


501 


PRDPPALALQNAGITGLKQSYFLSIiLSS 
WDYKHKPPCAPG YLK\VFFLEN * YLAL Y 
VF* * S YQGTSVLTQPPSLIPRHSSPGVS 
VQLFQKK/DLQHGCLLTPS I SGYSVTWD 
GVQWRDHGSL* P* PPRF 


1479 


15380 


A 


1487 


491 


425 


I PKNDI * AEP * RINKR/ CAKRKMVRKGF 
EABSTICKALRYEEIVCSRNTEFVHGWL 
*ENRGQIMKGFGNS VEKLEHN* KFYEG * 
LLYRFE *LAAGF I VFQQ* VS KE I VKAWP 
GTVAHACNP S TLGGRGGW ITRSGDRDHP 
G*HA 


1480 


15381 


A 


1488 


441 

■ 


17 

■ 


KKTNIYDQLIFNKGAKSTKLRKNSLFNK 
WCQDKIS/IIQKMKVDPYLISNI\NLKW 
LKDLNVTAKT I KLLKENTGAI LHDLGFC 
NAFLDGKPKAQTTKKKQVK* TSS KFKSR 
CQWLIPIISTLWETEAGGSPELRSSKPA 
WPT 


1481 


15382 


A 


1489 


413 


1 


LF WKEMVFLFWPGRVGNPGPK \ GWLRLR 
PPPF * FSFQPFWGPPVGLLFPKNFYY* P 
PGGAHGPNPPPPGGKG+GPPRVGGGKFN 
* PKFGPCPPG WATKQKPVFQKKKKPNKI 
KNRTVKVPKVTKFI Y YVP YLNGVSN 


1482 


15383 


A 


1490 


359 


407 


RHRMITFFT* IVTQNFPYL* KKMDIQIQ 
ED*RTPN/ RDQHKHTPRHI 1 1 KLSNSQX 
NSKRILKTRKK*LWY*GTHIRLSTYFW 
PKS 


1483 


15384 


A 


1491 

• 


397 


39 


RKFRGRS FAKTLFSGPGQG/ PKGNPGLI 
PEGPPPFFFWPAEDGGFFFPPGQLPPQG 
PIRGPGPFLGFPVPPPEGEGPPPTGP* K 
PGQKRPKKRGWSPQKKGPKKKTLFFFFF 
CQMESRSVT 


1484 


15385 


A 


1492 


378 


1 


FGVFWFLAPRRKGGFFPSHLIWVPPGFS 
PPPGV/SNPGPGIKFGGPIKKIFPCPPR 
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peptide 
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Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H~Histidine, I=IsoleucJne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Gfutarotne, R^Arginine, S 5 ^™^ 
^Threonine, V=Valtne, 
W=Tryptop ha n, Y=TTyrosine, 
X=Unknown, *=Stop cod on, /^possible 
nucleotide deletion, \=possibb 
nucleotide insertion 














GGKFGPFKGAPPFFFFFFPQDGVCSVAQ 
AGVLQ*AEIVP\FPPT*ATEQDPVSKKK 
PTRPPTRPPTRPPTRP 


1485 


15386 


A 


1493 


3 


428 


IKSP* PDRFIAEFYQTFKBLVPILLKRF 
HKI /EEGTLPS * FYE/ACVTLI PKPGKD 
TTK 




' \JDQI 


A 




j 


417 


ILNNARLKPFLLR*TTRQGIIiLLSLLFN 

ki*vleflagaisqekeikviqvrneem 
sktlfsqtt*s*grnpfikiplpkmsw 
n/nfikvteykvsih 


1487 


15388 

i 


A 


1495 


429 


4 


pfclgskrfppffnpprggkf*kkkkif 
gpprggpplfpplwapkgggppraggsg 

P PP PKGGNPLF I NPKKN/ PPPPGGAI FQ 
SRFLGGVNPKI FLFPGGKFLTDPGFPPS 
LPPGGKKKKPRFQKKKKRKKRKVKEKKY 
FL 


1488 


1 <380 

I j Joy 


A 


140/% 






FSKEDTPM VNKHMKD / CCTSL VTREVQ I 
KTTTRYHLMPTRMAMCIYIFNYILFLK/ 
SKNNKC* *RYRE IGTLIH/AQWKYKMIiQ 
LLW / KT I WQLLRMLNTKLS KDQE \ I PLL 
GI *KKKKKKKKKRGGRFKGINFTDPGVE 
RINFYNSAPK 


1489 


15390 


A 


1497 


3 


326 


WPACL * AVAAVALLVPEATRLTMGNNLT 
VCTPHS IAELLS S KG /DLWLTDNR/LLK 
YQALLIiE \DLQLRT FTCLNPATF / VPEE 
TGEPEHDCEWVWQTGKRNNKDHCLYSIj 


1490 


15391 


A 


1498 


345 


22 


KS KWL I DIjHGKCKTIjKR * YRKPTK* \LG 
HGND FLDTTSNAW F I /RKIDKLDFI INK 
NVCSGKVTVKRMKR*TTDRDKISSKGIS 
DKRLLCKI YIKI LTTQQ 


1491 


15392 


A 


1499 


194 


432 


PWCVCVCVYVCMCVFETQS/HVARAGM 
Q*HNHSSLQL*TPGLKQVSCLNLSSSWD 
YRHTPN* FFFF/CNFYLERGGVS 




j jjyj 


a 
*• 






417 


RD*FMRFK\EKSYSCNIKIQREAASANV 
ETMAS Y * EDLAK I INEGS YTK\ SQI FNV 
DEVAL\KKMPTRTFIVRERKSIPGFKAS 
KDRL ILFLG/ANAPGNIKWKPVL I YHS T 
NPRAFKNDAKS I LLVL YKLNS RAWRXAH 
LF 


1493 


15394 


A 


1501 


414 


0 


SSSSSSSSSSSPKWTTGALQP/LLSRTP 
LKDSSEEESSQ*AEL*AVHLWHFAWKE 
KWPDM * LYTDSLAVASGLAGWSGT\ WKK 
HDWKICDK/DWGRGMWMNLS 


1494 

• 


15395 


A . 


1502 


3 


125 


RIiGLPKC*DYR/RAATTPGLH*F*SWKB 
QRPQ*LALGRKPVNRNKNTQKYIHIYTY 
TH/P I PTNVYICI YTHIHTPV* YTHIIY 
1 1 YLFKVLVFGFLRRSLT/SVAQAGVPW 
RDHSSMQPRPPYKQLTCH* *LASQSARI 
TGGSHHTWPAL I LKLERTEAPVI S FRTE 
TSK 


1495 


15396 


A 


1503 


406 


172 


DIILDR* RQKQRLRLRQKQRETETE/RR 
DRGRERQRYRQRQVQRQGRRQRQRLRHR 
Q * QRQ IDRLRQRQ / RAE RDRGRGRDRDR 
GRDRGRDRERQRIiRQRQR*RERQRHRQR 
/RERDRQRQRQRDRGRRRDRETDRGLDR 
G* DRG I DRGRCSDRGGDRDRGLD IGSDR 
DR 


1496 


15397 


A 


1504 


3 


420 


ITGVSHRAWLPS* FLKFFWVEMESHSV 
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OCystelne, D=Aspartic Acid, 

£r— wiuiumic >\liu, r — r nenyiaianine, 
G=Glycine, H=Histidine, I=Iso!eucine, 

TV— MajsinVy Xj JUCUUIIV, 1V1— lYlCiniUlllDC, 

N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y'^Tyrosine, 
A-unKiiowD, — oiop cooon, /— pOSSlQlB 
nucleotide deletion, ^possible 
nucleotide insertion 














P\PHLAFFQFF 


1497 


15398 


A 


1505 


407 


1 


PISIYHSKDPGSLKSYTKSTLPVP+KCN 
NKTWLTAYLFTAWFPKFFKPTVETYC/S 
RKKSPFKMLLLIANAPSHPRALMEMHKE 

LRNTFCKAIAAI DNDS SGGSQQ IQ 


1498 


15399 


A 


1506 


I 


408 


PPGFKPFCLSLPSSWNYRHPPP/RPG*F 
FVFLGGTRFHHVGYTASQKOTSRETHAY 
ALU 


1499 


15400 


A 


1507 

i 


2 


416 


EPRSHHCTPAWQLSKTLSQKKKKKKRKR 
MVFTGRKKGLFFGN/LKNLGTLSREYPL 
GPGIKNRLAQKRKPLFYKKRF*NINPGG 
GAH PGG P KS WERGGGRKI * TPVGKYASN 
P E YHT C I P PGKRNQNP FLQKKKKRRAD 


1500 


15401 


A 


1508 


276 


H 


SPPPYFLLIR*GRKGRGQF**LRLFFII | 
NLRQCL/DSVSQAGFQWHNHSSLQPRTP 
GPK*PSFLSIiPSSWDYSHAPQLIiAFYGA 
NCFN 


1501 


15402 

■ 


A 


1509 
♦ 


1 

■ 


391 


NIFKEIMSENFPSLMSSNFPSliGKEIES 
Q IQEAQRT PNKMNF K£S TPKQVI I KF* V 
RE/MLKTAREK* I VI CKGTTLRPAVDFT 
AETLQIRRE*DDTFKTLKGKKKKKKTLP 
AKLTFP S I LVFKKTRGGSR 


1502 


15403 


A 


1510 


2 


419 


PR VRS RATNVIS KY KQHKKKNTS K* I KD 
LNVKPEP I KLL\ EKTTGEKLLDI ELGND 
FLDS /TPKTRAPKANLTP*NYFKLKSFL 
TAKETFN\KTPTKRGANHISDRGLISKI 
!AEIjTIQ*QKNNIjlSK*AIUjljltKKb bK-S 
DV 


1503 


15404 


A 


151 1 


392 


122 


ENPLTSAGGGCSEMRSHPCTPAWVTQQD 
SVSKK*INK*IK*/Q*LRQNVKIKPSPI 
FCEQGKRRK 


1504 


15405 


A 


1512 


2 


281 


GGCSELRSCHCTPAWTT\SETLSQKKKK 

VSCTCL * WS LGRS LKRS LERHQKNLNDP 
YPIFQNETNIH 


1505 

i 


15406 


A 


1513 


242 


382 


QGPNLG * LHPPPGGLKGFSPLTIiLRS WK 

KTP T .DDflWD TT VTTP WPP TrMTTV 
IVKlJr'i'yxir'JP I r r KAWIW 


1506 


15407 


A 


1514 


1 


388 


RTRGTERDT/ RFRERRRERDRGREG YRW 
RYRERERHRERES * V * RDRVRDGDRVXD 
RERERQSERERLR/RERDWERDTYIVPJB 
TETERETEGERDIERERDIVRDRYM/RD 


1507 


15408 


A 


1515 


4 


285 


TRXQICNGDKTALY * KEMPSRMFPARVE 
LM PGFKAS KDTLTLLLRPNAVGDVNLKP 
MMIYYSENPRALKNYDKTQLCLYSTNGN 
KAWMTDYRFT ! 


1508 


15409 


A 


1516 


415 


2 


MGDRES L P P S RVFLF FLAPCP KG I LFHP 
V* FGS VRS FS IERCYR* GQRNEFWGPVT 
RVKAS S SRAG * WFS S TAP PLFF / CFFE 
TESCSVAQAGGQ*CDFSSLRPLPPKFKG 
FLCPSLPDAW 


1509 


15410 


A 


1517 


1 


324 


P TRPEMG FRHAAPAGLELLS S SDLPTS A 
SQSGGITGVSHRTWPIIiASNNY/SMDKL 
CAI/CERFILFFLKNIIiLWILILFIFIY 
FCIiIK*CITINFSQNSIjLPHSIHLYI 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
Hr-vjiuiamic Acta, r— Jrnenyiaianine, 
G=Glycine, H=Histidine, l=lso!eucine, 
iv— juysine, ju— j^eucine, ivi— ivietnionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

V— llnbnAiun C^ nn _%_-__•]->_•• / .i_j_ju.ili.lj_ 

A— un Known, w — atop coaon, /—possible 
nucleotide deletion, V=possible 
nucleotide insertion 


1510 


15411 


A 


1518 


2 


163 


ACRYPWLNFVLLVEMGL \TMLASDLERS 
G*PQ/ CDLPASASQSAGITAESHHGWP 


1511 


15412 


A 


1519 

» 


A A F 

425 


52 


VLRGAQFLFQEARLKWDFGPGGI^PPC_^ 
PPPLGV* KGPLPKG\GGPNPPGPPRGNP 
VFS KKPKI TPPGGGAPE VPPPWGGGAEK 
FFYPGGPRVQGTE /RKTPPPPPGRQKGT 
PLPQKQKKKKPWSL 


1512 


15413 


A 


1520 


3 


404, 


THASGKSNIRGLTLPVFKTYYKAAVIR/ 
TVWP/WLRANTNRQNRL/DGPEVDPYKG 
SQMI FDKGAKAIQ* RTDIWLIiNKW* / ES 
* TSTCQKKKKKNQFR/DPAPLIYTKFNS 
NGRKNLRV I GKT * KVLKKNI KQNWGDLG 


1513 


15414 


A 

■ 


1521 


332 


39 


TSRETGNFQLDLTAFLNEVGVLFLWRKG 
K3KIDIGIGTKEMP* FPI*IiIjMFSYIVI 
ERERKPE / LS L S P S LECSGM I LAHKKNS 
LPGSSDSHDPASRVAG 


1514 


15415 


A 


1522 


482 


0 


rartsgvllcspgws*tpdlr*slclsf 
pkrwdyrt*atvpgllyslysrfsp/de 
lkgceksrtspa 


1515 

* 


15416 


A 


1523 


1 


417 


NKC* *RCEEKGMPVYYWYECK*GQPFWK 
TKERFFKKNL/NIELPYNTAIALLNMRP 
q*I* S q*RKVCSCMLCAT\PTPNK*IKT 
MW* VCLVEYSS PLKKN\LLFSTTWINLE 
DISSNK/LRHRKSS ISRFHLLL 


1516 


15417 


A 


1524 


I 


397 


RDST YQGHHTPP VQ * GLRYGVI LFITS E 
VTFFAGFL/WSAFTRSSLAPTPQLGGHW 
PRTG I TRLNRLE VPLLNTFVLLASGVS I 
T*AHHSLIESNRNRIIQftLLITILLGLY 
FTLLQASEYFESPCTISDGIY 


1517 . 


15418 


A 


1525 


2 


376 


L1<AKTGQKLGLLHQTVSKFVWAKEKFWK 
ELL\KSATPVDI+MIRKRTSLITDMETV 
*W*KEDQTSHPIPIjS*SLTQSKAIiNIjF 
KAMKTDRGKGAVE * KS EANRGWFMQFKE 
RSC/ RFCN IKVQG 


1518 


15419 


A 


1526 


3 


386 


ESMLKAKTGQKLGLLHQPVSKFVNAKBK 
FWKELL\KSATPVDI *MIRKRTSLITDM 
E TV* W* KEDQTSHHI PLS *"SLTQSKAL 
NLFKAMKTDRGKGAVE *KS EANRGWFMQ 
r J^KbC/KFCNlKVyQ 


1519 


15420 


A 


1527 


127 


388 


KRKSQINNLLLQFKELEK/QEETKPKAS 
RRNKKKKIRVDLLKI KKGKPLEGVKKKG 
GFF*RTNKRDKPLLKLPKKKGGGRIKTF 
HKTS 


1520 


15421 


TA 

A . 


■m mm #_k 

1528 


3 


402 


HENHMKI CSTSYVIRELQIKTTMK*YYT 
PVRTTAIQNTDHTKCWQG*REQ/GSLIY 
CW*QCKMVQ/PLWKRVWQFPTKLKHSLN 
I*SAVVLLGIYP\KSGKFNVHTKTCT*M 
FLAK* PQCSSVDEERKKM/WYMYGFE 


1021 


J 5422 


i\ 


1529 


2 


* 


GLKQSSCFGFPKCWDYR/R*AALSGRLIi 
LTISSH/REQIEQEFTHYPEKSAKLFMQ 
DGPPRRK/H PPTRP QASNIQNKFS FFLR 
QSL/NSLAQAG 


1522 


15423 


A 


1530 


1 

- 


418 


GTDTE * ACDKIQKPF PDKRLNKLG/ IRK 
KHLQLDKYL / WKNP IAS I ILKSRRLKS V 
RLRPGDGRQGCSFPAWLFNIILEGIiARA 
IR*EKQKQKQGQ\QIGKKEVNLSLFIED 
I MLYI ENLKESTKKP III INBFSRI^IDF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 


M 
eth 

00 


SEQID 
NO: id 

26 


Predicted 
beginning 
nucleoli cie 

lnr»<it inn 

correspond 
ing to first 
amino acid 
residue of 
pcpiiue 
sequence 

* • 


Predict- 
ed end 

*% ft ft aIa 

nucie- 

OlIUc 

location 
correspon 
ding to ~~ 
last amino 

n i rl 

aciu 

residue of 

peptide 

sequence 


Amino acid sequence (A«A!anine 

OCysteine, D=Aspartic Acid, 

kj— vviuiamic aciu) r— r Dcnyinianine, 

f^=r21vrSriA H=HictirfinP l±=fcnlatir»in» 

K=Lysine, I>Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 

U/sTruntnnhan V^TvrncitiA 

vy—i rypiupnno, i~~.ijru»iiie, 
X=Unknown, *=Stop cod on, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RP 


1523 


15424 


A 


1531 


381 


3 


ILQGFVIRKKSKI FQKACHTSDLWTLTN 

MRKTTLSCCFLT*DFFLPLLPPCFFR*F 
ILLIF*LYFCKDKVSLCCPSWSKLLP\Q 
QSSCLSLPKYQDYKA 


1524 


15425 


A 


1532 

i 


38 


479 


DEACGPQDPYIiTP Y VKTQW 1 JsD* TRNKG 

I qfleengkn\ fd igfs sdlldmtpktr 

ATKVKL\NDIRLRNFCASKDTINLSLIjC 

r\kr* pvewekisanhisdkgli sgiyr 

QPPLNSKTSHLI * K*ARDFNRHFSEEDI 

ftn»T \mLTT rr 


1525 


15426 

< 


A 


1533 


105 


447 


L IFCRVFE YLHSLHLPQE I CLS LALFSR 
FTFCVI ICEVDWSVIFKVPFCSKRNKV 
AVHTMLYIQIFVSLFI * PQNWKQPKCPA 
TVERINKMwYIHI V/ EYYSANKR 


1526 


15427 


A 


1534 


76 


471 


VWVCLLS LEGS QS KFGNS I E FGVLLS SG 
GFS AWRL FFFVYFLRQSL /NS VAQAGVQ 
QWRDLSSLQPLPPGFK*VLKQRGVCLFV 
CFETESHS I AQAGTQWCDLGS LQ PLS PE 
FKRFSCL\SLSPPWP t SG 


1527 

■ 


15428 


A 


1535 


45 


338 


SNNEPFLDWI VM* RKVDFIRQPAMTSSV 
VGPRRRSKALPKAKLAPKKVMVTIWWSS 
ARLIHCSFLNASETIASEECTQQIDDMH 
* KLQRLQ AAL I NRK 


1528 


15429 


A 


1536 


425 


1 


FFNITFHSVSFSPLSQKCILTL* ILLKS 
I VKNNMRNFQRS L VRK* AKDMNRHFTDD 
HVQMAS / KHMKRCSL VIGEMQ I KN \TVS 
YHYPPIRMIKVRNSSNTKCW*GCGQTGS 
LMRCWW/NK*NQLLWKTGIPPHGLVSTR 
SRVP 


1529 

■ 


15430 


A 


1537 

* 


4 


443 


ETFVDHYQCGG I RP FS DLQLHAGRTTAL 
FKAVRQGHLSLQRLLLSF \ VCLCPAPRG 
GAYRGRQVS L»o CGGLHP VRAo WL JjCLPK 
♦AWTMEGTSTPASLPPCSLISDCCASNQ 

DneTrPTPnecDPTVPVWT t.uodct ooootr 
RDSVGIorSfirGAli xiNl^VirKr IjbiroJSJV 

RSIWVGV 


1530 


15431 


A 


1538 


487 


3 


TQNGG VLLSAPRS VFS PTTLR/ CTLOAQ 
C*AFWWGGTQQAASSTAAMAAMKPLGIW 

ELSIGRQRAWREQPDQSAEPPFSQAPRP 
GYPPS PQPLSMRRGPGANPRLARPLRGP 
VRVRLRRAS SERQKRSRGGSGPLG 


1531 


15432 


A 


1539 


394 


489 


IYLFI FETESRS VAHAGMQWRDLDSLQP 
SPTG 


1532 


15433 


A 


1540 


475 


202 


T}/~iC /~< T»TT? CT3T3PIKTT30T VTTMCfVlrtDDVQU'DD 

PPPGNGEGGKT PGAGHSG I * NP PPPLKI 
N/LEKGGRGGPPGPLLNPPFPKEKPRGG 
KKKKKKKR 


1533 


15434 


A 


1541 


14 


468 


LSMWWNS RLKARRLVLVSQWNA\ KERF 
LKE I KS VI PMNTLMVRERNS L I VAVEKV 
LVA* VKEQTSHNI PLCQSLVQSKALTLL 
HS VKAKRG* EAAEEKLEAGKHWFLRFKG 
KS RLHNI KVQGEATS TDGELQQVTYPDL 
VKIVGEGDYPKQ 


1534 


15435 


A 


1542 


479 


141 


RKTDS WDL I K\ S FSTAK* AINE ADGQTT 
EWEKT FANYASDKGL I S RKELKQ INKKK 
ANS P I KK* AKDTKKQ PTNMKKCRTSLI I 
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SEQ n> 
NO: of 
nucleotide 
sequence 

'i 


SEQ IB 
NO: of 
peptide 
sequence 

■ 


i M, 
eth 
od 

• 


SEQID 
NO: in 

09/515,1 
*>< 


Predicted 
beginning 
nucleotide 
location 

rm — _ ■ - - - — mm mm. t% mm. *M 

correspond 
ing to first 
amino acid 
residue of 
peptide - 
sequence 


Predict- 
ed end 
nucle- 
otide 
iocs uon 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
bj- Glutamic Acid, Jr— rnenyiaianine, 
G=Glycine, H=Histidine, l=I$oleucine, 
k— JLysine, l#— JLeucine, m— metnionine, 
N^Asparogine, P=Proline, 
Q^Giutamine, R=Arginine, S=Scrine, 
"^Threonine, V=Valine, ■' 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














REMQ I KTTMR YHLTTESE * LL I KSQQRA 
h 


1535 


15436 


A . 


1543 


17 


474 


NPKKKKKKKI DLLDT YYTL YPK/ AEYTF 
SSE IHKNWLYIN* LRANLNKFIN* KIKT 
I LSDHNRRQPE INKCNKTKR/R VT/TQK 
FKNLILSNYWVKGELQTQIABFLKRNVN 
KNT I YLNT * NTI KVLNSGGST / ALNTYI 
DKNFKKRVPS 


1536 


15437 


A 

• 


1544 


89 


481 


ICLKVISLRQENATSVACIVSCLSEGSR 
ASENLVLHNAS PNS CGLG I AFTCLCNW 
SSGNKITDCYD*LSSS/WLOFFFFFFLK 
KS FL FVAQAGGQGGNLG* PKPLP PGLKQ 
FLPPPRANFCFLEKTGFFLF 


1537 


15438 


A 


1545 


1 


300 


PPPPAPXXCRPY*XPWXXPVYHS SWRHG 
SGAAQGAVLAGFGGVGRQGPGAASVSIP 
LCPENQGCREPGPSHAVPAPSALPSLRS 
LTGGQ I GGTRAAQAVG 


1538 


15439 


A 


1546 


2 

■ 


436 


GAPE I KS 1 1 KGYND / RLCTTKF YNLDEM 
D KFLVRHKLP KLI * E * IDNLNR W I TSQE 
TDW* I *QQSSSSSSSSSS/PSSSSSRPN 
GFTTES YQSFEDKL I PI ICKLLKKIDKE 
\GHFPIiQL*GITQI pkpdi yh/ienyrp 
ISLM 


1539 


15440 


A 


1547 




419 


PATWEDHLSQGGRDCSEPRLHHCTPAWV 
RE*DPTSKKKKEK/NDQ*LLIP*S*RRL 
+RSSHQTSHLMDWNSEAGR*EATFLRSY 
SKL\KEELELLLILDNNDDHNNSD*YLL 

m*m mm\^ mr m^m mm* mm. JH flHK^ 

SIYSIPGTVLS 


1540 


15441 


A 


1548 


37 


339 


KRWKCLRA* LLMRPRHAD * LNPDGVGYS 
EL*SRHCTPAWVTEQDLVSISTNRKNER 
HTLEYSHQHYSG*P*TG/EESEYPSAL* 
QRE I IDYS FIQGMTDQIj 


1541 


15442 


A 


1549 


477 


1 


PGSKDLGSYT*PQ/WSSPEVTSRDAPS 

v my mm. — ^_ m _ ■ ■ ^ w ^_ — m* m* mmm mlm ^m. mm mm. m*mmmW mm\ mm mmm. 

HPSAPKFCSNPCRGRWLTS SKQPKLRLT 
SAVPGAPGARGTFHALGAGAAEAGGHSA 
SRPEAALCRPLPPLPMTLTSHPLXjSGPG 
RLAWGCNGRR* IKGGG 


1542 


15443 

- 


A 


1550 


430 

- 


8 


CWPGSSGTPDLK*STRPGLPTCWDYRHE 
PLCPAS KT FLS PQ I ETP YPLNNNAHSPR 
RPALVNYSLLSVSMDLPIIj\AFHTNGIT 
GYVASPAPPVRWGSHSVAQAGVPWPNLG 
SL* PPPPRLK*S SRLSuLS SWDYSHMTP 
LN 


1543 


15444 


A . 


1551 


2 


419 


ETSPSLQGWLGVLFPKRGAKTSRFLIIR 
PQGGSFKDGDFFNPPGEIKTPPAKKKKK 
KDSARS P P ARLQA* GAGLWDARAS FRRP 
FQAPVPLSAQPRAQPIiAVGTSRDGGSSG 
PTPGQESAV\PWREKHPQQPPPQPG 


1 CAA 

1544 

• 


1 G A AC 

15445 


A 


1 j 52 


*> 
2 


Jo7 


T7Din/an2inT.PT.TyZ\ CCHDQ'U'QAQnQ'TYlT 
i 1 Kn V>iy>ibJjCiJjJju \ ooi/ro Vo/ioySioJ. 

RGMT YRAQ PGL * LFNS KNS I LSGP KV/L 
QDYMW 


1545 


15446 


A 


1553 


393 


2 


KNPIFFFLKQGFPFFP*LEGRGKFLiGPC 
HLCFPGSKKG\LPHPPQLIGAPGPPPSP 
GFFFLKRGFS /LFCPGGSFS LRRKGPPP 
PALPKF\GFLRVTPLAGPGNPSFFFFFF 
RDGVSLCHPGWSAVAASRLTA 


1546 


15447 


A 


1554 


1 


427 


LLLL I L YAI VEAVT * TEGAG Y P P L PGSY 
SHRGASGHRAIFALHLTGGCCI *GAMHC 
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SEQED 
NO: of 
nucieouuc 
sequence 


SEQU) 
NO: of 
pepnue 
sequence 

• 


M 
etb 

OO 


SEQID 
NO: in 

09/515,1 
26 


Predicted 
beginning 

n ii c 1 c*f\t i /I f» 

nuLicoiiuc 

location 
correspond 
ing to first 
amino aciu 

residue of 

peptide 

sequence 


Predict- 
ed end 

illicit!"' 

otide 

location 

correspon 

UIDg W 

last amino 
acid 

residue of 

peptide 

Sequence 


Amino acid sequence (A= Alanine 
C=Cysteine, D=Aspartic Acid, 

mir A i»iH T?sPhpnvlQl**nirtA 

Hi^^jP 1 U L2x Hill* ntlllj * 1 I1C1I V1a1«UI1IIC, 

G=GJycine, H=Histidine, I=]sofeucine, 
K=Lysine, l>Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

VJlUianillH;, x\~rVI gllllUC, 0~OCl IIIC, 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucicOiiQc inseruvn 














ITTMIDIEPPAITQYHTPLLA+SGLIT* 
GL\LVLSLPVLDAGITILLTDRNLITTL 
FNPDGGGD P IL YQHT J i * VFGHP EVYIRI 
LPR 


1547 


15448 


A 


1555 


•> 

3 . 


382 


1 JuX loJjr AA*ALjv7ii/ <^oiYJj& JjJj 

PGNLLGNDHI xOTIGTAHALVI ILFIVT 
PII IGGPGN* IiDTL I FAAPDMALSRLNN 

GYLCLAGNYSHP * AS V 


1548 


15449 


A 


1556 

i 


380 


3 


EVSPCCPGWS*TPGFKQFACLGLPRCWS 
YSREPAPPAKACFLDI*VK*WFCK*GSP 

LVLFFCFFVFVFVFVFF*HRIS\SVAQG 
GVQWCDLGSLQPPP 


1 r" At\ 

1549 


15450 


A 


1557 


100 


254 


IPTVPTl* lPIl^rTlKARliSoSGPvjiSW 
PVDSAK*VPLAVVSLDSR\RDSGNLVHP 
LMRVTN* MKRHL VTLTQS \ CYS 


1550 


15451 


A 


1558 


2 


289 


APGVS S TMEDEMGGGLEPQRRGCS KQRS 
HHCI PAWATE * DCLKFNNNKKNYLLL I S 

GEKEH*SQKRKSK (( 


1551 


15452 


A 


1559 


24 


354 


PLPSASPGPEGATPVPTS/ACPNKIKLY 
HLKKKKKKKKKKKKKKKKKKKKKKKKKK 
KTKKKNKKRKQQLNE* ETG 


1552 

* 


15453 


A 


1560 


376 


2 


AARGSGVRD PLEE AVCijFSDIiQ JjRAGRT 
TALLKALFK\RQGHLSLQRIiLLSF\VCL 
CP APRGGAYRRRQASL SGGGLHPVRAS R 
LiLjCLP KQAWAMAGAPP r ASJjF FCo WXoD 
CCASNQ*DSVGVG 


1553 


15454 


a 


1561 


3 


408 


AASTVFLPFLERKGIDFGFLFFFFGEKK 
FPFLAPGGAPGGLFSFPEASSPGLNPFF 
WPNPPEK*KKGGPPPPPGFFFFF*KKRG 

rrCio > IrvjviAtrr xrutr !\1 wJrr^FlriVAjvjCirK 

GGPPPPGPNFFFFFLKGKGGGGPP 


1554 


15455 


A 


1562 


355 


161 


FKPGDGGCSEPRWCHCTPIWVIMRDSVS 
KPKQSKTKKRNVFT/C*D*VF*NNNINY 
INCFS*RALYILYEKQFM 


1555 


15456 


A 


1563 


410 


1 


•PD DDTJ DVTIIT7T /ITDT 1?DlfMVBf27/JT.l?OT)D 

1 etftrirclsxic lAji irlir irJ\X«l>-fUjlvvjijc irrc 

LGFPKGGQWPI PPQRFPPI*FSPKKRAD* 
KPKPPPALKIRGPNPGVFPKGFCGFSFF 
PPPPFPRGGGLIFFLPP/TK*SGG*KKP 


1556 


15457 

• 


A 


1564 


2 


374 


ADRNLNTTFFDPAG/GVGDP I LYQHLF * 
FFGHPEGYILILPGFGIISHIVTYYSGK 
KEPFGYIGMV*AMISIGFIjGFIV*aHHI 

FS*LATLHGSNMK 


1557 


15458 


A 


1565 


396 


0 


1 1 FLI flrhgfawaqagmqwcglgslq 
plfpgfrlfs /cllss wdyr* rqgftml 
arlvlns*pqai\saspsqspgitdvsh 

CA 


1558 


15459 


A 


1566 


186 


452 


kqkcnfktlntkyqplpffflernfcfc 
ppgggegadftfleplpsgakgff\clt 
lqrm/we*rvspptplnpgflvxkgfsi, 
cgstgf 


1559 


15460 


A 


1567 


453 


37 


KTALYWKKTPSSTS IAREEKSVPGFKGQ 
AHFLFRGKHKFKLVSKLIYHFENP/R/A 
LKNYAKSIL/PYKWNKAWMTAHLFSP\W 
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SEQID 
NO: of 
nucieoiiae 
sequence 


SEQID 
NO: of 
peptide 
sequence 


etb 
oa 

• 


SEQID 
NO: in 

09/515,1 
26 

■ 


Predicted 

beginning 

nucieonae 

location 

correspond 

•ng to iirst 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucic* 
otide 
location 
corrcopuu 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^Alanine 
C=Cysteine, D=Aspartic Acid, 
Hr^vjiuiainic acio^ r — x nenyiaianine, 
G=Glycine, H=Histidine, I=Isoteucine, 
K=Lysine, L=Leucine, M=Methionine, 

NaAanaf ooinA P^Pi»a lint* 

i^^Asparagine, sr^r roune> 
Q=Glutamine, R=Arginine, S=Se rine, w 
T=Threonine, V=Valine, 
W=Tryptopban f Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucieouoe insertion 














PIKYFRPTVEM*CSYNKILVLIDTQLDH 
PRPLIETYK\FINLHVCVLFNTTPIVQP 
LPR 


1560 


15461 

* 


A 


1568 

• 


453 

■ 


39 


TALYWKKTPSRTSIAREEKSVPGFKGQA 

KNTYAKS / TLA YKWNKAWMTAHL FS P \ WF 
I KYFRPTVEM * CS YNKILVLI DTGLDHP 
RTLIETYK\FINLHVGVLFNTTPILQPM 
PRAQ 


1561 


15462 


A 


1569 


1 


352 


FGTRRER ERERKRERE RERERERGRGGA 
6HQ I PS I EGQHKI VGAI I Y \ CREC\GB K 
ISGSTSYIKVWDIRDSAKCIRTLTSSGQ 
GISGDACAATFTRAITSAQGEH* INQIA 
LRPSGT 


1562 


15463 


A 


1570 


394 


3 


TLDWPQSRGGSTGKPVYPSVCCCCCCCC 
CFNFCQEAEKNNAEGLLHNQAGRTKDGS 
FAPSHDH * A/PRGTEV/DLLESTIjQTS I 
KQVES KPR \ EQARTGAGGQKEKATQNPE 
KSVLTSMYTKSQGSEGRLPGNR 


1563 


15464 

* 


A 


1571 


399 


2 


KHQLPVFWQYNKKAWTTRTLFLDWLHCC 
FVFE VRKYLAS KGLPFKWL I IiDNAPGH 
PPRTP* VQY\KGIEVI YLPPNTMSLI* F 
LDQGVIRTYR/ SHYTQYSMQRTISAMQE 
NSNKBNI I KVWKDSTTDDAI VA 


1564 


15465 


A 


1572 


37 


400 


RGT I TGEAASADQE I ADKLS DAINKXME 
EKG\ Y* LPEQVFNLDESTLFWG\KKKPQ 
RTLLSKE KKKAPGFKTOKDRLTLLFC /A 
NAVKLI IRTALTYKAENPQALKENVKHQ 
LPVFCLTTRGL 


1565 


15466 


A 


1573 


84 


485 


AGHKDS PRPHQTQE PS WLHLWDPAPGLQ 
VELPASP\GRAIALLSPWWDGTGRPGA 
GGGTRRGGSGPTGAHGAGGRLMHGGLQV 
PSPAPREGS*GPARYQAQRMWARTAGGP 
STPSAGASRVPSPHCPGP 


1566 


15467 


A 


1574 


3 


463 


TPAQGLRDPSl^RKHAYCGCCV/ GITIjC 
VGAQNIOJRAVCGLYSTCPRLCVYEH32HI 
CVNE*VCEHVCERESVRVCESTH/LPLC 
A*T03PIFGCMSEKHVFSYTPCVHRVCV 
CVCVHLGCCVC/VCVCVCVCVCVCVCV 


1567 


15468 


A 


1575 


1 


383 


FLSFGFAPQAGGQGHNHG*GP P * P /PKA 
KGX r P PHPPElCRSQRVHATPPGl^ljUFr 
FFWKKGGLNLGPKKNLGPGGKKNIiLVS P 
PKGGGKKKETPGPGGVFFWGGI FFFFP P 
PCSPG*PLSLLKKPKGG • 


1568 

t 


15469 


A 


1576 


35 


469 

■ 


RI PRCHQP VGPLGCREGAKPQG PG PDAA 

CRRNAGAG PRAHTAG I CAPQPNS * SRHH 
PWAAAPAGDDSPTTSLGSHVPVGRKSSD 
GCRORALCT PGS P APS EAEVGGS PELRS 
LRPAWA 


1569 


15470 


A 


1577 


473 


62 


SLEINP YI SGQLI FNQHAKTNH* SMRKE 
\ S FFNKWC LDN * I S TGRRMKLD P YliKPH 
I KLNS K / LKDL1N IRDTTMINI*HD3jGFC3N 
GF*TMTTK*AIKEKIN/WDFIKI*NFCA 
SNDIIKKVKRPNVTAYTCNPSTLEGPR 


1570 


15471 


A 


1578 


59 


426 


LERRSFGAPLSPFFAPQFEMKKGFGSP* 
KFFFSPKALNFGGGWPFFPPPKKRFFS 
KNPQEGFI PPPLKKKKKXSQPP/ YKFGP 
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SEQU) 
NO: of 

1 1 U»-ICUIIUC 

sequence 


SEQ ID 
NO: of 

nentiHp 
sequence 


M 

eth 

nd 

• 


SEQ IB 
NO: in 

USSN 

09/515,1 
26 


Predicted 
beginning 

niiclfntide 

location 
correspond 
»no to first 
amino acid 
residue of 
peptide 

JCUUCUkC 

1 


Predict- 
ed end 

nu de- 
otide 
location 
corresoon 
ding to 
last amino 
add 

r&cidue of 

• GJIUUG Ul 

peptide 
sequence 


Amino acid sequence (A-A)anine 
D=Cysteine, D=Aspartjc Add, 
E=Glutamic Acid. F^Phenvlalanine. 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, I>Leucine, M=Methionine, 
N=AsDflraeine, P=ProHne t 
Q^GIutamine, R^Arginine, S=Serine, 
^Threonine, V=Valine, 
W«Tryptophan, Y^Tyrosine, 
X—Unknown. *=StOD codon./^DOSsible 
nucleotide deletion, ^possible 
nucleotide insertion 








■ 






PKES FKRAPP FFFFKGGPFFPIiVSNQWT 
GKPRLPPPPPKETQNPSSFQIGGQKKGK 
GGPQSFSFPIKKKKKKGGHSRSRTSPRV 


1571 

* 


15472 


A 


1579 


2 


419 


KANKTKNAYFEG INKMDKPQVI LKKKY / 
REKT* IPSIGEEKGIMSP\NSEDTKRI I 
KECFKQTYAHRFYSLKB/MDII^ESHKL 
PKLTQ/EETDSLNSPV 


1572 


15473 


A 


1580 


129 


403 


YMFFIPINCQDHPK*KKKKKKKKKKRGG 
/RP FKKTLRGP KLNRAE KNKFFF* KGS I 
KKKCIiEILKKKLFFGGEKNCKNPPKKKK 
PSREKKKF } 


1573 


15474 


A 


1581 


317 


76 


PRFFFFFPPPKKGFFSPPFFFFSPRFFP 
PPPFLKPPPRFFFFGPFKKNFF\PPPRP 
TiTPFPP*APPPPFFFFPFFFFFFV 


1574 


15475 


A 


1582 


259 


377 


PREMKTYLPTKKLGYE I SH YH* W WGCKM 
VQP VW KTVWQFLKGLN I KLP * DSAVPLV 

IANK* KQPKCPRMNKW 


1575, 


15476 


A 


1583 


1 


415 


ptrpitssiclrqsylkaliayssishi 
algv\tailnptp*sftgagiliia\hg 
ltcsllfclansnyertrrri i ilsqgh 

nTT .T . PT . T a P* * T .T .Zl P T .JJkWTi JIT . P PTTWT.T » 

GELS ELRTTF S * SN I TLLLTGLNILGT 


1576 


15477 


A 


1584 


216 


406 


LNLLPLVLGGSSCLPPPCGGN*KPPPPP 
G\LFLEKKGFSPCGPAGF* PPALRGPPP 
PPT.PRGT.T 


1577 


15478 


A 


1585 


383 


3 


KKFGYP F YWIG * KI LK* FPG * K* SLPHR 
KSPFFFFF*GRVLLCPPGWRGTTKGHGS 
fWT\ r,PAJVAMTFOVV K"*SSPI*RIiPSR*G 

YRQASPWSGNFFF/CLVESLSMLPGIilL 
NYWAIAI KPSGPPKVLG 


1578 


15479 


A 


1586 


126 


413 


NPTLKK* KMKENRMKKNEQSLRDL*DTI 
KLTNRCILGIPKEERKKGAEGIFGEIMV 
GNS SNliT R \ EN TNLNI EVAO * TIiSRIN * 
KRATLRHVTMKM 


1579 


15480 


A 


1587 


242 


409 


GWMI FR FN FF F FLRGS FTL VTQAGG * GG 
DFG/SLRPPPPGLKRFSCLTLPRSWDYR 
H 


i eon 

1580 


15481 


A 


JD06 


I 


"WO. 

■ 


PTE KKC5TCG KKR RG * R SNKKKRGGRLEGO 
KY*SPPCPRLSFFLVDEKGPEVLQDSLG 
GWWKTP/SGCPEMTDSSQPY/YRAFYVL 
KN/QRVGFSVDVGE IEKDQDVEKNQDPS 
CPRL 


I JOl 


1 </tQO 

154oz 




1J07 




ft 
u 


NNFLEOLKFP PKKKI TTDS YKAPRPFFF 
FPP SQKGVFP PTLFFGFPPGFP P PPFLN 
PPPGFF/CFWAPLKKFFFPYPGG*TWVS 
LKGPP/L/RFFFFFF*DGVSLCHPGWSS 


1582 

m mm ^Mmm 


15483 

• mw ■ %mrw& 


A 


1590 


75 


412 


VEGQHCNFCAAQET INRVKRQHTELLET 
FANWS PDKGI* I PRT YKELKHLNRKKHS Y 
* KWADDLDRHFSKETYTPPKTYSCTQYC 
*P/SITIEN*KLKTSNIKTRLQGGHKRL 

ER . . 


1583 


15484 




1591 


309 


1 


FSTQGAHMQVCYMDTFHDAE I WTS I EPV 
TQIVNMLPNR*FFNPCPHFFPPPFCSP/ 
LVSFLLLLRDRVSLCCPGQSRSVGFK*S 
TCLGLPKRWDYKCEPPHVA 


1584 


15485 


A 


1592 


2 


415 


LS X S LS X FS FLP \ FS FliLPS LSLSLFLS 
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Amino acid sequence (A=Alanine 
OCysteine, 1>= As parti c Acid, 
E^GIutamic Acid, F=Phcny lalaninc, 
G-Glycinc, H=Histidine, J=«Iso leu cine, 
K*=0Lysine, IHLeucine, M=Methionine, . 
N=Asparagine, P^ProIine, 
Q-Glutamine, R=Arginine, S=Serine, 
T=Tnreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Unknown, * Ba Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














FFLFLSFFPPSFFFFFHRASLCHLGWSA 
MA*SWLTAVSTSQ\VKQSSHL 


1585 


15486 

* 


A . 


1593 


3 


396 


HSAFLFFL*DRVSLCHLSWKAVAQSQLT 
ATSTLL/VLKQSSHLNLP 


1586 


15487 

i 


A 


1594 


300 


62 


KMTGVLKSSCGKSPKQVGWLCVCVCVF/ 
S FKRQGLAS / VAQTGMQWGDHS SLQPPA 
PGLR*SSCISLISSWDYRRVPRRAPG 


1587 


15488 


A 


1595 


1 


412 


FDRPAADQKAAS ALKAS GVQAQMAKGTY 
HDWS LQDWKVLWMTH* VS \QQEQDPTNL 
YISNLPLCMDELQLENMLKPFGQVISTR 
ILRDYSGTFRGDGFARMESTDKCDAVIE 
HYNVLL I MTPPGVSAPTE PLLCKFAE 


1588 


15489 


A 


1596 


428 


2 


QSESREPENFLLPTSTSSLLLVHCLSRT 
CWNSTLSCVPHFFHLTFNSHLLLTSQP 
RRVHFC*IiSLSSI I SWKLLNTLPGVPIR 
\ASEI FGLRTIRNFPFLSHS VLFPFSLS 
ESHSVTQQGVQWHELGSLQPLPHGOQWL 
SCL 


1589 


15490 

i 


A 

** 


1597 


7 




QGSL* PHPPMLK*SLTSAF\NYRHVSPH 
IjANY FLFFF* KQGLAVLLGLALN* AQ/C 
HLPALAS QSAG I TGVSHGAWLFFFFLNQ 
C YLI * FL IL I F \ERRNS PVAHVLVNGGD 
LGLLKPPPPGLMGFSCLNLLRSGKYKPP 
ATSPGYIFC 


1590 


15491 


A 


1598 


419 


121 


NliGYPRVSPPPPFLNPPPEFYEGPPKKK 
FY/PPPPPAQKIDPP*TPPPFFFFGTDG 
SHYVAQAGLEIiIAS SDLPASALQSTGIT 
KHE PPHLAET I FLVFL 


1591 


15492 


A 


1599 


45 


397 


DRVSLCYPGWSAWV * S*LTAASNSW\V 
K*SACFSLLSRGDYMCMTLHIANIKKNF 
LGRVRGSHL * S YHLKLCLKK*RTKBGFL 
S FFWKGKRHFLGQKNI LNPRLKVFFLPG 
PPKGLGY 


1592 


15493 


A 


1600 


3 


397 


S RR PGRFS LMLTLS WHS * VCRAAIaAARE 
EQWSGCFKSHF*LEVNFWVSGIDREV/C 
GY/LKTVLGEDLNDYVSTQI *D*LMKPR 
CPEKQDESLLKEFGGGA*RLNWHRPGA 
VAHACNPNALGAKGRRIPRSG 


1593 


15494 


A 


1601 


244 


2 


KTKTSFHSLIDSCGYLLSYSNFQI*KKM 
I FK/YLHLRAVPRHVIVRFTNLERQEKV 
LRAAREKG*VMHKGKPIRLTADLSA 


1594 


15495 


A 


1602 


435 


2 


PQEAIYTPTQHPTYGAICRIARIHGSRD 
Q/SVEMKWHPQLTITPSGP/liGKSLLPI 
P VALC FAD LEVF I P KGGMLS PGVT 1 1 SL 
NWKLRLPWVILQ * RITPLARVI PPDYQR 
E I RLLLNNERKKS YVWNTRDPLGHLVVL 
PCPMVIK 


1595 


15496 


A 


1603 


.288 


8 


EFPLIREMQIKPTMRYRLTPV*MD\R*W 
* GCGKRGTLVHCWW* CKLVQSVCERSSK 
NITI ELPCDPAVLILGMHTKERKLRPCG 
WTRGVGPGYC 


1596 


15497 


A 


1604 


411 


236 


LIPSEHFFFSG* F* AFDRSSLAPTPQLR 
RHRPPTGLLPLSLLQGP\LLTPSVLLS* 
GGRFKGPNFTPPGGQGKIFFMGPPKSNP 
GPGV 


1597 


15498 


A 


1605 


2 


437 


KCLRISPCAGPRRPWCPSFEPRVCVWPL 
GVPQVG/ PEGP *GEG\GFEGGDVRQLWQ 
GKKKKKKNTLGKRGPWFPKEKRGFSLTP 
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OCysteine, D=Aspartic Acid, 
ti— Glutamic Acid, r==Pheny (alanine, 
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X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














W f XjPQGLiWE F CLN PGGPjjF L I GG FQKGP 
KVLIGKGKTPGPKKKNNRPRKRGSLGPP 
GKDDFPL 


1598 


15499 


A 


1606 


31 


467 


EFPGRRFRGGGCSELRLRVCTPAWTTEG 
DSISNNI *LVNEN/HTVNSWKCSDS 


1599 


15500 


A 


1607 


418 


32 


VGQVGLELLTSSDPPASAYPK*WDYKR\ 
D PRAQ PNPNI LKAQSNRSP PRQR* PASS 
FSNQNS CMHRCI SLP * VLPL I FFKNRP * 
NNS FGYNS SGWWSGLNILNSC* FWFLFG 
NSDTE I R I CRS FHRGREV 


1600 


15501 

> 


A 


1608 


379 


3 


FPILSPSYNPSSTELP***SLQSPLFBP 
YDFFS VNMFS FL FFNLKSHTVAQAGVQW 
RNHGSIiQ PRS FQLK\HPPAS AS AGTTGM 
CHHAW/LIYLCVYLFLEMRSHCVSQD*V 
QWHNHSSLQPETPTK 


1601 


15502 


A 


1609 


456 


99 


PFPTPFPPPHSNWGLGLGCGL/EPTMTP 
SGLGLKGS PS * S P PAHRLSGAQLCFVLR 
AP VLG P S QMP PG R KKKPTARG S P WRKGV 
FVMSGQSDPLGPSSCQELGPRQSTQGTF 
PGA 


1602 


15503 

• 


A 


1610 


1 

• 


420 


FRFSDGAAGQKCSS P.PRPGRGRAEVLLT 
SQTTGRQGRGAPHISDNGQPGRDAP\PS* 
M*WRPGRGAP\PS+VGWRPGGDAPHF\Q 
TGQPGRGAPHI PDDGRPGRDAPHFPDGV 
GAGQRLQTRHFGRPTAAAWKVKVVTSLR 


1603 

■ 


15504 


A 


1611 


426 

- 


3 


KNPFLLEAKVSFNPKWPPALPPGEQRDS 
VSQKKKKNLL IHKKAHSKH * FF I CRE \ C 
ES ALljLHQJTIHAGGKS YVCNK *GRGFRN 
KSHFTYQRTHSGKKAFl4*KECG*DFL* K 
AILTAYQKTHSGKKSFVCKECR*DFTQK 

in Tr 

TK 


1604 


15505 


A 


1612 


428 


273 


HHA*LIFKIF/CVETRVSLLCPGWS*TP 
E LK* S S LLG CP KCWD YRRE P P I RPTT 


1605 


15506 


A 


1613 


311 


4 

* 


ANKFKNLNEI KFPEAHNLPKFTQEGGLN 
NPVSLY* KN*TYSLRLFP\KKKKSGPDD 
FTGEFN*TS KEEVPSLHKLLQKI * ERNT 
LPNLFHKARVTQVS KSEM 


1606 


15507 


A 


1614 


338 


88 


PNP P P P S KG K \GFP PPTPGRKKK TRPP P 
PPGKFLGF * KKRGFPPFERGGP \NPAPG 
GPPPPKPPKGGGTKGEPPSPEGGVFFFF 


1607 


15508 


A 


1615 


2 


162 


KHGGTCLYSLLLGRLRHESCLNLGGGGC 
RE/PB/ SHCIPAWATE*DSVSEKKKYL 


1608 


15509 


A 


1616 


3 

* 


399 


PEVREYLTSRGLPFKVLLILDNAHGHSE 
PQRFNTEG INVFYLLPNPRS L I QTLDQG 
VTRTFKSHYTWYSTERIANAMEENPDRT 
S * KSRIMTPL/ IDAI VMTEKAMEAIMPK 
TI ISCWRKLCPDWHDFTRFT 


1609 


15510 


A 


1617 


390 


1 


KRNCFGPFLTPPPGQRLGGFKFIjK\HFF 

£ XKVJiUCuuf rf orf nAur trctr trirr ri\*rc 

LGALKF*RGVPPSKPP?PGPGKKFKLKP 
FPPRKSYGGCFKCFLGGPFPFEGPPPQK 
KKKKNFTAARDLE PNAW 


1610 


15511 


A 


1618 


468 


0 


MKLVNIWLLLLVALL*GKKHLGDRLEKK 
S FEKAPCPGCSHLTLKVE FS STVAEYEY 
IVAFNGYFTAKARNSFISRALKSSEVHN 
WRI IPRSWPSSDYPT* WP/VALKKKKKK 
AG V/LPLENYS I I TRV 


1611 


15512 


A 


1619 


421 


2 


SSRLSLPKCWDYRREPPRPAQPRILKKI 
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Amino acid sequence (A-Alanine 

C=Cysteine, D=Aspartic Acid, 

iL — ijri uciiii ic aciu, r— r ueny iBianine, 

G=Glycine, H=Histidine, l=Isoleucine, 

K=Lysine, L^Leucine, M=Methionine, 

N^Asparagine, P=Proline, 

Q=G!u famine, R-Arginine, S=Serine, 

T\ — k fnnn inn V— Volino 

i— inreonine, v— valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 




- 




■ 




■ 


o w ^ bFr c LiXXjJU-1 w JUK IvGijOKonK&jKLj 
NS TYVLHKFMD* * LGQLMNGIiNMLDESS 
FCNVLKHTKQ\WRDYYFVCLFVCFVRQS 
Li/ JN F Vb yiib VUKKXJiiob XiKAF FFGF AFF 

s 


1612 

• 


15513 


A 

* 


1620 


406 


2 


GT KKASSNGF INGPGAAKKKSQTFAFliA 
IvtrQjAr FoXlji^bK^KFyXlUrrACr F/ SX 
RESQTSYPVFFFWSPSCGFFFFWGKPEP 

7 T T.T/T7 a/^T?-» 7T5T TpT.TST.WSWT #nf2trCT.rV\t3 
ixbijtVclAVjlN. V FXir Liff Lfbr Jj w jJIj Vi>JjV-Ulr 

GWNAVLRSQLTAESNPHASAHAS 


1613 


15514 


A 


1621 


411 


0 


PPPPKKKPSGPPPP/PSSSSSPSPPRKF 
LGGPRVFFPPPFFKPPPPKNFWGPPKKK 
K± FFFFfcHjKKrr r *KAFFF 


1614 


15515 


A 


1622 


2 


403 


TARCGLNFPCSSSLPFIAS * VAGTTGTH 
HHAQLILLI FCGDELSLM\CPGWS/ PNS 
SHLGLPKCWDYR 


1615 


15516 


A 


1623 


298 


411 


LIINVCWPGLVAHACNPSTLGG*GGRI/ 
TMRSGVRDQP 


1616 


15517 


A 


1624 


263 


2 


DSVSKKKKNFKE*LIPVLLKLFQNIEEE 
GILPNSLYKAGV\liIPKPDKDT*RKEIY 
RPI SLINIDAKI VS KILANKIQ* FIKKI 
TDAW 


1617 


15518 


A 


1625 


3 


281 


PFSCLSLPSCWDYRRPPPRPANFF/VYF 
YKKNTRTQGFTVKHGFTVLTRLVLIS * P 
CD P PSIiASQS AG X TGVSHR T QPHT V r r Jj 
NNPALPKLQT 


1618 


15519 


A 


1626 


300 


20 


NPG PRGF P P PG P PKRLD FRGGAPRPGF * 
YFLKNFLGFFWFQKNFLVFFLGNKTPFP . 
PQFFFFF/ RDRMSLCHPGWSEVAQAWLK 
AAXiTSQT PA I 


1619 


15520 


A 


1627 


394 


40 


PQ P AAAS LFSPGLFF FFPP VFS P PP FFK 
TPPRIFFFWPP*KNFFF\PPPAFFFFFF 
LGAPPFFFFFFFFFFFFFF * DRVSLCRP 
G W S AVAQ S * LTAALNSQT 


1620 


15521 


A 


1628 


386 


3 


IFPTRCTHLHLGTNKLPTCSFFDQAKKN 
PFCS \Hj^RGVGLRARLFCERLT XEGAG 
TPACPAP*FPGEPTRP/PGRVRWLTPAI 
PALWEAETGS * YVARTGLE LLVSSNP PL 
SAS QSAR I TG VSHRTWP 


1621 


15522 


A . 


1629 


401 


93 


ARGVLPLNPPPWGGRGGGS P*GKNSKPP 

KPKUEN FFFr liKFQKIjFPt'CaUCaFFFFJjr 

LGG*SKKTPPPPKGGAPINQKPPPGLPP 
PGKKGAP FPKKKKKKD KN I RTKKKARRS 
GSLLQSQHPGRPR/RGGPPLTKNPPLAS 
PPQEKRGPPFQKKKKKKTKI 


1622 


15523 


A 


1630 


417 


47 


P PPGT X S S PNP / QKNLKKG PGPGGN PRN 
PPPLGGQRG/RGLWAKKSRPPGPPRGNP 
PLFKKKK INGGGGP P P WP P PRGARAGK 
SLYPGGGPPO*POMGPPPPPPGAKKGFF 
PKKKKKKPKKPRKT 


1623 


15524 


A 


1631 


416 


54 


EYWCKGR * IDQWNRM * FKMDQHLHSQFI 
FN*ATKAIQWGKESLFNRLCLKNWLTIR 
DK\ I YLDACLTT Y \ * KINSS WHSGAFL* 
SQILERLRQEASLSPGI*VQPRQHSKTP 
SLKKFFFKFFS 


1624 


15525 


A 


1632 


2 

• 


373 


LVFLDNMLKLLRHSALASACFPEDIiAKI 
MDEGGYTKGQ I FKVHGTAFCWKKMPSRT 
FVVREQSVPGFK/ATD*LLLGANAAGKr\ 
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KLKPMP I YHS EN PR v F KNYVKP I LPALY 
TWNS KA/QMAAPLF 


1625 

* + 


15526 


A 


1633 


3 


383 


NI WN* KAWMTVHL/ PIT* PTEYPKSTVE 
NC+EKKKILFKI/LLLIDNTFGHPKALM 
EI YKEINVIFMPANTIS IL* PMDQGVTA 
TLNS YYLRNTFHKTIAFINYDS SDRCQQ 
SQLKNFWKQFS ILDAIR 


1626 


15527 


A 


1634 


2 


182 


GACTQL IGRLRQENRLN/ SGDRGCSEP I 
LHL CTP PWATE * D P VSKKKKKNPCPKLK 
KGPPP 


1627 

* 


15528 

» 


A 


1635 


331 

. ■ 


1 


LFPPPPVLKSGPGPN*N* PPLKGKSFGP 
LKKKFFFFFF\FRDWVLLYCLGWSQTSG 
LKRSSCWDYRCEPPHLTPNF/SYF/CRD 
S VSL * PRLEGSGS I IALCSLKLLDSSS P 
H 


1628 


15529 


A 


1636 


80 


381 


KLSKCHAHHSRGFYKYSLFQLGASQFPQ 
VLRITHPHKGSWAAPRS*G*SQC/SHFF 
SFFFETKSHFVVQAGGYGRNFT*LQPPP 
PGLKRFSHLSLPSSWDYS 


1629 


15530 


A 


1637 


381 


38 


SKRQGFHHVGQ AGLELLS S S DIiP TMALH 
PTCPLQ KCWD YRC / QATAPNLS S TFFMQ 
EKCLAFL\PLFSSAPHSL* *PKAVIEKD 
CPGL* IW* *VARSQRMNVFNFIK 


1630 


15531 


A 


1638 


3 


295 


PGPDGFTAEFNQTFKE*LIPILVKLF*K 
IQEVR\FFKFSITIiTPKSNKD 


1631 


15532 


A 


1639 


48 


380 


ILGKAISFTIE*KGLKYLGIYLTKEAKG 
LHTENYKMLLKELKKDTN I W KG I LCQ/ W 
TRRLNMVKISV 


1632 


15533 


A 


1640 


343 


23 


SWLTAT/ FCFLGSSDSPAS AGVELLTSG 
DLPALAS *TAG ITG VSHRTRPALSINTS 
TLLCSSPYCPPHLQSLQGTTQPVPFLKA 
QRTG CLLQGVI LTS QAENFCNKH 


1633 


15534 


A 


1641 


3 


397 


LELFSSAHCCPSLTVMQYYPP\RPTSHD 

*m<m jmm. <mm. mmm -r f —mm mr*. -m m-m <m. J B> m T W WW WW f W W'% W W™t M^ m t' m f WW^t^f J r ^t * V w m W TTfmV 

CQREKSPHRTKKKKPLEGVFIjG*KIKTN 
LENPPPLPFFGGGPPPKGGGPFKIV*GG 
WPWPWGLKIPTLPKGPTPRPPWGPLGTF 
GGEGKPLRAFPLQRFFRGPE 


1634 

* 


15535 


A 


1642 


2 


308 


NKWR*GNWISVKIDNFNFYLMPYTNLS* 
IRDDLNAKATTI KLVGENI GENLGIGKN 
F* ERTLKA\LRGKKMDKLD F I T IGNFCF 
SKDRI KNKNKARRGGSRL 


1635 


15536 


A 


1643 


16 


386 


EKKKLS LFTENGI PYL*NPKESAKRLLS 
LINDFSKV*GYKNN\DEKSVAFLNTNK 


1636 

• 


15537 

• 


A 


1644 

- 


533 

• 


3 


PLSLSLSSLFLSFFIi*DRACFVAQAGVQ 

l.l^ Wm^mm, -mux mm mm jtm, ^m. <mm. mmm ^mX mm* m> m* ■ 1 - mmm Mrm. t —wm^m\ » mm-m m} Wmmm mmm. 

WLDLGSLQPPPPGIK*FS/CGGNVAVTP 
RLSPLTLPAMTEVRLPSSKIQTNKEKNT 
VMSEIYQSLM*MGWG/M*VRMCVKFKKP 
WVDN* * VSVMGCLLPSFLPSFLSFLFLS 

G*LQPPPPAVP 


1637 


15538 


A 


1645 


341 


19 


GIGGRPP * FQLIK5RLKQEK/HLERGKGF 
NEPKSRPCISAWATKGDCFQKKKKIHMT 
LQQIQPKQSWKTDISQFRTDYKTTIIKT 
VGHGI W I D I F ING I ELSENLNQCS 


1638 


15539 


A 


1646 


279 


3 


TFYHNEKDNNKCWLTI \GKI * MLTGCYW 
ECKMVQLPWKTAWQFL/R/DVNI * LP YD 
LAITLLSIYTRKRKTYVYTKTCIQMFLA 
VLFTTAKRWKQP 
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SEQID 
NO: of 
nucleoli lie 
sequence 

'i 


SEQID 
NO: of 
pepuue 
sequence 


M, 
eth 
oq 


SEQID 
NO: in 

09/515,1 
26 


Predicted 
beginning 

II (1C1BQIIU6 

location 
correspond 
ing to first 
amino acid 
residue oi 
peptide 
sequence 


Predict- 
ed end 
nil ci 
otide 
location 
correspon 
ding to — 
lasi ammo 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
Hi — oiuihiij ic /vLiu, r*~i ncnyimiinine, 
G=G lycine, H=Histidine, I=Iso leucine, 
K=Lysine, L^Leucine, ^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=»Arginine, S=Serine, 
i — i n r eon id e, y— • valine, 
W=Tryptophan, Y=Tyroslne, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucieonae insertion 


1639 


15540 


A 


1647 


392 


l 


QKLE I MLRE EGMLKAKTGQ KLGLLQQAV 
THLVNAKEKFLKLNVIiLjQGTHKW*ESET 

WFM/RFKERNCLHHLKVQGBAASAWBA 
VANYIEDLVKKIDKGGCIN 


1640 


15541 

• 


A 


1648 


425 


148 


SQSL I * SKALTLCNSMKT/ERAEEVAGK 

SLDGEAAASSPEDLVNFIDEGQ YTKQQ I 
FN*DKTSFFF 


1641 


15542 

i 


A . 


1649 


271 


462 


RRQKKRHKRCILSPLLFDIVLEVPARTI 
*QEKKIKGIQIGKKEV/KIIS1*FADDW 
LYL 


1642 

* 


15543 


A 

• 


1650 


70 


398 


RPEASLRHMCLNAGQLIjSKRAKLiGALSL 
SFFFWKKSLAFAPQPGGQGGNLG**KPP 
LPGLRGFSGLTLLRN/WE*RWPVPPPT\ 
NFGPLI KTGFPLVGQAGFDLRTLGALR 


1643 


15544 


A 


1651 


425 


3 


FEGFPKVGFPLGPRFPSRVPPFGTLPPP 
PKRGPRCFPQPGAPPPKIWTP PGALPQG 
GVGPALPG/ALQKFGPKNPGGFFSGPPQ 
MAP\GGFPGGP*RPPRGGAPFFFFLRQG 
FP VAQARVHIjPGS S DPP ISAPQVAGTTD 
VC 


1644 


15545 


A 


1652 


385 


2 


KGNN PS P PE I KFF F F FF* KGLLPL PQGG 
GQWGYFRS LQPP PSRLKLFS CPNLPSNW 
EYRGP* \RIjL*LITGkGTSGSKTKVPTP 
CGPFNLNQLGLGTQARNFS/RLVFFFFE 
TESP FVAQAG IQLRDLHS j 


1645 


15546 


A 


1653 


242 


3 


KNKNFGINRGFFFTFKVPGFFFFLGKVK 
LFFFFRHLKFFLKAKPP* \VFFPIGPSF 
FFFFFFLDRVTiLCCPG WNVWQS L 


1646 


15547 


A 


1654 


3 


285 


HF I I YTKDLNRRFS KEDI * I I I KHV/ KK 
CS PbliA VREMQ 1 ICTXVRPGTVAGTCNPN 
TLGGQGRRIVQDQLKQQSKTSSLQKKIL 
FRLARHGGTCP 


1647 


15548 


A 


1655 


1 


373 


KVSLFFFFEQGLLCCPGWSAWSSLQPQ 
CPRVKQFSHVS/LPSN*EYSCTPNTFSL 

nimTO TUWT+VTVTVTDtnJD //-rr> /nDT 

UVL.vblaK.1 1 w Xa i X i J.bKJ?r /CK/ UKi 
LARLHRLVSNAWSQAILPPWPPKVLGLQ 


1648 


15549 


A 


1656 


189 


2 


VQPGQQERNSISKTKNKTKQKLPTKKSP 
GPD*FKEELIPXIiHKL/F/HKIEDKX5TL 
HNSFYWTI 


I Aft 

1649 


15550 


TV 


1657 


385 


13 


bCaPJb'P WCyARb PPKJj *ivPPPPKiUtPPGPP 

PPPGGAPPLGGFFLFFPPPGGPPPGGGK 
/ S FSPRFFFFFFFLGGGGQIFLFS PGGG 
PRGVFSPPAPPLVPKPAFFSKKKKRGSS 


1650 

- 


15551 


A 


1658 


352 

■ 


2 


HLSLLSNWDSIRAPP/RPCVIFKNVPUI . 

I IDVSHWAKLRRSV/CYVFETGSGSLSQ 
AGVQRYNHGSVQPQPSRVS * SSHHSLWK 
YRYPPPRR 


1651 


15552 

♦ 


A 


1659 


265 


3 


HS GQRDEGRMRCGE WLESHGWRARS CM 
TLKTSFLFTMAKI*NQLKCLFMDEWIKK 
MWHIHTME Y YSAI KR /DE I PS FVATWME 
LEVTM 


1652 


15553 


A 


1660 


1 


163 


NQQNENRKT I EKI IGTKT * LFKKI KKFD 
KPLARWT/RGKKNIQITNIRN*RRDVI 


1653 


15554 


A 


1661 


56 


320 


KFFMYSAGESTKIRCLF/ SC3jFLFLR/Q 
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SEQID 
NO: of 
nucleotide 
sequence 

• 


SEQ 10 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to urst 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A«= Ala nine 
OCysteine, D=Aspartic Acid, 
£j=\jiutamic Acid, r— rnenyialanioe, 
G=Glycine, H=Histidlne, Msoleucine, 
K=Lysine, l>Leucine, M=Methionine, 
in— Asparagine, Jr^Jrroiine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Try pto ph a n , Y= s Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














ES RS VS QDGG * S E PRS C YCT PAWATE * D 
S VSKKQNKNKPKKNLNKCFASSLL I TL P 
PARHKA 


1654 


15555 


A 


1662 

• 


3 

■ 


421 


GIITDTFPNLEKGINIQVKKV/RRPPSR 
FNPKKTTS RDL I I KLPKIKDKGS * KQKE 
KTSKSARLPQPHGLLGLGLGKSASSPIK 
KERKQ\QITYSGAPIFLVTDPSVETLQV 
RREWHDVFKVLKEK/DFYPRIVYLVKIS 
F 


1655 


15556 


A 


1663 

i 


362 


2 


VI FADEAQ I LKKEDKLDF IKMKTS VHQK 
TLSTEYKDNHREKI FVSQISDKELI SRV 
YKGLLKLNNTDKNLILFYF*RLGHSVTQ 
GRVQWCNHSSLHPQTPGLK\NPPTSAS* 
AAGNTGVHL 


1656 


15557 


A 


1664 


79 


355 


IHLPSLIGDFNLFTGISLLICLVLFVFI 
FETGSCSVAQ/S /GVQWHNHGLLQPRPS 
GLRQSSHLSPLSSWNHRHGPG*FIYF/C 
VEIRAHYHP 


1657 


15558 


A 


1665 


47 


384 


KEKASGPLINFFFFPKLCKLAPFFFLPF 
FLWGGGGGKFSRNPQKHFPR * KRVFVNF 
FFFFFFLRHCLTVSL£HCHPGWSAVA*S 
QLTAGSNFW\VKQSSHLS/LPSSWDHRH 
APP 


1658 

■ 


15559 


A 


1666 


163 


601 


I FCKGGVLPCCPGLADLHFSTSNS ISFY 
YS SGLLRMTNKTET PMSTI PKG VGVAWR 
FGNSECIFQELPLTLHHLLSTMLASFIH 
SHEAS ANAL VGRSLTVGWGCRGVGVSVD 
PAAWLWRDLKGCF* DKSHS VTQSGGQWC 
NLSSLQP*APRLKRSCILSLPRSWDHWH 
VP P FLAN F* I FCKGGVLP\ FAQGW 


1659 

• 


15560 

• 


A 


1667 


418 


3 


SVCLGLPKCWDTGRKPLCPAPSFFY/EG 
SITLIPKSEMHLPRNENYRSGFLLN/M/ 
DAXILNRILANCISN*I*NH*\*KVKFT 
PGKKDWFNNRKPTD I I YPH*QNREEKSL 
VSS TDMAKVFNKIQ PVLRELTI B I KGNF 
LNL 


1660 


15561 


A 


1668 


411 


1 


LRLHVGRTTT \ LFKAVS QGHLSLQRFLL 
PSVEICPAPRGGVYRGRQASLSCGGLHR 
VRASQLLWFLTQASAMAGPPPPVLLPPC 
SLI * DCCANNKGGF IGVGPFEPCVGYNL 
LVSHLLRPSEKPSIRVGVT*FSRC 


1661 


15562 


A 


1669 


151 


1 


PLEKEAEITGP*PHAWLIF/CFF*TESR 
YIAQARMKWHNLGSLQPTPPGFK 


1662 


15563 


A 


1670 


1 


389 


TFF * KLKM INLSEEGMLKAKI SQKLGLC 
\TVSQWNSKEKFLKEIKSATPEST*MI 
RK*NSSVAOTENV*ERSRTSHNILFS*S 
S I QN KDL I r rxJ S LKA WRO QE VABE Kb EA 
SSGWFMRFKERSHLHTIK 


J bo J 


13564 


n 


lO/l 


1 


JW 


£»V- 1 IjJr J\ J.>iA. J. ± XjC/ JViviviv V LrVjiJir J. / irXV r JV 

I*YKAPVI*FWLKVPVIPSSSAILMKTV 
*Y\YFKDRNQDEWYRLRVWKINSHIYGQ 
LI\FSKGTKTIQW*KSLFNK*CWNNWLF 
TCKRMKLD 


1664 


15565 


A 


1672 


203 


2 


ALNKRTDMPS * * IRRCNI I/KCLFSPKM 
N* VFNVIT/ IQCPSGHFFTETDKS ILKF 
IWKSK*PRLAKRTL 


1665 


15566 


A 


1673 


15 


378 


NYHHNQNNEHMYHS LPNF FF FFFGKGAP 
PG \ PKGGGRG * REP * I PGGKGNP PL* P P 
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SEQ ID 
NO: of 

niif Ipfttifle 

sequence 

'» 


SEQ ID 
NO: of 

sequence 


M. 
eth 
od 

uu 

■ 

• 


SEQ ID 
NO: in 
USSN 
09/515,1 • 
26 


Predicted 
beginning 

nuHpoHfte 

11 U HvvllUV 

location 
correspond 
ino to first 

lllg W 111 

amino acid 
residue of 
peptide - 
sequence 


Predict- 
ed end 
nu tle- 
otide 
location 
corresoon 
ding to 
last amino 
acid 

residue of 

npntid a 

sequence 


Amino acid sequence (A a A la nine 
C=Cysteine, D=Aspartic Acid, 
E=Clutamic Acid. F^Phenvlal&nine. 
G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M^- Methionine, 
N=Asdq rapine. P=Proline. 
Q=GJutamine, R=Arginine, S^erine, 
T=Threonine, V=Va1ine, 
\V=Tryptophan f Y=Tyroslne, 
X=Unknown, *=Stop codon, /=possible 

nucleotide insertion 










• 




KEGG P PGGTP P PGKPGKI WD FWEKGGFP 
LSPGGAGTPGPKGDGPPWPPKGGGKPGG 
TKOPGPKKFF 


1666 


15567 

* 

i 


A 


1674 

i 


2 


349 


LFLTVLARLVSNS * PQ/CDPPALASQSA 
EITGMSHRAQPRGIR* I * CG SYTTCHPL 
IiLT\YLFIYFSTESHSVIQAGVQWHDL 
GSLQSQLRRLT/ CILTLPSSWD*PHVTIi 
PG*FCLF 


1667 


15568 

i 


A 


1675 


348 


78 


LGPWVPPVIPAPLGG*GGRSPRPGFLTN 
PGPKGKPRVFLKIQNLPGV/RGRPPLFP 
/GSPGSGGPKSP* PLGPSFPLT*NFSPP 
FWGPPGGPK 


1668 


15569 


A 


1676 


2 


385 


1 oKKUi Kf \oJrtfcifjyj_ia.r / orUUJUAoJj 
CCPCWSQTPGIKKSSCLVFPRCWDYRCE 
PGL* I *MGKNPT\LFSNGL* CDCIPLIH 

CSVTQAGVQ* CDLGSLQP 


1669 


15570 

* 


A 


1677 


386 


1 


KSTRPVLYKWNNKA*MTEHLFTA*FTEY 
FKAT I ETFCS EK* I PLKI LLL I GNVPSH 

QGVI STFKS YYLRNTFDKAI AT/DSDS S 
DGSGKNLLKTFWKGFTI 


3670 


15571 


A 


1678 


2 


193 


EGGRIFFNSFFEVILTLI/PKPKKWER 
K* SYHP I S IMNGDVKILAQTLLNQIQQY 


1671 


15572 


A 


1679 


561 


830 


TLLLGT \nawd fklkpmi*t y\hs \ens 
rgl * ks winlgltvfykwgtnnawgddr 
hhlvyrhgftgif*gsqlktycsenipf 

IvlULir XL/Li 


1672 


15573 


A 


1680 


415 


2 


TSCAWLSLYPVYLYRSSSLPRFIFCHFK 
HLWYKN I TKKAAE I NTNFCCYINRVLLC 
HX^WS/ATWSS*LTVTSKLLGSRDPHT 
IiSTjPSS*E(3RCITPRLGDL1iNLFSRDGG 
GIRGTSTLPRQVLNAWPQAILLVRIT 


1673 


15574 


A 


1681 


1 


78 


RP R I RHE VGQAGLKLLTSGQT PAS VP * C 
WDYRPEPPCPALHISYK*NHAMCX3LKCL 
AVSA* RHVLGF IRG\ WHVECCFPFCS * A 
GLKLLTSGQTPASVP 


1674 


15575 


A 


1682 


414 


162 


GGPGGPIPGA/AGIiRPPPPPLGNPPPP* | 
KAKICPGGGAPPVFPGS*KGGGESP*PP 
RGRGPFIGVQPLPFGLGHKRGLFPKKKK 
KT 


1675 


15576 


A 


1683 


378 


129 


QPFGPRNFYQIFNFPG/PPV*HSPLFGP 
KFPPFSPGGGQWGPLGNPRPPGAKGGST 
LRGPRTGGSRGGPPGPGKFFFFLKQSLA 
L 


1 4L1C 


15577 






3 


in a 

J/4 


fzy c:\n •PnTf^RPCiWK'P FT .FRF VKT .YNNTVT? 
EREKYDNMAELF/AVVKTMQALEKAYIK 
DCVS P S E YTAACSRLL VQYKAVFRRVQG 

seissidefcrkfrllx:piamerikedr 
p it i kdd * gnsls 


1677 


15578 


A 


1685 


2 


373 


PFI RPET I KLLEE I PGGKLLDLGLGKDL 

LARTPNAKINTWDHI klks fcti ketin 

TMKKQPTDCEKI FSRIilSDKGLM/SQMC 
KELVQLN* KK/TDDSVKGWAEDLDRHFS 
KKDIKMANRPGKVTj 


1678 


15579 


A 


1686 


378 


201 


HATCLANF\CSYG*DRVSPFCPGWS*TP 
ELKRSTRLGLPKCWDYRCLGRSLLFPGA 
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SEQ ID 
NO: of 
nucieoiiae 

CA/1 IIAfll #» A 

acquciHB 

• 


SEQID 
NO: of 
peptide 

con nan t+Ok 

acq uence 

- 


M 
eth 

OO 


SEQID 
NO: in 

no/si s i 

26 

■ 


Predicted 
beginning 
nucieouoc 

IU L« LIUI1 

correspond 
ing to first 
amino acid 
residue of 

rvonf iH A 

pepiiuc 
sequence 


Predict- 
ed end 

tin aIa 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 

IT - rZ Into m it* A P^PhAnwIotinirto 

sii — o mm hi in aciu, f — x nenyia iun me, 
f^=ifilvrinf» H^Htstidinp YsTcaIaihmii^ 

K=0Lysine, L= Leu cine, M=Metbionine, 
N*=Asparagine, PHProline, 
Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 

W=:Tr\;ntAnhan Y^TvrncinP 
w— i rypiupiimi, p~- xjruiiiic, 

X=Unknown, *=Stop codon,/=possibJe 
nucleotide deletion, \=possible 
nucleotide insertion 














nnny 
Viz Lin 


1679 


1 5580 


. A 


1687 


1 


404 


MCVNPGGGACS EPRS CHCTP AWVTERDS 
V* KTNKKKVKRRKKI FHANS / KLKRVGM 

QEDISITNTYTLNT*APKYMKDTLTBLK 
EEIDIYAIIVGDFNTIxLLI ICRQT 


1680 


15S81 


A 


1688 


307 


33 


DEGSCHDAHAGLKLLDSSGLPASASQSA 
GILGVSHGARPLISSYNGTSHGGLGPAL 

t rT" d r?i\TT c tut cwt»t t er/iT /inr*nc , *r , iw^T 
v ISrNJjbHlar JtUJJ-iijbL^Jl/HV Ic^IjMvjxj 

GFAGAKLSL 


1681 


15582 


A 


1689 

i 


11 


394 


IFILEARTRISSRTFFKS * INIHAPYNS 
AI*LIGIFPREKK/ STCPYIYTQMFIAS 
IjFyTAQTRKQPKCPS T\3E \ Wb KNLWN 


1682 


1 5583 


A 


1690 


20 

> 


391 


SEGKGWYSCTKWQSMKLGGITFFFFFPP 
PKNPPPKKSGP/QKGPFF*GKGPPWPPK 
KRGHKN \RGFPKQRARF PKPVFFJjI FGK 
RG FLLGP KGG* NPG E KRNP PP WP * KGGK 
NPGNP KGGPHLTL 


1683. 


15584 


A 


1691 


72 


392 


IKMIGSIjFFGFAFFFFFGKKTPFFPQPK 
R\GGKP* IT*TPPPGN*RNSGPSPPQKV 
GZ KAP PP LrPK \NF * if,FGKNGVTPFP PGG 
FEPPTPKEPSPPVSPKGGKTNPAP 


1684 


I5585 


A 


1692 


389 


161 


HGGACLRSQLLGRLRREDCLNQGG*GCS 
EPCTPAWVTE*D/SSQKNQKSKIKKSGIj 
DNS FS IG * G ILGIiSTCDR YS 


1685 


15586 


A 


1693 


286 


363 


DGI SL* PRPPGLRQPSQPS \IiLSNWGCR 

STPLCLAGFFVFVETGFLHVAQACLRQG 

FTMVAQS /ASQKIHIT*GAFETIQVLFY 

WGGVGFGQWRFFFFFETESHSVTQAGVR 
pact eeTyMronDP 


1686 


1 5587 


A 


1694 


1 


356 


ELLE PRGRGCSERRPCHCP PVRVAEQDS ! 
VSKKKRERKYLYFLRFNWRSLRIFIYFC ! 
* /HPLQHNIQNI S FTLQNS FGFFSRQYC 
PSPLE 1 1 FLTPLTENLLGLFMKG I IQNW 


1687 


1 5588 


A 


1695 


3 


298 


KY F E TNENKN IQ YQNL\AVKL VFRENLT 
VVNACVKKEERFQVNNLALYPKN * EKSM 
LNPKGKI IKVRSEKNDIE*KNDEENQ*N ; 
*SWYFEKITYWQTLD 


1688 


1 5589 


A 


1696 


3 


405 


QLI FLTKVQKQVGEGQPFNK* CGGTWAP 
TGKT/MEQPPKASSSSSSSSSSSSSSSC 
KM*NIVFFKMGENLWDH*AKSYEVRTKA 
♦TIKGKVDKLDFIKXKHFCYGKN 


1689 


15590 


A 


1697 


6 

. 


392 

. 


PRALKKTYAKSTLPVFYKWIKIAWMTV* T 
VAAWLTE YFKP I FENYCS /EKKI PFKLF 

Jjjj x.£iEiJ±jr\*t%lr X urXuTli&rIX A X X XM V V70Xxtr>Ui 

TTSVLQPMGQRV1 STFKS 


1690 


15591 


A 


1698 


390 


3 

■ 


AI I E SDFLTTS RE VAKKLRVHP FMVLWV 
LEPIGKVKKVITKWGP* KLNK/NKKNHCI 
EVSSSLFLCNNEPFLDGIVTCDEKWILY 
HNW* *SAQWLNREVAPK\HFLQPNLHQK 
KVTV I VWWS AAG L I QNKH 


1691 


15592 


A 


1699 


1 


245 


GGGGEYS KI IAI KTALKNTT YLG IYLIK 
\ DVPDL YTKNYGTMLREIK YLEK *RARP 
CS *TERFKI VKMS I /LPNLI YRFNTI 


1 692 


15593 


A 


1700 


2 


324 


GTSGTS GTSGTS QT CR ISRVRSTS SWTS 
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sequence 


SEQID 
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SEQ ID 
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nucieoiiue 
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correspond , 
ing io iirsi 
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Predict- 
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nucle- 
otide 
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correspon 
ding to 

locf oinlnA 
lo$I niTIinU 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
C=Cystelne, D=Aspartic Acid, 

XT' 1 lit*) rYl i r* A /•iH 1T~pliAnuli> InntnA 

iv vj iu iu ni ic aciu, r — nenymianiDe, 
OGIycine, H=Histidine, I^Isoteucine, 
K=Lysine, L=Leucine, M-Methionine, 

M= A enorooina lif% A 

i~ /\5paragiue, r^r ruiinCj 
Q=Glutamine, R=Arginine, S=Serlne, 

W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














fZ'O HT ./^OWVG W AT? G C W T?\7T .Q & Q G neTT C D 

ISLSGS/GSSIiSNSSKNLTLPCLFSSIiL 
S*P*ANPVDSAFKIYPGLSSVGSCL 


1693 


15594 


A 


1701 

» 


183 


372 


PGVLLCWPGWSAVTVHQCDHSALQPRTP 
RLK*SP\STWDYRYTPLCPA 


1694 


15595 


A 


1702 


29 


382 


GKRFPCSWEPKFPLTQGGFPPIjHPGQKK 
KPGSKKKKDKS KCW * G/ CREA/GTLTHY 
RQELEMVQSFWK/TVWQVLRKLNVELPY 
DPAFLLGLHPTSTQKR\DTMFMA/AITL 
/ ISJsRWKQPRCTS 


1695 


15596 

i 

i 


A 


1703 


1 


382 


KKVKIIGEAAVEFPDTIKKIIEEKEYLP 
L*VY\NADESGLFWKKKLQRTFISKEEK 
\SMDRLTIiI ILCKCS WVYEQDGP* AuKE 
KGEHQIiPVF*L*NKKAWTVRTLFLD*FH 
QCFVPEVRKYVASKRIi 


1696 


15597 


A 


1704 


2 


330 


KLNNLLLNNS*VNTEIKAEVSSSLLEIN 
EYEDTTYQNLWDAAKAVLKGKHVAPRHF 
LQEVKK/ RLKRFQINNLiTLYIiKEL/EKE 
HINLKASGRK+MTKIGDLFGLYFVLNG 


1697 


15598 

■ 


A 


1705 


100 


342 


APKWSIVCPELVGSWSH*/PSRMKPWTL 
TRQGFTMLARLVLNS *RRDLPALASQSP 
GITGMSHRTQPLLINLMEIFTEILS 


1698 


15599 


A 


1706 


600. 


211 


S CS VARLE FSD VI KAHCHL/RTPGLKQS 
SHLSHLSS\WD*GRVPHDLANF+ IFCRD 
RVLjPRLIjQAGIjEL \LAS3DPPS * ASEKC 
WNYRHEPTVPRQNLGIjLKTTYGWVFLKK 
YILTVSVFSLMCLDHCLLM 


1699 


15600 


A 


1707 


409 


1 


RGPFFFS PGKAFIiKIiGSNSFFPI KKTRA 
PKKPIFSPVSP/LNFPPKTCFPPVFPPQ 
MGGFFF/ CFPSFLFFFLPPPPPFFFSFP 
PPFFFFFFFFFFFPLFFF* DRVSLCHPG 
WSAVAIiSQLTAALTSMDSSNS PTCV 


1700 


15601 


A 


• MAO 

1708 


154 


2 


IGKPItAGLiTK/ RKRENT * INK! RNEKJ3D 
MTADNTE I QS I IRDFVS * RTAHQ 


1701 


15602 


A 


1709 


263 


37 


G/WPGAGAHACNPSTLGGQGGWITRSGD 
QDHPG*HISV 


1702 


15603 


A 


1710 


390 


42 


YAGGFRAI FFFPLPREGAKNPNFPVGPP 

GGAKGFPNARGPFFF/G* KKKKGPKNKT 
PGFFL/MGPPNPGGPPPRGEGGKIGAKK 
KKKSIRL 


1703 


15604 


A 


1711 


3 


167 


YTCVFVCLCLCDCMC/CVCACMYICVCV 


1704 


15605 


A 


1712 


116 


391 


KRNFFFGPQGGGEGPKFN*RGPPPPGVK 
GIFPPSPPEG/GKKKGAPPPPGIIFWFF 
KKKGVPPCGPGGV*TPDPGGPPPPGPPK . 
GGAPRQGPL 


1 /ID 


OOUO 


TV 
n 


i nil 


AC\\ 

4U1 


4/ 


QVHT/HTDTHTHTRKVSSVCVLIMAEQE 
RPCPHC* GGEAGAECGVCAWLGLSS *TW 
RNRKGPAHTVRWGQEQSAGCAHGIiVFP 
RSYLrWT 


1706 


15607 


A 


1714 


1 


400 


CVESCEVDI EMVSCCV/ CSG* SAVCSGT 
AS AHCSL P I PGSRDS PAS ACQVAGTTGM 
PH/LYPGVPLKPREGLQFTELPSGQLEI 
Q PTCENKR * HVPCAL * VQIjTD I RPN* RY 
QFRVAAVNVHGTRRFTAPS KHFCS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 

09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid— 
residoe of 

mt m. 

peptide 
sequence 

i 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
fc,-GJutamic Acta, r-FnenyJalanine, 
G=Glycine, H=Histidine, I=Isoteucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=*Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y^yroslne, 
X=Unknown, ^Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


1707 


15608 


A 


« M « m* 

1715 


413 


3 


LNTFEPQRSPFI VWSSEWILLVECLFVT 
SGWIYHERFLNPGREIDWATCYSETGPC 
PVTQDGVQWSNHGSL*PQTPRLK\HPTT 
SVFPVA/RQGL/NSVAQAGVQWHDLGSL 
* CRLPSLKGSSHPRLPSSWNYRYAPPPC 
I 


1708 


15609 


A 


1716 


421 


144 


RLE CSGG ITAYCS LNLPGP S S S PASAS * 
VAETTGLLHWKKKIVETGSHFI VQTAFK . 

FJjDSSDLPAZjAC/ swdyrsellcpacfy 

NFCLFINIPC 


1709 

* 


15610 

• 


A 


1717 

i 


3 


384 


YSACVCVCVWVDVSVHRCMSISGH/ARI 
IRVGPAHKAQSSCRPVPAGCCTSAQTPP 
WCSAATADPPPR* GESLPGAYPGSHSTC 
CPGCCLDW* HSLPSTEPAACRAGAPGGG 
H*AWLGCGGRAGGRPG 


1710 

■ 


15611 


A 


1719 


3 


615 


PVGSWARSSGAGWPPGSPQSVSDGEAGH 
S IPAPRGQCSRHRAEGRTARVCLSHCSF 
SGPRPGLVPIR*SLGRPDAVQAIVPDSQ 
EGR KTG I HAEAVMF PLG PG KG VKGCAGG 
RLCPSSPPSRCLDLGRRGMPPSSGPAGP 
RPSGVGSDLGRPGAGAATSSSSSSSSSS 

*m\ *m +m? mm mm. mm 0mm mm\ mmm 0mm. vk. mmmm mmm 0mm taB V fatjta «k 0mm 0mm mm mm 

SSSSSSSSSSRDGEGP\TGSVEAPGSLL 
GPWLPSQPS 


1711 


15612 


r 


1720 


320 


I— 


GIiKWLNL KDNS L YL I LAKWDDCLDEK* 
YADKMLQ YTKS VWVS RSRGCQGQVGEKK. 

-mmm w mmmym* m- m— 0m. mmm. f, mm hi, mm* m*m \ m-m. t-m a mm » pfc yJW» *m v <_ 

QATEAAQ E WELR K \ RL YWR KE CDALRAA 
REEQKELRDVRKAKKVYVCVRV 


1712 


15613 


A 


1721 


44 


373 


KAMGQTLWKTVWQFLTKEEIGPM*QSCP 
T/DLALVFLGICTIDLKAY1HTETCTQM 

mm) mmp M a— «^ W «M *m* m- ■m*m>m>m m. -m- m-mj 0m 0m mm m-mm mmm g m. mm. M ■ ■ ■ M mmm 

1 1 ITLLI I AKNRKKAS CSSVGE / WNKKL 
YYIRTMESYSSLR*NELSSYKKHWGGGS 


1713 


15614 


A 


1722 


135 


396 


AQGLFCTSVKLASEQPLRILFQLDKRNK 
\ FE I YGTSG*L* S III CQNNLQS KFQMY 
HHK I M S S LGAVAHACN PST LGGQGGQ I T 
RSGD 


1714 


15615 


A 


1723 


4 


383 


LNLRAKAIKLLEENIGIHLPDLGLDDLF 
LDITPKSQATK/AKIGN*GFIKIjKHFCA 
AKNI I KKMKRQ YKEWKKI FGNHVS DKKL 
VFRIYK*HJjPLI IKNSSS 


1715 


15616 


A 


1724 


2 


405 


NS RTSL I LNQNLE / 1 KLSEKGTWKAKTG 
* KLGLLQQK\ I S KI ANAKEKLIiKEVKSA 
TSMNM*MMRK*NNFIPEMQKVLWW/I* 
NI/PLCQSLIQNKALTLFNSIKAERGEE 
A \ KLEATKRWFMRFKE S CIiHNVKVQDEG 


1716 


15617 


A 


1725 


90 


400 


SQLLRRLKQNSLNPRES SSKTTTRTKVS 
HWHKNRHVNQ/YNKIENSGINLHIYG*Ii 
TLNKGDEASQYSS /DILFNKWCWQKKKK 
\YLDPYLTPCTKISSTWTISGFL 


1717 


15618 


A 


1726 


390 


1 


'PCPDVT^UVIfTT * T?TtJ^'T5D'Dt5VT IWrm?!! 

Irr f Ki \.Cti\.i\J.Jj x r ± WvjFKKFIxIAjn vi? F 

* QNKPKIEGI PLPGFKI YYRALVTKTAW 
F*HKNPPIGQRNKVENSETNFHPPSELN 
FFFF\YKGAKNIHWGEDSLFNKWCWENW 
I S I * RRMKLGPNIiTPYTK 


1718 


15619 


A 


1727 


3 


365 


HASAKSNLRWIIKLNIiRAKTIKLLEENM 
GENLWDLELSRGFLDRT/PKIiYS iklkl 
RK/ WNF I KI KNFCFSRDT* KS * MAGKN I 
LSGKDLYS E YLKK* CNLIIRQSKTS T*M 
FIAALFI IAKH 
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SEQID 
NO: of 
nucleotide 
sequence 

* 


SEQ ID 
NO: of 
peptide 
sequence 


etb 
od 


SEQID 
NO: in 

I ICCM 

09/515,1 
26 

• 


Predicted 

beginning 

nucieonoe 

location 

correspond 

ing to first 

amino acid 

resiuue or 

peptide 

sequence 


Predict- 
ed end 
nucie- 
otide 
location 
correspon 
ding to 
last amino 
acfd 

residue of 

peptide 

sequence 


Amino acid sequence (A<*Alanine 
C=Cysteine, D=Aspartic Acid, 
H(— oiuramic aciu, r— rnenyiaianine, 
G=Glycine, HNHistidine, )=Iso leucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine T R»Arginine, S«Serine, 
i — inreonine, v— valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, ^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1719 


15620 


A 


1728 


368 


2 


RNCLNSEGGGCSEPRSRHCTPAWATEPD 
S VSKMK*NKKINK* IMFLY \EQQSBTEY 
FKLPLTIA/SRNMKFLGINLSKGVQDLB 
TENYKILbQEIEEDLNK*RNKTY*WIKL 
N 1 V KMK£> I Ijb KK 


1720 


15621 


A 


1729 


326 


30 


NPFGGPKKGGSQGREIKPPLPPMGKPLF 
F*KTKNKWVGGAPPVIPPS/SGVLSQKK 
TFTLEGGGPNKLNS P PALP /ARGPKKNF 
FQKKKKKrKQNPS QREE 


1721 


15622 


A 


1730 


1 


374 


I FNADKI A\ FWKKF * KPQGTSVGREEKQ 
APGFKAGRNRLTTI/L/GASAVGFMIRA 
ALICKAANPQVLKGRDKHQLPVFRLLYN 
KKAWTTRTLFLDCPHQCPVPEVRKYLAS 
KGLVFKVLLILDNGPC 


1722 


15623 


A 


1731 

■ 


389 


1 


FPPKIFFFSTLFFFFPRFFPPPPFL*P/ 
SPPIYFF/ CAPKKKNFFPPPPGKNFFFF 
KTP P P FFFFFFF * D * VCLCCPGWSAVAQ 
SWLTTTSVFRVPVI 


1723 


15624 

* 


A 


1732 

• 


118 

- 


422 

- 


D I TTHLFKWL KFKK* EKGLN I LFTKEDM 
QMEKNLKRCLT* FWKELQIKMRYHYPP 
IQMAKI * KN/STIS I AWQGYRTXGTLP/ 
HC* * EQPFW* FLS KLNMILPYNPA\ IML 
LS I YPNALKKHVHTKTCM * MFIAALF 1 1 
TKNWKEPRCPSICEW 


1724 


15625 


A 


1733 


407 


1 


NIKGPFRGPLIQWGLLIWPKDSFPILGY 
PPFPSPKISFFFLARSCSGAPNHFPLPN 
QSPCFPQPSFFLGEEKKEFLPGYSLAFP 
FN/RFGGS * RVRKGNGGPI PMGESFLFF 
F * DRVLLCCPGWS AVA* SRLTATC 


1725 


15626 


A 


1734 


322 


362 


TAIYIYI I * liFTDN I P SHPRTW I E I YKE 
INI FVPANTIPI VQHRNQGVI CTFRKTI 
TVTDCDPANGSGQSKLKTWKGFTILDAV 
KN/lRDSWEKVKIG/TIN/GVRKLiIPSL 
KNDFKRPKT 


1726 


15627 


A 


1735 


49 


395 


RGGPGFFFFFFFFSKKSQILPPGWKGRG 
EPRVNGTPPPRGKGNPPAQPPQEGGKTG 
PPHKPG* FLCFLKKKGVQKGG* GGPQ / A 
PGPKGAPRPGPPKGGEKREGPPGPTRPM 

T WJVIT 

LiiiArl 


1727 


15628 


A 


1736 


417 


2 


FLFFFFFIiFFFFSFASGPEIIiFTCL*HT 

HIHFLFFYSKST* *PPVFAGGMFQDPQW 

t DCPVfjnroxTnnrwcTTT t /vto* * trr 
ijlrcri isl*! V r i\JVrJ<. i lux/ X ir* w KlivJS 

CDLSNFFCFF*DRLIiLCRPGWSAVA*SR 

LTATSTFQAQANRTRG 


1728 


15629 


A 


1737 


316 


338 


t7T?ct euvccroevr tru'D+r*T?Ti7irT o irtrv 
r r r JLtot ZirlSilcioXJjrulir'ur JLVj\JjoJ\VK 

DI EN\ LKKTARGNYQVTYKGAS I RLAAD 

FSAE I SQAWREWDNMFKVLKEKTNWQPR 

IYKTLFVPHF 


1 TOO 




IV 


1 / JO 


1Q*7 


j /y 


OKRAO IDKAI F I CRD I AL / P * FO / MYYW 
AT VTKTAW * WYKNRHI EQWNRS W \ PPE I 
KSQSYSHL 


1730 


15631 


A 


1739 


4 


401 


RG YRHAPPSLANFCI FSKD/MGFTWLN 
S* PQ/ CNLPASTS * SAGITGISHCTRPQ 
MATFLIGPHKIIPWSVLWPNL 


1731 


15632 


A 


1740 


94 


117 

* 


KDRPMVPPVGAGEDQADEPCRGHASLWS 
QLVSAPTTPIPLPGRDVPSRATPFPAZUj 
AAQQP P * AS P YPLP PGLGAGHASAS PVT 
VPFSPISESTGS*ESAL/PAPRPGGSG 
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SEQ ID 
NO: of 

rv ii r IqA^i /■ Ok 

nucicoiiuc 
sequence 


SEQ ID 
NO: of 

pcpuQc 

sequence 


M 
etb 

OQ 


SEQ ID 
NO: in 

1TCCM 

09/515,1 
26 


Predicted 

beginning 

nucieuLiuc 

location 

correspond 

ingto first 

amino acid 

ruiuuc ui 

peptide 
sequence 


Predict- 
ed end 

mi j*l A— 

otide 
location 
correspon 
ding to 

lflct fltni'nA 

■ lilol <SIIIIIIV 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A«Alanine 
C=Cysteine, D=Aspartic Acid, 

F^fillltamif Acirl l?=PhpnvlalaninA 
vji u in in iv .rvvnj, X I IIC1I Jr lalall 1IIC, 

G=Glycine, B=Histidine, I=Isoleucine, 
K=Lysine, l^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G! uta mine, R=Arginine, S=Serine, 
T=Threonine. V=V»lmp 

ji * ii » v« ij iii w, » t n 1 1 UC) 

W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


1732 


15633 


A 


1741 


3 


390 


VDQFLI SHDLSKLTEDEVHNLNSSTTIG 

V EVF.FKVfTKTA * VTC K <?QfiPtV3TTf3TTT?Y* 

tvreevtpil/ sylf/heiekeetlvns 
fyear 1 1 l/l pkpd knkp * tnidaktts 
kvlpnriwqyvki i iqhn* vgfv 


1733 


15634 


A 


1742 


3 


442 


DLHSRVEPRVRPSVRKQ*WLLKVLICA 

oJVU 1 ljr*JXJ^J\t\\Jc' ±\jW Ci)\± E ViNrir'JirJJrJJJj 

IPEYKINMQKSQVFLFTNNSYSSTTDND 
INKWAKDLSRHFSEEDIQMANKHM\KR* 
S VS LV I RE IKI KTTGR * LFTPKCWQG*G 
EIITLVH 


1734 


15635 


A 


1743 


41 1 


2 


KCISLKGPPFFFFCRYRVLLCCPGWFST 
PGLKQSSHLGLPKWWDYRHEPYCTQSSF 
SLSFFLKQTGR*WFDLSNFFFY/ cycfi 
YFSRDRGLTLLPELVLNS WPQAI LL» 


1735 


15636 


A 


1744 


1 


393 

4 


RPGGPPKGRSREQGREGERSRRRRPRAP 
WARSHMWGARVFS VPRS FBQRPQEKCVQ 
SSYXQQLPARAPJX3TGNLIRGAPLFFF* 
DGVLLCRPGWSAGFKQFSRLSLPSR*DY 
RRTPPHPANF*ML/CLRRSLT 


1736 


4 m ^ mm 

15637 

• 


A 


1745 


395 


0 


np 7v rif L'i?c"i'UT +t /^Dnn/^DnnTiDPT VTiTk 

PSAPSFFSTRIj*ljGEPPGr PPPPFIjKPP 
PRN/SIFGAPKKKFFLPPPRGKKFVSLK 
GPPLFFF* DGVLLCCPGWSAGFKQFSKL 
SLPSS*DYRRTPPHPDNF*MF/CLRRSL 
T 


1737 


15638 


A 


1746 

* 


397 


1 


UCiiN r ijivKhjlSJNr liAKKr ijyl\. w AAKr KIN XK 

*VTPEITAFCPL*HVASFLVFLTPOTPT 
1 IPQIiYCLEPLGEMGGSGSKLPPFPTSKT 
PNPLISVNLCFP\AIKWENFFF*DGVLL 


1738 


15639 


A 


1747 


392 


2 


FTKKGGRGGNLS P / LPPQKI F * KKKTLK 
KPFFWQRVWFFSPLFFEQKGQGPFFFKK 
PFF*KRPPDTPQPPPPSFIFFFLLFFFR 
/HLVAQAG>5QWRHLGSPQPPPPQLTQSS 
UUciirn 1 it irntr U1X J? uirutUJK 


1739 

i 


15640 


A 


1749 


33 


403 


IKGFKKKRGKGGAPQKKGGTGAKPPPPP 
PQGF* K * QKRKNGS P PNVFFPNPGGPPP 
PPPFWGDKRGGPPRGGGAPPPRGKRENP 
FFPTLA/HGKKEKKK 


1740 


1 C£ A 1 

15641 


A 


1750 


3 


39o 


TTPnTTMWT?ir\7Pa*TrNr\ A^f^lT.TQT.TPT^K'^ , 
fUtyj 1 IviYiEj 1\V r jH JWi \nL/^7JjXOJJX£\I\X\.\« 

LKVKK/ W * R/DMNTQ FTDKG I LMTNKHM 
KRS STS Ii I MEMQHKAG VI FHPSDWQKC* 
STDNTQS*QGHEEMYTIKHSW*TFDYQQ 
PF/SESNL 


1741 


15642 


A 


1752 


30 


Jin 

419 


INJ CAj J. Ijr/iO VCf /^>VLM I\.UXt O l ¥ iKJVIX V a V Xin 

LNRRLDGELLSGLTSTQALPGWAYLHIiL 
SHHAVRPL FLCFKRGWVLLCHPGWSAVA 
GSQ\FELLGOVI LRPHLPS S* DYRS IPP 
CLANFKNFFRERARYACR 


1742 


15643 


A 


1753 


16 


410 


VGP KKS LQQ VWAAVQATLPLES YBLAHP 
IILKVSLADRDAI*NLWQIPIVAS*YIP 
LGF/YSKAMPSSVDIYSSFEKKKTHFFL 
TGGGGPPLYSNYLGRLGGANHLTPGVKN 
QPGQFGKPPPLQKVQTLAGRG 


1743 


15644 


A 


1754 


2 


17 


NSSLIKHRRIHTGERPYQCSECGRVFNQ 
NSHLIQHQKVHTR*RMYI *SR/CGKDFT 
QKSTLI+H 
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SEQID 
NO: of 

it u Lieu im c 

sequence 


SEQID 
NO: of 

sequence 


M, 
eth 

uu 

• 


SEQID 
NO: in 

I ISSN - 

09/515,1 

26 


Predicted 
beginning 

location 
correspond 

tn<r in first 
■tig iu in ai 

amino acid 
residoe of 

sequence 


Predict- 
ed end 

mi j*l a_ 

ULIUC* 

otide 
location 

cor nnn 

ding to 
last amino 

»wu 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
OCysteine, D=Aspartic Add, 

l^rri^lnf a mil* Arid 1<=Ph<>nvlalanSnA 

G=Glycine, H=Histidine, i=Isoieuctne, 
K=l»ysine, I^Leucine, M=Methionine, 

ll rlJl'nl agillV) % & l villlC) 

Q=Giutamine, R=Arginine,~S=Serine, 
T=Threonine, V=Va!ine, 

AV=T"rvntft nhan YsTvrnfiine. 

YT — * • J piU p 11 4 11} 1 1 Jl VftlllVj 

X=Un known, *=Stop codon, /=possible j 
nucleotide deletion, ^possible 
nucleotide insertion 


1 HAA 
1 /*W 

4 




t\ 


1 / JJ 


1 JO 




X PX XX 9 P X P PP FPX PT i 1 ? QTiFTjTXCQ AY * 

TPAHRPPPPGPFVPPPKPCCPPPPPLRP ! 
WPPLPLDTNPIPPPTLSFHPGPALS 


1 HA^ 




a 


1 / JO 




Q 


WT.CI P TRHTTFOOPttRPF /III TWSOPVfi 
PLFCLFVF * DTVS LCG PGWNAGVQSGLT 
AASTSQAPSL . 


1746 


■ 15647 


A 


1757 

■ 


2 


403 


RVLFSPTLAYTYLLFYLGATILMGVNIW 
KQPNCPTK/ GQQKIKtiQYIYRMEYYSAL 
KKK* ILLFAIR* VNLGDIMLSEVSQ\SR 

JNJUM. J. V Jjo liTQW 


1747 


. 15648 


A 


1758 


398 


65 


FFFFFFFFFP * TKGLG CIHRCDHGS PQP . 
RIH\GSSNAPSLAS * VGGTTGAFHHARF 
TLIQSSSVHVSTHTLHPYPSSLP 


1748 


15649 

■ 


A 

- 


1759 


456 


31 


FAKRITDKKXiLSLIYLAT*KGEITKALI 
KQ* \IjKAQAKNSIjKJ^TIVIiNHMKLi/CI* 
ILLIIKEMQIK/STLRYHFFFITLAK\I 
* KLGNTFCWC3GL *GTL IHCWWECK*HNS 
YVGG I WQ / FSNKL YVQ I * YNS AI SNLDG 

KVUKJrU 


1749 


15650 


A 


1760 


3 


378 


Q FQ YF Y YNGSVI KAVW YWSKNRK\ IDQ * 
NRTES PDKNLHXHMQL IFDKGTNTPQWR 
KDDLFKKWCWNN*TSTCQKKKKKKGGGP 
♦KEQNLTPPGWEDIIFLFGAPKNMPGAG 


1750 


15651 


A 


1761 


69 


384 


YTSASWGGARYTASAAGWKTLLLLFLFI 
*DRVLLCHPRWSALTQPRIIAASTFQ\V 
KQS FWLMH I GDWDYRRCMPPCRANFFNF 
LCKKKKTLRRQ E VNQ T P ALVRV 


1751 


15652 


A 


1762 


390 


1 


CCPDWYSTFGLKQSP/CFSLPKCWDYRR 
ESPPQAFFVFFGLPSIiPFSPPPAPSLSQ 
SSSFFF*MEFHSFAQAGVKWLN\LFGSL 


1752 


15653 


A 


1763 


2 


390 


PRVRGFFMRKF VDS YLVP YTKSKLKWT I 
DLHVRAKAIlSrLLN*NVRIHLYDRELCNG 
FLEM I KKTKAAATTTKLDFI KI KNFCAQ 

VS RKYVKKAYNS S I RTQSH 


1753 


15654 


A 


1764 

• 


334 


3 


WSKRSGPPLSKNQTKKKNATKPQT*KNW 
INEIGPI I/NTSPSKEKTGSNGFTD*FY 
*TLKEELIiS ILLKLFQKTEKSVI FPKS F 


1754 


15655 


A 


1765 


259 


1 


KSTLFLMKKYWKGSFLKKHFFLETRSLS 
D/SPG/GVQWRDHSSLQPRTPGLKQSPH 
LSLLS SWDHR* AP PC PANFRFRLKTRIG 
RDV 


1 ICC 

1755 


15656 


A 


1766 


A /VI 

402 


386 


BYEGHYTKQQIFHVDERAFYWKKMPP/R 
TFIARKKKSMPGFRASKDKLTVLSGANA 
AGDLKLKPVLTYHS /ENPRALK/HYAKS 
TLPELYKWNTKAWMKIQKFPS * FT * IFM 


1756 


15657 - 


A 


1767 


2 


406 


PRVRPRVRKIjITLLNVSQRWSSEKKKKK 
KKTKKKKKKKKRGGGRL* KKKKKKPRRG 
RVNLFFWGPKKSTPPRVFKHRGGEK/PP 
PPPPKKPRERKPSLGVGSTWHGISPXKH 
SKKTKSTK 


1757 


15658 


A 


1768 


14 


409 


IASGFLFFIYFGVIGRPPKRGGVFFPGG 
GGAPHPPQGGC*KKKRGGGGGPPPFFPP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M. 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
■ amino acid 
residue of 
peptide 
seauence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


• 

Amino acid sequence (A B Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N^Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown y *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SLGGLGRPRPMSKI KTPPSAGGKPP/ PF 
LRNQN*TGGGGPPQ*PPLFGGPGGKNFS 
PPRGGGGGGPKTPPPPPPGGE 


1758 


15659 


A 


1769 


305 


1 


TKTGPFPSNRAKKKKIVFQKKKKEKKIN 
CFVTAPFKGIKTEATD*EKI FAKHLSVK 
G/LVYL* YI* ISKIYKELLKLNNEKTTI 
PIKK*AKDLSRHFTRYTD 


1759 


15660 


A 


1770 

* 


148 


402. 


FSPVSLG*GRGNIYAGMSNV*EVPPEID 
FQHE VKRALQTS FQ VKLVKI I FFKST I * 
KSLAK / WLAWAHACNP STLGG * GGW I T 
RSGVKSEPGQHGE 


1760 


15661 


A 


1771 


250 


2 


KKKKEKKI ICFVTDTFKGI KTEATD* EK 
IFAKHLSVKG/LVYL*YI* ISKI YKELL 
KLNNE KTT I P I KK* AKDHS RH FTR YTD 


I fol 


1 jooz 




1 nry 
1 / YZ 


Ann 


i 
l 


KKI RRGGGOP PLFP LIGGGRE KKFFF PG 
KRGFY*TKAGPFPSSWAKKKKFVFKKKK 
KEKKI ICFVTATFKGIKTEASD* EKI FA 
KHLSVKG/LVYL* YI * ISKI YKELLKLN 
NEKTT I P I KK* AKDLSRHFTR YTD 

A 1 ^J^X A A ^b> 4 ^B» A «* X * bbbtIbb* •*» A 4* * V 


1762 


15663 


A 


1773 


1 


406 


KKKKKKP TTFPGLP FFFS LSL PPRVPGA 
RGPPSSFPSKPGLGC*IKKSGVSPAPKK 
KKPRKTNPKNFPRPNPKGGGEKFFNLPT 
HPPGGTPPFF*RRRENPPGFAPQKG/EV 
F S PGGRNSRGB PRGG PKKXKKKGG 


i /OJ 


J DOOH 


a 




X 


J fO 


AAGE WLHOGS LOSLP PGLKQS APLGVSK 
W*NPWHDPPPPAPRFVWWGOWLRWS 
FFLGAQAGIFFFF* IEMGSHYVAQAGLE 
/PPRLOGSS * LNLPSSWDYRPVP 


1764 


15665 


A 


1775 


1 


431 


QQMRDKRNIiFEHNK*GIRGIYLNIIKAR 
HEKPTVDTILSGESFSSKIKTMLISPFL 
FNTVLGVLA*ARKRKDI * VGK*EVKS YM 
FTNDMI \LDNPKDS TPKKKKTG YFWGGP 
GSKPPPQRGGAFLSLTRDPLEREFPKTA 
LFTLGQKKIKGPKFFS 


1765 


15666 


A 


1776 


334 


402 


KGGGGVGG*QGPWRLAHCTDK\KEERKR 
ERERKRQRKKERKKBRKKEKKE *MKNNK 
KKNDK 


1 /DO 


1 JOO / 


A 


1777 
iff* 


*twu 




SPSSSSSSLFSPPPFFWGGPRF/SPPPP 
VFKPPPPFFFLGPPKKKFFPPPPAV*FF 
FF\LGPPPPFFFFFFFWEAGFPFFSPG* 
GP/SGPMAGFRSLPPPGNSLSKKKKSEG 
IjGEGGNS VLTRVLL I SS YQ I PGNPR 


1767 
I fvf 


J JuOO 




177R 


70 


409 


LISFLVSSLIVRLYRPLLCLFPDPSKGH 
C I PCLL PLFW ILLH I FLFMHS F I YCLIN 
DRVLLCLPGWCAWRPRLTAASATQ\ I K 
RSGSHLSLPSSK* \ WDHRRCPGFFFFFF 
FF 


1 /Oo 


J J 007 


A 


1770 




11 


SHLS LPKCWDY / RL * ATTPS OKI LVFTH 
G*VLSLLSLSFLIPPD*TF*KMSLMRPS 
LTQKSSI IQHDLLDKVPITIFLRQSL/D. 
S /VTQARMQWHDLGSLQPQPPGLKLSSQ 
PQAL\SSWDYR 


1769 


15670 


A 


1780 


357 


1 

t 


LTLFWGAQYLPKKGKGPFFLLSLPSPSV 
WGPLPQKKKSPPLCFFYFLNRVLLCHPG 
WIAWQSWHSSAHFSL/VLTRFK*SSCL 
SLLSSWNYRCTLLHPPNFLNFWYRQGNA 
VLPKLVL 


1770 


15671 


A 


1781 


122 


254 


RKNE \ WSGAVAHACNPSTLPGPGGQIMR 
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SEQID 
NO: of 
nucleotide 
sequence 

• 


SEQ ID 
NO: of 
peptide 
sequence 

■ 


M 

eth 

od 

• 


SEQID 
NO: in 

1TCCM 

26 


Predicted 

beginning 

nucieotiue 

locauun 

correspond 

ing to first 

amino acid 

residue of 

pepiiae 

sequence 


Predict- 
ed end 
nucie- 
onoe 
location 
correspon 
ding to 
last amino 
acta 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
tLr~ Vjiuiamic aciu, r— rnenyiaianine, 
«j» — oiycine, n— msuoine, i—isoieucine, 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q=GIutaraine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
w— iryptopnan, tyrosine, 
X=Un known, *=Stop codon, A=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














buiJ y JJ & Pu w rl£j r rK.ir 1 U3 r sr u x r brl 


1771 


15672 


. A 


1782 


45 


387 


TQT P KLRRF FHFS LZiKS WH YRCSPL \ PQ 
HNGYFLTNLDIATSMFFFKDRVSL\LAQ 
AGBQWCDPS SLHPQTPGLRR\ PPASASQ 
EGETTGAHHHTWRNLI YFFYYTYKF * GT 
YPMA 


1772 


15673 


A 


1783 


1 


401 


FATLARLVSNS * PE / CDLPAS ASQS AG I 
TVSPAHLAISPSFPAMPSSGLGLSYPAY 
HPGLGLRCHLPVLTSPWTSDTGPSSVLP 
DAGALHCPPEPQHICPL\LSGWLQTP 


1773 

■ 


15674 


A 


1784 

i 

■ 


432 

- 


1 


FLFLFFFIKKKTPFFGAKPKNRGF*KPP 
EFFQIF/S/CPVFLGRFPKPQKEFFFPE 
REMGFFFLFSPQGGGPARGFPPPPLNQG 
EAPRAGHKKKGPNLGGRTFFFFFFEMES 
\FSVTQAGVQWHDtiGSLQPLPPPFKRFS 
CLTKAS 


1774 


15675 


A 


1785 




434 


RLSIiS CCGREEHSTLPGAP WRCTE IAWA 
DSPDPAPSPPSALPSLLPFHVYRDVCPV 
LCLRGWP* *MVERGRLGISPTWLLGWPF 
PGGA/PHIKPE*YFIjFAYTILRSVPNKL 
GGVLALLLSILILAIIPILHISKQQSII 
F 


1775 


15676 


A 


1786 


1 


258 


CWPETP VLK* SHLDLTKRGEDRPJ2PVWA 
ASTT I FFFETGL /NSGAQAGVRWVHLGS 
LEPLPPIPSLFMTP*GPCPGLGSSWPLR 
EF 


1776 


15677 


A 


1787 


399 

- 


63 


SLHNQWKSTPTLKTSKXISHI *STWPG 
WAHACNSNTLGHHGGRTA* /RSGVQDQ 
PGHHSETSSQRLRNP YIKRC I KYIiAKS K 
CCINDSFTVSVTSRKLIGKREVSPNNIT 
FR 


1777 


15678 


A 


1788 


3 


474 


MSISPVRWNSKEAGRAANRQFPSFSPWK 
DDSRDASPPEPASPTIG\PIRRLAESSW 
TWGSPCAEHPRARAGRRKAATDCPWAAG 
SQWRGPAGQGAPRS CLFPGSRTAARAQH 
PRVAPPPPPAPLNTRASALRSQLPRNPL 
* VMTPRPPAAAPRS P VGP 


1778 


15679 


A 


1789 


66 


395 


LVQPLFDFIWHSRLSLWSGRDLLWSGKE 
T^PCLNHSIGVLQEWQGSDVKKRRRLM 
ESLTGPAADVI R I LKSNNPAITTAECI»K 
AIiBQVrX55Vu£>o RDAQIKrxiNT YQNP 


1779 


15680 


A 


1790 


413 


1 


PSPRALI TDYS * EEGPRFWQV\EKKGQP 
LKPHPGLGSPHQESPRVGPPKRGGYNPF 
* KKRAKFFAPGENKGPPFWTGRDPTF* G 

LRRS LAWS PRLE CS AAXSAKCKLRL 


1780 


15681 


A 


1791 


314 


1 


KTKPFFIjKKTPPKKKNKKKGSC*DKARY 
QTRKG/ I VNLGH/HPSFLFYFILFFETE 

RHSKPSRKDYRHPPKRPQECVQ 


1781 


15682 


A 


1792 


104 


409 


E KQS FAFDWHFDL YFDNVK * KEGEESKA 
GEFNASTGWFGNFRKRL/RFKNVRVTGE 
PASVTQEAADEFPDNFKKITEKKGYLHG 
KFLMYHEAAKYLNFWCPTK 


1782 


15683 


A 


1793 


392 


2 


GERERDRL* REREREREREREGERERAR - " 
EPJ3RARQGTSTVESRF/HS YCRRDQDVA 
RPKAKGEVAAGRGS PDGLQVGRGQ * PGP 
S LR PG P WREWG F ATYLACGGP I PTGEVE 
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SEQID 
NO: of 

- ■ _ — ■ A. Vm at a *M am 

nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M. 
etb 
od 


SEQID 
NO: in 

1 TO CXI 

yssw 

Uy/315,1 

26 


Predicted 
beginning 

a a a 1 am 4a *J #m. 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue 01 
peptide 
sequence 


Predict- 
ed end 
nucle- 
ouae 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A*Alanine 
C=Cysteine, D=Aspartic Acid, 
tv-uiutamic Acid, r — rnenylaianine, 
ur— ixiycine, m— jwisnoine, j=isoieucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=G lute mine, R=Arginine, S^Serine, 
I s J nreonine, v^vaiine, 
W=TryptopnaD, Y=Tyrosine, 
X=Un known, *=Stop cod on, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














SGEEWAQQGEWGQALGPP 


1783 


15684 


m 

A 


1794 


405 


119 


1 XKNAX VPKAI YRIN 1 ISIKVPMAFFTK 
\LKTI IKFI *N*KRARIAKAILSKKNKA 
GSIPLPDFKLYYKAIVSKPTWCWYKNRP 
I VLVHS RTARKK 


1784 


15685 


A 


1795 

- 


455 

. 


3 

■ 


CSVTQAGEQWCNHSSLQPQPPGLXWS/C 

"*» aa. \ f am. aM favMf* a ■ - avf% arm *a am fft A m»at am. _, am am a av 

P SQL P K \ WNYRHVP PRPALTAH PALTAD 
F*RRKEYKLMRHRGKKWHDFTLRSKKMK 
AMRHE YHS FLTYPNGHLYLHWQFFFFDT 
ESHSVSQAGVQWHDLGSLQPPPPGFKRF 
ERVGPGGN5GAD 


1785 


15686 


A 


H96 


2 


134 


PRLQHCTPAWATPQD S VS KRKRDMTTDP 
AI * KG* * ATTTN I Y IHKFYNLDE I DQFL 
KKHKLPQLTWYE I / DNLNS P I TR\ IEFV 
IIiNSKKKYPGSDGFTGEF*D*FCIjETKK 
R YD YRP CNLKR I ISDYNKKLHT 


1786 


15687 


A 


1797 


1 


404 


PTRPLTGSS ASGMMVEIFPKTYL I SSAC 
WVKISKIDLKLFSPFFFFGFLEGGFYFC 
PPNSRGGSPGGKFGLIEPLPPGLKGIPP 
PPPKRGGEFGPPPPTPAYFFFFLWGGGV 
/PP/AVGGGGKPPI *GNPPPWPPQG 


1787 


15688 


A 


1798 


2 


383 


SGWLWACRS PDS EPLACPAGPRQ * GAYH 
PGQQLVIMVR*VSQQCWPTPPALHSIS\P 
RP*LPPRRAGLGTCSPP/VPESPLLPPW 
APAWPKPPRRALPPA/PPVPGPIDRPRA 
EECRRMVRDWQAAPPAAP 


1788 

* 


15689 


A 


1799 


35 


410 


am jmaaw aav ^a a* w mm ai^ma ft m am *■ a* j am* u J P avm tana tea Ada a* a>amm 

ATGPSLGKVCS * AFS FL* PKLDFWHLYF 
QASGFLL / CHNS P SQTNSCS S FGKHYGM 
VLRVNSL*PDHPWNEFYFLNFFIFYRDS 
/SLTILPRLVSNS *AQTILLPLP/PKVL 
GIiE 


1789 


15690 


A 


1800 


65 


415 


■ rj*UT V-V a- a- _n- i — — m>m, ^m. ana a— ^haW m- • _ _ _ a\ am m~w -* amm _ a<w mv 

KKGVLKGGP PPQPLGV*GP PPBKGGPGP 
F\GAPKEKPPLAIiGEPKGKPF*RGPKGF 
FFKKGLEPGAPPPPKPRGEKPPVFRGPH 
PFCKKGGGDKREKNGGL*RKKPTELGNG 
PPPPG 


1790 


15691 

• 


A 


1801 


417 


3 


NLGPNI FPPRAPQNWGPAPPLFFFLKKN 

Mfrtf ■ -am ^aj ^m jm. am am* a* / *» *a)Mam a*"l YtfW/'IV ffA* 

FTWGGGSTPLFP /LNLGGLGGPFPGLEV 
YAPPS PHG* PRFFFKNQKLPPPVWGALY 
S PFFGGWEN\KKAPPGQTIjF r FFFFLK* 
GQDLPMLLQLVPNYWAQVILPPWPPKV 


1791 


15692 


A 


1802 


1 


431 


QPCTPGLK*SSCLSIaPSIGDYRC/RTTV 
PS * FFFFFLEKGVGFI PRGGIKGLDNC* 
LGPHPPELK\NPPPQNSQEVGTTGPPPR 
PG*IjFFFFETFFFF*KGGFFIIjAQP*MK 
WGALKKTP PAFS KRGG YRGGTPMEPTHC 
FLKEN " 


1792 


15693 


A 


1803 


256 


399 


AIKNIHDS*EEVKISTLAGIWKKLITTL 

J. LiUC Uur Mo v EiEt V X V U VfLCi 


1793 


15694 


A 


1804 


407 


2 


FE KANLFP FLFKNSG P PNVPGRGMAGVP 
KFCCPSKRKVPGP I FFLRRLFLFYPKCR 
RPPLFWPAFGP* KFFLPQVFAPAF* KPS 
PKKKRAPVFFFFFFF/RDKVWLCHPPWS 
AMA* SQLTVTS VS WAQAI LLPQR 


1794 


15695 


A 


1805 


429 


116 


LLTKKKRKKKPLACGESGLGGP*GTVTO 
V*QAEDTHVIW\VLSSAPSLSSEBMTDS 
M P GHL PS BCDS R YGMEMLTDKKWTWDGGA 
WD S S PQGANG KRGARQASGFS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
nenfide 

sequence 


M 

eth 

od 

vVJ 

* 


SEQID 
NO: in 
USSN 
09/515,1 

« 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

* 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, J0=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine. 
G^lycine, H^HLstidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methtonlne, 
N=Asparagine, P=Prollne, 
Q=Glutamine/R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X^Un known, *=3top cod on, /^possible 
nucleotide deletion, \=possible . 
nucleotide insertion 


1795 


15696 


A 


1806 


2 


408 


FVI FVFLVETGF \S I LARALACAISNS * 
PRDLPTLASQGAGITGVSHRTWPRRSCV 
F*EAFLTMLMVLHIPPSLLWHSVFTHLL 
ALSS * CFLFFLRQPWAVTQAEQQGHDIiG 
SLQPPLPGLKWFSCLSLPSS*NYR 


1796. 

< 


15697 


A 


1807 


1 


196 


FRLGASLDLSGGCSGVSWTPAPPLPPPP 
PPSAASGIS /SGSTSAAAGL* SCRTAFF 
SFLSSFFFFFLKKINPFPFLGGI*PFKG 
GPRLV*GNI *PPPQG\DFLGGTRGGKKP 
WGGGKFGQGGNFPLPLKKPPGPQKNPPP 
\PPPPPP8AASGISMAAHLQLPACDRAA 
QLSFLSSHLFFFFF 


1797 


15698 

i 


A 


1808 


395 


3 


LGKKMNNP KF WQG CEATGTL / M / HCWW V 

LGMHSMPACTFVHQKT*TKMFIAALFIL 
PLNWKQV/RCPSVI \DGY I PTMDQSTAM 
KMNKLHAKTWMNLRNLMLNKKPR 


1798 


15699 


A 


1809 


7 


454 


IPGSTISLQPPPPGFG+FFCLSIiLSSWD 
pp»pappp /rANT.pp*T*T,n^A*r.wi?\pR 

PILDEMKS FMCNCNSLTEGVKGRVEMMS 
QNGRLLTKFCHVGQACLKTPDLK* S ARL 
GxxLKCWDYSCEPPCIjx , QMPFSFFLFF*D 
RVS/PLSPGWS 


1 TOO 

1799 


1 C7AA 

15700 




1Q1A 

JolU 


OA 




SMCM05EVSAGWTIiLCLCLST\CSGAYA 
CDWGCG/ CYSACVCARVCACVCAQLLTC 
I GMWE * GQAGQGEVLLDIiPLCLCS CWAP 
Q 


1800 


15701 


A 


1811 


3 


414 


S S KNDNNS LQ E FMDK I AGM KKNLGNLTE 
LNNTV* EFHRATTS INNRINHARERISD 

J-UXL1 X V UC muii X O XWIRXHImIOORXOI* 

LBNWLSE I \NRQT* KIVTRNEQKLREVW 
DYVKRLNL* I IG \ VFEREGGKAYYLQN I 
FEDIVHENFPRFARDANSQIQEMQRT 


1801 


15702 


A 


1812 


1 


443 


AGKS PS P KKFKKGPGGGRWS * FLGGAGG 
GVPFRPGTOGSNYRFF /SPPPPPLGEKK 

KPPFKKKKKTPQKLQNGNPHPYFNPRVH 
GC3eEQQPWEKAPDATRQPHYADKHVEAG 
EPREPPKPHSEPLFSSPRKRPLFTRSST 
/SGASPP 


loUZ 


I J /to 


a 


1R1 3 


A1 1 
*ri I 




WKKNVFNKECLBR * I F/ 1 IOKKKLDPX F 
LTRYIKIKSK*IKDLNIRLEIKKTPGKE 
SVTLAKWLIRSPYLSFP*QIQSKQ*INS 
YVLI KI TKEERWS TS KEYQKP WRERKAM 
YKIK 


1803 

- 


15704 


A 


1814 


369 

• 


20 


QEVRPSIxTJSSSNRKYVKDSNARFTKEA 
IQIANIHMKNCPTSLIVGETHIKTSKGY 
hyvpirmakilkdX CNRRCGETGTLIHF 
*WEWKMVQPFQKPF*WFPKKRKIH*QFD 
LATRS 


1804 


15705 


A 


1815 


2 


675 


GLAI LGRRLRGEACTRS P FS S I L I FMVS 
MGEWPGVP/ GIKGTRTGAEAVPTRRRKS 
S I W PQTGGAGES SG / PLRGLPRQGKPGS 
PRGPSGPPQNCARWWHPQAAPLGACCFS 
GPEPSRLPPWRQGNWCLPSTPSSSA*EG 
WR/PVLQPGFL*SPSSLASICPGAERGP 
PGSSRP/GLRGAPGG 


1805 


15706 


A 


1816 


273 


2 


MESHIMWPl^riGFFIiLVQCXiHGPSMIjQC ; 
I AVLHSS YLFI YLFI Y * DRVLLCHPGrWS 
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SEQID 
NO: of 
nucieotiae 
sequence 

*i 


SEQID 
NO: of 
peptide 
sequence 


M. 
eth 
ou 

• 


SEQID 
NO: in 

09/515,1 

AO 


Predicted 
beginning 
nucleotide 
location 

cu ires puna 

ing to first 
amino acid " 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
"ding to 
last amino 
acid 

residue oi 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteme, D=Aspartic Acid, 
£j— tviutamic Acia, r— rnenyiaiamne, 
(^Glycine, H=Histidine, I=Iso leucine, 
iv— juysine, u— Jueucme, ivi— ivi era to nine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vallne, 
W=Tryptophan, Y=Tyrosine, 
A= Unknown, *=atop codon, /— possible 
nucleotide deletion, ^possible 
nucleotide insertion 














AVVQS*LTGASGSW\LRRSFHIiSLP*C* 
DFRHEPR 


1806 


15707 


A 


1 AIM 

1817 


40 


404 


GLiHE \PGVQGCSEJj* shdctpawvtset 
LTLKKLKI KKS KKKKRANPRGFERPTFG 
EAGPGGLLKAIISKPRVIQGTQKSGALL 
GFNSNGGENHPPQKQPYLGGLYAKAHSP 

KtjvaijJfir o JrvJk? 


1807 


15708 

• 


A 


1818 


392 


3 

■ 


EKYNMSYDIKSTNHRKNCKLDPIKI/RN 
CCSLKDTINKMKMQASNLEKI FAIHMPD 
RGLIF!OlKNSCl^VRR*QPPPPKEAKDL 
NTQKSQ* TNG/HGSKETSLI.IREMQIKT 
T/MwYTTSIPT*MLKIKKMNI ( 


1808 


15709 


A 


1819 


311 


431 


EWGRAWWLTPVTPSLWEAEAGESRDQE 
*KLCTI^K*KTISN/HDVPIRSSWTGM 
VAHACNPFTLGGRGG* ITRSGVGDQPD* 
HGEGL 


1809 


15710 


A 


1820 


68 


410 


AKKNQGPGPMVFGFGGIKPPPKQKKVRG 
GFFAICPKEQVFFFFFFWRAVVQSLNHC 
SLQPQPPGLKQFKQSSHLSLLSS*GYKH 
VLPCPANFILFSLVETGS\SIYFPGWSQ 
TP 


1810 


15711 


A 


1821 


408 


1 


TPFFFLRVLRLTPPLLGNFFGPGFPFWG 
GFS PGPLLKGPRP / C FPIFKP I FQPGKW 
GLVFFFFPPFFFYPRGSR*NIJCKIFPNF 
P FFFPVFF* ILMPPFFFFFFFFFFFI»*D 
RVSLCRPGWSAVARYRLTAS STSQ 


1811 


15712 


A 


1822 


362 


76 


SEIAPLHFNVGDRVSLHLETNKQSNKQT j 
KKTIiIFRDRVLLCRPGKNA/VEVQ* *LP 
AASN/ FLRIiKQSSC ISLSSNWI YRHAPP 
HLATVFNFtillFE 


1812 


15713 


A 


1823 


314 


2 


VISKPCPRBLTCITYGVSLTQCSMPGRM 
KGLLL I WPPVCEVRRASGRPPLMGSEB P 
LCPAATP SGRCTQQ / LH* ERAMMTMAVL 
SNRKGGNVGKR* RNQI VAVS 


1813 


15714 


A 


1824 


57 


389 


NLHLLQLPTYTDADSTGPTLSGMNVKNL 
HWSYEYKYRSITGVQWLILGSLQPLPPR 
FKCS CLS FLRRWD YRCAPPRRATF\ * FL 
VETAFLERLTSCDLPTSASQSADITGV 


1814 


15715 


A 


1825 


410 


70 


VPIMSATQD YRHEPPR PAGRFLIuUjKME 
PPHNPALLLVGI * PKNMKSLHKDVCTPM 
FSGTLFAIAKIQ KKPNCPS S MDE W IN CR 
NY\MHIYDGICYSALKKNEILARRSGTR 

T 

U 


1815 


15716 


A . 


1826 


2 


411 


FLVEMGF\SMLVIAGLKLPTSGGAPASA 
S ESAG I TDVSHRAW P VFFFLKRCIjVLVG 
RS**A\WPHTNLIPPLPSGIKGDLCPNS 
AGGWEKGAP PPS PGKF/ CEF * GRTGTTN 
FAKo/ W 1 lvTPiJ 


JoJO 


• 


n 






• 


SCSVTQAGVQWCNIiGSLQPLPPGLQ*FS 
CV\ K* FS CLGLLS S WD YRHMP PHLDNKS 
IFSRNGVS 


1817 


15718 


A 


1828 


1 


391 


IiEPRRRFLQt^QDCATALQPGQQSKTIiS 
QKKKKKKGGPP/ S * YQKGQGCPSGKKGR 
GVAGKGAFG PGFGGENKTP PGGGPTGEG 
PFPQKGWGPSQGPTKGPNLWGPGGPKL 
GGKGGPPGPTKGGGGPSSF 


1818 


15719 


A 


1829 


2 


134 


DHIaSPWWNQPGQHSEAPSL/LINIWKL 
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SEQD) 
NO: of 
nucleotide 
sequence 

• 

* 


SEQIi> 
NO: of 
peptide 
sequence 


M 
eth 

OO 

■ «— ■ 


SEQID 
NO: in 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to iirst 

amino acid 

residue of 

peptide 

sequence 

i 


Predict- 
ed end 
nucje- • 
otide 
location 

corTcspon 

ding to 
last amino 
acid 

residue oi 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Add. 
jLr=\jrjuiamic Acid, r — r n e ny j a ja nine, 
0=Glycine, H=Histidine, I=Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
r^Asparagine, jr^jrroiine, 
Q=Glutamine, R=Arginine, S=Serine, 
1^Threonine,V=Valine, 
W=Tryptophan, Y=*Tyrosine, 
a— un Known, — otop cod 00,/= possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 








• 






AG CGGAHLS S HL / LREDHLNLBVQGCSE 

f w liiiAL. 1 f AW Al £■ w IJFVS ul 1UjN w Jn ± WK. 

LAGCGGAHLSSHLLGRIT . 


J819 


15720 


A ■ 


1830 


427 


107 


VQHQPGQQRBSPSIIIIQKLaRGGARCL 
♦SQLPGKLRLENRFN/SGSERSHQCTPA 
WVTDRD CLKRTG I TRAS SQRLS VG I KQS 
ULNrKTAQIjQVSAQSPSTVSTNlj 


1820 

.■ 


15721 


A 


1831 


3 


540 


VQF PNFKI Y YKATAI KT VL \ QHKQR * ID 
E/MNKI * TPEINS YVYGYLNFNKDAKAI 
QWGNDSF\FKKWC *DNWI SACKYSQTSV 
SASSASSSSS 


1821 


15722 

i 


mm 

A 


1832 


385 


2 


AGRQS ETP YHNST I MKVSLQVCVDLS CL 
LGPGSLVQDSPNIPAQRKRFIiRLGSFWL 
PLSRFSRVGWPIiPHWGQRSSGFSLPR\P 
P* SQI PAPRS PPPAGFVPARS WVCGPRP 
QTRPLPAERPS PRPRRL 


1822 


15723 


A 


1833 


7 


399 


RISRSYLSEYGGSGKEHPTLGASYARIM 
VFG / VF I I YF * RQCLA* AQWYSHSSLLP 
QTPGLKHPP\AQAS*GAGTIGTHHHT+L 
TFAF/lFVLGCFFLfNKISVTQAGGQGC 
NFGSLQPPPPGLKRYSCLTLPR 


1823 


15724 


A 


1834 


2 


306 


LARI.VSDS*PQ/CDPPASASQSAGI icv 
SHRAQ PAE E I LJCVFG TLCLE P QR PTPD I 
FI I PYLFC\LFETOESCSVAQAGVQWRI L 
GSLQPLAPGVKRVSCLSPP 


1824 


15725 


A 


1835 


12 


400 


KKGMVPKQLKVGKKPPFSCWGPNMKKRD 
SPVFHHQDPIFPIPFFFFGNGFtjFFPP/ 
LAGGQGGNLN* PNPLPWGLKEFPPPTPR 
GRGEKGGAPPP P INF VFLKKGGFPLGGR 
GGLEPPPLGDPPPLPPKRG 


1825 


15726 


A 


1836 


220 


401 


KGS FVF I PQPEGEG P FLG * L KPR FPGLK 
QFSCLTLLRSGNYGPLPPPPVIF/CGFL 
R 


1826 


15727 


A 


1837 


12 


357 


GLLGGQMNGSIiGTQTSYEDLMSSS\FKP 
NSPPPTPS/VRTGHLPK*PLESSNGPPP 
PQVSHSFQGQWARGHPSPPPQWNTPFSP 
PQQYTQCSKTDP*PPPSPPYLGQEGSNA 
PSLA 


1827 

> •> 


15728 


A 


1838 


8 


380 


LMKTHAKLLNKRVAN* IQQYKK/ HH/NQ 
MGFI LGVQI YFNF* KINLI QLINSVKKK 
KN/HSSSSSSSSSSSSSSSPPVLIKSLC 
NLGKNKRNFLCLTKG I YKNKTE \NSMKI 
ILNGEQLNAFPLRLGTK 


1828 


15729 


A 


1839 


2 

■ 


444 


VPGDAKWFSVLHLKDAFFFIPLVPESQY 
PFAFEWENPNTREK\TWAVLP*GFWDSP 
HFFAQPLERDLRGLQLEDGS ILQYVDHL 
LVYS PTQEASDQNT I KTLHFPADRS YKV 
S KKKAQ ITl^^VHClJGY I IjTPGTCK/ I#S 
PERVQAI 


1829 


15730 


A 


1840 


1 


642 


E I KG I Q I GKEE VT / S L FADD I D YLRIDS 
TKKLLE VI CELNKVG * KINM * T * IVFLY 
IGNEHLELEI/R/ELMPCIKTSSTMKYL 
EINLKKDV*DLYTENYKVFPREIKIT*A 
NQ*EILCLCSRRLNTKM/STFPQVFCVY 
YAIPVK/ I PSRNFLVLVDKLILKFI *KY 
RGPRTAKTTLKKKKKVVRLTLLI FKSYY 
KTIVITIGWYGFQDRQVD*WNRIE 


1830 


15731 


A 


1841 


3 


423 


HRITSE+ DLHLARELI W\SMYGSLDHKN 
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SEQ ID 
NO: of 
nucieouue ( 
sequence 

"l 


SEQ ID 
NO: of 

pcpuoc 

sequence 


M. 
eth 

- 


SEQ ID 
NO: In 

09/515,1 

« 


Predicted 
beginning 

DULlcOLlUC 

location 
correspond 
ing to first 
amino acid 
resiuue oi 
peptide 
sequence 


Predict- 
ed end 

niijilil 

IIUCJc- 

otide 
location 
correspon 
ding to 
last amino 

acid 

residue oi 

peptide 

sequence 


Amino acid sequence (A=A)anine 
OCysteine, D=Aspartic Acid, 

— VJl U La ill IL. AUUy J M UCliyialiiUlMlCy 

G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, Lr=Leu ci n e, M=Methionin e, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S==Serlne, 
a— j nreoninc, v — » uime, 
W=Tryptophan t Y=Tyroslne, 
/v — unKiiuwii, — oiu p luuuu, /— jjussi Die 
nucleotide deletion, ^possible 
nucleotide insertion 


• 


■ 

• 






• 




YDAELDRMAMP CLCAI AGALP PG YVDAS 
SSSKAEKKATVDAEGNFDPRPVETLSVI 

WSYGENIDEELKTRPMKR\ P Y \ KTYSEK 
D 


1831 


15732 


A 


1842 


33 


179 


YMLGGRGCSDLRSHHCTPAWVT/TA*LS 
KKQYQRQQNRLQYHLQEYERKK 


1832 


15733 


A 


1843 


349 


10 


LAGGLNSMEGER\LR*EREECTQQQMVH 
DKYCKDLMGFGTKPRHITPFSSFQAVQP 
QQ S NALVGLLG Y S S HQGLMG FGAS P S PA 
KSTLVBSRCCRDLME EKFDQRK\QWVIiK 
CR 


1833 

• 


15734 

- 


A 

■ 


1844 


15 


856 

• 


AAEQLSFIYKLPQNPSPSTPGSSLSGTH 
GMQTMIiGSTHLPNliTDMLGPELGVQGI P 
S PGCACQRGRG\GGGRE CCS PPGVS PQG 
\SAVGRGAEGPGGLTRSGSGAASALVRP 
GE KGCWCRTASGAG PQRRQRTRGPGSWG 
LSFSQTS EEKCPS PAGSAGAGP VOQRRQ 
SSFAGGCGTGAGAPGST \GDAHPAQGGS 
GGPLRSLPAVGGPRPGPSFLKTSGSGSV 
PQGVP I LWSLT * RALAAPGS QGPAGLAV 
S CTGGRG YRDPQAPGTGAG * HGNSTRLR 
GP 


1834 


15735 


A 


1845 


402 


2 


SKARRQGRPL»RQG*APG/AARIPEQKRI 
GGP/EERRRPSARGPRATRVAGEGPKPK 
GQTAMAGGGHDPLPLPPARSRSQESIGA 
RSRGSGHSQEQPAPQPSGGDPS PPQERN 
LPEGTERPPKLCSTLPGQGPPPNV 


1835 


15736 


A 


1846 


446 


32 


TSRKIS * KTGNQFLMKECS CNSHHKAAF 
TKKDVLNI \LAVVKHVNTKASETrHFFQ 
SGQAKVQQGFVKEGCELINEALNLFNNV 
YGAMHVKTCTCMRLLDRLQYIMGDYAEA 
LSNQQKAVLM\TERVMGTEHPCIRPL 


1836 


15737 


A 


1847 


440 


4 


VDGRHVE VSKKGGQVNYQAG \ KTVE I WA 

AITAGGAPIAFGPVIDGNVI PGDRQILM 
EQGEFLNYDIMLGVNQGEGLKFVDGIVH 
NEDGVTPND * KFS VSNFVDCMRPRRGPN 
YSRFQ 


loiv 


15738 


> 


1 OA O 

1848 . 




0 


tr i\K.u tr r if JftJXU X Jr JL r 1 XIV o lr V W £iJllJtrnV7ir 

APLRPDHPSLSPCPAMG*K\PGLPRGCP 
QTQISPLFNRSPASPLICHHHPPSEP\K 
PGPEPPPTPSSSIPS1ARFTRPGESSPL 

ir fJry l rouf r* 


1838 


15739 


A 

■ 


1849 


417 


31 


QATGQECGCHRGGPPPGPAGETEPQAPL 
RLPGGTGIPWGAGILCPS*LPGPGSLSP 
AAGRG/ SGPSAGPGAAWFSSP* /PACPS 
SSRS AVPGGAGS FRRAGPGLF YTLPAPP 
WCGRGASNKI IOMPGLVC 


1839 


15740 


A 


1851 


3 


285 


YTVCECVCVCLCVCLPVSL\SLCLSVSV 
CFFPSLCGFVCVCPCACVSLAECALCAT 
KRFV/CMAACLW*ASFCVSTWVMRPAVN 
R FRRGGS ALGA 


1840 


15741 


A 


1852 


128 


524 

* 


KI PGLGQRS EGVGQGKDDLHVTAPVPTH 
GWGEGAASKPTVLPPPPP\PDAPTVFFF 
FFFFWEKKYFFGPPPKGAGPQIYLLGPN 
PPGFKPFFPPHPQAAGI*YIKPPPWYKC 
PLLKKRGVSTLAP\GFPKPPP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
pepuue 
sequence 


M 

eth 

od 


SEQ DD 
NO: in 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartlc Acid, 
iv-oiuiamic aciq, r— rnenyiaianine, 
G=Glycine, H=Histidine, J=Iso!eucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, ^Proline, 
Q=Glutamine, R^Arginine, S^Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1841 


15742 

• 


A 

- 


1853 

i 


1 

■ 

■ 


1648 

■ 

• 


MTVP LHSGIX3 KGVKP ERKTTVRRP FVSA 
GKKYACP F KKAETP Q WRRLRMRP KAP AA 
SLLATFPISQRLIPLTPARKHCPSESET 
STWVVS KAPATPRTRGAGPTS PPR PTRR 
R\ * ACSLQKL FAVE EEFEDEDFL SAVED 
AENRFTGSLPVNAGRLRPVSSRPQETVQ 
AQSSR/PAAVTPHCSLRGFGPARLGP/P 
ASLPPARPVLTA/GPSCIGAAPLRPVST 
SSSWIGNQRRVTVTEVLREPARPQSSAL 
HPLLTFESQQQQVGGFEGPEQDE FDKVL 
ASMELEEPGMELECGVSSEAI PIIiPAQQ 
REGSVLAKKARWDLSGSCQKGPVPAIH 
KAG I MS AQDES LDPVI QCRTP \ D PP * DL 
VLWVTFLFQ/ PALTVPTQQLHWEVCPQR 
SPVQALQPLQAARGTIQSS PQNRFPCQP 
FQSPSSWLSGKAHLPRPRTPNSSCSTPS 
RTSSGLFPRIPLQPQAPVSS IGSPVGTP 
KGPQGALQTP I VTNHLVQLVTAASRTPQ 
QPTHPSTRAKTRRFPGPAGILPHQQSGR 
SLED I MVS APQTPTHGALAKFQTE 


1842 


15743 


A 


1854 


235 


223 


ihkf ihq i wlaki tcprtkv* kersvll 
ptsfspvpsqghytckq\lcslasdlsq 
pdlvykfmnlavlham wns rkvs cyp wt 
miyflhanrt 


1843 


15744 


A 


1855 


373 


3 


I KDG I YR Y F YKALDS FCLCCE FMNQF FS 
CMDRBLSQRCFLNSAY*FPSNL*CYLCY 
LFFFF* LKKYFLTFFLFLRDRVLLCCPE 
*SAMVHS *LTVPLDFW\ VKGSSCHSLLS 
SWDYRHALPHLY 


1844 


15745 


A 


1856 


378 

* 


1 


RQRHS PAGNTGR PQVTPGG * I S WPS ITK 
DRTSMS S VTSGALGHTAAS PHARLLPLA 
LPSVRTQHGSPPPGQEQPTIICPSNLPT 
HPSLPLGMHPSVRASPPLCK/P/SPPSI 
PASVHASKHPSPPVY 


1845 


15746 


A 


1857 


3 


379 


YMVRKVI E VWFLLKLLLFFFFRGGFLGQ 
GWGP PAPG FS FGKKNPQGLGS SNLRGPW 
* TNP * PVPGGALFLVGPPTPAEFPFKNF 
SRGFLLV/ALADF * TRPRIiVS PHGG/RG 
AKGTPAFLESMPPWMP 


1846 


15747 


A 


1858 


452 


1 


GTHGLLLGSGPF \RQVFKPDNFVFGQSG 
AGNNWAKGH Y I EG AKIiVDS WD WQEE * 
ESCDCLQGLQLTHSMGSGMGTLFISKIR 
EECPDCIMNTLSWPPPKVSDTWEPYN 
ITLSIHQSVENTDETYCIDNEALYDICS 
RTLKLTTRCI 


1847 


15748 


A 


1859 


1 


385 


NTSSDYIFPFFFIjFRNSIHSVTvA!GGQW ; 
HNQGSLQP WPSRLK\CPTASAS I CLHMP 
PFLANFLI FFVEIGSPYVAQAGSRDPPA 

SFRKKCSPWPPGGGPPIF 


1848 


15749 


A 


1860 


470 


17 


IEMDSRRVPRDKLACITKCSKHIFHS1K 
I T * NELAS AD / DF F PTLI YIVLKG /NPP 
CLQ YNI Q Y I TRFCNPS RLMTGEDG YYFT 
NIoRLGTHCS WLMMTMWTCFRAFR I FS SQ 
WMSYVLRFRNGITGVSHRAHP 


1849 


15750 


A 


1861 


3 


790 


CSRPEFPGRRFVEAVRSKPYLSIjPF* SR 
* S FFNVPAE* TSAKDILASSEFI KQNND 
VSS\LQKFMPQEVK*LDHIHTA*ADGSW 
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SEQ JD 
NO: of 
nucleotide , 
sequence 

'a 


SEQ ID 
NO: of 
peptide 
sequence 


eth 
od 


SEQ ID 
NO: In 

itccxr • 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
C=Cysteine, D=Aspartic Acid, 

Artist C—DkA>k>tii>i*-ii>«. 

£/— Glutamic Ac id, r— rnenylalanlne, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, JL^Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q^GIutamine, R=Arginine, S^Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion i 














KAELERGPDG£APSPFPQKPAASRQPEL 
GELATFLGRVDPWYQSNWTLCPAIHKL 
AEMPPSLDTSRTVDPFILDVITYYIRMG 
TQPIYFQIYTVKIFFSDLSQDPTEDIFL 
IEIiKVKIQDSKFPKDGFSPRRRGVAEGP 
GAELSLCYQKALLSHRPREVTVSLRATG 
LILKAI PASMY 


1850 


15751 


A 


1862 


192 


3 


SSGSHS VTQA \G VQWHBHS S LLP/ LLT * 
PPGRK* ASHLTLAS S * D YRRAP PHPANF 
♦IECREGV 


1851 


15752 

■ 

* 


A 


1863 


82 


370 


SLCQKRAPV6EKLVH6titV$t>Sd6kWS~ 
CPDPWGCRPRFEAIAVYSFLKLRWIPE 
VSILPEDLEELYDLFKVRSSGKMRGSRP 
♦AGLSRG/DPACP 


1852 


15753 


A 


1864 


2 


325 


IQVYS ISHLSI YLF\I YHLSRGSMHVSM 
SLS I * S IYPCMDV*MYLSNLCMHVYIYL 
FYGS I YRFYLS I CLS VYLS IYLSIYLS I 
YL I GUPS I FKTVT DEHAI FATWRH 


1853 


15754 


A 


1865 


3 


377 


YS P WCKLFRBLCKINVFD \LDS PLLSGK 
EFNDTTHNTFDHMWRTKEHNEAGWLLLS 
SVDKVMKENDELRDSNSWLQKQV\ * PLK 
SAKTALSGSLNSCREKAEIVEKQTQSLT 
M*VADLQRKMHVQP 


1854 . 


15755 


A 


1866 


10 


378 


GWKNGEFIDAL* KVYGHKAPNKSAVYKW 
IT/ * FKKGQDDIEDEDHSGRASTLRKKI 
HLVYALI DKD / * RLTAEAIANTIDX S I S 
LAYRILTEKLKLSKLSTQWVPKQLCPDQ 
LQRRAELPMEILK 


1855 


15756 


A 


1867 


346 


1 


DILWKLQKPQPNGKMLKAAKBRKKFAF 
KGVPVRMNADFS I AAMKA/ RRRWNS I FS 
F* KENNCHLRLL YS AKNI F P / E IKTFS D 
RE FVT I RS AVKE I LKDVLWAEERLSHVK 

jtaiMtt mm m 

SRNV 


1856 


15757 


A 


1868 
• 


I— 

* 


377 


GTFFQRTQCKGI KQ\YWGLI IKASSDP 
TCVEKEKVYIGKLNMILAQMLKQEWTEH 
WPAFISDIVGASRTSKSLCQNNMVILKL 
VSEEVFDFS SGQ I TQVKS KHVKDSMCNE 
FSQI*Q\LCQF 


1857 


15758 


A 


1869 


90 


384 


QWLLFTEYSSLYHPVLFFFFFFPGGRTG 
PNP PAGGEGNETG / PNGP STPGGGGNP P 
PLPPGGLGLLSMPPPPRQILLMETKKRP 
PP*TNKCCSPGYSPP 


1858 

■ 


15759 


A 

■ 


1870 


2 

• 


578 

- 


FWKHALLLGLDFLPGKMAPWSGQGCSL 
GHTEGGTSWDFAVGGASWRLKWCVKGD 
SHKGPATPIASCKGPLGRPCPLLAQSKA 
*GS * KRG/VAPGS P*LALGMGGG\DRLT 
LISQVHGNQVTQI IPFSTEGETKAQRSP 
SLPPRDLIRGRHSWNLDSTQLLGYCPLL 

trirr LttirJMjtr JjFV F c lJNofci JLUaJSJNoIuS 


1859 


15760 


A 


1871 


1 


382 


S GQDAGS CLL YGAGSGAMVS GAYNP YI E 
I IEQPRQRGMRFI YKCEGRSAGS IPWEH 
STDNNRTYPS IQ IMNYYGKGKV\RITLV 
TKNDP YKP YPHDLVGKDCRDGYYEA* FG 
QERRPLFFPNLG I RCA 


I860 


15761 


A 


1872 


490 


1 


ADSLSKDPGRPLHPPNIWGDRGRQ*PGE 
TPQ*HRRSCASQDPGRSQ*PGKILPSPK 
PGRPPPMTGEDAGPPK\HGNPSPNDQKK 
NPPPK* /PQGNRIHRDPGRFPPQ*PRKK 
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SEQW 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 

• 

• 


SEQIJO 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
'amino acid 
residue of 
peptide 
sequence 

> 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystelne, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalamne, 
G=GIycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M-Methtonine, 
N s Asparagine, P=Proline, 
Q=*Glutaraine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop cod on, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion [ 


■ 












SFPNNLGDSNDQRSRFPVA*KNLPQSSS | 
GTRAIGRELYLPPQPGSSAFPN5TT 


1861 


15762 


A 


1873 


373 


1 


GGGVPGALFSHKKKSLFFFPPPPFFFRK 
GL*NF\KIKGCGSPFLPLLFFLEKESCF 
VP\RVECGGVILGPCKVCLPGSPPFSAS 
AS * VSGATGAC/RPRPGKFFSPFFFFYF 
LVEMRFHRVSQDV 


1862 


15763 


A 


1874 

i 


3 


374 


YMLGKE I VS KTKIGQELGLLNQ/TSQ W 
NAKEKFL*ETKSATPMNT* IKRKQNSPI 
TETEKALLVWIEDQTSHNILLTQNVI*N 
MALTLFNS I KAERGEEDTEEKLEGSRYW 
FMRFKKKKAISIT \ 


1863 

■ 


15764 


A 


1875 


2 

■ 


364 


IHSGKGESLWDLGLDTEFLDLPSKA*HI 
KAKNDKLDL I KM * NFCSAK/ ET* / 1 RMK 
TQAIHWERIFVNNI *NRKLVYKELLKLR 
N\ KKNT IRR* AI DMDRHFTK/ E KMPMTN 
♦YTKGCSISLVIL 


1864 


15765 


A 


1876 


41 


461 


GFLYLCSSEIDPYPSPSKKIKSEWIRTY 
WMKLLSENATIMLKDSDLSQV\FCVNIP 
FVQAAKAKIEE*DYIKLKWFCSAKGAI + 

kakrqpte/w/dri^aitcpcvyglittt 
cmeftqltslittptitlpwnprqsyhh 

NPT 


1865 


15766 


A 


1877 


2 


185 


VRPTKLDPLERTQYTLPLLYKWNNKAWM 
TAHL / FTA* FTE YFKS \ S VKPDE+ VRE I 
DYRMLYL 


1866 

■ 


15767 

* 


A 


1878 


1 


491 


IHRLLW PPLS AAR PPS RESGLRCRAPRR 
PASAAAATAAS PS \ PTAPQGP PR/RRRL 
LIQPPLYPRGLFTPGVPPL/APGGSREP 
S * SLT * DARTLPLP VLG PRRARLLGACP 
AVQAEEGVDPVGHLSEVLAAPCRQP * T P 
PPACHCLDGEGRPSGVQAPLHKAKLYP 


1867 


15768 


A 


1879 


24 


449 


LQPPLTWALLLQPPQPKSRAAFFFFFFF 
FGKKSQFLFGPPGGGEGEKLG*REPPPP 
GTKGMPPPPPRERGKKRGGPTGRENLGI 
*RKGGVPPGGRGGGQTPNPGGGGAK\PP 
KGGK* GGGRLPPPQIKGPKRGRPKKRKG 
RP 


1868 

+ 


15769 


A 


1880 


190 


2 


PLYCHKVGQVGLELLTSGDPPPLSLPKC 
WDYRY\DHHAQPSF*LFLSVQISGIKFN 
HSWQPCI . 


1869 


15770 


A 


1881 


1 


458 


FAIRAGRNLPDKEFRYLRTVIVTAAVYW 
GLNS KLRCLTS \LLTFQHRAGVSPYTSP 
FGFA+TCVFAKQLLEPILC/RPCFHRAP 
IiLPKLRGHFAE/FP *QCFFR/QALGFS p 
/RSTCVGIjRHG 


1870 


15771 


A 


1882 


3 

■ 


392 


YMMRYHYILIRMAKVKI\SSTNDOTQ*L 
KLIHC* QECRMVQPLWKMV*QFLIKLNI 

*T 15*MT IVTT T.Hr*TVT TPMCTV WUU /B7T 

ciqmfmtdliviakywtq/pqcpsvagw 
ikq/irsihtveyysavkrnql 


1871 


15772 


A 


1883 


2 


473 


IQGGIAAYRVDRQERSNGRQGNRRNLSG 
TNMRRNKQKGLRRCKPRW /RSRGMAWKV 
DKPRDAR* KS DRDRQ/ RGETDG * RVSGG 
LTA* EAYRH\HRGQ\EAASQSGRSRQAG 
RQT P * QEQAGNEAE I RRE EE KSRGQERE 
RE \ KDRDADRHKG 


1872 


15773 


A 


1884 


138 


444 


CYLTLIKCRIIYINDKVIVLTIVWY*HK j 
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SEQID 
NO: of 
ncfvicoiiuc 
sequence 


SEQID 
NO: of 

pep Hue 

sequence 


M, 
eth 
oo 

• 


SEQID 
NO: In 

09/515,1 

■ 


Predicted 

beginning 

nucieouuc 

location 

correspond 

ing to iirsi 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

nucfo 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 

C A 4*% 1 1 AVI 4* A 

Sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
Hf— oiuiamic aciu, r — .rnenytaiamoe, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
i^^/vaparagincj r^jrroiinej 
Q=G!utaniine, R=Arginine, S=Sertne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucicoiiue insertion 










• 




DRHID* WNRLQS P * LNPS I WVQLI LLNN 
AKNTQWVKDTLLDKWYW/ENWIYQCMRM 














PLGS YTI PMNQFKMNYGLS 


1873 


15774 

< 


A 


1885 


27 


470 


TICCYIFLFFSFFSIiFFPFPGKGVSP*S 
PGGENTG/ ANLG * WNPPPPQKG I S PA* P 
P KE PGMEGAGHPPGT KS * PvJKFGPOPGG 
GPSFPYRGPNKTWGKPGPKWFPGGIGYP 
ATIPQKKQPEGKRGR\PGGPMSPYNGPP 
KTPTHGKGPG 


1874 


15775 


A 


1886 


478 


1 


KNVQENSALQAAMCRKMLIVCQTQCVI I 
SGESGAGKS VAAKY IMGx I S KvSGGGEK 


■ 


\ 

\ 




• 






VQVRRGKQKDKLGRPHLHGSAAPPICTP 
ASV*LIjPTPPASAP\HVKDI ILQSNPLL 
EAFGNAKTVRNNNS S RFVS LCR PAWSS C 














SLRADLSTHTPTHTHTRV 


1875 


15116 


A 


1887 


1 


402 


HSLERPHYIG* LFKNIFSRD/ RVFAMLA 
HWSRTPGIiKQSTPLSLPKCWDYRCEPQH 
PAGSFFFFFFLKNGFWGCSLGGRAGGQQ 
*LKS*WRPNPIiG*GNPPC*PSKEVGTTG 
AHKKIANREIRTRACGGTNFSL 


1876 


15777 


A 


1888 


511 


124 


GTRRQHFAGAHVP PEGP* S \MLDPKIjK\ 
DDRPASDMWIREPGLLLPRAPAQDAGKY 
YCHRGNLTMSFKLKI TARPVL WHCLLRT 
GGWKVSAVTLAYL I FCLCSLVGILHLQR 
GESCPQWVCPNPTPSSPG 


1877 


15778 


A 


1889 


667 


310 


QLKP* ATKSV* KDTAFGIDVGNDFLAMT 
PKAQAMKGKIDKwGFI iR*S I CTAKETI 
NRVKR*PRK*EKIFAKPTWQKGQIS/RI 
HKEFQQLN/ KQKSNNLIERQTKDLNRFL 
SKECSKDL 


1878 


15779 


A 


1890 


462 


3 


KWFFPLGPPFLPPP/PPPI*NPSFQKNK j 
KLTRGGCARYFPPLKSPRPRIPFPFFEK 
GEGSPNSKHSPAP PFWGPKETFFFRRTP 
PPPPPPPIiCSFLKNNLFSYIPKGFLGGK 
GQNFHS FLPSFFFS IKKKLLGLGVHVRF 
YCRVNS CTGFFVQMY 


1879 


15780 


A 


1891 


1 


455 


NTCLGFGNGFLDATPKA* SMKKI INKLD 
FNETENFCSVKDTVKGMKRQATHWEK/V 
RKTHILYKDLILKIYNQLLKHHNKKTST 
I KQ * ArLDLjN RED I QM rN KxiMKKCbGWGR 
WFTPII RALWACKVEGSLELES S LGNIV 
R PHRKRERE IHCY 


1880 

• 


15781 


A 


1892 


1 


537 

- 


RGGIQAPKEVSPEGRQEPARKSLI *TA* 
ETPP*SQ* /PIPEEP/rGVFMKKPVSVS 
IiETGKHAVWPKVNGKELPDNPTI KWFK 
oKWJjnJjObKoviArtr SFKEoiiflSAoNVxr 
VELHIGKVVLGDRGYYRLEVKAKDTCDS 
CGFN I DVEAPRQDAYGQSLE S FQRTSKR 


1881 


15782 


A 


1893 


2 


514 


VRCQRRCHE * RACGS SLVNAKKL YEDAL 
MARKVKQSLFSLDVETDEDKFQMMSLQ\ 
CSLAYGTLTKILSEKRSAKSYGMSSVRM 
RSAGQTSKAHLHQPRRVSQVLQVPAVNL 
XjPFRKKGQTKD palnts LPQKVLGTTEE 
I SGKKHTEDTI S VAS S LHYS P PAS PQGS 


1882 


15783 


A 


1894 


473 


2 


VMGESRGFSPPPTFTGGNFVFFWKQGVS 
LFCPEGFKTLCSSSPPPPPPQKAGVLGG 
SFHARPPPFFFFQI*IPFFFGIKKP\LC 
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SEQID 
NO: of 

nil |t|&A^ino 

Ducieouuc 
sequence 


SEQ ID 
NO: of 

pL.pi.lUt: 

sequence 

■ 


M 
eth 

uu 


SEQID 
NO: in 
USSN 

09/515,1 
26 


Predicted 
beginning 

II u ticuiiuc 

location 
correspond. 

ino tn tiTvt 

111 g IrLF 111 31 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

II uwiv* 

otide 

location . 

rnrrpcnnn 

ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 

F^falutnmir Arid. K^Phenvljiljinini* 

G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, I>Leucine, M=Methionine, 

N=A cnflrncnn^ P^Proline. 

Q^Glutamine, R^Arginme, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptop h an, Y=Tyrosine, 
X=Unknown, * i =Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














SPPP*PIi*KKRGIFPVPKGIFCEMESCS 
LV*LSS*DYKCVPPHPMY 


1883 


15784 


A 


1895 


613 


11 


FRPRS PACGLHAVS S PKLPGAQALSSPG 
LNLVFTAGSWEAG/LLRLSPRANNPRVA 
LPRVHTGPSST/DLSPSCPLGLACGFGT 
SQPACL*SPFLLLAPARPSGCAWPGI,PP 

LRPCLCPALPLSREPPSPLLNDLVLPRM 
PRRAPVLPLGRTPHSLLLCHVQSPSEQL 
PSRGPERL 


1884 

- 


15785 


A 


1896 


20 

. ■ 


449 


KFGYS SAAARRQQLGWEAWL* YS FPLQL 

EVSPCVACGIQAALSMGSTSSVKIiIiSHP 
QAPLPQWHQMVFARCL04CGAQLNVPP\ 
BbJtrTrU VS i KiJ vyJuAJuWftwvbKlv/ KA1V 
FRQPL 


1885 


15786 


A 


1897 


393 


3 


RPTAQS KGN I * VRVAS \ EALS PKLLDFL 
PGKVLNGEKVDVRPATRQNLS QFEAQAR 

KRECVRVPRGGIPPRAHSRDSSDSADGR 

i 

A 1 sronst Li V Jr sbnK V V Ixir f£> V utr X r JN kCrlr 
SALPVMGLP PPP I PPPPCI 


1886 


15787 


A 


1898 


395 


217 


RER/ CKSFR/PPA/ HLQAKIKGAQ*QVN 
QAAAAQAAAPAAAMVSRDIS SLL VSSQ K 
SKVSNYM 


1887 


15788 


A 


1899 


1 


375 


NTVLVQ * NNKAWMT VHPF / TAWFS E YFK 
x«rtrDTvr*e /pvtotjvtt.i xtp&duuqudd 

TIjIE I * KE I YAVF I P ANTPS I LQ PMDHG 
IILSSKPYYLRKASRAQRLTPVIPAIjWE 
AEAAGS PE VGSSGLA 


1888 


15789 


A 


1900 


47. 

i 


326 


VKSI IQFNSKNEPGKHDKTPSPEKTQIV 
WWLGS EXiRS PS YSGAAGGAEMGRS LES W 
RSRPQRAETAPLHSSPGSGSEMLSL*RQ 

FCPSQHHPQPPSSWDYATRCQATKQFVF 
FIjEMGF \ VMFARL I LTVELNN 


1889 


15790 


TV 


1901 


181 


837 


ASRWGRGRWRLGQACRVPQGLPLSTFHL 
GAQAKARGGTPLACS SHLPNS HVGSLKA 

+ *GLHIVAARRQQGREEEPPRTTAAPET 
LCFQQTSWASSCSLEHSAQPSEVQVRAL 
SVPSHSPMWV\PQLSLPRDHRKPPGE 


1890 


15791 


A 


1902 


1 


385 


YTWGFRGKKPLIHCL* E YKLVQPLWRAA 
WRFSK*LRVEL* FNAAIS PLGVYPEENK 
LF YQNS TCTCWFI TALFI TGKT/WNQP 


1891 


15792 


A 


1903 


207 


3 


FREMEFLHLGQAGLELSTSGDPPTLASQ 
NVG/HYRREPLRPANTLQS * PLGLKQPS 
CLSLPSSWEYRHMY 


1892 


15793 


A 


1904 


2 

* 


391 


IQPLISQRKYKTLGQNSIiS/CCAHP/PP 
HFFFVLDS YFHSLLSQLKNAP FT * G * CP 
YWYSKSYSLIHSWGSSSItiPCPLTSSGF 
PSLPPSYQPLPCPSLFLIi*NSLSTLCLL 
FFSLLSQPSFSNRWPSQVYL 


1893 


15794 


A 


1905 


3 


424 


YRAGCXiQSIiLiPPPLLLFIjLPCDVPFPPS 
S\IiP*VKASCGLIRSQQNVGTIPCLQNR 
KSNKPLHKLPSLRHSLRAMQNRILPILR . 
IGKFF/ 1 FFS *DG VSLCHPGWS AGVQIiQ 
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SEQID 
NO: of 
nucleotide 
seauence 

( 

'i 


SEQ10 
NO: of 
nentide 

mm ^* MM «• w w 

sequence 


M t 
eth 
od 

■ 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid , 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

mm m 

residue of 

peptide 

sequence 


1 Amino acid sequence (A=*Alanine 
OCysteine, O^Aspartic Acid, 
E^GIutamic Acid. fr^Phenvlalanlne. 
G=Glvcine, HMHistidine. I=Isoleudne. 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R-Argimne, S^Serine, 
^Threonine, V=Valine, 
W=Trvotonhan. Y=Tvroslne. 
X=Unknown, *=Stop codon, /"possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FTALHLL/ VLKRFS CLSh 


1894 


15795 


A 


1906 


2 


389 


IQRGLDKSCLITDNIPSSQSLIQKKGVN 

EKBRSHLHN/ IK* VQGKAACYPEDLAS I 
IDEGGYTKQ* I FNEDY/MWKKM* FRS FL 
TREKLTPGFKASKDRLTPLV 


1895 


; 15796 

i 


A 

■ 


1907 

• 


458 

■ 


13 

• 


AKEETQSVWDFPM/VPQGVYVNFPVSR 
NANLSTIKOLLWHRAOYEPLFHMLSGPE 
AYVFTCINQTAEQQELEDEQRRLCDVQP 
FLPyVRMVDCEGARVNKLIiSSQISLFIG 
KGVRE LDS LSDPEVSDFSTKMCQFCE KS 


1896 


15797 

i 


A 


1908 


409 


3 


EKTDGLYRASQRGKDLRRLCAQSFHASW ! 
KDGMALCAL I HRRRQ\DLI G YAKLRKDD 
PIGNLNTAFEVAEKYIiDIPKMTiDAEDIV 
TTPKRDEKAIMTYVSWL*IVIAGAEQAE 


1897 


15798 


a 


1909 


116 


379 


HSGPRREGALLLPKCLPHAKRCLLLFKM j 
CSDGATALCCPGWSSAAPS*LTQSP/AS 
TSQAK*PSHLGLPSCWDYRCIPPHPANC 
LDYYY 


leys 


1575/9 


A 


1910 


At O 

418 


3 


UU T lAiAfJKWr J.u£uijt>UAv-Jc LifoLt^irAil 
LL * L * QLMLFS FPGTP VFS YGDE IGLBA 
SALPGQHMEAP VMLWDESS FPD I PGAVS 

ALMIVKGQSEAPGSLLSLFRR\LSVQRS 
KRP <3TiT.HGDFIAFQ AGPJrr»FS YTPPMY 


1899 

■ 


15800 


A 


1911 


394 


3 


I LEAYPEVKDPAVKGASSKKEMYGH* / a 
AEQALPYASEQEQQRHERSEKKQPQVKE 
GNNTNKS E KI QLS EN I CDS TS SAAAGRL 

QENEEKTNVNIMSKKNREDV 


19C0 


.15801 


A 


1912 


499 

■ 


141 


PGLGERD W TS KYGQGAGEGS TREWASRC 
G/IRPGGDAGQCXJPRPE*SVCPRGAHSP 

GPGS WKAS PAWHSAEPGGRCSGLGVQKE 
GFPGHIVLO PGGRTPGPGTF KERPSGYTjO 


1901 


15802 


A 


1913 


127 


387 


ISFVFPPTLPKMPQLKPETISMTGLNLF 
QHLCNLARLATSAYDGCSNSEV/ CDLDL 
LY*AALPLKLDYQIRFPNYFSTKYRIYY 
LCLY 


1 Of)? 


1 JOUJ 


A 


1Q1A 


J\rt 


V 


PGPGORKHSTAPMCIiLiNIYACFIIjRFVIiA 

nqiykcikriihhdqvgfipimqswfni 
qnq* \inlihhinrln*knhmi isfdkt 
hhlfi iktfiklgiegnllnliknif/s 
knpaani ilnsekv 












"1 J 


YTSNKOLO IO I LKT TYNSTKKYKTIiTNW 
IKDVKDQYTENQKILLREIKDLNK/YRD 
TSCSW/NIVQMSMFS/KLIYRFSEPPNK 
NLSTL* ILTTSF* FAWKYTGTGIYKVTM 
KNR VGRLS IiPNFQNY 


1904 


15805 


A 


1916 


420 


1 


ENAD CVERARKS PDS I P * \RGGQI S VTM 
VSPNEQEKAGQLAIGVRAVRYNGV\LLA 
KMWRKKLHLTLSANLEKIIAIGLFFSNF 
ERKPPENTFLKLTAMATHSESNLSCFAQ 
EDIAICRPHPAI KMPEKAEQYKPLTASV 


1905 


15806 


■A 


1917 


384 


3 


TRT I TS GQ Y STHVI RAS RVPTRS * VPVF . 
RSCTSNRRFSQAIEPRVMH*KVHIRAST 
VR YD S GGHVAVYPANDS AL VYQLG KI LG 
ANLYVVMSLNNLDEESNKKHPFPCHTS/ 
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SEQ ID 
NO: of 
nucieuuuc 

jCUUCIItC 

• 


SEQ ID 
NO: of 

pcpuuc 

JCUUCIILC 

• 

■ 


M 
eth 

OU 


SEQID 
NO: in 

26 


Predicted 
beginning 
nucieuLiuc 

■ lUwlllUll 

correspond 
ing to first 
amino acid 

rcaiuuc oi 

peptide 
sequence 


Predict- 
ed end 

lJllClc- . 
uuuc 

location 
correspon 
-ding to 

Ida l allllllU 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
jir-vjiuidiiiiL aciu, r— r nenyiaiantne, 
*j ■ajij'i.iiic, n— ".niaiiuine, j.— isuicucmc, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q**Glutamine, R^Arginine, S=Serine, 

T=Thr*»nninf» V=Valin» 

nrcuiiiijc, T~Taiinef 

W=Tryptophan, Y=TyrosIne, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 
















1906 


15807 


A 


1918 


405 


105 


KAEAQRRREI CSGPHSQADSAWHPLVTW 
WH* PPPVARSSQEPGVHSPVP/HVCRKI 
LLIRPKMALAKEGNYRELRWFTPWSRSR 
* VGCLTTP6MCVKHLRCV 


1907 


15808 


A 


1919 


409 . 


3 


GGQEEGVFRVLNSKEFETERGKP* IPPV 
KDQE 1 1 H \ PTKr NHVAHMG PGDGKQ VL M 
DLPLSAVPPSQEESPGPAPTNLARQPPC 
RNKPYISWPSSGGSEPSVTVPLRSMSDP 
AQDFDKEPDSDSTKHSTPSNSSNP 


1908 


15809 

i 


A 


1920 

! 


9 


470 


APARNPLPRPCTWPTGP*CLRPARPPVA 
SACLCGGTWNLALWLCSPGT/PVPFLTP 
PCS/SOBVQQPASHSVASNQSKEPAKSA 
AVAHECPPGGTGSADPGWPPGATCPESP 
GPATPHTLGWEPGKSSPPTMEEEPWAP 
QGSPCWTVRQRMTMMM 


1909 


15810 


A 


1921 


556 


1 


| IQLWAALGGILRRVGRDPPSHRIGKEPS 
AMAGQAGGNGDGEEGSS \GGLAAVPHLP 
* SQHAAHPLLIGPPGQQNLGDSKV*GFP 
S P RL EENTLENGGWGS KQLHGS PGSQHA 
GGS WKNGETSLKG/ PH* ADGAGRHTMPQ 
SPPSPPFKPHSV*HNPPAS /PPPHGSPA 
PSGTSPLPMSAVSLLPPGSL 


1910 


15811 


A 


1922 


567 


41 


GGWGETFSRLGNDLQAHR* SRFNAQAQE 
ETSR\VLAVSLINEALDKGSLEKTLSAI* 
LLPAAGLDDVSIiPVAPRYHLLLVAAKRQ 
KAQVTGDPGAVLWIjEE IRQG WRATQDA 
NTAQRMALGVAAINQAI KEGKAAQTERV 
LRNAAVALRGWPDCANGYQRDLESAMA 
KIQRPAAV 


1911 


15812 


A 


1923 

> 


2 


405 


IQGCG I TSS S VLHGMVFKKETEGD / VTS 
VKDAKIAEYSCPFDGMITETKGTVLIKT 
DEELMNLSKGEENLMDA*VKAIADTGAN 
VVVTGGKVADMALHYANKYNMMLVKLNS 
QWDVRRLCKTVGATALPKLTP PCL 


1912 


15813 


A 


1924 


510 


37 

■ 


LLGHAr HVQSSSGRPQLAEASGHSHLKK 
GECVQQRTGNVGIjSPNTARWGTPLiGPS I 
SSSAPPWSFSAFPGPPGSGK*AAKDGPG 
CSLKSGLKR\RSQGGLR*TRGSGPSPMP 

PKINTACVCAAD X S PGLBPV 


1913 


15814 


A 


1925 


74 

• 


429 


ATI PGHELLLLFFFFFFFFFLGKGAWPL 
G*GKG\GGPIRG*GNRAPRG*REFPPPT 
PGKRGNTGGGQPGQPI FG FLKKKGAP PG 
GPG VPKTRGQ I E P PP WPS KRAGVTG WTL 


1914 


15815 


A 


1926 


515 


304 


ALAASLALALNGVFTNTI K\ * I VGRPRP 
DFFYRCFP/GMG* PHSDLMCTGDKD\ W 


1915 


15816 


A 


1927 


1 


433 


NTVGSNKKSKKLYFAEIEKSYLNFIWNL 
KGPQ+VKI ILKKKESVTKCTLIHSKA* Y 
KV/LKI VWYWQRDTHM/DHVINS I KQHMV 
KQFFDGDTKTVQWAKDCLYNNWCWEK\W 
ISTSNRMKFNIYPTP*ANINCHQDPNSK 
ELYRYKN 


1916 


15817 


A 


1928 


316 


356 


GGT*PPQAATPIS*LYLPITSSL/TLPP 
APSLPLQIAPIPSDSPSPRYS/TPTPPI 
SLGLAPPTTLL I P VPSLPVS PRPLNSTA 
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SEQID 
NO: of 
nucleotide , 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M, 
eth 
od 

• — 


SEQID 
NO: in 
XJSSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide * 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alantne 
OCystelne, D=Aspartic Acid, 
E=Glutamic Acid, (^Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leu cine, M^Metniontne, 
N s Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \ppossible 
nucleotide insertion 














PP/TFLTGSG^AVTWSLAVSLS PAPAS 
AMDKAQNM 


1917 

• 


15818 


A . 


1929 


490 


11 


PTRLTCPGSHVPGPVTFPPYETHSMP\P 
PCHVHPLVQPPGSSHVPGPVTSSPSPPS 
ATHIHPLVQPPGSSQLPNTWVHPSQRMR 
TWLQQHPPLPAPPQP / PSPGS ICSSMTH 
L.TEYLASFPAPQ/PREQAPQGGVPTSLP 
QW*APQPTAPCRATQSQQLLDG 


1918 

■ 


15819 

i 


A 

■ 


1930 

- 


1 


552 

* 


RNPRKRAFPSPPIPAPPS*NQENERSRH 
PQSLL\PFVKSRKRAFPSP/LSPCSSPV 
KSRKRAFPSPPVPAPPS*NQGNBRSRKP 
QSLLLLREIKEMSVPVTPSPCSSFVKSR 
K*AFPSPPVPVLLRDHTFLPLPBPRQPS 
TVPVGCFGFSRIPRRWNHTGCAFLLICH 
SARLLCDPRVSSCWRHS 


1919 


15820 


A 


1931 


414 


3 


RVPGESRK*ERVLDSRHKD*EGRRRGNI 
\MGKRVDYTT*TVITDVTNLSIDQVGVH 
RSIAANMTFAEIVTPFNIDRLQELVRRG 
NSQYPGAKYI IRDNGDRIDLRFHPKHSD 
LHLQTGY KVKRHM CDGDI VI FNRQC I 


1920 


15821 


A 


1932 


521 

■ 


103 

■ 


ATEPAGVRLKEGGNI TESFVTVGNVISA 
LADFSQDA/SRYS * KKKQVLVP YRDSVM 
TWLLKDSLGGNS KAIMMATI S PADVNYG 
ETLSTLRYANRAKNI INKPTINEDANVK 
LIRELRAEIARLKTLLAQGNQIALLDSP 


1921 


15822 


A 


1933 


1 


490 


NTG WWRKGEGNGTLMVRRLP CVPLCS S 
AGG LTEEDE GE L CAAG FPLLAED FGQAL 
QQLQTAHSQAVGAP/KBGDQVLGGGTGA 
P PTAYPNHALS IS* I PSVSWHDEGALPA 
VKIEILGI IHLLPERHALLSLVQARSGL 
• LLHGP PATG KI LLNKAGTTECCLT 


1922 


15823 


A 


1934 


383 


151 


E VAFLENL I KDD I ERGRLP LLLVANAGT 
AAVGHS \S * LGRMKELCVQCVNRPKMEG 
YMHVSQHPVPESHKMRAKAI F 


1923 


15824 


A 


1935 


379 


1 


YVPWS KDKE YF FN S Q * GLTTHQ I LP Y I 
DGFRHVQKISAEADVKLNLVRIAIQILL 
* VGLO S YLGQGHQPGKS / CRGPGCEGWE 
GMVLRS*AQLSLSGTTAL*HWCPSSRKL 
IQFGIIKNLIRRLCI 


1924 


15825 


A 


1936 


376 


2 


GOjFIYMKPTAMSSSQVARSGBVSPFTA 
VPA*S* K/QGHGAVLGCHITSEACFQAC 
FPFLRPGRSTCFA* SGC*DVERSSSHSH 
GTAHS P \HGTAHTP MEQHTHSHEQHTHS 
HEQHTHSHGTAHMY 


1925 

■ 


15826 


A 


1937 


426 


1 


KHGE I IDDLLKVYRDNASKKSAI / YSKW 
ITHWKKRRDDVGEEVHSSRPATSVCEET 
IHLVCAL I * ED*LIAAETIANAVG I PRC 
SAYTILT/E/KLKLSKLSTRWVPK/P/L 

IIIAGLY 


1926 


15827 


A 


1938 


2 


469 


KRRKLVSS IAAAIHPALSS*TRQRFIdbL 
LFLFH I VREDLVQLR I KKKHT I Q IGNVE 
VKLL* PTDNI I LCG KLYRFHRSTQKVLQ 
LI/NFS IVAAYKINMHISLVFLYSNDE* 
LENKTKQTSSFTIALKRIKYVGSNKKSK 
KFYFAEIEKSYLDFIWN 


1927 


15828 


A 


1939 


468 


920 . 


I PLSTHS PGMGTSTHLSTRHHIjS FWTYH 
ERLLNPGRETDWA/ IMLFRWGLIiMLPRL 
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SEQ ID 
NO: of 

niif Ipntirip 

sequence 

• 


SEQ ID 
NO: of 

nentide 

sequence 

■ 


M 
eth 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond- 
ing to first 

amino nniri 

ClllllllV CI w 1 VI 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 

■ ■ U Ml* 

otide 
location 
correspon 
dinp to 

\M m MM IV 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
0=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid. F=PhenvIalanine 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, I>Leucine, M^Methionine, 
N^Asparagine, P=Prolinc, 
0=G!u tannine. R = Arf»inlne. S=Serine. 
T=Threontae, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


• 




• 






■ 


VSNS * AOVI PPTSASOSAG I TGMS \HAS 
GHLI FNY \ CSM* WLHLRNSQ * KRCTGQE 
TGPCPVTQDGVQWSNYGSL* PQTPRLKQ 
SYHVSLPSSWDYS 


1928 


15829 


A 


1940 


448 


474 


GIPGG+GSTTRNALYHVMNGEDWILTT 
CKHGKDWQKHKDSRCDRDNTEYBK\YDP 
GEMLHNAT PCLVHRGRRHGS PRPLEAMO 
AACVPVMLSNGWELPFSEVINWSQAAVI 
GDERLLLQ I PST IRS I HRDKI IALRQRS 
QFLWESCIATAIEDPNLIGVVR 


1929 


15830 


A 


1941 


1 


423 


NTTL I FAGGM *NVC PG PLCARLLHRSLT 
HATPD PPLLTLP * I P TPTP S S RCAPKAP 
PHT . T PPT <3 PPI >CS KPHPWP P / CSS CY> 


1930 


15831 


A 


1942 


48 


417 


RLTMYQVL YRLHCTALQ PL P P RVX* F S C 
LSLPSTWNYMHMPQCPTNF\CIFTRDEV 
PPMLP\SWWRTP 


1931 


15832 


A 


1943 


450 


2 


VHRRSNRQNMDTSR * RGAMQS VES V\GV 

P Y\EQ * T IVDG I \NSGVWEG I AYA* I EE 
pvppppnT.pnnTJirvT.vpvpri/iF /nvnrw 

T I KLNVEAVNTRRDKPTN I PAKNQAPVR 
MRRNS FTPLSS SNT IJIRPRNYS VGSRP I 
KPLSPLRAQECI 


1932 


15833 


A 


1944 


451 


3 


GRITRHLPRRAEDDRERERE PS PLPSRH 

G/PPPVSLPRKPAHRSC\PVFPP\ASGG 
DTS IHSGKTVYVKRKS QPAWP VLPPGGL 
♦APSHGAPSPSPPDQHRHCPEI \R*DLL 
PAPAPSPFSIPPLY 


1933 


15834 




i r\A c 
15*45 


41)2 




LTGSCVNC I VI L I LNF FYEKI S AW IAKM 
EIPRTYQEYESSLTiKMFLFQFVNLYSS 
WFYVAFFKGKFVGYPGKYTYIiFNEWRSE 
E CDPGGCL I ELTNQMT I IMAGDV 


1934 


15835 


A 


1947 


1 


405 


NTGWRVFAILCSLKGRPRGIERV/GGKK 
TnfKTCTfpr^GPTdRPanGK'TPTrrn KGGfszvoo 

IV_P».I\_r\J\£r v3VT.tr i-ftJJ £r.n\^ulv.i. c XV X \J IVV7VTvXr\\^VJ 

NRGKPPLFSGGNFGNGRI FGNPLPAPGP 
GGGGG PR * KTR \KRKNGNWDLLKGGGKL 
VFCPMVGKKLSGVPGV* QKNFKGGW 


1935 

* 


15836 


A 


1948 


443 


1 


LTPGAANASLLG\CCMEDLSVWG*RQGL 
WF AT ,T jTWMMVAOCPIiRK VDNAYGHYEAF 

S TLAPKAWLS VELAEPCVPE PGLPPVFA 
NFIQLLS A/ PVWTEGGTAWLE WWHVQP 
MLALMBAELRKSQVI^VT*GAHYSDCI 
AAALRIKIT 


1936 


15837 


A 


1949 

• 


396 

• 


2 


GNGRGGSVPPNSLNEDGISCAI * /RHIN 
WIiNG*TPTIYCL*ETHLICK/DH/HRLR 

VTCGKTCTC T I .TTD WTWOKPAGVATjIiT* GOTD 
v i\uuiu\ji unrj.1 v3x> y ivr/ivj v nmj j. vyiy 

I K* KAI KSNKEGHYE I KGS VKQEN I TVG 
NT YAPSTRAPRYKKQ JjLDLKGVN 


1937 

■ 


15838 


A 


1950 


419 


1 


RWQ/PSARPPTPSGK*GASLPARPSSGT 
*GAIiLPGCPVWKVRSASSWPPSRLGSEE 
PLCPA\PSHLGSEER\PSRPPSHIGSKE 
R\PARPPIA*DVGSASAPP/LPSGM*ER 
PAAPPSGR*GASLPGRPSPEMWGVPLPR 
RPVWDVRAPSRAPVWEVRSVSAWPPHLR 
REKTLRLATAPV 


1938 


15839 


A 


1951 


422 


3 


QNHVIMSE/DAKIAFGKIQYPFKMESLN 
TL * MKVNFLNIi I KI TQKNP IANTMFTGE 
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SEQID 
NO: of 
n u l ico ii tit? 
sequence 


SEQID 
NO: of 

peptide 

sequence 

• 


M. 
eth 

OCJ 


SEQID 
NO: in 

09/515,1 
26 

* 


Predicted 

beginning 

nucieonue 

location 

correspond 

ing to first 

amino aciu 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
umg to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCystelne, D=Aspartic Acid, 
itr-ujuramic Acia, r— Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=*Asparagine, P^Proline, 

l~\ — Into m in A Dq A ^r.: n : n « C-CapI-a 

V£=uiuiamine, K^Argmine, y^erine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=XJnknown, *=Stop codon,/=possible 
| nucleotide deletion, \=ppssible 
nucleotide insertion 














MLB / AFLLRQGKRS C/ CSLS P I LFI ILL 
EVLPS*VWQEK*/IQG*EREE*NCDFVD 
DM I VYVEN* KKPLYNLL* I IS * VARFKV 
ILDTCI * 


1939 


15840 


A 


1952 


2 


409 


CAHYSKNPPQRPLRTDARLPAV / DFWSL 
GAILFMVGCGQPPFQEANDSETLTMIMD 
CKYTEPSHVS KEGKDLI TRTLQIDPRRR 
ASLQ* IENHPWLKGVDPSPATKNNI PLV 
* YK\NLSEEEHNSI IHRMALGDITDR 


1940 


15841 

+ 

i 


A 


1953 


2 


367 


IQ CVRL * VHVCLCAS SCVC I CLYVHATL 
CVSTCL*CVAVCM/CLLCACATVSACLC . 
/V/CGCVSTCVCVCPPVCTCDCVCMSAC 
VCT/ CLCVCVTVSTCVCERLCVPTCVCR 
CPCICSSLNGNEWMG 


1941 ^ 


15842 

• 


A 


1954 


374 


2 


EAP WLLRAGRGALP P CWS KTP PS PLLF \ 
PPLLPGTLVYQPWVPLPPWNrLACAVTL 
ST*ARAGTSNPSWHLPPVSTAPQHPSTW 
QSPGAGTMGDPSLSPWTSHLLPGCCHNP 
CHCPPPHHKPTRV 


1942 


15843 


A 


1955 


1 


411 

■ 


NTPSPELHPC*PGL\PPLPSPPQQPPTW 
APPRTSTQQKLP I LCLLKPS AHTDAPCT 
QPGS TLPLHTPHTQQAQGTAYQ I HTT* A 
APPPGAKPG*RCPPPPPSRQPQRMQTPA 
PGNPQ P CPRXiTTLTRVLVPAVP PLP I 


1943 


15844 


A 


1956 


33 


451 


RGRNTPGPLQS PPPRFK * FS CLSLLRS W 
EE\RDYRCMPVHPANTMLASLVLNS\CD 
LPALASQS PG I TG VSHPTRPHLS * FLMP 
LE * GHHHLLLVLLSQPPNMP P * FQRFST 
GQIiECSSFRTNLIVPSPSLiNFHSDFPLT 
S 


1944 


15845 


A 


1957 


3 


399 


YMQVRTTMSDHSSHSISKINTDNTKSCK 
G* GS TES LFHDRWE Y KFVQLLWETVWHY 
VR*TFTILYNPEIILKR/lFRHTYIOJVC 


1945 


15846 


A 


1958 


47 


399 


AANPTLPAVFFFFFLETEPPFGPPGGRA 
GPQSRLTEPPPSGVKPIPR/PPPPGEPE 
KWPD\QGGGGGGDPKPPGGPPP*ATGQN 
SAGKKKKKKIPRPGAHGADPLLPGKAGG 
EEWLDPA 


1946 


15847 

• 


A 


1959 


407 


238 


TQAFALI / KDGGVIGGICFRMFRTQG IR 
EIVF*AVPSNEQVKVSGSPRRQPQCTAH 
S 


1947 

• 


15848 


A 


1960 


106 


1854 


NEAVKSKPNQTRNHKEKNNYVHLPDAGP 
SQPASAGGSSSACRRSTKATLHKRWVSS 
PAGPGVQPLSWQHPPMMA*GLQGNPSPQ 
AAAP PARPGL VSGNCLT *EMAQAGA\GT 
GGSLSASLGKRCPHI PVPCE\PVL*GLC 
VPGRSSI^EL^NPVTVQSFPGPGAESD 
PW* GDAECCS LLQAS FGDRAGWS IR/RG 

LGGFCCPEPLSREAESGMVQE* *GRFWT 
GQERTPTGR* GCKVAG YLHS S ATVGHSG 
AGAGGSPGKTSATLDVGQGLGGT/PSGP 
AWDRTNGYMEKAALPLCNGKVTGNTQCG 
AQPPASGPS PWVWQPLLLRQRLRQSTGL 
* LLF PGVGGGLQPGEG\GQP FPLS PWGS 
LTGRKNSNKARPGGNEGTEG * GARNGAS 
LTWVWTVPNGGYC PQAGRD FWLVDSRFK 
PSLGLRAMV*NSRKAPLSFEDGRMGTV* 
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SEQ n> 
NO: of 
nucieotiae 
sequence 

• 


SEQ ID 
NO: of 
peptide 
sequence 

■ 

■ 


M 

eth 

od 

■ 

• 


SEQU) 
NO: in 

09/515,1 
26 


Predicted 

beginning 

ijuciei/uue 

location 

correspond 

ing io iirst 

amino acid 

residue of 

peptide 

sequence 

i 


Predict- 
ed end 

IJUCIC- 

otlde 
location 

vurrcajjvii 

ding to 
last amino 
acid 

residue of 

pepciue 

sequence 


Amino acid sequence (A»Alanine 
OCysteine, D=Aspartic Acid, 

IT 1 — {"2 1 1 i+a m I n Arirl 17— Ph onT/loIonino 
JEr - VJlULaUJIL /\IIU» r — X/Jcnj'JdJaliiriC, 

G=Glycine, H=HLstidlne f I=lsoleucine, 
K=Lysine, L=Leucine T M=Methionine, 

Q^Glutamine, R^Arginine, S=Serine, 
T-Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine> 
X=Unknown, *=Stop codon, /^possible 
nucieotiae aeicuun, v— possiuie 
nucleotide insertion 














PQGQEGRASHRKGDSGLSPHSGKG*DP* 

fl/pnrtlTEPQTCPPT.PRRT,*fiP/Vri'P r RrtB . 
\jf ruvjoxitvQXorJriirnijjj \jr / v \jE±C\\j£\. 

NGDWPW 


1948 


15849 


A 


1961 


76 


427 


QRQEPTPGHTTSGTRHPPGPGGRGPIQT 
HSHSHLF P WLETSAGSS PPNFLCSLP PY 
TPLLRLLPSPFLCPSQTTQRMPI PAPHP 
SPAS PCKVP*DARGAPAGGAPS TL/RGH 

rltMrriir 


1949 


15850 


A 


1962 


375 


131 


FLVGRIRLLYCLRELFVYLLREQKAKRN 
SQWVHTLPISSRHVDAVPCSPSINRNRM 
GRDKKRP F PL\ CVDA* LHS PS WIQP 


1950 

• 


15851 

■ 


A 


1963 


89 

■ 


538 


GLILLQKWHPGAVSSMVERG* LLHSGLF 

TH T"\ Y\T7"I IflflTn T/~V 1\ Z""tT //*\f*1TTT^T POT /MITOTIC 

FSF PDKVS S WQAGVQWHDLS SLQAPAS 
GSKRFS S LGDHRHAP PHLAH PC I F/M* T 
QDFAMLARLGTS FI YLFGCCC \ FETGSH 
S VAQ AEVQ WHNHS S LQPKS PRLRRS FHL 
SIPGSWDHRNAP 


1951 


15852 


A 


1964 


402 


42 


CCHYPGTPGLQGRLSASHQAS/SGPQLP 
APLPLQVLPFLALGIGVDDVFLLAHAFT 
EALPGTPLQVGPCP \QGSSEAAQLTG*E 
PLGS SDLGLLMNLGAS CPHL * TGS I IVL 
CPKGYCLDQ 


1952 


15853 


A 


1965 


8 


14 


SSRAAACPPRPALSIiPRPPVFTRRMGPQ 
GSGLPKLPVSAPSSRMGSFPAPSPIiLLA 
SGDRHLCPDAAALPLLAIETGCDS\PPG 
H*PPPSPGGLHPCSPPS*PLIL 


1953 

* 


15854 


A 


1966 


521 


2 


YBL YG 1 1 KRHRRE VSNNQQ VR YKEHSI P j 
PDYVSSVPTDPTWQPERREBESSGHrnV 
DHTGTAAGGGGGM I LAS PKLGATPLPPE 
*APA/PPPPPPPPPPPGVGSGHLNIPLI 
LEELRVLQQRQIHQMQMTEQICRQVLLL 

LPLTV 


1954 


15855 


A 


1968 


111 


395 


YLMRGFLIiHHNMVKGKEDERKTKGARL I 
LFFFFFFKKESPSPPRVEGRDP\NLGT* 
NPLPPKVKKLSRPTPPEKWEPGAVPHCP 
NNFLFFRKNGGS 


1955 


158:>6 


A 


1969 


413 


lOl 


YHFS PTRMGKIKN\ KKENKFWQGYGETG 
I FTHCWWECKMVQ P I KKTVWQFFLKVK* 
LLVFGPACKELGSYYSLVTKS 


1956 


15857 


A 


1970 


411 


l 


NLTPALCTKVHFKWVMDIKMKGKTIRLL 
ENNTRKYIiHNIKIWKGLNRTE\KP*TIK 
KKTDKFDYDTI KNLS S S KNKTRQT I DWE 

/ KKTNNPVGKWARDLNGHLI KND IHMAC 
I 


1957 


15858 


A 


1971 


3 


828 


GQACHFIFRSAQAGGSRLRIWRAKQLGL 
WRVCAIj IMPALEHS FPTRLLOGNSVPGP 
SISLDTSCL/CNRCVWEGGRG*TGPGLP 
SLGKVLVEGIPSESPGPTASHPCCSPRP 
DPDQ / LS CI SAPS ATPTVH* SHLTLPMG 
WPGPQ *GLHLSDQPGPRKP* PGCSGLGG 
GDAAPRGMEKP\PPPQLP 


1958 


15859 


A 


1972 


398 


3 


GPGCY FS VRLQCP / RKI PAWKRAVCSAT 
LI \SLQGPSLSAPHVLGLAALAVHLGES 
RSALPEVDVGPPAPGAGLPVPALFDSLL 
TCRTRDSLFFCLK*ALLPQSRLLSSLSK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


ML 
eth 
od 

■ 


SEQID 
NO: in 
.USSN 
09/515,1 
26 

* 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 

mm 4m. mm\ mm m*. *A 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
J£r-l»lutamic Ada, t»= Phenylalanine, 
u=vyiycine, ji— Histidioe, l=lsoleucine, 
K=Lysine, L=Leucine,M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X= unknown, *=Mop codon, /-possible 
nucleotide deletion, Impossible 
nucieotiae msernon 














SDTCSFDKIR^DRHADIPPLY 


1959 


15860 


A 


1973 


2 


433 


QDPTKMTKEELNALKSTGDGTLG/RASE 
VEVKNEIVANVGKRE IIiHNTEKEQHTBD 
TVKDCVD I EVFTAGENTEDQKSSEDTAP 
FLGTIAGATYEEQVQSQILESASLPENT 
AQVESNEVMGAPDDRTRTPLEPSNCWSD 
LNGGS 


1960 


15861 


A 


1974 


373 


580 


T I FS RQVLRIQNALSDKPNVST VYSNNG 
SELHGTSEASV/YHWKILISTEYHKTL 


1961 


15862 

* 


A 

• 


1975 

■ 


47 


413 

• 


KWKHLRGS EHWRPQENQVHQR IAELRKA 
GLWS QRR / PAEAAGG PRPKSHWDYLLBE 
MQWMATDFAQERWKVASVKKMVRAVARQ 
IiQDRTRREAGARREEPSRLRQTSPVLPE 
KSSVPGLVLRR 


1962 

• 


15863 


A 


1976 


444 


3 


GYERSRGTTSGTHSSRDYDGIFGQATVK 
WAMLDQFRMLSPCSKEVMHQPFYLKRVE 
IMAQCEEWIADIQQYSSDKRVGRIMSRH 
SAAIKRRTGQLREELLKLPCPEGLDPDS 
GDAPEVCRAATGAEETIMHDQV\QPSSS 
KVLPSDFQ 


1963 


15864 


A 


1977 


281 


567 


PRSSLQGQELSRPWGRKGNFPGPPVPPP 
QRMFYDSELFSG I S DPS CC F / RS APTAD 
QVYGDQDMHEVVRKHCMDYLVRKWRPLG 
KGQGGKISSPHRP 


1964 


15865 


A 


1978 


357 


2 


KQI LG P PHPQAQPGRWPP PHGP KDAPL 
WS SRAAP PGRGLGRAG PAAGVEAGATLR 
DSS PS T WTREGLHVQAQRKRPSHVHKG / 
SGPGCItEDGEFPTS IiRLQAQLAE IGRGN 
GLSVRRQ 


1965 


15866 


A 


1979 


29 


434 


VQAEAEGLPGDTEHPQPQLMSRSLEGQS 
DVTI KHVACGDFFTACLTDRGI IMTFGS 
GSNGCLGHGStiTDISQPTIVEALLGYEK 
AQ VACG AS NVLALATE REL FALGRGDS G 
RTGARTKESH/YLPQQVPMP 


1966 

■ 


15867 

■ 

r 


A 

* 


1980 

■ 


3 


2082 

• 

• 


SSEGYLRGNMSENEEEEISQQEGSGDYE 
VEEIPFGLEPQSPGFEPQSPEFEPQSPR 
FEPESPGFESRSPGLVPPSPEFAPRSPE 
SDSQSPEFESQSPRYEPQSPGYEPRSPG 
YEPRS PG YE SESSRYESQNTELKTQSPE 
FEAQS SRFQEGAEMLLNPEE KS PLNISV 
GVHPLDSFTQGFGEQPTGDLPIGPPFEM 
PTGALLS TPQFEMLQNPI/3LTGALRGPG 
RRGGRARGGQGPRPNI CG ICGKS FGRGS 

ri -i -i- ^_ —* my mm mm my my m m~ m mm>mmm-m m ^m mm mm m. mm Amm m^m^Amm mm, 

TLIQHQRIHTGEKPYKCEVCSKAFSQSS 
* DLIKHQRTHTGERPYKCPRCGKAFADSS 
YLLRHQRTHSGQKPYKCPHCGKAFGDSS 
YLIiRHQRTHSHERPYSCTECGKCYSQNS 
SLRSHQRVHTGQRPFSCGICGKS FSQRS 

VLAIHARTHLPGRT YS CPDCGKTFNRSS 
TLIQHQRSHTGERPYRCAVCGKGFCRSS 
TLLQHHRVH\SGERPYKCDDCGKAFS\R 
ASDLIRHQRTH 


1967 


15868 


A 


1981 


2 


188 


LPETNFAELFLPYISQHNLIRKYKK/WP 
GAVAYACNPSTLGGQGGWITRSGDQDHP 
GLHEEWP 


1968 


15869 


A 


1982 


3 


424 


egqaivermnlclkqqlqkqkgenryyr 
tphkqlnXalltlnflslpkgrii^saae 



664 



WO 01/64835 PCTYUS01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
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sequence 


M 
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SEQID 
NO: in 
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USoIN 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
ii>=tjf]utamic Acid, Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M»M ethionine, 
N»Asparagine, P=Proline, 
Q=Giutamine, K-Argmine, S=Serine, 
T^Threonlne, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


■ 


■ 






- 


* 


QHLQKPAAKTEAEQIiVWWRDLITESWEI 
GKI ITWGRGYAYVS PGL/NPSRHLKPYH 
ERMLRKRFREDPEDP PSCSHVKTDABED 
PN 


1969 


15870 


A 


1983 


3 


f*. jm. 

399 


YSKLSFKGTLTKFRR I YSSS FY KEPQGC 
I SDLCHP/R/ TLTN CLRGELLKENL VWG 
AFGCHPHFPCYYNKSQERNIiLQALRHPT 
TVAFGETGFNYSYKCTMPVPEHHKVFKR 
QLELAASLEK/ PLVI LCQQADE 


1970 

- 


15871 


A 

- 


1984 

! 


1 


405 


RRH IGGGVRLYY I GGE VFAKS LSDSAI F 
AQTPNCNQRYGWHPGTVCKI PPGCNLKI 
FNNQEYADLLDQSVNQGLEADY/QLTRM 
CTILMSLIiKGWGAEYRRQTATRTPCWIE 
LHLNGPLQRVDKVLiTQMGYPS ILM 


1971 

• 


15872 

• 


A 


"t r\ n 

1985 , 


27 

. 


452 


QGREHAQGGQS PGAGHLGPTP EPQPEPQ 
PRPSSQAVPAGRWEPAQE/PTRHPHPRL 
S PASRPLGS PPAPLVRSS PGRCRLHEHT 
IWSSTVGTSVEPAPSLGRPQAPLEPGTV 
TSSVRLQQPHMHTPGKIMPDPSKRNGKF 
TFT . 


1972 


15873 


A 


1986 


414 


220 


GAEQEELLSP/ GSGG£SELRSCHCTPAW 
ATRAKLRLTKQNKTKRESYRQGNQNWGR 
ILGELLGRI* 


1973 


15874 


A 


1987 


52 


412 


TRERKLFFACDNWKHLKRYLRKNSFGE 














NLWSSRNIKKKKKKKKKKKKKKTKNFST 














PKNFFFFKACQCMGKKNTKHDFHEQNFI 
EHALLQQLNINFSTNYIEGRVF\HPGAP 
IESLLLMCH 


1974 


15875 


A 


1988 


2 


143 


EKRRGLDKRTP/AQAAFEKMQEKRQMER 
ILKKAS KI PPPFVCMWSVDS 


1975 


15876 


A 


1989 


3 


163 


TEFQPSEK/ WGEDLGDNTWEYI FAIDLL 
CCHQKW I CHPLFLVG WRAGAEVSGVF 


1976 


15877 


A 


1990 


1 


439 


DKTAAEDAI RNLHHYKLHGVNINEEAST 
NKSKTSTKLHVGNISPTCTNKELRAKFE 
EYGPG\ IECDILKDYAFVHMERAEDAEE 
AI KGLDNTE FQGKRMHEQLS TSRLRTAP 
GMGDQNGC YTGGKDGHWS KECS I DRSGR 
VADLTEQ 


1977 


15878 


A 


1991 

* 


1 


145 


WAASKAMK/MGDWKTCHSFI INEKMNG 
KVWDLFPEADPVLLKRIjRESR 


1978 


15879 


A 


1992 


2 


425 


NI STLKKTLESDCT \KLFSQG IGGEQAQ 
AKVDRCLSDLAAETNKFRDLLQEGLTEL 
NS TAI KPQVQPW INS FFS VSHN rvEEEF 
ND YEANDPWVQQL I LNLEQQMAEFKASL 
SPVIYDSLTGLMTSLDAVELEKWLKST 
FN 


1979 


15880 


A 


1993 


3 

- 


449 


VAGPAPGAGARPGLDLQFLQRFLQ I LK\T~ 
LFPSWSSQNALMFLTLLCIjTLL\LKSFD 

YYTLNVLRDDIDNPDQRISQDLERFCRQ 
LSSMASKLI I S PFTLVYYTYQCFIiSTGL 
LGPVSIFGY 


1980 


15881 


A 


1995 


1 


410 


SSRRPFI ALKTKSMRDLNPEDI DQLI TI 
SGMVIRTYQLIPEMQEAFFQCQVCAHTT 
RVEMDRGRIAEPRVCGRCHTTHSMVLIH 
NRYLFSDKQMIQLPESPEDMPAOQTPHT 
VI LVAHNDLDDR/ VQPVDRVNDSGFF 


1981 


15882 


» 


1996 


1 


154 


LFFFRLLVRYTKKVAQVSTPTLGKVSRN 
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od 

• 


SEQID 
NO: id 
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Uy/515,1 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
oitae 
location 
correspon 
ding to 
last amino 
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peptide 

sequence 


Amino acid sequence (A=Alanioe 
OCystelne, D=Aspartic Acid, 
tLr-oiutamic Acid, JP Phenylalanine, 
ij — vviyciDc, jh— riisuaine, j^isoieuctuey 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=»ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
A=Unknown, -Mopcodon, ^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














UGKVGKKS WKtfPEAK\KRPCGKN 


1982 

- 


15883 

• 


A 


1997 

• 


2 


475 


KKLRRQTRREAQKE \LPDKV\RWGLVTP 
SETPVGISNLS PFLGPRCL/ QNPAELKP 
PWRFPWAQRPKAH\ ERANAGPKTPABQR 
KVKKI KKLKEDI S QGVHI S VYRVRNLSN 
PAKKFKIEANAGQLYLTGWVLHKDVNV 
WVEGGPNAPKTITRLMPLRI 


1983 


15884 


A 


1998 


3 


431 . 


QLRTRDRGWPSRRPEREKRTSQSARRPT 
CTESRWKSEBEVESDDEYLALPARLTQV 
SSLVSYLGSISTLVTLPTGDIKGQ/SPL 
EVSDSDGPASFPSSSSQSQLPPGAALQG 
S GD PEGQNP CFLRS FVRAHDSAGESSLG 
SSQA 


1984 


15885 


A 


1999 


1 


400 


ALDLRGLQI LVG FP KRRVTTCS YPTALQ 
S P I E YQRKERSTAVMRTEPDSAYQAS PR 
PYSAGPADSKKPTKGYCYNPTLP\RI»EI 
MTLEGTTG 


1985 


15886 

* 


A 


2000 


1 


372 


QNIDLVISFFSSRLLQAGAELSVERVIjE 
I IKQGWALPKDRL/RGSCAPTLSAAGR 
SSGGQSPCMPGLCVCSFWVLTVSWLVCQ 
KFPELKFKYVEEEQPEEFFI PYVWSLVY 
NSAVGLYWNPQDI 


1986 


15887 


A 


2001 


393 


1 


GG I GRGGGAGGG VGAAGS ASGGVGRRGA 
GGVIADSGAPGGGVEGGVGASGGWRB/G 
RGTSGGVGGSGGACGSV/GGSGGAGGGV 
GACGSTSDGVGRSRGTIGGLGGSGSAGG 
GVGACGGASG YVG I RGAGGG 


1987 


15888 


A 

■ 


2002 

• 


2 


362 


WVTFISLLFLFSSAYSPGVFRRDAHNSE 
VAHRLKDIiGEENDKAXiLL I AFAQ YLQQC 
PFEDHVKLLNEVTE FAKTCVADE SAEDC 
DK \ SL \ HTLFGDKL CTvATLRETyGEMA 
DCCAQQEPE 


1988 


15889 


A 

.* 


2003 


2 


358 


EANRGWF IRLKEGS QLYNI KVQGEAAS A 
DVEAAASYPEDLAKITDEGGCAKQQIFN 
VDK \QTAFSWMKJtPCK. iijliUUiaKo V Fl» 
FKL/SKNRIJ^Ia3ANAAGGFKLKSVLI 
CHSENSRTF 


1989 


15890 


A 


2004 


190 


1 


DQTCFliSFTvKAVi. r WoWVWWJjj. fv J. FA 

LWD YRH\ RPANF FVFIj VE TG fhhvaqag 
LKKLrGS 


1990 


15891 


A 


2005 


1 


132 


GMCHHAQLIFVF/CSRDRVLHGCSQTPC 
ijl\yroUiA»J_»r'J\vJVJ LI IK 


1991 


15892 


A 


2006 


2 


134 


PMTFFTEIiEKTTLKFIWNQRRIQIAKAI 
LTNQK\NKARGITNIC 


1992 


15893 


A 




315 


127 


SEIAFFFCSFLKIILDTR/FSFFARAGL 
KLIiASNDLPSSSSQGAGITGVSYGTQPV 
CFEYNVG 


1993 


15894 


A 


2008 


3 


325 


RCSMLAVREM/QKATKRCHFLPTRIALI 
K/ND /GSNKCWK\HCWWECKMVOIiIiWKl 
VWQFLKKLNIELPFDPEIPLRDIYPKQL 
KTYVHTKTGGQMF I AVLF I LNVPHLMNV 


1994 


15895 


A 


2009 


369 


3 


VGQAGLEFLTSGDPPASASQSAVITOflS 
HHTQPIFCIFGFA\GCPDWSQTPELKQS 
AHLSLPSSWDYRCMPPHLANFYFCRH/R 
VCX:PGWSQTPGLK\YPPALVSQSVA*rrG 
TSPRAWQDTILPV 


1995 


15896 


A 


2010 


114 

* 


287 


APLCLCLRHLRLL I KRLTVLGTEAHTL \ 
NP ITTRGRGGQITWGREFETSLANMVKP 
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K=Lysine, Lr=Leucine, M=Methionine, 
N=Asnnrnpine. P=ProIinft- 

Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine. V=Valine^ 

A A III vvlllllvf T ▼ HIII|S| 

W=Tryptophan, Y=Tyrosine, 
X— Unknown. * = Stoo endon. /=nossihle 
nucleotide deletion, V=possible 
nucleotide insertion 















CF 


1QQ£ 


J J07 / 




£SJ 1 1 


J. lo 




I MKFNIiIjKFYF I Yli FSROS Ii / S S VAOAG 
VHWRNLGSLQPLPPKF 


1997 


15898 




2012 


310 


1 


SSSSSNTHFGIPKYLINPDTCFLA/KVN 

nocu iujjci iiui jlj i\r uit\ 


1998 


15899 


A 


2013 


3 


301 


SDCCASNQRDSGGVGPSEPARKHTLCV/ 
CNSLDLIE" 


1999 


15900 


A 


2014 


332 


3 


LRDPLEEAVCPFSDLQLHAGRTTALFKA 
VRQGHLS LQRLLLS F \ VCLCP APRGGAY 

AWAMAGAPPPAWPRPCSLISDCCASNQR 


2000 


15901 


A 


20J5 


2 


325 


LLLHHAPPVNLFLtRDRVSLCGPLOGPGC 
SQTPGLKQSSCLSIiPKC/WDYR\RATAP 
GL 


2001 


15902 


A 


2016 . 


m% mm A 

373 


3 


wAHGLQPGKPKVDSLSKKRKKKKGKKIiN 
TFPLRSRTRQACLLSPLLSNIVLBVLPN 

MIIYVDIPKQSTKKNQGSYSVARPGAVA 
HACNPSTLGGLGG 


2002 


15903 

• 


A 


2017 


343 


1 


EFPFVSGSRATGKSSDIRATKYIWRVIiE 

IJjRAWPRGU« KLiKSb-XHlfaJbijCjo zHPCjA 

FRGDKWS CCHQKDETGGRGEDEVLLCCP 
GVLGCSAIiCRSGVRTKFGIRGRPWKERE 


2003 


15904 


A 


2018 


139 


2 


NSISTKNTKNWSGMVAHAFNPSTLRGRG 
GQl\RGQDFKTSLANMVKP 


2004 


15905 


A 


2019 


3 


322 


ARELVF F FGAYRKG F FHRDMKP KNLLCM 
GPKLGKIADFGLARELRSHPPYTDYGST 

SGS S I WNI KTSDSNF I FFKKS KG 


2005 


15906 


A 


2020 


205 


377 


DE/WPGAVAHACNPSTLGGQDGQITRSG 

JJKJEi 


~205S 


15907 


A 


2021 


3 


324 


KKWGKR\LNRAPNGRRYPETRWALEEDQ 
CHICKELHIKTVRFHCTPIRMAKIHTTD 
NPQSWPRCX3TGTLIHCRRGCKTV/R/PI, 

WKTVRQ/FIi/RKLNIPLPC3>PAVLSLCI 

v 
* 


2007 


15908 


A 


2022 


2 


382 


RVSQDGLNLLTSRSTRLGLPK/CWDYRC 
EPPSP 


2008 


15909 


A 


2023 


1 


421 


RWNPGGRGCSELRLHYCTPTWVTERDSI 
S KIAKNK/NNKRPRNNCR 


zuuy 


J591U 




ZUZ4 






LTSG SQSAGIAGVSHRA\GQKHQFRPEH 
RFLKFGFVFRDRVSIiCCPGWPQTPRLKQ 
SSRLSLPKSWGPATALDPDSSYLEEMVL 
o 


2010 


15911 


A 


2025 


2 


146 


NTFGRSRQEDHLGPGGQACS ELRSHHCT 
PAW\VI EQDPVS KKKKPPKP 


2on 


15912 


A 


2026 


3 


380 


RLE CS GG I S IHCNliR / LPGFKRFS CLSL 
PSRWDYTRLP/PFVFLVETGFHHLGQAG 
LELLTSGDPKCWD YGC\ DHCTWP 


2012 


15913 


A 


2027 


32 

* * 


296 


DYMNSLMYFHSLVLASTDEGFLPKTVST 
QSAGITGISHCARPWIPF/CFFFFKNRK 
TRFVAQAEGQGGNFGSLNPLPPGFRGFP 
CLSLT 


2013 


15914 


A 


2028 


2 


187 


FTLL P RLECSG \M I LVHCSLNLPGLRWS 
S CLSLLS S WNYTCVAPFS I FYYF I FLTW 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=GIutamlc Acid. F^=Phenvl&lanine_ 
G=Glvclne. H=Histidine. I=Isoleucine. 
K-Lysine, L=Leucine, M=Methlonine, 
N=Asparagine, P^Proline, 
Q=Glutamine. R^Arffinine. S s3 Serine* 
T-Threonme, V«Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Un known, *=Stop codon, /^possible 
nucleotide deletion. V=Domihlfi 

nucleotide insertion 














GGTLNC 


2014 


15915 


A 


2029 


197 


3 


AAGLS QAGMQWGDFGSREHL P S RFQRFF 
CLKVPNNWDYRHGPPRPVG I FFFSF\LV 
ETGFHHPGQ 


2015 


15916 

i 


A 


• 2030 


88 


362 


KFGINLKKKEKKGPQKNQNFFPTFLFFE 7 " 
FFETBSHS ITOAGVOWRDLSPLQPLPPG 
FKSLPSSWDYRCLPPCPANFCIF\IPDS 
VSLCWQGW 


xuio 


1^0,1*7 


a 


ZUJ 1 


7 ■ 




GTIMAWAPGBGGCSEPRSCHCTP AWVT \ 
SETPSQKKKKGVLL 


2017 


15918 

* 


A 


2032 


1 


343 


Legse prlchctpawV/ Igds v£ kkkkk 

KKKK 


2018 


1591? 


A 


2033 


243 


14 


QKLAPWPPQSAGNNRRVPPHPASMAFLT 
VGS I TL FNS KIHYMTTVT KT I 


2019 


15920 




2U34 




i 


TPTjvT DVVT/^TTRr^PPUPA<3MAl?T.TTT , R 

DTALKFIQNHKRPQIA\KPILSKNRAES 


2020 


15921 

* 


A 


2035 


345 

- 


2 


IiVKIQKEMNVFMTANPISIPQFMDQGVI 

SWQSEMKTFWKGFI ILDGIKNTDDSWEB 
VKISTLTGIKKKLIPTSTDDFKEFKTLV 
EDVT 


2021 


15922 


A 


2036 


1 


140 


GRCC \ HELRSRHCT PAWATRAKLKKKKK 
KKREKKTOKGKNGLGFWAF 


2022 


15923 


A 


2037 


199 


348 


RSSNEGGRDCVFCFGDRVSLCLPGWSAV 
AQLWLTATS TSQ\ VKS LSHLGIi 




1 SODA 


"a 


ZU jo 






NGTJJAP I KRLRIiANW I KfiDDPS VCCIQE 
AHLTCRDTH /RCYLKG / WYKAF 


2024 


15925 


A 


2039 


116 


337 


SKLLKILPRLCWGWWQAPVI PATQEVEA 
EDHIjNPG/RSRPAWATQRTPVS I KKKX I 
LRPANGK / CHGPSW 


2025 


15926 


A 


2040 


368 


252 


VFFFFALFYLFFFFFFFFFFFFFFFFFF 
QFLGFYFRFKF 


2026 


15927 


A 


2041 


204 


399 


VSSHKINGLTVCSTSPFFt*SLLPPSEES 
AC FP FAFCHDCKFPEAS /S VMLPVKPVB 

T. 

mm 


2027 


15928 


A 


2042 


103 


356 


WHFSPPQPLPPP/PLPNPPPPPTPPPP\ 
PPPPPPPPPPSPPPPPPPPSPPPPPPPL 
PPPPPPLSPPPPPSTPP/PPPPPPPPPP 
PLPP 


ZUZo 




rl 




J 




TYC/SGKKKIPFKILLLIDNTPCYPRAL 
IxFJ4CEBISIVFAPATTTSS/LKPrllX2GV 

NIIS 


2029 


15930 


A 


2044 


2 


349 


PRVRKSPGPNGFTANFYQTFKELIS ILL 
KLFQKKKKIKKGENPPNSFYGAIIPRIP 
NPi^I-lDLSKK/ElYGPVSGRlSJMEAKIFTK 
FIiAGHFKQSFGRE IHHDQREFI PGIQGG 
FNIGN 


2030 


15931 


A 


2045 


280 


462 


CXFFLVWLVWCWLLFXVWLWLCFFG 
FWCXTWFC^CGVWFFVFFWCVLCGVL 
LGCWC 


2031 


15932 


A 


2046 


3 


284 

* 


PSPSFSLLLPPSFSLLFPPSF/SPPPPS 
FSIxIxLPPSASLLLPPCTSLLIHPPTSLQ 
LP PLPS FYLLLPTS I SHHL PTHNLPTTS 
IQDPSTPCSIK 
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Amino acid sequence (A=Alanine 
OCystei ne, D=As pa rtic Acid, 
E=Glutamic Acid, F=Phenvlalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, Lr=JLeucine, M~ Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V= possible 
nucleotide insertion 


2032 


15933 


A 


2047 


95 


1 


GRVDAILAHCKLR/ LPGFTLFSCLSIiPS 
TWDC 




1 WXA 


n 


70/10 


Alt. 


1 
i 


EPTTMTGAKWGS SQTTTNYHMLQSLRIN 
VRVDFFFFFCTDR/SLSMLPRLVLNSWI 
PAIILPHPPQVLGLQ 


OMA 


ijyjj 






J J 




EVRRPS TWGP PRLRS EEPLRL \QP PRLG 
NIGFCHVGHAGLKLLTSSDLPTLASQSA 
GIT\GHSYRRASEEDKKBSSMSS 


2035 

* 

- 


15936 


A . 


2050 

j 


281 


3 


QSNFKRALLSILLKAIYRSNIIPIK/VP 
MTFFTEI /EKVI PKFIRNLKRLRIAKVT 
LS KKNKTRRITLSD FKLYYY\ IAVFVAA 
WYWKKNRLSDOWN 


2036 


15937 


A 


2051 


251 


12 


iLRFFLPKFLGFQKPPSIiBKTHFFLPAF 
GNSLFWPPEKLGQKKVFFFFFFFFLRQG 
L /NS VSOAGVOWHNTjGLLOS PLPRLR 


2037 


15938 


A 


2052 


2 


325 


ADHLRPGVQDQPQ PGQNGKTP S PLKXEK 
\LAGCGGGHPRLRE ENCIjNPGGRG CSBP 
RSRRCTPAW/VNDSKTLARKKKKKKGVE 
KNECX3RKVMRV/LOGPKAKVKPWGENL 


2038 


15939 


A 


2053 


3 


166 


SLLLPRVECNGAI S AHCNLHL \HNLHL P 
GS S6s PAS ASOVAEVRGSLEPRSSSLA 


2039 


15940 


A 


2054 


227 


3 


LNENIGRITGMSHHARLIIiliFCE/YRVS 
ITOAGV/OHDPGSLOPLPRVFKQFSHFS 
LQSSWDHRCAPIRIAIFWVFC 


2U4U 


1 KQA 1 










ORKSHMFLT INOKLEMI KLSEEGMSRVE 
TGQKLDLMC/QVSQAVNAKEKFLEK1KG 
DTSVHTQMIRKQSSINVDMEKVGIVWIE 
DQAP/HNHIPLSHLLMRAR 


2041 


15942 


A 


2056 


1 


109 


RPLRRLRQENRLNRGSRGYSEPKLC/HL 
CTPAWAT 


2042 


15943 


A 


2057 


313 


250 


PQPPPPSPPPQPSPPPPSPPPPSPSSP/ 
PPP 


2043 


15944 


A 


2058 


134 


2 


EKESRSVA\RLKGSGAISA\HCNL\CLP 
GS SNS P PS ASRVAGS SGA 


2044 


15945 


A 


2059 


120 


326 


NSLVADME/KVLWWIEDQTSHNIPLSQ 
S /LIQSKALTLFNS IKAKRSEEAAEKI F 
EASRDWFMRFKXKQK 


2045 


15946 


A 




102 


413 


ERTGFRHVGKSGLEFMTSGDPPTLASQS 
VGIT/VHEPRTRPG 


2046 


15947 


A 


2061 


311 


1 


FFKKQRFCKSGCKTPGKPRGPKKLGANQ 
FSGPTPLKNGVGFSPGPKGGFFPPPPGG 
FPAGRP KWliTR I LG KGGS LLRG / RYP KK 
GFLKP I LGDNS PQRAPKRGP 


2047 


15948 


A 


2062 


229 


380 


WYDLGSLQPMPIiRFKQASRVNLPRSWDY 
RHPPLSRLN\IVFIVDTGILHVG 


2048 


15949 


A 


2063 


142 


383 


PQS CFSTHWQLLQKQEETAGAVSVCV CT 
S \VCVCV(^CVCVCAGAMCVCAGA/CFC 
VCVCAGA/ CLCVCVGA/CLCVC 


2049 


15950 


A 


2064 


408 


200 


NLIQIKALTHFSS IKAERGDKSTE/EKF 
EGSRGWFVRFKERGHLCN I KVKHEAANA 
YAEAACSQLSRRSS 


2050 


15951 


A 


2065 


348 


3 


WVSPYSPCVCVCVCVCVCVCVCRDWVSP 
CCPGVCMCVCVCVCRDW/ CFTM/CAQVC 
VCVCRDWVSPCCPGVCVCKDWVSPCCTS 
WSILKLLSSGNPPTTVSQSAWITGRSHS 
AWPARA 


2051 


15952 


A 




73 


286 


"NLIRGLLESHlLlSMRYGYCKSYTLSMR 
DIPEALNKWKS / 1 PMFCS WSRRLI VSMA 
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NO: of 
peptide 

ft? AA ■ a Aa* m%A 

sequence 


etb 

OQ 

- 


SEQ ID 
NO: in 

26 

■ 


Predicted 
beginning 
nucivoiiciv 

correspond 

inn fa fi rcf 
mjj iv nidi 

amino acid 
residue of 

sequence 


Predict- 
ed end 

*1 II J<"> 

uuuc 

location 

tui I capun 

ding to 
last amino 

AI.IU 

residue of 

UCUUuC 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 

Ij 1 -—/^ In +n m if* A/*i/t IT— Pluirtvla Ian in A 

— vji u m in h. /^ciUy r— j^iivn jioinniiiC} 

K«Lysine, L=Leucine, M= Methionine, 

Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 

X«Unknown, *=Stop codon,/=possible 

iiutivuuuc uciwiiuii, y~ pusoiwtv 

nucleotide insertion 














VLSKLINRFNAKDVQH 


onto 
2U52 






ZVJO/ 




fi 
u 


pccTPT.T >PT7PRPPP<3PPPPPPPPPPSPP 

/PSSSSPAPSSPHP 


2053 


15954 


A 


2068 


221 


3 


ELKVILHCLRDFSLQSSIMKVLILKILI 
LSVCCVC/VCVCVCAVCSCVC/VCSCVC 


2054 


' 15955 


A 


2069 


1 


167 


GTRENPLNPGGGGCSEPR \SCHCTPAWA 
TKSETLSQKKKKKKKISNYKTPFKSYRI 


2055 


15956 


A 


2070 


2 


354 


ARACJjGIjPSSWDF/ SVENRFHRVGQAGL 
LL 


2056 


15957 


A 


2071 


20 


341 


CIVTVNXRGENINICWSPDGQTIAVGNK 
DDWT?IDAKTHRF/LKQNSSSS/SEVN 
EIS WNITONNMJj Y PDTGiyGClWlJjo I PI I 
ESRAI YQRP I FHVHVNQDLPHGDVLS 


2057 


15958 


A 


2072 


353 


58 


LQLLTTSDP PASASRGAGI ADGVWFTQ / 
S LNGAQAGVQWRDLGS LQPHPPSRL/LL 
ASQSAE IAASARP PPRIK5S EERIjCLAAH 
RLGCEEPLCIiAAQSGK 


2058 


15959 


A 


2073 


I 


338 


GSRLQRVCINYRLS FFPFLSQGWINFTW 
LFCLCVCFIjRDRVSIjCCPGWPSxSGFKR 
SSCPSLLSRWDYRHMPQHLASHTLFKKL 
/TILPR 


2059 


15960 


A 


2074 


1 


322 


GGGREAGEARGGGEGGQGSGRRRRGRGG 
PRTGGAEGGRGAGETPGGGARPERBQGR 

rinitn cn at 1 n>r / rj r\T 1 i/nnv r P^Ti^r\T Ortisrv 
t^RH^>riR^C3i , 1 / KU 1 KKfc'Kl K I K-UIjoUW Jv. 

NT 


2060 


15961 


A 


2075 


450 


225 


TPVGRGCSELRSCNCTPAWVTD\ETLSQ. 
KEKRGGVKIGWKKRRTR 1 1 SLHLPGSHE 

V TSXTKTT .13 T>\ TXTJ>3 /T 7>>TtTT?/^D T . Wl» 

K.rririJjE\L VivWCVIVrli? CKJxNT 


2061 


15962 


A 


2076 

■ 


2 

• 


470 


TPQNKPHPTTKNTQPQPTQKK/ PTQTQP 
/ 11 PKPK71 ivxvWT 1 irNFAirFJS I Unflhi 1 A 
PTTQNPHKTKQKNKTKPNTTQQQNKPTN 

TS PNP ISTKKI KKLAEP 


2062 


15963 

• 


A 


2077 


304 


1 


NSATPPCSPTA/KPHPTPPTPPPTSFIP 
TCQHSPPTKICPQARPRTPPYAHPPRCP 

IWRGRVCVCVCVCVCARA 


2U63 




A ■ 




i tin 
10/ 


i 


TT T^TMQTW QTJVT.T .WnOJlVAnJVr'\N<3 <5T 
X X J-i X1MO A rVOXi v iJiinUon vny/iu X 

LGGQGGRITRSGDRDHPGQHGETLSRA 


2064 


15965 


A 


2079 


3 


364 


HETRSRHSCAWCSAALWRAAVASRCPSS 
/IPVTPPQCIiYWP/WKVPIiQCPPDL 


2065 


15966 


A 

* 


2080 


303 


2 


RTPRRLCKSQLDRCS PLKERHKFGLSRS 
VLSAMTQSGIY/WQPPPPEFKRCFSCIiN 


2066 


15967 


A 


2081 


4 


326 


AG I TGMSHRAWIiFL YFLNKFAFT\ YGLV 
LNFFLHKIGEPSLG\SGSGP /lscns 


2067 


15968 


A 


.2082 


1 


343 


PPPPPPPPPPPPPPPPPPPPPPPPPPPP 
PPPPPPPPPPPPPPPPPPPPPPPPPPPP 

PPPPPPPPPPPPPSPPPPPPPPPPP/PP 
PSSPSSPPPPPHTPHP 


2068 

■ 


15969 


A 


2083 


330 


88 

t 


ACDRSIISRIHKEIi\EQI.CKQK\SKNPI 
KKWPKUM I R YFSKQDVQTVKTMNKGSTS 
L I I I EMQ I RSTMRYYLTTVRTPHPS 


2069 


15970 




2084 


1 


340 


RVRSHGTTHLAQIil FVLLVQTGFHHDGQ 
DGPDLL/NLVIRPPQPPKVL 
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SEQ ID 
NO: of 

Jl UHCUlJUC . 

sequence 


SEQ ID 
NO: of 

sequence 


M. 
eth 

Art 


SEQ ID 
NO: in 

1 ISSN 

09/515,1 

26 


Predicted 
beginning 

n ucl&ntidfi 

location 
correspond 
ing to first 
amino acid 

rp^ifliif* nf 

I CAIU Uv Ul 

peptide 
sequence 


Predict- 
ed end 
nude- 

MM HImw^ 

otide 
location 
correspon 
ding to 
laet amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid. F=Phenvlalanine. 
G=Glycine, H-Histidine, I-Isoteutine, 
K=Lysine, L=Leucine, M-Methlonine, 
N^Asparagine, P=Proline, 
Q^Glutamine, R=Arginine, S=Serine, 
T^Threonine. V=Valine. 
W=Tryptophan, Y=Tyrosine, 
X-Un known, *=Stop cod on, /^possible 
n ucleotide deletion, ^possible 
nucleotide insertion 


zu /U 

* 




T\ 


ZU5D 


L 




ARAL S L»G AN AAQNF KLR PLLRYH SONPR 
ALKNYAKYTL P VLHRWHS ES LETAFT E C 
FTEYFKPTVETYCSEKD/I/SYKALTiLI 
EKALGHPRALMEMYKDIS IVFMPVNTTS 


2071 


15972 

i 


A 


2086 


369 


2 


KairPPSPANLFFl»SVY/MGFPMFTRLIIj 
ISCPCYPPPSASPSAGIKGLNPPGWPPF 
S FF YOKFVRFFVLARNRTFGFVD YFS I L 
YLIYGCSNLYRFLLSALYFFVSVFVFVL 
TQGIEAMVPRA 


2072 


15973 

* 

i 


A 


2087 


314 


3 


LLESVGPTRNSRPFKGLIGRILLDPEFY . 
PSLVSDDLPASASPNAGRILPFFFFETG 
S Hfl \ TO AGVO WRHHGSLO\ LKRS 3 YLSL 
TSSWVYRHTPPHPANICIFSRA 


2073 

i 


15974 


A 


2088 


330 


16 


CPCFFLSALSVLVGWCFAFWCGCV/WW 
CVCFWVFVCVCFLGCWLCLCFVACL 

v 0 v t**v*> v vrr v9V9U>*v v 


2074 


15975 


A . 


2089 


1 


337 


GTRTFLPPSYKDPCEY/IWAHPDNPGSS 
SNCNMIiNFSTSD/PPVHSSGNWKVLSSP 
NRPYYYSYTATPHTDPTPHLPSPNPSSP 
SPSYPI^SDSTICQTTPTTTPITSSHTL 


2075 


15976 


A 


2090 


350 


84 


QRKENKRTRKRATERRDESREEKAGRKE 
GENQEKRNKEQQGRQRRSRBRT/EEEKE 
EAKRREHKNPKKKKTKPPPQKKKKTEKK 


2076 

* 


15977 


A 


2091 


1 


355 


SDPPTSASQSAGITYVSHRTWPliLEFSG 
TS IRLAGKPAGVLVEVTGK/ S VCGGGVT 
KTHWNECHTGYPKCCWSSQAGESSLQPP 
PPttFKRFS CfTi <3LPS S WDYRLLPP \ONFC 
IFSRDGL 


2077 


15978 


A 


2092 


27 


345 


ASIPCLKKRKKKKKKTGGKPPP\GGPKP 
KRGGGGG P PPT KRTLFGPPQGNQTPGGV 
Q /RPPTTflDPPPSPT PPnORKSPPFEKGG 
KKGPPPWRGVKREKGHFLKNFQQK 


2078 


15979 

■ 


A 


2093 


3 


301 


HEHVAQAGLKLLGSSDLPTLASPKCWDC 
kr\dycawphifpi sgplyvffplprms 
PHTHTHKYTRTFFHYAHKHCACTIiNIiTiTi 
RAO FOCH F FKG WPD P 


2079 


15980 


A 


2094 


76 


342 


WFXXFFFLLFVFFVFWFVCFCVFCGFFF 
FCFFCCVWFVFGFCLFLCFLFWFCFFVF 
FGFFGFWFFVFFLVCFLVFVCFFGCFFF 
LFCFF 


2080 


15981 


A 


2095 


115 


325 


MDERKKIRGGGRQGKEC2CIHCKKKLSPG 
IRSYPVEN/F/VDTMYDYLQPAYYKLND 
LTNADPCAVRYLLFDQN 


2081 


15982 


A 


2096 


80 


227 


SCXiGN/CIHLYSHSPTLSFTHTHTHTHT 
HRERERERERERE I CICMSVYA 


2082 


15983 


A 


2097 


308 

mm WW 


1 

Mi 

H 


NSTVTMENSVNIHYRTRVFTEAQFTIAK 
SWNQPKCPSILEWIKKL\WIYIYVCVYI 
CVCVC I CVCVCVCVCVCVCI Y I YI^MEYY 
SAIKRNEIiAAFAVTWTRA 


2083 


15984 


A 


2098 


2 


361 


ARACLGLPSSWDF/SVETRFHRVGOAGL 
ELATSRDPPTSASPECRDCRH\DHHTWP i 
LL 


2084 


15985 


A 


2099 


1 


221 


LLWRIiRHENHLNLGGRGCSEPRLHHCTP 
SWMTR/ GKTPSQKKKTKQPMBWQNIiCLK 
KVFGNTG VRE I FNG I KLS 
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SEQ ID 
NO: of 
nucleotide 
sequence 

■ 

t 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 

26 . 

■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
OCysteine, D=Aspartic Add, 
E=Glutamic Acid, F=Phenylalamne, 
OGIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
lN=Asparagine, P^Proiine, 
Q=Glutamine, R^Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y^Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2085 


15986 


A 


2100 


186 


2 


ERVYVCVCVCVCVCVYQCYLPQIRSIi/V 

r^rtT TJ" H ITT. TTT 1~J~1T TJTT 1*J~T1 T-TTT I.TI r j~|T T 11 I_~I¥ r_~t¥ ¥ 

AS S LKANARVC V Cv CvCV CV CVCVCVCV 
SLWHPLTA 


2086 


15987 


A 


2101 


305 


2 


HNILLSQSLIQSKALTLFNSMKA/B/RG 
EEAAE EKLEASRGWLMRFKE \RSHLCN\ 
TKVQGEVASADTEAAPSYTEDHSKITDE 
GGYTKQQ I FNVDKKAS YWNMS 


2087 


15988 


A 


2102 


6 


166 


EQTALEILARAIRQEKEIKGIQIRKEEV 
KLCFFVNDXMVLYIiENSKDFLKVDAA 


2088 


15989 


A 


2103 


53 


176 


E I KNNRPGMVAHAY/NPSTLGGRGRQIS 
WGQE PETS LVNMVK 


2089 


15990 


A 


2104 


3 


267 


FRHVGQGGLKVLTSGDPPALVSQSAGIT ! 
GVSHCAQPIVGDFNTPLSIL/D/RSTRQ 
KI NKD I QDLNSALDEADLLD I YRTLHPK 
/STEYTFF 


2090 


15991 


k 


2105 


268 


3 


PPKEHGSSPATEQSWMENDFDELREEGF 
RRSNYSE/LREDIQTKGKEVENFEKNLE 
ECITRITNTEKCLKELMELKI KARELRP 
ECRSLR 


2091 


15992 


A 


2106 


3 


170 


GFHHVDQAGLELLTPQVTHP \LGLPKCW 
DYRREPPCLASPHFHQIAISQKRHREAK 


2092 


15993 


A 


2107 


1 


398 


SARGPDGFTAEFYQTFKEELVQILLKQF 
QRIKGEILL/KNHYVKPSITLIPK\PG\ 
RDITKKLLTRSFCVSLCPVLSPLQSLQS 
RPSSLSMlSIiHSVCFSVLSASAHPHVHL 
CPTCPTLVLSGSHCVCCVSLFFF 


2093 


15994 


A 


2108 


3 


370 

• 


HENWNNKGWGDSTI YSMKYLYFKPMLRP 
Y/C/SQKKI PFKI LLLFDNAPGHPRVLM 
E I MYKDMEVF\MPVNTTF/ ILQPMDQRV 
I LTFKS YYLRNT FHKT IAAINS D YS DGS 
GQSQLKTFWKGFIVL 


2094 


15995 


A 


2109 


1 


213 


HFP VENESAPG/ FKAAGDLLTLLLGGNA 
AGDFKLKALLVYPSENPCFLKGSFKPNL 
PLVWCSHKKAWVQLG 


2095 


15996 


A 


2110 


391 


3 


KKKKKN PHRKKI KKRWKKI FNAERNKKR 
AGVHHILDRLDFKQQTIRRDKR\GYYT 
MINGTIQQEVITILNIHAANTKAVRYIK 
QVLLKLKTELGPNTIITGDTNTLLLSTL 
NRSSRQKNQQTLDIilCNI 


2096 


15997 


A 


2U1 


188 


365 


FQNTIHICVCVCVCVCVCVCL/CVCFCV 
FFFCLLCGGFMCGCWCDFCILFCFYGVG 
FFFL 


2097 


15998 


A 


2112 


3 


340 


RMESALDRIiKQFTTWGDTGDFHAVDEY 
KPHDATTNPS L ILA VAQMPA YQ ELEEEA 
IAYGRKLGGSQEDQ I INAI DKLS VLLGA 
EILKMITGRVSTEVDARLSFD/SDAMVA 
TA 


2098 


15999 


A 


2U3 


11 


305 


FLFTDFCLFMTHILGHXINYITN/CKRN 
VTIl/SYfc SPHNRIKljKIblKKIbKKSS 
NTWKLNNRLLHYPQ I KDE VSRE IRKYLE 
LN INENTNF /QNLWDIHK 


2099 


16000 


A 


2114 


3 


387 


QTNH/NIPLSQNLIQSKAITA/NCMKAE 
RSEEAAGKFEASRGWFMRFKKSSY\IKV 
QGEAASAGVEAVAIYPDLAELIDE/GCY 
tt/qi FSVFQTAFFWKKKPSRTFM/REB 
KLI POLKAS KDSSSLLLRVHAAGD 


2100 


16001 


A 


2115 


152 


393 


VYCPlCWIiVFFFCFCSVLILFVMFV/CL 
CFSFFCFLGFWVFSS FYLFIGVFFFIV 
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SEQID 
NO: of 
nucleotide 
sequence 

■* 


SEQID 
NO: of 
peptide 

PA/i \\ An A A 

sequence 


M , 

etb 

_ j 

Ou 


SEQID 
NO: in 

26 

■ 


Predicted 
beginning 
nucicoiiuc 

lUCHUUIJ 

correspond 
ing to first 

ttniiuv uliu 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucie- 
oiiue 

location 
correspon 

Ulug it! 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alaninc 
OCysteine, D=Aspartic Acid, , 
jii^vviuiamic aciu, jt — jrnenyiaianine, 

vj— wiyLiiic, ri — noLiuinej i— iso leucine, 
K=Lysine, L=Leucine, M=Methk>nine, 
N=Asparagine, P^Proline, 

T=Threonine, V«=Valine> 
W=Tryptophan, Y=Tyroslne, 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














.T/2r"T PT.P\7\7TTnT.'T?T.T?\7T.1?UT.r , T.T?Pl7Q 
* Uu^IjC ±j\* v v j? 1/Jjjt jjr v ijr vjjwjjr r wo 


2101 


16002 


A 


2116 


2 


373 


ARACLGLPSSWDF/ SVENRFHRVGQAGG 
LL 


2102 


16003 


A 


2117 


3 


449 


HEFDHAMLQAHRAHHIAIDAYHEPEETY 
I PKDQKYS FLHDSQTS FCFSDS I PTPFN 
MBETQQKSNLELLRISLLLIESWLEPVR 

trr Df?uf7i vrvrr TJVT\fPononnvDOT vr\r uc 
VLiKbMr ANNJjv iJJlbUyUUiJtlr LiKDIdSr* 

GIQTLMGRAEKRHCRTVQNLKQTYRR/F 

DTNS/HNHDALL 


2103 


16004 

■ 


A 


2118 


146 


15 


FFFFYFFFLFFFFFI PFLFFFFFLFFFF 
FFx F I FIIjFIYSLLVF 


2104 


16005} 


A 


2119 


406 


3 


LFSVNETGFYS KMLSRTFTATEBTS I PG 
F KVS KDKLTL \ LGANAAGD FKLKPML I Y 
HS/ENPRALKNYAKFTPPVLYKWKNKAL 
MTTHLFTARYTBYFKPTVETY/ ILLLID 
NAPTHQRALMEM YKE I NVFMPAN I IS I 


2105 


16006 


A . 


2120 


109 . 


396 


YYFYF YLVLFYFFI FLFFYFFFFFILLL 
FYPFPS FI FFFPLFVYFI FLCYFFLFFF 
FFFFYLFFYSFFFFFIFLFFPFFFFFFF 
SYYLIFFIiSYII 


2106 


16007 


A 


2121 


15 


413 


IVLARNTNFWLSFliFPVALGILIVIiKGV 
KYIFWPLEYCQRLKMFVSYSFHY\FFLG 
SLLFLKYGFHMYL I LLL / C I FI 1 1 MCFF 
I KYSFFFCCLYHFFFS FYLPFLYFLIF\ 
CYLV I L FFS FLL FLFLSS YCFFF 


2107 

* 


16008 

■ 


A 

• 


2122 


1306 


429 


SSSSSSSSHVLRIIKDEDFKILEQRQW 
LS EKEAQALCKE YENED YFNKL I ENMTS 
GPSLALVLLRDNGLQYWKQLLGPRTVEE 
AIEYFPESLCAQFAMDSLPVNQLYGSDS 
LETAEREIQHFFPLQSTLGLIKPHATSE 
/HKRGPSMVMILTKWNAVAEWRRLMGPT 
DPEEAxUjI^PD&IRAQFGISKJjKNIVHG 
ASNAYEAKEWNRLFEDPEEN 


2108 


16009 


A 


2123 


3 


206 


LRRLRQENHLNWRGGGCSEPRSQHCAAA 
w / vsNS is rpQiyyuuvius iuuuuuvnliPJNo 
ALKKTYSQRGKLF | 


2109 


16010 


A 


2124 


23 


401 


I ASGRP FFF FFFFFFF WPPP PGG YPHFS 
Tpj mfvvrDprj/yr'iroT.T DorzuriDxrD'DvX 

r ijlVKJvAJ\Jwwi^LjJS^ f fix \ 

WGFPLFPPPLFPQKTPPPPFFFYKTPPI 
PPQGPRGGPLKFTPPGGGGGNPPTXLLD 
KKRGVLGPPPFFWTN 


2110 


16011 


A 


2125 . 


3 


439 


Mr UvbJjJ-il c IrioJjr lir a V\jr rixCJvL. tS.Ua 1 
LCDGGELRDHILLPTSICPITR/DKCSC 
PGEGC 


2113 


16012 


A 

4 


2126 


1 


213 


HFPVENESAPG/ FKAAGDLLTLLLGGNA 
PLVWCSHKKAWVQLG 


Al XL 


lOUl J 




9107 


\f\A 
Wit 




NS FFFEEELYNPFGGLGKKTFFGGGEEP 
GHTPPBNEALGGKNKFFTGEGGQTFSNN 
GEEKSVS PWI S IEKILHRALL/AHALCK 
NCWELNFGQKEEPFFPPPEEF 


2113 


16014 


A 


2128 


399 


140 


PPPPKNFFFPPKGKFFG/RG/VGPKFPP 
PKKRVFSQKPPRGFFYPPLKKKNNPFPP 
PGNFGPPRGFFKRPPPFFFFFFFFFFFP 
FFFFF 


2114 


16015 


A 


2129 


10 


457 . 


KTSWTWCPVPWPATQKAE/AGGSPEPG 
RSRLL 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 

09/5154 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

ammo acid 

residue of 

peptide 

sequence 

* 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Ala nine 
OCysteine, D=Aspartic Acid, 
L=Glutamic Acid, F=Phenylalamne, 
OGIycine, H=Histidirie, I=Isoleucine, 
K^Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=OIntamine, K=Arginine, S=Senne, 
T=Threonine, V=VaIlne, 
W=Tryptophan, y^Tyrosme, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


2115 


\ 16016 


A 


2130 


409 


0 


PPPPSPPSPPPPPPS/PSPSPPPPPPSP 
PSSPPPSPPPPSPPPP/PPPPPPSPPPS 
PPPSPSPPPPPSSPPPSPPPSPPPSSSS 
SSPPPL 


2116 


m ^ #» « mm 

16017 


A 


2131 


317 


« mmmm 

177 


FFFFFFFFFFFFFFFFFFFFLRLQIFFF 

m TJ T T TOT T?T OCT PPL'I DD 

r ul? xulb ub Ltrr Lit? br Lift 


2117 


16018 


A 


2132 


3 


542 


EP.WSVAQAGVQWRDLS PLQ PLPPRFERF 
SRLSLPSSWDYRRLPAHPANF\QFLVET 
VFRHVGQAGLELLTSGHLPALTSQSAGI 
TGMSRTS \ RPGFLFKV 


2118 


16019 


A 


2133 


146 


439 


LKSVTSIAKTWIQPK/CPTNSEVDKCPS 
TVKWIKKMWYI/YFTMEYYAAIK 


2119 


16020 

* 


A 


2134 


95 


1 


GR VDAI LAHCKLR / LPGFTLFS CLSLPS 
SWDC 


2120 


16021 


A 


2135 


2 


380 


THTHHTH/THTHTHTHTHTSLS 


2121 


16022 


A 


2136 


4 


337 


KRNNKAWMTVHLFTAWFPEYFRPTVETY 
CSEKKIPFKIL/LLVDNAPGQPRVLVEM 
HKEMNWFRPANTASILQPMNQGGISTF 
NSYYLRNTFHKAIVAIDSNSSDGFGQNK 


2122 


16023 


A 


2137 


2 


356 


PVSSSQVRASVYLKKKKKKGPPPEGITN 
TAGE KP PQS FRGKG\ P P F PL I S PKKE P V 
I\SFLKNFGPCTIGKKKPPHPPAKNGGP 
LQ/RPPTQGGGRGKKKGKNNQRPLSGLG 
GNRSPKPPF 


2123 


16024 


A 


2138 


2 


357 


FLGSSDPPPSAS P VGRATG / RVFFFFFL 
VE WSHYVAQNGLELLDTSNP PAVASQS 
VRITCVSHRTWLLSPLYKIIQVCVSK/S 
PE LEQS EDKSLKK 


2124 


16025 


A 


2139 


379 

i 


16 


LLQVRCFVSTWRGSSCQKTIQVYYVQE 
AIPPSFLLSPFLM/PYTKINSRWIKDSN 
VKPKTIK\TLEENLGMPLNI FFTYQLLW 
LYLHPESQLEICNSFRALQEGNLIIFFI 
GRVGRPGTTGL 


2125 


16026 


A 


2140 


73 


411 


NYLLNNLFFFFFLERGLTFAPRAGGWGG 
NLTSWNLGPPGPNKPPPFPPKRP\GTPK 
PTLKEGLFGFFKTTGFPPGAQKGPELPG 
LRGPPGIAPPRGGNKGGNPPPGProKTFN 
G 


2126 


16027 


A 


2141 


46 


421 


AGVS WRDHSSLQ PCLTS RARAV\ SHLS F . 
LSSRDYGPMPPPPKKKKKKKKKKKKKKK 
KK KKKKKKKKKKKKKKKKQ 


2127 


16028 


A 


2142 


26 


479 


LAYLLPS PVMYKPQNWRTILQS I SNKGL 
I CR I YKETLQLNHKKTNNP I FFKWVKKS 
KHFTRGAI QVANK \ HI KMKLN 1 1 9 VYGN 
AQKNTMRYHHTP IKMKKMGCALWLTPVM 
CQEAGY I LKMYPGWTPYPGRWGRPTAP 
ALP PRGTFPRD 


2128 


16029 




2143 


9 


166 


QNRLLARLTKKKRGKNQAHP/ IKNDKGD 


2129 


16030 


A 


2144 


1 


328 


LEKESRPPPPPPPPPPPPPSPP/PHPGS 
LP KP LGLQ PLPRPE VSFLTE YLPKLKAC 
E GGG VE I AAAS FPR I YLMGMCS RQDR I Q 
KDIDWIQKSPJVEDCLFAGVKAILKA 


2130 


16031 


A 


2145 


1 


440 


KTFLRSLWQLVB/ CYISSGLIDDHRRPM 
ALTPRHPRSGITAPRPRPQPPGRVGIPE 
PTALSPSPGPPPPPCSTPGRCQVPSLBR 
RRKEGREPPSVGRGCGGHGISPSSDIFF 
HLNFCLAHPPDLFVFVCSISNQLYIFHE 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 


SEQID 
NO: of 
peptide 
sequence 


M, 
etb 
od 

• 


SEQID 
NO: in 

26 

■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otioe 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A)anlne 
OCysteine, D=Aspartic Acid, 
jj^Criutamic Acid, E=PnenyIa!anine, 

yin/llllfl U-Uis^l'JlHA 1-ToAlnHAlnA 

vi=ijriycine, /i— wistidine, i^isoieucine, 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P^Proline, 
Q=G\utnmine, R=Afginine, S=Serlne, 
T=Threonlne, V=Valine, 
W^lryptopnan, Y^lyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide; deletion, ^possible 
nucleotide insertion 














KKKKKGGR 


2131 


16032 


A 


2146 


23 


422 


IASGRPFFFFWGAPGGEEGKKNFFWGPR 

GGKKKIRGAPTPRGKGNPPPKPWGKKGG 

rdtivmt/'t wwp a AOPtmr'iJfvrar'inr 
AFPl\WXjolibT31^GAQRijFPGWJe KTPCaiUt 

KSPPPGPXKGKKNRGRP\GPPPQIFFKP 

FFSTGEKNPLGKKTPNWRLLPW 


2132 


16033 


A 


2147 


3 


372 


KKNTPFFLTPRGPPPPPRGPLSPPPKRA 
TPP PP PFKKKKPGFPPKKKNFFPPPGGG 
PPPP/LPGEKKGGYF/WW 


2133 


16034 


A 


2148 


363 


1 


FNCRWFETKSHSVPQAGEQWHS FGPLQP. 
LPPGFK\EFSCLSL 


2134 


16035 


A 


2149 


2 


361 


FFFLVETGFHQVGQGGLEHLTSDDPPTF 
GLP\GGWDYR 


2135 


16036 


A 


2150 


387 


2 


KKS F FLVS PARVQ WGDPFNPP I PGSNNF 
PFSTPPKTGVIRGPPPARKIFVFFIKTG 
\FPQLGRGVLKSLPQ/CDSPPPAPPKGG 
VSGGNPPAPPFFFFFFEMBSHSVSQVGV . 
QWRNLGSLQPPS PGFKQF 


2136 


16037 

• 


A 


2151 


23 


421 


IASGRPFFFFFFFFFFFFLGGRGVFFYP 
PGGGGGPNFFKKTPFPPGKKKFFPPPSP 
KGGFLTPPPPPPFFFFFLKKXGVINGGG 
GGVKI SPPGGTPPFLPQKGG/NKKGGPP 
PPRKKKI FLFFFPGAI KQRPPP 


2137 - 


16038 


A 


2152 


3 


366 


VIWAHCIFHLLGYTDRPCIiKKKKKGFPG 
GEKMGLKNFFPPLGKKPPPPQIPNFWSV 
IEKPPRGFYLGGAAPPKKFFLLKP\GPP 
FFKEPPPPKKKGGASPPLKKKFFKGKKS 
PPFFFSDPTS 


2138 


16039 


A 


2153 


423 


253 


FFWGGGAP I FPPPKKGFFPKNPPGVFFS 
PPKKKKIFFFP/HPVNFGPPKDFFKRPP 
P 


2139 


16040 


A 


2154 


287 


I 


KEMNKKKNP KKKKVS KKI S TAF YNKTLN 
NINFF\ FAFFFFFFFFFFI FFFFFFFFF 
FFFFFFFFFFFFFFFFRFLFLFMYKQTA 
QLMQHSAPTRP 


2140 


16041 


A 


2155 


433 


2 


RRVLFIFPPL/HPFSPPPLPFSPFPPPP 
PMKFFSPPPTFFYFYKSPPHPP/HPPQV 
VFSPPTPPMPFITPPPPPHTSPSFPSFF 
FFFPFFFFFFFPSFFFFFF FFFFFFFFF 
FFFFFFFFFFFFHI ISIKGBIGRSSWR 
VMSRTRG 


2141 


16042 


A 


2156 


69 

• 


A**%. A* 

606 


LWWPPLSRHAAYRQWPGPTAPRGLGHKV 
KGPGAS PA\ PGGAAAGSTAQG \GGGAAC 
LPGPAAGAVTWPAGPGGGRATGPVLQR 
PAGAGQPTQQGQHDHAGRVLWQHGIGRP 
GAHRPGPCAP/DSAPRAPGGRCG 


2142 


16043 




2157 

• 

• 

• 


449 


0 


SSPGSSSSCSPPPPGGGVGFFL/YKPPQ 
KKPPPPPVGGPGFFPQKKLKSFFPLPPP 

P PT /IfJfVJD T5 DDT) DDTTT TTQ T7VP PPTfWQ T? 

PPPPKKAFLPPPPPPLPPPPPSPPQPPP 
LF 


2143 


16044 


A 


2158 


241 


5 


KKFSFFPPGGSQGGFFGSLQNPPPGFTP 
FFCLNLGKKWGQRGPPPRPNNF/SFFFF 
FFFFLVETGFHHISQDGLDLLTS 


2144 


16045 


A 


2159 


434 


2 

* 


LGSASQLGCSGVRDPLBEAVCPFSDLQL 
CARRTTALFKAVRQGHLSLQRLLLSF\V 
CLCPVPRSGAYRGRQASLSCGGLHPVQA 1 
SRLLCLPKQAWAMAGAP/PPASLQPCSL 
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SEQID 
NO: of 
nucieouue 
sequence 

■ 


SEQID 
NO: of 
pepuuc 
sequence 

i 


M 
eth 

OU 

■ 


SEQID 
NO: in 

09/515,1 
26 


Predicted 
beginning 

IIUllvulluC 

location 

r nrresnond . 

ing to first 
amino acid 
residue of 
peptide 

cortitpnf*i> 

■ 


Predict- 
ed end 

otide 
location 
correspon 
ding to- 
last amino 
acid 

peptide 

ca/i llPfl ft* 

ffVU UVII K- Vr 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartlc Add, 

K=CiIutamie Acid F=Phpnvlnlfinine. 
G=Glycine, H=Histtdine, I=lsoleucine, 
K=L/Vsine. L=Leijcine. M=Methionine. 

*■ A/ JWIUVJ m,m A'VMVIflV* 1 'Jl i"lWII|| V • J * 

N=Asparagine, ps=Proline, 
Q=GIutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

V=I In Irnnvvn *=Stnnmffnn /=nnccibfe 

nucleotide deletion, \=possible 

niirlfntiH^ in<;ertinn 

IIUVIWUIIUw IIUV1 UUII 














ISDCCASNQRDSVGVGPSEPRAGYNLLV 
RCFLSP 


2145 


16046 

■ 


A 


2160 


448. 

- 


84 


PLGGFFFFFFWFPPPS PPQKEAL\QKKI 
PFVFFS PESNQKNFFFSFSERGGPPNFL 
FjrnciGPX fs pwfijFFFFFFWFffTjWFf 

FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFIjGKC PIjKYT 


2146 


16047 


A 


2161 


3 


156 


RGCSEPRLRHCTPAWVT\KETVSQKKKK 

WTiTT/^rj If TFKTJFFF V JMRAPGTFP 


2147 


16048 

» 


A 


2162 

i 


412 


37 


FFHSPPPPPPPAGGAVFPPKKKKKSPPP 
PTPLLL/RGGGGPPPPPPPKRGPPPQKP 
KRGFFSPPKKKKKFFPGPGGPPGPPQKT 
ppppppPFFFFFFFFFFFFFFFFStjFFF 

CCCCrCCEE E E E E E E K EEL tL.EE t?J-IJ. J. E 

FLRTLWRKGRGRGI 


2148 


16049 




2163 


5 


inn 


n«3 QCVfl EJT iVPWPTTDRaEBRCJPT .RPR S 

ERLQ/ CSHCTPAWA 


2149 


16050 


A 


2164 


437 


36 


FFFFFSEAESCSVAQAGVQWHARSLPQP 


2150 


16051 


A 


2165 


3 


623 


RQGFTLVAWAGV QWYD LGSLQP P PTGFK 
RFSCLSLPSSWNYRHAPPCPANFVFLEB 
TGFLHVGQAGLELPT,SGDLPASASQSAG 
ITGMSHCA/RPKVCSYHLFF 


2151 


16052 


A 


2166 


337 


0 


)*Vrtree&rB'trir&/ jrolrlre'r'xriror'r'O \co 
PPP 


2152 


16053 


A 


2167 


350 


156 


AEATSKIRCQKCY IM IAGHSGTRLQFQL 
LQRLRQENHLNPRA/RGCNEPRSHHRTW 
PTWYISKSFLA 


2153 

• 


16054 


A 


2168 


1 


337 


DERSLSQRSRSWSYNGYYSDLSTARHSG 

PLKTRKKPRGGLLKAHPFWGGPPPGFFL 
TOEGAPPPVFFKKKKKKPPLGGKGGFLW 
G 


2154 


16055 


A 


2169 


2 


606 


K. V uKA V AAtltSCi r JL/ Jvoo lUSJMrHiu^VolrKvr 
VKRQRRSS SGGSQEKRGRPS QEP PIiAP P 
HPJ«SQPP\HPGPLPPTNAAPTVPGPVB 
PT.T.T .P PP PP PSTJVPAGPIWAAPIjPAPST 
SALFTFSPLWSAAGPKHKGHKERHKHH 
HH\PAPMVIPAPAEPI 


2155 

■ 


16056 


A 


2170 


311 


2 


GIiQPLGIiGSVKQCMDI*ACVPETVCVCVC 

V \_ V WUUU1 X IT V i \JS3\* V CutJ A " ~« «** * 

C/VCWACTCVPVCMHTCVGVGASVCMQ 
RNELGRWAWENDAIRQQRC 


2156 


16057 


A 


2171 


78 


337 


NTPNS IYKAMSLKGPITGTFLPNYPGHK 
VCVCDTYL CVY QHTHTHTHTHTHTQ AF P 
HI/HIYTHT 


2157 


16058 


A 


2172 


2 


278 


KN RLN PGGGG CS KLR S CH CTLAW AT \ S E 
TPSQKKKEKRKRRGRKRKPJCKKIVYTKI 
EKGNAKMTGESFIHFCKKSSSVPQVALS 
AEYRCSST 


2158 


16059 


A 


2173 


2 


365 


f yhvgqaglkllts sdp palglpkcwdy 
rre\d caq pnvks 


2159 


16060 


A 


2174 


195 


389 


FRVFSKLKYYYDFFRGRISLSCPGWSTT 
PGFKRLSCLSLPSSWDYRRPPPCPAN\F 
CIFSRDGVSP 


2160 


16061 


A 


2175 


23 


374 

* 


IASGRPLIFFFFFFFFFFLFCRGGGPPF 
NPRGKERGGFLIKGPPPPGGKKNFWAPP 
PGGGE I KRTPPPPGVFFFFLKKKGFS FG 
GGGGKK/PPPPGEPPPPPPQKGGKKKKK 



676 



WO 01/64835 PCT/US01/04927 

■ • ■ 



SEQID 
NO: of 

DUClcUllUC 

5 CtJ UG1JLC 

• 


SEQID 
NO: of 

3u|UCULC 


M, 
eth 
ou 

• 


SEQID 
NO: in 

09/515 1 . 
26 

t 


Predicted 
beginning 

n II r bkAtf do 
II u cnsu UUC 

correspond 
ine to first 
amino acid 
residue of 
■ nent id e ■ 

sequence 


Predict- 
ed end 

11 U LIC** 

ntirlp 

location 
cor res Don 
ding to 
last amino 
acid 

residue of 

peptide 

sentience 


Amino acid sequence (A**Alanine 
C=Cysteine, D=Aspartlc Acid, 

F=fJhltamlc Acfd FsPhonvlfllnnino 
i/**jiuuiiuii AkiU) m. a net I y lal all Ills, 

G=Glvclne. H=Histidinc I=Isaleucine. 
K=Lysine, ^Leucine, M=Methionine, 
N=Asparaeine, P=Proline» 
Q»Glutamine, R*Arginine, S-Serine, 
T=»Threonine, V^Valine, 
W=TrVDtoohan. Y-=Tvroslne^ 
X=Un known, *=Stop codon, /---possible 
nucleotide deletion, V=possible 
nucleotide insertion 














TRGPLF 


91K1 

2101 


\\J\J\3A 




^1 /□ 


i 


J Jo 


FFPLKDSLSVK)AGVOW\PVDSIiOPIiPP 
RFKRFSCLSLPS SWDYRCVPQCPAN 


71<9 


1 AClfxl 
1 OUOj 




9177 






PRFHFLASPSEMSOMTVKAKTTVP^SKd 
AYPKIEPFFPF\DPRGSBS\FDKLGVHH 
HPLFPLSGEPLMILHKNTH 


2163 


• 16064 


A 


2178 


3 


256 


YLKKDLNVNQKTLTLLB/EKQQK/HIiHD i 
VQNFNAVKD PVKRMKRQASD / WENI FT 


1 1 

21t>4 






21 /y 




i 
i 


SWDC 


2165 


16066 


A 


2180 


1 


143 


ARGERERERERERERERBRERERERBRB 

IvQJtsV f V / tUSKiiftJJO KJ5IttJii>XW<M BKOwJJJr 


2166 


16067 


A 


2181 


1 


304 


ARGQGHLSLQRLTiTjSF WCLCPOPTGGA 

WAIAGAPPAALLPPCSLISDCCASNQRD 
S VGVGPS KPCVGYNLLW 


2167 


16068 


A 


2182 


3 


163 


FQRRSTKSCGWDKDARSQSRSPPRQQAH 

nuuc /M'i > ui'i'i-i'i'tJAW T rH T T T u7m i rmvtyT , pp 
vjrulo / nlnlniii/ulln irUinixiAji iJtvr 


2168 


16069 


A 


2183 


3 


205 


NFKLFCKGFFFLRQSL/DSVSQAGVQRH 
GVQRHDHNSLQP 


2169 


16070 


A 


2184 


228 


88 


WFFRIYDSFFLVFIiFVFFFFFFFFFFFF 
r cLv Ltrctrv i r xyivv i jl 


2170 


16071 


A 


2185 


287 


3 


S LFLAIPQWE FQRKI YLRKQKI VULACQ 

£roi\.^jjj^± vyv v i\.o\^l_iTio ir V Lil\jj\j,Uc x Jr \ 

PRFKRFLCINFLSRWSYRHVPPRPDSFV 
FLVETGFLHLV 


2,1/1 


10U72 




2 loo 


i^i 

202 




CTHTHTHTHTHTHTRVELQGPA 


2172 


16073 . 


A 


2187 


267 


459 


KHVPGSSYTWL/ IFFFRGNFLKKGANFV 
PQGVLKGGNLSSLYPPPPRLKQYSCLTL j 

ui\ xuh x Ann 


2173 


16074 


A 


2188 


170 


15 


RSVMKDLNSHFSKENIQMANKRTKECSA 
LWFRE I K/ TTMRCHLTP I RMATIKK 


1 1 1A 

2174 


lOU/J 


A 


21 o9 


*> 

2 


DU4 


n\/T t q Trtin. <3 7x K\m r m t .VTTnwvnif t tov 

l/V X Xk3 1 LilAOiiiV VuX t%_Li V i; XJH X UTlvX XUfV 

T PYPHPRGMTVS VKQLFSTLP VRHKE FQ 
RNIKKGLG\RRSCFCFDF 


2173 


lOlF/o 


7V 

ii 


11 OA ^ 


i 
1 




SDSPASASRVAGTTGACHHTWV/ IFFVF 
PVETRFHHVSQDGLDFL/NLVIRPPRPP 
KVLG 


2176 


16077 


A 


2191 


142 


382 


NTPPIiLFFFVIRDRVSLCCPRWSGVAQF 

lffr^QliTV*l\CPV\KT?R^rTJ5T,P /^Mwrivpr* 
tt i rin i iadaV \mir ovuour/ onnuinv 

VP 


2177 


16078 


A 


2192 


138 


365 


KHQYHHCC/ l^KKKKKKKKKIOCKKKKKK 
KKKK1KNKKHGGPFKKNFW/EGQPRNWA 
GGV 


9!7£ 




a 

■ 


710? 




71 


RGEI FFFKTRRKKFS POGGRGG VFPPS P 
PKNFFFPQGGKPFGG/ EGGPKVPPPKKG 
GFPKKPQGGFKVPPKKKKKI ISPPGVIG 
GPPGNFLKGAPPFFFFFFFFFFFFFFPF 
FFFLVRAVKLS 


2179 


16080 


A 


2194 


415 


56 

• 


PPPPTTAPVFSPPPPPRPFFSPPPPVPF 
FRSFPPAPPPPPLFFPPPSPP\PSFPPP 
PPPTRPCPPPPPTI PPPPPPFSPPPPPF 
FFFFFFFFFSIiLP FFFFFFPFF FFFSFI* 
FLFFPSRLW 
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S£QU> 
NO: of 

II ULICUIIUC 

sequence 


S£QH> 
NO: of 

pcpuuc 

sequence 

■ 


M 
eth 

CJU 


SEQDO 
NO: in 

09/515,1 
26 


Predicted 
beginning 

n it rl Anttfto 

II U tICUIIIIC 

location 
correspond 
ing to first 
amino acid 

i*p%iftiiA nf 
I volUUv VI 

peptide 
sequence 


Predict- 
ed end 

uuvlt?- 

otide 
location 
correspon 
ding to 

1 <k o 4- «mSfiA 
■ laai aiiiiuv 

acid 

residue of 
peptide 


Amino acid sequence (A=AIanine 
OCysteine, D=Aspartic Acid, 

1 1 1 trj m if Arid K^Phon via Ian t no 
e/^^jiuuiiuil /\uu ( iJciijriuiaiiinc, 

G=Glycine, H=HistidIne, I=Iso)eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine f P^Proline, 
Q=Giutamine, R=Arginine, S^Serine, 

Tfc=Thre«niiie V=sVnlineI 

A 1 III SUIIIMC) T f dllUC) 

W=Tryptophan, Y=Tyrosine> 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 

nuvicotiuc insertion 


2180 


16081 


A 


2195 


35 


369 


MKRPSPPPPTPPPPPPPPHPLLPPFPEK 
KTOPP /RGFFKGGERGPPPPKICKKDPPP 

QKQKK.KKKWGGGGGGGGQRKKKHPQKNP 
PPPPGRGGGKNFLWGRPRGPPPQGGGGK 
K 


0191 




A 
A 




o 
Z 


zoo 


KGLPLGGPGEKTI FFSLKPGQKAQPKNK 
IPFKKKIPFFSTPKGKKKKNFQGGPKKK 
KKEI 


2182 


16083 


A 


2197 


1 


277 


ARGERERERERERERERERERERERERE 

O IT CCD 'DUO /DDDI7DI7 


2183 


16084 

i 


A 


2198 


1 


285 


ARGERERERERERERERERERERERERK 
RERERERERERERERERERERBRGGAPP 

TJTDT VO'PDtTT WrtDnDT7T.t>DVDS r Din? 7dQ 

f KJj JC KCitCV Li KJXUKvKf lurtr V K£i JUvtv/ Ko 
PVQTLPGY 


2184 


16085 


A 


2199 


346 


1 


VPRHGLPCFCFPHQLQPPKRKKFNNRKR 

AT moTITTtPin T L.' L l n l PPPC< T7\ /OPPOOHTO 

ALiTisHlvr r VrhllfioLoVA/ KfitabAiiS 

AHCHL\ CIMRSSNS PASGGAGSSELRFR 
HCTPVGQQTE IWSPKNKPIKKEEGCPYC 
QFLSC 


2185 


16086 


A 




3 


389 


HbQMlli/^UoJjNLir^pUL'/ AoilvAKl 1C 
VHHHVQIiILF IFVETK/ FSHYAVQAG/L 
ELLSSSDP/PTSQSSRITGMSHH 








2186 


16087 


A 


2201 


3 


204 


HECHCTPAWAT \ SETLSQKKKKKKKKKK 
LFHLKRGPLKLF 


2187 


16088 


A 




69 


379 


KKKRVFFWGPGGSKPPT/SGNPPPWPPK 




GGGLtRGIiPPPPGPRGVlFKNFGKRGPPP 
PPGLKS WGPRDF FGIiALQRGGI SGLNNG 

Ar ir If r r A3 V IjIVAJjF r Lie If V o ftAa 


2188 

■ 


16089 


A 


2203 


81 


379 


KKKKKPPPPKKKNPGPKKQEGGPLRGPP 

PP PGAE KKNP ALGGE KKTLGPNGRGPKR 
LRGLAKKGNPPSS FGGK 


2189 


16090 


A 


2204 


1 


341 

■ 


GQ S L P VS P RLQ LS NGNRTLTLLS VTRND 
VGPYECE IQNPASANFSDPVTLNVLCEY 

T.T T?\ DVBOJiTQPWDDQnCDnTJIXTDTJRQTr 
Li Lit? \ tr Vii^Hl Olrlv ir rCC>V:K±rdJ-»o V rLwOR 

NADS PPTPRNPARHDFLPQANMGRPSLD 
QE 


2190 


16091 


A 




2 


370 


DAPPRPANFVFLVKTGFPPVGQAGFKLP 








PPGDPPPLASQITGKG/HCAQPPF 


2191 


16092 


A 


2206 


362 


Oil 

24 


DDPCOCBCCCCCCCCBODCCCCCCCCCC 

SSSPF\PPPPVLKPPPPPPKKSPPPKKK 














GKRAKPPFQKKKKKRKRESSFYQQVIIH 
M 


2192 


16093 


A 




357 


3 


AGQALWLARVI PVLWEARS YYVRISWYR 








NI VAMDS DSLDGSGHS KLKS FRKGFPVL 
YAI KKTHDS WEGVS \MSAL I VWKSLIPS 
CVDAFEELTS SAEEVAAQWG IATDLEL 1 
LVGCARA 


2193 


16094 


A 


2208 


280 


411 


GNDVYFLVFLF/CLFEKESCSVAQAGVQ 
WHDLS S LQALPPRRMA 


2194 


16095 


A 


2209 . 


3 


392 


PllYEKYTOQINEMlPRKLQHQQlAL^R 
NGPILLHDNAQLHITQC/LQKLNELGYR 
VLPHLPYSLDLS PT0YYFFKHLDNF\LQ 
GKHFHNQEDAENVSQKFVKSQSMDFYAT 
/GNKL I S HWPNCVD CN 
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SEQID 
NO: of 
nucleotide , 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M. 
eth 
od 

■ 


SEQID 
NO: In 

dssn ■ 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

■ A. ^ m A. 

ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~AIanfne 
OCysteine, D=Aspartic Acid, 
E=Giutamic Acid, ^Phenylalanine, 
OGlycine, H=Histidine, Msofeucine, 
K=Lysine,L=Leucine,M=Methk)nine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=»Arginine, S=Seririe, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

%^ If 1 A *T*% A * ft •« ft 

X-Un known, *=Stop codon, A=possible 
nucleotide deletion, \= possible 
nucleotide insertion 


2195 


16096 


A 


2210 


248 


417 


GILANCAS IKpLISRI YKKLKQIN/KQK 
TNDPLKKWAKDMNRYFSQBEIQGVNKHM 
KK 


2196 


16097 


A 


2211 


213 


493 


GKRFY FACPGKI GALQGFFKGGGPFFFF 
FFFLTPPPNTNYLTNKLKIKSPLCTFYP 
SNILGFYPSITHRTIPDAWVTA/SQSGP 
VLMRSIS 


2197 


16098 


A 


2212 


159 


413 


liAPRVI FGPPKKPPERPPLFFFFFBTES 
S S VTQ AGVQWHNLS S \ P PRF KRFY / CLS 
LPSNRDYKRPPPHPANFCTFSRDGVS 


2198 


16099 

r 

i 


A 

• 


2213 

■ 


455 


60 


NPRREVGPICPPPKIRVPPQNPQVGFYS 
PPLKEKTFTSPAPVNI/3PPRDPFKRPPP 
FFFFFFLDFFFFYFTIYKTSAACPRSPP 
AQPRVARRPLVPSS \PPLPSLCLAPAPR 
GPGSLCPRGSLEGDNGSSPG 


2199 


16100 


A 


2214 


2 


243 


LTLS PRLKCNGMVSAHCNFCLLSSSDS / 
RLR/QENCLNLGGGGCSELRLRHCTPAW 
TTERDSVSKTNKKNSLKIFTCFFVDAA 


2200 


16101 


A 


2215 


1 


286 


FSQIiLRLFIC/SQGGRLLCCESCPASFH 
PECLS IEMPEGCWNCNDCKAGKKLHYKQ 
IVWVKLGNYRQVFPRTKRKYSIIVQTSF 
ILWIQSDLEIDR 


2201 


16102 


A 


2216 


173 


2 


I T I FFFVRQGLTQAGVQWR \DLSGETLV 
ILPPLEAPCSLQSSWDYRRVLPHLPNFC 
IV 


2202 

• 


16103 


A 


2217 


216 

• 


4 


VYPPSFMVFSQVFLSSTHISLSFSFFWN 
YLF I YLF I YLF I YLRWSL/ DS VTQAGVK 
WHNLGSLQPLIiPGFK 


2203 


16104 


A 


2218 


14 


228 


KRSSHLFTDDI I LYMENTKHS TK/NLLE 
LIKEISKVTG/YQKSVAFLYVNNKQAIK 
KTIPLTIASKRIKNSGQA 


2204 


16105 


A 

* 


2219 


244 


3 


EVLNQNSGliPRWPNILRAKASLRVPRQC 
SRGWFSANGAGTTRYLYAKEWGGIjGGG 

GCS klrs chctp vwat \setlsqkk 


2205 


16106 


A 




162 


2 


inmvgnlfffgdrvslchagwsavapsw 
ptvastsl\vkqssflslpsswnhrh 


2206 


16107 


A 


2221 


146 


3 


grvdgvpwrnpgslqppsp\gssdppts 
asqesgttgahhhtrlifvf 


2207 


16108 


A 


7773 


239 


2 


syiskpdkfphdsspeikpvtvnwrnsf 
sfifppfflppekeshsvtoagvqwrnl 
gvssywp\aglklltsgdppalas 


2208 


16109 


A 


2223 


2 


159 


lnrdlggggcselrs chctpawat\ses 
psqkkkrkkkkkkkkggrgrnski 


2209 

• 


16110 


A 

■ 


2224 


3 


345 


RFKLFS CLSLPSS WDYRRVPPRPA/NFF 
VFLIET\GFAILTSSKTERQSRLECIFG 
FYGLP CRE KRAS ERRS VEG/HERKILFS 
FDFFFLGGTESCFVTQAGVQGCYLGSPQ . 
PPPPG 


2210 


16111 


A 


7775 


103 


319 


rotblRNV X NF JjNXj irlbtJo Clti LiVK/ C / 
WSEGYMATPCTILUjLFFQRHCLTLSPG 
GVQWCSHSSLQPQTPGIK . 


2211 


16112 


A 


2226 


2 


110 


FHHVGQAGLELLTQWS IHLSLPKCWDF \ 
RHEPPHPA 


2212 


16113 


A 


2227 


2 


178 


IFLIFTFLEMSSHYVAQAGLEFPG/FKL 
TSRLSLLSSWDYRRPPPRLANFFAFIAK 
GDAA 


2213 


16114 


A 


2228 


173 


3 


FIFIFISLFFIFFLRQSIVLSPSAVQSR 
IiQPPPPRFKQFS\CLSLLSSWDYKRVPP 
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nucleotide 
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M 
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SEQID 
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beginning 

nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
odde 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenyla!amne, 
o=Glycine, H^Histidine, I=lsoieucine, 
K=Lysine, L=Leucine, M a Methionine, 
N=Asparagine, P^Proline, 
Q=Clutamine, R-Arginine, S=Serine, 

T_.TL — ir_.ir.ii_ 

T^Threonine, V^Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon,/=possib!e 
nucleotide deletion, Wpossible 
nucleotide insertion 














C 


2214 


16115 

• 


A 


2229 

■ 


333 

■ 


1 


HTPPPPPTFHSLCTEGAPPPPQNT/PNP 
PRNPFI IPLSRSHTRANEPSLSTPRTHP 
HSPRPPLLTHPHPNPRASAPPGSRYPPR 
ARHRERPRERPTQRERERERESVCVCV 


22J5 


16116 


A 


2230 


350 


3 


KKKD I PPVNI YAPNTGAHKY I KQ I LLDL 
NRDGP \HTITAGDFTTPLS VUNSSAESR 
GSRPFI ISPSLWGLRE 


2216 


16117 


A 


2231 

i 


277 . 


1 


QMHPARGHLPQALI PVQKPALISQGMSA 
SGSTQVSPPLSPCFWVEVNCSNTKVLT\ 
PPCGAGTGSHS FAQAGAHWCNRGSLQPH 
PPGFKRL 


2217 


16118 

* 


A 


2232 


3 


335 


ETRFHHVGQAGLEPLTSDDPPASASQSV 
/GITGVSLRARPGES/GSWKTVCCNNMS 
EPT 


2218 


16119 


A 


2233 


29 


448 


CPSLRQAWHEAAI DEVRTGT YRQLFHPE 
QLITGKEDAANNYARGHYTIGKEIIDLV 
LDRIRKLADQCTGLQ\GFLVFHS FQGGT 
GSGFTS LLMERLS VDYGKKS KLE FS I YP 
APQVFTAWEPYNFILTTHTTLEHSDCA 


2219 


16120 


A 


2234 


1 


365 


GARLILVFLEETGFHYVGQAGIjELLTSS 
DPPASA/FPKCW 


2220 


16121 


A 


2235 


28 


460 


DRL I OH I S KLGTRGLQGFLVFHS CGNLG 
\1\3FGFTSLLMERLSLDYGKKAKLEFS I 
YPAPQVSTAGVEP YNS ILTTHTTLEHSD 
CAFMVDNBAI YD I CRKNLD I ERPTYTNL 
NRVISQIVSSITASIiRFDGALNVDLTEF 
QTNLV 


2221 

■ 


16122 

■ 


A 


2236 


614 

* 


84 

* 


LAASIiAACAQLSALAASHRMWALQRLRK 
LLTTE FGQS INI NRLLGENDGETRALS F 
TGSALAALVKGLPEAIjQRQFE YEDP \ IV 
RGGKQLLH/SPHFFKVLVASRIiVTLEAG 
HFCP CCAETHKW\AWFRR YCMASRVAVA 
LDKRTPLPRVFLDEVAAVRVCGHIIjQLG 
DTELQQHICHL 


2222 

> 


16123 


A 


2237 


1 


393 


GPMLAQLSVFRCGSTSAPNDLWYHFIBL 
PYHGESITMLIALPTESSTPLSAIIPEM 
STKTIDRWMSIMVPKKVQVILPKFTAVA 
QTDLKEPLKDLGITDIVDSSQGHFCQIT 
KAENLLV/SHILQKQK 


2223 


16124 


A 


2238 


3 


402 


HVGQAGVQI GKACWELYWLEHG IQPEGQ 
MPSDKT I GGGNDS FNTFFSBTGAGKHVP 
RAVFVDLEPrVIDEGRTCTYRQLFHPEQ 
LI TGKEDAANNYARGHYT IGKE I I DLVL 
\DRIRKLA\DQCTGSQGFLGFP 


2224 


16125 


A 


2239 


2 


478 

* 


GRGGLHR I P WT PLTPS FARGLVPSLAR 
GVEARNGAGPIKSYPRPGSRLKMQNGSK 
GSGLQNKTFHWE I CDAHVNSKIQLKQ\H 

1 o oKKn KJJK V ALi I\ir J_» JVfcr J\ I Kir i IN xvU^Ko , 

PS ILAAKLAFQKDMMNPLAPAFLSSPLA 
AAEAVS SALTLP PRP FCFV 


2225 


16126 


A 


2240 


255 


2 


FLFVKPHQI SCPTKKG I KSFLVLCPLNF 
FLFWRQDLAHPGWSAVTQSWLTAASN\Y 
GLKQSSYFSLLSSWDYRCIPPHLGGKRP 
L 


2226 


16127 


A 


2241 


365 


11 


EPPPPGGEKKKRGKPPPKKTPKKDGPQK 
KSAAFFGG\GKIKKKKGAFRKKKKKGGG ! 
PPPKKKPPPRKKKKKKNFSPLPPPKHTP 
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n miiio acid 

residue of 
neotide * 
sequence 


Predict- 
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n ii d pi- 
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location 
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last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
OCysteine, D=Aspartic Acid, 

Es=GIutamic Acid F^PhAnvlnlnninA 

G=Gly cine, H=Histidine, l=lsoleucine, 
N=Asparagine, P=Proline, 

0=Giutamine. R=Araininp S=S*»rlne. 

T=Threonine, V=Valine, 
W=Trvoto D han. Y=Tvrosine_ 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














SLPQKPPTKI FLFSPPLLFLNILFFFPL 
SPFIF 


2227 


16128 


A . 


2242 


3 


143 


PNFLSPRLDCRGTIRGHCGLYLPGSCDP 
S/CLSPPSSWNYRSTPP 






A 






7 


MSAHCNLHLPGS SDS PAS VSOVAQ IMQA 
CHI F\VFLVQTKPHHI 


2229 


16130 


A 


2244 


334 


43 


TS KHMKRCS I S LATREMQNKTTTGGHFV 

NLASPLWKTIWHYLVKCLPYSSAIFTLG 
\IYPEEVLASSVPGT 


2230 


16131 

t 


A 


2245 


1 1 o 

312 


2 


WQWRDIiQPLKPPPPESKQPSCPTIiLSI 
CVCVCVCVCVCVCAVCIYIN 


2231 


16132 


J% 


2246 


3 


231/ 


AGLKLLAS SDP PTS ASQ S AO I TG VSHHT 
WPQPLTFCPHAKSLPFINQI 


2232 


16133 


A 


2247 


2 


214 


GRVDLVI QAGVQWHDLGSIiHPPLP \GSS 
FVETGS /HTQVSNS YD VL 


2233 


16134 


A 


2248 


3 . 


139 


EAGTAGSHYHVWLIFLLF 


2234 


16135 


A 


2249 


3 


160 


EGVQGCHHSSLQPPTPG\SSDPPTSAS I 
vjnsj muoninv viijxr ixu£v~r Duunn 


2235 


16136 


A 


2250 


319 


3 


GQKR YKI FLCFFLFGWLVFIiRWSLTLS P 

KEVLAKYGTKFFFVFFGLFVWFFLRWSL 
/NS VAOAGVO WRDPGS LGAS PRP 


2236 


16137 


A 


2251 


15 


394 


FVSFSFFPSFPSHLFFSSSSFPPFIiPSF 
HFSFLPSDRPS WPS /FLPSFLP 


2237 


16138 


A 


2252 


60 


306 


GRERILEEIMAEDFIaNLMKDLNISIQVA 
QQIPSKMNSKRPHRYQHFRSQSQRILKA 
TREKOLATYKGSSVT /VSPSPGPOTVNS 


2238 


16139 


A 


2253 


3 

V 


351 


G FHHVDQANLKLLTS S DLPAS TSQSTGI 
TGI\DHCTQPNFPI 


2239 


16140 


A 


2254 


150 


2 


RPRRPDHSRLGAGDQPGQHSKTPS \HQK 
KTKTS QAWRHAPAI AGTRQAEA 


2240 


\C\ At 

16141 




2255 


357 


i 
1 


KFLGGKNGANLMDTRLDNAFLDLTSKZ^JQ 
LTKEKIDKLNPTKIOTF /CSTKDLTjGNI. 
KRQAPE E KKI LRNH I SNKEL VS R IHKE V 


2241 


16142 


A 


2256 


397 


1 


FSLFPPVGGQGGFFSSCKSPPPRFRAFF 
CPNPFRKGGNRGPPPHPGKGTLGFFFFF 
FRQS FALS PRVEYNLCLPGS SDPPAS AS 
QVGGAPGLPPPAWVNFGIFPF/CFFlxRQ 
SRSVAQARMQWRHLGSLQAMP 




10145 


a 
n, 


99<i7 


2 




TLTLTPRIiECSGTIS /AAHCKIxHLPGSR 
HS PAS APRVAGCGGGHL 


2243 


16144 


A 


2258 


214 


347 


KISLILGVHKICCEF/ CFFEMKSRSVTQ 
AGVQGHDLS SLQPPPPGL 


2244 


16145 


A 


2259 


347 


2 


FFSFFFFFSEAESRSVA\RLBCSDTVSA 
HCTLHLPGS 


2245 


16146 


A 


2260 


333 


1 

* 


SDQRWTENAFVELRDEGFACPSFSSL/R 
STPSTSGEEVENFEKKLDECITRITNTB 
KCLKE LMEL KAKARE PREECRS LRSRHN 
QLEERVSVMEDQMNEMKREGKFREKRT 


2246 


16147 


A 


2261 


2 


357 


SPRSCSVYGIAILLFLYFLYKIiAFALLY 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartlc Acid, 

P — f^lnfti m 5r» A /*iH P' — -Ph/^n vlnlonino 
Hi — v*JU(<JDllC ACIU, J 1 — f ncnyialttlJIIIt:, 

— f^lvpinp V¥=Hist irlinp T=lcnlpnr , inp 

K=Lysine, L=Leucine, M=Methionine, 
N=As da rapine. P^PVoHnft. 

Q^GIutamine, R^Arginine, S=Serine, 
T=Threonine, V=VaIlne, 
W=Tryptophan, y=Tyrosine, 

X=TInknawn *=Stnn rnHnn /=snnccih!p 

nucleotide deletion, V=possible 
nucleotide insertion 














GLALNS PLHE IOEP / SLASGSGPLSRNS 


2247 


16148 


A 


2262 


355 


0 


PPLPSPliSPPLSLSFFSLPPFPFFFPFF 
PLF/CPSFSFFFPSPFFP 


2248 


16149 


A 


2263 


348 


35 


YFLKKGFSFFPRGEGRGKDSRSLKQLTF 
GFKKP/SCPSFLRKW\ETRLCPPAQKIF 
FFFFFLIiEMGFCYIAQGDLKGSSQSSGI 

io voininr ir iunjjinv.jjru\.v. - 


2249 


16150 


/■i 


ZZ64 


270 




nPDaPPIfMPPTY«J<lPTiPPPPFPPPPPPP. 

FFFFR\ FFPFFFFFFKFFFFLFSRWSLM 

TPTQ 


2250 


16151 


A 


2265 


1 


417 


FRPPAGVQ WRNL IS LQSLP PGFKRFLYL 
SLPSS/WDYRCMPPQIA/NFFVFWFDV 
VVVVVVVLVEM^RHVLARLDIVVLICI 

C T .MANfiMP VT .PT.C Q Ptt THT i P <5T.T .JfCP FT 

YFAHCLFLYYFLETGSHSVTETGWQWCI 
I 


2251 


16152 


A 


2266 


344 


427 


IilGLCEDTNLGAIHAKRVTLIAQDLQLA 


2252 


16153 


A 


2267 


310 


417 


RVPYGTLGSGPLATMAGFDDKLKPHMEG 
AAAPILVR 


2253 


16154 


A 


2268 


186 


464 


NSCLSHNQRQLLFPJjE KGMGP I SAQEKL 
VLHTLSG FSGLV\ VGWLVFEMES CS WQ 
AGVQWRVHPP PRFKQFS WLS F P S S WD YG 
HVPPCPANF 


2254 


16155 


A 


2269 


1 


427 

t 


PEPPSLPPDGAKKQKTQKSKKWRICFLR 
KKKKKKi^KKQKJuCKKKRGUCjSQi^^ 
GPKLTTETNII ILIKGGLKKMNYREIBK 
KLLFGGGGVIGPTPTQDIKGREEXNYLE 

avgrekqrfislvk/tnva/heatrdti 

FRGYL 


2255 


16156 


A 


2270 


399 


it A 

44 


X S r Q 1j LiIjP 1 TV LiFi? HVR VliMfcM I Jvb V ii V 
VFVP\STTTFMLQSMGQGVTLTFKSY\Y 
LRNTFNKAVAALTSDSSEESGQSQLKPV 
WAAFSSLNASKNI /RDSWEAVKI PAL KG 
VWKKWSLKM 


2256 


16157 


A 


2271 


19 


420 


AAGIRHEERERERERERE/QRERERERE 

FFFFKKKRRGFLVPTP PGGGGAQKKKTS 
LEGGKGVFFKGGGKKTPLKNPGGWGEPP 
QKKNGGGGPPPRDPPPPPRPLF 


2257 

■ 


16158 


A 


2272 

• 


463 


20 


S YNI PLS qs l I q / s RALTL fdstkaern 

HNI KVQDEAVS YPEDLDKMDALNTKQQ I 

FQAS KDRIi / LLLG/ ATAAGDLKLKPML I 
YNSKNPRVPRAEF 


2258 


16159 


A 


2273 

• 


474 


82 


VG VWAD FL KNT S QAQArKAKMD KWDP I K 

TSDKGLI /SRIDKELKQPYPJCNPNNPVL 
KWAKG WAQWLTP V I LALWEAKAGRTQGQ 
E IEATLALFSGLFCQVFLC 


2259 


16160 


A 


2274 


152 


3 


AEGRNGDG I XQKS S I \RTLLSNDKNPQN 
IHRRPTRFLTMLYQQNLCHLGL 


2260 


16161 


A 


2275 

• 


32 


361 


LGASARYEKPTVNLILNGERLNIFPIR/ 
S KTRLG YLL S LLL FNI GLA I LAS AI NQ K 
KEIQI IQIVKKKIKTQRKKKKNKTKKKK 
ALFKFKGGPEKEKGPPKNPFKTPPVVF 


2261 


16162 


A 


2276 


329 


487 


EFVNITIKIATSLHYKAIVIKA/AMVIW 
YWHKNRHIDQWNRIESLEINPHICNK 
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NO: of 
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SEQID 
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sequence 
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USSN 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A«Alanine 
OCysteine, D-Aspartic Acid, 
E^GIutamic Acid, ^Phenylalanine, 
G=G!ydne, H=Histidine,I=Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=»Serine, 
T=0Threonine, V«Valine, 
W^Tryptophan, Y=Tyroslne, 
X=Unknown, *=Stop codon, ^possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 


2262 


16163 


A 


2277 


2 


466 


AHHSCSHLLTTAAHYSCSPQLLTTTAAH 
TYSPPQLLTPAHHHSCSHLLTTTAAHHS 
CSPPQVLTAAHH\GAHTCSPQLLTPAHH 
SSPQLLTPAHHSCPHLIiTTTAAHHSCSP 
PQLANS YRG \ S P YCS S WSQTPGFRKSSH 
VGLPEP WI FHGLQW 


2263 


16164 

• 

■ 


A 


2278 

• 


4 

- 


467 


KNVTQQGKIHIRDKLDEMWGNTSVPCTN 
HMKHQTNFNAKKCNVFKECX3K\TACNFQ 
IiTQYQISHANQKPYECQICGKPFRKRAH 
LTQHNRIHTGGKPYECKECGKVFICCST 
IjIQHKRTHTSBKPYECIiECRKTFRRSAH 
LIRHQRIHTGEKPYK 


2264 


16165 


A 


2279 


383 


3 


FLCVCACVCVYVCMCMCACAC\CMCMCG 
CVCVCVCVCVGLRGLGWGAWCRSWGPP 
LCFLLLG I L PLKS RLLWLPRRT I S I CTL 
PS AQGPLPAPG FGKYASNTTGVKGS SSV 
FSLLSRITALSHLHW 


2265 


16166 


A 


2280 


47 


219 


VCSELKS CHCTP AWAT \ S VTLSQKQNKT 
KQRRTLGSIFFQHTFMHLKKEKSLILQK 
W 


2266 


16167 

> 


A 


2281 


294 


160 


NKTFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFLFLF 


2267 


16168 


A 


2282 


68 

• 


490 


RTFAYPKPPVFGSHSGKTTIPKNARGSF 
PS/PPLPFPCPSPSPGPSPVSCLPPRLP 
SPSPSPSLSPISRLPSP/SRSPSPSLS 


2268 

■ 


16169 

■ 


A 


2283 

• 


3 

• 


454 


CQSAPLGGASQLGYSGVRDTLEEAVCPF 
SDIWLCAGRTTTLFKAVRQGHLSLQRFR 
LPFVWLCPAPRGGVYRGRQASLSCGGLH 
QVRASRPLCLPTQASAMAGAPPPASLPP 
CSLISDCCASNERG S TGMGP/S E PGTG 


2269 


16170 


A 


2285 


1 


452 


LKDSGRD YVS QFEG CAIrGKQLNLKLLDN 
WDSETSTFSKLREQLGPVTQEFWDNLEK 
DTEGLRQEMSKDLEEEKAKVQPYLDDFQ 
KKWQEBMELYRQKEEPLRAELQEGARQK 
LHEL/QEEIiSP/LGOBMLDRARAHVDAL 
RTHMAPYSDELRQ 


2270 


16171 


A 


2286 


3 


266 


NSSPPSSGHSTPRLAPPSPAREGTDKAV 
SAIiKSPQPNRGMGRGQR/PGLPS/DTAT 
ITPHTSGFPKQPQLSLKVQTQRAKGRLS 
HWDLEP 


2271 


16172 


A 


2287 


274 


453 


I YTFVKS SSKTLRPRHIDQWNRI ENP E I 
KP/EYSQLIFDKANKNIKWEKDTLLNKW 
CWDN 


2272 


16173 


A . 


2288 


81 


487 


TVYFKPTVETY \CWDKKKI PFKGLLRRD 
NTPCYPKSLLEMCEKINIVFAPATTTSS 
/LKPMDQGVIVTFKSNYLKNTF\RLGE3 
RKKKKKKGRKKSFHKALAAINSDSSDGA 
GQSKQKT/FWKGFS I PNAIKNI QDPWE 


2273 


16174 


A 


2289 


75 


469 


S RGVAGAPPKPS PS TPS PGPLD VTPGPH 
SSHHAASPGP\PPPPEPTASSMASAP\P 
PAPQPTPL\PPATLGPPSAGPE/PSPGS 
CTSTGWG YS FCCPRCRRWMRWQPPQQGP 
AWHWWPREFP/PPPRVSGS 


2274 


16175 


A 


2290 


256 


55 

# 


PTQPLRVELTTVTLRCDINKWDYIKLKS 
FCTA\ KKRQ PTEREKIFPNHVSNKRLIS 
KIYKELIHRI 


2275 


16176 


A 


2291 


497 


29 


S LTHR VAGGAAVT PLAHAG ARQ I FFLGD 
PHPTSSLLGWGPAWDPCAFQVSDHPASS 
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SEQ ID [ 
NO: of 
nucleotide 
sequence 

■ 

• 


SEQ ID 
NO: of 
peptide 
sequence 

• 


M 

eth 

od 


SEQ ID 
NO: in 

T I COM 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

ammo acio 

residue of 

peptide 

sequence 

< • 


Predict- 
ed end 
nucle- 
otide 

location . 
correspon 
aing to 
last amino 
acid 

residue oi 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 

IT |n+n mt If* A*IH 1?— Dlinniiln 1 nnin a. 

e/— vjiuiaroic aciq, r— menyiaianine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, I^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Vj=V7iuiafDiiie, it— Argimne, &— oenne, 
■^Threonine, V=Valine, 
W==Tryptophan. Y=Tyrosine, 
a — unKnown, — oiopcouon,/=possiDte 
nucleotide deletion, ^possible 
nucleotide insertion 














RVGKTLLSCPGGGRLPWVTCPGGGQLS C 

GMCPRCMCVCVCPCRDVY/ICVCVCVCV 1 
CVCMCLCPCWDVFTEL 


#4 ^ 

2276 


16177 


A 


2292 


141 


392 


PQKKQTTSTSSLKSLLRLSKVRFLFNIV 
LDVLARE IRQBKE I KGIQ /IiGKEE 


2277 


16178 


A 


2293 


14 


291 


xvGTEGISr xSr MRV1TJ XM 1 KxtbAI JoKx. 
S\CYSR\VYPRFIEFLRFDIQSTGQ/RI 
TSRQHP PR / DLRD ALLYLNRR I TL VRTR 

u VAKK f Kab ir 


2278 


16179 


A 


2294 


110 


293 


LLSNRSIAIASLCGGCNELRSCHWTPA\ 
WRQRETVKKKKKKKKKPGWGFLNPPPQR 
GKLCF 


2279 


16180 


A 


2295 


96 


313 

> 


WNGCYLLSNRSLAIASLCGGCNELiRSCH 
WTPA\WRQRETVKKKK3CKRKKPGWGFLN 
PTPQREKLGFLKRGPGF 


2280 


16181 


A 


2296 


234 


2 


CCLBTTRSLFDKGTKBNTQWGKDSPSNK 
RCWKNWISTCKRMKLDPYLIP/YTNINV 
KY/IKDLNLRPEITKTJ.KBNLREK 


2281 


16182 


A 


2297 


334 


6 


KLF S P PGGGGGP P P P»PP KKG WVP PETP \ 
KRGGGAPNPPP PKGGGAPKP PPQKKNS P 
PKKKKKI F CP PPKKKGPKGVFLRGPRGP 
YRVFLKGPPLFFFFLKKSWRPLAMYA 


2282 

• 


16183 


A 


2298 


467 


8 


LPGFKASTDRDN\VAGDFKLKQMIjIYHS 
ESSSALKNDTKSTLPVLYRNKEAW/VTA 
HLL I P wCTEYFKIj IAETCCSERK IS FK I 
LLLI \DNAPSYPRALMKMFKINVFMSDN 
TTSIVYSTDQGVILTCNSYYliRNTFYKA 


2283 


16184 


A 


2299 


1 


449 


SIYLS IP/ FNLS INLSIYIiS IPIYLS I Y 
/HI SVYI YL/S I YLSS IHLSAI I CLS I F 
QSISLSLSINLSTYLSTimiSLSSIIYS 
SI IYHLSIH/LS YYIiS 


2284 


16185 


A 


2300 


1 


445 


QAGLQLLTSGDPRTSGLPQCWDYRC\DH 
RSWQT 


2285 


16186 


A 


2301 


241 


22 


KWI^TCACVHVFICVYMCLYVCAWV/C 
VSMOjWCAC^CMNQC/VTVCMCVCVCV 


2286 

* 


16187 


A 


2302 

• 


3 

■ 


478 

• 


GGQTETLLiTSQRKGGWPEAIiLTSQP^GRP 
five a pu t DTinirn vcq n zvimrr . dtvs & ivfin \ 

SAPHLPDGE \ PGRGAPHI PDGAAGQRRS 

PLPRWGG/ELGRGAPHIPDGAAGQRRSS 
wpp ypfipp^pnapWPPDTJATO /o/rviAP 

HFPGIPPDGT 


2287 


16188 


A 


2303 


440 


41 


KSHHLS FLS FLLFFFP / TKSHSVAQ/ CW 
SAISAHCNLCLPGSSHSSAIiASRVAVTT 
GVYRI 


ZZoo 


loloy 


A 




2. 




FFLVKTRFIiHVGOAGLKLPTSGGPPAIiA I 
SQ/SL/RITGMSHRTQPE 


2289 

■ 

• 


16190 


A 


2305 


184 


2 


S I KKP KPLVNLRNH/ NT/ WQGAVTPACN 
SSTLGGRGGRITRSGVQDQSGQRRESEF 
LRLGYGLD 


2290 


16191 


A 


2306 


86 


472 


I IKLCKWQNK/RFEAGSLVPEMGFYYVT 
QAGLEIiLFSRDPHTSASQSAGITGAAFH 
QRWSVGTVLLQVDRGTPPVGDCGSRTPQ 
WPGQAFLRTALKS EAHP PHS STDVTPVL 
WSEGSPCLIiSPPSI»SFTG 
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SEQTD 
NO: of 
nucleotide 
sequence 

■ 


SEQID 
NO: of 

. m -m 

peptide 
sequence 


M. 
eth 
od 

■ 

■ 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 

■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 

ft A* 

location 
correspon 
ding to" 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanme, 
G=Glycine, H=Histidine, I=Isoleucine, 
K-Lysine, D=Leucine, M=Methionine, 
N<=Asparagine, PHProline, 
Q=G!utamine, R=Arginine, S=Serine, 
T^Threonlne, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

>ftJT f 1 1 A i § mm m 

X^Un known, *=Stop codon,A=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


2291 


16192 


A 


2307 


133 


2 


RVIHVWRSHPAIP\TTREAEAGESPEP 
GGGGCNEPRSCHCTPAW 


2292 


16193 


A 


2308 


3 


369 


LTTAFPDFPGSTHPLTSASQVAWATGAH 
HHSWLIFVFFVETGFHL/ SELLSSSSPh 
AS AS QSAGITG / REPPCLTS P PFP 


2293 


16194 


A 


2309 


253 


365 


TD YFYTjFI YLRQS L/ S S VAQAGVQWQDL 
GSLQPPPSGFK 


2294 


16195 


A 


2310 


362 


0 


TKAQQCVHENHFKLKDANTLNIKVWRNI 
CHSSPNQKKYGLAILNLDKSGFRSRKDT 
GDBE/HFIKIKKSVIQEAI 1 1 INI YA . 


2295 


16196 

> 


A 


2311 


167 


2 


LC Y CV I FVFF I F WRQS LTLS QGGMQ WCD 
HGSIiQPLPPNGIiKRSSHLSLPNSWDYRR 


2296 


16197 


A 


2312 


67 


482 


DHLIPGGGGCSELQSHPCTPAWVTA/ET 
LSQKKKKKRBK 


2297 


16198 


A 


2313 


188 


437 


AGFFPPENQLTNMKVRKANESDPWGVKP 
ESDESETHGSLSLSLSLFFLFIFLRQSL 
/NSVIQAGVQWRNLSSLQRISP 


2298 


16199 


A 


2314 


41 


325 


TTTLFXRVRQGYSSLLMFLLPFVCLCPA 
GRGGVYRGRQSSLSCSRLHPVRASRPLC 
LPTQASAIAGAPPPALLPPCSLISVCCA 
SNEQGSVG*DP 


2299 


16200 


A 


2315 


417 


0 


SP PRNWDYRCVP PRPANFVFLVEMGF/ Y 
HVARPEPGLELPTSGEPPTIiA/FSKCWD 
YR 


2300 . 


16201 


A 


2316 


98 


288 


LMAVVPATWEAEAQESLEPGGWGGGEEG 
CS ELRS CHCT PAWVT\ SETLS HTHTKKK 
KKNGAAL 


2301 


16202 


A 


2317 


1 


410 


LNH I PNLSLTKRKPS PHS LNLKKKKKKK 
KKKKKKKKKKKKRGGGVKKKPRGGQKKK 
GGEKKNFPSKKGGKKKKRGEFGKKNFFG 
GGKKREKTPQKKKS PKGKKKNLR/ BERG 
EKNPKRGGEKKRSSSPRNNNLRGEKK 


2302 


16203 


A " 


2318 


249 


3 


PLKASSPPKAFNFCREVGPICPPPKKKV 
P\PKIPKLVFIPPPIRKKLLPCPPPLTL 
APPRVPLKRPP 


2303 


16204 


A 


2319 


2 


393 


AHLGLPKCWDYRHEPPRPAPLFLLNSYF 
GLDLLT /S/GDPPAIASQSAEITGVSHC 
AQ/PEYVY 


2304 


16205 


A 


2320 


389 


2 


RGNSNIGGPGPLRGKKFSPPPPLKNWGT 
KLGPPPPPPFFFFFF\GKPGFSIFFKPI 
LKFKGPKKPPPRAPKNGGSPGPGWFFFF 
FLSNFTQIiPGIKLSKGGKKNFASPPPPF 
FFFVHYKTGYP S ATPDAW 


2305 


16206 

• 


A 


2321 


389 


206 


FQWRWGF/NHVGQANLELLTSNDPPASA 
SRSAGTTGMSHHAQLKNYFLMVRMWRNW 
IAVGM 


2306 

- 


16207 


A 


2322 


371 


42 

■ 


SFPPQSGFFSPPPPHEFFFPPPPSFFSW 
VGVRQIPPPPKIFSSPFPPGGFFSPPPE 
R/VDr FrSPPPFFrPPFSFrXibfc'FPPrr 
FFFFFFFFFFFFFFFFFFFSFFGVKKRK 


2307 


16208 . 


A 


2323 


82 


386 


PFLTQKYFFTPPEEGFLKKPNRREGPPS 
PITDPTLWPNMMKGI VPKAPPI IFMGGG 
INMTFSG FVTTKAPFFRPLRFNPMLRQG 
FDLLTLKAS\WGSSASWY 


2308 


16209 


A 


2324 


1 


413 

* 


RGSGDNRHGPPCRVNFVFLVETGFLHVG 
RSGLELPTSPALASQSVGITGVSC/RPP 
PQASY 


2309 


16210 ' 


A 


2325 


58 


400 


SETLVSKKKKKKKKTPPPPKNPKKKKNP 
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SEQ IP 
NO: of 
nucleotide 
sequence 

• 


SEQID 
NO: of 
peptide 
sequence 

• 


M 

eth 

od 

■ 


SEQ ID 
NO: in 
XJSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleutine, 
K=Lysine, LHLeucine, ^Methionine, 
N=Asparagine, PHProline, 
Q=Glutamine, R=Arginine, S=Serine> 
T=Threonlne, V=Vallne, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stopcodon,/=possible 
nucleotide deletion, possible 
nucleotide insertion 




• 


• 


■ 






PPPPPGAKKGPGGKF/PPPPIFQKKRGG 
FKK/SPPKKPPKKKKTPPPLGKKGPPPP 
RGGPLGGFPPPPPQKKNGPPPPKKKPPF 
! FFPPK 


2310 


16211 


A 


2326 


462 


306 


AHHNLRLPGS S DS PASASG VAGTT/GMC 
HHARPILYLSGDASAIiLHCFSSAQLF 


2311 


16212 


A 


2327 


1 


393 


SRPPSRTEKIRNFFFVETGSCYVAQAAL 
KLLDSSDPPTSASQSAG/ITGMSHRAQP 
TS 


2312 

* 


16213 


A 


2328 

i 


391 

_ 


2 


AWFKETKAGWIIPRSWDRGSWVSQPYSA 
LTSSPESGFHSVTNGDSMTCLEDGKHSL 
VAPHGI PQRGSSLQDGVSFCPAPTMKPY 
LASTKM\RLHEARELGFFSFLSLRQGLA 
LS PQLECSGAI LAHCSLNL 


2333 


16214 


A 


2329 


2 


115 


GCSELRSRFCTPAWAT \SKTASQKKKRG 
KKKKGGVFYF 


2314 


16215 


A 


2330 


2 


406 


AAAPSALAIiRDGWAVRPELDLLPPCGEE 
VAPGAHCLGCGPSPCLFLSPSHTRQSPP 
APTSSPGLSTSPPLVPTHVSAPHSSKGP 
PSIPGAQALRGCX5IjGGW0R\PSSPSLP/ 
PDVSPKPLNFAP 


2315 


16216 


A 


2332 


226 


377 


KRKS KH ITFLFKTLS WPDTVAHACNPST 
LGGQGGRI/TPRSGVRDQPDQHG 


2316 


16217 


A 


2333 


3 


191 


CLS PGGGGCGGLRLC YCMPAWVT\NBTV 
SQERKKERRKERE KE IKKE RKKRKKE RK 
KEKKKKG 


2317 


16218 


.A 


2334 


295 


81 


FFKPFFPPKKTCPEFS\FFWEKRGFPPP 
PLRSFFQNPPPKGGPPILLEGPPPPPSL 
GWPPPPPLFFFFFFLR 


2318 


16219 


A 


2335 


375 

i 


8 


TQIVPLPSNLGNKTRIjRLKKKKRNEQGN 
IPTOTIDNRIKQI IQ/T YYBQLCANK\N 
LDKMDKALESHNFPKIiKQRESLNl/HSA 
KEIHFI / ILNISTKKTPDPTGFTGKFLQ 
IFKEKKMAGHSGSHL 


2319 


16220 


A 


2336 


399 


97 


FFFFFFFKQNLA/SVTQVGVQGQYFRSL 
QPLPPRVKPFSFPNPLSNRGYRGPPLGR 
VRQENCPKXiKSKRFQLNKIPRLGKKKKL 
RFPKKKKKEKBKKIVKT 


2320 


16221 


A 


2337 


411 


57 


KKPRSFSSCSSSPPFFFFFSPPPKKKIF 
PPPQIFWGPPFFPPPPFFKPPPPPFFFS 
PQKKKKI \SPPPPKKKFFFKTPPPPFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFF 


2321 


16222 


A 


2338 


97 


354 


AKAPSLSLVLFSFSTFFLGIQGLALLPM 
L ECRGAI TAYCS LNLPDS SGP PTSAPS P 
TPY/RIAGATGTHHNALLLFKFFSRDGL 
PL 


2322 


16223 


A 


2339 


391 


45 


LMFFHLSHKHRSGEAPS IHWS I YLS IHP 
S IHPSLYHSS IHLS IHSTI YPSTCLS IH 
/ CIS IHPS I YPS ICPS VHLIiAHS FIHST 
ICPSNHLPIHVSIQHFLSAQILPVSVFG 
EVSDI 


2323 


16224 


A 


2340 


506 


0 


RDHEQLGIVRADKKKKKKKKKKIKKKKK 
KKKSSSSSSSPGTFFRG/VPLKDPVG 


2324 


16225 


A 


2341 


203 


1 


VELRVRATEPGFNFLKGI YCTSMVDWI K 
KMWYI YTME YHAAI KG/ DR I MS FVATWM 
ELEAIFLSKLMQ 


2325 


16226 


A 


2342 


402 

* 


40 


PYPPKKKASPTDA\FSSSSSSPPPFFFF 
SPPPKKGFFSKPFFFFSPRFFSFPPFLK 
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SEQW 
NO: of 
nucleotide 
sequence 


SEQID 
NO: Of 
peptide 
sequence 


M, 
eth 

Ou 

- 


SEQID 
NO: Id 

T1CCM . 

09/515,1 

■ 


Predicted 

beginning 

nucieoitae 

location 

correspona 

ing to first 

amino acid 

resiaue oi 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 

IT— flirta m it* Ai*tf1 TT' — Phnntilnlnnlna 

Ej~vj imam ic Acia, r~ rnenyiaianine, 
G=GJycine, H=Histidine, I=IsoJeucine, 

I^zT.VcinP T =1 .PI 1 f»i n a IVIof li innina 
JCv— Ajjainc, ir-iiVUUDc! irieullOUluB, 

N=Asparagine, P^Proline, 
Q=Glutamine, R^Arginine, S^Serine, 
i— i nreonine, v*— valine, 
W=Tryptophan, Y^yrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, V=possib)e 
nucleotide insertion 








■ 






SPPPLFFFFFFFFFVEAGVLLCYSG 


2326 


16227 


A . 


2343 

♦ 


181 


1 


QLKDQASDLLGKNGDE VKETI PS FLPS S 

T T5lnT7\ T DT T7T Ur\T?T T>C?TPT DCDO&DOT T O 

FLS 


2327 


16228 

• 


A 


2344 


2 


407 


FVASQLGCSGV\RVRDPLEBAVCPFSDL 
QLRAGRTTALFKAVRQGHLSLQRLIiLSF j 
\VFLCPAPRGGAYRGRQAFLSCGGLHPV 
GASRLLCLPKQAWAMAGAPPPASLPPCS 
LISDCCASNQRDSVGARPSEPGAGH 


2328 


16229 


A 


2345 


405 


178 


IIKFIYEKATA/LLNG/EKLKAFPLKLG 
TRQGCLHSLLVCNTVLBVLTGTLRQEKE 
IKGIIIEKKEVELSLFADRIL 


2329 

* 


16230 


A 


2346 


2 


412 


FKASKASLSPLLGANTAGDFKLKPGIjIY 
HSLH\LKNYADSILLVIiCQWNNKAWMIA 
HLFTAWFTEYFSPPLRPAQKKISFKMLti 
FIDS APS QPGVLMEMYKEINVVFMP ANT 
TSILQPMDQGVILTLKSYWLRDTFH 


2330 


16231 

» 


A 


2347 


2 


397 


hi bJjfci P<jKRKLjU('AR J. rlr Jjno ouDNR VKIi 
CLKKERKKEIGVLIRCWQECKIVQPLWE 
IVWYFLKKLN\ESPYEPAVPLLNIYPRE 
MKIHVYTNTCTQI FIVALFTIAKSGKWW 
GTVACACKPSSSE/WLRWVDYL 


2331 


16232 


A 


2348 


3 


423 


EGCSELGSHHCTPAWAT/AESVSQKKKK 
PKKK 


2332 


16233 


A 


2349 


49 


262 


QMCKGSNRRRGKRVGS RQ I S KKKTNAP I 
KKWAKDMNREF/DIQMANKHMEKCSTSI. 
I IREMQ I KSTMRYHLY 


2333 


16234 


A 


2350 


356 


2 


FVTAPLH S S LGNRARS YCKE KKKVQ VAA 
KAVLRGKF I IAYTVFKKRKI SNINLS I S 
LKTLEKEEHTETKADGAQYVTKI \RAKI 
NKIETANETKSRSLEKTSKTVEGKCLRD 
ItiSAQI 


2334 


16235 


A 


2351 


360 


19 


LDAQFLEVGLGELLFRSTVPTLQPPGCG 
ASFPVLNTLPF\SLSPSQSSSSPASLVR 
P WVAPPFLCPHGE PDGGPDSTTSLPFPG 
PAATGPAARLIQHPASRQPRPASHTHCG 
V 


2335 


16236 


A 


2352 


360 


0 


NTFIiAAFRLVFCQMTSYSLALLSHKLAL 1 

MPIiNLSDLLTRWTHCMGELFFIi^ 

NPFHTVFFLGHPEWGMESRFYVQAWQW 

PDLRQLQPS\PPGSMRFSCLSLPNSWNS 

PSYGRL 


2336 

. 


16237 


A . 


2353 


34 

• 

• 


474 

• 


EG^PCKEIiAARQVGCPHSCFS PHWQIjIj 
QKQEKTAGAVSVCVCTSTLCVCVCVCVC 
VC3\QAMCVCAGA/CFCVCVCAGA/CLCV 
CVGA/CLCVCAEAVSVCVQ/VAVSMCVC 

CGIPPPVXjCXN 


2337 


16238 


A 


2354 


297 


16 


KFFFLKSFFFFSFFFLTPPRFFFFFPKK 
KKIFFFPPRKIFFFFLIPPPPXFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFXRHGG 
WFEEITILTV 


2338 


16239 

• 


A 


2355 


3 


315 


PVTPATRETEAGETL\HDLGEPGGRGOG 
ELRS CHCTPAWVTEQDS VSKKKKKKRGA 
RFKESNFTTPGLQRNIFFLGALKIilSGA 
GVLKRRDGKTLGFPQFNRPWG 


2339 


16240 


A 


2356 


399 


154 


PGQRGE IPSLPKIQ/ ELAGCG/ GHLN PG 
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SEQID 
NO: of 

■ i ii vicwuuc 

sequence 

• 

• 


SEQID 
NO: of 

sequence 

♦ 


M 
eth 

uu 


SEQID 
NO: in 
USSN 

09/515,1 
26 


Predicted 
beginning 

n ii dp n t i fl<* 

location 
corresnond 

ing to first 
amino acid 
residue of 
peptide 

( 


Predict- 
ed end 

nfif*f A_ 

otide 
location 

lUtallUII 

correspon 
dinp to 

last amino 
acid 

residue of 

peptide 
sen ue nee 


Amino acid sequence (A B Alanine 
OCysteine, D=Aspartic Acid, 
us— vriuidiiuc Acidy r — mt nenyjaianine, 
G=Glyrine, H=Histidioe, I=Isoleucine, 

N=A5parogine, P=Proline, 

0=Gliitamine. R=Arf»inine <v=3Serin*» '"' 

\£ V* lUUJili lllb, IX /»i gill 111 v, i^nJvllllC| 

T=Threonine, V=VaHne, 
W=Tryptopban, Y=Tyroslne, 

X"— Hnlrnnwn *=Stfin enrfon /snnccihlp 

nucleotide deletion, V=possibIe 
nucleotide insertion 














GRI FGEPRSRHCTPAWATEQDS I CKSHS 
RSGLWCYKSSSVYLSTRGVWVRGIASV 






A 








FFFFFSETESRS VAOAGLRTX OWRNIjGS 
LQAPPPGFTPF 


2341 


16242 


A 


2358 


209 


2 


KKNLVPWPRGGYFKSLQPPPPGV/SCPN 

PPKKLEYRVLFPQPSNPFFFFCIFSRDG 
V^PPPPOW^PTPn 


2342 


16243 


A 


2359 


279 


380 


RGYNP / WPGAVAHT CNPSTLGGRGRQ I T 
noouyun 


2343 


16244 


A 


2360 

i 


415 


224 


FFFVFFFFLLI TF I LMLNLLPCHKFLFL 
QFI,IiGYLFLLYL\CAFFLLVALNIFITS 
FQQLDYYTF 1 


2344 

• 


16245 


A 


2361 


278 


2 


T OntlT?DT7UMT VD TUT^T TCT TrtTOCM^Tr 

S CQKFQI LEHFRFQ IRDAQSVLRK\RKA 
WTGAVAHACNPS PLGGRGGWI SRSGDLV 

UDflrtUC FT D 
ntr\3\^aaBt i. tr 


2345 


16246 


A 


2362 


139 


282 


KKKKGGRGGGGGRGGGGXXWGGTKKKKK 
GGE KKNLWGGG KGGGKGGGS 


2340 . 


16247 


A 


2363 


19 


409 


Pit* 1 F5 VLAaGoOP AR I PS PJjGGPNRP VP Jj 
G PGVGAP PGP PG KT P FFLKI KKI YP ARG 
GPPVIPASPGGEGKK^PLPPRPRVPLTQ 
IFPP/PPPPGGPNQG 


2347 


16248 


A 


2364 


159 


383 

i 


HSHFKNLSS 1 1 KKLHRNNTFTEHFSLSS 
£>.Ur4y t»r J-uNJj X V r x ortbuN r JSX* oJN/yWPG 
AVAHTCNPNTLGGRGGQITR 


2348 


16249 


A 


2365 


400 


221 


GRURQ ENCLNS GGRECSEPRS /HLHCAP 
AWAT \ E QDS VS KNKKQNKQ KQTH I YTVL 

liLAKrl 


2349 

• 


16250 


A 


2366 

• 


383 


14 


GGRGCNAPRSCHCTPAWVTERASPQKTK 
LQCPRAEGPSQKAARMELMEKQEKNQGP 
TPAQGLSLLICK 


2350 


16251 


A 


2367 


274 


1 


PPJCI LKARGKEHLAS RGTM I RMTS DFYL 
OTMOARREWSKIL/NVLEEKIHOHRIL 7 
PVKSS FKS EEE I KTFSDKQKLRGLVTSR 
SDLGKDVK 










i^i 

101 


£, 


P FFFF55 ETR scr \VTnaRVnWPITT/5<;TiP 
GSSDS PAS ASQVAW I TGTRHYAWLIF 






a 




JOl 




NTGRIil FVFLVEMGF V TMLARIASCDPPJV 
SASQSTGIRGMSHNSQLKCFTEFDSFC 


2353 


16254 


A 


2370 


116 


300 


HLNGDAVEERDFMKCTVSGI IVAHCNLE 
FLG/SSDPSASAPRVAGTTGMCHHIWLI 
FVTLVEM 








/ 1 


o 




l T ix\ii-iL/ Jrnu o tr i J. j\ v jw oicw x ivl/IjIM xuczrix l x 

KXLEPNXR/ ITLLGIGLGKDSMTRNPKA 
IAIKTKLAR 










1 
I 


i n 


QDSSPEKKKKKKKGGF 


2356 


16257 


A 


2373 


43 

• 

• 


403 


LHDSPAIiASQGAGTTGVSHHARPAAGIN 
S RIGQAEDR I S ELE D WLSEIR \ RQS GRN 
MDKRMKMNKQNLQE I RDYVETKSM/NTR 
LI GVP E R VGENG SNLENI FQDL IHENYK 
GKPIRLMVDL 


2357 


16258 


A 

• 


2374 


404 


215 


GQGGRITRAQKFETS1X3NWRPGSEAQB 
LLE\PDDRGCSEPRSCHSTPAWTTEQDS 
VSKKRRKC 


2358 


16259 


A 


2375 


3 


397 


SKQLEFTQLYTKLNQLNKTKISDLKKEK 
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SEQ ID 
NO: of 

nucicuiiuc 

sequence 

"t 

* 


SEQID 
NO: of 
pepucie 
sequence 


eth 

00 

■ 


SEQID 
NO: in 

09/515,1 
26 

■ 


Predicted 

beginning 

nucieouue 

location 

correspond 

in a tn fil*c4 

ing iu iirai 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
iiucic~ 
otide 
location 

ding to 
last amino 
add 

residue of 

pepiJue 

sequence 


Amino acid sequence (A«=Alanine 
C=Cysteine, D=Aspartic Acid, 

IT— fl 111 ♦ami I/% A fir\ C^-Plianiilal nn t ntt 

OGIycine, H=Histidine, I=lsoleucine, 
K-Lysine, L=Leucine, M-Methionine, 

Q=Glutamine, R=Arginine, S-Serine, 
T^breonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucieonae oeieuon, v^possiDie 
nucleotide insertion 


■ 






■ 






YPIRLFSKENTOMASRYRTKY9'IWjT TP 
EMQ I KTTFRHKIiAALAS S RCLLGLG \ AT 

OAnVvl VUvUrJJVi 


2359 


16260 

• 




2376 


413 


1 


PKKGLFPKPIIWVTPGFFPPPRFKKPPP 
KKI FGAPKKKKKSPPPPAKKFFFFKGAP 
PPFFFFFFFFFFFFFFSGDSQERVRBAM 

PVQTHKAQSPKLPA3ECPPPTTPLS 


ZioU 


16261 




2377 




247 


fCKr rixi/Ujifcii-Ujijiiijiji ooUJjrv4l_iffcoU \£>A 
GITGMSHHAQPSATHFQKHTjVS 


2361 


16262 


A 


2378 


2 


143 


QBNRLNPGGGGCSELRS CHCTS \AWVTT 
AKLCLKKKKKKKFFGKGGG 


2362 


16263 


A 


2379 

• 


417 


90 


FFFFFFFP<3GGRLQVCPPPGILLFCFI,Y 
KGGSPPGGSTIFPPPPPGEVGPPGPPPP 
GGFFFFFLE/ QGGVSPDGPGFFVLPTPK 

K.PF r P AFU KGG Jfi i* KJp KFQ V WGPPWPTF 


2363 


16264 


A 


2380 


36 


427 


VHPLNHHDQKGQASSTQKiCKKKKKKKS - 
KKKKKKKKG/ WKGGGAF 


2364 


16265 

* 


A 


2381 


402 


2 


NFFIiKGRGWGPLPPFLPRGPPPRGGPQK 
/ RGGI/3GPPP FFFGSKTP PAGGSRTPMG 
QKKKGPPLPEGPLFGGAGQAPFLPPPVP 

KGiWPbFiUXJSJU*/ APFFrFPFPFFPPPP 

PPFFFFFFLRYNLALLPRLGCSGT 


2365 . 


16266 


A 


2382 


166 


5 


THGHVIYVDQEMQMMIENM/WPGTVAHA 
CNRSTLGSRGGW ITRSGDLMVKPRLY 


2366 


16267 


A 


2383 


2 


457 


TSQPSLLSS WDYRSTSPRIjANF\ ILFYF 
IYFFFAFSVETGFHRVSQDGLNIiLTS/S 
/IPSIPRIPKHWDYRHDP\RTWP 


2367 


16268 


A 


2384 


2 


417 


GRVGFSQSNGNPSSLSFTLLKVDFEVTI 
PGEG KDRI FKVS X KWLAI VS WRMLHBAL 
V5GQ I P VPJjBS V/ UALDVAtoRHLjASMRY 


2368 


16269 


A 


2385 


318 


32 


TME IMIiDKKQ IQ VI F/ FEFKMGREAVET 
THNINYTSGPETVQWWFKKCCKGDESLE 
DEBCSGRPEVGNDQLRAI I AHASADAWV 
DRDSGRCCCACP 


2369 


16270 


A 


2386 


409 


*> 
3 


ISQAPSTPPRKRGFR/ PPQKTSFIiGPPF 
YAAFYQEKKFLFFSSDPPREtGDKGKQK 
GFPPPKVAPKKKGFFKKGPPPGKKKDPP 
SFFKHKSQPTRPPRAPALEGPRSR/SAA 

LKi FoD KKK / FFoUJUSJFiKFFlKFKlKP 


2370 


16271 


A 


2387 


415 


10 


KKGFFPFGPFIFLFPPPGFLPPPPPPIF 
»TSGFCPPPPPLFYFFFRGGPK/HILVF 
PLFFLPPPFFFFFPIFAKESPPKKGPGP 
uHXr ifr r (jFrvnXi / o fxrirsrcC c e dSPlcr tt 
SC^PGWSAKWHDLGSLQPPPPVFK 


2371 


16272 


A 


2388 


410 


95 


KAITOKSDHIKLKSFYPAKETPTKVKTQ 
PPEWEKVFANYPSDKGLI PI I YKELKQP 
YG \ KKSNNS I KKWAKDLNRYFS KD I QMA 
NRC3*4KRCSRPGAVAHTYNPST 


2372 


16273 . 

• 


A 


2389 


362 


92 


RFLFFFSPPPKKGFFPKPFFFFSPRVFP 
PPFFLNPPPKLIF\GPPKKKIFFPAPGG 
KKIFFLKGPPPFFFFFFFFFFFFFFFFF 
P VENT FY 


2373 


16274 


A 


2390 


131 


487 


ATEHEKTEKSS LSFFS IS KRKKKMEKLH 
DIGFSSNFIjG \ RP KAQATKGKTDWTSAK 
LKI CSSRDTI SRMKRQPKBWAKTFANKS 
CHSINKKPEraDKNKKraCECKKFljGGAIAKK 
TNLKPRG 
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SEQID 
NO: of 
nocreOtlue 

sequence 

- 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 

* 


SEQ ID 
NO: in 

09/515,1 
26 


Predicted 

beginning 

nucieonae 

location 

correspond 

ing to first 

* • r" • ■ 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nude- ■ 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
Hi — vjiuTHmic aciq, r<— rnenyiaiamne, 
G=GIycine, H-Histidine, I=Isoleuctne, 
K=Lysine, LMLeucine, M~Metbionine, 
N B Asparagine, P=Proline, 
V£=oiutamine, K— Argimne, o=oenne, 
T^Tbreonine, V=Valine, 
w— t ryptopnon, tyrosine, • 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2374 


16275 


A 


2391 


408 


145 


RWSIiALSPRLECSGVSSAPCKVPPPGVT 
Fr SCLiSLjFSWWEDKCESr FFAU/ IVr IG 
EGFYILHGFF/RRGPKIRCFISGCPPPV 


2375 


16276 


A 


2392 


1 


203 


LFFFAFSVETGFLHVGQAGLEPPTSGDP 


2376 

< 


16277 


A 


2393 


386 


16 


TPSAGGRHIELSLSTCPSCAQHQGKBHIi 
EGGEGGGAQSLTTAPSS ATSSQDP I S AH 
AVEDKLS IRI^ETD I LKTKS / W PGAVGHT 
v_NP& 1 JjGKKKQAiNHliKPGViUJQP^Xj V JL(~ 
RPRPPKVMGLQA 


2377 


16278 


A 


2394 


3 


396 


QLLERLKQEDRLSTGSQGCSELRSHHCT 
PAWATEP \DSVSK 


2378 


16279 


A 


2395 


273 


416 


FFYSFLIKIRWKKQPGMVAQACNPSTLG 
GQGGRI /TLRTGVRDQPGQHG 


2379 


16280 


A 


2396 


329 


76 


FI P I E S ACSQECLKPNLRQE W / YI FGTL 
KLI FFETESHP VTQ/DWSAGELIi\DPRG 
RGCSELRSCHCTPAWATRAKLCLKHTHK 
IKK 


2380 


16281 


A 


2397 


210 


7 


GGKKMYCRKPGGGGFFAAVWS PKGLFS / 
RI WKEAPI LSPQKKipCKTNNP IKKWAND 
ANRHFSKEGTQQ 


2381 


16282 

r 


A 


2398 


48 


393 


S I LKTRKPPLKKGRGRRKKEKES VRTHV 
FFSYQSNAFPSKPLNNSMTLGRLLSFSF 
FLVHFFFYTDG I LLCCPG WS RTPGLKGS 
SHLSFPKCWDYIHEPPHPAYHS/LFCR 


2382 


16283 


A 


2399 


128 


383 


EEAPKHFPKPNLH/QKKVLVTAWWAAAG 
LIHCS FLNPWBTI TPEKYAQQTNEMHQK 
LQCLQLALVNRKGPILIiHHNVRLHITTH 
AS 


2383 


16284 


A 


2400 


54 


384 


LFTF I LNS VFHT YMCLYFWTLFFSVNPF 
VSMPIPQCLDDSSFIISIiDSEINPNIYS 
QLIFDKKTQRGKNSLPDRWCWENCIFTY 
KRMKWD P YL/ S P YTKI I SNWMKDIjNI K 


2384 


16285 


A 


2401 


393 


1 


HRGENTHQQGGGLS RWRRHSRQRGTSRW 
IRHTRQWGPSRWITPSRQRGSSRWRKRS 
RAGDIEVEKTQQSEGALEVRIRSRQRGT 
SRWRSFSRQRVSFRWRICSRQRCSS/KV 
RRRS KQRGTfa R WRRHSRy R 


2385 


16286 


A 


2402 


1 


330 


RPPPPPHCWDYR/HEPPR\QPTLWVIFK ' 
LS VETRLCYVAQ IGLELLGSS KESSKLD 

FKIPRFFKAGMEIFKQIEGFWSSHPLAT 


2386 


16287 


A 


2403 


3 


407 


ADAWGJjRGTHGP WEQAG lo G 1 S PoNS PJj 
FVCFRQSVALVSQAGEQWRRLGSLQPPP 
PGCGRESC\PSFQGGITPSCFFVFLEKM 
GLHRVGQAGLQLLMSGEPPALCSQRRGI 
TOVSHHARPPSKGFIWHTGAPAM 












3/j 


LGSLQPPSRGFKQFLCLSLPS S \YWFTG 
TPPLVGGGSAGLRGG 


2388 


16289 


A 


2405 


1 


122 


PT RP GRGCGE LRS CHSTLAW / VNS ETTS 
QKKQRKEKKLPLL 


2389 


16290 

• 


A 


2406 


417 

* 


130 


QALRVKHVQLVPSSDFLAKTRAQSVNNIj 
LRHSSSSSSSSSS3/SS3SSSSSSSSS3 

sssssssssssssssss\ssssssssss 

SSSSSSSSSSSSSSSSSSSFLFLFFFFG 
LRGAFLGR FCP I RR 
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SEQ ID 
NO: of 
nucleotide 
sequence 

'i 


SEQID 
NO: of 
. peptide 
sequence 


M , 
eth 
od 

• 


SEQID 
NO: in 

09/515,1 
26 

■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucie- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue or 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteJne, D=Aspartic Add, 
jtLr=ijiuiamic ACio, r— jrneoytaianine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K«Lysine, L=Leucine, M~Methionioe, 
N«Asparagine, IHProline, 
V£=vviutamine, K=Argimne, cFranne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
A^Unlcnown, w =Mop coaon, /"possible 
nucleotide deletion, ^possible 
nucleotide insertion 


2390 


16291 


A 


2407 


327 


3 


RQSQSLRYMAfJQPYCCFTCSILIRAEIjN 
EI/ ETKWKKIEKINET/ RKSWFF EKINK 
MGAITPDATERQNI IQGYYEQliYMQNLE 
NI^YMDKFLGRFKPPSIiNQEELDTLNR 


2391 


16292 


A 


2408 


178 


404 


AAYRINSLQEVQHPTKRHSQPRKQSKLR 
EGNAQRNNXRRGPGMVAHACNPSTLGGQ 
GGQV/TLRSGVRDQPGQHG 


2392 


16293 


A 


2409 


94 


386 


TSFALVALGGRSCSEPRSCHCFPAWVS \ 
SETLSQWKRKTPTLNNAKYWGGRFLRFF 
FLTRGGGGRFFFFGGTKTKSLGAGFKKG 
GGGKPGGPPNKGLG 


2393 

■ 


16294 


A 

• ■ 


2410 


1 


384 

- 


PTRQVRIKLFPRIYVHTRKRLKNIYGNT 
ETFITQPFKNHSVWQL IRNNRNAKKQBN 
VAHNEEKNKKQS IBTN\QKYTHNKNLDT 
TKKKKKKKKKKGGGPLKKTI FKARGGEN 
NFFFLGPQKLNSGAGF 


2394 


16295 

» 


A 


2411 


423 


2 


FFFFP P PQKKSLFPP FPFCGPKI FSS PP 
VFLTPPQKIFFCPPKKKKYPPPPRGKIF 
FFLK\PPPPFFFFFFFFFFFFFFFFFFF 
FIFFFFFFLGFLIiSSLLIGVPFTiTiTiNAN 
IMUjLQKKILKDEVQI ISSFPSTXNRNV 
h. 


2395 


16296 


A 


2412 


218 


112 


LKKENNTKCYYG WGTTGTL IHC \ WW \ KL 
IQVLWKTD 


2396 


16297 


A 


2413 


64 


403 


LKNFFFFFFFKKKGPPGAPGGSKPRGIiG 
EPPPPPPQRGGNTGGGPGARPKKNGGGF 
FFFWRGEPP/PPPPAKGGGQDLGPGPFG 
REETNNFPAPAPPGPKNKNLGPRAQKIL 
CF 


2397 


16298 


A 


2414 


217 


409 


VKYLS YSVLLTI I CTVQACYQELRPGAM 
AHTCNPSTLGGHGGRI /TLRSGVRDQPD 
QHG 


2398 


16299 


A 


2415 


416 


0 


ERPPPSSFFPPLAPGQVGQFFYGKGPWP 
EKSKKKNRPWGTKP FPPEGGGPPPPPNS 
PPGGGEKS PLFFFGGKGNPTWSPGPKFL 
KKRVI KGPLHPGGKG YKKTRE FGLKPGP 
PPPKFFLGGG/PGFQNWEKPPQP 


2399 


16300 


A 


2416 


277 


0 


PFVIiLSSSSSSSSSSSSSSSSSSSSSCS 
LLVRYLCKVKHKS S LCAHPGTVPFIiEL I 
HDTDS FAH / VrFADlUj 


2400 


16301 


A 


2417 


205 


486 


RHG YGFGFWKNTCMQHRHS YRRVY I HRH 
I HLTFLA YVHRES PEAMSALLTQ ILLFF 

PRPPGFKRPWC 


2401 

• 


16302 

• 


A 


2418 


364 


1 


KES liGDPVKDTVI LFATRNPMQMGSNYQ 
FFIYLCLTHLLSRSYCWFGCKMLQ/PLW 
KTVWQFLKKLNAE F PYD PAI PL\D I CI P 
VFTASJjFi XAQRWEQPKC^TDEWINQM 
WHMHTIEHYPR 


2402 


16303 


A 


2419 


281 


492 


LFPPSLPASHPKLSTQQPERSHSVTHCG 
/VDRRGSDLGLLQPRPPRIiKPSSRLSLW 
SSWDYRPAPPRPANSC 


2403 


16304 


A 


2420 


63 


465 


RLQRFLLS F \VCLCPAP I GGAYRGRQAS 
LSCGGLRPVRASLLLCLPKQAWAMAGAP 
PPASLPPCSLISDCCASNQRDSVGVGPS 
NPGAG YYLVARRFLSPLEKRS IQVGVTL 
FS RCRLS PLS LTRKGNSLTPCAS . 


2404 


16305 


A 


2421 


61 


482 


QRARITGVSHHAQ/LRFCLFDMRSHSVA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 

■ 

• 


M 

eth 

od 

• 


SEQ ID 
NO: in 
USSN 
09/515,1 

* 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
fc-Glutamic Acid, ^-Phenylalanine, 
OGiycine, H=Histidine, I=Isoleucine, 
K= Lysine, JLeucine, M=Metnionine, 
N=Asparagine, P^ProIine, 
Q=Glutaraine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














QAGVQWCDNSSLQPPPSGLKML/ STSAF 
QSL/WDYRCEPQHPIHY 


2405 


16306 


A 


2422 


90 


444 


YCFSSECS EKCRCPGHDLQLSPGLCHH6 
VSGLAKPLLCFwPWIWKPQPLCS PTTSL 
PFLPLPPFSQCPQGPAWLELEGRPSLWK 
QGGLQSLAIKR/REGSRAQWLTPVI PAL 
WEAEAGG 


2406 


16307 


A 


2423 


1 


444 


PGPGPGCGPVATLGSPSRPARTDSPSLP 
PHS QLREAEARNRD I EAHVRQLQERMEL 
LQAEGATGESLMCPLPRRT/WEEVGRLL 


2407 


16308 


A 


2424 

i 


189 


418 


SRRAEPGSFRGCLRVGVPSCTCV\SI»WV 
CVCVCWG/VCVCVCVCVCARA/CLCLCA 
NFSL/CSHVSLCLSLSLSL 


2408 

■ 


16309 

• 


A 


2425 


. 


895 


LDEQCTSE IHRRGEATARPRAPEHPAPP 
ATAVRGRDAASQNLKRRPGSGTDGLRLQ 
GAEP SRLLRTYAGGAVI PRGTPERAQPP 
PPQDPLGRRRWLSRNTWGPWPGTTQPPS 
PQLLRNDWGSCGFMVPEAARGKVFQDSQ 
EGAHI RRETVS KS VCAE PWRHQRARDPA 
PTNF PLKCQKQRGA^ TSSGQHGDRVNIiV 
FFIDDDYSPPSKR/P,KTNEPPQPPVPEP 
ANAGERKMREFNSGPHNPVEETKLICLC 
PSGHASCQVHLWTGAMLLGFQSWRKLPG 
SGLKARILQ 


2409 


16310 


A 

• 

■ 


2426 


494 


154 


SSRVRCQCA\LLGGASQLGCSGVRDPLE 
EAVCPFSDLQPRAGRTTALFKAQMEMQK 
SPVFCVAHAGSCRZiELFLFGHIiGSSItHG 
LWN 


2410 


16311 


A 


2427 


416 


1 


PQRGPLLtALE PGRQGAAP VEDLQPQGPD 
KPPPPPLPQPFRARTWTAAVPRHPPPV 
ACHP PQP LAASKPWP / SVAGGDLLPLPG 
PERPVHAFFIGF I FVHLGLGGVSGRGAV 
APARSGPVPPRPPSSSTSRFSLFFTiHE 


2411 


16312 


A 


2428 


84 


409 


DYKHAPTMPRIQNFVYHSPQQPROENRI 
NFTPKKKKKKKKKKNKKKKKKKKK\KIF 
WGG 


2412 


1&13 


A 


2429 


1 


389 


LRDLSSDRSNPGRFLSTSNSSLY/EKDK 
RNKAYFTK/RPSPVHDIIST 


2413 

* 


16314 

■ 

• 


A 

- 


2430 


456 

■ 


0 


PGWPGRGAP \PSQTGWPGRGAPHIPDDG 
QPGRGAP\ PSQTGWPGRGAP \ PSQTGWP 
GRGAPHLPDNGWPGRGAPY I PDDGQPGR 
GTP \ PSQTGRPGRGAPHI PDDGRPGRD/ 
GSS L PRRGG SRAEALLTSQTG WPGRGAP 

P 


2414 


16315 


A 


2431 


3 


344 


CRERRSCHCTPAWAT \SETPSQKKKKKI 
FFGNGPPGGPQAGLKLRAWGFFQKRGTM 
GPGTKNHPGQRGBPPLLQKKQKXTRPGG 
GGPGAPPPQGGGAGKSFNPGGGMFQGGE 
TP 

X IT 


2415 


16316 


A 


2432 


1 


109 


RPLRRLRQENRLNRGSRGCS EPKIjC/HL 
CTPAWAT 


2416 


16317 


A 


2433 


1 


239 


QSFAVLPRLVSNSWAKVICIi/PSVSQNA 
EIT 


2417 


16318 


A 


2434 


3 


464 


DWLQLEMQGEIVALVHSHHGGLPWLSES 
ARRLQVQSDLPWWLVCRGT IHKFRCVPH 
LTGRRFEHGVTDCYTLFRDAYHLAGIEM 
PD FHRENDWWRNGQNL YLD I LQAPGL YP 
VPLSAAQPGDVIMCCFGSSVPNHAAI/Y 
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SEQID 
NO: of 
nucleotide ( 
sequence 

'i 


SEQID 
NO: of 
peptide 
sequence 

• 

• 


eth 
od 

• 


SEQID 
NO: in 

26 

* 


Predicted 
beginning 

n ■■ a|aA AL|a 

nucieouuc 
location 
correspond 
ing to first 
amino add 
residue of 
peptide 
sequence 


Predict- 
ed end 

ft i aIa 

nucie- 

onoe 

location 

correspon 

ding to 

last amino 

acid 

residue oi 

peptide 

sequence 


Amino acid sequence (A a Aianine ■ 
C=Cysteine, D=Aspartic Acid, 
Hi = \jiuiaroic acio, r— rnenyiaianiney 

fl=(2lvfina llsRiefiHinA I=lcnlott**in* 
vjr— ijiyciiiii> x\— xiiMiu me, i~j»oicucmc > 

K=Lysine, L=Leucine, M^Methionine, 

N=Asparagine, P=Pro!ine, 

Q=Glutflmine, R=»Arginine, S-Serine, 

"^Threonine, V=Valine, 

W^ryptophan,Y=Tyroslne, 

a— un Known, — oiop coaon, /^possioie 

nucleotide deletion, ^possible 

nucteotiae lnseruon 














CGDGELLPHIPK 


2418 


16319 


A 


2435 


3 


427 


EG ART S S ERHP CNKYLLLHS IAHLF INA 
LNLVLKGLSPSPFPALPISFPIFFFSPH 
FLGTPTLEGGRADLPFLQPPQAPG/QPA 
PLNYGPGPYKNPCP/RLPQLKPAOPGHG 
Jjjjl^PPFNVOHJNIWl^ijljVafaijr Ur r KJ\Rl& 
IPLP 


2419 


16320 


A 


2436 


238 


400 


QFRWKRGKAI FFFFFLRQS PSVTHAGMQ 1 
WSNLSSVQPPPPGFKQFLC\PSVPSS 


2420 


16321 

* 


A 


2437 


U 


442 


LGTRPRATDWGVRLELSRACPVLGHPAK 
HPRPQRWCKVNFSYSPEQADELKLQAGE 
IVEMIKEA/ CGNPDMPSVSPGPQRPPKT 
TEDKGWWEGE CQGRRGVFPDNFVLPPPP 
IKDAQPLLLFGIDQEAGPTESGISQIRF 
RRLSC 


2421 


16322 


A 


2438 

- 


411 

- ■ 


80 

■ 


PQAEEGAPTPGSENFNPPPPRGGGAPPP \ 
PQKNFFP PRGVNPGGGGGQKRPPPKKGG 
/SPKKNPGGDKNPPPKKKKNIGBGGGFI 
GAPRGTPKKTPPPRDGYFQVFFFIVSLK 


2422 


16323 

♦ 


A 


2439 


290 


1 


QLNKI KKTPLI*FLP WAnEKvPKIPPNYP 
PGPKKKGYPPS I FFFFFLRQGL/NSVTR 
AGVQWRDIiSSIiQPPPPGrKRr SC^KJPNS 
AFPHASADAWVDP 


2423 


16324 


A" 


2440 


440 


213 


PFSRPLFFFSPPPKKRASPPPFFFCFPR 
t recti tit* re /crpppinJppii'PttpgvygT o 

PPPPKKXFFFFSPPP 


2424 


16325 


A 


2441 


131 


408 


GCVPPEPAFLCFVLEIVSSVAQAGVQWR 
NLSSLQPSPPGLKRFYRPRLPSS/ QDYR 
RAPPIA 


2425 


16326 


A 


2442 


m Am 

102 


351 


QASSSVXiKXiCVCVRAKIiCV / CAXJVCACV 
CVC/VCVCECVCVC 


2426 


16327 


A 


2443 


190 


3 


PQGAREKSHRPGPIGRRLKL/DPPFIiSP 
HVKINPRWXKDIjNVKPTTI KTL&GNIaSN 
TLLDTCPG 


2427 


16328 


A 


2444 


410 


30 


VC VAPPLiC VCXG I JjCVK AHivif (JVC VHT 
HSLCVCR/CQFLCVCVQ/VAVSVCVCRA 
SLCVOT\ALCVC^CR\TVCVCVHGTVCV 

UVVJuJoruflUvLftlwVLVUVvVU^ 

LCVCVAS AQLLGRGFCSSVNI KGA0P 


2428 


16329 


A 


2445 


243 


1 


KVMVQNKGPFSNLFFFGPTINFFTPQFK 
GMQ W CHAG SLQHLP PGLKQFSCLLP 


2429 


16330 


A 


2446 


276 


408 


MLKNCAFWPGTVAHACNPSTLGGQGGRX 
/TLRSGVRDQPDQHG 


2430 


16331 


A 


2447 


24 


405 


UOU VLAr r r rr x JUJ^UJj^irUVsJSJjXiJirVv? 
PAGENQRWEQFDKLQKGGIKMNPLRGGI 
SVPLTKKKNQ/WARGE 


2431 


16332 


A 


2448 


239 


3 


SPLCGNNVYKPSTVEKTNQVEKMPPSKQ 
IWELCVELLiWO /SNRGIAGSGAHACNPG 
TLGGQGWRIMRSGDRDHPGQHGETP 


2432 


16333 


A 


2449 


406 


144 


GCSEPRSRPCTPAWVTSETLS/RKKRKR 
RKKKS CLLRAI LS TS PELTHFTLTTPLF 
SRYNDYPHFRDEKTEARRVYATC5GSHS 
WKRLG 


2433 


16334 


A 


2450 


349 


1 


GGAIFLTPRPKVPLTPITPIiAFNPGHQE 
EIPLPKKKKKIVRAIYDKPTANIILHGQ 
KVEVFPLKTSTRQGCPLSSLLYQHKTRM 
PSL \ PLL FNIVLD VliARAI KQEKEI KG I 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QIR 


2434 


16335 


A 


2451 


382 


iy 


GKGP KHS EAEAS LPRNPGS YNRQAQWPQ 
SSLHLKGTIPDTTSLNTPWKMLSSLKVP 
S WQGAVAHDCNHS TLGG \RGG W I TRS RV 


2435 


16336 


A 


2452 


3 


396 


FKL KLML I YHS EN PRALKNYTKS TLS ML 

VTHiJNTttlf AWMMatTVRT^HiJPTPV''BTrPT APT 

YFWGKKKITLKILPLVDNAPGHPRALME 
M YKE I NV/ VQ I C I LQPMDQGVT STLRSY 
R\KNT PCKDIAAI DGDS SDRT 


2436 


J 6337 

i i 


A 


2453 


115 


411 


KGGNFPPPGGGGEKKIW/ PNGGPPPRGK 

PV CP P PPPfan/"3fJW3fJU'WPP DflD T I?PPP1T 

KKKGLFGG/ PGGAPNSHPKRNP P PWFPE 
GGELTNPPTFFFEGGAF 


2437 


16338 


A 


2454 


3 


114 


HHV/GQAGLELLTSDDPPASVFQSAGIT 
GVSHRIRSVS 


2438 


16339 


A 


2455 


397 


1 


WSWQK\NRRIDQRSN IE S PEMNPRMYGQ 
VI FUKI AlUNTE WE KDoXjcjN KWCWKNWI / 
LKRMKLDHPHLTP \JCQKSKWMKDLIRPE 

AQGAKVNTDKWDNI KLKNFPTRP 


2439 


16340 

• 


A 


2456 


424 


3 

• 


PKKKKI FFPPPGFKI FFFLRAPPFFFFL 

HI CUPT T KTPOPCT.CrTTCrWCPTPTCT. 

CNKSSGV\CX3LHCSLIiAl/CSLIHLTLC 
PFCVIiLVCMCmVCVCVCVCVCDTVCVC 

QR 


2440 


1 CI A 1 

16341 

* 


A 


243/ 


zoo 


o /y 


VRDQPDQHG 


*\A A 1 

2441 


16342 




Z45o 


1U 




RSVDSATREAEAGGLLi\DPGGRGCSEIjQ 
LCHCTPAWV/TSETL 


2442 

• 


16343 


A 


2459 


184 


387 


I VHFQMHKMINVAY I IPQFYSFILEIQS 
HSVTQAAVQWHSHSSLLPLPPGLRQSSH 
/LSLPSSWDY 


2443 


16344 


A 


2460 


no 


1 


knrvsfff/csfetesrsvtqagvqwcn 
lgslqpptp 


2444 

i < 


1 £.1 A K 

16345 


A 


zAol 


JoU 


Z 


LQAPLPG 


2445 


16346 


A 


2462 


382 


29 


NGPGHGGPPVIPGTWGGQGGGFPRF\GS i 
KPGFTWGNPPFLKNHKNYPGGGPPPVIP 
NFLGGKPGNFFYPGGGGFQ/SGPGAVFP 
PPPGORRS T PLPKICKICERKIjTLFPSIiSD 

RLQNCDA 


2446 


16347 


A 


2463 


228 


1 


KKGTLFKPPPPGGRFFFFFFFETQSRSA 
VTQAWQ/WMPSRLTANPTFRTQGrLLP 


2447 


16348 


A 


2464 


395 

•J -S *J 


i 


RLRRENHLNPGGGGCS eprs / HPAWQQS 
ETTSQKTKTKTKQKALASDTVLSPRQSS 
EERFHLSLFHSS FVTPFGIFSFLTTSPH 
P I LPWLAPQLPAI YC YGDS VI KSNFQL 
WLSKELQESLSRLLWTQAFS 


2448 


16349 


A 


2465 


29 


299 


ETPNEAS PKTS WD YRHVLPCLANFFI FF 
VKTOL /HRAGLELLTSSDLLCFPKCWD Y 
RH\DRSTWPLSSVFGSIFLVYYWPLYLI 
ETNMLTTL 


2449 


16350 


A 


2466 


1266 


1473 


YFVPQSTQNHAAVFRVGSLLQEGCGKIS 
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Amino acid sequence (A=AIanine 
C=*Cysteine, D=Aspartic Acid, 
E«<jlutamic Acid* ^Phenylalanine. 
G=Glycine, H=Histidine, l=lso!eucine, 
K^Lysine, L=Leucfne, MNYlethlonine, 
N=Asparagine, P=Proline> 
Q=Glu famine, R=Arginine t S=Serine, 
T=*Threoninc, V=Valinei 
W=Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possibte 
nucleotide insertion 














KL YGDLKHLKT \ FDRGMVWNTDLVET LE 
LQNLMLCALQTVNG 






A 

* * 


2467 


49 


356 


VQVIIiLPQLLRRLRQENCLNPGOGGGSE 
PRWSHCCP/PAWVTEQDSISKTKK 




16352 


A 


2468 


115 


3 

ml 


LiS HTKWSAW PGAVA/ STLGGRGRQI TRS 
GVQDQPDQHGE 


?4<>7 


16353 


A 




>j 


404 


FVAPGGGQGTFPGSLQPWPPRGPGASGS 
PSRAPLI FGFFGGKGVSPVGPGGFLLPG 
SRD/SGPPGPPKGWGSG/GLGPAPGP 


2453 


16354 

* 


A 


2470 


404 


218 


FAQWI KWGALS SPKPPFPGFKLFSRPS 
PPSYWDYR/RRPRPPRLFFFFFLRQGIA 
LLPRLE 




f 


A 




JOO 




AGGLLSPGVQGYNKLLSCPCKPAWTTXS 
ETLSQKKKKKKLPFY 


2455 


16356 


A 


2472 


2 


364 


HHTQI IFVFLVETGFHHVGQAGVSIiLTS 
GDPPASWDYGR\GHRTWPYSHIFNNL 


2456 


16357 


A 


2473 


1 


292 


DRGCSEPTSCHCTPVWAT\SETVSQKKK 
KKKKKTPLLRGPE KKNGNPP PRGFFRKN 
PGFKRKKGGAPP PGLFKIGRKKS PVGGH 
RHTFILGGPLPRF 


2457 


If) CO 

16338 


A 


.2474 


l 




PVMLPVKPVEL 


2458 


16359 

• 


A 


2475 


2 

■ 

* 


596 

■ 


MKNABDILTMEVLKSTMKQELEAA\QKK 
HSLWELLRIPNICKRICFLSFVSSSSSS 
S<?C!gfifiSSSSfiSSSfiSSSSTKTTAW/l»P 

PLEAS F YRSTCLMPARALLFAST IPFWG 
tiTLHLOHIiGNNVFIiIjOTLFGAVTLIjANC 
VAPWALNHMSRRLSQMLLMFLLATCLLA 
I IFVPQGEKSSQVEERKCLSIjFSOGLPW 
SHLS 


2459 


16360 


A 


2476 


367 


44 


YQEDITIMN/TYALNIGVPTYLANI/DL 
NREIBSNIIIVEYFNfSLSKMDRYRSSR 
QNIDKETVDLKYS I / H INQMDLTDRYRT 
FHPTATERDSISKKRIKIKIKINKSLT 


2460 


16361 


A 


2477 


12 


362 


HHEPG\GOTCSELRXjCHCTPAWVTELDS 
VSKKKKKKKSQKKK 


2461 


16362 


A 


2478 


401 


1 


FFFFFSETRSHP VA\RLECSGAI SAHCN 
QCI 


2462 


16363 


A 


2479. 


3 


353 


YMCVCVCACVC I CVLPWMCLCVC/CYCV 
CVCWYLCVCLCVSLCGHQHLAVSGKRSQ 
P PSHES FKTSLLLWVFKNLP PLLCAWVP 
QVSQNMPLDGTRTLNNVSLPDEKVHDLL 
LPI/CAA 


2463 


16364 


A . 


2480 


57 


361 


MNESLLSCKRWSIiLAFSHI/CVFCLICS 
TT>ISAL07TVCAC3^CGCRCVCMCLCV 
CI^CVCRCMYGRVCAPMCLCVCVHCVL 
ARMCACGCGCVYKAECPI 




lotto 










YMCACG YI CMHA / CVPVCMCVCTCLCAH « 
VCL/CCMCVCTFVPVCMCVCICVCLCVC 
TPI ITLP FLSQERI S FCTDLRS FRATAK 
RSHEEVKNCVYLQT I WNQ YL 1 1 FYIjQNP 
NFGWAQWLMLVIPAVWE 


2465 


16366 


A 


2482 


2 


380 


IHVLGNCLYFSEPQFPHWCX1RITRSFLE 
G YGENLMRLHMS YIHMCLHICAYICI /C 
/LCVYICVLLCAYICVCVYLCVCICMCI 
FVYLCTHICVFVCVLVCIYLCV/C/VCV 

crcvci 


2466 


16367 


A 


2483 


296 


1 


NGTTIRRMLPIFSRYRPGMPGIiVQCTIQ 
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C=Cysteine, D=Aspartic Acid, 
EMjiutamic Acid, F=Phenylalanfne, 
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X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














. SCTANSVPYSHGFIYPPIYLLRDRVWliC 
YSGWSAVAPSWLTITSN\YRLKPSSCLD 
LLGSWNYRQMPPCI 


2467 


16368 


A 


2484 • 

i 


1 


319 


NTVGLCVCV/CLFVCVIiSLCV/CLSLiCV 
/CLCVCWSLCVC/VSLCM/CVSHCVCL 
CVYLWLCVSVCVSLCVSLCVCVYECVIS 
STKRSLGAPSRAEAAKLPRNWAPAKQSH 


2468 


16369 


A 


2485 


3 

• 


334 


NHPI ISIDI/DKHFRKIHQPFVPKAITK 
LGIKGNFLNLLiRGTSVKSTGNINM\NGE 
KPNNLPLRLVIQGDVLFHLLLEVLASAV 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KK 


2469 


16370 

1 


A 


2486 

mm ~ W 


17 

A 9 


409 


CLDNKKPRKEYCQKTHLIKKNPLLSRIC 
KELLKLYDKRMINPIKKWAEALYRCLSK 
GDV/HEMASKHMKRCS lS/h/3 1 RKMQM 
KMTL 


2470 


16371 


A 


2487 


410 


165 


LECNVTIMAHCNLRNIjGSSDPPTSAS\Q 
TAGIHHSLQHFFFTiFFVETGFCYTQAGL 
ELGSSDLPTIASQSVGVTGTGHGTWP 


2471 


16372 


A 


2488 


142 


417 


VEQLLEGYRTKSLYLRSFFLFVCMPLRW 
SL / DS VS PGLTNCHHLjGS LYPPPPGFKQ 
FSCIS\LWAWRRTP 


2472 


16373 

A 9 


A 


2489 


i 

l 


404 


KAGAQMGIKHPQHS SRLRLRRPS PRLPS 
SQKPIiNTHY F PS FS FCAHRHTHTHTHTH 
THTHTHTGV /HLHPHT 


2473 

Aw* 9 mf 


16374 


A 


2490 

mt* w ^ \t 


373 

mr w tm 


146 


YGLERKISFKIL/LFVGKSPSYPRALME 
MCKINWFMHANTTPTLQPIDQEVILGP 
GTVAHACNPGNLRGQSGRNA 


2474 


16375 


A 


2491 


427 


85 


KTPPPP\RGPFFFYFFFFFKKAKNFFPP 
PPTPPIiGKKKNP\PPPEKKISPPPGFPP 
PPFKKGPPKTLFKTPIiKKKKGPGAPQKK 
PFFFKTPPPFFFFFF 


2475 

mm* w 


16376 


A 


2492 


443 


240 


FLFFFFFXXFXXFFFFFXFFXFFFFXFF 
FFFFXXXXXXXXXXXXXXXXFFFFFFFF 
FFXGCLFI YLFI 


2476 


16377 


A 


2493 


3 


135 


I SAHCNR \ LLL PGS SNSTASAS PVAE IT 
GACNNNRQDFSFI ILQI 


2477 


16378 


A 

■ 


2494 


37 


420 


AHVCVCTCVCVCI CLS RRVWCAHSHVC 
AG I FLHL I NQT FTTHL/ CV CKVMALRVT 
PSPCLQGA\QTQLAGQCMCAKVCMNFMY 
IFIEGHICPQTCSOjGEPIRGGLSPLVC 
IC/MCAGFFVSFC 


2478 


16379 


A 


2495 


402 


178 


RQNKTPFGVKKQITGGGGRGRVSYFLRG 
VSQENPLNPGGGGCSEP\RPPGWGTKNP 
LFQKKKKS SKQQEGRGED 


2479 


16380 


A 

» 


2496 


138 


439 


RXFI LGDHHHHPS AVHFHL PRRKLCTHE 
TGAAP P P PALGATVP P SVS VCLTGGG PQ 
I TP PP PALGARDLP S VSVRLTMGGPQ I T 
PPHPDQQPL/SPPSVSVC 


2480 


16381 


A 


2497 


118 


382 


VKSFCASMRACVCECVYMCVSMCVCVCV 
CVSACLI CVCLCQ/CDLINKCV 


2481 


16382 


A 


2498 


164 


439 


VKSFC^MRACVCEC^TOICVSMCVCVCV 
CVSACLICVCLCQ/CDLINKCV 


2482 


16383 


A 


2499 


1 


411 


I STS F YTD T INRTALH FAVG RNHLSAVD 
FLLKHKARVDVADKDGMNALHFATQSNH 
VRIVEYLIQDLHLKDLNQPDEPKESPLH 
LWINNHITWNSLIiSAQHDlDIL\lR3 | 
. SKPPLHVAADRGNVKLVE/LLLKAGCD 
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■ ™ w ■ ■ ■ mm ^ MUM m, MM lra mmw mmWJkm'M m Brnmrnfm 

^Threonine, V=Valine; 
W~Tryptophan, Y-Tyrosine, 
X s TJnknown. *=Sta>D codon. /=nossible 

m m, mmf asamMV » » **y mm VVwVIII # |/VtJUl VI V 

nucleotide deletion, ^possible 
nucleotide insertion 


2483 


16384 


A 


2500 

\J\l 


80 


404 


RTAAAVSTVSFPQDFEGQSPKCTQGVRE 
ALRQIK/RLVPTGSLRH/WPAGSLA/LC 
QPLSDEKDLTQLFMFARNAFTALAMMDY 
PYPTDFLGPLPANPVKGRRRIiPRRRERP 


74.514 


JvjQJ 


A 


■U-J\) I 


715 


1 
i 


AAIOQGSIjACSHSVPPATTPRAYTPVPP 
QLLVRNF\YPKTLEIiRSQLRCARRFPRE 
TGADCRHAGAGRQTK 


2485 


16386 


A 


2502 


423 


214 


WQAQVGGLLEP\GGRGCCELRSCHCTPA 
WVT\GEILSQNNPKRQQNKTKQKMGPGV 
PGSWGSPRAGGLTV 

m* 9 mmw mmw v T W • « v mm* mjmt mmm mm* V 


2486 


16387 


A 


2503 

i 


294 


1 


LHWNQRQRCPPPPPPPLPPPPPLPPPPL 
P LR \ HR I AGAG QTGAKI R WGRWRVGA 
GLGVAAA/DSRESCPDGAAAGGGEGGGD 
SAATAGPGGGGGGGTK 


2487 


16388 


A 


2504 


331 


411 


PFLKTGNKGAPPPTKIFFLFFKKKGG/W 
PGAVAHACNP S TLGGRGGR I TRS GDRER 


2488 


16389 


A 


2505 


407 


195 


GGRG C SE LRS CP CT PAWVT \GBVLSQKK 
PSTCQRRVGAWFIYL 






a 


ZjUO 


i 
i 




ROGS MNKLETERO I KKAPARNPKRERER 
ERE S KKG EDRHTD I CRP /RERERQNYRD 
RQ 


2490 


16391 


A 


2507 


27 


380 

- 


FVCLCPAPRGGAYRGRQASLSCGGLHPV 
RASWIiLCIjPNOAW AMARAPP PAS IJjPCS 
LISDCCASNQRDSVGVGPSEPGVGYNLM 
VRRFLSRSEKRNIRVGVTRFSRCVCHPF 
L*LGKG 


2491 


16392 


A 


2508 


396 


234 


RQENCLNPGGGAS S E PRSRLCTPAWATE 
RDSISN/QNKQTNKSHILRAAFSRPSCY 


2492 


16393 


A 


2509 


1 


421 


AKKIBCPPFPLKKKKKPGRGGPPPPPPP/ 
AKFGRETPLNPGGKGS IKPKLGPPPPPP 
GGPPNLPPKKKKKKAIPK/PLVPSHVKS 
TGFPPSCS/RSALLRAFTQ 


2493 




A 




17/ 


«* 


TGPPD FNFFFFLEMKSHS VAOAG VQWRD 
LGSLQPPSPQFKQFYYWGIiQGA/IiPPCL 
NKICIFSRGGV 


2494 
M-*ty** 




"S — 


2511 


114 


454 


OGPLEKKGAPFKNKFFKTGVLFFGSGNI 
PPWWFKKGFSRTPFE I KKKNP PG I LGGS 
PKKFFFFFFFEMVLLHHPGWSALAQSLL 
TTASDF/LRLKQSSHLSLSS 


24Q5 


I 


A 


2512 




495 


FFLSRGLF IHLES APAI FOCLLFWFLV 

k MWA*wMfc ^» * * * * Miml %^mV Mm ■ 1 A » V»*"A w — Mm MmtM ■ 

FVATGSHYVPQAGLKLLASNEPPASNS P 
KCWVYRHEYPALKCLCFGLCLF/AFLP 


2496 


16397 


A 


2513 


25 


437 


PLLFSPPAGHAMEEDILPPAPSFLFYFF 
F YWLVWVKI IYLYVIiAQAATTNYHRLSG 

mt mm* 9 w mmm w m> m mm mmm 9 mm ▼ 4** m—m m mm mm mm m mm mr-mmm bmmh -^mr 

LNNKHLFLTIIjEPRMP/SIKVPA 


2497 


16398 


A 


2514 


2 


256 


QAP S AEGEMTS YVLLAYLTAQ PAPTSED 
LTSATNIAKWITKHS \NAQGGFYYNQDN 
L PYSLCS GRNESTAFGTNGQDI IHPVHS 
S 


2498 


16399 


A 


2515 


2 


284 


KCQCDELCS YYQS CCTDYTAECKPQVTR 
GDVFTMPEDEYTDYDDGEENNNAIRHEQ 
A\GFTSVI SDLQAQI I WISEQKBAMSFV 
EEMTVLEVTL 


2499 


16400 


A 


2516 


2 


432 


RDCERESSRAAEYTASLKASCWMGDLAE 
SPSLDPQVAMGTPPAGPC3GWSVGTRSQP 
FPRGPSRGPCCVSPPSWTPEPGTGQ/CG 
EGGGRMSPQVPP 
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SEQID 
NO: of 
nucieouuc 
sequence 


SEQH) 
NO: of 

pcpiiuc 

sequence 


etb 

uu 


SEQID 
NO: in 

09/515,1 
26 


Predicted 
beginning 

IlULIcUUUc 

location 
correspond 
Ing to first 
amino acid" 

icaiuue UI 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
lasi sinino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A»AIanine 
OCysteine, D=Aspartic Add, 

ITs^^lntn mir Arid TTssPtionvlftlaninA 
vyiuiiiiuii. niiUy r~*x iiciijf miHumCy 

G=Glycine, H=Histidine, I=I$oleucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=Prolinc, 
Q=*G]utamine, R=Arginine, S=Serine, 

T=Thronninp V=ValJn*» 

W=Tryptophan, Y=Tyrosine, 
X=Un known, *=5top codon, /^possible 
nucleotide deletions-possible 
nucleotide insertion 


2500 


16401 




Z51 / 




L 


VRS\PRLIRRQRTPTGEKPYECEECGKS 
FSRRYRLAQHQR\TRAGEKTYBCNEOGR 
GFSERSDLINHYRVHTGERPYKCDECGK 
NFSQNS DLVRHHRAHTGEKP C 


2501 


16402 


TV 


2518 


i no 

398 


1 


£• 1 r vai\ \ovjijKol V fljUc IjAVvjrAJbKAVM 

I SAIBKQKLVYILNRDAAARLTISSPVB 
AHKASALVYHWGVDVGFENPMFACIJEM 
D YEEADNGS TGEAAGNTQQTLTLYELDL 
GLNHWRKYKEPLE EHGNCI 


2502 


16403 

♦ 


A 


2519 


384 


3 


GGGID/ SDASLVIAGVxIjKUEGRXRCEL 
INGIEDESVALTLSLEGEALPLPPHSCV 
AAGPPRLGLPGLLPS I SSAPLGTPAPSP 
RPRRPSSPSAPIRWPSPGSPPPPRCGVS 
VPTQPGPVPVQLLRGVY 


2503 


16404 


A 


2520 


1 


426 


GDRQMITAIiLRKLKQQS RES VEENRPRL 
LKALKELGDFYLELHWDFQSWVPLLSRI 
LPSDACKIYKQGINIRLDTTLIDFTDMK 
CQRGDLS F I FNGDAAPS ES VWLDNEPK 
VYQRLRHEES /QE KNRTQVD CLTP 


2504 


16405 

• ■ 


A 


2521 


2 


425 


AIiPIGRMP IMVRSSR\CVLTGKTPAEFA 
KLNECPLDPGGYFIVKGVEKVILIQEQL 
SKNRIIVEADRKGAVGASVTSSTHEKKS 
RTNMAVKQGRFYLRHNTLS ED I P I VI I S 
KPMGVBSAQEKIQTVSPQEPEWR 


2505 


16406 


A 


2522 


2 


376 


X QS FS WY G YHLCS QG X LS b RVS AS Kr P 
LLFFFFFKKGGPPFMP/QGGKTKGPLKP 
GDPQS PG I KGFPCIiAP PNKGKKGGPPPP 
RGNFLGFKKKKRFPQVGGGGS KP PREGK 
PPPliAPQGGGNRGG 


2506 


16407 


A 


2523 


403 


3 


HAHYEQKKLAASQIEKQRQHMKLGKQ\Q 
QEQIARQQQRLIQQQHKINLLQQQIQVQ 
GHLPPLMIPVFPRDQRQLDAIiAQQGFLL 


2507 


16408 


A 


2524 


2 


369 


NARdi I LRAAE YMAACVYVC I S VFACMC 
r L.vrcvi»v r vv^v jov^jjt— vivoitiei vrtj-ioAjv-. 
/VEQCACLWMCVSLHVCLYTCACLCVSV 
CEAVCLHLESHCRGGSRPFPWGPAFSL 
SSCLIIiAPPSVT 


2508 


16409 


A 


2525 


59 


380 


RNRNFNKNSS FFFFFKKGPWGGGPGGTK 
GPKQRCX3EP\NHWGQKNFPGQTRKPRKP 

QKPGGQKKP PGGAPQKRGKTGGET 


2509 


16410 


A 


2526 


1 


371 


VMPLSRKHPKGFFSNHTLVLHVARSEMD 
KERVFQATRKSSP/CFVPLLPWPT 


2510 


16411 


A 


2527 


445 


3 


RWLGLAL IAL KRELKKGDL \ PEMRWWDS 
S 1 1 PNG\ FDLTEETPKRED YFG I ANLVE 

LRRQTRREAQKELQEKVRLGLMPPPEPK 
WISNLMRVLGTEAVQDPTKVEAHVRAQ 
MAKRQKAHV 


2511 


16412 


A 


2528 


1 


478 


RPTRPKRKLKSHRLQSRQESKKVRVL/T 
NAEDNEMEEETDDGPLLVPRVKVAEDGS 
1 1 1»DEESLTVE VLRTKGP CWEENDP I F 
ERGSTTTYSSFRKNYYSKPWSNKETDMF 
FLAISMVGTDFSMXGQLFPHRARIEIKN 
KFKREVYASAIEDQATSVHT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
nentide 
sequence 

* * 


M 

Cth 

od 


SEQID 
NO: in 
USSN 
09/5154 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

n rn inn arid 
•till 111 LP 4LIU 

residue of 

peptide 

sequence 


Predict, 
ed end 
nucle- ; 
otide 
location 
correspon 

Hino tn 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G-Glycine, H*=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparaglne, P=Proline, 

0=Glutamine R=A rcnnint* S=SArinp' 
T-Threonine, V=Va)ioe, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=£top codon, /-possible 
nucleotide deletion, \=possi ble 
nucleotide insertion 


2512 

» 


16413 


A 


2529 


385 


1 


QKKCEDLK\DQDNPIVRPPPTPGSCGHG 
GVVNISKPYVVQLNWRGFYYLYGAWGRD . 
YS PQHPNKGL YWVAPLNTDGRLLE Y YRL 
CNTIiDDFLLY INARE IR I TYVQGSG IAV 














NYNNKYVSMYNTVNV 


2513 


16414 


A 


2530 

• • 


1 

■ 


1228 

• * 


FRATLRPETMFGQTNCWVRPDMKYIGFE 
TVNGD I F I CTQKAARNMS YQGFTKDNGV 
VPVVKELMGEEILGASLSAPLTSYKVIY. 
VLPMLTIKEDKGTGWTSVPSDSPDDIA 
ALRDLKKKQALRAKYG IRDDMVLPFEP V 
P VI E I PGFGNIiS AVTI CDELKIOSONDR 
EKLAEAKEKIYLKGFYBGIMLVDGFKGQ 
KVQDVKKTIQKKMIDAGDALIYM\EPEK 
QVMSRSSDECVVALCDQWYLDYGEENWK 
KOTSOdiKNIiETFCEETRRlTFEATLGWL 
QEHACSRT YGLGTHLPWDEQWLIE S LS D 
STIYMAFYTVAHLLQGGNLHGOAESPLG 
IRPQQMTKEVWG\YVFFKEAPFPKTQIA 
KEKLDQLKQEFEFWYPVDLRVSGKDLVP 
NHLS YYLYNHVAMW PEQR 


2514 


16415 


A 


2531 


335 


2 


KKKAIiFFIHPPPFVMGRPHKNPKIKSLG 
SPTPFSLIFLKKKFCfFVNQDGVRWLYFG 
SLQSLPSRFPPFFCLNLLSSWEYRGLPP 
RPGKYFFPFIFFYFLVEM\GFTVLARMY 


2515 


16416 


A 


2532 


3 


380 


AINSYIRGDDPSSYPEWQSASRSSK\W 
SPLPRALHLTDAK 


2516 


16417 

■ 


A 


2533 

• 


3 

- 


2083 

* 

■ 


SSEGYLRGNMSENEEEEISQQEGSGDYE 
VEE I PFGLEPQS PGFEPQS PEFEPQSPR 
FEPES PGFESRS PGLVPPS PEFAPRSPE 
SDSQS PE FESQS PRYEPQS PG YEPRSPG 
YEPRSPG YESES SRYESQNTELKTQSPE 
FEAQSSKFQEGAEMLLNPEEKSPLNISV 
GVHPLDS FTQGFGEQPTGDLPIGPPFEM 
PTGALLS T PQ FEMLQNP LGLTGALRGPG 
RRGGRARGGQG PRPNI CG I CG KS FGRGS 
TLIOHORIHTGEKPYKCEVCSKAFSOSS 
DLI KHQRTHTGERP YKCPRCGKAFADS S 
YLLRHQRTHSGQKPYKCPHCGKAFGDSS 
YLLRHQRTHSHERPYSCTECGKCYSQNS 
SLRSHQRVHTGQRPFSCGI CGKSFSQRS 
ALI PHARSHAREKP FKCPECGKRFGQS S 
VLAIHARTHLPGRTYSCPDCX3KTFNRSS 
TLIQHQRSHTGERP YRCAVCGKGFCRS S 
TLLQHHRVH\ SGE R P YKCDDCGKAFS \ R 
ASDLIRHQRTH 


2517 

• 


16418 


A 


2534 

- , 


434 


3 


APLHSGKRSPTKCN \ECG\ GAWNRSSLL 
DRHKI IHSEENPNKCEECGKAFKQASRL 
TIHKI IHAGEKPYKYEECGKVFSQSSHL 
TTQKILHSGENLYKCKECGKACNLFSNL 
TNHKRIHAGEKPYKCKECGRAFNISSNL 
NKQECI 


2518 


16419 


A 


2535 


46 


454 


PSTFSSKVMDKQTLCSSQATSOTSRYAA 
ALYRQGS I YPKE MKTCVHTKPYTWHSS 
I FMIAKKWKQPRCPSA\DS WRNRMWSIH 
AMG CYLTME KNEAVMLPRR / WS LENI VL 
SERSH 


2519 


16420 


A 


2536 


13 


476 


ALKTYKYSHKKAFNPKKVYGKGSQKDPF 
PINHCLPTEKLHLCDKIGEGVFGKVFQT 
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SEQ ID 
NO: of 
nucleotide 
sequence 

• 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 

■ 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
nentide ■ 
sequence 


Predict- 
ed end 
nucle* 

otide 
location 
correspon 
dins to 
last amino 
acid 

residue of 
Den tide 
sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Add, 
E^Olutamic Acid. F^PbenvIalanine. 
G=Giycine, H=Histidine, I=lsoleucJne, 
K=Lysine, L^Leucine, M=Methionlne» 
N=Asparagine, ^Proline, 
0=Glutamtne. R—Areinine. S=Serinc 
T=Threonine, V=*Valine, 
W^Trvotoohan. Y^Tvrosine. 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion. V=DOSslble 
nucleotide insertion 














. IADHTP VAI KI IAI EGPDLVNGSHQKTP 
EEILPEIIISKELSLLSGEVCNRTBGFI 
GLNSVHCVQGSYPPLLLKAWDHYNSTKG 
SANDRP/DFFKDDQLY 






A 

*• 




1159 


9QA 


yvcor y kianpflps ytkinckwvtd t m 
VKPTSVKIjLQEKRRKSL 


2521 


,16422 


A 


2538 


495 


348 


FGW \HAFI VKEPRVEKLCKASARAKPQP 
PAI I AKTFKAGGITGQYAQAAL 


2522 


16423 

• 


A 


2539 


480 


63 


ARSEAWIS IADAYMAS PGDR IAQLLLLP 
YI^VGNSEIKRTGGFGSTDPTGKAAYWA 
SRVSENRPACKAI IQGKQFEGLVDTGAD 

ASEVYQSTVILHCLGPDNRDSTSHYSKT 
R 


2523 


16424 


A 


2540 


2 

• 


509 


NVDADDVRLAIQCRADQSFTSPPPRDFL 
LDIARQKNQTPLPLIKPYAGPRLPPDRY 
CLTAPNYRLKSLIKKGPNQGRLVPRLSV 

SVTSQRFTVQIPPSQSTPVKPVPATTAV 

A 


2524 


16425 


A 


2541 


501 

• 


1 


QWQWEDPGPGPRPREEASSYKCELREQ 

AERFLAKTCS TKDQQ FKKDQNVLS PVNC 

RFVQVSEDSGSLFKKVQRYFFTEVBGWS 
NDTI FKILLDIMLITWVTQLSVHQTPV 






A 






9R7 
Lot 


EKDFNLPPKDLRli / KTSDV / TSTKENE F 
EIDYCLKRELLMGICEMGWEKPSPIQVC 


2526 


16427 


A 


2543 


268 


482 


KKKKAWGLQRGGKNFPAKAELPTHGIQI 
NSCSVRL/VDIKKBKPFSILKVEGQAQA 
RTHLNRAFDI I VLTRGG 


2527 


16428 


A 


2544 


407 


1 


CKKICYLIHYWWEYKMIQPLWKTVWQFL 
\KKLNILYDPAVILYDIYPEELY 


2528 


16429 


A 


2545 


28 


399 


FRHSSFQRSGRGSQLMVHFLSL/ SVMPK 
IGSVAGINYGLVAPPATTGETLDVQM/K 
GEADTENH 


2529 


16430 


A 


2546 


2 


3c* 


FWNVDEVGGEALGRLLWYPWTQRFYE 
S FGDLSTPDAVMGNHKVKAHGKKVLGAF 
SDGLALLDNLKGSFATLSELHCDKLHVD 
PENFRLLGNVLVCVLAHH/ FDKEFAPP 




1 Aril 1 


A 






i 


awTi prp mh ftpitr p v<5Hai? ai? Qaa etc inn? 

urnv>mwiuiu( v am v onniuviaiindAivvA 
ANGSGKQNEGMNVTWTATLALSQAVISS 
ATYTQMQPHSLIQQQQQIHLQQKQWIQ 
QQIAIHHQQQFQHRQSQLLHTATHLQGA 
\QKQKQQQHEWR 




1 AAIO 
10*OZ 


A 




i 
j 




DSFKSADESGQALLAAGQYASOEVREKL 
AVLSEERAALLELWEIiRRQQYEQC3flDLQ 
LFFRDTEQVDNWMSKQEAFLLNEDUGDA 
VDS/IKEILKKHE 


2532 


16433 


A 


2549 

* 


360 


1 

* 


RGEMLWTV\NNRFLKNFVPGKIEPFKSH 
SLYPPCYVHDVSFWIDQKKGFDEIjEFHT 
VGRAVSQDTI IS IQFIiSRFQHPKTQQVS 
IiCYRLTCQTCDKALTQQQVASMRSQXRK . 
EIQQHQY 


2533 


16434 


A 


2550 


2 


403 


WAEBDTELRDLIiVQTLENSGVLNRIKA 
ELRAG VFLALEEQEKVENKTPIiVNES LK 
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SEQID 
NO: of 

nUCIcOllue 

sequence 

• 

» 

* 


SEQID 
NO: of 
peptide 
sequence 

• 


M 
eth 

a/I 
Ou 

• 


SEQID 
NO: In 

UOOW 

09/515,1 ' 
26 


Predicted 
beginning 

nucicunuc 

location 
correspond 

ing iu iirai 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

n iiiil a 

nucie— 

otide 
location 
currespon 
ding to 
last amino 
acid 

residue oi 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCystelne, D=Aspartic Acid, 

IT" — (~Z In f-Q m i P A i*iH C — P haniilalanina 

Ks— oiumiiiiL. ci li, f— *r neny laiamncj 
G=Glycioe, H=Histidine, I=Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 

MsAtnarooiiiA PsPrrtl inn 

ii— rtsjjar tigiue, * ^rruunc, 

Q=Glu (amine, R=ArginineVS=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, y=^Tyrosine, 
A-un Known,, — oiop cuuon, / — possioie 
nucleotide deletion, \=possible 
nucleotide insertion 














KFIiNTKD\LQQLEGRENLARDLGIIEAE 
LDLSDVHSPPKSPEGKTSAQTT 


2534 


16435 


A 


2551 


1 


409 


VPRNPTPLGGPGGP I LRSREGGPPGLPR 
EPPSVLKKREPPTRGGGRPPGPPLPRRV 
RPEKWVNPGGQTFPEPKLRPRG/HHPGG 

/"VP VMD WD virv 


2535 

* 


16436 


A 


2552 


256 
* 


1 


HEMDGTLLGPFPPPGSSKGPSLTGPPIL 
PQS PGP / APH J PS S S PANLKT/ CTP VCP 
SHLPW/CCPLCLPMRLPWSPVPPVSKSS 
PPALY 


2536 


16437 

■ 


A 


2553 


398 


3 


ARQQQQLLQQQHKI /NWLQQQIQVQGQL 
PPLMI PVr PPDQRPIiAAAAQQGFbLPPG 
FSYKAGCSDPY PVQMI PTTMAGAAAATP 
GLGPLQLQQLYAAQLAAMQVSPGGKLPG 
IPQGNLGAAVSPTS IHTDKRCI 


2537 


16438 


A 


2554 


20 


517 


DRPPS TKRRDTPQLRGAPDLS PRGPAPV 
PECPEH/ SPRKKTSACRPPLPLRPSHSS 
PLP/SPQPSHSTPQASCPLPEALSPPAP 
FRSPQSYLRAPSWPVVPPEEHSSFAPDS 
SAFPPATPRTEFPE^WGTDTPPATHRSS 
WPMPRPSPD 


2538 


16439 


A 


2555 


361 


3 


KSSQEALEEALRQRLEELKKLCLREGEL 
PGKLPVEYRLDPGEDPPI VRRRIGP \AF 
KLDEQ KI L PKGEEAELERLERE FAIQ S Q 
ITEADRRLASDPNVSKKLKKQRKTKYIN 
AVKKLQVY 


2539 


16440 


A 


2556 


470 


2 


LKAA.VTAGLEVPSDVSDRAFE \WLSAFP 
Xj\DSP YS IHHPRRIQVSSEKEAAPDAGA 
ERITADSDLAYSSKVLLLSSPGLEELYR 
CCMLFVDDTAE PRET PEHP VKQ I KFLLG 
RKEBE PVLVGGEWS PSLDGLGPQADPQV 

KVb ci A X KL-AUAy 1 vi 1 V 


2540 


16441 


A 


2557 


2 


315 


VVAEE DTELRDLL VQTLENS G VLNRI KA 
ELRAAVFIiALEEQEKVENKTPLVNESLK 

WERSGISGLFFVFNCYRSSF 


2541 


16442 


A 


2558 


372 


3 


CN I S SHFANKKCQDVI VAARNVMTSQ I H 

HAV\ KI I PGFNINVPGLPPPDEDTELEV 

rufucMBrtvwmnKMT .wt .itmtt.tvmj cvct.c 
yjxVolN ry irtci VPxtNljCtijCiJM 1 Jji/vrloXoJjlr 

TCRISEYVKKLMELAYHSLLEAASSSDQ 

CADQLFYSVRCI 


2542 


16443 


A 


2559 


64 


435 


WGDAS CTGRAQLG IAHKS VLPTLTDKFN 
TIPAKTPMS FPKEPDKLTPKFT / WKNRT 

DDTT/IT 

rKl vi\ 


2543 

• 


16444 


A 


2560 


113 


375 


VPGPARDSTQRRAVKNDKKELMS IPGI P 
GPVQ VPGGGLS T \ GMRGFPGSS LGFLTI 
PGGALI PFSPAFFSRVGGDLS PRNTGPG 
OKPG 


2544 


16445 


A 


2561 


431 


51 


RKIYRVYERENFRVEIMFRFSHTSKKVC 
KGNV/DVAKFI KLHRIXSWHVIiNVQCACH 
QKGGIYWFRSIHVELIGYPPPRSSSHIK I 
I GDKVR VKAS VTT P KYKWGS VTHQ S VGV 
VKGNI I WVI KFLMLTFH 


2545 




A 


2562 


1 


455 


P EG I ALEWTE S TG KE REHT FQ P GDNVE 
VCEGBLINIiQGKILSVDGNKITIMHKHB 
DLKDMLBFPAQELRKYFKMGDHVKVIAG 
RFEGDTGIiIVRVEENFVILFSGLAMHGL 
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SEQID 
NO; of 

n nclpotlflfi 

llUVIbullUw 

1 

seauence 

a %*% ■ w • m ww 

t 

'i 


SEQID 
NO: of 
nentfric* 

seauence 


M, 
etb 

uu 

■ 


SEQID 
NO: in 
USSN 

09/515 1 ■ 
26 

• 


Predicted 
beginning 

niiHpfttfrip 

11 UIICUIIUC 

location 
correspond 
ing to first 
amino acid 
residue of 

npntirfp - 

sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to ' 
last amino 
adti 

residue of 

peptide 

sequence 


Amino acid sequence (A«Alanine 
OCysteine, D=Aspartic Acid, 

Kr^fSlutamic Acid 1?=PhpnvlnlnninA 
G=GIvcine. H=Histidine I=Isoleticine 
K=Lysine, L=Leucine, IM=Metbionine, 
N=Asparagine, P=Proline, 
Q=G iutamine, R=»Arginine, S-Serine, 
Threonine, V=Valine, 
W=Trvntonhan Y=TvrntIne. 

X=lfnknown, *=Stop codon, /^possible 
I nucleotide deletion, \= possible 
nucleotide insertion 














KVLPPPLQLCSETAS/SIN/VGOOHEW 




1 AAAV 








i 

* 


KWT)HPIiSTfr , PVPr , fie5QTT^<5KrflT>VQVWi 
rdmjrii: jjl? ^wxi v t v^vjoo x xs ouur vox irVj 

FPSPY\TSSQDCVWMITVPSGHGVRLNL 

SLLQTEPSGDPIAIWDGPQQTAPRI/3VP 

LRSMVKKSVQSSSIQVLLKFHRIAATG6 

IPAIALSAYPNTKCPPPTIHPCI 


2547 


. 16448 


A 


2564 


382 


1 


TQEVB VAVSLDRAPAFQSGCQS BTLSQN 

FTPREQVS SHTRMF I AALFT I AKTWSQP 
/KXPPVIGWIKK\MWHIYTMEYYAAIKK 
DBFMSFAGTWMKLEAMY 


2548 


16449 

• 


A 


2565 


429 


0 


PI^SSCQTSCHQNCSFJCTSLHFPGDVLA 
TPEQVGGS PAQVP I P YLDDDI PLLEVEQ 

RGHL I VQLQELLHHWVLWSAVKSRWVI V 
GLFVS rLILSLVFAIRLR\SASRAPVLL 

Kr 


2549 


16450 

* 


A 


2566 


464 


1 

• 


VIHGVI \NPFVHGDQYKKKFPLK\FYQE 
IYESPFVTETGBYYKQEASNLLQ3SNCS 
UXP^Av JjUiuaxSx/blSxKL.uK^ JjafoUX lit 
VIHECHQRMVADHLQFLHABCHNIIRQE 
KKNDMANMP VXLRAVS KGLLHMIQGHVL 
RALGR I PTS YVRMQS 




lo45I 




2567 




1 TO 

1/5 


VKfrvyirvpcnT T c cu/~"T*T5xrLJT7 T r\ optt o 
XJNJbruvuVCoUJjXo&rlwilrVVi V 1 \o£>i.Lio 

QKKKKKKKKKKNFLPETRERALPGKKKG 
GGG 


2551 


16452 


A 


2568 


215 


411 


IHLIFIHLVFIDYLSVRHSSKILGYISE 
QlRKRNS/WLYWAHTCNPSTIiGGGGGQ. 
ITRSGVQDQPG 


2552 


16453 


A 


2569 


170 


430 


S P WLQVRTNAMAS PLLKFSAEDLLFKAA 
LSQFCVIMLNAKLSVQKYEK\LISAFSD 

?JP 


2553 


16454 


A 


2570 


3 


463 


CRFFGYSTAAAPMTPSSGGSTI»SGITAP 
AVPNI PS P IGVNGFTGLPPQANGQPAAE 
AVFANGMHP YPAQS PTVAD P LQQAYAGV 
QQYAGPAAYPAAYGQISQAFPQPPAMIP 

QMVLPFGNVISSKV 


2554 


16455 


A 


2571 


3 


424 


LKTMKGGTGNGLE IMIiDIQQDEYLPVWG 
ETGTS PTSGAPLHGS RPQP / PARGFLGF 
TVRPG 






JV 




loo/ 




RSAKMMRASHDKPTANI TIH\GKTGRAS 
KQRREERNRQEVKLSLFTDDMFLYLESP 
IVLiAPKLLLLINFSKVLAYKINVQKLLA 
VLYIKSSRESNEEHN 


2556 


16457 


A 


2573 


1 


399 


FAISQDHPALPSRPPSLHHPKPGTLTFH 
PDLPHOATCSPJP IRHORTWPEDAPLAKA 
DTVS PAEHPPAAATKAP /TR/PAPDKPG 
GTSDPQTGPAP\PASPPCSGP\APQPVP 
RKPSRAAPSKVSVTVPRRVPRTFPP 


2557 


16458 


A 


2574 


2 


452 

i 

i 


AKVNEMKSPMRKGHTLLKNKEEKLNQLE 
SSLtWEE AS DEGSLGGS PTKKAVT FDLSD 
MDSLSSESSESFSPPHLDSTPSLTSRKI . 
HGLSHSLRQISSQMSSVLSILDSLNPQS 
PTPLLASMPAQLPP\RDPKSTPTP/SYC 
GRCRGFSLTS 
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SEQID 
NO: of 
nDcieotiae 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 

00 


SEQID 
NO: in 

TTCC1M 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

f 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A«AJanine 
OCysteine, D=Aspartic Acid, 
Es=olutamic Acta, r— rnenyialanme, 
OGIycine, H=HLstidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^GIutamine, R»Arginine, S«Serine, 

nreonine, V^valine, 
W^Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, ^possible 
nucleotide insertion 


2558 

i 


16459 


A 


2575 


2 


369 


TI*V YPAITFILLS I CICYW IVTAVFLTT 
SGVPVYKVI APGGHCI HENQTCDPE I FN 
TPEIAKACPGALCNFAFYGEKSLYHQYI 
PTFHVYNLFVFLWL INFVIALGQCALA\ 
GAFATYYWA 


2559 


16460 


A 


mt mm m* 

2576 


A w\. A** 

406 


1 


RQEGTGPS YIJjLVG I xPRS FWGFS PPLG 
PSGKNP I KTFGGPLQGFFFRVQMGVLLP 
LPGPSGN/CSI KVSALNSSFSPS3VNPG 
EASLPWF/ CFFEMESRSVAQAGVQWQPP 
PPGFKRFSCLSLLGSWGYRRPPLV 


2560 


16461 

i 


A 


2577 

i 


3 


410 


YISPFYITHMRAHTNLPGPF/KLNQRAD 
ALVSAAFADAQTFHSLTHLNAAGLRKRY 
GLSWKQVKEIVKHFSAGEVLHLPHQGAG 
VNPRGLS PNS I WQMDITHI PTFGKLS FV 
HVSVDIYSHFIWATYQTGEATAHVK 


2561 


16462 


A 


2578 


386 


1 


TER IRQRQYQRETEKDKRYTERQRKTBR 
IRES DRDRE RQNQ I ERDRRAT \RERDRE 
KQRESDRETYRERENQIETERDRKRQSD 
RDRETQRBTGRI RYRERERQHQTESDRD 
RERESDRETQRHRECI 


2562 


16463 


A 


2579 


2 


432 

• 


LLYFAKRYGAAFGECCQAADKAACLLPK 
LDELRDEGKAS S AKQRLKCAS LQKFGER 
AFKAWAAARLSQRFPKA* FAEVSKLVTD 
LT*VHTECGHGDI*LECADDRSDLAMYIC 
ENQDSNFSiCLNKCCEKPLLEKFHCIAEV 
END 


2563 


16464 


A 


2580 


3 


426 


NLLNDALAI RE KTLiGKDHPAGAATLNNL 
AVLYGKRGKYKEAKPMSKRALE IRKKVL 
GKDHP D VDKQLNNLAL LCQNQG KTEQGE 
YYYPRALHIYHTKLGPDDPNAANTKNDL 
VACYLKHGTVKQA*SLNKE2flLTSAHEWD 
V 


2564 


16465 


A 


2581 


10 


389 


KLAG * GLWGHTL I TNPLTEPLT YPFLGL 
YL*SI I ITSSICI»IjQTDL*ALIAYSSI I 
HIALVITAIIiIQTP*SFTGAVILI IAHG 
LTSSLLFCLTNSNYERTHSRI I ILFQGL 
QTLRPLIAL* *LLA 


2565 

* 


16466 


A 


2582 


1 


192 


LIPTLAI ITR*GGQPERIiNAGTYFLFYT 
LVGSLPLLIALIYTHNTLGSLNILLLTL 
TAQELSNS * ANNL I * LAYT IAF I VKIPL 
YG uHL * L P KAHVEAP I AGS I VIiAAVLL K 
LGGYG I IRLTLI LNPLTKH I AYP FLGLS 
L*GGQPERIiNAGTYFLFYTLVGSIiPLLI 
ALI YTHNTLGS LNI LLLTLTAQELSNS 


2566 

■ 


16467 


A 


2583 


2 

• 


438 


QAHGPLAGCRLRPRSPSPVLWRRRRRWW 
RQRRKWKTKTATAAAGMYAS WTKACRGL 
EEIiINLTRLNVSYNHIDDLSGLIPr«SI 
KHKLR Y IDLHSNR I DS IHHLL * CMVGLH 

PT .TUT . T T .T?KT^T*)'n MP Vf*RT ."DfTYT? JVGT JTfi 

TFAQL 


2567 


IEEE— 


A 


2584 


3 


415 


GRAILLLEEAIQYLSGIEIDLIDTDRGE 
WDSLTPQARR* KEAGLQMFVQLARFHNI 
VCKEAFGTLAFLTSEIKSLFGHPFIAER 
1 1 SMLOTFLOHLI G PKMGALEVKDFSEF 
DF* PQQHVSDSGAI YLDLGDEENFC 


2568 


16469 


A 


2585 


2 


453 


DAKMYLSYKYATVIRNLREGTCPEAPSD 
ECKPEKRCALSHHQRLKCDEWSD I S VGK 
I KCVSAETTE YCIA* IMNG* ADAMS LDG 



703 



WO 01/64835 PCT/US01/04927 

< • 

■ i - 



SEQD) 
NO: of 
nucleotide 
sequence 

- «v 

'l 


SEQ DO 
NO: of 
Deotide 
sequence 

■i 


M. 
eth 
od 

• 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 

■■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

ins to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A« Ala nine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histid ine, I=lsoleutine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GJu famine, R=Arginine, S=«Serine, 
T=OThreonine, V=Valine, 
W=Tryptophan, Y=Tyroslne, 
X»=XJnknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














GFVY I SGKCGLMPVLAENYNKSDNCEDT 
PEAGYFAVAWKKSASDLTWDNLKGKKS 
CHTAVGRNAC 


2569 


16470 

i 


A 


2586 


3 


413 


MAVESRVTQEEIGKEP*KPIDR*KTCPL 
MLRVFTTNNGRHHRMDEYSR*NVPSSEL 
QIYTWMDATLKELTSLVREVYP*ARKKG 
THFNFAIDFTD VKRPGYRVKE IGSTMSG 
RKGTDDSMTLQSQKFQIGDYLDIAI 


2D /U 


ID**/ 1 

1 


a. 


ZDo/ 


i 
i 


/>o 


T >FVMTiMLVKAGADORAKNODGMNAXiHFA 
TQSNHVRI VEYL I QDLHLKDLNQPDE KG 
RKPFLLAAERGHVEM I EKLTFLNLHTSE 
KD KGGNTALHLAAKHGHS PAVQVLLAQ W 
OD INEMNELN I S S LO IATRNGHAS LVNF 
LLS ENVDLHQKAE PKES P LHLVDINNH I 
TWNSLLSAQHDIDILNQKQQTPLHVAA 
DRGNVELVETLLKAGCDLKAVDKQG* TA 
LAVAS RSNHSL WDML I KAERYYAWREE 
HHES IRDPSTGFTL 


ZD / I 




jv 


ZDOO 






AWS IAPSHPHSKVPPGPRRGKAEGRPGA 
AAQAAEQAEVHPPSSGPSPLPARQPPVW 
QIPPTPSLKTTTRRGAQPQHSRKRLA*S 
RSVSVLFRKM 


2572 


16473 


A 


2589 


a 


438 


AYYGLNWHLGATLSQKKKKKKKKKKNFP 
RGGGP PRNPHFWG IGGRPGTGPPRGEKN 
GLKNQKKEKFLAPR * KKS TGKPLKPPQG 
KGWGGGPKFPKKKRPPRAGGGSLKPKGK 
NREPFLALKSLPGVFLGGFGGNPGMGLK 
NQ 


ZD / J 




A 








VHLVRVKLGL * SRI YKELLOLNNTT * DN 
PI * KWATDLDRHFS TKYTQMVDRLMBKC 
STS 


2574 

• 


16475 


A 


2591 


62 


616 

■ 


EVHQGTE VRDS EVRRRPQARGPLW PAER 
AGRQRWLVPALQ PRRGGLRR * RGAVRQH 
GAHPHGLLLQDQKI PALPGRKQAGSLHA 
PGTEGE PDHGGD P VLDAG I QHHRQQRHP 
TADHLWPGEHRRGEAHVRAAV* PAAGAE 
G AAKE RRAHQANTALQVHRR * LGS FAEL 
RLLRKPGRTSVWPSPM 


2575 


16476 


A 


2592 


345 


438 


HKRWLPVPILYQHLF*VFGHPEVYILIL 
PGF 


2576 


16477 


A 


2593 


479 


313 


QDGLDLLTS * STRFSLPKCWDYRRBPPL 
KL INLQS SGVGLHVQLFP PS FCFDQLL 


2577 


io*t / o 


A 


2594 




419 


LTLHSNTLPPPEMSGIaT* PPATNAYWTF 
LPS AIRLFPEI FFIAVLLSVSLFDETET 
LSDAHSWRLTFKYERDANYHLLMSAQER 
LERLFGLHGGT I P I VPTADFHDS I SGAS 
DTDIAHSGLAYTMERSARQIMRTAMKY 


2578 


16479 


A 


2595 


1 


419 


HKCEGRYRGKT YGG Y WS LCATVNKALDE 
RIPITSASYYATVTLDHVRNILGSDTDV 
SMPLL*ERHRILNETGKFLVDKFGGSFI 
NCVR * RDNI AKKLMHLWES FPS YRDVT 
LFEGN* VAFYTRAQ ILVTDTOTVLEGK 


2579 


16480 


A 


2596 


199 


397 


SPTALNTDDVNP FMGSSSRGHHRQFQTG 
IVSDHPAEAGPIS *DBQELPYAVLHFHK 
VQPLEPKVTD 


2580 


16481 


A 


2597 


154 


3 


MLSLRGFFCLFVCFETGS * YAAQAGLKL 
LGSGSPPISSFQSARITGVSHL 


2581 


16482 


A 


2598 


425 


223 


QITGHGGKCL* SQLLGKFGQENFLNIiGS 
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SEQID 
NO: of 
nucicouuc 
sequence 

* 


SEQID 
NO: of 

sequence 

• 


M 
eth 

Art 


SEQID 
NO: in 
TISSN 

09/515,1 
26 


Predicted 
beginning 

location 

corresnond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

UCUUUb 

sequence 


Amino acid sequence (A=AJanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid. F=PhenvlalaDine. 
G=Glycine, H-Histidine, I=IsoIeucine, 
K^Lvsine* L=JLeucine. M~Metbionine* 
N»Asparagine, P^ProIine, 
Q=Clutamine, R=Arginine, S=Serlne, 
T=Tbreonine, V=VaIine t 
W=Tryptophan, y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 

nucleotide deletion. \=nossibIe 
nucleotide insertion 














RSFN* PKLLPGTSGWVTMLNSLGKKKKV 
TAYLNKWMPTP . 


2582 


1 r A O? 

16483 

. ■ 


■n 
Jx 


25yy 


J 


AA 1 


DHS DP IHHTFDHMWR TKBHNEAGWLLLR 
SVDKVMKENDBLGDSISQLQKQILSLKY 
AKIALTERLIS CPV* TE I VLNYT* SL IM 
LLTDLL* KVHAQSHHAHSGQASNCNVMT 
IjXGLQH 


2583 

i 


16484 


A. 


2600 


131 . 


423 


GPAPMVILFHLTGFLIiAFLPLSHLLTSY 
LVPWIliSGTDGHTFRSACLPRWLEAEWI 

FGGVKYQYGGNQEGK* CFFTG* S YVYNG 

c CfJTCV P WF. TP.c; P T 
Ouurv. v trnctxc oni 


2584 


16485 

i 


A 


2601 


222 


447 , 


SG I P S FGL* WEQNRPGRLNAQ I LKDIjG 
VSPGSDYGKLKNGI SSGLDNGVTTSDHD 
VLKMAI VGRKI W I LGDWSG 


2585 


16486 


A 


2602 


2 


457 


SIEVMGDGCLNDEHLEELGGILKAKLEG 
HFKNQELRQVKRQEDNYDQQVEMSLQDE 
DECDVYILTKVSDILHSLFSTYKEQILP 
WFEQLI,PLrVNLIC5SRPWPY*QWGLCI 


2586 


16487 


A 


2603 


1 


297 


DHRQKLYAHTECGKAIjLWKSVI cvhqki 

J HHalMvyddwrX IN. J UXV Af JXiZxVjVgfr <T V ETUI V 

HAVE I PCK*TECEKATC VHGPRGASEFL 
PERPTGMNMAKCESR 


2587 


16488 


A 


2604 


1 


405 


RFDVSDGLELRPKYNGIAHRLTTIWYIiD 
GLRGLYKGVTPI IWSAGLSWGLYFVFYN 
AI KS Y ITEGSSERLKASE YLAS TAEPGP 
MTT ,C T TKTP T .WVT KTC5T >MTjO YD AWNS PH 

* Q YKGMSDTLVKI YKHGGVQGLY 


2588 


16489 


A 


2605 


165 

i 


3 


PEELLLVFFPVLKNTGIRPGAVAHTCNP 
STLGGRITRTGD*DHPGQHGETLSL 


2589 


16490 


A 


2606 


3 


455 


KRYGCFSKRJ^KRSATNVFFCARKGEVL 
GLLGHNGAGKS SSI KVTTGDT * PTAGQV 

VRQHL * VYAAVKGLRKGDAEVAI TRLVD 
ALKLHDQLNSPVKTLSQGLKIKLGFCLS 
ILGKPSCGILA 


2590 

< • 


16491 


A 


2607 


1 


4zy 


HFTS WPDLGVP ITP IGMLKI LQKEKACK 
PQYAGAI VDHCKA* VGRTGTFGD I EAML 
DMMHT*RKVDVNGFESRNRAOSCOKVOT 
l^YVFIYQAIJ,EHYLYXaDTElxEVTCXE 
TPW ! 


2591 


16492 


A 


2608 


166 


435 


KFLS1WYVHFKQNFKKVLKFIKHLVLNY 
FKNIVLGOV*RATPGI PALWEAETGESL 
EPRSSRPTWAMWRNSISTKl^IKNIVL 
RILDPG 


2592 


16493 


A 


2609 


228 


431 

w mm* m 


IFSKICIFNGFNYFPIVGHIGRIYLFIE 
aYJSHCVTQAGVQWC3)YGSljQP*TPGLK* 
SSHLRLPSSWDH | 


2593 


16494 


A 


2610 


439 


175 


RNGGLHLWSQLLRRIiRWDYHFSPGSQGC 
SKS * S YHCTPAWVTQGDSSLKKQKRKI S 
TYTS ILSVLNETVYI I VYYTKKLDTSFK 
RKKL 


2594 


16495 


A 


2611 


1 


439 


LLGS I LSSMQKP RGLVDQETLRKAR* QA 
ARLNKLQEHEKQQKVE FR I RMEKE VSHV 
V*DSGHITT*VQSMNSIERSIIjHEWEV 
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SEQ U> 
NO: of 
nucleotide t 
sequence 

*♦ 


SEQ ID 
NO: of 
peptide 
sequence 

* 


M, 
eth 

00 

■ 


SEQ ID 
NO: in 

09/515,1 
26 


Predicted 

beginning 

Ducieonue 

location 

correspond 

■ng 10 nrsc 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Add, 
iv-ojuiff mic Acia, r— .rnenyutianine, 
G=Glycine, H=Histidine, l=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionioe, 
ri = /\sparagine, jr^rroiine, 
Q=GIutaminei R=Arginine, S=«Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=TJn known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














DGLTS FY FGEpDDCRYGM I FTKDAAPSD 
EELDSYRRAEEWDPHMADDKRKLTELAQ 
RPDDDA 


2595 


16496 


A 


2612 


1 


439 


VIRKVTGTEGSSSTVLDYTIPSSTGGMP 
VRKSEDQTDTKRTVI KTMED YNNDNTAP 

STQPISSVGKPATVIKNVRTKPSAIVKY 
PEKESEPFAKILKFTMDVSHEVIPHBVK 
SS*YSA 


2596 


16497 

* 


A 

- 


2613 


2 


453 


EKPEKEECNLWTEMWQENVPGSFGGIRL 
YLQELMT I TQKALHSQ PWKMKAQGAIAM 
ASMALQTNSLVPPYIiGMILTALVEGLAG 
RTWALKEELLKAIACVETAR3AEL*KSV 
PNQPSTHEI IQADLKECSKENGPYKI* P 
VXCAADILHA 


2597 


16498 


A 


2614 


2 


441 


LEPALPGRWGGRSAES PPSGS VRKTRQN 
KQKTPGNGDGGSTS * APQPPRKKRARAD 
PTVESEEAFKNIMEVKVKI PEBLKPWLD 
EDWDLVTRQKQLFQLPAKKNVDAIL*EY 
ANCKKSHGNVDNTEYAVNEDVAGIKEYF 
NEMLGT 


2598 

• 


16499 

- 


A 


2615 

• 


1 

■ 


430 


RGDRDLHCTESQSEASTEEGHDSLSVGI 
FEEDSQLEFILDPPKS KPPAWLNGIMTC 
* DFELLNPRRVRFLLE VKDLALTRRQ I L 
LHKGLSDYEKSTTLQELVLKKSSRSGPP 
LS IEDLGLNFQL+ PSSRVYGFTAE ELKP 
SGE 


2599 

• 


16500 


A 


2616 


373 


552 

• 


ICKIKYKLFNSALLFFRQGLALLPRMEC 
SGAIKVHCSLDPLGSSNPL*RKIKESTL 
NLE KSLCTRGI FLCKYEEVPKQFLKICF 
Q I FLETG SRS VS QAE VQWHDQNS LQPRT 
PGLKSFSCLNLPKCWDYRCEPPRRALCS 


2600 


16501 


A 


2617 


498 


157 


QLIGS STHQ AALKS RS IiPPPAGPGTFHF 
HYQGKAWWKGGTS PDSLPSLLGRGVSV 
QLHPRGKEQRGASDT*HKCPVKLWTIGG 

SL 


2601 


16502 


A 


2618 


255 


389 


LSEFYTYEGPSIRPPIGS*GTNLPLPLS 
Y I PRS P S AVDDENLLD3 


2602 


16503 


A 


2619 


3 


207 


QHSSLI I IRAIAIKLGIAPFHF* VP* VA 

ftPTBT TCPT T.T T T#UVT nOTCT TVATCB 

SJSs 1 trlt I o V7JUuuJj 1 w HlUifur J. & 1 X X UXo F 
SLNVSLLLSGT 


2603 


16504 


A 


2620 


3 


207 


QGTPLTSGLIiLYT*QTLTPISMIYQISA 
SLNVSLLLSGT 


2604 


16505 


A 


2621 


45 


447 


VHIRIDCRLNLCVTTASLKHFFSIAHKFH 
LHNGSHLQSQHSCEAQLRRTARFI ICCL 
PYGESGQSWPTLTLERANLNWLTVYFNN 
WKDWEYDS DO YMfiNFDR S RFIMMNI TVW* 

RSIDNLPHI ICSTLINTKNSFL 


2605 


16506 


A 


2622 


3 


142 


GNQATPKTAPATMSTPTILVATAVHAYR 
* * VAEKEHPLKPGGRACS 


2606 


16507 


A 


2623 


2 


453 


HGPGGLLDYIDKERIRDFLNGECMCEVP j 
DGGLVPKSLYRTA*DLENEDLKLWTDTI 
YQS AS VFKGAPHE I L I Q I VDAS TVI TWD 
YHVCKGD I VPNIYHS KRSPLPPNKDYLI 
AHSITFPDGNIWHLNNKVWMIXJPDYIMW 

ITTLITNNEN 
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SEQU> 
NO: of 
nucleotide 
sequence 


SEQJD 
NO: of 
peptide 
sequence 


M 

eth 

od 


S£QQ> 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, {^Phenylalanine, 
G=Glycine, HNHistidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginint, S=»Senne, 
T^Threonine, V=Valine, 
W=Tryptophan t Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


2607 


16508 


A 


2624 


1 


315 


EEKQAPGFTTGRGKLTPLFCAHAVRFI I 
STSL I YKGAYP * ALKGKEKY * LAVFWVY 
NKKGLNNNNLFLDGVHLCWPEVRKYLA 
GKGLPFS IGLYHLIIiLLFPFF 


2608 


16509 


A 


2625 


232 


478 


LDCGNYFTV*MYIKTSHCIP*IYIILNF 
HLYNFRYNLGDYNGEIVSBVMAQRQPMK 
PTYAIPIIPITNSSQFKHQEAMDVKE 


2609 


16510 


A 


2626 

i 


3 


430 


TSTPNVHMGSTSLPGDSTT I EDAIQSHS 
ESASPSALSSHPNNLSPTGWSQPKTPVP 
AQRBRAPVSGTPDRDKLRPCGQRD*GYY 
WEIEASEVMLATRIVSGSFGTVYQGKWH 
GDVALAILQVVDSTPEQFQGFRNDVAGL 
RI _ 


2610 


16511 


A 


2627 


2 


341 


ALQKHEDTDCPCVW3CPHKCSVQTLLR 
SE * SAHLSERVIAPSTCSFKRYGCVFKG 
TYQQ I KAHEAS FWQHVNLLKEWSNSLK 
KKGSSFDKECVDKNKSIHCWHIQICSSV 
I 


2611 


16512 


A 


2628 


2 


179 


RHTG LWVTSLPAVFPGQVRRTLF ITGLP 
RDARKETVESHFR*AHWLVGDFSPCCIS 
RTGEADPVHHRTPQI^PQEGDCGEPLPVS 
R WVGS FL IHYIQEHCL 


2612 


16513 


A 


2629 


1 


418 


GFSFCHPVPVKWRHRDSPQP*TPGLK*S 
CLGLPKCWDYRHEPLRPANISY 


2613 


16514 

• 


A 


2630 


3 


442 


FTCGTI I IAI PNGVKE FT* LATLHGSNM 
K* SAAVL * ALGYI FLFTVGGLTG I VLSN 
SSLD I VLHDT YYMVAHFHYALS I RAVFA 
I IEGFIH * FPLFSGYTLHQAYA* IHFTI 
IFIGVNLTLFPQHFVGIiFGMPRQYSDYP 
DAYTTR 


2614 


16515 


A 


2631 


2 


454 


AAPMELICWSGGWGLPSVDLDSLAVLTY 
ARF TGAP IiKVHKI I NPW* S PSGTLPALR 
TSHGEVISVPHKI ITHLLiKEKYNADYDL 
SARQGADTLAFMSLLEEKLLPGLVHTFW 
IDTKNYLEVTRKWYAEAMPFSLNFFLPG 
RMQRQ YM * RLP 


2615 


16516 


A 


2632 


131 


410 . 


WMWS S KAPHCFRLP S IGDADTVHQCAMS 
FQKGHSALEGVLHLVFKPDLVYQTLLQM 
P PRKCCIiWPGAVTHACNSMTLGGRSR* F 
TRSGVQDQP 


2616 


16517 


A 


2633 


2 


230 


F F SETS S LLE IQG I ARHGGTWRLRQENN 
LNPGGGGCSELRSCSCAPAWVTVRLDLR 
KKRNRNP*KILKNYLKIFVI 


2617 


16518 


A 


2634 


2 


370 


GTSSSDPAQPGDDKEF IDASRLVYDGIR 
DIRKAVLMIRTPEELDDSDFETEDFDVR 
SRTSVQTEDDQLIAGQSARAIMAQLPQE 
QKAK IAEQ VAS FQEE * S KLDAEVS KWDD 
SGNDIIVLAKQ 


2618 


16519 


A 


2635 


2 


f% mm 

376 


miFrir^rtir%7i t v*bbt i?ftti*T\vrit/iyiTMif boat 

MTDTDALY* RELFD PADKDKMDHSRRGI 
ALVFNHERFFWHLSLPERRCTCSDRDNL 
TRRFSDIiGFEGKCFNDLKAEELLLKIHE 
VSTVIHSDADCFVCVFLSHGERNHIYAY 
DAKI E I QTLTALF 


2619 


16520 


A 


2636 


1 


461 


DMAFL I LTE RK I LG YGQGR I G PNVACP Y 
GLLQPFGDAI+LFTKEPLKPVTSTITLY 
I TAPTLALT I ALIiL* TPLP I PNPLGNLN 
LGLLF I IATSSIAVYS I L* SR * ASNSNY 
AL IGALRAVAQT I S YEVTLT 1 1 LXiSTLL 
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SEQID 
NO: of 
nucleotide 
seauence 

'i 


SEQID 
NO: of 
Denticle 
seauence 


M, 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
* neotide 
sequence 


Predict- 
ed end 
nude- 

mm mm • w 

otide 
location 
correspon 
ding to 
last amino 
acid 

mm mmm %M 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D^Aspartic Add, 
E=Glutamlc Acid, F=PhenYlalanlne, 
G=Glycine, H=Htstidine, I=lsoIeucine, 
K-Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R-Arginine, S-Serine, 
^Threonine, V=Valine, 
W=TrvDtoDhan. Y=TyrosIne. 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \ppossible 
nucleotide insertion 














IKGSFNLRSLITT 


9*7fl 




A 


LXJD 1 

* 




■2 OA 


ENFKAIiAMIAFGOYROKRPFEDHVKLAN 
D*LNFAKTCVADESAENCDKSLHTLFGD 
KLFPIATLRETYLELADCCAKQ*PDIHE 
CFLQLKYDNPYLPRLVKPNVDVMCTAFH 
DTEETFLK+YLYEIA 










1 
1 


J f j 


TFI YLLFLFS SAYSRGVFGRDAHKSEVA 
HR FKDLGDENFKAIiVL IAFAQ YLQQCPF 
EDHVTCLANEVTEFAKTCVADESPEN*DK 
S L Y,T LFGDKLCTVATLRETYGEMAD CCA 
KDEPDRNECFLO 


2622 


16523 


A 


2639 


2 


375 


KGPCYRLVSTGTQRRHPGAVYLNKHLC* 
CNVGKA* GPHCEKCTLPCTFNEEPLKAIi 
TFFREHGP*VSDPEVATAPTBKEIPSLD 
QETTKLEPGQPQLSPGISTIHLHPQFPV 
VIEKTSPPVPVEI 


2623 


16524 


A 


2640 


215 


478 


KYFLASHTSLFIICYTAHLTCTIABPKQ 
IESHFGKRLDADLVF* KSDDSTWDVT P 
SKPV+TSGEiFSGKCL*HIVEGI IRAVDP 
RKLY 




1 






7rtft 


5 /O 


I LRNTj T KNH FW PG WAHACD PNTLEAM * 
GQITRSRDWDHPDRKGEAPSVLRIHRLA 


2625 


16526 


A 


2642 


424 


2 


KGEPLPPLGGWKT*GPRAFPQAQIPHAG 
FKTRG C P F PL P QGRNKAQ VPNP FCPRER 
FLPGKGSGRVAPLKNGAOLGGPPFYPHP 
FGGP S PR VP LG FG VQTHLGNKP KP P FF P 
KKKKGGTGCSVGNWGPKSSQTLPSPQAS 


2626 


16527 


A 


2643 


190 


3 


ELSTRGFQGPHRRIAGLKQCLVMFPELE 
RSGGLSEH*HIiHLPGSSDSRASAS* IAG 
ITSVCH 


2627 


16528 


A 


2644 


2 


339 


CCEKPLLEKSHCIAEVENDEMPADLPSI* 
AADFVES KD VRRNYAEAKDVF LGMFL YE 
YAIUWPDYSVVLLLRIJaCTY*TTIiEKCC 
AAADPHECYAKE FHEFKPLVEEPQNLI K 


2628 


16529 


A 


2645 


81 


369 


VEVTGQP QNAS FVKRNRWWLLPLIAALA 
SGS FWVFRTPDG * TSWR* IHNHMTQLI 
NNHLRA* YS VYRDI YFLWIAI * * SS * PA 
SVLLFTADYCPE 


2629 


16530 


A 


2646 


1 


348 


DMDMNPLRPQNYRFG*ELKADIDYHFKE 
DNDEDDHQLSLRTVSLWAGAKDD*HIVE 
S EAMNYEG CP I KVTLATLKM S VQATVTL 
GGFE ITP PGDLRMKOGSGP VHIMGIiHLV 
AGEE 


2630 


16531 


A . 


2647 


I 


178 


GYTDTI LDVRSQRVRSLLGLSNSEPNGS 
VETKNL*QILNGGESPKQKGQEIKRYDL 
YLT 


2631 


16532 


A 


2648 


181 


244 


TI KR YKNVMI F YY FNFF*R* GLNSIjAQA 
AAQWCNHGSLQNQPPGLKGSSCLSLPCS 
♦GYTIFYLSIHQLIDIWIVSTFSKLFLH 
VSAYSSIKMS 


2632 


16533 


A 


2649 


2 


369 


KWITFI SLLFLFRS VYGKFBFLIjDAPKN 
DVAHRFIDFGEQHFKALAEIAFDHYFLH 
CPFEDIiVTLVNE * TEFA* T CLADES A*N 
CDKS LHTL FGDKLCTVATLR * TYGEMAD 
CCAKLEPERI 


2633 


16534 


A 


2650 


1 


349 


VTFISIiLFLFSSGYSRGVFRLDAHKSEV 
AHRFKDLGEENFKALVLIAFAHYLQQCP 
FEDHVKLVNEVTEFA* TCVADESAENCD 
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SEQID 
NO: of 
nucieonue 
sequence 

■ ^ * 


SEQID 
NO: of 
pepnae 
sequence 


M 

eth 

_ j 
oa 

• 


SEQID 
NO: in 

T7COKJ 

09/515,1 

xO 


Predicted 

beginning 

nucieonae 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

ft* im ■ a_ 

nucie- 

otide 

location 

correspon 

ding to 

last amino 

acid 

restaue oi 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
iv-^riu iHiriic Aciu, r— rneny (alanine, 
C=GIycine, H=Histidine, I=Iso(eucine, 
*v— x^ysine, i_r- leucine, iyi— lvieuiionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Ajrginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
a— u n Known, ^oiop coaon, /=possiDie 
nucleotide deletion, \=possibte 
nucleotide insertion 














RSLHTLFODTLCTVATLRETYGBTADCC 
AKQD 


2634 


16535 


A 


2651 


21 


403 


CVCCVCVRES RTE S bPGDNEDFNVKClAS 
VKDVRCVHFDCDSQNDPPMEATGPTAQV 
TIYSLINLLHRLKYFETLDLH*IAIFFS 
KVS I LNHPGQ I IADYAP ALDCHTCH IAC 
KYAELKEMIDRRSVK 


2635 

> 


16536 


A 


2652 

i 


1 


355 


ARMS I TDTYGQHLIAGGLMTQEDVSE I K 
SS YYAKS NDHLNNMAH YRPTALNLQAHW 
HGLAQAEAQ ITTWS SGVP FDLLLMVGMM 
TVQVP * EliQMHSHLIiKTHDQSRMENMMY 
GIKLDW 


2636 


16537 

♦ 


A 


2653 


2 


360 

* 


LFBHLG * YKFQNALT iLRYTMKVPQVLTP 
TLVKVS RNL * KVG I KCCKHPEAKRMPCA 
QD YLS WLNQLCVLHEKT PVSDRVAKCC 
TESLVNRRPCFSALBVDBTYVPKEFNAE 
TFTFHAD 


2637 


16538 


A 


2654 


360 


314 


YTCAIVPCKLFWWCC3U5RVSLCCLYWSQ 
TPELK* SSCLSFPKONYGHEHTWPARI 
IFLNLYNKSG*TATRKCPPTKVTPSSHQ 
Q*HCTWPYFYFK t) 


2638 


16539 


A 


2655 


I 


353 


NWTL*TLKKIFSLKDAIKRIKIQSVRWK 
KI FAKHTS DKGVES KYI * RTLINQ* ENN 
S I KNWS KNLNRRFTKDIQ KANKHMLS AT 
SLVH * ES KLK* Q * YHYTS IRVAKMKKTD 
YVTSW 


2639 

F 


16540 


A 


2656 


340 


10 


GREDKSEKCYLKPGRSQPDRGAPKSSPG 
VP*PPAPPALPGPGRSSPFPQGSLAGKD 
LRPRQPSQPGGPGELIFPVKTKKKKIKE 
RKTLLAWTRSNERPEIAQREGGCLRQ 


2640 


16541 


A 


2657 


509 

* 


261 


IPYFKIHCLHSALGVAETEKETAEHLDL 
AGASSRPKDSQFJwSPFylPPPSPDSKKK 
SRGIMKLFX3K*VKQ*TSGMGPVLHVA 


2641 


16542 


A 


2658 


345 


3 


SAHLSHP KCWD YRRE PLHLAR I SFLFKA 
SSPLYGRATFYLSTPPLMDGRWVAS IFW 

T T ■* XTTT n * "7\ T»TT rVTTVT OOTtST O T T?*VTV^ C 

LL* IVJjP*AW V x KXIjSJb 1 F1j£> 1 if *K1 X»S 
LSVAQAAVQWHNHGSLQPQTPGLKRSSH 

T 

li 


2642 

i 


16543 


A 


2659 


271 


56 


VTLYI VMP I PRPCPHPHPYIHTHTHTHT 
KI FLE I YIJjP I IiLKM 


2643 


16544 


A 


2660 


366 


68 


PGQQSKIMPQTERKKGRKEEREEREBKE 

TRIKAS *LNSLYTTVKPQRRLKGGKKHP 
KDSNFKD*RNISPHR 


Zo**4 






2661 


Jjl 




DWATE * DFWSNK IK* QKVQ YS KY 


2645 


16546 


A 


2662 


I 


160 


ERAWLHLS PGDGVCS EPRSHHCTPTWVT 
E* DS VANQ PGWDRVRLCLKPTWVTE 


2646 


16547 


A 


2663 


233 


3 


GRDN WLKT VAKPES LSGKRG PS LIiRKVF 
NPFPVYILFLPQAPKGKWLSPFPPFFFF 
FFFLRGSLAM* PGCSAVAQS 


2647 


16548 


A 




96 


349 


QMPS S VLTRVANAYYSGFFFWFLKQKFC 
FVTQAEVQGRS IS * PGPLVPGFKRFLGL 
TLLSTWNCRRAPPGLVIACGFKMSLLTL 


2648 


16549 


A 


2665 


322 


74 


DTYTLPRLNQVESLNRPITGSEIEAIIN 
RPPTKKSPGPDRFKAKFYQRYKBEIjVSF 
LLKLFQ* IEKGGIGWARWLMPVIPVL 



709 



WO 01/64835 PCT/US01/04927 



SEQID 
NO: of 

II UIHSVUUC 

sequence 


SEQID 
NO: of 

|IC|JUUC 

sequence 


M , 
etfa 

nd 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

UUUCV UUv 

location 
correspond 

ino* in fir^t 

JI1K " w XII 0I> 

amino acid 
residue of 

sequence 


Predict- 
ed end 

HU I4v" 

otide 
location 

wl 1 C9UUII 

ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanlne 
OCystelne, D=Aspartic Acid, 
R=4"«Iut9mic Acid. F^=Pbenvlalflnine_ 
OGlycine, H-Histidine, I=Isoieucine, 
K=Lysin e, L=Leuci ne, M=Meth ioni ne, 
N=*A sua rapine. P^Proline. 
Q=Glutamine, R=Arginine, S=$erlne, 
T=Threonine, V=Valine, 
W=TrvDtonhan Y=Tvrosine. 

X=Un known, *=S to p codon, /"possible 
nucleotide deletion, ^possible 
nucleotide insertion 




J033U 




zooo 


it 




DLPASAS QSAGI MGVSHHAQP FLHS YLR 
PLI L I KGDQFKYAFNIjFSC 


2650 


16551 


A 


2667 


129 


301 


VRPLAGLGSPLIFFFFFKKSLAFVAQAG 
D 


2651 


■ 16552 


A 


2668 


318 


21 


FS LLVKD FQNDLAVNS SS I FPSTFLFLV 
iw iiur v i\ i_iiN c rii r i u£ xuJisr ouL.L.ruH 
TRIPRLKQSSRLRLLSSWGYKCVPLHPA 
INRFL*CDFSISGY 


2652 


J 6553 

« 

i 




Zooy 


2 


511 


nTYST.riT.T.TQ * QTPT.rtT.Pttr'WnVPP PCDP 

PAPINKCLLSIYHGPGPSLGTEGERGQD 
YIPAI EQLTDQWRRLTWTGCYCAAVABD 

(JIOVW PT J .PftYF AT? KTCAftP I7H 


2653 


16554 


A 


2670 


29 


261 


EFNTLSKCVWIY+LFSLSLSLSLCVCVC 
VC^CVC^CLCVSVCLFYGSPICLHFMHP 
DTLS EES FHFTiQ T I S FCKLLKC 


2654 


16555 


A . 


2671 


5 


224 


PA* PRDP PAIiASESAGITGVKKWLIABL 


2655 


16556 

• 


A 


2672 


126 


321 


WGRRGARARRDSHRALRVSPELALLELR 

P QTM AW T fNDQAT.nflPfJfJW T T * DOE PT?T 

NLANLEKPC 


2656 . 


16557 


A 


2673 


360 


74 


SRLLRKLRQEVGLNLGGRGCSGPRIjHHC 

PETNS LLAWEPDCVCKTNKLATRLE IMA 
QKSCSQRPQYH 


zoD / 


lOJJO 






1 00 




RTRKWI»YYYYYYLt,P WSLALS PPJLECRG 
TILAQCNFHL*S£SRSPASASPVAGIT 


2658 


16559 


A 


2675 


236 


375 


EEVL YSRKDRHVAS YSRK* W * RSGAEAH 

AU.1JTC J. AJVTvXtvVjVJ »1 X X XV0%7 


2659 


16560 


A 


2676 


236 


2 


ATKVS INS IGTLGVKVGGLLKGRRLRPG 
PKQSSCLGLQSSWDHR*VQPHPGNNFTL 
FVETRSMLARXVSNSWPQAIFQ 


2660 


16561 


A 


2677 


164 


389 


LTVQHGljRGIjRKLTVMVESEGQASHLIiH 
V A AOP P S KrTJOGNODMTiTKS S YOPCAJRA 

HACNP S TVGG* GBR I TRSG 


2661 


16562 


A 


2678 


208 


1 


CSVYEGSCFCILRVRRVCVCVCVCVCVC 
VCARSCIYVRSENH*KQLPSSSILT*RF 
* TOSTFPSMKP.U} 


2662 


16563 


A 


2679 


159 


382 


RWL I KNHPTQAR I Q VRGLLDRDCQTQTW 
LWESGVQPLATT PRRHS EMGGWPGAVAH 
ACNPS TLGGRGG * ITPPRPESKLEDCLT 
HTARRRI.G^^nGr'SriWriPPPt^GTOfCWG 

VGRAQWLT PVI PALWVAEAGRSLEVR 


2663 


16564 


A. 


2680 


29 


3*7 


DCQSEQLRKLRWEHHWS QGV* GCSQP* S 
HHWTPAWVTEQDPVQNKQTKNYNTHQRA 
GEVWRBINCLSTEQLTELYSEITQIIiAL 
SVRN* KLL* YVKSS VI SAETQI SGGKIF 


2664 


16565 


A 


2681 


256 


3 


ENIVPVRFLCIHRLFAILIMVQ*IiERKR 
DHIVra^TLETVYLKKWQTRPNAVAH 
ACNPSTLGGQGRRITRSRARDYPGQHGE 
T 


2665 

• 


16566 


A 


2682 


1 


395 

* 


LLIEHILLAMAIJjILTERKII/jCIQLRK 
GPNWGPYGJjLRPFTDAIKLFAREPIjQP 
VTSTITLYMTAPTLAVTMAVLL*TPLRI 
PNPLVNHDLGLLCILATSSLADY* il*s 
R * ASHSNYALIGALRACPH 
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SEQW 
NO: of 
nucleotide 
sequence 

* 


SEQW 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQXD 
NO: in 
VSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

i 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A/anlne 
OCysteine, D=Aspartic Acid, 
E=G)utamic Acid, ^Phenylalanine, 
OGlycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P»ProIine, 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W-Tryptophan, y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


2666 

• 


16567 


A 


2683 

• 


3 

• 


349 


VRIDNAAVCLLY*AWAEFIWALANLI IR 
PDLDHPGNLLGNDHIYNVIVTAHAFVI I 
FFIGI PI I YGGFGN*LVPLI IGAPDMAF 
ARI TNIRF*LLPPS LLLLLASAI VKAGA 
ACT 


2667 


16568 


A 


2684 


99 


2 


LTVAHACNPSTLGGQSG* ITRSGVREQP 
GQHG 


2668 


16569 


A 


2685 


325 


4 


SSFPMSFI INS Fill FLY TT X FLI FYW 
VYS PE FKP * VC FCLTECWD YRRE FQCS F 
LPCKRV*VLAPFIKNFFFFFFFLRWNFI 
FVAQAGVQWRHLGSLQPPPPNFK 


2669 


16570 

• • 


A 


2686 


382 


269 


MGFHHVAQADLELLGTND P PALASQSAG 
ITGVSHHAQPKLTF*TFSY*ACVGGSSP 
QFILWQHGLAGNSFCCF 


2670 


16571 

• 


A 


2687 


3 

• 


381 


GHLYALTFFFNSQICLFPRIYKTCKKSF 
WLPFLNLVSLCAI EG YKTKKVPSNV* KG 
I FIHHQTG FI LRMQGWLNI * TS ING I YH 
ISRIEDKNHKTMFIDAEKWFHNI*HPFt 
IENIRPGWAHICN } 


2671 


16572 


A 


2688 


289 


91 


I KYKE LG VACS KTS RNVS Y YC YHHHHHH 
RCHHYHHHS * RLAALCEESGWRKALGST 
SMGRVAFHRH 


2672 


16573 


A 


2689 


3 


211 

+ 


LHHVGQDGLDLLTL * S APIX3LPKCWDYR 
TEP P FP APNE FLLLHSLLS KHYALiGRPH 
TPRERKKASGILR 


2673 


16574 


A 


2690 


2 


197 


DLLTS * S AHFGL P E CWD YRRE P P RPAS Y 
AFI KYFCPALS CFWCTETGP YL I FAYLR 
EESFVRSRT 


2674 


16575 


A 


2691 


253 


356 


AVPVKMAI VKKTRNNRCW * G CGE IGTFL 
HCWWEC 


2675 

• 


16576 

• 


A 


2692 


211 


339 


PGHPLS 1 1 LQWIWGEDTRRGPMHEALH* 
PCX:SELRSCHCTPAWPTE*NSIF*KKKK 
KWLGELKEKQKNMACANDP* PGHPLS 1 1 
LQW I WGEDTRRGPMHE ALHGKS GDGVYA 
VRLLTAY 


2676 


16577 


A 


2693 


1 


175 


RHEGLNLGGRGCGE P S SCHCTPAWVTB * 
DSVS KKKRKGKKKSALIL I FSNGVLFCH 
LG 


2677 


16578 


A 


2694 


234 


13 


KTSREP WS EFLS I KGEQQCS FGLLYVFC 
FQMESHYAAQAGV* WCNLG* LQPPPTGF 
KQFSCLSPLSPSSPASAS 


2678 


16579 


A 


2695 


149 


2 


SQGEDFTKTRALQWRSRRSWAHACNPS 
TIjGGRGWWIT*GQEFKTSIiAN 


2679 


16580 


A 


2696 


2 


194 

■ 


CIGLGWAGACSSRLRQENDMNLGGGAC 
SETRSHHCTPAWVTE* DSISKKKRKRKR 
WGGTRFSN 


2680 


16581 


A 


2697 


338 


357 


YLIRRKKISNSKS* FSWPGWAYACNPS 
TLGGRGRRTTRSAVRNQPGQHGDPKFFL 

vss 


2681 


16582 


A 


2698 


53 


293 


TISIKSKKWKNKCX5EYNKWCMENWLAIC 
RRLKLCPYVLVHSCIAIKNYLRLGNL*R 
KEV*LAHSS IX3CTGNMAEKPHGRR 


2682 


16583 


A 


2699 


2 


335 


LYAAKVPIjQKAC I RTFS YIS FF I FFFFF 
FFGKQSSFGPPGLKARGGTPLIGTPPLG 
VKRNFLPQFSGKPGITGCPPLPQKFWFF 
KKKRGLIF* PGGVPNSDPKGIGPPNPP 


2683 


16584 


A 


2700 


166 


3 

• 


D YRRVP PRIiVKR * GFTW * AGV* WRNHG 
SLQTQTIGLKQSSHLSLATCWDYRYE 
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SEQID 
NO: of 
nucleotide ( 
sequence 

■ 


SEQID 
NO; of 
pepuae 
sequence 


M. 
eth 
oa 

■ 


SEQID 
NO: in 

ITCCM 

09/515,1 

• 

■ 


Predicted 
beginning 
nucieonae 
location 
correspond 
ing to first 
"amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

a 1*1/1 

acta 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanlne 
OCysteine, D=Aspartic Acid, 
jo- — - Lyiuuiujtc aciu, r*-.r nenyiaiaoine, 
G=Glycine, H=Histidine, I=IsoIeucioe, 

\T — T iicinn ff —I Kinn A/f=A/fi»4li lAninA 

IV — L^yd lllCy Lr*JUcuunc, ir4 = 2rJciIJI0IllDG, 

N~Asparagine, PHProline, 
CMGIutamine, R=Arginine, S-Serine, 
T=Threonine, V=Valine, 

W — Tr-xrntn r\\t a n V=Tvrneir»/» 

yv— ji rypiopnan, i — i/rosine, 
X«Unknown, *=Stop codon,/=possib(e 
nucleotide deletion, V=pqssible 
nijcieouuc insertion 


2684 


16585 


A 


2701 


209 


340 


YRMTVFPARHpGSHTCNPSTLGORQGWM 
T * CHE I ETTLGNMVKP 


2685 


16586 


A 


2702 


78 


341 


EVACNCLLPAIGCFASVTyV*SSVLL*F 

DWSVMA*S*FTEALTSQTIRSSHVSLPS 
SWDY 


2686 


• 16587 


A 


2703 


258 


185 


TMIAPVHSSLGDRARFYLFKQKR*GLAL 
SPRLiEWTGAI I VHKNIiErljCooNFPTSA 
SGVTG I TEMEVERHGW I QETFWK 


2687 

• 


16588 


A • 


2704 


361 


41 


NFGPVYKTNPGPSRPGWGPLGKPPGPQK 
FFPFGVPPSPPFPKWGSVFSPPGGQKDP 
FKGVFWGFP* PRGKKTPTQQPGPSPKVP 
NWUr r JMjUc rxUvKJvKJ\±<Jl\.J> JjKJN 


2688 


16589 


A 


2705 


12 


329 


SCTLQLTTPRLK*SSRPSLPNN*DYRHV 
P PHPA5 rP * r oKM IT w iJoTU i F JjNSIiKr S 
S * YYLVIFFFKME PCS ITQDGVQWCDLS 
SLQPLPPRFKRFSCLSLPSTRP 


2689 


16590 


A . 


2706 


232 


1 


KRKRKS FKTYYKKFFLNS S FKNQCP YAI 
DLGSPQPPPPGFKPFSHLRLR 


2690 


16591 

r 


A 


2707 


135 


1 


CFFL I FFFFSSDTQAGV* WPQPPGI KRS 
SQLSSPRSWDYRRISPR 


2691 


16592 


A 


2708 


1 


157 


LDLLTS * SARLHHPKCWD YRRBPPRPAD 
YKYFLKEVASIjIVKL»YIjFCKLNFE 


2692 


16593 


A 


2709 


88 


344 


MQEHYYFIKRGNKMI IRKYVQLHANKLD 
SLDKIDKFQVKNHQN*L*KK*KSWPGAV 
AHACNPSTLGGRGGW ITKS *DRDHP WIiT 
C 


2693 


16594 


A 


2710 


1 


383 


LKKVTT VLRFLS KFHILVLYFYYTCVHV 
SVSDWWCWFIjY* I rTIJM x XiA V * LiD t r 
KLS IKFLRCIC* CVY* I FIPVFI YCFFL 
YLICLFLSYFLFNIYVMLLFYILCIIYF 
YVIFSSSSIF 


2694 


16595 


A 


2711 


227 


1 


IFFFSNSFLKISLGFFLNFAFF*KPKFF 
NKFFKF* I YLTFFKKTKPK 


2695 


16596 


A 


2712 


1 


136 


i\JN i K.it>ytjWrV ri r*V XJrAlyi^EiAi^AjoljJS 

RGRQRIiQ*RNLGSLQPLPPTFKRSSCIiR 
LLSSWDYRRRPPPIiANFCIF 


2696 


16597 


ji 

A . 


2713 


3 


349 


T T DDfnP CnrtTT DVTVTOnTC T CYZMT^T ."KTI? Y 
Lt J. irKv^UUro 1 V V-LiF liNfc'AloXruNiJJjtviSJ. 

KMYGHAKTCI WMFMTSSFI IDRTRKRQ* 
CS SVREWINKQ* CIQTMELVFGRNFFFF 
FFFFFFFFFFFFFFGGPPPPKIQIFKBI 
NPQ 


2697 


16598 


A 


2714 


lot 

181 


342 


GKEEVKLFLFAHDTIVYP 


2&8 

■ 


16599 


A 


2715 


3 


205 


FRHVAQAGLE LLS AS S L PTLAS RS AG I T . 

b VonLVy i VDoiiKiior iJ/Xdf IS~Lt\U± 

QSCFSWVLYYCLQSAEL 


2699 


16600 


A 


2716 


388 


23 


ASNQSKNSFEKGEKNQQSVMVKTSQQAL 
NKRAFFDMIRSVYQKTYK*RNA* *ERQS 
FSLKVENKKEYPSSLLLFNIVLKVLVNT 
GHKEK* KAHRFDRNI IQFANDMIVYVEN 
PKDSTKRLS 


2700 


16601 


A 


2717 


4 


194 


FSCIiSLLSSWDYRSMPPHPANFliYF**R 
RGFTMLARLVLNSWPHSAGITGVSHHAQ 
PFNYFFI 


2701 


16602 


A 


2718 


1 


212 


LCCPGWWETARLKRSSRLSLPKCWDYRH 
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SEQ ID 
NO: of 
nucleotide 
sequence 

• 


SEQID 
NO: of 
peptide 
sequence 

■ 

• 


M 

eth 

od 

• 


SEQID 
NO: in 

09/515,1 
Zo 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue ox 

peptide 

sequence 

i ■ 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 

|a|v4 IT. _|_ ■ ■ MM J-|_ 

last amino 
acid 

residue of 

MAM 

peptide 
sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspa rtic Acid, 
Ei=oiutamic Acid, ^Phenylalanine, 
G=Glycine, HNHistidine, l=Isoleuclne, 

IS I f _* Attain— H<f H/T*.+t*:n.r*im* *v 

iv— Lysine, Lf—jjeucine, M— ivietnionine, 
N=Asparagine, P^Proline, 
Q=€lutamine, R=Arginine, S=Serine, 
i** j nreonine, v^vanne, 
W=Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, /^possible 
- nucleotide deletion, v— possible 
nucleotide insertion 














P .KT. A "RY2 CET.T. # DVT ,MWVE?C PKTT 

YDFSLVYSLYGTSI 


2702 


16603 


A 


2719 


53 


299 


AAVrlVK 1 r r J.TK.VrlJj xP^luIyblioLirP 
FPSPl»HPSI^l^SFL*ia)RVLLCYl?G*S 
AWQS*FTWPQIPGLR*ASCISLVSS 


2703 


16604 


A 


2720 


139 • 


320 


QPTMRHS FS F* KQE IS IFLKVSSSLHHT 

T DT?T t?T7T?T7T T TYDX7CT CU'OHTJ CT^TTlT C*T 

TVAW 


2704 


16605 


A 


2721 


1 


99 


1 1 JaAYSS ITHIG* I IAVLPYNPNITILN 
LTIYM 


2705 


16606 


A 


2722 

i 


215 


361 


JjTF F FF J?" FillKK £>N F VTUAEMUGGN L V * L 
KPRPPGLKHFYYLSLQSTGNY 


2706 


16607 

i 


A 


2723 


335 


24 


AIPLRPEF*THPGQNGKTRFPLKPQNLT 
GVGGNAR Y S Q FP KRLRLKI P LN PE VGP S 
INPNFHPGI PFWAQKENFFPKKKKKKE I 
IMLNFSLICYNLFFLYICY 


2707 


16608 


A 


2724 


27 


314 


WKQLKY P P S DE W INKMWCIHAME YLAIK 
RNKVQLCVT+ RWRWLMDTKKK 


2708 


16609 


A 


2726 


1 


112 


G FHHVS QDGLNLLTL* STHLSLPKCWDY 
RHEPLCQAR 


2709 


16610 


A 


2727 


271 


3 


RSPMLKNQAIKS VI MVYIGTI WRSAGPF 
AIFFIAEYTNI III ITLTTTI FLGTTYD 
ALSHELYTTxFVTKAVLLTSLFL* IRXA 
YPRPR 


2710 


16611 


A 


2728 


121 


I 


ENKYRPGAVAQAY I PNTDGG* GGWIT*G 
REFETSLTNVEK 


2711 


16612 


A 


2729 


2 


290 


NRHFTKENIQMPTRHKKRDPQSSLVIRE 
MQTKTKVliYYFILTRMAKVKKIDNTTC* 
♦GY^KI^SPKLQTTKCPIFEWVNKIjWHS 
QIVEYFSGICS1A 


2712 


16613 


A 


2730 


98 


264 


IiVSSE* * * I YQILFFFPEMECHS VSQW 
VRGSNLC* LHPLPPDFKRFSCFS yweg 


2713 


16614 


A 


2731 


2 


286 


PFYKATVI RTVWYW* KN*QIDQWNRKES 
PEIDIjHK*SLJjIFAKGAKAIQwRiajS 
NKWC * NRW I STC KNE S MLGWAHTCNPS 
TSGGQGGRIA 


2714 


16615 


A 


2732 


323 


181 


RDFVLLVEITMLARLVSNS*PCDLPASA 
AQSAGI TGVSHRARPFLDL 


2715 

> 


16616 


A 


2733 


272 


131 


GRVDRIiNPGGGGCS * LGS CQCTPAWVKE 
TPSQKKKNKIICFYAFLFI 


2716 


16617 


A 


2734 


227 


87 


AASTLTLTSLIPPILTTLVNPNKKNSYP 
HYVKSIVASTFII8LFPTTIFMCLDQEV 
IISN*H*ATMDFT 


2717 


16618 


A 


2736 


25 


415 


FS KYL LDHQ LSAGLLVEQHRGS LCPLSL 
LHSDSSLSFAVSGKVSLAASIRNKLELP 
ET*RALMM*NHGHPLEFGOfl* * PWT*EP 
GS *RC* EDGQPGQARJuiAGNPSTIavGRG 
GWIMRSGVRDQPGQDVET 




10017 


a 
n 






i 


KGFFT.ETKR KFF FTNG FFPLIiGGKGVPF 

PGKRLGPPILGFYPPQRVFPFPKPPFFP 
PGPFKRGQKPPGGGKNF*RGPPGPPKKG 
GAPTGKKKKKEEERKEKEKKTPRNISKP 
LSHPFPNFPFCATVRGFS 


2719 


16620 


A 


2738 


96 


331 


QFTYTKGLFPFNFNRIjCVFFFS * KLVGG 
V* LCCPGWL* TPGLKQS SCLSLLSTWDY 
KGVPPCIiAEHFFFFEKKSPLSL 


2720 


16621 


A 


2739 


379 


14 


LREVFPFFSPPKIKCFSKNSPQKYFPPP 
WFKTPPTPPFFFPPLFGKNFFFPSLFN 
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SEQ ID 
NO: of 
nucleotide 
sequence 

*» 


SEQ ED 
NO: of 
peptide 
sequence 


M, 
eth 

00 

■ 

• 


SEQ ID 
NO: in 

1TCCM ■ 

09/515,1 
26 


Predicted 

beginning 

nucieouue 

location 

correspond 

ing to urst 

amino acid 

residue oi 

peptide 

sequence 


Predict- 
ed end 

Mil it 1 A 

otide 
location 
correspun 
ding to 
1351 ammo 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 

XT — f 1 tn fn m if* Arlrt I^Phanvlalonifia 

E/— viiuutiiiiv aciu, F n x Deny luiauine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 

NsAcnorooSnA P— P—r» lino 

Q^GIutaroine, R»Arginine, S= 3 Serbie, 

TT— TTh rpnninp V=Vnfin» 

ji~ j nr uumiHSj w aiine, 

W=Tryptophan, Y=Tyroslne, 

X-Un known, *=Stop codon, /^possible 

nucivuuuv ueicLHiii) v— posmuie 

nucleotide insertion 














FCPPRGFF*RPPLFFFFFFFFFFFFFFF 
FFFFFFFKVFYSLLLLYFLFIIiK*MLTF 
KTSISTQCY 


272 1 


16622 


— s — 


2740 


t no 

198 


38 


♦LTADSTPWAQVILQPQRNPIEHDF 


2722 

- 


16623 


A 


2741 

■ 


290 

■ 


39 


MSKVGPLLGVLGTSFRVLLKGPEIKGDP 

T T.1WDT .T?T?T.T7T? T7T .D TV3\7T ,WB DTUJ Q Rtrt/ 
JjLbATir'ijr c i-ir r f JjKJJIjVIjV IKitVttVoAV V 

QS*LTADSTPWAQVILQPQRNPIEHDF 


2723 


16624 


A 


2742 


1 


399 


lijIRKHbbX^rbiuuFDA* ruTiuxuiia 
VLNRTELEPDQATLLGNDPVHEVIDTAH 

»rnT TDBTITTT1T T T /"'fTTV"**!'* T T T/"*H 

ALGI IFFI VI PI 1 1 GGFGN*LiVP1jI IGA 
PDMAFARINNI S F* LLPTSLLT.T.TASAI 
EEAVPGTG* TVY PSIiA*NYCR 


2724 


16625 


A 


2743 


398 


2 


SPPPLIFFLWGFSYIFPPPKKFFFLINP 
PPKFSPPPFFFKPPPPFFFFFPP*BBKK 
NFCS P PPFSPPPPFFLLPPPLFFFFFFS 














KKKKKKKRARSRSRTSPTRP 


2725 


16626 


A 


2744 


295 


185 


FFFFFFFFFFFFFFFFFFFFFFFFFFLF 
FFLFXFFXF 


2726 


16627 


A 


2745 


374 


4 


LKKHKSTRVP *NVKSGK*NFSPPFKIRP 
WAQ KRAKKKGARE KKAD * B KGE FGKLKK 
KKSFRPGKKV*G* PKGFQRNFEKKPVKS 
QGPVNMGVIPAFEKKPRIPPVAKIPI *G 
AQKKKGRPLAI 


2727 


16628 


A 


2746 


2 


382 


QDATAP I I DEL I S FHDHAL I I I CLI CFL 
ILDALFLTLTAKLTNTNISDAQEIETV* 
TILPAI ILDLIALPSLRILYITDEVNDA 
S LT I KS I GHQWY * T YEYTJJ YGGJjV FN S 1 
ILPPLFLEPRDLGLL 


2728 


16629 


A 


2747 


2 


374 


DWPPX vKRKIwPKAyS lAADRr ILJjr 1 V 
RGLTGIGLANSSLDIVLHDTYYWAHFH 
YVLSLGAVFAIIGGFIH*FPLFSGYTLD 
QTYAKIHFTIIFIGVHLTFFPQHFLGLS 
GMPRRYSDYPDA 


2729 


16630 


A 


2748 


55 


230 


KKKGACLKKPHGGAKEKS WGRQKLFS FK 
GG 


2730 


16631 


A 


2749. 


161 


3 


NGNYTLVKKCFNTKEKIKWVKN* IFFFF 
WRW b r jTIiVAUAVj V U W UiUJjo Jjv r* Li 


273 1 


16632 


A 


2750 


362 


1 


PEKKS FAI KGVKKAGKQCKFCDAPKKGN 
PPKKKNPTGTPKRVLKK*PPREEKKNSP 

FFFI S FFFFFFFYF FFFFFS I VINLFGT 
FIATTLEA 


2732 

• 


16633 


A 


2751 


311 


no 

• 


ITLPRHGGPHVQSQLFKRLRWKDHMSLG 
DRGCSELSSCHCTPVWRQSKTFSKIKYG 

cCINv? 1 UJN_ri>\l V_iNirvjK.r WSWJKft. V ■K^U^ao 

ETSLSNSRTRL* KNKI WSKRDRQGGYL 


2733 


16634 


A 


2752 


223 


16 


KKTPQKKKNTEGKGP VKKS P PQKFLKS T 
PP*IFFYVFCFFFFFLYFIYLLFRIFSF 
FFILFVYYLF ILP 


2734 


16635 


A 


2753 


367 


10 


PAPRGG VYRGRQASIiS CSGLHPLRASWP 
LCSPTQA*AMAGAPPPASLPPCGLiSDC 
CASNERGSVGVALS * SGAGDNLL VCRLL 
S GKCRNHPS S AS LTLAAVD W3 CS Y3 AI L 
APPLKNS 


2735 


16636 


A 


2754 


14 


341 


PGPNF * FGGQNS PGKNQNLSLPT* KGNG 
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SEQ ID 
NO: of 
nucleotide 
sequence 

• 

• 


SEQID 
NO: of 
peptide 
sequence 

■ 

• 


M 

eth 

od 


SEQID 
NO: In 

I ICCM 

09/515,1 

26 . 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to lirst 

amino acid 

residue or 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanlne 
OCystelne, D=Aspartic Acid, 
Jtr-uiutamic a ci a, jp=pnenyialanine, 
G=Glycine, H=Histidine, I=Iso!eutine, 
K=Lysine, L=Leucine, M=Metfiionine, 
N=Asparagine, r^rroiine, 
Q=Glutaraine, R^Arginine, S=Serine, 
l— inreonwe, v— valine, 
W=Tryptophan, y^yrosine, 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


• 


- 






- 




PGNPKNGGPTVPGTPPETRGSLLRTLQG 
PLRP SGTPPTRRAPGTPVGNPGF* PRDF 
r Awou FIN V KULj Jj JvUuFIN trnWliN f FF 


2736 


16637 


A 


2755 


15 


172 


HS WWECKLVQPL * KTVRRFLKKLKIDPP 
YYPAIPLLGIYSEERKSVKQSRRG 


2737 


16638 


A 


2756 


263 


2 


QKPLPPRNDPVFPKGPERTPRPTWGFIP 
GGVPIajKPFTLIjVSPFFFFFSPSPNLFF 
F*NRVSLCRPGWSAW*S WLTAASTS QA 
QAI 


2738 


16639 


A 


2757 


311 


106 


FVCFLRQRLALLPLECSCVILAHCNLRL 
LCS SDS PG* QSETV SKKKKKKRKKEKET 
GNL * RSLDMKGR 


2739 


16640 

i 


A 


2758 

• 


23 


328 


NITYYT * LFVLYFYVPGHTWFFFXFXF 
FFFFFFFFFFFFFFFFFXFFFFFFXFFF 
FFFXFFFFFFFFFFFFFFXXFFFFFFFF 
FFFFXFFFFFFFFFFFFF 


2740 


16641 


A 


2759 


1 


338 


DR*LFSTNHGYIGTLYLLFGA*AGALGT 
ALS LLMRAELGQPGNLI/3LDHT YNG IDT 
AHAFAI ILFMSLAI IIRSFGN*LVTLI I 
GAP VLAAPRJTD I S W* L *PHS VLLLLA 


2741 


16642 


A 


2760 


120 


2 


INKYAWLGTVAHACNPSTLGGRWTA*GQ 
EFKTSLANMVK 


2742 


16643 


A 


2761 


251 


3 


CRGQ I GT PTLHYS S S S SS KWI KDLNVRV 
KTIKIiSENTGVNIiRSLKLOUUi^ijLUDTKS 
KNSTRKIRK*SLKLETFFLGGTR 


2743 


16644 


A 


2762 


11 


303 


ALVLKRS S CLDLLKC WD YRHE PLHLALS 
EILMYRFCL* I FD I FFYWYSRE I DLI FV 
Y*FFFFFLETGFW FJjIiPRGDIjG* WEPPP 
PGYKGSSCLGIiPR 


A A 

2744 


16645 


A 


2763 


236 


356 


DWJbl rVJf b VEKvjr UnVAQ VUlj v ijL*JoGD 
PPASASQGAGIT 


2745 


16646 


A 


2764 


2 


332 


LTQT PGLK* SACLCLPECWNYRCEPPHP 
AGFLF* RHFDKGVS * LLI YPGTGWS 


2746 


16647 


A 


2765 


297 


19 


KKKSWFFFKKKKCWGGATKFKK*RGFnr 
KFVFYFFFIG*GVFFFFFFLISIIIFFF 
rrc FrFFFFFrVri? irrrrJrrfrr Fur 
FAMKINVFY 


2747 


16648 


A 


2766 


1 


147 


QIIjRRLRWENCVNIjGGGGLoEPRIjHHCT 
PAWVTV* NS VSKKKKNPLLFF 


2748 


16649 


A 


2767 


357 

• 


183 


NWDYRPLSPRPGKFLDF*VNRGFPLLTK 
LVSNF* PQTLPPSTSPKAGISGFNPRAQ 

P 


2749 


16650 


A 


2768 


2 


193 


RWESCLNPGGGGYSEPRSCHCTPSWVTE 

w DSVSXi i J. I^AJSKKQSirCl WblSbNbJjUlt 

YKDTNHF 


2750 


16651 




2769 


333 


173 


TfCATVT "HT T TO * CTUT J~»T DVPUMVODCD 

PRPAQKLLMVIWLGFMSSPKSPLEIi 


2751 


m r r r\ 

16652 


A 


2770 


3 


331 


r^hA 'crci/~*M\ i T tj r> v v\ r vur vxrvil m t w vdu v 
CNiSN CM V 1 f ivK V ivrLKiN I J^JLyJvirxrVifR. 

AVOOLSTELNIL* PYDPAITLLGI YPKE 

LKTS FRTKTCT * MLTASLFVIAKTWKQ P 

RCPSVGE * L SNL * YVQTME CYSVLK 


2752 


16653 


A 


2771 


209 


55 


RPGRPQVLRRVRPQNRLNPGGGGFSEPK 
* GPCPPAWGAQRDFI SKKKKTIT 


2753 


16654 


A 


2772 


213 


57 


RPGRPQbLRRLRLQNRLNPGGGGCSEPR 
♦GPCPPAWVTQRDFISKKKKTIT 


2754 


16655 


"A - 


2773 


148 


1 


KATGRKTWVKSRCWVGTVAHTCNPSTLG 
GRGR* TARAQEFKNSLRTLAK 


2755 


16656 


A 


2774 


2 


364 


WSAVRRDWITALQPERQRETPSQKKKKK 



715 



WO 01/64835 PCT/US01/04927 



SEQ1D 
NO: of 

DUCIcQllOc 

sequence 

• 


SEQID 
NO: of 

sequence 


M, 
eth 
uu 

■ 


SEQID 
NO: in 

09/515,1 
26 

- 

■ 


Predicted 
beginning 

niirlAnHrif* 

UULICU UUC 

location 
correspond 
ing to first 

aminn arid 

residue of 

pC|PHUC 

sequence 


Predict- 
ed end 

niirlo- 

UULIC* 

otide 

location 

correspon 

'"H i no tn 

last amino 
acid 

residue of 
peptide 


Amino acid sequence (A~Aianioe 
OCysteine, D=Aspartic Acid, 

FlsOlutnmic Arid F=Phpnvf alanine* 

G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, {^Methionine, 
N=Asparagine, P=*Proline, 

y~ViuUiUi ills, *v r\l gllliuc, u' 'OCilIICf 

T=Threonine, V=Valine, 

VVzrTrvnfnnhon Y=Tvrnainp 

X=Unknown, *=Stopcodon,/=possibfe 
nucleotide deletion, ^possible 

niip|pni , iH<» iitCArfiAn 

lIULlCVllUv IlIoCI UUU 














KKKKKKKKKKJCGGVRHGVKS RKP *TSGR 

GLKKLFKTPGHVNGAGNPKFRNGKSGKP 
PEVHLNGAW 


Thrift 


1DOJ / 






191 




HLRSGVSDOPGRHGKAPSLLKIEKLAGS 
GGTCL*SQLLG 


2757 


• 16658 


A 


2776 


195 


3 


GRVDINTLLALLLI IITF*LPQLIGYIE 
KSTPYECGFDPISPARVPFSIKFFLAAI 
PFLLFDL 


2758 


16659 




2777 


3 


IDC 

285 


LPKC*DYR 


2759 


16660 

i 


A 


2778 


135 


379 


HWSATVKLYAILLGLIiKWB SGTLLCS KD 
VLFFL*G*SSAYSIESVIMQIKAT*VKG 
KARVQLGAKKLKAYWQRKSPGIPAG 


2760 


16661 


A 


2779 


364 


1 


GSFTGAVIIjI IAHGLTSSIiXir diANSNY 
ERTHSRI X ILSQGL PTLLPLIAF* *I*LA 
SLANLPLPPTINLLGELSVLVTTFS*SN. 

TIT T.T HVIT .MTT ATT* 7iT.VQT.VT T?T*PTY*mn Q 

LTHHINNIK 


2761 


16662 

i 


A 


2780 


356 


1 

1 • 


GCLRAHIWPQKGNHEGQVHLFIDKVCRQ 
PMTEDCINBITTQVAQIFLVHFLLRQSL 
TLSEKRKKRKKKRKKRNRIESPEINTHI 
YCQLIFNKGAKKIK*G*NSLFNKTYLDP 
WISTCK 


a mm 

2762 


16663 


A 


2781 


1 


A SI .J 

204 

■ 


AQVGIiQDASSPRIEEIjlTrKUHAIjII IF 
fclCFLVLYALFxVTLTT* LTNTNI SDAQD 
SETDCYSQSYIIi 


2763 

• - 


16664 


A 

• 


2782 


1 


522 


YKCNE CGKVFNQQAHLAQHQRVHTGEKP 

* KCNBCGKTFSQMS KLVYHHRLHSGEKP 

* KCNECGNTFHHNSTLVSHKAIHTGEKL 

IlN.kJNCiVJIaXVVrriWAil X XjnItrLClKJ.il X ACJUl 

YK*EECDKVFGCKSNLETHKKMQFSKTD 
SAFSLQ 


2/64 


\CCJiC 


a. 


2783 


1 


327 


SVSEKEKEQKQNFHLNAQSNCQKLKSP* 
KYLKHPE KTD I LS KASQYNNQLTADQ S I 
LKFKTKTFDQKKGRPKGHGMTYSNS 




lDOOO 




2/54. 


11 K 


IZo 


SSLPKCSD YKQE P P YIACATLKCYQI PN 
FYCWPH I FKRMPY 


2766 


16667 


A 


2785 


208 


3 


RPICPLSLWGVSFLFFFFFIiRDRVLICH 
TGYSAVAQ YCNTAHCYSPRLK* SSCLSL 
PSSWNHNLIPPR 


276/ 

■ 


lOOOO 


■ti. 

* 


2 /5b 


4V 




GVLNDLGIAEFNTCSKSLLLLIFFKGRI 
LLCCPGW I EWOSOliTAS STLGLK* TFH 
LSLFGSWDHR 


2768 


16669 


A 


27R7 


331 


187 


GCSE PRS P P CNAAWAT KGDS VS KE KKKLSf 
KKREEKKH* HNTSGEKINLI 


2769 


16670 


A 


2788 


325 


34 


RSQLRGLPSMSIS* I*NSRLR*LRPRRL 
FIFCRDRVSLCCSGWSQTPGFKQSTHRS 
LPKCWDYRREPSPLAQIRYLGLFFSQGL 
S S AFCYVAMVSG F 


2770 


16671 


A 


2789 


3 


150 


AASTRQLIFHFTSKHHFGFBAAA*YWHF 
VDW*LFLYVSIYWLG 


2771 


16672 


A 


2790 


86 


311 


NI HPLNDI FTRLKNG F KKKE I SLVKNEP 
NKGT* SLI S I K\7LFHT *K* F IRPGVAAH 
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M 
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oa 
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ottae 
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last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 

OCysteine, D=Aspartic Acid, 

ti— Ljiutamic Acia, r— rnenyiaianine, 

r-TlimiiiA HsHktiHiriA I-IeAlaitomA 

K=Lysine, L=Leucine, M= Methionine, 
r^pvtspuragiuc, jt *-~.r roiine, 
Q=Glutamine, R=Arginine, S=Serine, 

TtoTh rAnninA V=Vnllnn 

i — J III Gvll IUC) T TflllllC) 

W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /-possible 
nucleotide deletion, \=posstble 
nucicuiiQc insertion 














T*NPSTLGGQGGWIT*SQE 


2772 


16673 


A 


2791 


2 


123 


GGGACS EPRSHLCTP AWBTE * DS I SEKK 

novv* t rtT VTMT 
CtSKJX w iJUx X 1 JNJ. 


2773 

* 


16674 


A 


2792 


302 


162 


PLLRRLRPENCFNLLGGGFN * PRSRNCP 
PAWATKRDS LLKKNLKNF 


2774 

■ 


16675 


A 


2793 

- 


1 


258 


GGCSQLRLCHCTPAWETD * D P VS KKKKK 
KNFTRAKLiLiI FGTLVIjGKv KRGG PFKPR 
I LKL PWETWQNLVFT KKKKI FFLKNRGG 
GE 


21775 


16676 


A 


2794 


199 


3 


VGP I PRHDKPPHSKQLCTYS PAFTQLFS 
SGKILKNIHLWPGTVAHACNPSTLGG*A 
RWITRSGDR 


2776 


16677 


A 


2795 


1 


302 


GGCSEPRSCHCI PAWATEPDS I * KINK* 
INK* K* * KINRNKKGKNHFLSSST* * PQ 
VAS S FLTGQHRYTKL * SSQKVLLDSATL 
QSKSNAEVKRVNRTTD 


2777 


16678 


A 


2796 


228 


326 


NDNGQSGWAHACNPS ILGGQGGWI I *G 
REFKT 


2778 . 


16679 


A 


2797 


270 


1 


KFGISAPFAPSPKV* KRGFF*NFLGNGP 
PVKIPPF*TPL*NFSKPGKNFPPFF*KT 
RFSKI FPNGFFFFFFFFEGVLLCCPGWS 
ALTRSW 


2779 

• 


16680 


A 


2798 


307 


36 


FKNFCCG * EVSLCCPG WSQAPGVK* PS C 
LGFPKCWDYRCELPCLlASCSL*CDRNQT 
risawlwi efhiisp ilpvlplflkkpq 
AGLLCF 


2780 


16681 


A 


2799 


132 


5 


I FEVTV* CTK* HKRNMQPG WGHACNPS 
TLGGQGGW I MRSGV 


2781 


16682 


A 


2800 


32 

+ 


314 


KQHPGNGPNPSGKGPGRASRFLKKKNFF 
F*KTPPKKPHTPCKPSPKGKGPERIiPWH 
FFWQKFGPIiPMGPNPDQRAGFCLRDP *G 
GGKNRSPPTL 


2782 


16683 


A 


2801 


139 


3 


AASTFY IFFETTLI PTLAI I TR*GNQPE 
KJjNAIjI x r Xjr x I JUVCifciU 


2783 


16684 


A 


2802 


238 


3 


AASTSHVISSMYNI YI IIQFKTFPVFFF 
RDKVS LCHTGWSAVAQ S *LTAAS AALTS 
CVQG I LPTQPPE YHTRPLRLFL 


2784 


16685 


A 


2803 


110 


291 


KKIX5 VFGFGFGFKTKSNF IMQAKGQLPN 
LG*LKPLPRGFPQFSGIiSI*SGTWDYKHT 

rAlK 


2785 


16686 


A 


2804 


97 


2 


S YNNQDS WLEKEKTNRS VKQN * AP IPE 
KKKKKNKVGX3LTLPNCKTYYKATI IKTV 
WYWRKKRQIGQ*NRIESPEIDPHKY 


2786 


16687 


A 


2805 


268 


I 


ATFIjL F YYVFRPQ I P YYL* I F I LSI*F* Y 
RHSGPPYVGPPQQYPVQPPGPGPFYPGP 
GPGDFPCRLPIRKWWLVWLMPQLNPTVW 

TVT''T TtTl 
PTI1T 


2787 


16688 


A 


2806 


153 


3 


IPLPKGLLVPLFGVFLKVFFFFFFFF*D 


2788 


16689 


A 


2807 


3 


216 


NAARDHIVKPSP*PLTGAI*SALVMTSGL 
AM*CHSHS I SLLILSLLTSTLTIYQ +RR 
NVTR * TS YQGHHT PTDQKGLRYG 1 1 LFI 
TSEAFFLS * FF*AFYHSSLSPTPQLRGH 
WPPTGLS P LNPLEVPLLNTS ELLAS *VM 
PLSLH I ASNTKPTNQHTNHI PMKAQCNT 
INLIPRPPHTN 


2789 


16690 


A 


2808 


168 


354 


FLERESGF VAQAEGQGQNIiS * WKP F FP I 
LKHFSCLS LLRSWNYGPTSPVPAKVGGF 
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• 

od 
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correspond 

ing to first 
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residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=(jiutamic Acid, r^rnenylalaiilne, 
(j=vyiycine> ti— Jrtistiaine, 1— isoteucine, 
K=Lysine, L=Leucine, M=Methkmine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S^Serine, 
P 3 * nreonine, v^vaiine, 
W=Tryptophan, Y=Tyrosine, 
X-Un known, *=Stop codon, /^possible 
nucleotide deletion, v=possiDle 
nucleotide insertion 














WRNKIY 


2790 


16691 


A 


2809 


61 


360 


YVSNS KCSNHRK* S LFFFF F FERBSS Fv 
PQVELHGRDIiG*LQLWLPGFGKFPGIiTP 
LRNGDDGPRPQPPANLGLLVKTGFSPVA 
HliGVNIiGTljGDCPAIiP 


2791 


16692 

* 


A 


2810 


236 


3 


KMFFQMRYTMLKK*QATFGFILALKNVL 
KTFIiAP I FFFLRRNSALVAQAGVQWRDL 
GPLPPLPSGLKRFSCLSLPSC 


2792 

* 


16693 


A 


2811 


140 


210 


NAKITKC* KGYGKRGTLMHCCWQFKMGQ 
LLWKTGYQFLKKLKLELLSNVP I PYL * I 
WKKGNPYALLLAI 


2793 


16694 


A 


2812 


230 


1 


VKLCPVSLKTGVRPLLTSSBVI FILNCK 
FNiaV*IiLPGSLLLLLASAIVEAQAGSG 
*TVYPPLAGNYSHAEPYAIiV 


2794 


16695 

■ 


A 

• 


2813 


3 


345 


HE VRI DVHTRTCFT FGT IIIAI PTGVKV 
FS * LATLHGSNMK* SAAVL*ALRFIFLF 
TVGGLTGIVLSNSS LDIVLHDTYYWAH 
FHYVLS IGTVFAI IGGFIH* FPLFSGYT 
LD 


2795 


16696 


A 


2814 


2 


184 


ARVGFHHI DQAGLKLLTL* SAYFGFPKC 
WDYRHEPPRIiALKFNNIKKKKKKKKKTP 
GEKI 


2796 

* 


16697 


A 


2815 


276 

• 


3 


ARVHRIDHGLM*HQPDGLK*SLCLSLPS 
SWD* * SMQPCPDDYGTRSDQKKPTFKSP 
YVSQTGLEHLGSSDLPASASQRAKITGM 
SHHALV 


2797 


16698 


A 


2816 


3 


94 


HENQYSSLIIIMAIAIKLGIAPFHF*VP 
EV 


2798 


36699 


A 


2817 


93 


1 


GGGGCSEPRLHHCTPVWVTE *NPVSXKK 
TRA 


2799 


16700 


A 


2818 


1 


197 


GTRAQSLLLGRLRQKNQLNLGGRGCSBP 
KSGHLTPAWAT * PDLVSTKSCTLI YLVT 
QALHITLLP 


2800 


16701 


A 


2819 


109 


2 


GVF * G VFLAQGLTIiVAQAEVQ WHDIiGS L 
QPCPPRLKRLV 


2801 


16702 


A 


2820 

• 


2 


347 


ARAPGFAENEVFVFSSCFVWMVPGGKGE 
NAELMQPSSY*BSQHFFPAPPRCSSCV* 
LCSLGPSLLGTLI FOQFTLSBIiPRFRKG 
* FSSSLKKVFCFGDRVLLCEPGWSAWQ 
TQL 


2802 


16703 


A 


2821 


1 


353 


CTRRDVTRESTYGGHHRPPV QKGLP YG I 
ILFITSQVFFFT* FF*TFYHSTLTPTPQ 
Ij*GHWP PTGITPIjNPIjKVPXiIiW 1 o VliLP 
SRVSIT*AHHILI*NNRNQHSSLr,ITI 
IiLGLY 


2803 


16704 


A. 


2822 


261 


400 


VEHSNSWKENFIjGQGTGCHAC^NIiNTLGG 
RGGKI1 WKSGV*DyjjDyti 


2804 


16705 


A 


2823 


273 


97 


IiNTPRMQRIiTQLATVILRFHKDQGFAML 
PRfWSNS * AOATr!HTjOriPKC* DYRREPP 

GQK 


2805 


16706 


A 


2824 


159 


355 


KKFFSIFFFFFFFF*DRISLCHLGWSAV 
VQSWLTAAST 


2806 


16707 


A 


2825 


67. 


338 


LSPEL*LY*IF*KPFS*QLSLTSHSPSI* 
FSALLLLS ISPPTWLHSQSDKNLHLFSF 
LFRDSVtiLCCLGYSAVT*SWFTVASNSW 
AQWLS 


2807 


16708 


A 


2826 


3 


359 


HEQKYSFLHDSQTLFCF*DS I PTPSNMD 
ETQQKSRLELVRI SLLLIEPWLEPERLL 
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sequence 
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peptide 
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- 
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M 
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od 
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nucleotide 

location 

correspond 

ing to first 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H-Histidine, l=Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q»Glutamine,R=>Arginine, S=Serine, 
"^Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Un known, *=Stop cod on, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RSMVANNLVYDTSDSDDYHLLKDLQEGI 
QTLMGRLEYGSRRTGQMLKQTYSKFDTN 
SHNHDAL 


.2808 

* ■ 


16709 


A 


2827 


3 


338 


LERNLDTTFLDPAGGGDPILYQHLF*PP 
GHPEAYILILPRFGIVSHIVSYYFGRLE 
PSGYLRMG*PIISVGLLGFIV*SHHIFS 
VGVDIHARAYS YFGSLVMAI PCTLBVFT 


2809 

i 


16710 


A 


2828 

j 


1 


406 

• 


RHEGEKLRRPTFGPRHRGAGTAKMSASL 
VRATVRAVS KRKLQPTRAALTLTPSAVN 
KIKQLLKDKPEHVGVKVGVRTRGCNGLS 
YTLE YTKTKGD S DEEVI QDG VRVF I EKK 
AQLTLLGTEMDYVED * LS S E FVF 


2810 


16711 


A 


2829 


332 


3 


GI I VMQ AT I ATAL IGYVLPLGQ IS L* GA 
TVIAYILAAI PFIGTDLVQ* I *VGCSVD 
SPTVTGFLAFPFVLAFI IAAVAALLLRF 
LQGTGS PNPLGGASQSDKIAFQFSC 


2811 


16712 


A 


2830 


186 


349 


YSEREVAFFFFGNGFCF*TQAEWNGGNL 
G* LNLLPPR* KE FS C KI FPRTWNYS P 


2812 


16713 


A 


2831 


303 


590 


NILTTLLNKQSKSNQLQNKI IHTVYIKI 
KIFFFFLRQSLSVTQADVQWHCLCSLQP 
PPPGFK* FS CLCLL£ S WD YKHAPLHPAN 
FCIFSRDGISPS 


2813 


16714 


A 


2832 


1 

* 


370 

■ 


EEL I TFRDHAVI IS FLI CFLVLYAL FLT 
LTTKLTNTNI LDAQD IQTV* TILPAI I L 
VL IGL PFliR I LYI TDE VDDPS LTI K*NG 
HQWD * TY* YTDYGGLI FNS Y I LRPLLLE 
PGELRLRDVDN 


2814 


16715 


A 


2833 


207 


1 


QF F I FLRHS FTLFAQAGMQWRDLGSLQP 
SPPGFK* FSYLSTLRGIARKITLAQEFQ 
TSLGNMGGPHPRA 


2815 


16716 


A 


2834 


223 


1 


DNLAHKGKTRVYLKSRNKLGKGGGAGNL 
VSLDS IGSRRDHRRAPPCPANFVFLV*M 
GFPHVGQDGPELLTSCSC 


2816 


16717 


A 


2835 


1 


321 


GTRKPS P * FVTGALS ALLMTS GLTM * LH 
FHS I TLL ILGLLTNTLT I YQ * WRDVARE 
STYQGHHTPPVQKGLRYGI ILFITSBGG 
LFAGFF * AFYHSSLDRTPQLGGQ 


2817 


16718 


A 


2836 


57 


370 


IWMGRVLWKDRLYGVFCRAFNRAVTSRV 
WAEDPWRVPKTLSVDPRKLPPFS*ISVR 

GQIYFT* FFFFFETESHTVAQAGVQWVfG 
DLG*LQPSPPGFKRFCLSLP 


2818 


16719 


A 


2837 


2 


345 


ARAHRQLDEP*L*RRPGERHPSW*SEET 
VERQRTKT+SESSQTGTSITSSRNARRR 
ESEKSLHLETLNKEEDCHSPTFKPSTPD 
HPLKVMPAPSPKENAWVKRSSNPPARSQ 
SS 


2819 

• 


16720 


A 


2838 


15 


376 


AKI E PL YS SLGNKS ETP SQKGKKKKKKK 
KGKGGGE KKKKKKTEGGTRGLTQETQFF 
GNPKGPKHI/3ARN* KHRAKQAKKGNQPR 
EYRFTEEKKGEKLFNPGGGRSQKPKTGQ 
WNSAGGKK 


2820 


16721 


A 


2839 


3 


336 


HELLAS I LLI Y * * CRDVTRER 1 1 PLAHH 
T PP VQKGLRYG 1 1 L FI TS * VCFLAGLF * 
YLYHS SLPSTAQLRGHWPPTGITPLNPL 
EVPLLITSVLLASLCSFI*AHHSLIEH 


2821 


16722 


A 


2840 


338 


3 


KI IFYLFFLRS IAFLAQIEGHWAVFSSF 
KLWPPGFKHFSASTFLVNRVFKGGPRAR 
VNFGFFIKSGFFPIF*VGF*LFFFFFFE 
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nucle- 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanlne 
OCysteine, D=Aspartic Acid, 
bj=K*iuuimtc Acia, .rnenyfaianine, 

r-rivcino 14— HicttHinn I— 1 miotic! no 

ijr— ijriycine, jci— tiisiiaine, i—iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
n = Asparagme, r-rroiuic, 
Q^GIotamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine. 

W^TruntAnhan VaTvrAfltitA 

tv— *i rypiopnan, j = iyrusiiie, 
X=TJnknowr, *=Stop codon,/=possible 
nucleotide deletion, \-possible 
nucieotiae insertion 














TBSRSVAQAGVQWHDLCSIiQAPPPSSC 


2822 


16723 


A 


2841 


2 


i /in 

14y 


KKVECSNOJVCRTMCILCCFW 


2823 


16724 


A 


2842 


348 


175 


SACKWSITGMHCHA*VIVSVLLV*TGS 

JjJtl V vay VoljiSLiK 1 o UUrlrl i Luxo Voviij J. l\? 

MS 


2824 


16725 


A 


2843 


338 


118 


DPGGGACSEPRLRHCTPAWVTKRDFVSK 
KKKK * NI KC * HAKSGTVL YS FL YLFYS * 
YFLMRWSVHIPGPSTK 


2825 


16726 


A 


2844 


189 


47 


KCvAIjLLSLFLSCSVM*RNACFRFAFCH 
DCKFLEASPAMLPIQSVEL . 


2826 


16727 

* 


A 


2845 


335 


139 


EVSPSWPG*SQTPDLKRSACI>SLSKCWD 
YRPGVSHRARLNGSFLTQLYEMLTYFPI 
IWVTLQVFR 


2827 

• 


16728 


A 


2846 


3 


352 


HEQRLTPEWKKAATALGDVVKVGAVnAD 
KHHSLGGQYGVQGFPTI KI FGSNKNRP * 
DYQGGI TGBAI VDAALS ALRQIjVKDRIjG 
GRSGGYRSGLQGTSYRSSKKDVIELTND 
SFDN 


2828 


16729 

• 


A 


2847 


2 


269 

• 


ARG L F S TNHKD I G TL YGL FGA* AG VLGT 
ALSLLIRAELGQPGHLLGRDDISRIHVK 
RI I PG CGTLD G I S VGRE CACLWTPACS R 
KCARG 


2829 


16730 


A 


2848 


317 


28 


SFLPN* KLAGHGGRPL* FQVLRRVRLEG 
CFSLEG*GCSEPWSCPCPLTWATETDPF 
SREKKKRRRREEKAEKRRENRRVPSLSV 
FLEEYVELTLIL 


2830 


16731 


A 


2849 


326 


107 


FHHVGQDGLDUiPS *SAHLGFPKCWD*R 
CKPPCPASRNFKTLKLYHIFSNNLIE*N 
* KS I TRGTLE TVQLPGN 


2831 


16732 


A 


2850 


224 


3 


SILQAWLECVPSPPGNPIiGLLQSQVDFFF 
♦RC^LAI^PGl^CSC^IMAHCSIiKLLGS 
TDPPTLAS*VAGTPSSC . 


2832 

■ 


16733 

• 


A 


2851 


27 


329 


VRTAAINRPGFLLPCFIGQNFFFY*RFE 
TYRLGV*MILTPH* YLI*S SLSGLDCV I F 
FSFILFFYWFIVYCHVFMVYLFFFLFVI 
LFVLCVFWCFYFLLFV 


2833 


16734 


A 


2852 


351 


60 


NLLSSLQGGRLPRPPGIiSPHPQIFI PHCs 
GVPL*LPGPWKSRPEASLNPRS*PPPGQ 

PRGNPFPPKKNFFFFETVLLCRPGWSAV 

nni?nirvtfMT T if 
AUr VlLKJS-Ni-iJj A 


2834 


16735 


A 


2853 


186 


365 


LTQL1jNLI*YQNQIEYPQPYAQCX3PRSG 
NAy Q X Vsrlbr PrlQs w ^VFAo^rUioLiJr W 1 n\i 
GLKK 


2835 


16736 


A 


2854 


17 


239 


HFTTLQISMSLLTETLARHDSTHLWSQL 
WTTEQDPVSRPGAVASEH 


2836 


16737 


A 


2855 


325 


140 


GFHYVSQDGLDLLTS*SACLAFPKCWDY 
RHKPPdiAAYIF IOSRVFIiAVTSISTHS 
SFQLL 


2837 


16738 


A 


2856 


1 


157 

• 


GTRGFHHVGQAGLE VLTSDNLPVSAS QS 
AG I TGE S H * AHERS DGYANTNS PG 


2838 


16739 


A 


2857 


157 


361 


LLLRIVSLFWRKSSPLGVQAEMYRPDLP 
VTKPPPPD*DKDSCLSLLITWDYRLAPP 
LPANLEF*I*TG 


2839 


16740 


A 


2858 


3 


346 


HEGNHKFKI INVHLR* LNQNL* SGLEGD 
KVIWRNI*TSVTFLFC*MNGXMNPDLFR 
QFTYEVGRIFFFSLKD*VWLCCLGSSTV 
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OCysteine, D=Aspartic Acid, 
E=Giutamic Acid F^Phenvfalanine. 

G=G)ycine, H=Histidine, I=Isoleueine, 
K^Lvsine, L—Leucine. IVI=Methioninc. 
N^Asparagine, P=Proline, 
Q=Glutamine,R=Arginine,S=Serlne, 
T^Threonine. V^Valine. 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, possible 
nucleotide insertion 














DLGSLOPPPLGPKOPSCLSLPSSWEPPC 
PA 


2840 


16741 


A . 


2859 


18 


233 


KNPTILCCKGGGAQA*KPRLLGRLRQKK 
GLT PGGEG CRDLKS PLGI P PWATRPKLI. 
LKKKKKKGGKKKTLF 


2841 


16742 


A 


2860 

■ 


356 

. 


3 


RVNFKI FGKKG FCPGGPGGFBVLTPGDL 
GPT.PFOKGflDS PS KPLKPPFFCFATJfST 
TLNYFQFAFCVLRPTPN*SVTQAGMQ*H 
DHSSLQPQPPSLK*SSQLGFPSSWVYRH 
APPSC 


2842 


16743 


A 


2861 


115 


374 


IVCGKILNV*YTQ*HNKLQNKNKVD*PT 
IKLMLISQSLGILIFIQKRDFEVNYGMI 
LKKRGWPGPVAYVCNSKTLGGQGGQIT 


nail 

2843 


16744 


TV 


2862 






T .TfT3 T MT.T .TT.HQHDHTCPHnPfJV1Y1T.Tlft T 

GPRWY *TYEYTDYGGLIFNS YI LPPLFL 


2844 


16745 


A 


2863 


1 


236 


GTSPFFSFFFLSFFPSFLPFLSFFNPTT 
LLLSI^NIS*KKVLFSFEMESHSVNPOi 
ECSGTISAHCSPAWATGTDSIS 


2845 


16746 


A 


2864 


67 


364 


VLvCVvVLVovo VLvcVLV X XV- 1 x luni 

VCA*YRIRRYIMLPTLVYNSCPEIHDSK 
CLTVWIiHICLSVCMD 


2846 


16747 


A 


2865 


2 . 


155 


ARGLTQENRLNLGGGGCS E LRSHHC I PA 
WATE*DALSQKRKKKKGTYRGI I 


2847 


16748 


A 


2866 


344 


81 


I QPNS FI FLKI F I SRDGALLCC PGWS PN 

L* KHYMYKWHKVHCVFSHYNKI FTGQT K 
TEN 


2848 


16749 


A 


2867 


217 


340 


RVFSDFYMNLKWSLRKRRTWPGLVAHAC 
NPSTLAG*RGQIT 


2849 


16750 


A 


2868 


** 
3 


349 


UPaTCDTTnTJT.TTITHnHBT.TTTTT.TC'V 1 

ilciAi &Jr J. 1 WJ£Jux i r JlxJluLLiJ. J. li jjJ.i_x.Lf 
LLYAIiY LTLTT KLT I T I MSDAHE I HTA* 
TILPAI ILDLIDLPCLRILYVTDEVNDP 
YLTITSIGHQWY*TYEDTDYGGLI FNS Y 
ILT 


2850 


16751 


A 


2oo9 


I 


.352 


riTT? T n WTP ft YPT Q aTT/S T ft T PTRWUF 

S * LATLHGSNMK* S AAAL* ALGF I FXiFT 
VSGLTGIVLRNSSIxDIVLHOTYYVVAHF 
HYVLS IGAVFAI IRGFIH* FPLFSGYTL 

AAA V Aiw A VA^% V A* *i AAWi A** AV A> AJ A A A AJ 

DQTYA 


2851 


16752 


A 


2870 


2 


342 


ARARFRT5TiIiTAFAI*L*LPWIiQEAGAAQ 
TVPLTTLFDHTMl^DHPAHQLAIDTYQB 
FEETYI PKDOKDS FLHDSOTS FCYSDS I 

AT AYjfij X A A Ay A\A^^<A\AV>J A> Hill Aw4 w A> iJI^ 1 A 

PTPSNMGETQQKSNIVBLIjRIFLLLIESR 

L 


2852 


16753 


A 


2871 


254 


2 


YPGYQAI S Q SDMQS YAYRHL* LQPPGVK 

* SS TVS LltS S *DHRP VPPCLANE KNFFF 

■ QtJ A V W *■ 1 A4lTJ AVXaAVAI V XT ATA i i JfW A3 AvA^ A7 AT A7 

* RQGTiAMIiQSRTiVT 1NSWT1T 1T1YSHARPR A 


2853 


16754 


A 


2872 


282 


3 


HTHILHYIYIHHGHIST*HLSPPOTIYS 
LCNIKILI YTP YDH* I IRIPIiHQEHAVH 
RRJiLIHXHIOTSVCVWCVSVCVACVCV 
CVCVWVLV 


2854 


16755 


A 


2873 


278 


3 

■r 


LWPLKNSGPQPWPQMDLGKPQVKTPWY 
YKKGP*KKFFRKF* KWPGFFFFFFIiRQS 
HSIAimGVQVTRDLGSLQPLPPGFKRFSC 
LNQISSC 


2855 


16756 


A 


2874 


2 


348 


A!rA*SL1l^SL1 1 PNATSNLLGLLP YSF 
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SEQID 
NO: of 
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M 
eth 

aH 
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SEQID 
NO: in 

09/515,1 
26 
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beginning 

niicicuiiue 
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correspond 

ing to first 

ammo aciu 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

uing to 

last amino 
acid 

residue of 

peptiae 
sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 

IT— fZliitfl mlc Aplrl FT^Phonulo lonin a 

MLr- vi iii ia iii ii- rtuii, r—- rnenyiaiHnine, 
G^GIycine, H=Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=ProIine, 
i^otuianiine, rv = /*rginine, o — oerine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Un known, *=Stop codon, ^possible 
nucieotiue ueieuon, \— possioie 
nucleotide insertion 








■ 






TPTTQIjS INLAMD I PL * AGTMVI G YSS K 

TDITTVT 7VTJT?r.DT3/*l r P'D , T r U'T.TT3TT.TVM TtPCMC 

x KiifLUAxlr JofKo 1 lr l illj J. r 1 JjAP1.L,e»oiY1o 
LLIQPIAMAVRVTDNITAGHLTMPLTGS 

ir± V 


2856 


16757 


A 


2875 


1 


260 


GTRENHLDPAGRGCSKSRSHHCIPAWVT 
D*DSLKKKKKKKPQ 1 1 LGNGGTEGNQK* 
RGNPFRGPWGPSRQKIKELGPGGKHGKS 
Ivr 


2857 

> 


16758 


A 


2876 


176 


1 


EHLFCKKLLGEEKGSLSKEFGKCWVNPR 
*KWNSRPGWAHTCNPSTLGGRGGQITR 
SG 


2858 


16759 


A 


2877 


168 


1 


GVRVFGFSSVPE FSrDTCAGAQWRNIiGS 
LQPLP PGFK* FS CLSLLSS WD YKRTPRA 


2859 


16760 


A 


2878 


67 


333 


WQGIjGRAVARVPRS YE ITWAGNLKGAA* 
FQQLFFFFFFHLLFFFFFFFFFFFFFFF 
FFFFF FFFFFLFS FFFLFFFFFFFFFFF 
FFFFL 


2860 


16761 


A 


2879 


335 


1 


GRKEGRRSKEKBRKKKKERKKERKKTIF 
KKHIPSPLAFPRKAWKLKEIQIKIiVAH* 
KINRQAI RGCP PNKRPHPFQNMI KPGRE 
EGEGGLETSRDGERNLWI FGPSVGPLR 


2861 


16762 


A 


2880 


3 


345 


HERHETASIILLIAILFNNILSGQ*TIT 
NTTNQYSSLII IMAIAIKLGIAPFHF* V 
PEVAQGTPLTSGLLIjLT*QKLiAP IS 1 1 Y 
QIS PSLNVSLLLTLS I LSI I AGS *GGIiN 
QT 


2862 


16763 


A 


2881 


347 


156 


WLIFVIiLVEMGFFHSGQAVLK-LiLASSDP 
ASMTSQSSGITGLRHCAQPR*AFLIALC 
LPPSAKI 


2863 


16764 


A 


2882 


338 


2 


I KKAJj *H£i * LKBKKKo PIKToO o SKAr w 
NLPRSKKKKESSNNYQLL*AHKLQKVKE 

JW2TTTT !?TDWT.DT.T.Qn7Uai7DT,Wrtl>T , PCQ 

KIETL I KKP YHPEKS YGHDE VTAKFSRA 


2864 


16765 


A 


2883 


206 


1 


IRTRPFTPMFIFYMFVYLATRSCPFTIiA 
WDYYACLHAELV 


2865 


16766 


A 


2884 


341 


140 


DGGCREPRSRHCSPAWVTE*DSKQNKTK 
QKQKRYTNISSPQTSPLCYS FSAYTPIiH 

J^lrUrUll xiir OO JJ 


2866 


16767 


A 


2885 


360 


247 


FHR VSQDGLDLLTS * SAHLSLPKCWD YR 
REPPRRPKAL 


2867 


16768 


A 


2886 


229 


351 


KLNNRPG VVAHAFNPSTL* KLNNRPGVV 
GSTP 


2868 


16769 


A 


2887 


359 


2 


FYSSSSSSSELVPCRQDVQVPHYLEGLF 
LRSCFMEIKYDT* KRKIKYTHVHTHTHT 
HTHTHTQRE KERDFPRFTAKW KANLEAG 
SGYATASS I RAVL WTT I TLV 




1 A"*7*7A 
10/ /U 


A 


Oft Oft 


4 




AD53HMWK YRT^PGI TKILLKKSKI * GYHO 
DWI S SRYQDLFHKDMI FKIMS YWYKKRQ 
VDHWDR I ES PETS S QMYEHL I YNKDNIA 
EW*EKDSFLSKWCWDN 


2870 


16771 


k 


2889 


1 


436 


IAILT * YDYTLL* RVGPRGQEFGTRATV 
ITNLLSAIPYIGADWK*G*GGYSLDSP 
TLTRFFTFHFILPFIIASLATLHLLFLH 
♦TLSNIPLRITSHSHQITFHPYYTDQYS 
LRLLPFLLSFTTLLLILPNFLFYPP*TT 
LTFPP 
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NO: of 
nucieouae 
sequence 


SEQID 
NO: of 
pepnae 
sequence 


M. 
eth 

Ail 

oa 


SEQID 
NO: in 

09/515,1 
26 


Predicted 
beginning 
nucicuiiuc 

location 
correspond 
ing to first 
amino acid 
residue of 
peptide 

ri n ji luni Jtit 

sequence 


Predict- 
ed end 
nucie~ 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue oi 

peptide 

sequence 


Amino acid sequence (A^Alanine 
OCysteine, D=Aspartic Acid, 
Cj — vviu iu ni ic stem, r^-r oenyiaiBmoc, 
G=Glycine, H=Histidine, l=lso(eurine, 
K=Lysine, L=Leucine, M= Methionine, 
N=Asparagine, P=Pro!ine, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyroslne, 
/v— unknown, "—atop codon, /=posstDie 
nucleotide deletion, ^possible 
nucleotide insertion 


2871 

■ 


16772 


A 


2890 


1 


218 


RKISPLIKLINHSFIDLPTPSNISA* *N 
FQSLLGACLILQITTGLFLAMHYSPDAS 
IAr XAriX XKDVN X tj w 1 X KiIjhANumo 
IFFICLFLHIGRGLYYGSFLYSET*NIG 
1 1 LLLAT IATAFIG YVLP * GS LLGACL I 
LQ ITTGLFLAMHYS P DASTAFS S IAHI T 
RDVNYG 


2872 


16773 


A 


2891 


3 


368 


LSNS * ANNLI * LAYTIAFI VKI PLYGLH 
L*LPKAHVEAPIAGSIVLAAVLLKLGGY 
GIIRLTLILNPliTKHIAYPFLVIiSL*GI 
IITSSICLRQTDLKSLIAYSSISHIALV 
VTAILIQTP 


2873 


16774 


A 


2892 


3 


342 


HENLI *IiAYTIAFI V KI PLYGLHL*LPK 
AHVEAPIAGS I VLAAVLLRLRGYGI IRL 
TL X LNPLTKH XAYP FLELS L* G 1 1 XTSS 
ICLRQxDIjKSXjIAYS S XSHXALVvtAIIj 

I 


2874 


16775 


A 


2893 


2 


361 


ARVCLRQTDLKSLIAYSSISHIALWTA 
ILIQTP*SFTGAVTLIIAHGLTSSLLFC 
LANSNYERTHSRI I I LSQGLQTLLPLIA 
F * *LIjASLANIiALPPTINIiIA3EIjSVLVT 
TFS*SWIT 


2875 


16776 


A 


2894 


229 


3 


YTFVQSFI FSWLFICKVCLL* PAHSMV 
CACECMCVCVCLCVCLSSLKI I PLLECL 
IYLQLILVLSSAIYNSRFE 


2876 


16777 


A 


2895 


3 


391 


G FL ITNNIS PAS P FQTTI PLYLKLTALA 
DTF LGLLTALDLNYLTNKL KI KS PLCTF 
YFSNIXiGFYPS ITHRTI P YLGLLTSQNL 
PLLLLDLT*LEKLLPKTISQHQISTSII . 
TSTQKGX I KJj YrJjo FFr 


2877 


16778 

* 


A 

- 


2896 


381 


2 


GRCRVSGS I IDHRMAPQKWR YTKGGPHQ 
WHNFCFF* KKGVFPYGPKVFLFRAPVFS 

13 CV^ D D TTD Ut? T in 7 # T 1 1> 13 O S T . 1? I7T? 1? I7WI? C\7 

Pr\»Pi*AJKfvis±iN.v w 1 trPtrALit rr eurrr V 

EMGSHOTAQAGLKLLGSSTPPTSASQGA 

l?T 2\/"lT7QTTD ZVODTJ2VQ 


2878 


16779 


A 


2897 


324 


1 


I,YTl!OTKHINPQQimTK*YQRNTKNRRK 
KEQQTKNNI KRHI TESARKNS * *HRNIQ 

KKKKI ENSHAASD* ITSSGGRSRA 


2879 


16780 


A 


2898 


120 


345 


PPAPSS*YTTSLIQDRLFLMMAVIiSSAS 
VLDIMEAISLLIQPIAIiAV 


2880 


16781 


A 


2899 


381 


219 


CVSQDGLDLLTS * STRLSLPKS WDYRCE 
P P R PAG S GHL YS S FKKQTGDTHHNF 


OQOI 

zool 


lO/oZ 




2900 


1 


224 


T . CHPT .TT .VP2XVT .T.7AT . P VT?V PT .TTT T .TYT T 

LPRLVPYSWP*ASSPASASQSSGITGMS 
HYTWPLPQTLNP FLFFVP 


2882 


167o3 


A 


2901 


1 

• 


228 


VVT.TWMD T .TT"TVYI* T TKT.TClTrtMTTTnW 

T * GY* YNRSGSS FNGVYDYELLRSESEH 
P WMI VDNTE YDE I YTRGGIE 


2883 


16784 


A 


2902 


2 


332 


LPPSFSLIiLAPSPSSSLPPSPSSSLPPS 
PSSSLPPSPSSSLHTSHSSTLTPSASCT 
LTITESFTQRLAQSYITILP*ASRTTMQ 
LLKYRTLQTTRNNTHSMTRPKALTKI 


2884 


16785 


A 


2903 


3 


243 

* 


DIGFGTDFSDMTPKAQTKQVDQIiDFIKI 
* NFWPGTVAHTCNP S TLGGRGM * ITKLG 
V* DQPGQRGETTQKLAGHGGVDAV 


2885 


16786 


A 


2904 


179 


2 


CGYFNAYICYSLIiCFLYLSLCNQPFWiaC 
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M 
eth 
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n 11 rl pwkt irl a 
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ed end 
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correspon 

din 17 to 
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residue of 
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sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 

F^f^ltitflmic Acid FWPhAnvlalftninA 

(Xllycine, H=Histidine, I=Isoleucine, 
.vsine. ¥^=1 .eiirSnp ' IVf=IVfpthfnnfnf> 

*V l_fJO lilt*, MJ AJVUblUC, 1TI — ITACllllUII 1I1C, 

N=Asparagine, P=Proline, 
0=Glutaniine, R=Arpinine S=Serine_ 
T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=6top cod on, /-possible 

mif*lAnH/)A df*lf*tinn \s?nficcihlA 

IIUUvvUUv UCKUulli iT^l/USSI l/ICT 

nucleotide insertion 














I FLPLFGLVFFFFFF*NRV* FCCPGWSA 
VSQ 


Zooo 


lo/o7 






ins 

lUo 


z 


SIiIIFIA 


2887 


16788 


A 


2906 


318 


2 


YKESNSKNNEKKKITRQRKRQKKNRPSM 
MVHTCNPSPMAGRDGQI PS I LFNQPHSP 
RI IRLTLILNPLTKHIAYP FLVLS L * G I 


2888 


16789 


A 


2907 


17 


308 


KLAG YGGMCIjWS KLLERPTRMNHIaS PRS 

PrtT .MP D+ <3TTMP T PTOJTP * fiMPVQWl T TY3 

FCFLTPNXjAMP PRLNLSGP ICCQLSLRL 
LGPRLPAASVSGV 


2889 


16790 


A 


2908 

- 


378 

• 


1 


RSLHSTETAAKTHGHQTHPGLSTFSK+T 

DTT) T 2V G T "D T "DWDITNT/T . D A T7WC TTC a C VI? 

SECVLTSSFKE I IADFR* KYSSTN*HP Y 
HLFLLKFVFFCSRDGSLTMLP IRPGLNS 

rCuj IJni. LirrirrV .f ir ivV J-J 


2890 


16791 


A 


2909 


1 


129 


HLFGTNHRDIGTIiCIiLFGA*AGV2jGTAL 
SLLIRAELGQPSTRP 


AAA « 

2891 


16792 


A 


2910 


1 


323 


FGNKVWFCS EG *RAGDQLL IME PLASGF 
KGIFFLNLPQNWE*RVSPKLPGKFWIF* 


2892 


16793 


A 


2911 


367 


136 


ETPGNHLSPGV*GCSEIi*SCLCTPAWAT 
EQDPIS KQKRTKKS KTLLKNTKADLTRW 


2893 


16794 


A 


2912 


258 


140 


FLIFFCLDCFLNIKVLGVPGFFFSLKKK 
GSRNPLTIiAPQIAGATGMSH*VQGYNEL 






A 




AO. 




V^M T Pr*\7T 1 T T. PVQ PT^ QTY? T PT .TYTN1NPT. 

FY * STYPIAFFKTESHS VTQAGVRWHHL 
GSLQPPPPR 


2895 


16796 


A 


2914 


309 


76 


GPI ICVCVCVCVCVCVCWFIFTEYSFH 
YLEVYNISFEVWASLFNRPLLL 


2896 


16797 


A 


2915 


1 


122 


FSVETGFHHVGQPGIi*PPTSGDPPSLAS 
QS AG I TGMSHHA 


2897 


16798 


A 


2916 


167 


1 


GNMCS KESVSGTNINRKPD * * YNPR3UGT 


2898 


16799 


A 


2917 


2 


356 


ARGI1»IjLRIMLTTLTRYP**RDVTREST 1 

YQGHHTPPVQKGLRYGIILFITSEAFFI* 

PRFLPPFLSSSFPPTPSLLKPFPPSSFP 


2899 


16800 


A 


2918 


2 


351 


ARE YTSIiQLI LQMTF IMAFTCTDRTL YE 
LAFEC* I^RTLDI ITR*GNQP*RPNAGS 
YWLFYTIxIXnijPLLlMilYTHxmjGSxxN 
ILLLTLTAQELSKS * ANNLI * LAYTIAF 

TWT 




1 DOUl 








321 


A1J4GITFFFFIjGKGVTjAPRGGGRGGNPG 

lkggpppglgpfsg*slqggga*gpppo 

AGANFGFLIXxRKTRGSPGCPGGFWIPGP 

gdrpagapqggg 


2901 


16802 


A 


2920 


1 


361 


gtstrlgvlllslhhags inplgitlhs 

DKVTLHPx^IKDAl^IxLLFJaLSl^n'LT 
LFSPDL^DPDimiiANPLNTPPHIKPE 
* YPLFAYTILRS VPmOiGGVLAliLLS IL 
ILAIIPIL 


2902 


16803 


A 


2921 


3 


152 


HERLYSVPLRILRRLPDPLSIPQGWRHV 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Add, 
Jb— Olutamic Ac id, r— .Phenylalanine, 
G=€lycine, KNHistidine, I^Isoleuclne, 
K=Lysine, L»Leucine, MNMethfonfne, 
N=Asparagjne, P=ProIine, 
Q^GJutamme, K— Argimne, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan,Y=Tyrosine, 
X^Un known, *=Stop codon, /-possible 
nucleotide deletion, ^possible 
nucleotide insertion 










• 




IRESAYKSHLTLPARRGLQDGI ILFITS 

EVWFAGFF*AFYHSSLCFrTOl>*(3iVfP 
PTC I TPLNPPESPLLNTS VUiASQWI T 
* AHTKAI SHYLREEAFKTG . 


2903 


16804 


A 


2922 


410 


1 


RAGYRQKWGSLATV IGQLGLPVERGWYG 
PSGBGGTPQRHCSELVLDDLHVITQNRA 
NHRPRCGGSLLSELKFAPLALQPGRQSK 
TLSQKKNKQTTTKKNTKKQPGSVAHACN 
PSTLGG*GRQITLPSGVRDQPGQH 


2904 


16805 


A 


2923 


2 


145 


ARAARGDVI *S FHVFAQRDIiNliLSSRDL 
PALASQSTGITGMSHQCPG . 


2905 


16806 

» 


A 


2924 


29 


340 


EMNEVERRGRK* LFHDSIQSR* EYRCAP 
QCPANS CVLM * RWGFAMLP KLVS 8DLPA 
LASQSAGITGVSHCAWLVFLPLLSTSPI 
SQRHLYLPRA 


2906 


16807 


A 


2925 


2 


337 


ARVLILPGFGIISHIVTYYSGKKEPFGY 
IGMD*AMISIGFLGFIV*AHHIFTVGID 
VDTRAYFTCAT 1 1 IAI PTGVKVFN*IiAR 
LHGCNMI * SAAVL* SLGFI FVFIVSGQA 


2907 


16808 

* 


A" 


2926 


76 


388 


RYCTPAWATETPSQKKEKKKKCQTFlEI" 
S S F P VC LENGGGGN F I SLF FLFL I ILS L 
FFYFY* FFFI I FVYSYYYFYFCLFSYFF 
FYFFINLIIFLLLNYFFFFI 


2908 


16809 


A 


2927 


48 


374 


KDQBPTDMGSAHFQVFKGWRQVGGANED 
RINSHGGPVMFSGQYYGQLVRFLVY* 1 1 
I FFMLLFFS LFYS YFLFIS I FFVLVFYC 
LFIFLFYFSFFFIFVFFFYYFIVIF 


2909 


16810 


A 


2928 


3 


294 


HEGFHYI SQDGLDLLTS * SARLGLPKCW 
DYRRBPPRPARFPTLRIWLYILKALYTV 
LGCYREYEAYRPWCWTHRMDGGQEVEGR 
REGTGGGHILGF 


2910 

■ 


16811 


A 


2929 


7 

■ 


420 


IRRRDVLLLTLTSLSPTAARICYNGGRR 
GNRLNS LAYLS KELLAAWS LRKPS HGLL 
TPIRCVLYIRGHHF* LLPPSHLLLIASA 
IMGAAGGTG* TV* PTLARNYS QPGVCVN 
LAMVSLHLSGVSS ILGAITFITAIMNME 
APAITQYQTPLFV* YGLITADLLVLSLP 
DLTAGITILIiTDRKLSTTFFDPAGGGDP 
ILY 


2911 


16812 


A 


2930 


11 


253 


GLLHKAPSP*KFFFSPKPFNFFWKFSPI 
FSP P KKKFLSKNPHI VFKF P P FKGKI FT 
FPPPLKFGPPRVFFKAPPPFFFFFFFFF 
FFFFF 


2912 


16813 


A 

♦ 


2931 


70 


341 


DSS F FS * LATLHGSNMK* SAAVL * ALGF 
I FLFTVCGLTDIALPNS SLD IALLDTYY 
WAHFHYVLSI/3AVFAIIRGLIH* FPLF 
SGYTLDQS * AKIKCAI I FIGVYI TFFPH 
HSLGL 




1 A0 1/1 

J0ol4 




">cvn 
2.y5l 


i 




W V<3H PO A <! VNT T WT VHT i AO V55f T T.RA T 

■W UnCVIrtw VliUAAI X VflUnwtOvtUKIU 

NFITTI INI KPPAITQYQTPLFV*SVLI 
TGVLLVLSIPVLSAGITILLTDRNLNTT 
FFDPAGGGDHILYQLLF*LFGHPEAYIL 


2914 


16815 


A 


2933 


2 


364 


RVQKGLRYWI ILSITSEVLLFAGYF*AL 
YHSSLAPTPQLGGHWPPSGITPLNPLEV 
PLLYTYVLIiASRVSFT*AHHRLIQNNRN 
HI IQALLITI LLGLYCTLW3AS *NFEPP 
FTISDGIYG 


2915 


16816 


A 


2934 


2 


367 


PRVRPRVRYLLFGA*AGVLGTALSLLIR 
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nucie- 
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fi ft n n i r\ 

□ing to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
OCysteJne, D=Aspartic Acid, 

V — 1 1 1 to m 5 f* A plrl fiWDh Am/In lAnSitA 

£j — oiuiuiniL /\liu. r-~-jr nenyiaianine, 
G=Glycine, H=Histidine, I-lsoleucine, 
K=Lysine, L=Leudne, M=Methionioe, 
N=Asparagine,P=Proline, 
\^=uriuutiiiifie, iv— Arginine, o — serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=XJn known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucieotiae insertion 


- 












AELGQ P GNL LGNDH I YNV I GTAHAF VI I 

I?!? TtfT T> T T Trtfil?fiM* T JT2DT.T Tfia DTIM& 1? 
rrlvlrlX luur vsJN JjvjrJj 1 1 wurlJHrvr 

PRINNISF* LLPPSLIiLLLASAIVEAGA 

R1X3*TVYPPL 


2916 


16817 


A 


2935 


3 . 


365 


YHIV*PNP*PLTWALSALIOTSGLTM*F 
HFHS I TLVII/3LLTNTLT I YQ * WRDVSR 
*bi lyUnnirrVyKyjjKiwiriljr 11 W UK 
FFFAGVF * AL YHS S LAPTPHLGGHWP PT 
GITPLNPliE , 


2917 


16818 


A 


2936 


3 


144 


DSHGRHVING*TCVTICFIRQLIGHPTS 
KHHFGFEAAA* YWHKKKKK 


2918 


16819 

i 


A 


2937 


3 


422 


QRLLATNHHDIGTLYLSE\3A* PGALGTD 
LSLLIRAELGQPRDLLGNDHIYNEI*TG 
HALGI LFF I GLP I I IEGFGN* LHALIIG 
APdMALPR I NNI S F * LLPP YLLLLLASA 
IVEAGAGTG+TVYPPLAGNYSHPGASVE 


2919 


16820 


A 


2938 


3 


382 


RTRGLFSTNH ID I LTL YLL FGA* AGVLG 
AALSLLIRAELGQPCNIiLCNDHIYNVIV 
TAHAFVIIFFIVIPIII*CFGN*LLPLI 
IGAPDMAFPRINNISL*LLPTSLLLLIiA 
SAILEA*SGTG*TV , 


2920 


16821 


A 


2939 


225 


3 


NISWRMKQIVPKGEPYNGVPVFVSLWFH 
KMFI F ETGSHSVYQAGVQ*RHXiGSiiHPH 
PPGIKQFYLSLPSSWDY 


2921 


16822 


A 


2940 


1 


247 


PTRPPVIYSTIFAGTLITALSSH*FFT* 
VGLE INMLAFI PVLTKKINPRSTEAAI K 
YFLTQATASIIXiLIAIIjFNNIKKKKK 


2922 


16823 


A 


2941 


3 


369 


TRDST YIX3HHTP P VHTGLRYWR I LFITS 
DAFFFAGFF*AFYHSSLAPTPRLGGHWP 
PTG ITPI*N PL EV PJU Jj YTb V JjIlASG V b VT 
♦AHHSLVEDDRYQI IQALLITILLGLYF 
TLLQASEDFE 


2923 


16824 


A 


2942 


3 


401 


LTVCCVIVLRLKTLFFFFNKPFLTQKGY 
FNTPEEGFFKKPNRRVGPPSPMTDPTML 
1 JNJU X KAja. V fc»KAP PK 1 LiMuuti A JM Mirror 
VTPRAPFPLTLRLNPMLQQGIDliLTLKA 
SGGSPASGNFLKELGLRSINF 


2924 


16825 


A 


2943 


189 


348 


RNARGVTVTDKEKRLRLGMVAHACNSSTL 


2925 


16826 


A 


2944 


2 

4 


349 


ANSNYERTHSRI I ILSQGLQTIiLPLIAF 
* *LVASLANIALPPTINLLGELSVLVTT 
FS*SNITXXLTGLIILVTALYSLYIFTT 
SQWGSLTHHINNIKPS FTRENTLMFILL 
SPIL 


2926 


16827 


A 


2945 


621 


929 


GLooGTGCCP I ljCDliPKirW&I^Kto VUGo r 
SSTAHLCPRGWRSGRCFLPPLSAS*VDS 
AMSL IQAAKNLMl^WQT VKAS YVASTK 
YQKSQGMAS LNLPAVS WKM 




lOoZo 


j\ 
*» 








SAG ITGMSHCARPLVATS IHKI DNRYI D 
DRS*NINIGTIFINLLIFYLSIYIjSIYL 
SIYLS IMYSFTVAQAGVQWRDYGSLQPE 
SPGPK*SSCISLSSSRNHSHTPPHGWVD 
PKIP 


2928 


16829 


A 


2947 


2 


357 


htyd i rkprr* t1acals aiilttsglam 
tix:fhyitllilclltntltiyq*wrde 
trestyqghhtppvqkglrygi ilfits 
eayffagff * afyhs slaptpq1x3ghwp 
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peptide 

sequence 


Predict- 
ed end 
nude- 
onae 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Add, 
£=Glutamic Acid, F=PhenylaJanlne> 
ijr— ijriycme, h— Misname, l=lsoleucjne, 
K=Lysine, Lr=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glotamine, R-Arginine, S=Serihe» 
T=Threonine, V=Valine, 
W= 1 ryptopn an , Y — Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














PTGITP 


2929 

• 


16830 

- 


A 


2948 

i 


2 


357 


PRVR YS TDHSD IGTLYLLFGA* AGVLGT 
AISLLIRAELGHPGNLLGNDHIYNVIGT 
AHAFVIIFFIVIPIIIGGCGN*LGPLII 
GAP DMAFPR I NJJ I S F * LLPAS VLLLLAS 
AIVKAG 


2930 


• 16831 


A 


2949 


2 


217 


KNLKIVQYGDMCLWSQLLRELRWEDRLS 
SGG * GCSELCSCHCTPAWTTQQDCLKKK 
KNFWPVGEVKGRHMlj 


2931 


16832 


A 


2950 


2 


354 


AI PMTTARLTIHEAYLI IIiERTTTTTKD 
VKNPRRIAAAITASCLGGGLEDAISCQY . 
GIATKDRPTGLGT PEVLLGALPGAGGTQ 
RLPKMWVPAALDMRLTGRSIRAHRALK 
M*LVD 


2932 


16833 


A 


2951 


3 


150 


LFSCSPTFSSDPLTTPLIiILTT*LLPLT 
IMASQRHLSSEP LSRKKKKKK 


2933 


16834 


A 


2952 


283 


344 


EKKKKKKKKKKKKKK^^ 
KKKKKKKKKKKKKKEK 


2934 


16835 


A 


2953 


3 


378 


D AWADAWNQTPLFN* SVLITAVLLLLSL 
RVLAAG I T I LLTDRNLNTTLFD PAGGGH 
PILYQHLF*CFGHPEVYILILPGLGIIY 
RIVTDYYGTKEPFGYIGMV*AMRSIGFL 
RFIVRAHHIVTVG 


2935 


16836 




2954 


3 


387 


ILYQHLY* FFGHLEGY ILILPGFGI ISH 
MVTYYSGKEBP FG YIGMG* AMI S IGFLG 
FIGRAHHIFTVGIDVDTRAYFTYATI II 
AI PTGVKVFS * LATLHGSNMK* SAAVL* 
ALGF I FLFTVCGLTGI 


2936 

• 


16837 

■ 


A 


2955 

• 


2 


432 


PRVRPRVRKHIAYPFLVLSL*GIIITSS~ 
ICLRQTDLKSL LAYS S I SHIALWTAI L 
IQTP * S FTGAVIL I IAHGLTS S LL FCLA 
NSNYERTHSRI I ILSQGLQTLLPLKKKK 
KXKKRKKKKKKKGGGIiLKESLGGPNLTS 
EGK 


2937 


16838 


A 


2956 


12 


391 


SYFISSSKFHLSPPWLSSPDEATSKKKK 


KKKKKKKKKKKKJUUGiCiGCK^ 
KGGGFK* KRI GGS FKKGGGE KKI FFFKG 
GKKKKTGGFFEKKFFLGGEKY*DNIiPKK 
NKTVGEKKNFLGGG 


2938 


16839 


A 


2957 


1 


352 


PTRPYFPVDAGEAQHHPRTCRRPLRALW 
SSHHERWKVTLCTHCSLGVFFLYCCTYY 
IFVLFIP*SSCGLTLIFITCIILFGSIS 
FFLFFTIVFSI I IVTTFKFRIiIiYSIIFla 
SYLLC 


4% A pfe A 

2939 


16840 


A 


2958 


1 


243 


V¥T ntlT VTT1I/OTBTT VTtlOTTnilTfTirKTTI'O 

nlprlnpkeieilnrsiicnkikaiiks 
LIjSKKSSGHNDVTAEFY*TFKEELITIL 
LKFYLTPKKKKKKKKKKKKKKGGPF 


2940 


16841 


A 


2959 


.373 


3 


FSSLKKRVTPPPPPKTGFSLEGLHLLKK 

KKKPPPPPPI*HPPPKIIiQPPPPPFFFF 
FFFPFFFFFFFFFFFFFFFFFFFFTDMR 
GFIVRKFRTRG 


2941 


16842 


A 


2960 


2 


136 


PRVRSTLPISYKWNNRALMKAHIiIMK* F 
TEY P KTTDEL YLHDNT 


2942 




A 


2962 


1 

k 


358 


HTPPVRKGL* YGI ILFITSKVFFVPRFI 
*AVYHSSLCPTLQLGGLWPATGMTWLNP 
L*VPLCTTFVLLA* *VSIT*AHHNLIDL 
NRCHVI YALV I T I LIiRLY FTLL PAS EYF 
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peptide 

sequence 


Amino acid sequence (A«Alanine 
OCysteine, D=Aspartic Acid, 
F^OIutamic Acid. F=PhpnvlflIflnlnf> 

G=GWcine. H=Histidine 1— Lsoleudne. 
K=Lysine, L=Leucine, IVNMetbionine, 
N=Asparagine, P=Proiine, 
Q=Glutamine, R=Arginine, S=Serine, 
T»Threonine, V»Vaiine, 
\V~Trvntonhan Y^Tvrrisine 

X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














EAPLTIS 


2943 

» 

* 


1 CO A A 

16844 


A 




Z 


J 1 J 


P AVTJ T VP PPP * PT»Trf?ATjR ATiTiMTVOTiAM 

*VHWDCITLLILGLLTNTLAIYQ*WRDV 
S RES TYQGHHTP PVQKGLRYG 1 1 LF I TS 
EDVFFAGIV*AFYHYSLAPTPQLGGHWP 
PTG I T PLNPLEV 


2944 


16845 


A 


2964 


A fit 

403 

• 


Ol 


Lit: c tri-tajSCiC ±jtrf\3Coi\C rorf or AJTr r 

SPKALIFWGNFSPFFPPPKIRFLPKIPP . 

WVFFSPPFWEKLFSSPPPLNFGPPRVLF 

KGPPLFFFFFFFFFFFFFFFFRFEGBCW 


2945 


16846 


A 

* 


2965 


2 


360 


L I PNLAM VTR* GNHPQRLNAGTYLLFDT 
LDGSLAI^IGLTYTRNTLGSLNVILLTI 

YGLHL*LPKAHREAP1aGSIVLAAVLSK 
LGGSGKN 


2946 


16847 


A 


2966 


321 


2 


STGMHFPHINMAINPPPPPRPPPPIiFPP 
NPKTKPNPTQ * KGGFPPLGPF * KKLS PF 

FFCVCFSFQRVHNIYKNTHQQQ | 


2947 


16848 


A 


2967 


200 


1 


RRTYTSHLLACLRQG^iAFSPRLECX3GKI 
RAHCSLQLYGSSDPPT*APQTAGTKQHN 

UK XAUL^NAUn 


2948 


16849 


A 


2968 . 


2 


354 


LLTASSSEIAPLQSSLGDRARLCLIK*K 
EGVLNS I *SGNQGKS YANVYRLLYLDP I 
PKIYAEAYTP*NVNSTNLETKSPKTIQK 
FPEDREFKNDFFEKTKNGDRETGAFHPF 


2949 


16850 


A 


2969 


99 


400 


ALGIHFIFWSGL1X3IGLANSSLHIVLH 
DTYYVVAHFHYVLSLGAAFAI IGGFIH* ! 
IPLYSCYTTJX3TYAKIHFTIIFIGVNLT 
FFPQHFLGLSGMPRRY 


2950 


16851 


A 


2970 


2 


inn 

377 


FI VKI PLYGLHL* LPKAHVEAP IAGS IV 
LAAVLLKLGGYGI IRLTLILNPLTKHIA 

VDTn\VT*5ST.*f3T T t TQ^TfT.Tf ICSnCKTrtf KTT 
X rf iiVJJOu Ul XX A Al-XJlvI\JVEVI\J\J\-I\J\. 

KKKKKKKKRGGPF j 


2951 

• 


16852 


A 


2971 


1 


407 


GTRS YTHIi YR VFLEL I KMS VYDIiNHTVI 
MVT SGHVRLAFYG I VHLTL I LNLLTDH I 
LYPFLVLSLSGVI IIS S IFLRRTDLESL 
T AY^S T QWTAT.VVTAXI1TOTP* S FTGAD 

ILI I VHGLTSSLLGCLANSNYER 






A 


zy fZ 


1U 




QRS VA t YFKGMASA* * RMFS S KXKKKKFC 
KKKKKKKKKKKKKKTAITKKTTTAKWKN 
♦RTEKIRHrRLPLISSRDAKCVDFLYT 


2953 

* 


16854 


A 


2973 

• 


12 


400 


LiNCRTP S LYLiQRAGELLS VENPH I WCQK 

SITCFLAISISGLFAFFLLNSFNYHFII 
VP*NFSLSIYLCIiHSIiFYVGFFFSFDMM 
IFSILT»TLFFCLT»FHL 


2954 


16855 


A 


2974 


125 


3 


RPRRPAAQVGVQ * KNLSSLQPVPPGFKL 
FFPLSLLSSWAS 


2955 


16856 


A 


2975 


2 


332 


RECTNYPAQPPPPPLEAEEGFQPLPPTD 
TG*T^CLPPLGGGQHRAl^IiGATWHLEN 
ACALVIiADLGTGPKPSGPFSGQACGASG 
RLPLLSGP I LPPLGQGHTLS AWGHHA 


2956 


16857 


A 


2976 


7 


329 


SRDILVMTELX5EFF*GHV*P*DRSRRAD 
TPRLGGHWPPTG I TPLNPL * VPLLNTS V 
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peptide 
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Amino acid sequence (A»Atanine 
OCysteine, D»Aspartic Acid, 
£/=uiutamtc Acta* r— Phenylalanine* 
G=Glycine, H=Histidine, I=Isoleucine, 
jv— l^ysine, Lr- leucine, ivi— jytetnionine. 
N=Asparagine, P=Proline, 
Q=*GIutamine, R=Arginine, S^Scrine, 
T=Threonine, V=Valine, 
Vv— iryptopnan, x— tyrosine, 
X=Un known, *=Stop codoo, /^possible 
nucleotide aeieuon, v— possioie 
nucleotide insertion 










• 




. LLASGGS IT*THHRLIEYIRHQI I HALL 


2957 


16858 


A 


2977 


2 


339 


AHHI FTARIDVDTRAYFTSATI I IAI PT 
GDKAFS * LATLHGSNMK * SAAVL * ALGF 
IFLFTVRGLTGIVIADSSLDIVLHDTYY 
VGAHPHYGLS IGAVFAI IGGFIH*CPLF 


2958 


16859 


A 


2978 


394 

» ■ 


0 


QGCSEL* SCHCTPAWVTE * FPVSKKKKQ 
RK 


2959 


16860 


A 


2979 


149 


383 


PLCFPSQHWGKSSNCLSSLKYFPSSGLP 
LRC KAELDDVKQ KAD KELD WAHACNP S 
TLVGQGGRITLRSGV * DQPGQH 


2960 

• 


16861 


A 


2980 


2 


378 


ARVS IGFLGLI VGAHDMFTGRMDLYTRT 
YCTFAT I L I AI HTGVP VF I * LATLHGCN 
MK* SAAVL* ALRFIFISTVCGLTGIVLP 
N*SLDIVLHYMYYAVAHLHYVLS IGAVF 
AILGRFIHRLPVF 


2961 


16862 


A 


2982 


2 


409 


PAVAEAYLKPVVDDSKGS^GkPDtbci 
IRECCQRNFGWNRTRTDESLPPVLKQLD 
AQQTQLRIDSFFTLAQQBKDUAKRIKSQ 
RLNRAVTCMLRIEIEAASSBIEAVSVD* 
QKELEL I DKAVACHLE YMYETDP 


2962 


16863 

■ 


A 


2983 

• 


3 

■ 


453 


HASAHASAHASGQRKGAAPAEKKCGAEA 
QHEGLELRVENLQAVQTDFSSDPLQKW 
CFNHDNTLLATGGTDGYVRVWKVPNLEK 
VLESKAHDGEIEDLVLVPEGML* IVCPD 
HYS P VG INDFMWLL * LNWGNRL YFPH I S 
VYLMFHFRTF 


2963 


16864 


A 


2984 


3 


417 


LILPGFGI ISHI VTYYSGKKEGFGYICM 
V*AMISIGFLGLIVRAHHIFTVGIDVYS 
RAYFTS ATI I IAI PTGVKVFS * LSTLHG 
ttNMK*SAAVLi*AIjKr It JUb I VSGliTv? ± V 
LANSSLDIALHDTYYV*THFHYVLSI 


2964 


16865 


A 


2985 


273 


408 


MKIFIF*VSPPLFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFLAVFTFI 


2965 


16866 


A 


2986 


176 


335 


TFQPSELWRTALLPRLECSGLI IAHCSL 
ELLGSSSPLASAS * IAGTTGSILRY 


2966 


16867 


A 


2987 


65 


398 


XKEFKIGRKAAEKMTRNINNAFGPGTAN 
EC! vQWWFKICFCKGNKSIjEDBEC 
EDDNDQWRAI I EADPLTTTREVAE ELSV 
NHSMAVQHL K * VG KVKKLNKWVPHELS 


2967 


16868 


A 


2988 


2 


158 


PGWSLTPDLR*STCLSLPKCWDYRHKRP 
SPTSRQTFDHAPNKNSSHSPPIYM 


2968 


16869 


A 


2989 


435 


23 


GWPPNPQKSFYFPQRLKIWGGGGRKRP 
PPKKKGFSQKTPEGF* KPPPKRRKKIFH 
DPGKKGPPKGI FKRGPPLFFFFYFFFFF 
FFFFFFFI ITIECl^SKFPnRHHNKKFP 
PNPPS PRFWPQHLNTSLPNPKNKEP 


2969 


16870 


A 


2990 


410 


3 


GGRGH F FSGAFF I KFPWNKKG I SQPLCF 

PVKKTGAPRGKNGISPFFCFPPLGSFNR 
SGFFWALLGSPPLFFFFFF*DRVSLHHP 
GWNAVAQS* LTKTSTS WVQVILI* 


2970 


16871 


A 


2991 


371 


132 


KQSLP FKVKGFFFPPEVENAI YPDSHTA 
FYS W * KRS T FS KKKKKKKKKERKRKKEE 
RKEGRKEGRKERKKEGKKBIENA 


2971 


16872 


A 


2992 


336 


1 


CPRWSRTAGLK* S TCFGLPKCWDNRHES 
PCPASSVLND*QLSTVQRAT*WQTKSLV 
LKAPPARGALPVSRAVSPPPSSHHFAYF 
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correspond 
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nentide 
sequence 


Predict- 
ed end 
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location 

ding to 
last amino 
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nentide 
sequence 


Amino acid sequence (A^AIanine 
OCysteine, D=Aspartic Acid, 

TT^flliitnm if Arid R=Ph»nvlalanin» 

1j VJIU lil Ullk. AUUf A * II Kll VIM Sail III 69 

G=G Ivcine. H==Histidinp I=T«:nlpiirinp 
K=Lysine, L=Leucine, M=Metbionine f 
N^AsDftrflpine. P—Proline. 
Q=G lutamine, R=Arginine, S=Serine, 
T^Threonine, V^Valine, 
W=Trvntonhfln Y=Tvrosine_ . 

X=Un known, *=Stop codon,/=possible 
nucleotide deletion, t^oossible 
nucleotide insertion 














LHMI *QVSCLPIiR* SLALVPQAGVQ*RD 


2972 


16873 


A 


2993 


317 


440 


HLGRLRQEHHLSPG* LRKFFFFFGRDGV 
LMLS MLVFNS WTQVMLLPQP PQVLGLR 




1 £.Q*7A 

I0o74 


A 






Jol 


L CP ALV I LLTAKFTNFR YRVKNVCT FHV 
ESN 


2974 


16875 


A 


2995 


1 


165 


GFHC I S RDGLDLLTS * YARLGLSNCWD Y 
RHEPPSPAPLFISYSICLFLSKLIQFL 


2975 

• 


« r am/ 

16876 


A 


2996 


405 


2 


KKIGr P I rSKMr cLi h KJb'PlkAbsj Kr G 
DPKWS PHPRP I FFFLLKRGVL YVWREGF 
KFHPP* FPPPGPPKRWE*RVKPPHPPPF 

PTS ASQGAE IAGVSHRAQPHAS 


2976 


16877 


A 


2997 


2 


415 


CLFTGGGLTGIGSAGSSLDIVXiHDTYYV 
VAHFH I JJLiS iUAbr AJ. XGCjIj iii* If PJur o 
GYALDQTYARIHFT IIFIGVNLTFLPQH 
FLGLSGMPRRCSD* PDA YTT * S MLS SVG 
SVITLTAVI VLI IMIREAFDSKRTVL 


2977 


16878 


A 


2998 


349 


370 


HHFY* S ITArRPFQnMEIjKNrFFrFLF r 
F FFFFF F FVFLF I FLFLFYFLFL YE I YL 
I IiFFI»* HHFYSAV „ 


2978 


16879 


A 


2999 


36 


139 


LCHCTPAWETFS KE KKKKE KRKKKKKKK 
KKKKKKKPGGGS * KTALSHDCATVLQPG 


2979 

> 


16880 


A 


3000 


104 


373 


S ADREXXS KTDNLLGH * TNVNKCKVPR V 
I QS VFS SHSGLKLE INNRKMKE KS LNT * 
KINNISYWVQCSLYNLKSNSYTKSSDNT 


2980 


16881 


A 


3001 


1 


396 


LDCS KISS YLQKSS SHVLFFS FS FFFFF 
GGGGGFAIiENPYPPAGLGPQKKKTLSPP 
WLVGGPPFPEKAPP*GGHFLGKKPQKNP 
GGQNPKRPSGEGNPFFQPPGGEKKQIGP 

PlorvivilVirr rai? PlCc.r'fc'oP 


2981 


16882 


A 


3002 


412 


104 


FFFFFLPVRQTFYPQPFSVFFPLFPFKF 
FFFP *AFNFFWGFFP I FSPPKFGFFSKI 
SRLVFFSLPFWEKFFFFLPPFFFAPLRF 
FFKGPPXFFFFFFFFFXXY 


2982 

* * 


16883 


A 


3003 


400 


46 


iifirrrr JiUjrPPPArT^rrPPr PJj^Wrr 
FPPGP FFFLGGFP P FFPPP K* VFFPKI P 
PGFFFPPPF*KKFFFSPPPFFLPPPGFF 

PWPP P P 1? PPP1?P PI? P T? KT? 17 T7 PP PPT .fW 

VQVEAY 


2V83 


looS4 


X 

A 


J 004 


411 




YQDT.PPT.PPQQPPQVPPWPPTTQTiPFT.TP 

VYKGFFFFPFFPFELFGPPGLSLGSKPP 
PVFLGGDPAFFSIPHPRVGSLPPPPNWP 
FIGPSFW* ALFPAPPLFFFFFLSS FFFF 


2984 


16885 


A 


3005 


3 


200 


D AWG * L FS TNH KD I GTL YLLFGA* AGVL 
GTALSLLI RAELGQPGNLLGNDHIYNVI 
VTAHAFVII 


2985 


16886 


A 


3006 


259 


146 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFGLLNDIUj 


2986 


16887 


A 


3007 


118 

* 


414 


QNQPPQNKATHTVK I EKKEKPETKTVAK 
EHNKAKTAEKSEE*TKKEVKGGKQEKVN 
HTAAKVKEVQKTPSKPKEKEDNKKAAVS 
KHEQKDQYAFLRYMI 


2987 


16888 


A 


3008 


427 


111 


FFFFSPVGNFSPPQQFPFFFPPFPPKIF 
FFPPPL*FFWGGFPPFPPPPKKVFFPKS 
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SEQID 
NO: of 

n ukicvuuc ( 

sequence 

*i 


SEQID 
NO: of 

sequence 


M. 
eth 

uu 

• 


SEQID 
NO: In 

USSN 

09/515,1 
26 

■ 


Predicted 
beginning 

mi rlpfitirlp 

location 

rnrrp<znnnH 

ing to first 
amino acid 
residue of 

• npntiftp 

|FCp UU C 

sequence 


Predict- 
ed end 

otide 

Inrntinn 

correspon 
ding to 
last amino 

nrirl 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 

f&=dufomic Acid F=Phpnvlnlflninf* 

G=Glycine, H=Histidine, I=Iso!eucine, 
K=I .vsine. Leucine M=1VlethinninA. 

N=A$ para gin e, P=ProIine, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 

X=Un known, *=Stop codon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














PPGFFFPPP*GKKFIFPPPP*FCPPPGF 


2988 


16889 


A . 


3009 


3 


302 


SLASCLS YLVCVI FLGQPKPT I * LQNST 

PHTnC* WPTERYVICTCTO I P I AUiPKIfE It 
rniu\, v*tr i&xv* v wiyic <inuur xvA&iv 

QPRCPSAGEWINKMLYACTIEYWLAIKR 
YEILIYATV*MYLEKI 


2989 


• 16890 


A 


3010 


1 


409 


RLHDATFPI I EEL I TFHDHALVI TFL I W 
LLVLCALFLTLTTKLTNTNI SHAQE IET 

W+T T T .H & T TT . 2\ T.MVT .D GT .P T T .V T TTTnTTO 
V w 1 XXlnAX XXuVLuTlV l_»ir O-UK XXi I X JLUJ&UIM 

DPSLT I KSIGHQWY * TYE YTDYGGLMFN 
SYILAPLFLEPGDLRLLDVDNRW 


2990 


16891 


A 


3011 


157 


2 


GRVDLKIQKLARCGGACLQSQHTQQNHL 
NPGEKGCSES *LPPCPPD* VTKQ 


2991 

» 


16892 


A 


3012 


2 


423 


ARAARAHIVTYYSGKKEPFGYIGMG*AM 
ISIGFLGFIGRAHHIFT* * IDGHTRAYF 
TSATI 1 I AI PTGVKVFI * liATJjHCJSNMK. 
*SAAVL*ALGFIFLFTECGLTGIVIJVNS 
SLYI VLHDTYYVVAHFHYVLS IGAVFAI 


2992 


16893 

■ 


A 


3013 

• 


2 

- 


140 

• 


ARANI LLLTLTAQELWDPRANNLI *LAY 
TLAF I VKKPLYGLHL* LPKAHVETPMDG 
PILLAAKLLKLGGSGIIRLTLILNPWTO 
HIAYPLLGLSL * G 1 1 ITRS I CLRQTELK 
WLIAYS*ISHIALVVTDILIQTP*SKHF 
TTNSHCPRTMGPQSQQLNMTSLHTSFYS 


2993 


16894 

• 


A 


3014 


2 


420 


PVLAAGI S ILLTDRDLLTTLFDPGGGGD 
PILYQHLF* FFGHPDDYILILPGFGI I S 
HXVTD YYGRKE PSGYIGMV* AMTPVGFL . 
GFIE*AHHIFTVGinAHTRAYLTSVSIL 


2994 


16895 


A 


3015 


6 


292 


AHHIFTV*IDVYTRAYFTSATIVIAIPT 
GVKVFS * LATLHGSNMK*SAAVL* ALGF 
I*LFTVGGLNGIV* SY*LLDIELHDTY* 
FVMGCPRKVYF 


2995 


16896 


A 


3016 


3 


422 


TPI I IGGFGN* LFPLI ISAPDMEFPRIS 

Si X S3 Xl " XlXi f zr a X XiXiXlu/IOaX V luw X w X U 

TVYPPLAGNYSHPGASVDLTIFSLHLTG 
VSSIIiGAINFITPI INIRPPAITQYQTP 
LCA* CDLMTAVLLLLSIiPDIiAAGITILL 


2996 


16897 




3017 


21 


a r\f\ 


TD Tf DrtUUT .TOIT? C T .HT .af2f2P Q T T /ZTX. T KTV T 
1 Jrr IruKAliXrir oljxiXiHuvs^o xiiwixiN z x 

rTLINIRPPAITQYQTPLFV*SVLITAV 

LHLLSLPGLTAGVTIIXADQNLNTTFFD 

GFGIISHIVTYYCGEKEPFGYIGMC*AM 
ISIGFLGIIV*AHHIF 


2997 

■ 


16898 


A 


3018 


98 


402 


LRSQHS KS FQ I SGKPSQEEWPQ I S PDS T 
DYI INT*HFNAQMLKNI Y*EQPHGLHDF 

KKASSSSSKKDPPGGGPT 


2998 


16899 


A 


3019 


399 


229 


PPPPGGGGPQGPPPPRGGFLPKSPGGVF 
YPPPRGGKIFSPPPGFFGPPRGFF*GAP 
P 


2999 


16900 


A 


3020 


2 


401 

4 


S DA Vti * ALGF I YL VLEGGLTG I VIADS F 
LDLELHDTYYVGAHFHYVLS IGAVFAI I 
GGFVD^FPIJSGYTLDQTYAHNHFTIIL 
IGANVTFLPQHFLGLSGMPRRYSDYPDA 
YTT*NILSSVGSFITLAAALL 


3000 


16901 


A 


3021 


413 


67 


PPPPGKI FFKKTPKKKIFPPPQF* I FFP 
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SEQID 
NO: of 
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sequence 

- 


SEQID 
NO: of 
pcpuue 
sequence 

. 


M 
eth 

00 


SEQID 
NO: in 

09/5154 
26 


Predicted 
beginning 

n ti plootifla 
uuumuuc 

location 
correspond 
ing to first 

ominA AciH 
ailllllv nviU 

residue of 

peptide 

sequence 


Predict- 
ed end 

iiuuc* 

otide 

location 

correspon 

f\ Snn 4n 
Ulllg W 

last amino 
acid 

residue of 

r\o nil ii a 

pepxiue 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 

XT- — ntn m IP Arlfl fr=Phont/lalaninA 
Ef—VflU Lalll It. ntlU) A 1 Ucll yJalaUIDC) 

G=Glycine, H=Histidine, l=Iso leucine, 
K=Lysine> L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 

Q — 1 1 1 in m i n A RaAroinino C— QArino 

oiuiaiiiuiC) *\ /*rgin ijjCf o "Ocrine, 
T^Threonine, V^Valine, 
W=Tryptophan, y=Tyrosine, 
X=Un known, *~Stop cod on, /=possible 

n iiftlAAii/la flalA^i/in \**nAnBtKlA 

nuucouae aeienony v— possiDie 
nucleotide insertion 


• 




• 




• 




PLPPKKFFFSPNP*FFLGGFSPFFPPQK 
KIFFPKIPPNFFFSPPLKKKIFFFPPP* 
IFPPPRFFLKPPPPFFFFFFFFFFFFFF 

PPP 


3001 


16902 


A 


3022 


2 . 


332 


LTLS I LS I IAGS * GGLNQTQLRKILAYS 
SITHIG* IIAVLPYNPNITILNLTIYII 
LTTTAFLLLNLNS STTTLLLSRT*NKLT 


3002 

■ 


16903 


A. 


3023 


1 


412 


RGPPFFFFFCVFFFFFFFFFFFFNKGCR 
HSKRFFFFFKXKKKSSRPTRDRV*FYPK 
LyWKZ> Pup Lib r faFUvxrcvjlrr FIjony vvjr i>N 

EVLWF KKNLEL FRGS PARKKKKKKKHL 

E . 


3003 


16904 


A 


3024 


279 


1 


LGRNTELWKSGKGMDILKTNCGKLANEP 
rKQ ritvJjQjIviOiSAFKAVjoVir P 5 KAPPA* 
TPGPSSAGSWP* P PGTGRAPRG PAPS AP 
GARSPGRPG ! 


3004 


16905 


A 


3025 


151 


401 


KKPLGGPNLTGEGKKKFFSLKGGKKKPP 
GKFLKKTFFLGGEKMGKTPPKKLKP *GK 


3005 


16906 


A 


3026 


416 


140 


YLSPLKKFFTPPPIj^MFLPPNPLKNIFF 
PPQLKIFWGGWAQNSPPPKKGFFSKNPK 
RVFXPPPIRKRYNFPPHGKILAPPKNL* 
SAPPPIFF 


3006 


« ^ *\ f\ mm 

16907 


A 


A r\mm.mm 

3027 


f% mm 

379 


31 


PPPRRAG VF FFKNPKPKP PP PREGGRF * 
PFPPLKF* FFPKPQNFLGGGGAIPTPPP 
KRGLGQIPTERFNLSPPTQKRINFPPPG 
KGGPP P PLLKPP P PP PFFFFFFFFFFSL 
NSFI 


3007 


16908 


A 


3028 


28 

* 


420 


MQQTTMAHI FI»CNKFANCAHVPRT* SET 
KPMSTPLQFD* TYKGFJKSCKYAEHERTW 

NQFFKQTTHL * NHTRDNQ I C FS KIGLEY 
YYRITTRQHLLKLRTVCIL 


3008 


16909 


A 


3029 


1 


401 


IjGNNGEAVSEKRKEKSQKBKSHNVWGF 

PGWGEFPGPAPRGGGNGRQKQPGRGNFG 
V*REKGGSMGGPRGA*NPGPKNPHPWPP 
QGPEITGGTTRPHPSGIFKKT 


3009 


16910 


A 


3030 


285 


397 


MF I KGDGLNKLRPGAAA YACWPS TLGG * 


3010 


16911 


A 


3031 


265 


2 


KKESSCIKATNSNSLFFLF*SVFFFFVF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 

PP PPT? PPPP* UPTT C 11T ,T .UVTDT /J ITT URD 

PRV 


3011 


16912 


A 


3032 


410 


49 


GFS P PP P * KFF FS P KPLNFWGGGGPNFP 
PPKKRFFPKNPPGVFYSPPKKKKKFFSP 

DD*NT.21DDKTPT.'R'<2DDDPT?PPE , PPPPPF 

FFLSNVSNGLTNMYI LPCKDPSCPTTFP 
ILGSLISL 


3012 


16913 


A 


3033 


177 


2 


VTPPFFFFFETRSHSVARTGV* *SDRCS 
LQPHPPSIK*SSHFSIiPSSWDYRSMPPH 
AS 


3013 


16914 


A 


3034 


75 


395 


MS YKHXXKKKKKKKKKKGGAP * KKPWGG 
PKLTRTX3KKKFFSLKGAKKKPTWKEWKK 
TLI LGGRKMGTTP PKKLKPLRKKKI FKG 
♦RGKKHPKSLPVEKFASRGRIKK 


3014 


16915 


A 


3035 


400 


83 


KKEG PARWS PVT P PLLEGP VGRSPQTR 
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NO: of 
nucleotide 
sequence 
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NO: of 
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sequence 


M, 
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od 
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SEQ ID 
NO: in 

» ICO VT 

USSN 

09/515,1 

26 

• 
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Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide - 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
dingto 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^Alanine 
OCysteine, D=Aspartic Acid, 
E-Glutamic Acid, F=Phenylalanlne, 
G=Glycine, H=Histldine, I=!sofencine, 
K=Lysine, L=Leucine, M=Methk>nine, 
N-Asparagme, r=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














NF* PTPPTHRKS LPS * KSKIDWRGI PLY 
PPP PRVKPKKS FNPGNrRPC * TKIFPCP 
SFWAPKTHPPFQKKKKKRVYK 


3015 


16916 


A 


3036 

• 


3 


196 


DSMPQT*NKSFAKARKKKKKKKKKKKKK 
KKKKKKKRGGPPLKKTLGGPKLPGQKKK 
MFFFFRGG 


3016 


• 16917 


A 


3037 


1 


411 


FCYDVCVESGCADYS I VI IMKKKKEK* K 
KKKKKKGGGPLKKNPGGAQNYPGVEKKI 
FSLKGGLKKTPRGNFEKKPYFGGGKNGA 
PPPOKNKPLGEKKKFKRKKGGKKLQFPW 
GKKISLPGFYLKKIYPPGRGFFNFS 


3017 


16918 


A 


3038 


3 


429 


NFFFKKPRGGNFFPPPKKGFFSPPSPLK 
FFFFPPPFFFFGGGGPHFPPPPKRFFFQ 
KPPRGFFFPPL*EKNFFFPPPVFFAPPP 
VFFLTPPPP 


3018 


16919 


A 


3040 


1 


111 


IGLSGMPRRFSDYPDAYAT*NILSSVGS 
FMYLQQDNK 


3019 

• 

* 


16920 

■ 

■ 


A . 


3041 


76 


967 


QLLKGGVSGVCPLLMFRCVRSFFLLVGS 
WS S LAS GVKPQTFAVS VTVLKAARLEL F 
I PPRGL WS LASGVKLQTFAVSVTAHK5 
S VDPKNSGAQLAS PSGSRTRAAGGAACQ 
SRCRVPALLSPWWDGTGRRGAGGGARR 
GGSGRTGAHGVGGRLRHGGLHVPS PAPW 
KGS * GLARNRAQRR WAGTAGGP S TP SAA 
AGPGAKS LTALCE QGWPAAPSAGPTKP T 
PTRNS S WPASVARS PGSRSCLSLHTSLQ 
AEGVGSSLGQPSKGLPQCSGGAEGLLKC 
RQSGS PGRGGTES ERGL+GLPQCSGGAE 
GLLKCRQSGS PGRGGTESERGL 


3020 . 


16921 


A 


3042 


39 


141 


LSIRGLNI I IKRQRL* DWIKQQDSTLCC 
P*EIH 


3021 


16922 


A 


3043 


2 


405 


LFSTNHIYIGTLYLLFSTRAGVLGTALS 
LLI PAELGQPGNLLGNDHI YNAI VTAHA 
FVI IFFI ERPII IGGYGN*LGPLI IOAP 
DMAVPRINNITF* LLPPSLTiLLLASAIB 
EAGAGTG* TVYPPLAGNYSHPG 


3022 


16923 


A 


3044 


3 


134 


HLNPGGRGCSEPRLHHCTPSWATE*DSV 
S KKKKS RKGWTGLF I 


3023 


16924 

• 


A 

■ 


3045 

■ 


44 

■ 


187 

• 


DPRVRQ YQTPLFV* SGL I FAGLLLLSLP 
TLGAG I T I LLTDRNLHTAVFDPDGGGD P 
ILYQHLF* FFGHPEVYILILPGFGI ICH 
IVTYYSGKKEPFGYIGMV*AMISIGFLG 
FIAGAHHIFTGGIDVDTRAYFTSATIII 
AIPTGVKVFS*LRLIRPNLC3lLTSPrSP 
NPRGRHHYTTNRPQ PPHRRV 


3024 


16925 


A 


3046 


434 


40 


. GAPPP P PGRFFFFLNPREDTFPP PPQKG 
GFSPP PPPKFFFSPPALFFFGGGWPNSP 
PPQKNFFF* KPPEFFFFPPFLKKKI FFP 
PPLFFPP PQI FF*TPP PLFFFFFFFFFF 
FFFFFFFRGCKINFIVRGF 


3025 


16926 


A 


3047 


1 


399 


LFTGGGLTGrVLTNSSLDIVLHDTY*W 
PHFHYGLS IGAGFAI IGGSIH* FPLFSG 
YTLDQTYAKIHFTI I FIGVKLTFFPQHF 
LGLSGMPRRYSDYPDAYTT*NILS SVGS 
LISLTAAILIIFMS *EAFASK 


3026 


16927 


A 


3048 


116 


367 


GASMILSSMIFLECIVGGFPVFSVYLFK 
LQILRQSSTMCFVLFCFFEIRSCSVTQA 
GV* RRGHGSL* PQPPGLSHPSSRDHGHV 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 

- 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
OCysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucioe, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X-Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


3027 


16928 


A 


3049 


6 


345 


SQLLRRLRKENCIiNIjGGGGYSHQR* hqc 
PLAWATE * DYLKKKKKNFSLFLKTGGFY 
PFLKKAPLGTTLS PKNNLPLS PLYKKRT 
GPWNWERKNFGKKKGGVIGLQGGKTIPK 


3028 

- _ . 


16929 


X 


3050 


261 


2 


NKKSP PVNLWWKMGFFPKFAKRWLS WK 
GGGGPPFFSWRRS FTFVAQAGMQWRNLS 
SLQPRLPDLR+SACLGPPDC*DYRREPQ 
YP 


3029 

* 


16930 


A. 


3051 


160 


2 


ICVDEQAGVQWRYLGSLQAPPPGLATLS 
CLSLMSSWECRQPPPLG* FFVCPR 


3030 


16931 


X' 


3052 


89 


3 


PLTSGLLLLT * QKLAP I S I I YQ I S PSLN 


3031 

■ 


16932 

• 


A 


3053 


344 

■ 


2 

• 


HVEAP I AGS I VLAAVLLKLGG YGI I RLT 
LIIjNPLTKHIAYPFLVLSL*GI I ITSS I 
CLRQTDLKSLIAYSSISHIALWTAILI 
QTP*SFTGAVILIIAHGLTSSLLFCLAN 
SN 


3032 


16933 


A 


3054 


266 


2 


FQPPISAYTKISPSLNVSLLLTLSILSI 
I AGS * GGLNQT QLRKI LAYS S I THVG * I 
I AVLP YNPNI TILNLT I Y 1 1 LTTTAFLL 
LNLN 


3033 


16934 


A 


30J55 


3 


32 


KYNS L I MP TMIAT I TliNLYFYLS PLL Y 
**SSCPP 


3034 


16935 


A 


3056 


3 


33 


KNNS L 1 1 PTI IAT I TLLNLYF YLS PLL Y 
**SSSPPS 


3035 


16936 


A 


3057 


57 


445 


ANVWAPHGPAKLTNKDNYHI WKS KRLKI 
ANMTIKKLNEVIGLTLPDFKTYVELVQ* 
RC2NAID*RKHKQPVKC3SPBA*PHSYSQL 
I FHWGAKANHGRKDSLFYK* CW *NWTIS 
S QKLNLHTDLTNFTKIN 


3036 


16937 


A 


3058 


311 

i 


1 


R VGLLLKLNKIS WPP PP FYGPS * EKBPL 
CFSQIGLFLTRTMVLNNLHSPPVKTRPY 
NKIAPFRE LFF FFKDRVS I CLPG WSAW 
*SQLTATSTSQASSDPGRV 


3037 


16938 


A 


3059 


138 


411 


WERPWKAQEAVFW I * VSAFWAPP PLMEK 
Q I P PDLEQHYRNVPGVNRNQPFVS FFLR 
WSLTVAQAGVWWRDLGSL * PLLPGFKRF 
LCLSLLS 


3038 


16939 


A 


3060 


1 


189 


FCRVGQAGLKLLTS SDPPAS AS * SAE I T 
GVSQRAWSKITILKSSSFSYPPNSCKMC 
FWLICLW 


3039 


16940 


A 


3061 


3 


406 


DAWADAWVLILPRFGRTSHIVTYYSGKK 
EPFAYIGMV*AMISMGFLGFIV*AHHIF 
TVG I D VNTRAYFTS AT I I IAI PTGVKGF 
S * LATLHGSNMK* SAAVL *ALRF I FLLK 
KKKKRGAVLKVPWGGPSLPGOG 


3040 

• 


16941 


A 


3062 

• 


451 


82 


P PPTNYFSP PPAFLPGGGGPPRPPPPKK 
WSPPTPPPWIMPPPKKKKKFFPPPRGW 
GPPPKIF*KPPPFFFF*KKNPPFSPPGE 
NRGVPFSTKPPPPWGKKNFAAPGAPPPP 
PFFFPFPRGGG 


3041 


16942 


A 


3063 


430 


1 


FFPPKQLIFWGGGGPKSPPPKKKFFPKK | 
PPGVFFSPPKKKKKFFFPPPLNIiAPPKI 
FFKSPPPLSFLFFPFFFFFFFFFFFFFF 
FFFFPLS VQTLLKRTRAPPQPPLLD*EK 
APAPRVP * TGBGMPAVNVAFAPPPFYKE 
RPS 


3042 


16943 


A 


3064 


1 


409 


PTRPRESTYQGHHTPPVQKGLRYGIILF 
ITSEVFFFAGFF*AFYHSSLAPTPQLGG 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
iv— tsiuiafnic aciu, £=jrnenyiaianine, 
o^vMycine, n— xiisiiaine, i— »isoieuciDe, 
K=Lysine, L=Leucine, M=>Methionine, 
n^Asparagine, r^irroiine, 
Q=GIutamine, R=Arginlne, S=Serine t 
T=*Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, V^possibfe 
nucleotide insertion 














HWPPTGITPLNPLEVPLLTT 


3043 


16944 


A 


3065 


1 


137 


HTFNFSIYQKATVIKTVWYWYNNRHTVE 

PTIP TMI5VT VP *T T DOT O 


3044 

■ 


16945 


A 


3066 


4 

■ 


426 


KLEN * KMVLKE I KEDLNKQTD I LPS * LQ 
RLITVRMSIIiPICLIYKPSAIPIQIPA*P 
h+ IKI I IKCMRKGK*TRIABTIFFFFFL 
SQSFILSPKLDHRGGITANCrrPPWAIKG 
KLLLKKKKKPPKTKPEKFFIQKATGAEG 
uVn 


3045 


16946 

■ 


A 


3067 


411 


187 


RNL PNV P PR P TH FVLLVKTG VS QVGQGG 
GKLLASKNPPSPAPPKSWDYRGEPPRPA 
PRKFFF*LNKFKIYGGPKN 


3046 


16947 


A 

• 


3068 


193 


471 


QCTCI KVHSGQKTGSTPLVI GBLQI KIT 
LGCY YTPTLMAQI KKTDRTKCW*GYGAI 
GMLILCWRECKIVQSL*KRVWQFII*L» 
IYLAIKLNI 


3047 


16948 


A 


3069 


323 


478 


FFXFXFLFFFILXYFYYF* KLFYLYLFX 
YIFKIYFIYNFIYIILFYIIFIIF 


3048 


16949 

■ 


A 


3070 


1 


378 


GTRRFFFHS I TLL I LGLLSNTLTI YQ * W 
RDVTRE STYQGHHTPPCQKGLLNGI ILF 
ITSEVFFFAGFF * AFYHSS IAPTPQLGG 
HWP PTGI TPLNPLE IPLLNTCVLLASGV 
SIT*AHHSLIENNR 


3049 


16950 . 


A 


3071 


201 


2 


TTPIQLFLKHYHT*NFNYNFFFEIGSCS 
IAQAKVQWCDLCLIjQPQTIiGLKHSSHLS 
LQSSWDYRHA 


3050 

- 


16951 


A 


3072 

■ 


261 


1 


EKAMGGGPFKVKKSPGEGPTTKGWPLKG 
PLEGGQRGLTGPFKSNGRLFFFFFGSNE . 
VSLCCPG*SRTPDFK*SACLSLPKCWDY 
RHG 


3051 


16952 


A 


3073 


333 


52 


EI FKKKKKGGGRFFNKRVFPRPRVSNGR 
PRAQFFLBTFFFFPERGFFFFFFFBTES 
1 5 V AuAbV w WW I iMbuUr fKr KiCc oaJjo 
LPSSWNYRCK 


3052 


16953 


A 


3074 


7 


214 


SQLQENRLN PGGGGCGE PRS CHCTP VWA 
TE*DSVSKKKKKKKKRGGWVPP3LGGGP 
KKKNPFFNQEGGL 


3053 


16954 


A 


3075 


180 


419 


QKHIiS I LHYLF KRD VF Fr rl»KGS FWSQ 
VGGQGHNLG * LKAP P PRLTHFS CLTLRE 


3054 


16955 


A . 


3076 


371 


208 


QKLSGHGGSRL * S * P F* EAKAG * QLVSG 
GRGCSQL*SHQCTPAWVTE*RLVCKR 


3055 


16956 


A 


3077 


349 


•> 
3 


PP FQKGQGDAPP YKKVQRGNPPP PLKGR 
PSRGPPKKCKVFKAPVFIiFPRFSPPPFF 
PPPLFFFFFF*DRVLFCCPGWSAVAQLT 
ATS 


3056 


16957 


A 


A AAA 

3078 


124 


m 

1 


I SS ET F AVKS S r W PGVVVHACNr S x DGK 
* GKE I TRSRDRDH 


3057 


16958 


A 


3079 


205 


1 


CLVQN I WACVSHWRYIMSCKGC* RLGIjM 
CVCVCVCVCVCVCVYFCCCCCCCYC*DE 
LSLCHSDCSAVA 


3058 


16959 


A 


3080 


3 

• 


413 


GHHTQPVQKGLLYGI IIjFIT*EGFFFSG 
FF*AFYHSSIAPTPQIjGGHWPPTGITPL 
NPL*VPIiLNTSVLIiASGVSIT*AHHRLI 

* nnrnq 1 1 q all i t 1 lljgl yftllqas e 
yfetpftisdgiygstffvatgfhr 


3059 


16960 


A 


3081 


1 


321 


NSLNPGDGGCS * PRLHYCTPAWGTELDS 
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nucle- 
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acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A)anine 
C=Cysteine, D=Aspartic Acid, 
It,— Glutamic Acid, Jr— Phenylalanine, 
G=GIycine, KNHistidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
iv^Asparagine, r^rroline, 
Q=Glutamine, R=Arginine, S^nneT 
"^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /=possible 
nucleotide deletion, \- possible 
nucleotide insertion 














ISKEKKYPYHyYHPSMKMTIYNYRQQOT 
NHIYIMLVEHSQTQENTCFMISGNFFFN 
LP IVLGEGEKNQHS ISFKLFFNF 


3060 


16961 


A 


3082 

• 


117 


484 


VLKYFIDS EVNAVLFSISCS FVTDPVFL 
FFFFGKGVSFCPPAGIKGGGFGFLEPLA 
SGFKRI FFPNPLEKWE*RARPPPRGKFW 
NFFFFFFLKRELFFAPRWEGRGKILVYL 
KGPPWGYPHF 


3061 


16962 


A 


3083 


381 


227 


GI SRDGF YEGjGQAGIjE LLTSS DP PALDS 
*SVGITGVSHRTRPLLLRLNVQF 


3062 


16963 


A 

• 


3084 


2 


391 


SHAYHIG*PSR*PVTGALSDLLMTSGLA 
M* IDFHSITLLILCLLTNTIiSIYQ*WRD 
VTRESTYHGHHTP P VPKGLRYG I ILFIT 
SEVFFFARFF*AFYHSSLAPTPQLRGHW 
PPTC I TPRNTLDVPLLNT 


3063 


16964 


A 


3085 


2 


248 


I MRSGDRDHPG* HGETPSLLKYKRLAGH 
GGMRLWS QLLGKGGTADSHHHVLtilLET 
FYSLRERRHLTSVPTLGMNYWAQDIR 


3064 


16965 


A 


3086 


276 


243 


EKWPD * S RAACP VLCRGNGQYS KGRCLC 
FSGWKGTECDVPTTQCIDPQCGGRGI C I 
MGS CACNSGYKGE S QEEAPRY I PEKE 


3065 


16966 


A 


3087 


3 


130 


GFYHVGQAGLELLTL*SACLSLPKCWDY 
RREPPRPAHTPPHS 


3066 


16967 


A 


3088 


277 


2 

» 


SSSSVFCLLVWTSSSSSSSAARIiPPLTG 
FLPKWAX IEEFTKNNSLI I PTI I ATI TL 
LNLYFYLRLIYSTS ITLLPISNNVKIK* 
QFEHTKP 


3067 1 


16968 


X 

• 


3089 


404 

E 


60 


FSHGKMRFFSPPSPKKIFFSPQSFYFLG 
GGGAKMPPPKKRFFFKKTPRGFYFPP* K 
KKNFFFPPPVKFGPPRGFLKSPPPFFFF 
FFFFFFF FFFFFFS PTFFTVFHLMLKSD 
ND 


3068 


16969 


A 




405 


186 


INKKPEAFTNTVDQMVXjTNSHRTFYPTA 
TACSLSGAHRTFSRMDHV*DHKTSLNKF 
KTEI TLSTLSNHKLEP 


3069 


16970 


A 


3091 


371 


1 


SPPLVQKGLRYGI ILFITSEVFFFAGFF 
*AFYHSSLAPTPQLGGHWPPTGXTPINP 
LEVPLLNTS VliIiASGVSIT * AHHSLIEN 


3070 


16971 


A 


3092 


263 


3 


NKRSPPVNLWWKMGSFFKFAKRVKISWK 
GGGGFFFFSWRRSFTFVAQAGMQWRNLS 
SEiUPRLPDIiR* S ACIX5PPDC * DYRREPQ 
YP 


3071 


16972 


A 


3093 


1 

- 


392 


FFFADFKKMFILINHFKMELTTYFELKR 
NEATASENC*DAVKAVI*3GKFIVIjSTYI 
RKEERPRINNIS FQ I KHWTKKN * T*GKQ 
KICKKKKKKVTGPEIPKFLIVKSGKPPKV 


3072 


16973 


A 


3094 


3 


367 


EM* IEITMRYHYNTARRLKF*KTDNIKC 
* *GHGTSGTLI YHWOE * KMVOPLWKTV* 
QLiL 


3073 


16974 


A 


3095 


3 


257 


HEVSQDGLNIjLTS *SARLGLPKCWDYRR 
EPPCLAWL I LPDDCVI FQKLKLLHHNLL 
NLSCIDVLMGIYSLSNFSQSNFPFFFFF 


3074 


16975 


A 


3096 


189 


414 


KGLVMVTS*CKMFFCLSIFFFFFERGFC 
FFAQAGVQGHNLSSLEPLPPQLKQFFCL 
TLPRS * KYRPAP PCPANFY 


3075 


16976 


A 


3097 


1 


299 


ENYRPIS FMWZ'DAKHiNKHiANQIQQC-S 
KRITHRDQVGFLPGMQGQFYI * KS I KSI 
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SEQID 
NO: of 
peptide 
sequence 
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M. 
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od 
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SEQID 
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09/515,1 
26 
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beginning 

■ k 1 *k 4k *1 4k 

nucleotide 
location 
correspond 
ing to first 
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residue oi 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
Hr-tviutamic Acid, Phenylalanine, 
G=GJycine T H=Histidine, l=IsoleucJne, 
K=Juysine, i*— leucine, M=Metnionine, 
N=Asparagine, P=Proline, 
Q=Giutaraine, R=Arginipe, S=Serine, 
i — J nreonine, v— valine, 
W=Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














NAIHHISRLKKjiNnM^ 
S * FMKKNKKLAFQET 


3076 


16977 


A . 


3098 . 

• 


362 


126 


FIHTHTHTHTHTHTHIYI YI YI IASPQW 
NAIRGRMKCINKP * KDMEBLKCI LPSER 
SQSAKATYLLYDSNYITLBKAKL 


3077 


16978 


A 


3099 


3 


423 


RHEHAYHIVKPTP* PLTGALSALLMTSG 
LAM * FHFHS I TLL I LGLLTNTLTIYQ * W 
RDVTRESTYQGRHT P P VQKGLRYGI I LP 
ITS KVFFFAGFF* DFYHSSLAPTPQLRG 
HWAPTG I T PLNP LE VPLLYTCVLLASGV 


3078 


16979 

• 


A 


3100 


392 


83 


LRNTNCGHGAPFKNS PFFPQFLVKNS PA 

k) m »m»i4»k.^ IPff ^ TJ »kv^ k«j W m i|k\fkTB«Wff» km «tkr\M «m_ ftfBNf Ma 

YFQNKEKDVGKP PFFFRAPDGAP FFKKK 
KK+GLALSSRLEYGGMISLELLGSSDTF 
ASAS R VARTTG Q CHHAG PT 


3079 


16980 


A 


3101 


81 


247 


GGWGGPPLKPPLF*KKTGKNFWPPFFKG 
KEKPPPNPRGGGKKGGPKPPPVIFPPF 


3080 


16981 


A 


3102 


3 


388 


HEKKEPF* YIGMG*AMIS IGFLGFIV*A 
HHI FTVG I YVDTRAYFTSAS 1 1 IAI PTG 
GKVFS * LATLHGNNM K* S AAVL * AIX3 F I 
FIFTBSGLTGIVLSNSSLDIVLHDTCYL 
VAHLHYVLS IGAVFAI 


3081 


16982 


A 


3103 


3 


387 


HERHEELSNS *ANNLI *LAYTIAFIVKI 
PLYGLHL*LPKAHVEAPIAGSIVLAAVL 
LKLGG YG I IRLTL I LNPLTKHIAYPFLV 
LSL*GI I ITSS ICLRQTDLKSLIAYSSI 

+ & s ■W W till *«Y ^k>« m* /kfkik* 

SHIALWTAILIQTP 


3082 


16983 


A 


3104 


3 


381 

- 


HEQSHAYHI VKPSP * PLTGALLALLMTS 
GLAMGFYFHFITI>LITiGI»LTYTIiTIYQ* 
WRDVSREST YQGQHTPPVQKGLRYGI IL 
F IT * EAFFFARYF * AFYDFRI/3PTPQLR 
GHWPPTGITPLNSL 


3083 


16984 


A 


3105 


412 


Ul 


FLGHPLFKKKMGGKKRGLPKMGV'HPPA 
PKGKPPPLKKKKKKPGGGGAPLYSPFSG 
GEGKKNPIiTPEGGGPKNPNSPPPPPPGG 
KKKNPPPLSPKKKKKK 


3084 


16985 


A 


3106 


298 


98 


FFFFFFF FFFFFFF FFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
F*L* 1 I Y* AICAINNIRGAGAELYRSRD 
FVCY 


3085 


16986 


A 


3107 

■ 


2 


417 


ATHVGLQDVTSPMI EQLITFHDHALMNI 
LI ICFLDLGALFLTLATKLTKTNM* YAQ 
EI*TD*TILPAI ILNLMALPSLRMLYIT 
DEGDDPSLTIKS IGHQWY*TYEYTDYGG 
LIFNSYIRPPLFLEPGELRLLDVDNR 


3086 


16987 


A 


3108 

* 


8 


430 


VGLVLFLSELPLNGGILTFFHQGIYSPF 
PGGRTV7ALMVGSWGLVMASTDLLGPLCH 
AFTPATQLLLNLAVAS PL* PAALRIGCH 
SKTTNALTHFLPRGTPTPPRP ILVTIET 

MOT T TDDTUDTWDT a.ZW*'B"T2MJin'.T T.tTT C 

V 


3087 


16988 


A 


3109 


3 


453 

* 


PRAI KF YRDW PGHERKRIAWKGDP CHMV 
LIKDEKGLMCQKKKKKTPFFWAPKIPLV 
F PPAQKNQGS * PNPPGGGGNP PLRPGP * 
RKKPPALMGPPPSSPGGINPKSFFYLNP 
GPAH* PRGEN * LGPVFENLFLPLLKKI P 
LWGELRCSQT 


3088 


16989 


A 


3110 


398 


2 


SLFQKNPNPLVG* KKKGKALPQGPPPLF 
P PLGGAGPGGSQGRGWGPPRP PWGNPFF | 
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mi 
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pepnue 

sequence 


Amino acid sequence (A B AIanine j 
OCysteine, D=Aspartic Acid, 
su vriuusiiiiu rt.nu, x'nenyiaiaoine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 

N[=A<m a merino P=Pi*ftlino 

Q^GIutamine, R»Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban,Y==Tyroslne, 
X=Un known, *=Stop codon, /^possible 
nucieoiiac oeieuon, \— possmie 
nucleotide insertion 














P* NP KI WPAKGGAP FSP PS EKG * AGGF F 
LPRGGGFPLTQIPPPPPHLGGKSKPPFQ 
KKKKKNRQGTSWMMQGWGPQ 


3089 


16990 


A 


3111 


3 


371 


SSDPPTSAFQGSGTTDMCHHHHAQLIFN 

J? pve loot- 1 WiyAly v(JWnUH(&aLtL»a(J;l lr 

GLR*SSGATVLNWAPALGPRRPDPTRME 
SLVLMKPRGSL1RSACPDCFLVFWFSFF 
HEAEGCASEC 


3090 


16991 

> 


A 


3112 

t 


233 


449 


FALFS * LATLHGSNMK*SAAGL* ALGFI 
BT.FTAGGLTG I VLANS SLD IGLHDTYYV 
cAHrHYGL»S IGAE FAI XGGFIH* FALFS 
GYTLDQTYPKIHFT 1 1 FIGGN ITFLPQH 
FLGLSGMPRRYSDYPDAYTT*NILSSGG 
S FNALNAGS IT 


3091 


16992 


A 


3113 


458 


2 


RGPPPPPPPKFFFF*TPGKNSPPPPPBG 
VFFPPS PPPNFFFS PPPLFFFGGVPP IS 
PPPPKSFSPQTPPRFFFPPPPLKKNFFS 
PPPLFLPPPPFFFYPPPPFFFFFFFFFF 
FFFFFFFFFFCGDLEGELPGTGMLACVI 
LLRANRKARTRG 


3092 


16993 


A 


3114 


1 


418 


LNTTFFYPDGGGDPNIjYQHLY* f*ghpd 
pdihi lprpgirshidtdysgkkepyay 
VGMGWAMTSIGFLWLMVRARPLFTVGVG 
VDAQAYSS FAS ITIAI PTGAEVFS *LCP 
LPLSGMK* TGAAVWALGIiRF I FTCSGR 


3093 

- 


16994 


A 


3115 

■ 


1 


425 


PRINNISF*LLPPSLLLLLAYAIVEAGA 
GTG*TAz PFLAGNYSHFGA^GlvLTIFSL 
HLAGGSS ILGAINF I TTI INI KPPAITQ 
DQTPLFV* SVLITEDLLFLSLPGIAAGI 
TILLTDLNLSTTFFDPAGGGDPILYQHL 

p#pi?nnRryynnTT v/mr o 
Jf w v c u tf. AfcaslsU V X. Jj x Qnbr 


3094 


16995 


A 


3116 


2 


383 

- 


GI^CTNHKDMGAI*YLLLGARAGVI*SXAL 
AFVIIFFIVIPTVLGGPGN*LGPLIIGA 
EEAGAGTG*TVYPPL 


3095 

» 


16996 


A 


3117 


1 


259 


PTRPALVVTAILIQTP*SFTGAVILIIA 
HGLTSSLLFCLANSNYERTHSRIIILSQ 
GLQTLLPLIAF* * LLASLANLPAPTPTP 
HO 


3096 


16997 


A 


3118 


3 


174 


LI RGGRGCHELRS RHCTPAWATRARTLS 
QKK* KTTNPKKKLCLI FFGGKKKKLKKG 


3097 


16998 


A 


3119 


155 


1 


PD FFNKS MDKKKTARGWEDS S S S FCFFK 
p nPvr.T.rr'DfiMcii vane* t.thaq 


3098 

• 


16999 


A 


3120 

■ 


420 

• 


2 


P P PKFFFP PTPPFLGEGGAKTPP PKKI F 
FLKKPPGVFFFPPLKKKKFFFSPP*FLA 

rtrtS~Lc c lSJ\.tretri: r £ r i: r sEEi: rr r v ert: 

FFFFFFFFFLVFIVLFQVKVHFLKKCFN 
I QFPXiAS DNS * PS M I HE KFYCESN IEF 


3099 


17000 


A 


3121 


48 


387 


PJ)PVLQKKEKKKKKKPKNQKKKKKKGGP 
F* KI P * GAKI KPGKEKKNFS PKRGAKKK 
NPGNFEKKTNFGGGKKWGKPPPKN*RFK 
GKKKFLKGKGGKKTQ I PWGLKI FFNGFD 


3100 


17001 


A 


3122 


237 


2 


PFPWLPPFPLKASSSP*SL* FLLGGWP 
NLPSSPNKGSFPKFPSWFLFRPP*GKNF 
YLALPR* PWPPQGFL*TAPP 


3101 


17002 


A 


3123 


448 


130 


PPRFFPEFFYSGPPKPPFFKTPVFLGVK 
PGVFFFSPYQKKPTNFGPKMGAL*RIPL 
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ing to first 
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residue of 
peptide 
sequence 


Predict- 
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nucle- 
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ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A=A|anine 

OCysteine, D=AspartIc Acid, 

Lr- vviuiamic aciu, i^rnenyiaianine, 

G=Glycine, H=Histidine, I=Uoleucine, 

jv— j-#ysine, M-f— jucuiJJie, jvi— irieinioiiinBj 

N=Asparagine, P=Proline, 

Q=GIutamine, R~Arginine, S=Serine, 

^Threonine, V=Valine, 

W=Tryptophan, Y=Tyrosine, 

X= Unknown, *=Stop codon, impossible 

Mll#tlAA^l#ln JaIa^! AH \aan.A*allklA 

nucleotide deletion, \ppossiOie 
nucleotide insertion 














FETP I WVFP I KGFHKKKPPVLN* PPTRK 
PPPDKILKKKKKDDCISRLARN 


3102 

• 


17003 

• 


A . 


3124 

• 


453 

- 


3 


SSAREGGGVPPPPPKIIFFPPPPIFSWG 
GGGTKSPPPEREVFPKNHGGCFFSPP*K 
RGKYFSPPPRMGPPPGVFFKGPPPNFFF 
FFFFFFFFFFIFFFFFFFFFFTKKKNWF 
FFKAFRMSPKPVPPFFFFCNYRWFRPR 
VRPRVR.PRV 


3103 

• 


17004 


A 

■ 


3125 

- 


443 


2 


YFP PFGR VI WGDSLGAGVLNPPR PHKGT 
PFF P K3CI FS I RPGWGGGPPIjPP PQRQMW 
GGPPPPLFGLFRPQGKIPFPPKKKIKPP 
GGVCVGVGVCV* KKKKNPFPPFLWAGLK 
EPCFFFtri? FFENri V CbuCJ^WbArcARSQ 
LNCNLHL 


3104 


17005 


A 


3126 


3 


186 


PVI YS TI FAGTL I TALSSH + FFT* VGLE 

GGGL 


3105 


17006 


A 


3127 


171 


38 


KKKKLFFPPREKWGPPKNFLKRAPPFFF 
FFFFFFFFFFFFFWSERSS*VA 


3106 


17007 


A 


3128 


16 


189 


ILGEVI WV* *FF* FIKKKKKKKKKKKKK 
KKKKKRGAPFKKTPGGPQITPGBKKKIF 
SL 


3107 


17008 


A 


3129 


401 


85 


LVNFFSPQEKRGFFPPPPPKNFFFSPRG 
FFFLGGGGPIFPPPKKSFFSKNPPGVFF 
SPP*KKKIFFFPPPLFWAPPRFFFKGPP 
PFFFFFFFFFFFFFFFFFSQF 


3108 

4 


17009 


A 


3130 


2 


312 


ANNL I * LAYT IAF I VKI PLYGLHL* LPK 
AHVEAP IAGS I VIiAAVLLKIjGG YG I IRL 
TLILNPLTKHIAYPFLVLSL*GGGVF 


3109 


17010 


A 


3131 


449 


3 


FFFFFFFLGGPPEIFFFFFCPPKKPKPP 
LGGGKKKPPFF*NFPQKPLGVLGAPPPP 
LCF F I KKKKGGGKKFFS PPLF * KGGPFK " 
KFFFFP PPPKGPP PFLKNIiRGWVFFKPP 
PKKKALSFFKKKKKXKKKKKKKKRAAAR 
DLELADAW 


3110 


17011 


A 


3132 


95 


448 


VINRE*KCVC*MKKKKKKKKKKKKKKKK 


3111 


17012 


A 


3133 


60 


442 


LGGFFFFGGKKGFCLWCPRWGAKAGIPV 
NGTPPRGV*RNFLAQPP * EGGITGPPPL 
PQ* FGFLRENGVPIiRGPGGFEPPIIAjEP 
PPLPPQKGGKNGRNPPPPLKGFLVLFKK 

G 1? o o ii V f o n IvJlrClxLJir 


3112 


17013 


A 


3134 


236 


45 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFPOAAD*VILFRYFQ 
rtiAi. AMov 


3113 


17014 


A 


3135 


441 

• 


88 


KQQTP PGLI FF * KAPRRE I FL P PP IMVF 
FSPPSPFKFFFFLSPFJFFGGVLPFFPP 
PKKGFFFKNPRRVFFCPPIiKKKNFFFLP 
PFFFGPPRVFFYPPPPIFFFFFFFFFFF 
FFFFFF 


3114 


17015 


A 


3136 


2 


328 


TMLSPKPQQLNQQNCSPEHYEPQIjKTQR 
TWR*KKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKRGGGQKKKMVGGEKKKPG*KIFF 
FFIKKVKKKTALGDKKKTQFWGGS 


3115 


17016 


A 


3137 


281 


2 


KATKSGTP IPSQGQQSLAWSWAGIGSAQ 
PPALLHS *PIGKIFKNCMPVGRKSPQLP 
RNTS WQLGAVAH P SNSSTLGGRGGRI TR 
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SGVRDRTRG 


3116 


17017 


A 


3138 


448 


3 


FFFLPPS PFFPY* KRGFSLFGRWFNFP 
PPVFPPPWPSQIFGFQALIFLPPPPPLF 
FSFPVGFFQTALFPFNGFFPGFFHGLFP 
LLRFCPRK*VFWGGFFFFFFFFLRDKVS 
LCHPGWNAVAQS E FTTALT S KAQAPTRP 
PTRPPTRP 


3117 

i 


17018 


A 


3139 

i 


2 


436 


DR*LFSTNHKEIGTLYLLFGA*AGVLST 
AliSLLILAELGQPGNLLGNDHIYNVIVT 
AHALGKI FFIAI P I I IGGFGN* LAPLI I 
GAPEMAIPRItTOIS*GLLPPSILLLLAS 
AIEE AGAGTG* TDY P PLAGNYCHPGASG 
DLIIF 


3118 

■ 


17019 


A 


3140 


362 


2 

■ 


KPRRGKFFPPREGGGGFPPPPPKNFFFP 
KGGKFLGGGGGKNSPPQKKGFFQKNPGG 
VFFPPPKKKKNIFFPPGKMGAPPGFF*R 
GPPPFFFFFFFFFFFFFFFFFGQSGQVK 
LKSPKCKL 


3119 


17020 


A 


3141 


440 


102 


PTPPPCCKFSFKRPPKKTLFFPTTNLVF 
FSPIPP *NFFFS PQALIFVGFLAPIFPP 
QKKFFFSKFPPLFF^TPPLIKKFFVPPP 
PFILSPLKIFYKPPPPIFFFFFFFFFSF 


3120 


17021 


A 


3142 


1 


79 

i 


FKLDYFSI IFIPVALFVTWS I IEFSL* Y 
INSDPNINQF*KLDYFSI IFI PVALFVT 
WSIIEFSL 


3121 


17022 


A 


3143 


3 


441 


FFFPPPLKKKKFFPPPPNIGPPPKSL*K 
PPPPPFFFFF 


3122 


17023 


A 


3144 


223 


3 


LPYWKLPYLKH***LQDTNQESRG*HFL 
RPRP FKNQMKS GTVAHACN P S TLGGRGG 
RITRSGV*DHPGQHGEI 


3123 


17024 


A 


3145 


65 


414 


KKKKKKKKKKKKKKKKKKKKKKKKRGGP 
PKKKTRGGPQNPPPKKiCKFPPQKGGKKK 
PPLGF*KKTPPLGGEKIPPPPPKKNTPP 
KKKKKF*GGGGQTPPPPPPGKKFSPPRN 
KKKK 


3124 


17025 


A 


3146 


410 


31 


RANQKAFRGKPLCDLAVGKNLSSRTQIA 
LTI * KW INWTILKLRTSGH* KTP I KTIK 
RYPlJSGBKISDBEIiYIiQY*KVL* IGKKK 
PDN PVE KWAND L KRS FMKGN I LTVFKGM 
QR*LGSMAHICSIiS 


3125 


17026 


A 


3147 


182 


241 


SHPSHHSTINITNKGLL*TPLPI PNPLV 
NLNLGLLF I LATS S LAVYS I L * SGGASN 
SNYAL I GALRAVAQTI S YEVTLAI ILLS 


3126 


17027 


A 


3148 


254 


1 


KTKKGLKIKDPLTRF*ISVC* ITKSI*F 
F iujLPrr PLLiiUsnlPLKiIir FFFFLFET 
EFRSCCPGWS*mV*SLLTATSTSQIQAI 


3127 


17028 


A 


3149 


499 


2 


NTPPAAGGGCFFFFFFGEKNFPPPHPTP 

PPPKKVFF* KIPRGFFFPPPKKKK1FFF 
SPGFFGAPPGFFLSGPPSFFFFFFFFFF 
FFFFFFFFFFFFFFFFLINFLLSLQGFF 
L VS KB L FL FGLTVKF TRG FRG FGGQ 


3128 


17029 


A 


3150 . 


256 


146 


KNAKVTQVG PE FNKG PG * HTHTHTHSHT 
HTHTHTLQ 


3129 


17030 


A 


3151 


333 


1 


TISCLCTRGEHPLSPRJRAGPYTGSPLHC 
CVDWDKVFS S WKDLTD W PLGDLD I ETF 
TDGSS F I LRGVCRAG YAAVTLD SAVEVL 
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- S VSAETS A* KAEL IALTRAL wLGKEQK 


3130 


17031 


A 


3152 


43 


379 


NKTSLFGWDYIWEWGAPEPETPPKRAAG 
ALHSLAQPFSVALPPCFFDPCAPSLSPG 
♦HALRPLPLPGLASEIQTAPSWHVPPKS 
IiSPAPQPCPIPTLVPVGYKTPP 


3131 


17032 


A 


3153 


398 


12 


NTTPGGGKFFLKKTREEKFFPPKKKRGF 
FPPPPPKNFFFPQGGNFFGGGGGQISPP 
QKKGFFQKNPRGVFFTPPKKKKI FFS P P 
GKMGAPPGFF*RGPPPFFFFFFFFFFFF 
F FFF FFLKKS WRPIiAI 


3132 


17033 

I 


A 


3154 


371 


105 


SPSPQVNFIKGPKPPPPK*IL*RAPNPP 
LPKKKFSNPPTWGPGPQPPPSXFSKFAR 
FPFLPPPFPFKKEPPKKKIFFPTKEGTV 
I*KNPPFSGFQSPDSIK 


3133 


17034 


A 


3155 


3 


371 


DVGADPILHTSTGL*LAMQY * PEA*TAF 
SSIAHIT*DVyYG*VIRYLHANGA*IFF 
ICLLLHIGRGLYYRPFLYSKT*KIGLIL 
LLATITTAFIGYVIiP*GPI* F*GATVMT 
NLLSAIPYIGT 


3134 


17035 


A 

i 


3156 


1 


398 


IAIPTGVKVFS * LATLHGSNMK* SAAVL 
♦ALRYIFLFTEGGLTGIVLPNSSLNIVL 
HDPYYWAHFHYVLSIGAGFAIIGGFIH 
* FPLFSGYTLDQT YAI IHFTI FIGVNLT 
FFPQHFLGLSGMPRRYSDYP 


3135 


17036 

• 


A 


3157 


401 

• 


46 


PPGGRNFFKKTPGKKI FSTKKKKGFFPP 
IiPPKNFFFSPGGFFFGGGGGPNFPPPKK 
GFFSKNPRGVFFSPP* KKKI FFFPPG* I 
WAPPRVFIjKGPPPFFFFFFFFFFFFFFG 
KKSVRA 


3136 


17037 


A 


3158 


404 


18 


FFSPPPPPPGGGVFPPNPNKNPFPPPPP 
PFFLGGGPPPPPPPPP*FPPPPTPPNVF 
FFSPPKKKNFFPPPPGPPPPPPKPPPPP 
PPPFFFFFFFFFFFFFFFFFFFLFFFEP 
MEKGPRPGDIGSKPPS 


3137 


17038 


A 


3159 


3 


192 


S L V IAGCPR* NLS S TIjNLPTE PSKS PCK 
PWGGKK 


3138 


17039 


A 


3160 


205 


80 


VQRDNFGFLQPS PSGVKLFFCIiSLPNKW 
DFRCGPPNPG* FFS 


3139 


17040 


A 


3161 


3 


384 


IirVPTIILLPLT*LSKKHII*IINTTTHS 
LIISIIPLIiFFNQINNNIiFSCSPTFSSD 
HLSHPILKECKKKKKKKKKKKKKKKGGGA 
FKRTPGGAAHWGGGGR2TFF PKGGEKKN 
RPGVFWKQTFFWGGK 


3140 


17041 


A 


3162 


348 


70 


GPPPKKRVFSKTPKVVliNKPPQKKKKFI 
FPPPVNLGP PKNFLKGPPPFFFFFFFFF 
FFFFF YFFWMGCx/R* CS WRHS S PPRLiSG , 
TPKCSPSVT 


11/11 

J141 




n 


31 0.5 


z 




SEDHI I *TANVLYTHA* ALS ITLIATSL 
TSAYSTRI ILLTLTGQPRFPTLTNIETK 
KKKGGPFNRYPLGAQVYGGGQNEKFFLI 
GREII 


3142 


17043 


A 


3164 

■ 


1 


221 


PTRPRDCSELRSCHCTPAWATEGDSISR 
KKKKLSTRTAF*YTEAINSLIYSLNIGT 
FKTYFPKIKTYDRHFDF 


3143 


17044 


A 


3165 


26 


383 


IPFYQ*SLI*YTRKKKKKKKKKKKKKKK 
KKKKRGG 
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3144 


17045 


A 


3166 


251 


381 


GNLCAQ WARWLTP WPALWEAEAD * SRG 
QE IGAI LANTVKPHLj 


3145 


17046 


A 


3167 


391 


115 


LFKKISPHAGIWGFFSPLTP*NFFFSLE 
PFIFGRGLAPIFPPPK*RFLSKNPPWF 
IPPPLMGKPFPPPPPVRLGPPIYSFKGA 
PPFFFFFF 


3146 


17047 


A 


3168 


94 


389 


SPGILGQKGQIGPIGNHVPGGLAAPVTP 
* FRFKPRLP * GFG PKAS PPLALKPERAQ 
VGGTPPPGPRGPNGQPPPFKENQPGLGF 
RFRGKLAEKRGFHL 


3147 


17048 

i 


A 


3169 

t 


3 


363 


WATALQ PGQQ S ETQSQKKKKKKKKKKRG 
GPPFFFFFKKKI FFSPPGAKNKRGKFF * 
KPPKKKKKIFFFPPPPLKKKKKKKKKKK 
KKKKKKMGGAFLKKPRGAPPPPGKRKLI 
FFFLKGV 


3148 


17049 


A 


3170 


2 


171 


KEPLGYIRMV*AMISIGFLGGIV*AHHI 
FTVG I DVDTRAYFTSATI 1 I VI PTGVKV 
FS*LATLHGSNMK*SAAVL*ALGFIFLF 
TVGGLTGIVLTNSSLDIVLHDTYYWAH 
FHYVLLIGG* FYLRYHNHCYPHRRQS I 


3149 


17050 


A 


3171 


390 


27 


QSLTVKS P YP WI L ]j,KTKGHH * VMNAGL 
TRYQSLLCENPHIRSEVCITIiNPPPLLP 
VSESPVKHSCVQVIiDSVYSSGPNL*DHP 
♦TSVDWELYVDGISFANPCKVSLKKMTS 
PAPVTPRS 


3150 


17051 


A 


3172 


127 


310 


KNPGGAKrioRGGERKNFFIiKRGGKKKHL 
GIFGKKTFFWGGKKWAKPPK*WRFFSPF 
SP*KFFFSLKALIFFGGFCPFFSPPKKS 
FFS KNSQVFFFS P P FKE KI FPFPPP*NF 
GPPRVFLKGPPPFFFI/3VFFFFFFF 


3151 


17052 


A 


3173 


376 

+ 


3 


FFFFFLRRQSIiAVTLAGV* RCNLSSLQS 
PPPGIK 


3152 


17053 


A 


3174 


365 


14 


RENFFSPEGGAPKNKPPPPPPPPGGKKK 
IFFQKKKKKKI FYPWKNFPPPPKKKKKN 
P* KPGPLKSQGFFFFKKNQNLPWGPPQI 
SFPKKKKKKKAKTVQERKYNSNTQLVSA 
ETQLL 


3153 

• < 


17054 

■ 


A 

• 


3175 


1 

• 


377 

■ 


VPLHSSLGGKARIiHLRKKKKKNPGFLKN 
FGPLALLGMGVGNI KGLKGQKGKNPAFG 
AHTGGGCFSLRGRNTPFPKRAEGIICYN 
S PH* KET* KALE PRG YKGL * QALALPNL 
KSGKMEHI LRGAP 


3154 


17055 


A 


3176 


352 


125 


GHEVLDS SDLPAS ASQSAG ITGVSHHAL 
♦EILLSMFETTWACDLLFQNISFIK8SI 
PCFIGLDFIMPHRYCRFFF 


3155 


17056 


A 


3177 


139 


366 


TAHTS*GY*VKNYINLSFCFFFFFLERN 
HSALQPGGQGHNRSSLQPWFPGLKQFSC 
LSLPRSWDNGLVPQHLVNFF 


3156 


17057 


A 


3178 


206 


1 


KGTLFKKDFFFKN F PKKVFLGVPKNSLF 
* KILTLPP VIiNP YP FFFFFPFETVSLCH 
PGWNAWPRLEP 


3157 


17058 


A 


3179 


330 


1 


I V5TLETC Y I AYNEEE KDTF ITLRI YVI 
GGNGKFLGI* IKQHIKKIIHHDQVGFIP 
GMQGWFNICRSINVIHHISRIKGKSRAQ 
WLTPVIPALWEGDAGGSPEVRSSRPA 


3158 


17059 


A 


3180 


3 


399 


HAS AP LQS S 1X3 YRARPCFKKKKKKKKGG 
GGGGFFFPKGGFGPLPKKGFFSGKGGLG 
KWGLGGAGKTPGIKKPLGKGPPKKRGGK 
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KGOKGPHF*KKKKKKGGKKLKKGPF*KP 
FKKPLLGRGVQP *NPHLLOG 


3159 


17060 


A . 


3181 


2 


204 


CTTACPFW*NKEL1jMPKKKKKKKKKKKK 
KKNFFFFWGGY 


3160 


17061 


A 


3182 


71 


377 


PKRGGQPKQKKIWGPPPPGAPPQKGMGF 
FNPRGFKKQITFFPPPPPPPEKKPPFF* 


3161 


17062 


A 


3183 


151 


2 


FFFLEELVPLLLKLFQKIEEEGCSFNSF 
NEAFI ILIPKPFR* TTKKENF 


3162 


17063 

* 


A . 


3184 


152 


387 


YF^GFI YWACICTS^LFFWfe'LmFrJ 
FVTQVEVQGPNFT* LNPPLFGLKKFFCL 

rrrr ^ YlTHilYTXTD DT T)T3T3/~>X7 TT?^m?V 

TJjUIGWNWKFJux'r'FQVXJfL.Jf 


3163 

• 


17064 


A 


3185 


22 


336 


YE KCTAIiLQMVS S F I WMEREGTHQYS FY 
RKDFSLASKVNIVS YYLSP IVE* FFFFL 
RGS P FAPQAGGQGPNLGSWKPLPPGLMP 

nc r~*T rnr Tir* r**TJTKi r*T) n nnnownT 
FSCJjTJjPGGWNCRPPPt'Gi?VN 


3164 


17065 


A . 


3186 


344 . 


1 


WVLKKIFFYPGRGGPPFIPPPLGGQGGP 

TnMiMjno*nnonTinntnii/io+vtwvT/tsrtrt 
I PWARGr * PPRGPP PKNQr w JUUUUU(u\* 

GGP P PGFPPPGGPRGGVP FFLGGGGPRK 
XT 


3165 


17066 

■ 


A 


3187 


296 


1 


np kkiltlpkktkvykcegenqvp 1 1 fq 
alftigktwkqpky 


3166 


17067 


A 


3188 


2134 


1 


GVAAHACNAS I LGGQGGRI I *GWBFB TS 
LANMVKPC 


3167 


17068 


A 


3189 


I 


159 


LQDHPG * HGE P PS I VKIQKLARHGSLRL 


3168 


17069 


A 


3190 


119 


340 


CIKKNRLVSARGKNNKRK* IYKP*VDIF 
FKEDIQMAGKQMKRDLISLIIREIQIKT 
TITCYLIHARMGTITRD 


3169 


17070 


A 


3l9i 


75 


I 


Lio v viSS V W iro 1 V/ulivJNr'o luub^uo 


3170 


17071 


A 


3192 


343 


1 


I FI LGGGP CCS P VF FQFFGGGGGGFFLP 
QNFFPPRGKIFRPFFF*KKKLKRPNWGF 
F*NFFNPPLGFFNLFFFFFKKKPKNFFF 
LGGFFFFFFFFFFFFIAGGDSLALSPRL 


3171 


17072 


A 


3193 


105 

• 


368 


KFKDPPFPPPFFWPPPKQLPPPPGKIGA 
PHF*TPKGPPPPKKKKC*KKKILKGGRG 
KKKKKTPPKRPQKIWGPSKKKNPWGGGK 
TPPL 


3172 


17073 


A • 


3194 


1 


365 

» 


rCRDEVJjLir C JrDWo Fx iAjJjJvy r ortLiUbr 
KCWSYRC* PPYPAQGVFLK*HLTKSLSC 

LCHLLSTLIPFTHSLFFLFLGWFLRQHR . 
SVTQAGVQWR 


q IT? 


1 /\J/ t r 


A 




i 




GNQ PKRLNAGTYLLFYTLEGS VPLLIAL 
MYTHKTL»WSLNILLLTLTGQELSNC*AN 
NLI*IAYTRALIVMIPLYGLHL*IiPKAH' 
VEAP IAGS I * LAAVIiLKLGGYGI IRLTL 
ILNPLTNT 


3174 


17075 


A 


3196 


137 


3 


KGQPRFPPISLKRGSQGKKGFFFFFF+Y 
RIPLCHPGWCPWQSW 


3175 


17076 

- ■ 

• 


A 


3197 


1 


266 


EGRGCSEIi* SCHCTPA* VTVRYPVSKKK 
KKKKKKGGGKGKKKGGEKNTLFGPKKGK 
LRGPQKRGKKIGPEKKVGNNLKKGIFFR 
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EKTF 


3176 


17077 


A 


3198 


in 


446 


YWS EHYAMTQVLEGFS YSLQDHF YFCFR 
SIRRI I FYSLIKPS IND*GERELEPITT 
SQALQIAGRAGRFSSRFKEG 


117*7 
ill I 


1 /U/o 


7\ 








OPftPFGKTPFFLKT PNT.TPGRCtTT GT/KPP 
LFKRVKPENCLSLRG*GCN*PI 


3178 


17079 


A 




2 


330 


PPFPKKTTQKKPKKKKRGGIiNWGVKTPPP 
r T*n UTMn t q winner, itkwt. T.inrifK' d hci 

NPLF*FRGF*KKKKAPFKKKKNPPK 


3179 

• 


17080 


A 


3201 

t 


2 


385 


FP PFLGGPQ I PKFLKFFLRAN * NLFGLF 

KKIFFKTFFQKPLFFF*KKKFFFCFPPF 
FFLSRGFFVFFSPKKPFFFFFFFFFFL 


3180 


17081 


A 


3202 


Off 

355 


2 


rt rrrbiillloLoVAUAuVUWniijbbb'A 
LPPRFTP 


3181 


17082 


A 


3203 


156 


1 


LKS LLWEAKVGGLLEVRS S RPA+ ATWRD 
FISTKN*KISQVWWYILVWTTWE 


3182 


17083 


A 


3204 


352 


175 


QPRGRPAPAHPP * CPLRLALPC* CPCPA 
CCPPWAEASPSGVQASPARAPACPARAIi 
LNE 


3183 


17084 


A 


3205 


280 


373 


QRGTRIFSDIiQT*KKKNKSPFKILLLID 
NALGHS RVLTERYKDI FRPANTTSFLRP 

JL/TT rADT T T5TI7AD 


3184 


17085 


A 


3206 


1 


367 


EIESIQIDGHTKNKFLGIKLTYLTKEVK 
DLCKKNYKTLLKE I IDDTEKYDMLMD * N 
IflVKMAILPKAIYRFDTISTKIiLMSFFM 
ELEKIF*NLY*KA*MAETTISKKNKAGG 

1 X IiIuJFaJjX X 


3185 


17086 


A 


3207 


368 


12 


FAQKKIGCKKKKKKKKKKKKKKKAPPQN* 
RAPQKPLKPPPRVFIjIPPPIiGSPPPPAF 

LLA 


3186 


17087 

• 


A 


3208 

• 


2 

• 


356 


KYLFSS I PEX3KEKMKGIANLFNEI ISBN 

IIVKLSKVKDKBRILICrARKBCFITYKR 
TP IRPGMVAYACNPSTLGG * GGWIMRSG 


3187 


17088 


A 


3209 


359 


170 


INIKQIiPRCGGTCLWSQLSRRLRRQDCY 
YKINKFL 


J ioo 




a 








I DDQ L I EDKDVKT PB * RNNGLFNKWCWE 
N 


7 ISO 
J 1 07 


1 TflOA 

i /uyu 


JA 


JZl 1 


X 1 o 


AAA H 


IDTMTVNI ILPLRSLNI VITNPYNI *HH 
QHDLDYTYPDTTGQLVNCAYFINLLQP 


3190 


17091 


A 


3212 


340 


355 


LGLQG*REGNFGSIiQTSPPGFKRFSGLS 
LLSSWDYRV 


3191 


17092 


A 


3213 


254 


40 


W PGTMAHACNLS TLGG * GG C I TRPGDRD 
HPG*HNNNN 


3192 


17093 


A 


3214 


198 


25 


YNEKEKPVKMLKI IANSLGAVAHAYNPS 
TLGGQGGRIMK£GDRDHPG*HADAWVET 
VL 


3193 


17094 


A 


3215 


376 


3 


PPGVF*RGPPGFRTLPPKSSSSSPPPKI 
LGGPPFLAFFGGAPPQKKPPSSSSSASS 
PPSSSSRAPLKKGGPFNPAGGFPLFGGP 
PRGGPPFSSSSSSSSSSSSSSSSGRRSR 
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SEQID 
NO: of 

nitrlpftHrlf* 

IJULICULIUC 

't 


SEQW 
NO: of 

|f vpUUC 


eth 

uu 

• 


SEQID 
NO: in 

09/515 1 
26 


Predicted 
beginning 

location 

correspond 
■no to first 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

n it4*l a_ 
IjUCIc- 

ntiHp 
uiiutr 

location 

VVI 1 WUv 1 J 

ding to 
last amino 
add 

residue of 
sequence 


Amino acid sequence (A=Alanine 
OCystelne, D=Aspartic Acid, 

ft'; — f* 1 1 1 fan m i O Af*irf IT— Plianulnlantna 

r/— vjriuiainii. aciu, r = rnenjiaianin6 1 

K=Lysine, L=Leucine, M=Metbionine, 
N^Asnfl rflpine. P^Prolirte 

Q=Glutaraine, R=Arginine, S=SeririeT 
T=Threonine, V=Valine, 
W=nTryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, A= possible 

niiflpntiHp HAtotion \sni\««ihl» 

UUVIvVUUC UCICUVJIf l pvjjIUlC 

nucleotide insertion 














. SRTSRTRGRTRG 


3194 


17095 


A 


3216 


393 


260 


VSQDGLDLLTS * STRLGLPKCWDYRREP 


3195 


17096 

• 


A 


3217 


3 


387 


HASASSGSRAPPFFFFFNKKTGSKBLET 
TRGTVLKTCLFFKXKKKVLGGPKPPLGN 
PKGVGKPLKGFPGPNPWPPP*KGPGCHF 
* KNGF* DPGPKKSHI PSWVGGKGSSLLP 
urlwuroJSV X r oeuKn. 


3196 


17097 

i 


A 


3218 


77- 


406 


RMARPELGLPGNLLSNDHIYNGIVTAHA 
FVIIFLIVIPIINGGFGN*LIPLIIGAP 

rNM5\17T5T3 TVTVTTOT3 + T T DT5PT T T T T KOIkTtr 

EAGAGTG* TVYPPLAGNYSHPGASVK 


3197 


17098 


A 


3219 


394 


2 


KRRYFPDGLNFFWG PGI LKI FVKKKVSS 
LKKKKKKNFP PVFLQGWGGNKNFKGGGL 
KFSKPNLI *FFFPQKEAR*KVFFFFFRS 

nrttr>^T»T DT5OT3013BT nMTfiT fflmnuniiTn 

PQKDPIjRE r Fr FFLRQ V SLCHPDWSAVA 
RSQLTSAS ISRAQGAGRV 


3198 


17099 


A 


3220 


376 . 


157 


KFFFS P * KFFFS PKPLKFGGGVG PI I PP 
f KKKF FF KNPQG VFKKF Py KK&KI PF QP 
PVNFGPPRDFLKGPPPFFFFFFFF 


3199 


17100 


A 


3221 


394 


1 


VPPPQKFKTPGPPPPPREFFFF*KKKGF 
PPLGGFLNPAPKNPPPGPPKKVGFPGGP 
PPP PGG Jr FFS PPXiS F * NPGBRVFFGFKJ 
PKRKFLLKWGKGGKFFPKKPIFPPPPKK 
KKKKKKRAAARDLELADAW 


5200 


17101 


A 


3222 


207 


402 


SILM*LCCLFPLPGVTPIDGAPHRSYRE~ 
CY1*VLIjTOVMVG WVDKDLAPG I ADSIiRH 

rJWMKlsK.K.1 


3201 


17102 


A 


3223 


309 


3 


YPFFHL IDIAIH POTCFTKF YKATVTQT 
Anon X y J-K X J.DU w WV3f 1 albiSl f rnl XfiH 
VI CDKHDKNKQWGKD SLFNKWC WEN* LA 
ICRKLKLDPFPTPYAKI 


3202 


17103 


A 


3224 


3 


382 


LDRERPPFFFWGARHMDIPQLVNLSINK 
GHWANFNFLGYKKKGGWEKKKKKXKKKK 

KuljrF w ISJS^£r fcrvjtalrJx^ Fir vjoJxK X r f I Mvj 

GKKKPPGGFLEKPPPLGGAHLGNPPPQK 
YTPPGKKKNLKRET 




1 / lU'f 


A 

r\ 






L 


GSSDS PASAS * AAG I 


3204 


17105 


A 


3226 

• 


349 

• 


2 

• 


AGVPPGNPPLWGGEGGGSPRGGGLKPGF 

JryKoWJrXTI? r iVJVj£>yjr A Irirx^ouxririjxirlrr'n 

GGGAGGSPLPQGQRFQ*TKIGPFPSPRG 
KKKKPPFPKKKKKKKKRKEKSEMPGFMV 


3205 


17106 


A 


3227 


3 


239 


LNKVGRGCS EPRSRHCT PAWATE * DS IS 
EKKKKKPESRGILKVKGIjTQALFYLALW 
LYYLHPTAKQQIWGFFIYFSKP 


3206 


17107 


A 


3228 


205 


I 


IGLKIQNSCPLKDSLKKIKRQATDWRKY 
LONTSDKASVFI I YKEHLOLSNYKAVDP 
IK*WAKEMNKLH 


3207 


17108 


A 


3229 


174 


1 


VQMIiEDKS FEETP * FSSETLVLKI phvq 
PGAVAHTCNPSTLGGRGGRITRSGDRDH 
PG 


3208 


37109 


A 


3230 


2 


340 


KNHSVYLLCVFSIPFPTFCFF*F*VFNT 
F*FKLN* IN*FTYIDRVLLCHPGWSAVA 
+S*LTAALNSWAQAVPCLSIiLLAHHHAM 
LIIFKKTAYIIHIWYVWYDTTYPFKVYN 
S 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M, 
etb 
od 

• 


SEQID 
NO: in 

* if Cut 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Ala nine 
C=Cysf eine, D=Aspartic Acid, 
E=Glutaroic Acid, F-Phenylalanine, 
OGIycine, H=Histidine, I=lsoleucioe, 
K^Lysine, I>Leucine, M=Methionine, 
N s Asparagine, P^Proline, 
Q=Glutamine, K=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W—l rypt o p nan, Y»Tyrosi ne, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \° possible 
nucleotide insertion 


3209 


17110 


A 


3231 


1 


313 


KKSTPYQRGFPPISPARVPFSIKFFLEA 
ITFLLFDLEDALLLPLP*ALRFIFLFTV ; 
GGLTGVVLANSSLDIVLHDTYYVAPHFH j 
YVLS IGAVLAI IGGFIH* FP 


3210 


17111 


A 


3232 


207 


355 


BLS P FRLKKTL Y * IxGMVAHACNPS TLGA 
RGGQIA*AQBFKTSLGNMAKP 


3211 


• 17112 


A 


3233 


3 


355 


TTQQ*LIKLTCKQTIAIHNTKGRT*ALI 
LI SIiI I FIATTNLLGLLPYSLTPTTQLC 
INIiAMAIPL*ADAEVIGFRSKIKNAI*AH 
LLPQGTPTPLIPILVTMBTINLLIEPIA 
LARRL 


3212 


17113 


A 


3234 


2 


355 


KYLINNRLITNQQ*LIKLTSKQMITIHN 
TKGRT*SLILISLIIFIATTNHLGLIiPY 
S FTPTTQLS INLAMDI PL * SGAMVIOFR 
SKI KNAIiAHFLPQGTPTPIiI P I LAI I ET 
ISLLI 


3213 


17114 


A 


3235 


43 . 


370 


QG CVKG WVLEE Q VRRGW I LD S S EGKTDL 
KQRGS PGS WPEHVGGWSGVMG * SEAWTG 
QARWLTP I ITALWEAEVGGS LRPGVQWH 
NLGSLQPLPPRFKRFSYFSLPSSWD 


3214 


17115 


A 


3236 


15 


356 


LIQPSLKLMISIHNTKGRT*SVILLSLI 
IFIATTNLLGLRPYLFTPTTQLSINLSM 
VIHL * AGAMVIGFRSNI KNALAHLLPQG 
TPTPLMPILVIIETIRPLILPIALAVRL 
TA 


3215 

• 


17116 


A 


3237 


1 


376 


GTRTNTLTI YQ * WGDGTRBSTYQGHHTP 
PVQKGLRYGI ILFITSEVFFFAGFF* AF 
YHS S LAPTPQL* GHWPPTG ITPLNPLEV 
PLLNT CVLLASG VS IT*AHHSLI ENNRD 
QIIQALLITIVLG 


3216 


17117 


A 


3238 


1 


358 


GTRNG * YTNA* ALS I TVIGAS LTSAQGS 
RIILLTLTGQPRFPTLTNINENNPTLLN 
P IKRLAAGSRFAGFLITNNI SPGCPFQT 
TIPLYLKITDLGVTFLGLLTGLDLNYLT 
NKLIIKA 


3217 


17118 


A ' 


3239 


258 


390 


RQGLLMLAGLVLNSWPLQSSHLGFPKCW 
D YGREPPCLGN* L IL 


3218 


17119 


A 


3240 


2 

* 


372 


ARARFHHVSRDGLDLLT* *NTHLRLPKC 
WDYRRBPLRPGKTFFLKKKKKNS IFLFF 
REGFKEKSILGIKFFRPTGGVLILTGNH 
GWGCKTGTBLLVPSRFPGLAFKICGLWA 
HDTPHRVRNWL 


3219 

• 


17120 

■ 


A 


3241 


3 


283 

■ 


HERL WGG W KTGAAGLGRTS SRPTASLTQ 
T*TTMTH* SRTTGCSTGSGRTWTRSRHS 
CAWGMAGLYRVAVASRGPRGMM* PTPPH 
WLCwAwKvP 


3220 


17121 


A 


3242 


170 


3 


I KSQAGLVGFLGP FS FQDSLNLVFVFG F . 
FCF *DRVS FCSPGWSAWQSE FTAALV 


3221 


17122 


A 


3243 


2 


385 


* LSTLHGSNMK* TAAPLLTLGFI FLFTA 
GGLTGLELTNSWLDIVLHDTYYVGPDFH 
YVLS IGAVFAI IGGLLH* YPLFSGYTLY 
RTYAQIHFAIIFIGEK 


3222 


17123 


A 


3244 


1 


106 


GTR YVGQAHLKCLTS S DSPASTS QS AG I 
TGVSHSA* PASTS QSAGITGVSHSA 


3223 


17124 


A 

■ 


3245 


2 

0 


360 


ARANTLT I YQ * WSDVSLENTYQGHHTPP 
VQKGLRYRI ILF ITSEDYFFSGFL*AF* 
HSSLSPTPQL*GHWPPTGITPLNPI,KMP 
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SEQID 
NO: of 
nucicvuue 
sequence 

• 


SEQID 
NO: of . 

peptide 
sequence 


M 
etb 

Oa 

• 


SEQID 
NO: in 

09/515,1 

AO 


Predicted 

beginning 

nucieoiiue 

location 

corresponfi 

ing to first 

amino aUQ 

residue of 
peptide 

* 


Predict- 
ed end 
nucie- 
otide 

lUCallUll 

correspon 
uing w 

last amino 
acid 

rcaiiiUv uj 

peptide 
sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
n— vjiutarnic aciu, r deny la la nine, 
G=Glycine, H=Histidine, I»lsoleucine, 
*v=i>yoifie, i^Licucine) ivi s i>iecnionine, 
N=Asparagine, P^ProIine, 
v^-v>iuuiiiiiiie» iv— Arginine, o = aerine > 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, V=possibIe 
nucleotide insertion 














IIQALLI 


3224 


17125 


A 


3246 


17 


160 


GG*GCSCSEL*SCHCSPAWVTEiQb^DSK 
KKPAI LASCLKHLNPLSSH 


3225 


17126 


A 


3247 


236 


2 


WAHI YMTPLSPPSFLKPVQ* KNFYI YST 
YSIiDN *NPSS PKAKRAPKKS YTLF YLHL 
CVCVCECV*VCVCVCVCVCVWV 


3226 


17127 


A 


3248 


2 


2220 

» 


FFGGGRPSPPQGYFLLNNHSSPSPPVKL 
NPGPA* rYPP.TKGKNFPPPQR* PSPPKN 
IKTPPPSFFFSS 


3227 


17128 


A 


3249 

i 


2 


372 


AYTISFIGKISFYGLH* *LPKAHVEAP I 
VGSIViyVftKTiLKLGGFGIIHLTLIFNPIj 
TKHIAYPFIiGLSL*GISlTSSMWLRQTD 
LKS L I AYSS I SH I ALVVTAI 1j IQTP * S F 
TGAFFLIIDHG 


3228 


17129 


A 


3250 


293 


56 


BGS PKVI FNKS PPHHLFLFLFFFFI FFF 
♦FFFFFFFFFFFFFFFFFFFFFFFFLYL 
LAMFYLS FFFKQDNQRYRQYS I K 


3229 


17130 


A 


3251 


169 


370 


IrKMTELRGAP AS KP RGQEPPPHYPCHHH 
HHHHHHFL*VTKGQGPHHWPSPTRDPGW 
L*SPS*EDQRR 


3230 

• 


17131 


A 


3252 


22 


156 


GERIGLGLGGQGCSEP*LCHCTIiAWVGD 
TVRPCLKKKKKKKGPPF 


3231 


17132 


A 


3253 


3 


400 


QNQTP LLD* GGL ITAVLLLLYliAVLTGG 
ITILXADRSLDSTLFYPAGGGDPILYQH 
LF* FFGHPEVYILILPGFGI VSHIETNY 
WGGKEPFGFVGMV*AMIAMGFLGFIG*A 
HH I FTVGVD VHTRA * FTSAT 


3232 


17133 


A 


3254 


373 


31 


REVGPPTP*KIFFFPKGLNFWGGGGPKF 
PPPKKKGFFKKS PVGVFPP PGG * KSGPG 
PGFKKPPQRGKNISFPAGGKIGPPRGTL 
KRAPPFFFFFFFFFFFFLLWVYVQVERP 
TL 


3233 


17134 


A 


3255 


1 


379 


LNLIQRQ*R* V* KFL*LPPQT* KKKKKK 
KKKKKGGGAFKKNLGGAKFNGGRKKKIF 
FLKGGVKKKKAGGGFKKRGKGKKCYLGI 
FBKKPFrQGGKNWENPPKKI KGLiREKKK 
F *GE KGE KKPEKAG 


3234 

* 


17135 


A 


3256 


42 


376 


FC 1 1 S XiVHHC I YNDLiS FERKKN I FVPGQ 
INSISSLA.*EAHCKNKSLLHAVKKKKKK 
KKKKKKKKKKKKKKKKEKKRGEKKKKKK 
KEGRSSLKKEK 


3235 


17136 


A 


3257 


353 


67 


CYPLSPLKFFFSPRSLKFWKGVGPIISP 
PKKKVPSQMSQEAGFPSPNVLKRRPGPN 
FKTTP * KEKKNPFPPPVKFGPPKESLKR 


3236 


17137 


A 


3258 


2 


129 


FHR I S QDGLDLLS S * SARLGLPKCWDYR 
CEPPRPAKNKILLS 




1*71 IO 
1 /l JO 


A 




205 


/4 1 < t 

* 


ITAHMYPRLRQSHTI YVLRVHHP * VPSA 
IEGPVSV*ALLHSSTIWAGNLLLIRFH 
PLAEKS PLIQTVTLCLGANTTLVAGGWA 
LTQNDS KRXAAFSTSRGLGLI I VTIGVN 
QPHLALLHICTHAFVKAI LFMCSGS I XH 
NLSKEQDIRKIGGLLITIP 


3238 


17139 


A 


3260 


3 


393 


SFNLSTLITTQEHL*LIiIiPS*PIiAII*F 
ISTLAETNRTPFDLAEGEAELVSGFNIE 
YAAGPFALFLIAEYTNIIIINTLTTTIL 
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SEQID 
NO: of 
nucieoiiae 
sequence 

"i 


SEQID 
NO: of 
pepuae 
sequence 


eth 

00 

• 


SEQID 
NO: in 

09/515,1 

ZD 


Predicted 
beginning 
nucieoiiae 
location 
coit esponu 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
locfltton 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
OCysteine, D=Aspartic Acid, 
Jtf— oiutamic acio, Phenylalanine, 
G=Glycine, H=Histidine, Hsoleucine, 
K^Lysine, Lr-I^eucine, IVi^Metnionine, 
N=Asparagine, P*=Proline, 
Q=Glutamine, R=Arginioe, S=Seiine, 
T=Threonine, V=Valine, 
W=Tryptophao,Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














LGTTYDALSPIjlLYTTTFVTKTL^ 

Ii* irtayprfrydqlihl 


3239 


17140 


A . 


3261 

• 


2 

* 


400 


ISDLSEK* FKRLWKLIMEAPEl^lkAOd 
KEIQKMTQEVKGE I FKE* IA* RKKKSKF 
QETLDTL I EMQS ALE S FSNR I KQ VBKRN 
SELKDKIFELTQSNKDKGKRIRKYEQSL 
* VAWDYIQ*PNLGI IQIPEBE 


3240 


17141 


A 


3262 


450 


129 


NNLAFN*V*EFCVliAIKLLKVKRYPIEW 
EKMFANHISDKGLVSGIYKELFRLSNKQ 
AI DLTF * KWAAGHGGS PL 


3241 


M A* 4 A A 

17142 

* 

* 


A 


3263 


238 


3 


KEKKIGLKKCLQGSHF3 IHTAWS 1 1 1 YM 
FSPLTI ISRKRMOQPGIVAHTCNPST*0 
G * GRW ITRSGVRDQTGQHGKTP 


3242 


17143 


A 


3264 


350 


3 


S PTLLGS KD PNLLGFRFPLWKKGKI IRA 
PLSLGLN*RFFSEWLIP*KPPKNWPGG 
TFLWCFLKRGFPFLSQKKKKKNPG^VA 
HACNPSTLGGRGGRSQGQELETSLANTV 
KTR 


3243 


« Am A A m 

17144 


A 


3265 


AT A 

69 


200 


RLECSGVISAHCIILNLPGSGDSPASAS * 
LAG I TVMVKLPVI AK 


3244 


17145 


A 


3266 


223 


408 


GG FP FFP PGGGEGGWFGELEPLP PGLRK 
FFCLGPPRRGD*GPRSSSPGSFWVFKKN 
GVSP 


3245 


17146 


A 


3267 


185 


420 


DQGLWGFI I YFYRQSLA*VHWNNPSSL* 
PRTPGLKHT PVPSLL I S WDYGRTPPHLT 
NFCIFFDRGSFFF*DRVSATHA 


3246 


17147 


A 


3268 


3 


392 


TGCHS I PQAGVQWHNHGLLQPQPPGLR * 
SSCLSLPSSWHY 




17148 


A 


3269 


1 


398 


KFSCISSKHQKLKLTPKPPKPPPKKSPL 
VLPIGKKIRETFWGAFKSPPPNQPKGAQ 
TLPLKI WDKMGGGGGLALW* KAPPGNF 
KGPPGGKPMEQP *LGPGPPLKWKGGLPH 
QKGGFS KAPGEKKKGGEGRL 


3248 


17149 


A 


3270 


422 


183 


ETEWSLFKVI ITEKSPNLEKDTNIQVQ 
ESYRTPSRFILMKTTSRHLII ILPKVNN 
TERIL* MQQDRGNNIQWS SGCS A 


3249 


17150 


A 


3271 


3 


35 


KNNSL 1 1 PT I IAT I TLLNLYFYLS PLLY 
**SSSPPS 


3250 


17151 


A 


3272 


155 [ 


1 


KDFFFFFIiRQSFTLVAOAAAEWCDLCSL 
QPKPPGFK* FSYNSLSSSWDYIG 


3251 


17152 


A 


3273 


413 


71 


PPSTGLFLTEEYB VS FPPFPL * KFFFPP 
SGLFFGGGVPPFFPPPKKGFFPKYPRLV 
FKGPLLGGGGLPPPPP* ILPPLGSFLPA 
PPLFFFFLFFL 


3252 


17153 


A 


3274 


336 


45 


DRVLS CS PAWRAMARSHDFG* LQLP PPR 
VKVFSOiSNPSSWDPRHVPPRKGNFVFL 
VKI\^PWNIX2GQGCRERRIjCPC I PAWGT . 
DKDSVSKKKKKS 




I / I j4 


n 


3275 




ioo 


c^ttt K"m rr . irvra tv t .t .r» w d"ntdmt pep t 
o v r r r viirur r XtaMJjjj^r criNr'nli£*vJo(ai 

ILVHCNLHFPGSKDSSASPS*VAGT 


3254 


17155 


A 


3276 


334 


75 


EWTRRVERRRRTH I I YYNFF* EFMNRFF 
FFI FFDGRFSCVFFFFFLFYFFFFFFFF 
FFFFFFFFFFFFFWFSARSFI YFLFPRH 
VT 


3255 


17156 


A 


3277 


144 


17 


KAPPLFFFFFFFFFFFFFFFFW* FRG*T 
HWNGDACMCNLTKS 


3254 


17157 


A 


3278 


2 


109 


YHIVKFSP*PLTGALSALLMTSGLAM*F 
HFHSITL 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 


SEQID 
NO: of 
peptide 
sequence 


M 

etb 

i 

oo 


SEQID 
NO: in 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- - 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue ot 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Asparttc Acid, 
iL=vpiuiamic Acid, r— Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, I>Leucine, M-Metfafonine, 
N=Asparagine, P^Proline, 
Q=Giutamine, R^Argiriine, S=Serine, 
i— inreonine, v— valine, 
WaTryptophan, y=Tyrosine, 
A— Unknown, =010 p codon, impossible 
nucieotiue ueieiion, v^possioie 
nucleotide insertion 


3257 


17158 


A 


3279 


404 

i 


65 


NLVSSPKRS VFFISPSQKI FFFFFNWKB 
SK1SLPKEGGGPLLLLIRQRTPIFFFFF 

17V U W L* W li* U" W l* 1 L' I* 1 L 1 U' L' L' U L 1 W L' t ' U U 0 1? 

FFFFFFFFFFFFL *TACXF* TFKFTKTK 
R 


3258 


17159 


A 


3280 


226 


383 


QI*KNSMDGCCLaPFWFETGSHSHPGW 


3259 


17160 


A 


3281 


91 


26 


LFFFFFFFFFFFFFFFWSERSS 


3260 

• * 


17161 


A. 


3282 

t 


3 


321 


HTVIYYFGKNEPFGYIGVA*AMISVGSl, 
GFMA*AHHIFTGGIDVYSPAYIiTCATII 
VAI PTGVKVFI * LATLIK3SGMKRSAAVL 
* TLGFI FLFTARGLTGI VLSNS 


3261 


17162 


A 


3283 


393 


112 


PPLRVFFPPFPLKNFFFSSRRFFFWGGV 
APFFPPPKKGFFSKIPPGFFFPPPLKKK 
I FFSS PP * FWAP PGFFLKGPP PFFFFFF 
FFFFFFFFF 


3262 

- 


17163 


A 


3284 


360 


47 

■ 


P PLHRWVPPPPP * KFFFS PQTKFFGKVF 
SQNPPPPKKSFFPKNPPGVFYSPPKKKK 
KFFLPPPI FFPPPRFFFTTPPPFFFFFF 
FFFFFFFFFFLPLFFWSVSF 


3263 


17164 


A 


3285 


386 


157 


FFSPNPLI FWGGGGP.KFPPPKKRFFSKN 
PPGVFFSPPI*KKKKFFFPPP*NLAPPRD 
FLKGP PPFFFFFFFFFFFFF 


3264 


17165 


A 


3286 


3 


385 


DAWVPCKLYLGVFFCRFVKISAIIiILKI 
L I FNI ENKS I * YLLKKKKKKKKKKKKKK 
KKKKKKKKKECKKKKKKKKKKPTPKKKKK 
KDSR 


3265 


17166 


A 


3288 


81 


270 


1 1 KLEKKKKKKKKKKKKKKKKKS * KGNE 
KTRGPKKKVHRDVTKKI LCYKI CPFNKG 
ISLIGT 


3266 


17167 


A 


3289 


3 


145 


LGRLRQEimiiNLGG*GCREPKSHHRILA 
WATEQDSISKKKRKIGGPV 


3267 


17168 


A 


3290 


404 


47 


GPGGKIFFKKNPBEKIFSTQKKKGFFPP 
FPPKNFFFPPGGFFFWGGGGPI FPPPKK 
G F FPK I PRG VFFTP P KKKKI YFFP PRE I 
WAPPGI F* KGPPLFFFFFFFFFFFFFLV 
LNDILLA 


3268 


17169 


A 


3291 


106 


362 


KKKKKKKKKIOaaaCKKKKKKKI^^ 
KKKKKGKGGGKKJCKPRGtAQI IRGKKK 1 1 
FFFKKVFK* KGGGDL*KKKSFCGENNLE 
NTHKE 


3269 


17170 


A 


3292 


3 


226 


KPLVNLNIjGLLFIIiATSSIiAVYS il*sg 
KKKKKKKKKKKKKGGGVL 


3270 


17171 


A 


3293 


337 


2 


F FNRELLKS KNKE TE KNHD YHKE R I WHF ' 
DRPYKFYMLFFCFGVLIFFLYLPFFIYF 
FF FErFFFFFFFFFFFFFFFLN S LNI * F 
FS I NTHKCGNRGQGGGAIGPRLSNGRV 


ill I 


1*71 

1/17Z 








*y i 
zi 


KIPPPPLISPPPKENHQCPPPPHIFAPP 
SHPFFGPPHFFFFFFFLFFFLWRTGSRY 
IAQAGLELLGSSYPPAS 


3272 


17173 


A 


3295 


367 


101 


FFFSFF*KKKNFFPPPPLWPPPPNFL*N 
P PP PFFFFF FFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFLVNSLCRRYRG* SLLY 
YAWL 


3273 


17174 


A 


3296 


178 


2 


KGPPFFFFFFFFFFPDSITLFIITKNEK 
KTRCTS IGE * I TWYTHTGE YYSVMKRNE 
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peptide 
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Amino acid sequence (A-AJanine 
C=Cysteine, D=Aspartlc Acid, 
Hr^HjriuiiiiDic Aciu t j , ~jrnenyiai80ine y 

f^=l*lvrinp H=Hicf iriine Yc T en lot i /•! n a 

K=Lysine, L=Leucine, M=Methionine, 
N«Asparagine, PHProline, 

Z'"*' Info m 5 r» «» PsAroinino faj6a»Jiia 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucieouoe insertion 














LS 


3274 


17175 

i 


A 


3297 


147 


1 


RGPPPFFFFFFFFFFFFFFFFFFFFFFF 
FFFF FFFFFFFFTW * FALT FLV*NF* AI 
ITF 


3275 


1 7 1 76 


A 


3298 


3 


347 


PFGEGETNGRr ULU1 KAAL.VS KKKKJUCK 
KKKKKKKKKKKKKKKKKKKRGOALKKKN 
RGGQKKTGBKKKKFFFKKGBKKKTPGNF 
* KKKKFLGGDNliAQTPPKKKNLWGKKKN 
FWG 


3276 


17177 

i 


A 


3299 


405 


2 


WRPKPENSPSPGGPNPPGQKN*TPFPPK 
TPKLGRGCGGAPQFTLPVRARGENPPNP. 
RFLGFN* PKFPFPPFAL*GGGNNETFFQ 
KKKKGPRFWAGVLPTGGAEGBSIPHLSP 
ASGSSRHPWCFMLCRPSRLCLL 


3277 


17178 

• 


A 

• 

■ 


3300 


125 


376 

• 

• 


DQPVQNSETSSLQKKKKKKKLFPGGGGP 
PI*FPHFLGGWGGGGTRPPG*RKRGKSGV 
PQPPPPGGRVPPPPHPPKKWGNKGGPPP 

PGKSFFFFFFFWRDBVSLFCTGWSQTQT 
PGLKQSIHLSLPRWLDYGREPPHIiAIHF 
FPTRPRTRG 


3278 


17179 


A 


3301 


348 


1 


PQRLKI LGGGGPPNSPPPKKGFFPKTPR 
GF*NPPPKGKKKNFPPPRKIGPPQGFF* 
RAPPLFFFFFFGPWFFFFFFFFLFLENG 
VSLYCPGRSRTPGLKLSSRLLASLRAGI 
TGVS . 


3279 


17180 


A 


3302 


2 


354 


GRVGGRVGS PTTLTT*NPHSHEKTPSCS 

KKKKKKKRKRKKKKKKT*GGGAYIKKKG 
GGSPRTEEENQYCFLLGGKKNILRGSCL 
KKTPH 


T 












3280 


17181 


A 


3303 


362 


26 ■ 


SSQAAHPTKKKKKKKKKKKKKKKKGGAL 
KKNFGGGKNKGGEKKKIFFFKGGGKKNP 
GGNFKKKNF FGGG KXGBNP PKKK *RFRK 
KKKFLRGKGGKKS 


3281 


17182 


A 


3304 


371 


3 


NQKKPPPPVFFFFFFKKKKNPPPATPPP 
GRP* FFPPQKKFFPEGGPPPFFKKGAFF 
PPPKKNfc'riiFrr PCSfcrVrr r *i\KKNJ.KVF 
PP PKKKKKKKKKKKKKKKKKPKKQKKGR 
PARDLEPHAS 


3282 


17183 


A 


3305 


2 


196 


DR* LFSTNHKDIGTliYLIiFGA* AGVLGT 
ALSLLIRAELGQPGNLLGNDHIYNVIVT 

AniUf V lie 


3283 


17184 


A 


3306 


4 


339 


LFSTNHKDIGTLYLLFGA*AGVLSTALS 

FVI I FFIVRP 1 1 IGGPGN* LNPLI IGAP 
DRAFPR I NNI S F * LLPRS I LL VRAS AIA 


3284 


17185 


A 


3307 


2 


367 


KPPAITQYQTPLFV*SGLITGGLLLLFL , 
PDLAAG I TI LLTDRNLNTTLFD PAGGGD 
PTIiYOHLP*CFGHPEC3YIIiT LPGFGI IS 

HIGTYYSGKKEPLGYIGMV*AMRSIGFL 
GFIB*AHHIL 


3285 


17186 


A 


3308 


243 


370 


KGHRPGMVTPVCWPSTLGGQGGWIRRSG 
I*DQPGQQGVTPSIi 


3286 


17187 


A 


3309 


2 


363 


L VPLI I GAPDMVFPRINNI GF * LLPPCL 
LLLIAYAI MEAGAGTG+TAYPPLAGNYS 
HPGAS VDLTI FSLHLAGVSS I LGAINFM 
TTI INI * P PAI TQYQTPLFV* SVLITAG 
LLLLCLPG 
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nucle- 
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peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
b=Olutamic Acid, r— Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L»=Leucine, M»Metbionine, 
N=Asparagine, P=ProIine, t 
Q=GIutamine, R=Arginine, S=Serine, 
T= 1 nreonine, V=Valwe, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3287 


17188 


A 


3310 


* 

1 


364 


LYITGPTIjALTIAIiIjL*TPLPIPNPIjVN 
LNLGLLFILATSSLAGYSIL* SGGASNS 
N xAliIGALiKAVPy 1 XSYls V 1T*AI ILLST 
LLIRGSFNLSTLITTQKHL*LLLPS*PL 
AII*FISTL 


/V A jrS A 

3288 


17189 


A 


3311 


215 


2 


GMAKL KARFL P KKGGKRS LS PLLWGSGF 
PTSGTGBTFFF FF * DRVLLCQ PGWSAVA 
QS * LTTAST FWAQAI 


328? 


17190 


A 


3312 


153 


3 


MNKCICMCVCMCVCFRSHKHTVCAWVCV 
Y *V* TYTVCVCVCVCVCVCVC 


3290 


17191 


A 


3313 


149 


1 


KTPPGFFPGFKGPF*FKOPPLFFFFFFF 
ETEDGVLLCRPGWSAVSRDRA 


3291 


17192 

* 


A 


3314 


282 


69 


KIGKTAPFFSWPRRVPFLKKKKKKK*GL 
TLSSRLEYGGMISIiELLGSSDTLASASR 
VARI IGQCPHAWLT 


3292 

• 


17193 


A 


3315 


479 


60 


MMQHS I KHAL VE I TGWVR WLRP VI PAL W 
EAEVGEFNSISKTNNRKRSNIVKMSILF 
NLIYRINAPFFKFLIALLNKNSKSKSIW 
NLKS* *STQ*SSRNNVGGITISDF*THQ 
QATELRQEFLHTGLVHRSTVSRFRFHA 


3293 

• 


17194 


A 

• 


3316 


129 , 


3 


S FHQKVWPDAVAHAGSPSTLGG * GGRI T 
GSGDQDHPGQHGE 


3294 


17195 


A 


3317 


3 


156 


EFHRVSQDGLDRLTS *STLLGLPKCWDY 
RCEPPRPASFPSFLTVRISPFL 


3295 


17196 


A 


3318 


36 


340 


TGLVIAEHIjLFFFFKKGFWFCPPGGRAG 
PC3GNLMEFPVSGGKGIFCPGPPKSGE * R 
APPPSRGKFWFFKKKGGLPFWPGGV*TS 
DPRGTPPPLGPKGGEYR 


3296 


17197 


A 


3319 


326 


2 


KAFFLNFKPHPG FA* GLKFKPNSLLP I * 
SFIRKT*AFLICL*IQLNRADGKIPIiKP 
GVIPFKKCFPNFTPPtiFFKKNCSSTPNL 
FFFFPFFFFGDGISLCRPGWSAVA 


3297 


17198 


A 


3320 


Hi 


312 


OAGVQWCDLSSPXPPPSGLT*FY*CSLP 
RTWEYRCPPSRPCNFSIF 


3298 


17199 


A 


3321 


363 


51 


RWGPRLPVAWLPSVSRRAEECS PGRGAH 
GQGGGGGGQAR * REGPDEWTGGAP VGMG 
S PSLNRGAGRGPPQPGPSS KSRRGRVRC 
TPHSCLIGCGLSFFICKMG ! 


3299 


17200 

» 


A 


3322 


344 


187 


LRQETCFNPGGKNFG*QKLPPCPPAWAT 
RGWVSKKKKKRKICRKEKIRICVLT 


3300 


17201 


A 


3323 


3 


313 


TRRERERERERERERERERBRERERERE 
REREFFFFFF*KKETI*S PPPGGKKRGGGG 
FHSPPFWGEKDFLPPRPPKGGERGGRPP 
PPVFFFFFKKEGAPPGGGV 


3301 


17202 


A 


3324 

- 


365 

- 


1 

■ 


SLGWWRVTPPPFPIiGPRGGSPPRPDKRG 
S PGRVKFGG VKPFPKVKP * PP PKRLDP S 
GA* FFFSPKKICQGPPPRKKKKKEAELS 
ILCTKFPLQEEEVMQVPPPPCTCSSEPG 
XNC-ICKiulC 


3302 


17203 


A 


3325 

• 


1 


269 


ARALSLSLSLSLSLSLSLEFFFFFLAPT 
QILGGGLFYPPKGGAHLSNRLGGGRVFF 
GPPGKKTAP liGGGRVI KKKTPPGTPL* L 
SPPPGGPFFFQGGQKKLSPPPAGLTOGP 
P P WGGKKAP PPKFGSGPKKKKKTLERER 
ERERERERERELV 


3303 


17204 


A 


3326 


1 


316 


ARGERERERERERERERERERERERFFV 
S P PGP DTLQDRGGVS LFASDPTHE I YLP 
CPLRGPLSERDPEELHGVCFFSEPSHRW 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A°Alanlne 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycme, H=Histidine, I-Isofeuane, 
K=Lysine, L^Leucine, M=Meth*onioe, 
N=Asparagine, P=Pro!ine, 
Q=Glutaraine, R-Arginine, S^Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stopcodon, /^possible 
nucleotide deletion, V=pojsible 
nucleotide insertion 














FPEEGVGDICQC*HRGECGSP 


3304 


17205 


A 


3327 


2 


369 


ARGDVTREST YQGHHTPP V<5^iifeVG 1 1 
LPITSEVFPFAGIF*AFYHSSLSPTPQL 
RGHWPPTG I TPLNTLEVPLLNTS VLLAS 
GV* IT* AHHSLIENNRNQI IRALLITI I 
LGLYFTLLQA 


3305 


' 17206 


A 


3328 


1 


259 


GTRKKHSTI LI I RE IDIKFMPRFHISP I 
RLVKI*ALANIRC*QQOGKVGNLLHC*F 
SKLVQSTGRAIWHALVLSLARSRALSLS 
LH • 


3306 


17207 


A 


3329 


2 


352 


ARGTLLLLCLLFFALTIYPRWCDVTRKS 
TYQGHHTPFVQKEPIjVLGGKLFITS*VI 
FFAGFF+AFYHSSLTPTPLLGGHWPPTS 
ISLFNPLKKPLLNTSGLLASGVS it*ah 
HSLLQ 


3307 


17208 


A 


3330 


3 


361 


HEEPLGY IRMA* AM I S IGFLGL IVRAHH 
IFTGGIDVDSRTYLTSATIIIAMPTGGK 
VFR*IATLHGSNMK*SGAGL*ALGFMFL 
FTVGGLTGI VLTNS * LDIGLHDTYYWA 
HFHYVLS 


3308 


17209 


A 


3331 

• 


1 


340 


GTSGDTRAYFTSAT 1 1 1 AI PTGVQVFS * 
LATLHGSNMK* SAAEL* ALRFI FLFTVS 
GLTGIVLANSSLDIELHDTYYVVAHFHY 
VLS IGAEFAI IRGS I H* FPLL * GYTLDQ 
T 


3309 


17210 


A 


3332 


2 


352 


ARGDVDTRAYFTSGTI I IAI PTGVKGFS 
* LATLHGSNMK* S AAVL* ALG F I FLFTV 
RGLTGIALANSSLDIVLHDTYYGGAHFH 
YVLS IGAVFAI IGGFIH* FPLFSGYTLD 
QTYAK 


3310 


17211 


A 


3333 


85 


370 


QV3HRVRPCFERBRERDRYRHTQRECER 
AKEIYSGFFLSASEMESCSLAQAGGCKE 
LRMCHCTPAWVTQGDCVS *NKK* HWEKK 
GLWFLGVNIST 


3311 


17212 


A 


3334 


362 


3 


QEVKVSMNRDHTHS LQRGEQKRNS VSKK 
KKKSSQEKLKRVHTQTHT* IFI AVLFIV 
AKNGKQFKFPSTCEWINNM*YIHTMEYY 
SATKNQL IHVTT * MNLNNI YTKFLKARQ 
KMITYSC 


3312 


17213 


A . 

* 


3335 


3 


351 


HEGTTLHHLFLLETGSNNPLG ITFHSDK 
KTFHPYYT I KRALGLLLFLLSLMTLTLL 
* PDLLGDPDNYTLAHPLNTPAH I KPE * Y 
FLFAYT I LRSGPNKLGGGPATiTiTiS ILIL 
AIIP 


3313 


17214 


A 


3336 


17 


350 

• 


F I FTLTE TNRTPFDLAEGESE LFSGLKI 
KYAAGPLALLFITEYTNI I I IDTLSTTI 
VLGTTYDALS PEL YAT YFVTKTLLVTSL . 
FL* IRTAYPRLRYEQLIHLL*KNFLPL 


3314 


17215 


A 


3337 


2 


349 


ARDSL I S P I LRLUPRPNXjRQNAFHYH IH 
RGKSNFLPTTLSRPIRNAPTLLGLPRCI 
HHMKHPIICRLIH*FPLFSGYTLDQTYA 
KMHFTIIFIGVNLTFFPQHFLGLSGMPR 
RYSDYPDAYTT*NILSYVGSFISLTAAI 
L 1 1 FMM* EAFASKRKVLI VEEPS INLE* 
LYGCPPPYHTFE 


3315 


17216 


A 


3338 


2 


386 


ARAI LNAMAFLILTERKI LGY I QLHRGP 
NWGP YGLLQ PFADAI KLFTKE PLKPAT 
STITLYISGPTLALNIGLLL*TPLPIPN 
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ding to 
last amino 
acid 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysme, L=Leucine, M=Methionine, 
N^Asparagine, P^Proline, 
Q=Glutamine, R-Argimne, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophao, V^Tyrosine, 
X=Un known, *=Stop codon, /=possibte 
nucleotide deletion, ^possible 
nucleotide insertion 














PLRNLNLGLLFILATSSLADYSIL*SG* 
ASNSNYALIGALRAW 


3316 


17217 


A 


3339 


175 


16 


ILGDLFPPAWLHPFLFLLLLFSRPSLAV 
TBAAVQWRNLGLLQPLPPGY*SSWC 


33.17 

■ 


17218 

- 


A 


3340 


1 


369 

■ 


GTSNLSLLFIVATCGLAAN*ML* *GGKI 
N* I YTLRVALAAI IHTILYB ITLSI ILL 
*TLLISGSFNLSTLITTQEHL*LLLPS * 
PLAI I * F I STLAETNRTPFDLAEGESE L 
F*GLNIEYSAR 


3318 


17219 

• 


A 


3341 

i 


1 


363 


GTRGGT I LPAMVLMLIDVPS LR ILYITN 
KVNEPSLTIKSNGHQWY*TYQYTDYGGL 
ILNSYILPPLFLQPGDLRILDVDNRWL 
PIETPIRirNT*QDGLHS*AGPTLGLKT 
DAIAGRVNQ 


3319 


17220 


A 


3342 


181 


315 


PKSPPTPPPPTXXXXXPGGNPPGPTL*G 
PFYWGGERGPPF*TGGP 


3320 


17221 


A 


3343 


3 


183 


HEVSQSCRELLTSGDPPASASQSAGISG 
MHHRAWP VS * TFH I FNI YCG PGTVLKP Y 
LNL 


3321 


17222 


A 


3344 


83 


354 


VLGARCPCAGVSRVSMEPWFLWQQACLL 
P WWRCCLSTLYANRADRDVP * TSRTGPV 
MVAHACS PSTIX3GQGGWVTRSGV* DQPG 
QDGETP 


3322 


17223 


A 


3345 


124 


379 


GQGICFMWSLGLSHRLAHCRYTVGVNN 
LLCSSSLPFLLPLLPLPPPEGPGWDDIN 
1 1 IF 1 1 FRDKVLLCCLGWSAVT * SQLTT 
A 


3323 

■ • 


17224 


A 


3346 


3 


368 


YEP I P * PLTGALS ALLMTCGtAM * XiUM~ 






S ITLLILGLLSNTLTIYQ* WRDVTREST j 
YHGHHTPPVQKGLRNG I ILL ITS EAFFF 
AGVF*AIYHSSLAPTPQLGGHWPPTGMT 
PLNPVQGPLL 


3324 


17225 


A 


3347 


1 


35? 


GTSAGDVNYG* 1 1 RYLHADGAKI FLRCR 
FLHSGRGLYYGSYVYSKT*NIAINLLIA 
TIATAFIGYVLA*GQILF*GATVSTNLL 
SDIPYIGTDLSQRI * GG YSVDRP TLTR F 
FTFHF 


3325 


17226 


A 


3348 


2 


360 


ARAEISPLHSNLGNESETPSQKKKKKKP 
PGGGGKNWNPYPSGTPPPNPPKGGE * RE 
EPPPLTKRVF*KKKVFFEGPRVKKNLC3G 
KGKKKRGFPAR* KNPVFKAKGKGRNFGP 
QAFFFKK 


3326 


17227 


A 


3349 


226 


366 


PQSFCSVASWPAGDLME I * AWQKFE YKP 
GKVPMDPEGCLLPIiAKKKEJilKHKKKEEE 
EEEEEEEEEEEEEEEEEEEEEGEEEELV 


3327 


17228 


A 


3350 

• 


2 

■ 


374 


ARGGGYSPHRATLTRTVTFHSILPFIIA 
GLAAIHLLFLHETGAYTPL* ITSHSEKT 
TFHPYYTMKDALGVLLFPLSLTTLTLCS 
PNLLGE PHD YTLTNPIjNTOPHI KPE *DC 
LFAYTNMRSGPN 


3328 


17229 


A 


3351 


1 


266 


LGLVEHFLTLTT KLTS SNI SDAQE I QSV 
♦NILTAI ILNLMVLPSLRMLYITNBVFD 
PS LTI KS IGHQWY * AYE YTDFGGL I FNC 
YILP 


3329 


17230 


A 


3352 


207 


362 


ILFLLKQKIYVLFGFSN*GSSIPELAHS 
DAYQTKE I CS SGKVYKYLQCWBKW 


3330 


17231 


A 


3353 


178 


365 


KHGIDKGLRYRIYTVQQEDNSVKR*LKD 
FNQHLS KDNIRSADKHMKRCPTSLVINI 
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sequence 


Amino acid sequence (A=AJanine 
OCysteine, D=Aspartic Acid, 
E=Glutamtc Acid, ^Phenylalanine, 
G-Glycme, H-nistidine, I^lsoleucine, 
K=Lysine, D=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q^GIutaniine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopnan> Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














MPIKPP 


3331 


17232 


A 


3354 


359 


3 


LQVRSLMPIjGQFHPP *NFFCSFFPKPKI 
PFFFLFFLKGVFVFFPGWRARGQFWVPG 
PPPPKFKKPFFPTLLRNWGPRDPPPSRG 
KFFFFFFFWYF* *RLRFTMLTRLVFNS* 
PQVISC 


3332 


• 17233 


A 


3355 


2 


354 


AREPSP*PLTGALSALLMTCGLSM*IHL 
HSITLLILGLLTNTLTIYO*WRDVTPBS 
TFQGHHTPPVQKGLGYGI ILFITSEVPF 
FAGF L * AFYHS S liAPT PHLGGD WP PTGV 
TPLNAL 


3333 


17234 

• 


A 

■ 


3356 


1 


364 

* 


GTREM I CCS ALS PR1HLS FHRRWPDWEC 
IS KL I TRHRTTRHVLRWS SLP LCP INRS 
WICHHRRLHSLISPILRLHPKPNLRQNP 
FHYHIHGRK*SAAVL*ALGFIFLFTVGG 
LTGIVLANSSLDIGLHDTYYVGAHFHYV 
LS IGAGFAI IGGFIH* FPLFSGYTLNQT 
YAKIHFTI I FMGVNLTFFPQHFLGLSGM 
PRRYSDYPDAYTT 


3334 


17235 

■ 


A 


3357 


2 


376 


ARGGQPEDYILILPSFGIIFHIVTYYSG 
KKEPFGYICMV*AMISIGFLGFIA*AHP 
IFS VG I DVNTRAY FTSAT I X IAIPTGVK 
VFS W1ATLHGSNMK* SAAAL * ALRF I FL 
FTVSGLTGIVLCN 


3335 


17236 

■ 


A 

• 


3358 


357 


139 


AQPWFFFLETVFCHAAQAGLKSIGSSDL 
PTSASQSVGTTGMSHHAWPERSC*QNTF 
TYYVQS IYSDD IRKSMF 


3336 


17237 


A 


3359 


277 


1 


SCQKDI S IGFIiF VPGFRVKGVAQBHLVG 
VRMYVLVPTVGGDGKERBR *RGIRHTPP 
PSERVRQRERERGRDRDRBKERASQRKP 
EMDRETSC 


3337 


17238 


A 


3360 


261 


2 


TI CS KCAWLC FS KTLLTNTCAS RSDLAH 
GP* CPNLCSGI FSSAWTDISRIiIiFFCL 
FVLRQSLTVAQAGVQWCNLCSPQPLPPG 

sc . 


3338 


17239 


A 


3361 


199 


354 


HLPYWPMFMATFNFFFFFETEFHFFPQA 
GGQWGDFG+LKPPPARLKQFSCLN 


3339 


17240 


A 


3362 


3 


364 


HEDRDTPAYFTCANI I IAI PTGVKVFN* 
LATLHGSNMK* SAAW*ALRYI FLFTVS 
GLTGIVLSYSSLDIVLHDTDYVV7VHFHY 
VLS I RAVFAI IGGFIH* FPLFSGCTLNQ 
TYAEIHFT 


3340 


17241 


A 

• 


3363 


3 


360 


HEETYIPKDQKYSFLHDSQTSFCFSNSI 
PTPSNMEETQQKS VSGCRLP * AGMGBTC 
GQGSRAAQPLPSLPLQNLKLLRISLLLI 
EAWLEPERVLKSMFANNLGYDTSDSDDY 
HLLKDLS 


3341 


17242 


A 


3364 


352 


27 


VYLS S QRNSAAS VNRMTWTDRS LSP YT 

r/?DfCTDDT^DT.17tTCVCQr T7WM' VWCD^CfNG 

iA3 Wrixt f DrCLtr ri r VJfo uJf viir V CiiOoo^o 
IAQAGVQWCDHSSL* P*TPGLK* *TCSC 
LP*YAGTTGTHHYTGPLILLNLFN 


3342 


17243 


A 


3365 


347 


99 


GGPPLPRGFSPPPGARPLAPPWGPPQRS 
PPPPPPKTAPQKKKKAPS I PAGQWLVBQ 
NPRPQP FKKP PGVGF* NP P PPLNMGF 


3343 


17244 


A 


3366 


3 


319 


HEECI±AHTLCFFNLMS*RTTVTL«HRMFS 
IVTAI * YIiLSSQYVIGYYIYVCIHTHTH 
MYIHTHRHRTFICVCVCVCIYIYTSIC 


3344 


17245 


A 


3367 


3 


474 


YELLIiLNGPILNARAFLILTERKIDGYI 
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SEQID 
NO: of 
nucleotide 
sequence 

• 


SEQ ID 
NO: of 
peptide 
sequence 

• 


M 

etb 

od 


SEQID 
NO: in 

09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino add 
residue oi 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
fast amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Atanine 
OCysteine, D=Aspartic Acid, 
n^Hjiuiflmic Acid, t— Phenylalanine, 
OGIycine, H=Hlstidine, I=Isoleudne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R«Arginine, S=Serine, 
i— i nreonine, v^vanne, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nudeotide deletion, \=possib!e 
nucleotide insertion 


• 




• 




- 


• 


Q Ldi KG PNA V £> F I GJj JLjQ ADA! KLFTKE 
PLKPATSTITLYITAPTLALTI DLLL*T 
RL P I PNPLDNLNLGLL F I LATSSLP VYS 
IL* SG* A*NSNYALIGALRAVAQTI * * E 
V if L IXLLann KWLKN 


3345 


17246 


A 


3368 


2 


348 


ARGLQDATFP 1 1 EELITFHDHALI I IVL 
ICFLGLYALFLTLTTKLTNTNI SHAQE I 
ETV* TI LPAI I LVLI ALPSLRMLYITDE 

VNDPSLTIKS ighqwy*tyqytdyggli 
ENS 


3346 


17247 


A 


3369 

i 


64 


363 


KKWGFVPQQKKGGGRI S VNGTP I FKGKG 
NFPP*PPGEGGTKGPGPKPK* IWIFKKR 
GGLIFWPKQFPTPGPKGTHPPGPPKGGK 
*RGGP?SPGGFNPKRG 


3347 


17248 


A 


3370 


34 


355 


AKGKEY I ET VAE KKKKKKKAP PPI * KAP 
*NPGKKPPLALKFPPPSQKAPPPALFCP 
* KGP PPRVF PKKKKKF PPFGRKGF FFTG 
GKPP 


3348 


17249 


A 


3371 


I 


374 


FLHS LHLSLRLLYLP * KFSMF I ILLKAY 
YTLCVCVCVCVCVCVCV* PLNKIHINHF 
TCSTVRGRSNPIFFPPIiLN*RPFYTT 


3349 


17250 

. 


A 


3372 

- 


429 

■ 


3 

• 


FFFFIiKKQKISRVFPRGPEFFGQRKPFW 
AKGPPRAPQKGGEKGKNPS PPPPL I FFK 
KGEKPQGGPL FKKGLGFFS P P KVFFKKG 
VFKSGPGGP*NFGFRGFSPPSPPKKGGK 
KRKPQS PGFFFFFFFFE I LG CSGRSRSR 
T 


3350 


17251 


A 


3373 


2 


404 


LGTDLSLI IRADLGQTGDLLGNDHIYNA 
rvTAHAFEIIFFIEIPI I IRGFGN*LIP 
LIIGAPDMAFPRINNISF*LLPPSILLIi 
LASAIVEAGAGTSGTDYPPLSGNYSHPG 
ACVDLTILSLHLAGVSSILGAI 


3351 


17252 


A 


3374 


2 


417 

* 

* 


AFV * IAHI TRD VYYG * 1 1 R YLHAKGAR I 
FFMCLFLHIGRGLYYGTrLYSOT*NIG I 
ILLLAT I ATAFI G YVLL *GQ I SF*GATE 

TTKTT.T O ft T DVTffTlT Tn*T */TVVC\rr>0 TJ 

J. lJNJjJjoA.L.r I IvjI JJJjJ.y w J. w ViVjIo VUblr 

TLTRFFTFHFILPFIIAGPNPLHLTT 


3352 


17253 


A 


3375 


85 


1 


FW PGAVAYACYPSTLGG *GGW IMRSGVR 


3353 

i 


17254 


A 


3376 


396 


1 


GGV FS PL PPKKFFFFPKGLI FLGGGCP I 
FPPPKKRVFS KNPPGVF INP P * KKKKI F 

FFFFFQTLFLGGCGYNTKLR 


3354 


17255 


A 


3377 


3 


118 


LLP PSLLLT tT iAYAI LVAGAGTG » TDYPP 
LTGNY*KTKA 


3355 


17256 


A 


3378 


301 


82 


ERKGPPQRGGKKGPKAQKPEKKKPKKRP 
PPPQN* Fr SNPPKJUCKGGKDPPSKKRGG 
GKGVPPPRKKKKKKKKli 






'"X 




*> 


ISO 


GLSLL IRAELGQPGNLLCNDHI YNVIVT 
AHAFVIIF 


3357 


17258 


A 


3380 


21 


411 


VFCTNH * D IG TL YLL BXSA * AG VLRS ALS 
LLI RAELGQPGLLLGNDHI YNVIVTGHA 
FVMNLFI VIP 1 1 VGGSGN* LGPLEIGAP 
DMAYPR INNIG F * LLPPSLLLLLAYAIV 
EAGAGTG*TVYPPLAGNY 


3358 


17259 

• 


A 


3381 


2 


421 


GRVGGRVGSTNHRD IGTLYIiLFGA*TGV 
LGTALS LL I RAELGQPGNLLGNDHMYNV 
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SEQID 
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sequence 
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SEQ ID 
NO: of 

pcpilflc 

sequence 


M , 
eth 

All 
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SEQID 
NO: in 

09/515,1 
26 

■ 


Predicted 
beginning 

■1 UCICUUUC 

location 
correspond 

■110 fircf 

■ng w it rot 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 

ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 

IT^r^ltitflmip Arid FWPhonvlalanin* 

G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 

NTs A inflrfltrinfe. PssPrnlin^ 
J" AoJ^ni ugiuiy) t xi UIIIIC, 

Q=Glu famine, R=Arginine, SfSerine, 
Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucicouue inseriiuD 














IVTAHALAKIFFILVPIIIGGFGNGLVP 
LIIGAPDMAFPRISNISL*IiLPPSVLLL 
LESVIAEAEAGTGRTVxTPLAGTYSNPG 


3359 

• 


17260 


A 


3382 


3 


185 


VIYSTIFAGTLITALSSH*FFT*VGLEI 

vrvrr ni7TT3tTT TVITTINTDDCTT? JV HT WTTT.TA 
rlMLtAv JL ±r VAj X AA J. Ci JrK2> 1 JS/iAJL IvXrlilU 

ATAS I ILLIAILFNNILSGQ* TITNTTN 
QYSSLI I IMAIAI KLGIAPFHF*VPEVT 

P 


3360 


17261 


A 


3383 


5 


398 


FFGHPEVYILILPGFGIISHIGAYYSGK 
KEPFGYLGMA*AMISIGFLOFIA*AHHI 
FTVG I D VHTRAY FTS ATI I IAI PTGVKV 
r N w Lux X IxHVioWrl R, w o AAAJu "AwrXrluf 
TVGGLTG I VLANS S LDI VL 


3361 


17262 


A 


3384 


1 


250 


LGNTGETIjSLQKNV* KLARCX3GTGLRSQ 
LjLjRRJjK* EHGJjT FUGRCiCb BPK3 CHSS P 
AWATGQDLS QQQQQKY PLR I F S AI I ICS 


3362 


17263 


A 


3385 


456 


0 


IFFFSQVETGIHHIGQAGLQLLTSGDAP 
CVGLQKSWDYRPEPLHLAWLIHSIiLSNF 
YYC*FGFCKIiSVFII*INVFCNKTCNSSM 
r uFSYo *NPCAQ * KN J£G PLLNKW P CK X T 
DAWADAWVAPVF 


3363 


17264 


A 


3386 


1 


225 


P SRNFLVL VDKL I LKF I * KYRGPRTAKT 
TLKKKKKVVRLTLLIFKSYYKTIVITIG 
WYGFQDRQVD*WNRIESLE 


3364 


17265 

- 


A 


3387 

. 


3 

• 


400 


T T TT /"»T T rTTVTTT DTVTJ*UflT>J>fflOOO'PVrtP 

LiLiX J_iO L L 1 N TLr 1 1 P r WUUA I Kis cj TlQG 
HHTP PVQKGLRYGI ILFITSQVFFFAGF 
r^AFiHSSIiSPTPQijGGnWPPTQITPJjri 
PLEAPLLNTS VLLASGVSIT * AHPGL I E 


3365 


17266 


A 


3388 


24 


407 


IASGRPFFFFFFFPKGKFFISRGGGPFS 
P PKI * KNWGKGKS FFPPPQREPKKRGVP 
KNPSLSFYPFGGKPPLGPPPKLVPPSS* 
PGKKAFFFF*TPPPGFLTPLGGGLSPLK 


3366 


17267 


A 


3389 


148 


413 


LNLNNNNNGKbMLLNTYSCT*LEEKKKK 
KKKKKKIOCKKKKKKKKKKKAPGGGFF 


3367 


17268 


A 


3390 


157 


14 


R VS AC CQ VWW * MP WP ATREAEARESLE 
PGRHRLRS CHCDRDPVSKS 


3368 


17269 


A 


3391 


2 


398 


LLIRAELGQPGNLLGNDHIYNVIVTAHA 
FVINFFIVIPIIIGGFGN*LVPLIIGAP 
DMAFPRINNI SF * LLP PSLLLLVAYATV 
EAGAGTG * TVYPPLAGNYSH 


3369 


17270 


TV 

A 


3392 


2 


395 


AELGHPGYL I GNDHVYNVI VTAHAFVI I 
FFIVIPII IGGFGN* LVPLI IGAPDMAF 
PRINNISW»LLPPSLI.T.T.T.ACAIVBAGA 
RTG*TVYPPLAGNYSHPGA 


3370 


17271 


A 


3393 


396 


54 


GYPPNLTFVTSSLTSVPSPSHPPSHVCS 
PTPAIPQTPRSPPSESTPPTBVSLTTPT 
LSLGAISSIiASAGPQGASPASPGFWGTS 
PP* PYS *NRPQTAACGVFSQLFFIRPEV 
P 


3371 


17272 


A 


3394 


417 


60 

* 


KGWVFQRI LW PGPGGS PRYSRP FGGPRG 
LXPLGSGV*TPPGPHWETPFFFNNOPIS 
GQGGGGPLIPLLGGVKPEKIFYPGNQSF 
H* PKI PPHPP PLGAKQNFPPKKKKPQKN 
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NO: of 
nucleotide 
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SEQID 
NO: of 
peptide 
sequence 

• 


M 
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od 
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SEQID 
NO: in 
USSN 

26 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to itrst 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
otiae 
location . 
corrcspon 
ding to 
last amino 

a CIO 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, r— Phenylalanine, 

I - *— Plv/iina "PI — Viet irltnn. I— T*..^ - 

i*— Glycine, Ja=tiisciome, l— iso leucine, 
K=Lysine, LpLeucine, M=Methionine, 
o^Asparagine, i^iToiine, 
Q=Glutamine, R»Arginine, S=Serine, 
T=Ttareonine, V=Valine, 
w— j rypiopnan, »— tyrosine, ■ 
X-Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucieouae insertion 














TKGFWV | 


3372 


17273 


A 


3395 


417 


3 


FFFFFFFQHHFGFEAAA*YWHFVDVV*L 
FLYVSIY**GSYSFSINS 


3373 


17274 


A 


3396 


229 


1 


RS FWTKTMGFSCYKDFF* TNGNKDTTYQ 
N JjW D i AaA V LiKCj Jvc 1 ALW V P INKJjJSKb t> 
ISNLISQVKEGQAWWLMPVI 


3374 


17275 


A 


3397 


344 


165 


ELKS PQPRKRNGCFFLAVDPDEGF*NLN 
FCGNGQGLNAPGGYPCE I PMGFVPNAYG 
KTF 


3375 


17276 


A 


3398 

* 


1 


409 


HKDIGTLYLLFGARTGILGTALSLLIRA 
ELGQ PGNLLGNDH I YNG I GTTQALVR I L 
FIVIPI ITGGFGN*LIPLI IGAPKMAVP 
R INN I S F* L* PP SLLLLVAYG I VEAGAR 
TG*TGYPPLAGNYSHPGACVDLTM 


3376 


17277 


" A" 


3399 


3 


396 


KDXdTLVtSFtS * AGVliGAALfelXIRAiC 
LGQPRNLLRNDHI YNVIVTAHAFVI I FF 
IVIPIIIGGFGN*LLTLIIGAPDMALPR 
I NNI S F * LLP P SLLLLLAYG I VEAGART 
G*TAYPPLSGNYSHPGASV 


3377 


17278 


A 


3400 


49 


350 


KSNSHTEVQAGQEQtCSFFRSKHKAGRAG 
SDPAPYVLAGCLCR^JSSFLFFFFSLFF 
FFFFKRDGV*QGYSSGSNTPGVKQSSCL 
GLPKYWDYRREPLHPA 


3378 


17279 


A 


3401 


1 


398 


KPPAITQYQTPLFV*SVLITAVLLVI*SL 
PGLGAGITILLTDRSLNATFFDPAGGGD 
PILYQHLF*FFGHPEAYILILPGFGIIS 
HIETYYSGKKEPFGYIGMG*AMISIGLL 
GFIA*AHHIFTG* IDVDTRS 


3379. 


17280 


A 


3402 


388 

* 


42 


TPLFNPPGAKCVNPFFPKKLKKKNPSRA 
MGGPPLFPPLWGGKPKGFP*NRKSGAPV 
THFAPPAP PPGGKRE P PP PKKKKKKKRK 
EEKESNRAVQ I KKRGNKKGRLP VLDPAE 
ESQ 


3380 


17281 

• 


A 


3403 


413 


2 


SPHPLKNFFLPPKPPNSGGGLAPFVPPQ 
K*GSLPKIPRGRYKSPPFKEKTNALPPR 
GKPGPPRASS KRPPLFFFFFRGRVPLCH 
PG * S TVQCVQAQCTLE LLGS SNF LTS S S 
P VTWGHRQGP PRP INFFI FFRGRVP 


3381 

i 


17282 


A 


3404 

• 


433 


274 


LGWAHACNP STL EG * GGRTAWS PGVQD 
QPGQHS ETP I STNFFWYFRI S YFHI 


3382 


17283 


A 


3405 


131 


427 


GPPPQPFFFWGKRGGPNPFFFFFSEKRir* 

VF*QFFLRRRGPQKFFKPPFFCLQKTSL 

FSEKKKKKGrGPPIjFPQKKKGCGGGPHY 

KKEGFTLNSFLFQGDGVSSVAQAEVQWR 

NISSLQALPPGFT 


3383 


17284 


A 


3406 


413 


137 


SPPPPPQGGIFFFFKPR*ETFSPPPNLG 
FFS PPSPLKFFFFPKGFI FLGGGGPKFP 
PPKKRFFFQNSPGGFFFPPF* KKNFFFL 
P PVI LGP PRVFF *G PPPPKK 


3384 


17285 


A 


3407 


319 


540 


IVFKDQCEESFTTLNVDIRNHQNLLDSL 
EQHGKGDLL * GANAYHCE KCNKKVRAVQ 
LCLILITFNGFRKNRFFW 


3385 


17286 


A 


3408 


2 


403 


SLHLGGGS CI VGAI YF ITT I IN I KPAGI 
TQYQTPLFV* SGLITGILLLLSLPVLAA 
GITILLTDRNLFTTFLNPGGGGDPILYQ 
HLL*LLSRPEVYILILPGFGIISHIGTY 
YSGKKEAFVYIGW*AIISIGC 


3386 


17287 


A 


3409 


2 


142 


KKENYRP VS LNRDAKS LNQ I QLH I * KI I 
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SEQ ID 
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etb 

00 
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SEQ II> 
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TTQSM 
iflQ/S1*L1 

26 


Predicted 
beginning 
nucieouue 

) fioat inn 

correspond t 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 

nucic- 

ntirie* 

location 
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ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanfne 
C=Cysteine, D=Aspartic Acid, 

vxiuiaiiiit. HLiu, t— jroenyiaiflnine, 
G=Glvcine. Hr=HistidinA. fclgnlafWim 
K=Lysine, L=Leucine, ^-Methionine, 
N^Asparagine, P^Proline, 
Q=Glutamine, R^Arginine7S==Serine, 
T^Threonlne, V=VaIine, 

tt"-j ■ ypiu|ru»u, « — .tyrosine, 

X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














- HCNQVGFI PGMQGWFNIL 


3387 


1 T>OD 

1/Zoo 


TV 

A 


3410 


407 


AO 

48 


D"DC2fl V T T7P VITNDP V V T WDTWl YtrfX M?T3 U 

rfc*vioJSJ.rc x flHJriSjXlxXr FlUUKJGHVr 
FPLKNFFFPPRGKI FGGGGCQNAPPPKK 
GFFPKI PQGVF* S PPKKKKKNFFPPGKI 

Hrtryuvr LtrsXsir tr tr c c c r cr C rrccrrrr 

FFFLGVR 


3388 


. 17289 


A 


3411 


3 


402 


SYRLE*QLEDKAAPPQDFTQQLiRWPKKQ 
riKlYl V W rUnJ^MrlKUIJ VISAo 25 a Lu\SUAK£ 
L*DAHQQQKLALQEFIiKFNELMAKLYSH 
NHKVWDKEEEMEVAMHKADMMWQEI*RS 
KKLTKRMLFTQMRRITPVIPS 


3389 


17290 


A 


3412 


1 


224 


LLFQLIJIRI4RQENHLNLEGRSCSELKMH 
HWTP P WGTE* UCJj * IjiUUvQSAiIjPNNT X 
KKLTGSRTIEDICTEYNK 


3390 


17291 


A 


3413 


202 


1 


IVMSWPGWTHA* IPSTLGGQGGWITRS 
GVRDPLASAFQSAGITSVSHCSQLIFVT 
VFYLSPLSHLY. 


3391 


17292 


A 


3414 


407 


309 


PSFKAS KDRLT LLLGANAAuDLKFTPM F 
IYHflENPRALKNYTKSTLPVPVLCIRNI 
KAWMIAHLFTAWIjKEHFKP IVETHCSGK 
KISFKILliLINKAPGHRRAQMERHKMNV 
VFMIiANTTPILKLMDQGVF*TFILCLSI 


3392 

■ 


17293 


A 


3415 


404 


46 


LKKPPFQKFLKI LKKKKKGGGGGPPP*T 
PPFGGGRGGVPPRAGVSKNPVPQOETLF 
FFKNPKFPGVGGPAP* FPLLRGVNKKNF 
FNLGGGGFKKRKFS PPPPPRGKKFFFPK 
KKKKKKE 


3393 


17294 


A 


3416 


23 


404 


SVLITGALLLLSMPVLGAGITILLTDRN 
LKTTFFDPAGGGDP HjYQHLL* I FGHPE 
DNILILPGFGIGSHIVTYYSGKKEPFGY 
IGMA*AMKPIGFLGDIAGAHHVFTG*ID 
VNTRSYFTSDTINMG 


3394 

• 


17295 

- 


A 


3417 


38 


469 


SGTTHASALFDPAGG*DFILYQHI*F* IF 
GDPEVYILILPGFRI I SHI VTDYSGKKE 
PraYIGMA*AMRSIGFLGFIG*AHHIFT 
V* IDVDTRAYFTS AT IIIAI PTGVKVFT 
*LATLHGSNMK*SAAVL*ALRFIFLFTV 

OVMJ 1 


3395 


17296 


A 


3418 


1 


419 


FSTNHKD IGTLYLLFGA* AGVLGTGLS L 
LIRAELGQPGNLLGNDHIYNGIGTAHAF 

MAFPRINNISF*LLPPSLLLLLASAIGE 
AGAGTG * TVYPPLGGNYSHPGAS VDLT 


3396 


17297 


A . 


3419 


397 


107 


KNSAPPFSPIFLP*GPKFAFFPPGKKVG 
KKIF* KFPAPGPQGVYLREGGGS PSPPQ 

TDMMDKTTf Ifa Ii17\7T.PPPDT?i^\V ».'/ 'KT agTTW 
1 fcTlflxrJNIUUinir \f Xitrtrircrv V c wN zrfUMUx 

KKAVAFSS PNRR 


3397 


17298 


A 


3420 


3 


253 


RGCREMRLCHCS S AWAIAGVS EEKKKKK 
KNWO*CLIjGICSY14DLWLPGLFKWIjSOI 
GRIVELQLLELKQYLRGITKQTLKIUI 


3398 


17299 

■ ■ 


A 


3421 


1 


393 


RTRT I KTPAITQ YQTPLFVGCAL ITAAL 
LLI^LPVLAAGITIIJjTNRYLNTTFFDP 
VGGGDP IL YQHLF * LFGYPGD YI VI LPG 
FGIISHMVAYYSGKKEPPGYIRMD*AMI 
SIDFLRIIV*AHHIFTVEN 


3399 


17300 


A 


3422 


142 


377 


LSKNCYKPSYVGFVCQCKFFNQLSKFLB 
VKLLYC31VKLC* * EG*MAQPL* KQFVLV 
ALKLNI VLPYNKAISLPGIYSTD* KTYT 
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SEQID 
NO: of 
nucleotide 
sequence 
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SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 
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SEQID 
NO: in 
VSSN 
09/515,1 
26 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 

location . 
correspon 
dingto — 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A- Ala nine 
C=Cysteine, D=Aspar«c Acid, 
E=Glutamlc Acid, ^Phenylalanine, 
OGIycine, H=Histidioe, Wsoleucine, 
K=Lysine, l/=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=<5lutamine, R«Argininc, S=Serine, 
T^Threonine, V^Valine, 
W^ryptophan, Y=Tyroslne, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














DTQNLHTTVySSSLIIIKN*KQQRCPSV 
GE* INKCFI YTTEYYSLTKRNKLSSHMK 
A 


3400 

• 


17301 


A 


3423 

■ 


2 


435 


L tLPGFG I 1 SH I VTYYSGKKE P FGYlGM 
G*AMISIGFLGFIV*AHHIFTVGIDVYT 
RAYPTSATI I IAI PTGVKVFR * LATLHG 
SNMK* SAAVL* ALRFIFLFTVGGLTGIV 
LANS * VD I VLHDTYYVGAHLHYVLS IRA . 
VFAI 


3401 


17302 

■ 


A 


3424 


2 


436 


LNTTFFDPAGGGDPILYQHLF*FFGHPE 
VYI LILPGFGI ISHI VTYYSGKKEPFGY 
IGMG*AMISIGFLGFIV*AHHIFTVGID 
VDTRAYFTSAT 1 1 IAI PTGVKVFS * LAT 
LHGSNMK* SAAVL* ALRFI FLFTVGGLT 
GIVLS 


3402 


17303 


A 


3425 


1 

• 


193 


PTR P PTR PACS EL RS RHCTPAWVTE * DS 
VSKKKKKKKFSPARGAHVCTPTFLGGQR 
GKKFCFAP 


3403 


17304 


A 


3426 


3 


446 


HKKFLFP KS F FNRVFPPTKKKNPRGPG F 
FFFFLKKIFFFPPGGI*SGGFGSLQLFP 
PRVKKFFSPPPPEKGGFKAPPPGPGNFF 
FFLKKRGFSFFGGGFLK*NPGFPPPPPF 
KKWGVPKGGAPPPPPFF 


3404 


17305 


A 


3427 ; 


202 


1 

♦ 


FS PGRMLGNT I KLP YTHS F I LMNSQNNL 
EECYKVGG I PLSNFKTNYKATAIKTMWYW 
HMNRYMD*WNR 


3405 


17306 


A 


3428 


159 


1 


NPPHEKNFCS FFQMES HSADQAGVQWCD 
LSSLQPSLRGFKRFS * VSLVSSWDH 


3406 


17307 


A 


3429 


22 

i 


298 


ESYASTAKATERDMSDRMLSASGMPRRY 
SDYPDAYTT*NILSSVGSFISLAEGILI 
IFMI*EAFASKRKVLIVEEPSINLE*LY 
GCPRLRGR 


3407 


17308 


A 


3430 


267 


26 


GITFFFFFWLEI * LYYWLFLANKLYFHE 
QIWFGLVFFFGLLLFDFSFYSDYFPLLL 
CLGS FSNF *CL IHCFFS VI I VLLF 


3408 


17309 


A 


3431 


274 


374 


KKG* 1SCS ISPPQHTVRRCJLTLSPRLEC 
SGVISAHCRLCLPGSSDPPASAS 


3409 


17310 


A 


3432 


177 


47 


FYFFFFFFFFFFFFFFFSLLFFFFFFFF 
FFFFFFFFFFFFQINRSFIASFKYHK*V 
LIFKKKKN | 


3410 


17311 


A 


3433 


379 


83 


FKRHFTKEDEiRMVTEHIKSCSSSLATRA 
MQIETTMAHCYTFMRLAKIKKYQYTKCC 
*R*RVTGTLTYCWWECKWrQPFWKTIVR 
RFLKQYIYHVIGYV 


3411 


17312 


A 


3434 


214 


3 


AASTFFFFTRSTYYVCIFLKSKRMINK* 
NYIKLKSFCTTTTKINKMKRQPTDWEKM 
FANHINHHEGLILK 


3412 


17313 


A 


3435 


1 


353 

• 


GGQGQPGQ*SETLSQ*KVSWAW*LQLHR 

t3md/"\tywt t cDrTirvi^GPDn^rwTDaMa 
KWKy JJ JN i-iLilb tAjJjy*- 3 blif^ w Ci XirAWA 

RVRLCQNRSVNEKSLHDICNDKK*QNI* 
1 1 S VP EG KQRMKGLENLFNE I LDENYPS 
LARDL 


3413 


17314 


A 


3436 


29 


394 


VFMEVDFFFFPFFFFFFFGKRNFFFPPR 
GGGGGPF* I FGPPPPRGKGNFPPPPPGK 
GGKKGPPPPPGFFFFF* KKGGSPPGGGG 
FLTPPLWDPPPWPPQRGGFGV*TPWPGP 
GSDFLNFFFF 


3414 


17315 


A 


3437 


108 


2 


P VAVAHS CNPNTSGGRGGWIS *GHEFQT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 

00 


SEQID 
NO: in 

I [COM 

26 

■ 


Predicted 
beginning 
nucieouue 

location 
correspond 
ing to first 
amiDO acta 
residue of 
peptide 

* as* ■ i an/* a 

sequence 


Predict- 
.ed end 
nucie- 
Qiiue 
location 
correspon 

inn i/% 

aiug to 

last amino 
acid 

rcsiuue oi 

peptide 
sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
£/=vjriuiamic acio, r—rnenyiaianine, 
v>— oijcine, jt— nisuaine, i— isoieucine, 
K=Lysine, L=Leucine, M=Methtonine, 
N=Asparagioe, P=Proline, 
v^vviuiaiiiiiic, i\ = Arginine, o^oenne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
a— un Known, — oiop couon, /^possioie 
nucleotide deletion, \=possible 
nucleotide insertion 














SLANTVK 


3415 


17316 


A 


- t Ate* ■ 

3438 


215 


3 


LTEAGVQWRSHGSLWPQTPGLK*SSCLS 

T.DC T WT? V Q A VHW T P 
JUrOlniilQiiXXUllK - 


3416 


17317 

\ 


A 


3439 


1 


186 


QTSSFSFRRSVITNFFKLKDDV*TH*KE 
KTSTRP 


3417 


17318 


A 


3440 


290 


-> 
3 


SIKLNPPPAPPPGGPN*DSFLPPPPKKX 
GGLAQQSPKVL 


3418 


17319 


A 


3441 


163 


2 


* I*DKVLLCHPGBSATVHSSLGDRA 


3419 


17320 


A 


3442 


131 


1 


RPRRPliT I KS IGHQWY*TYEYTDYGGLI 


3420 


17321 


A 


3443 


162 


2 


PGAAHSTGHLPLRFHLRLGAVTHACaiPR 


3421 


17322 


A 


3444 


2 


130 


WEKIFANHVSDKGLIYRIYNBLQQLYK* 
KTNGPIKKRPKSTRP 


3422 


17323 


A 


3445 


2 


117 


IiALIX3IiKPGLK*ATCIjGLPRYWDYRHQP 
phpacfskgx 


3423 


17324 


A 


3446 


350 


101 


KAFSSSI^SSRKKGDSDAGFCIHFKTVP 
FGFFWTLELKKGIPFFLNFFFFPFLRVL 
KGSKNPF*KRGPFFFFFFFFDNFYIWN 


3424 


17325 


A 


3447 


106 


1 


LLQPNGQLGPGTVAHGCNPSTI J GG*GER 
ITTSGVR 


3425 

• ■ 


17326 

• 


A 

- 


3448 


28 


333 


GQQEQNS ISXKKKKKKKKKTPFKKGKRK 
IPGGKKKPLGKSRAPPPLGGGEKNPPPG 
KRAPEKI FKKKPRGiTOETFKRMEKGFLK 
GQ* PGNP * KVKPLWGGKK 


3426 


17327 


A 


3449 


81 


339 


pivss cpgvagalkqamtlefkvy qhhv 
VANCRALSEALTEIjGYKIGTGRDTDGVQ 
QALLVVGLRLAS * P I VSSCPGVAGALKQ 
AMTLE F KVYQHHVVANCRALS EALTELG 

X JxJ. VJ 1 VjKJJ J. LAj V AJjLi V VVj JUtCu/ww " v 

QPGCGPGSAAAAAMM 


3427 


17328 


A 


3450 


166 


3Z2 


onpcTnn tttidtcdavt dttsx/c^t j^tt . T 
KJJro 1 1 VJjl 1 iJK 1 oKy RJ. ir 1 JJ VoyLoNJj J. 

DIYKTL*PTIGKYTFFSSTYRSLR 


3428 


17329 


A 


345 1 


257 


354 


KFLSFFRDRVLLCCPGWSAW* S * FTAA 

T GCU 
boon 


3429 


17330 


A 


3452 


286 


3 


TD PFP I KRAKERE MLQI TKXRS IRENF I 
TDQQKTKEL*EEQLYAHRVDNLDEMDKF 
LEGYK*PSMTHGKIENLCRPITSKDIES 

VlrUNljlilUirtr 


3430 


17331 


A 


3453 


358 


353 


R* * KLLMMPC* I BTNKT*NTCTVT KEEK 
GQPGTVAHACNPSTLGG*GGRIT 


3431 


17332 


A 


3454 


108 


353 


THIiGGNP VRLYAWLLLHYVAS YRI FS VS . 
HCKDQTYLPILL*NFLMEIi3TVAHTYNP 

<? T T iRRTinPR T T* VTJP PFT^T .BNMVTrP 
OliJwvyw""* ^ V y D r u 1 DJJnilt'J V A,^ 


3432 


17333 


A 


3455 


342 


50 


GP VTPN I P P VP PLGGQKELFPPKKKKF * 
KFWGSPPPLFFKKPKNPRETPPESCPKP 
KKTPIPVKV*GKKTP*TFSKKKKTPPIP 
ELFLLKKKKKLIN 


3433 


17334 


A 


3456 


343 


2 

* 


KAP FFFFFKKFWS Q KKRGLFFF FFKNFF 
IPGFPGPTFFFWGFFFFKKGPPVF*LKK 
FFGGPIiPPLKDPPGVFPFSPF* FWDFPP 
PPPFFFFFFFFFETVSLCRPGWSAVSRD 
C 
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NO: of 

nucieoiiuc 

sequence 

• 
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SEQID 
NO: of 

sequence 

• 


M 
eth 

OH 


SEQID 
NO: in 

09/515,1 
26 


Predicted 
beginning 

ft n Iaa fl a 

HUklCwUlIC 

location 
correspond 
ing to first 

d 111 111 U a LIU 

residue of 

nanf iff a 

sequence 


Predict- 
ed end 

IIUul> 

otide 

location 

correspon 

Ulllg LU 

last amino 

Hf*tft 

residue of 
peptide 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 

H^V*ltl*iiiin*» /Vt-ivij J 1 — ruBD YliilnlllilC) 

G=Glycine, H=Histidine, 1-Isoleucine, 
K^Lysine, LHLeucine, M^Methionine, 
N^Asparagine, P^Proline, 
^ \jiuuiiiiiiic, iv— rvrginincj ocrine* 
^Threonine, V=Valine, 

WszTrvntAnhnn YsTvrftcinfi - 

X=Un known, *=Stop codon,/=possib!e 
nucleotide deletion, V=possible 

n nr»l <*n+5 H a inc^rrinit 
HUvlCUllUC IHSvI HUH 


3434 


17335 


A 


3457 


2 


259 


LTKPTESGTEDKKGRDSKBSEGKTERTE 

SLRSQNGGENS V* S CPS TS S TAALNTAA 

AANPLALNPQVSLSSLSCCPISIKGSFV 
p 


3435 


17336 


A 


3458 


186 


334 


AGGNKKMEKRNM*NKEIP*WFKKNWPGA 
GAHACN PTTLGGRGRR I MRSG 


3436 


17337 


A 


3459 


207 


I 


Y FKPTLKINDKG YIiNETIDKFDPE?D INR 
TSNIiTL*EHALFPSTCGIFMRRDYIEGH 


3437 


17338 


A 


3460 

i 


268 


3 


PLNPGPKQSSPPGLPKPWPFRGEPLPLG 
PWFFFVKNRGRP*AFFGTNPLAPPLPFL 
♦DRVLFCHPGWSWVQSQLTVQPPPLRL 

KQrS 


3438 


17339 

• 


A 


3461 


350 

• 


2 


AETAHWINNAFGPGSANKCTVRWWFK*Ii 

CKGVKRLEDKAI/3GWPS KVAINQLRAI I 

wm^DT vnT*T3inr7vTn?T KfTPoeTVTJTfMrr ttd 
KvxAirlf JKJr A KK V AKJtS JaN JL Js FixA V X UHXiAK 

I G KVKKLHKWVPHELHENKRNRFEVWS S 

T.TTP 


3439 

• 


17340 

■ 


A 


3462 

• 


312 

- 


1 


HIVTYYSGKKEPFGYIGMV*AMISIGFIj 
Qjr 1 V w Anrl J. trl V vj x U V U 1 KAX JT l o V F 1 J. 
IAIPPGVKVFS*IiA^IiHGSNMK*SAAVL 

* T?T ,T .D VfiTJT .TO TVT .21 
"HlAvr X r 1 >l ig V ov7Xj X £\,JL VXxrk 


3440 


17341 


A 


3463 


181 


3 


FLPLIAAFSGLLLFSGFLGVFLFCFFET 
ELPRLECCX3TISAHCSL*LLGS3D3CAS 

ZtSTW 
jl n 


3441 


17342 


A 


3464 


176 


263 


VGHDAHNPS I LGGPGGWIT * GQE FKVDA 
A 


3442 


17343 




34& 


239 


336 


tePGEVA'ifSCNPS , t , tGGRS6Wll*dQlFE" 
TSLT 


1AA1 

3443 


IT} A A 

17344 


j\ 


34O0 


oo 


3ZO 


OTQT.T.WTT * *PT.W* VTtfTPT.PPPPT.DPTT. 

TPPFSPPPPPPPPKKKKRKRK 


3444 


17345 


A 


3467 


4 


345 


EEERRRERKKKRRKKEEKKKEBREREEK 
ERKKKKKKKE KKKRRGRQE KRKGTG* IE 


3445 


17346 


A 


3468 


265 


335 

■ 


QMNPDRHLQF * DKVLLCHPG WGTVARS * 
LTATSRSSCLSPPSGWDCRRVPPCLASF 


3446 


17347 


A 


3469 


115 


342 


INKQINKIKDPTSYNILLSQSLI*FKAIj 
FKQINHLYNIKVQGEKPSTD 


1AATI 

j44/ 


1 /34o 




34/0 




3ZO 


p TvpTTrpnKwpnpp'n ptqp pp v t prht, y 

ITDBVNDPSLTIKGIGHQWD*TYEYTDY 
GGLTFNSYILPPLFIiEPSDLRLLDVDNR 


3448 


17349 


A 


3471 


3 


329 


RIGARDESSLAVTHKPRKIHRKIiTLWVF 
VNDSLVMPRPDKNHQWAFNKNCFPLVDV 
GIDPYLVYHLRPHQAQGI I FLYECVMGM 
RMNGRCX3AI LAD * NGLGKTLQC I SL 


3449 


17350 




3472 


1 CA 

154 


z 


PTvTT .T . VP Y Q Vttl K V YYk T WTPM W T VH T A^T T V 

KIQKQPRCPLMDKSIR*L*YIH 


3450 


17351 


A 


3473 


132 


340 


ALKTENITTLCLDF * L IETVR* *MCVFK 
KKKKXKKKKKKKKKKXKKKKD P P KKKKK 
IK 


3451 


17352 


A 


3474 


3 


345 


VI YS S I FAGTL I TALS SH* FFT * VGLE I 
NMLAFI PVLTKKINPRSTEAD I KYFLTQ 
ATAS IFIJj17VILFNNIIjSGL*TITNTTN 
QYSSLII IMAIAI KLGI APFRRRGREFQ 
IY 


3452 


17353 




3475 


1 


254 


FHHVGQAG FTP DL»R * STRLGLPKCWDYR 
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SEQID 
NO: of 
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sequence 
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od 
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SEQID 
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¥ tOOTVl 
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09/515,1 

Zo 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla!anine, 
G-Glycine, H=Histidine, I=*Isoleucine, 
K B Lysine, i^=keucine, M^Metbionine, 
N^Asparagine, P=Proline, 
Q=(jlutamine, K-Arginfne, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X= Unknown, *=Stop codon, /-possible 
nucleotide deletion, ^possible 
nucleotide insertion 










• 




REAIiCIjVHIIi^xIjPMIGP 
VFVFLQLHQAFIQSLLTLFYFISFPCNS 


1 A f> 

3453 


17354 


A - 


3476 . 


351 


186 


S V JjS ADN i fc.MGr HHVGQflgiiEIiLSSSDP 
LAP AS * S AG I RGVRHNTW P VLT I TLN 


3454 


17355 


A 


3477 


330 


3 


QEKVFE IPLTRDNVS EILRCFLMAYGVE 
PALCDRLRTQPFQAQPHQQKAAVLAFLV 
HELNGSTLIINEMDKTLESMSRYRKYEW 
NVKERLRRLKTARA* RLGRSQVKK 


3455 


17356 


A 


3478 


373 


248 


NRLNPGGGGWSEPRSQHSTPTWVTEYDS 
VSKK* INKNKNWK 


3456 


17357 

* 


A 


3479 


3 


92 


EX3QGCSEP*SRYCSPAWVTB*DCVSKNF 

F' 


3457 

4 


1735? 

■ 


A 


3480 

■ 


380 


2 

• 


VEVPLLNTSVFFASGVSIT*AHHSLIEN 
NRNQ I I QALL I TNLGGL YFTLLQAS EYL 
ESPPTISASI YGSTFFVATGFHGLHVI I 
GSTFLTMCFI RQ I I FHFTS KHHFGLEAA 
A*CWHFVDWCIFL 


3458 


17359 


A 


34S1 


225 


1 


ERAERWEQVMIALFSPNYIRLSFPISP 
TPWRHKFRFHLKG Y I WPGTVAHTCNPS 
TLGSQGGQITRSGD*DHPG 


3459 


17360 

* 


A 


3482 


332 


1 


IKYFLTQATASI ILLIAILFNNILSGQ* 
TITMTTNQYSSLI I IMAIAIN*GIAPFH 
F*VPEVTQGTPLTS 


3460 


17361 

• 


A 


3483 

■ 


277 

• 


1 


LSMSSFSISSIPSASINS*ALLFSSAES 
NSLECTFSSSFSPNSSESFSSSFPSCSC 
AS*FSNSQSLIKASSSTEASPLNSSASC 
FLPLPLVG 


3461 


17362 


A 


3484 


404 


209 


ALS VQ PLWRPRQENGVNPGGKASR* PRF 
PPCGPAWATKGELVPKKKKKKISFLGHD 
QSLELLLQR 


3462 


17363 


A 


3485 


31 


347 


FCILERKAFLYW* CWKFFFFFFFFFLE 
KBFFFLPLLVFFWCYYHLKGASLP*VFL 
FFYLHCYSSLFTLFFIFLQSLTPHFFLV 
♦TFFYYDPLMFFFLTLFLELV 


3463 


17364 


A 


3486 


327 


161 


QKKVRDPYEQL*ANKFKN*EEMPRFLEK 
HKLPKLTQEEIKNQ 


3464 


17365 


A 


3487 


311 


3 


PRRGRFQLTQNFALPFNLGNKTKIPFKK 
KKKKTEPIiCPCTCVCAAFVCLCFHLCT* 
PCDCAHDCDNICVCVNICVYVCVVTHLH 
TGLCSCVDCDHICVGTCC 


3465 


17366 


A 


3488 


245 


1 


RPRRLLAHjLIIITF*IJPQLNGYIEKST i 
PYECGFDPISPARVPLSIKFFLVAITFL 
IiFDLE IALLLPLP * ALQATKLPLIV 


3466 


17367 


A 

■ 


3489 


2 


400 

• 


AAPILGMLGLRQRGKYKRQRQLLCLWSA 

ILLFLYFLNKLL* ltkkkkkkkkkkkkk 

KKKKKKKKKKKKGGPRGGNPQILKGKIG 
KPPGGFFKGGGGPKKI FFFKNRGALFFK 
KKPPQKKNIjGPGPPGLKNFIiKEKKKA 






A 


3490 


J / 


jOD 


PDMYQCGLNHFI YLI I YLRWS FALVAQA 
GV*WHHLGSLHPLPPEPKCFSC 


3468 


17369 


A 


3491 


25 


384 


IRGTF* FFWGGLGGLS KNNPL * KKNFS P 
PKLGVLPPPLFFPGGGFFFFFPGAPPGF 
KKKKPP PKKFFF * KKGAPPFFFFFFFFF 
FF 


3469 


17370 


A 


3492 


170 


41 


AQPDQHGETTFLLKIQKLTRHGGICL* S 
QLIGGLSFITQKITC 


3470 


17371 


A | 3493 


1 


240 


VLS PAD KTNVKAAW GKVGAHAGE YGAEA 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 

* T ■ mT • "*§ ' mt • mm. mm _ — .mm * 

K=Lysme, L=Leucme, M=Methionine, 
N»Asparagine, PHftroline, 
Q=Glutamine, R=Argmine, S=Serine, 
"^Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=po$sib!e 
nucleotide insertion 














LERMFLS FPTTPS * FLDFFFFLEKJGFH 
HVGQS SLGFLTSRDP S AYNSLLGS 


3471 


17372 


A 


3494 


359 


204 


TQWLMP VI LAPWEAE VGR* PEVRSLRPA 
LNSPFFKRLWIMMYIMNI I ILS 


3472 


17373 


A 


3495 


1 


178 


DRLSPGV*GSSEIi*LCHCTPGWATEPDH 
VSRKJKKKRKKVGLGKNKAKQKHPRGYYR 
AEL 


3473 

i 

• 


17374 


A 


3496 

i 


361 


1 


LGPKLLKKPFWIS IGIKGVAGLPGFPPT 
PPQ*G *NFGLKRGGFFFYFFEAKGWFNT 
PGGKQRGP FRSLEP F PFRVKLFFPPNLP 
NNWGHKGAPPKPPFFFFFLRWS FALVAQ 
AGVQWCDL 


3474 


17375 

♦ 


A 


3497 


276 


79 


AEVYKI KCELGAVAHACNP STLGGQGGR 
ITRS * VSASFTKKYWQLflLMKEUJJB 


3475 

m 


17376 

■ 


A 


3498 

* 


3 


476 

■ 


FGTSMSI I TLI TPRLARGNNNLFSCFPT 
FTFDHLATPLLILTT*LLPLTSMASQRH 
LFREPLSRKKLYIjSILNSLQSSLIITFT 
GTELI I FYIFFETTLIPTLAI ITR*GNQ 
PERLNAGTYLLFYTLVGSLPLLrALIYT 
HNTLGSLNILLLTVSAQE 


3476 


17377 


A 


3499 

* 


581 


2 


AHACNliNTLiGGRGRWIT*GQEFDTSLAN 
K 


3477 


17378 


A 


3500 


380 


135 


RHQSGQIGKTPSLVKIPKLTPPKGGVL'* 
SHLTKRARQEKGLNPKSKGFN* PKLRPC 
P PTI VAKKQNPVS KKKKQTRQE PS A 


3478 


17379 


A 


3501 


1 


386 


GSR*GNHPERLNAGAYFI*FYTL*GSIjPL 
LIRL I YTDNTLCS LNI LLLTLTGQELSN 
S*AHNLI*LAYTIAFIVKIPLYRLHL*L 
P KAHAQAP IAG S I VLAAVLLKL CGYG 1 1 
RLTL I LNPLKKHI DYP 


3479 


17380 


A 


3502 


2 


389 


EKQL * KDKQVYRATHRIiLVLGAGESGKS 
TIAKHMRILHVNGFNGEGGEEDPHAARS 
YIDGEKATKVQDILNNLKEGIETIGAVM 
SNLVPPVELANPENQFIVDYILSAMNVP 
DFDFPSEFYEHAKALWE 


3480 


17381 


A 


3503 


2 


379 


PFGYIGMV*AMISIGFU3FIG*AHHIFT 
VG I D VYTRAY FTCAT 1 1 IAI PTGVKVFS 
* LATLHGSNMK* SAAEI»*ALGFI FLFTV 
SGLTGIGLTNSSLDIRLHDTYYWAHFH 
YVLS IGAVFAI IGG 


3481 


17382 


A 


3504 


281 

• 


& 


FKMRFG WGHSQTVSENYKPLLKE IRENP 
NK*KNIPCSWIGRISIVKMPTLPKVIYR 
FNAFP I KLPSPFFTL 


3482 


17383 


A 


3505 


139 


263 


ETGSHYVPQAGIi* LLASSDPPATASQNI 
G I THVS YYVQPL * PIFWEAVAGGSLEAR 
S 


3483 


17384 


A 


3506 


2 


427 


IiKTSMTMISTTLLT* LRPAWH * AQYQTP 
FFV*SALITAGLLFLFLPVLAAGITVLL 
TDRNLNTTFFDPAGGGDPILYQHLF* FF 
GHPEGYILILPGFGI ISHIGTYYSGKKE 
PFGYIGMV*AMISTGFLGVIG*AHHVFT 
GG 


3484 


17385 




3507 


343 


3 


GWHSRSSGGASSPSSPSHRSRLRVSSGS 
LGRL * WS KS S KTS VS CG WARDTEKPPL 
SKIEAS S SDSCES VMAKANAVFFTSTTG 
EKPASSASLSLSVWASGSSPPGAANSSS 
S 


3485 


17386 


A 


3508 


3 


363 


GPGDkA^RlAtENtt) IT* kWGMRT&tfC 
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SEQID 
NO: of 
nucleotide 
sequence 

'i 


SEQ ID 
NO: of 
peptide 
sequence 


M, 
eth 
od 


SEQID 
NO: In 

1TOCJWT 

USSN 

09/515,1 

26 

■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino aciu 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
co r res pon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 

Ti* . ■ |,,i n .,, » ■ A ^1*1 TA^VkL ft • 

E^GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoteucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, K=Arginfne, S=Senne, 
•^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /"possible 
nucleotide deletion, possible 
nucleotide insertion 














AKQLLSKQPF *S SRTGDLKLELQIELVR 
ET KRKC * RVL * I DRALTSHL Y S LLQTQD 
ALGDAS ADL IQKTS * &QEE FGYYAKTQK 
LLCRNGET 


3486 

• 


17387 


A 


3509 

• 


3 


434 


GPGDHVARGISGKKYDIGQKWGM3TYKC 
TKELLSQRPGLGS PTVDLBL* LQIELLR 
ETKRKYE S VLHLGRALTAHL Y S LLQTQH 
ALGDAFADLRQKSPQLQEEFGCYAQTQK 
LLCKNGETLLGAANLFVSS I YTLGTKTM 
EDTL 


3487 


17388 

i 


A 


3510 


1 


401 


GTRKNSFLHDSQTS FCFSDS I PTPSNMB 
ETQHKSNLELLR I S LLLIB PWLEPVRFL 
KSMFANNLVYDTSDSDDYHLLKDIi*BGI 
P TLMGRL EDGNRRTGQ I LKQTYS KFDTN 
SHNHDAL LKNYGLL YCFRKDMN 


3488 


17389 


A 


3511 


117 


2 


FKFIFSEMESHSVA*AGVQWRNLdSLQfc> 
LPLWFKQFSC 


3489 


17390 


A 


3512 

► 


491 


90 


KKVKIVLAGFFFKPPPPKN*KKKGLK*R 
KGVGPRGYTVNPHPCGYKPFGGPGVQNP 
PGQKGETPFFQKKNKKPGGGGGALIPPP 
KGGGRGVPI*SPEKFGLKFPPGPPP*GKK 
KNRFQKKKKKKRQN * PQKSQC 


3490 


17391 


A 


3513 


2 


31 


GKGAPTTSLINVAGTKIIAKWEDNKLP 
GAI CS LT CGGAH I GT PMARDERVNLLS F 
TGSTQGGKQVGLMVHERFGRRLVEDGGN 
NAI I AFEDAHLNL YVPSTLFAAVGTAGP 
KCTTARRQLIHES IHDEA*NHFPH 


3491 


17392 


A 

• 


3514 


2 


390 


KEFHKTHPtiENITLAPDPEGPDGLPPVA 
YNPWMD I RQREDGQTLNIREP YGPI PVD 
F*RKIRQSYFASGA+LDTQDGRLLSALD 
DLQLANS TI IAFTSDHGWALGEHGEWA* 
YNNFDDDTHDPL I F YDT 


3492 


17393 


A 


3515 


3 

> 


410 


RGTGNNWAQGHYTKGAELVDSVLDWRK 
ES*SCDCLQGFQLTHSLGGGTGFGMGTL 
LITKIREEYPNRIMNTFRVMPLPKV*DT 
EGEPYNATLLVHQLVGNTDETYSIDNEA 
LYDICFRTLKLTTPTYGDRHHLVS * 


3493 


17394 


A 


3516 


1 


396 


GEDAANI YARGHYT I GMEN I DLGLDRI R 
KLADPCTGLQGFLVFHS FGGGTGSGFTS 
LLMERISCDYGKKNKLEVSIYPAPC2VCT 
AVFEP YNYI LSTHTTLEHSD * AFMVDNE 
AIYDICRINLHIBRSTYTNL 


3494 


17395 


A 


3517 


2 


399 


EKIG YNPDTVAFVP I SGWNGDNMLEPKA 
NMPWFKGWKVTRKDGKASGTTLLEALEC 
ILPPTRSTDKPLRLPLQDVYKNGGIGTV 
P AGRVB TGVLKPGME VT FAP VNGTTEV* 
S AERHHE P FREALPGDNVGF 


3495 


17396 


A 


3518 


1 


402 


GEDAANNYARGHYTIGTBI IDLLLDRIR 
IvUnlJU ** A w JLiWvjur It V r ±io r vovXlo 
LLMERLS ID YGTKS KLEFS I YPAPQVST 
ACSEPYNS ILTTHTTLEHSDCAFMVHNB 
AIYDI *RINLDIERSTYTNLNR 


3496 


17397 


A 


3519 


1 


408 


GS CIX3QKL YNSTTKTAT* WSSNHTKKNP 
FSKFPKLQNVWTHPESHRDWTAPTRLYW 
I CGHRP YVKLPNQWAGS C I IGT I KP S FF 
LLPIKTGEliLGFPVYASHAKRS IAIGNW 
KDNEWP P ERI IQYYE PATWAQDGL 


3497 


17398 ( A 


3520 


3 


162 


ETVLIjCCPGWMTLGLRQSSCLSLPKCS 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 

• 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 

• 


SEQID 
NO: In 

JTCC2VT 

26 


Predicted 

beginning 

nucieotlae 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otiae 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
£>-oiutamtc Acid, f — rnenylalamne, 
o— Glycine, Jti— iiisiiQine, i— isoieuctne, 
K=Lysine, I/=Leucine, M=Methiomne, 
N=Asparagine, P=Proline, 
i^ = v>iuiflniine ) jv^Arginine, c^oenne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, v=possioie 
nucleotide insertion 
















3498 


17399 


A 


3521 


184 


354 


ETGSSliLP * GS SELPVSHPYFFNTHSLS 

JjbnIHininlHlNbolaWlnLFLx IJNMo 

CQRLVAVYRQKGS FKDQCTDI 


3499 


17400 


A 


3522 


2 . 


313 


ARLRSQIiLRRLRQENCLTPGIQACRELL 
KsVt I AoNlJirLvw^v^- 1 ^ VWlji, JuiooLiU 
DKARDSVSTKTKREGRGEK 


3500 


17401 


A 


3523 


13 


127 


LGLQARTTSMPSS *GYRRAPPPCPALLY 

12*T7T?'TV~ , E , tTLnr/"«^TVf~'T DT T TOTOT) 

r w V Jj, l\»J« Hn VCiUA^iiKljli i o 1 


3501 


17402 


A 


3524 

i 


148 


1 


VCMFLEES CS VAQAGVQWYNHSSLQP * T 
PGLKRSF*LSLPSSRDHRLMT 


3502 


17403 

i 


A 


3525 


1 


98 


VNPGGEGCS EPRSHHCTTAWVTE 5 DS I S 
KINK 


3503 


17404 


A 


3526. 


138 


3 


AASTHDLG*LQPPPPGLTPPSCLRLLSS 
WDYRPVPLRPANFLYF 


3504 

*- 


17405 


A 


3527 


10 


206 


PTTSLISFTS YS PCLKCSNVYEQVNRMR 
VLFYKS KQRLGMVAHAYDPS TliGGRGS K 
IT*GQGRRG 


3505 


17406 


A 


3528 


345 


3 


VTELI Q SNKAKL XKKS KNKNEQS LQK I * 
D YAKQPNLRI SGVL5EEEKS KSLENI FE 
KI I E ENFPGLTRDLE IQ I QKAQRPGKFI 
AKRLSPRHTVIRLSKVKTKGRILRAERQ 

mm 

K 


3506 


17407 


A 


3529 


3 


90 


AVL * ALGF I FL FTVGGLTG I VLANS VDA 
A 


3507 


17408 


A * 


3530 


3 


90 


AVL*ALGFIFLFTVGGLTGIVIiANSVDA 
A 


3508 


17409 


A 


3531 


354 


3 


AVI NSIiPTKS PGPDGFTAKS YQR YKEEIi 
VLFLLKPFQRIEKEGILPNSFYESSVIL 
IPKPHRGTPKESFRPISLMNIiNAKILNK 
RIAS *NHQHIKKVIHHGQVGFIPRTQGW 
FNKQ 


3509 


17410 


A 


3532 


2 


357 


FTRPRFSFTFFVLIRKYALSCGKISIQH 
S Y * KKKSRGKIX3GL *G INFCFFERKFHS 
VTQVGGQKHDPG*LKFPPPGFRGFSRLS 
LPS SWNHKRGTPRUVNFCTPNKNGGRRG | 
RKFQIY 


3510 


17411 


A 


3533 


76 


365 


KKKKKKKKKKKKKKKKKGGAGPEI PKLK 
MGKSEKPPGVFLRGVGEKIXFFFLKKGG 
PFF* KKKPLKNKNWAREPSFKKIFKKKK 
KKAHFFFGFKI XiKK 


3511 


17412 


A 


3534 


7 


353 


FFGFGGVFLVFLFFFF* KYPFFFPPPPL 
FFFP FFKWAPRR P PPP FFS S FFKNFFFF 
FP P P PP FFFFF PQKNFPIjFFS PliFFFFS 
PKFFFFFFPLTKKKKKKNKKKKTKTPKI 
HPQTQRK 


3512 


17413 


A 


3535 


354 


41 


AGGVPPGIPPPWGGRAGGSPKGRGFGPP 

TREGEGGKSP*PWKPRVQPT*NAPPPSP 
PGEGAKPPFQKKKKSMFHKR 


3513 


17414 


A 


3536 


174 


364 


TFNSLAFTFFFFF* RETGSHFFLEAGVQ 
GNHS S LNP * P PGLRGSS CLTLP KC * D YR 
REPLPPA 


3514 


17415 


A 


3537 


38 

* 


368 


PGRIjRLQEAVNHCTLEPGQQS V* KKKKK 
KRGGAFFKAKLTLAGWKGEKKKKTGVFF 
*TLKIRPKEIRNQGGSCX}RKKAELGKLQ 
QT Y P VLNFKATF YGEQGD YY* TFFKP 
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SEQID 
NO: of 

DUClcullUc 

sequence 

"i 


SEQID 
NO: of 
pcpuue 
sequence 


M. 
eth 

Ou 


SEQID 
NO: in 

I ISSN 

09/515,1 

26 

■ 


Predicted 
beginning 

n 1 1 <* 1 aa ti ff a 

location 
correspond 

into in fifrf 

■ tig III III 91 

amino acid 
residue of 
peptide 

& Ct| ucl ICC 


Predict- 
ed end 

mi f |<v— 
II u Lie* 

otide 
location 

LUI 1 COJJUIJ 

ding to 
last amino 
acid 

rcsiuuc vi 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cystelne, D=Aspartic Acid, 

l?:=f^| nfamip AciH EV=PhAnvlftlnninA- 
Ei^vjiuuiuin. s%biviy x: ~x llvlljr lUlouillw, 

G=GIycine, H=Histidine, I=Isoleurine, 
K=Lysine, L=Leucine, M=Methionine, 

N=Acmt rfl<rinfL P=PrnlSn^ 

Q=Glutamine, R=Arginine, S=Serine, 
T^Threonlne, V=Valine, 
W=Tryptophan, Ys^Tyrosine, 

Ysl Tnknmvn ^z^lnn m/Iaii Asn/tMihlA 

nucleotide deletion, \»possibk 
nucleotide insertion 


3515 


17416 


A 


3538 


107 


349 


. SVTFPFFFFFLKQTGSHFW*AGGQGGN 
FD*LEPKPPWLRESSLLTLPSSLDYGPP 
PTRP9NFY3TFFE1CPGFC!IiVPOIVSFP. • 


3516 


17417 


A 


3539 


274 


3 


PRRWS LPLTQMGPLPSRMGPKTKLCPKK 
KKKKI PPRKILE PDVFTGQFYLMFKE I T 
SIFLKLFQQCEEEGTFSNSF*BARIALK 
TK*DKN 


3517 

■ 


17418 

• 

* 


A 


3540 


1 


374. 


AL-AJj V KJNUrll X iVVbr 1 lALAaEtLir Cl £• 1 

PIII*G*DN*LGPLVIGAHWALTLINY 

loll iJ rra VL>ljijLiHV^AX r&AKAu luL 1 

AYPPLAGDySHRAAYVNLTILSLHLARV 
SSILWDTDSITT 


3518 


17419 

1 


A 


3541 


427 


76 


RGSLSPGV*K* PGGQNQTFPKAPKLRPG 
APRCFPAGKGS I KPTPGSPPGGRRGTPC 

MFWAS KTKNGLNKTLGPNPGPPLGGLRS 
PFWVG 


3519 


17420 


A 


3542 


234 


no 

88 


PPPPPPDPPPPPPPPPPPCT.DPPPPPE'P 

erevt t tt rtrb et et t if e out ret ret e 
FFFFFFFFFFFFFW* * ILYFV 


3520 


17421 


A 


3543 


30 


419 


TFSFFPFLFFFGFXjVLGGEKQGKWDPKP 
PGAEKPPTFSPPKI*KKGAPPPPRVNFK 

KF*KKGPLPFSPGGKNLKSFWGIWSIGN 

ot rjTPPpawao^Trvpnirt?vppvwppT?P'f' 
fliW V i Fb'KWJb'U-rfcKi't't? r W»lr XJsSi e r if lUjr 

PKPGGKKTQI FFFLWEKK 


3521 


17422 


A 


3544 


159 


411 


VSRDISPVRLSPSSCFH*PFIiHISSSVE 
GRFS S FAQFLLS CLNFCTFPLLFLLLRF 
*DRVSLCHPGWSAVA*SQLTATSTFWVK 


3522 


17423 


A 


3545 


2 


.351 


r-if yi tirnCTHTTje* TV Of n/VrtMOVTUVT VT T 

CJbAnT&G VE5 * Ab isPCGQJNo I I Hx LiKlili 
CMEELS FPPYLL I YS IS YLY*HTIMVI Y 
FILVL*SIIXNLIYIiFVTQIFPALASGS 
SFNWLLCLFNTTLAVHVCVCVCVCVCVR 

VUiw 


3523 

• 


17424 


A 


3546 


397 

- 


1 

- 


KPPPPCSYG KKKKI FLLWFRTLPTKPP 
GAGRKIFPRGPGFFLKNPKAL*TTFAPP 

FLCSP*GGIGVSLFFFFFSETESCSVTQ 

AG VOS CDIjC5SLO PLP PGLKR 


3524 


17425 


A 


3547 


76 


354 


FL FNFMG KI FKNFRGG VFFFFFWGG YN I 
FLVFLKIFFFFFGGFFLPFFFFLKKFFF 
FTiGPF FGKTS OKI FF FFVN* I FI1I1XXGA 

PPIiFFFFFL 


3525 


17426 


A 


3548 


197 


1 


INKMKQQRMD*EKIFAKHVSHKGLISQI 
CK* LIHLNSEKTNS PVRKWAEDLNRHFS 
KRDTKMVNM 


3326 


17427 




354* 


173 


•JAO 

3uy 


GWSAVA* SLLTAASTRP 


3527 


17428 


A 


3550 


3 


196 


GFHRVSQDGLDLLTS*SACVSFPKCWDY 
SHEPPRPAHKTSLKKNKMRVRI IIMMTI 
TYPAPWC 


3528 


17429 


A 


3551 


3 


360 


RAVE I FC YDVCVESGCAD I QUaliLCPTS 
IKVCFSSLSLDIVTVIPDIYICIYIYII 
FFFFFKQAP^* CQCIiNDKE*YCSLWPTP 
PGFKKLSCVSLPSIWDLRCAPSHPPNFC 
VFGKNGV 


3529 


17430 


A 


3552 


326 


1 


KDNHRILKIiMLAKSGKVRHYVFDFGKQI 
SKMSEKNIGVKPPIGKEKTKNPKKKPSR 
TEDPKICTYVL* PDKKKWRRFLNT*GIQ 
LG*KE IKLSLFADDMI VYLBNP I V 
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SEQ 10 
NO: of 
nucleotide 
sequence 

• 


SEQIP 
NO: of 
peptide 
sequence 


M 

eth 

od 

• 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to tirst 
amino acid 
residue of 


Predict- 
ed end 
nude- • 
otide 

location . 
correspon 
ding to 
last amino 


Amino acid sequence (A=Alanioe 
OCysteine, D=Aspartic Acid, 
£r=Oiutamic Acia, t— Phenylalanine, 
G=Glycine, U=Histidine, I=Isoleucine, 
K^Lysine, LF=Leucine, MHVlethionine, 
jn«= Aspa ragi n e, jn 3 j*ro ii ne, 
Q=G!utamine, R=Arginine, S^Serine, 
T=Threonfne, V^Vallne, 










peptide 
sequence 


acid 

residue of 

peptide 

sequence 


W=Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


3530 

• 


17431 


A 


3553 


186 


1 


RPRRLPI KLPMTFFTELEKTT FKFI *NQ 
KRVR IAKS I L>5QKNEVGG ITP PDFKL* Y 
KDTVTN 


3531 


17432 


A 


3554 


3 


366 


KTSLGNMVKP YNF FV * TFY I * * KKBIKK 
NSIGVL+PGTEQSKHEFKKILPSKRNKI 
ASKRIKYL*INLT*EGQCMYTENYKTLL 
KEI IBDLNKYKDI PCS * I IRI^LVMMAV 
LTKLIYRFN 


3532 


17433 


A 


3555 


239 


1 


GRGKMLPVS LtNI TRVTFAP CSdS I ILV 
LIALPSLRILYITDEVNDPSLTIKSIGH 
Q WY*TYE YTD YGGIi I FNSY II*PP 


3533 


17434 


A 


3556 


3 


123 


QA3LE V* T S SDLPTS VSQSAG I TGGSHR 
ARPANIINSVP 


3534 


17435 


A 


3557 


237 


2 


KGFSPPTQKFGFLGVFPPQKKKTKKNPR 
PPKGLTPVS*KGMI/TEPLEPFFFFFLRQ 
SRSVAQAGVQWGNLGS LQALP P 


3535 


17436 


A 


3558 


87 


379 


KKKKKKKFTLQKPVGFLNPPNGEAKSQI 
INELLLK I ATKKKKNLKLRIi KKKGKDJUF 
RGNYNPLFKKIKDDPTNGKNFPCS*MGK 
ISFEKMTLLPKAI , 


3536 


17437 

■ 


A 


3559 


390 


181 


KKKKKNPGGGGPPFFPTFRGG*GGGPPQ 
PGGGGAPKv* ISPPPPPPGGKKSPPQKK 
KKKKKKKGPFLPHL 


3537 


17438 


A 


3560 


2 


101 


VIVTAHAFVI IFFI VI PI I IGGFGN*I*V 
PVDAA 


3538 


17439 


A 


3561 


2 


126 


INTLLALLL I I I TF * LPQLNG Y I EKSTP 
YECGFDPISRRRG 


3539 


17440 

• 


A 


3562 

• 


16 

+ 


349 


FAFQKITLAMEKATEAQTGGAKKACQGT 

fttfttTUTfCIT wffwfl"f lOTIfllTOMTODBBUf illTJCJ 

NNIMKFLKKKCI PTTI FWISFFS FLFFF 
FETKPHFFNWPEGKGPNLG* IiKPIiPLGL 
KQFCLTLPRKGNYRHVLPPPVNGFFFF 


3540 


17441 


A 


3563 


6 


130 


TLRRILEDHFSLRS *GCGEPCIiRHCMPA 
WLTE*DPVSKNIY 


3541 


17442 




3564 


3 


152 


G FHCVRQDLDLLTS * S AHLSLPKCWD YR 
RKPPHPAGKGTYFYIDYSTIR 


3542 


17443 


A 


3565 


241 

- 


3 


NLVSHVI SMRKVINLVEQNVTSSNRSNH 
TWLSFIKLYFI 1 1 1 S I VFGEQVVFGYVA 
l^rSUDF*Dr\5VFI lui Vi XAJra 


3543 

* 


17444 


A 


3566 


71 


256 


S VFLLTVFLYPLATS FFFLNRVS LCCPG 
WL*TPGLKQFSCLSLPKRWNYRHEPLTP 
GSNrr 


3544 


17445 


A 


3567 


1 


392 


AGAGTSGTD YPPLS GNYSHPGACVDLTI 
FSLHLflGVASMLGAISVMTTVMNIEPPA 
ITQYQTPLFV*SGRNTAGr,T,T,T,CVPVIA 
AGITILLTDRNLNTTFFDPAGGGDPILY 


3545 


17446 


A 


3568 


334 


409 


LGTVAHACN?STLGG*GGHITRSGV 


3546 


17447 


A 


3569 


3 


467 


PKNPPGGFFSPL*EKKYILPPPP* IWPP 

Dr3Pl?+ira DPP PF , T7TTl?FVP , F , lS , T7FTt , PTT7nT 
x^or i* iv-rur trtrr c rrrirrrrrrr riryi 

YFITF 


3547 


17448 


A 


3570 


3 


420 


FFDPAGGGDPILYQHLF * FFGHPEEYML 
ILPGFGILSHIATYYSGKKEPFGYIGMV 
*AMISIGFLGIXV*AHHIFTG*IDVDTR 
AYFTftSATI I IAI PTGVEVFS *LATLHGS 
NMK* SAAVLS ALG F I FLFTRRGLTS IV 


354$ 


17449 


A 


3571 


1 


418 


NLINHSFIDLPTPSNI FA* *NFGSLLGA 
CLILQITTGLFLAMHY* PDASTAFSS IG 
HITRDVHYG* IIRYLHAKGASIFFICLF 
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SEQID 
NO: of 

■ A. * _■ 

nucleotide 
sequence 

■ 


SEQID 
NO: of 
peptide 
sequence 


M. 
eth 
od 

• 

■ 


SEQID 
NO: in 
JUSSN 
09/515,1 
20 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
OCysteine, D=Aspartic Acid, 
£/=<jriutamic Acid, ^Phenylalanine, 
G=Glycine, H-Histidine, f^Isoleudne, 
K=iiysine, u= Leucine, M—lYletnionine, 
N=Asparagine, P=Proline, 
^Glutomine, R=Arginine, S=Serine, 
^Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /"possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LHIGRGLYYGJ^FLY*ET*NIGIILLLAT 
IATAFIGYVI»PGGQISF*GATVITNLL 


3549 


17450 

• 


A 


3572 . 


3 


414 


SSGIiAM * FHPHS ITIiLIIXJLLTNTLTI Y 
Q* WRDVTRBSTY QGHHTPPVQKGLRYGI 
ILFITSEVFFFAGFF*AFYHSSLAPTPQ 
LGGHWP PTGI TPLKTPLE VPLLNTSVIiLA 
SGVSIT*AHHSLIENNRNQI IQALL 


3550 


17451 

* 


A 


3573 


2 


423 


GTLYLLFGA*AGVLGTALILLIRAELGQ 
PGNLVGNDHI YNGI VTAHAFVI I FFIVI 
PIIJGGFGN*LVPLIIGAPEMAFPRINN 
ISL*LLPPSLLLLLASArVEA0AC3TG*T 
VYP PLAGNYSHPGAYVDLT I FCLHLAOV 


3551 


17453 


A 


3574 


411 


117 


DGGPLLQLLRRARQENCFNPRGGGCS* P 
RLCPCPPAWGAKLNSLSGKKKKKKRHVE 
KYAQHDYI INVLCWCVHIKWLSLYMYS 
ITKS IDKXCKKAS 


3552 


17453 


A 


3575 


2 


394 


ALANMWEQIRSIiHQYAVHRI I SLFSLLS 
KKHDRVLEQATQYLRGSLNTNDVPLPDY 
AQDL TV I BEL I PMMLE I INS * LTNSLHH 
IPNLVYALLY*RDLFEQFRTHPSFQDIM 
QNIDLVSVNEDIYYDSFLS 


3553 


17454 


A 


3576 


262 


418 


GWRE WTEL I I FY I F FETTL I PTLAI 1 TR 
* GNQ PERLNAGT YFLFYTLVGSLP 


3554 


17455 


A 


3577 


232 

■ 


443 


PSVQTFPCCPLSEEGPWLLCVAMTPSPG 
VPVPTERALYSMECAFHPLFSLTSGACR 
LD YRTP DNR * VQL P 


3555 


17456 


A 


3578 


48 


410 


GGLLLHRAGCWCGHADRRADQPDSSDPH 
SLHAACLPPPGAARAAGAPSPA*TPAAA 
CPAPVPAPCSEHRC* PPGLSQRPCLPGL 
P DAR P PGAAAHGGWS LPTRLVLHDRSHP 
EEAPRRHED 


3556 


17457 


A 


3579 


422 


186 


VYSKFL*SQDIJUlIiRQENRIiNPGSRGCR 
EPRS*HCIPAWVTQQDSLSISSSTSHBI 
CGSHCSPGYLTVHICDQPVFSL 


3557 

• 


17458 

■ 


A 


3580 

- 


442 

• 


3 


GSAIS SRCTQESTYQGPHTP PVKKGLRY 
GIILFITSEVFFLAGFIi*AFSHSTIiAPT 
SQLGGHWPPTGITPLNPLEVPLLNTSVL 
IiASGVS X T * AHHSL I ENNRNQ I IQALLI 
TILLGLYFTLLQASE*LESPFTISDGMY 
CGRSRG 


3558 


17459 


A 


3581 


428 


1 

k 

\ ~- 


1 KU i tfMAS S IXiM * FHFTPITiili lLGitii 1 
TPLT I YQ * WRDVSRESTYQGHLTPPVQK 
GLRYGI I LFITSEVFFFAGFF * AFYHSS 
LAPTPQLGGHWPPTGITPLNPLEVPLIiN 
TSALLASGVS IT * AHHSLIENNRCIAAA 
LE 


3559 


17460 


A 


3582 


160 


415 


RCLSHCLXt-PMLS ILFIPSQLLSLPFHV 
KWIFPFFFLNKKFNCVPQAGGQGKNLI* 
liHPPPRGliKKFSCLRITSS *NYGREPOP 

line £ ri\\juiu\4 wwiukitai^M iifc »»t 

s 


3560 


17461 


A 


3583 


399 


2 


STHLGLPKCWDYRCEPPRPALLAIVKLL j 
CKTIHVRINNVCCLPLLTGAASVIGLSA 
SLLFSVS PGTSLQLEAR*TFPWPHMSST 
CVWRFI FTSLGRAQPQHSDLQTCPRRDV 
DLAEPGLLFNGQMLVPPGRS 


3561 


17462 


A 


3584 


215 


1 


KSPGLLNAFSRG£QIYQASRAMGA*LNQ 
RFWPGTVAHACNPSTLGG*GARIMRSGD 
GDRNS KDGVS PCGRV 
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SEQ1D 
NO: of 
nucleotide 
sequence 

■ 


SEQIB 
NO: of 
peptide 
sequence 

- 


M 

eth 

od 

• 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalamne, 
G^GIycine, H=Histidine, IHboleudne, 
K=Lysme, L=Leucine, MHMethionine, 
N=*Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


3562 


17463 


A 


3585 


138 


19 


LNQRFW PGTVAHACNPRPLGG *GAR I MR 
SGDVYRNSKDEV 


3563 


17464 


A 


3586 

• 


262 


3 


GRPEVFYKPPPRGEFFFFFFFLFFFFFF 
FFFFFFVI*FFFFFFFFFFFFFFFFFF * Q 
VRLYLLEKYBLFFFFLFYPQEISLCCPG 
WS 


3564 

• 


17465 


A 


3587 


381 


1 


KKFTPPPPRKMGPPPGFF*RAPPFFFFL 
GGGGGPLFFLGGPPPPGGGFFFSPPPPV 
FF*KSPPLFSFF*KKRGPPFFKRGGSPK 
TPPCFFFKKKNPPKKKKKKKKKKKKKKK 
GRPSS KSDAWVRTRG 


3565 


17466 

* 


A 


3588 


198 


1 


PKS PPRPLARGKEPNFFSKKKKKKS I *N 
LNTNS WLDMVAH I CNPS TSGGRGG * PHL 
RSGVRDQPGQ 


3566 


17467 


A 


3589 


373 


1 


GIFFPPPKRGGFPPPPPKKFFFFPGGFI 
FGGGGGPFWPPPRKGFFLKKKKGVF*SP 
P* KGKKFFFFPGGGVGP PRGFLKGG PP I 
IFFFFFFFFFFFFFFFFFPATIFLITYK 
DLTCHPGTHFTH 


3567 


17468 

• 


A 


3590 

• 


506 

4 


98 

• 


HTSRGPPPPPKKKIPPTRPFFFCCFWW 
FF PPKKKFFFFCS PRS F IHS P PKKKKKN 
FFFPKGGGPPLS P FLTP PPPL FFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFF*FFFFF*LFVSLFS I 
FVLLIFFLFYNAITTI 


3568 


17469 


A 


3591 


407 


1 


LVRGQITNKKNLFNFLGEELREWRWFF 
LPQQRIiFFFKL* LQ* KEI YLFI * KMTCS 
RGGAGFKSSLYFFFRSPPQWLFFFFFF 
IFFYFF*FFIFFFFFFFFFFFFCNDFMI 
YLYISPPLPILEKKKGGRSRSRS 


3569 


17470 


A 


3592 


380 


68 


FRRGVGPQWPPPQKKGFSQKPPGGFKRP 
PLKGKKITFPPPGKFGPPKGPFKRPPPF 
FFLEGKTLFLGPDP* *NLGWGG*RGHGK 
♦GPANPRVPFVFFFFEEYSY 


3570 

• 


17471 


A 


3593 


394 


45 


ALMFSTGQEGRQEKHPPPHQKKKASRGE 
EPRGGVRKNYPPPKKKSFPKKNPPGVTP 
PKKKKKAKLPPRPL*GGPQRRPQKTAPP 
PliNFYPPKEPTKVLS IFFSFFFFFSAND 
VELY 


3571 


17472 


A 


3594 

• 


402 


62 


AM I VLLHS SLGNKAR PCLFKKKWAKDLN 
KHFS KEDSQMANKYMKRCLI S LVIRNMQ 
MKITVRCHFILTRMAKIKKMVNS KCRKE 
VQKI PTLKHCWWECKIV* ILWTNLAARV 
G 


3572 


17473 


A 


3595 


220 


425 


FI SG FIiNLQEERE I AFFFKVN IRIX3AW 
HTCNPSTLGGRGRRIT*SPGV*DQPGQH 
GEPCLYLNLQKN 


3573 


17474 


A 


3594 


13 


412 


AEIAPLRSIHRARNCLKKKGRKGRKNPG 
GDGGNPQ IGPAFFGGGNGKTPGQKPHQP 
QTWGQGRHLKGGGGGPKRKGVPPKKKTP 
T I WGGTPIAPGVE PNWWGHTGNPGGADP 
KPPGREK*SGGAQITGPHTGP 


3574 


17475 


A 


3597 


2 


293 


CQLSP*ELGTPRERERERERERERERER 
ERERLP PRDLREKRAQDVFATHPNYEFV 
CVTRHYRGCWYTPLTTAHS IDAHKMCV 
CVKKRPLYKKKkK 


3575 


17476 


A 


3598 


243 


9 


ILLRQEKQCWSKGTRFQLQDEQVLNT*D 
RT WEVMWPGAVTHACNP STLGGRGGRXM 
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SEQ 10 
NO: of 
nucleotide 

i 

4 tXI UCIICC 

■ 


SEQ IB 
NO: of 
pepouc 

SClg U CI ICC 

i 


M, 
etb 

OU 

- 


SEQID 
NO: in 

09/515 1 
26 


Predicted 
beginning 

n ii i* Icvntf t% o 
II uctcuuuc 

iv^n tivu 

correspond 
ing to first 
amino acid 

I Colli uc Ul 

peptide 
sequence 


Predict- 
ed end 

IJ U Llc~ 

VUUv 

location 
correspon 
ding to 

1 a c+ n m inn 

add 

residue of 

pcpiiuu 

sequence 


Amino acid sequence (A=Alanine . 
C=Cysteine, D^Aspartic Add, 

ITsf^l ii f-ft m ir Afiri EV=Ph»nvlalon tnp 

Er^WIUUatlllw rtvlu y JT~JT I1CII VlalalllllC^ 

Cr == fZ\\c\i\&. H = H isf iri in** IssIfialcueiiiC- 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, PHProline, 
Q=GIutamine, R^Arginine, S=Serine, 

TsXhrpnninp V=VuIiiia 
11 ui cuiiiiiC) r > aiiuc t 

W=Tryptopban, Y=Tyrosine f 

X=Un known, *=Stop codon,/=possible 

miflAntirl a HaIp^iam Y=nneeiHlA 

nucleotide insertion 






-— 








RSRDRDS CRI HQGGSKPKSPR 


3576 


1 fylT7 
174/7 

* 


7k 




i 


5 Ay 


PGG PQRGPGKKGGKNF * S FGKPF* KGGG 
PPQFCPPPPRGGFFFFFPGGPPGBKKGG 
VKGPPPPFFFFWPPPPPNLFFPPGA 


3577 


17478 


A 


3600 


206 


1 


LRALGIMERAAHTEVRLVFIQILGSPL* 

VT T?*P fVH.QP<3PPT .17r , QflTT7A7iPP'4T.r>T . 

LSASNSPASASQ 


3578 


17479 


A 


3601 


322 


99 


KRTGAFGFFFPPLFNKKKKVFLGPEGPK 
I VKKKKKKEKKEKQNKNRKQTKiaC* KTH 


3579 


17480 - 


A 


3602 


410 


25 


LFFFKKKTGGGGGGPPEFFHPFGOVNQa 
VIRVGELSPPCP PGGTRFFFKKTKNYPG 
G*GGPFFPPPSGGGLGPKKTLSPRGGAS 
L* PNFGPSLPPKGKGGP PFSKKKKKKKK 

rtGTi TIDING *OT TJTJ/"»TOV 


3580 


17481 


A 


3603 


27 


349 


RSSIQCGGILLFYFLYFMVFNDIFWL* 
GFSAITSILISMLFSCI * YYFFKMYCLL 
eric KCnn I ETC xlir W M1\1 JmAUxIxN w J. if c K 
DAYLGYFFLFLFFFFFFGGKLYL 


3581 


17482 


A 


3604 

• 


88 

- 


435 


PAISTCLQNEQYKWLTTITSVEFCSLNDL 
TELKTMV* KLHDKCTSFS S RF I Q LEER I 

STMVRKQDKLRVKDAGI KHRNQNKGFTA 


3582 


17483 


A 


3605 


331 


428 


YGHYGQI EATFLPAI TTEKETLKTRKLT 
* SS INLLiK*T*NVRPGTVGHACNLSTLG 


3583 


17484 


A 




3 


294 


GX3*GCSEL*SCHCTPSSLG*NTVFJKKER 
KKGKREREKERKRE I IPGFGPCAHHKGA 
VTLGNLFPRVQVSQLVQRRKI I ITVIPD 
RLGEIVLVSQSL 


3584 


17485 


A 


3607 


263 


426 


RKIiAGTFPFLWKSGPFQTILAAAQRMGG 


3585 


17486 


A 


3608 


213 


439 


LSWACAAEWTVTQPANFPLDYILPFFLL 
IRES S KF * RHMATQLETAASRPG WVHA 


3586 


17487 


A 


3609 


2 


441 


MTDLESFPIiE SELSANSP VQCSGLCKPV 

HLSNSVLPRLLSGQRTETQKHFYTAVLS 
VCT.S T YLSRDRVIiIiCYPGWSAVM* FYSL 
*PQIHGLKQSSCLSLPKC*DYQHGISPP 
WCN 


3587 


17488 


A . 


3610 


102 


354 

• 


EPGAWADRCWLGRVLLTHLLWP CCLYFF 
YFIIFILNSGDRVSLCCPRWSCLK*SSC 
LSLPECWDYRCQPLHPACAVFIENSKR 


ocoo 
J5o8 


17489 


TV 


3611 


462 


7*71 
ill 


ws 




17490 


A 


3612 


*T\JJ 


319 


RNSHDGLELLTL* SAHIjGLPKCWDYRCE 
PPRPVLC 


3590 


17491 


A 


3613 


49 


476 


PRNSITLLILGLLTNTLTLYQGWFDVTR 
ESTYQGHHTPPVQKGLRYGI ilfitsev 
FFFAGLF * S FYHSRIiAPTPQLGGHWPPT 
GITPLNPLEVPlXNTSVLLASGVSiT*A 
IMS L I ENNRNQLMQ ALL IT I LLGIjYFTL 
LQ 


3591 


17492 


A 


3614 


223 


467 


TPS * PLAI I * FI STLABTNRTPFDLAER 
ESELVSGFNIEYAAGPFALFFIAEYTNI 



770 



ft 
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SEQ ID 
NO: of 
nucieotiae 
sequence 

• 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 

TIC CM 

09/515,1 
26 


Predicted 

beginning 

Ducieoiiae 

location 

correspond 

ing to first 

amino acta 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
uing to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanlne 
C=Cystefne, D=Aspartic Acid, 
tLr- ijjutamic aciu, r==rnenyiaiantne, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine, M° Methionine, 
N^Asparagine, P^Proiine, 
v^vriuiamine, xv— /vrginine, o=5»enne» 
T=Threonlne, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon,/=possib)e 
nucleotide deletion, ^possible 
nucieotiae insertion 


- 


• 










MI INTLTTTI FIXSTTYTJALSPELYTTYF 
VTKTIiLLTCLFL* IRTAYPRFRYDQLIH 
LL*BNFLPLTIiALJjI*YVSIPITISSIP 

n 
tr 


3592 


17493 


A 


3615 


2 


405 


ARGLQEAjGAVQNDPVSRLPDHAMLQGHR 
AHQMAI VTYQE FGETYI PKDQKY* FLHE 
SQTYFCFLNS I PTPSSMEETQHRSSLEL 
LRISLLLIESWLEPVRFLRSMNANNLVY 
DTSDRDD YHT JjKDLEEG IQTLM 


3593 


17494 


A 


361<J 

* 


328 


63 


PSKF*KPCLISPPPLGPGPPN*TPGFLiT" 
KPKFPPRGGAHP*SPLLFRVRPGDPLNP 
RGQGFP* PKLPPGPPTRATKPNFVSQKK 
KKKB 


3594 


17495 


A 


3617 

■ 


1 

■ 


379 


GTSVG I PTGGQE FS * FAALHGSNVK* S A 
AAXi* ALR I MI LFTVRGLTGIGLANSS LD 
I VLHDTD YVEAHFHYVLS IGAVFAI IGG 
FIH* FPLFIiGYTLDQTYAKSHFTI I FIG 
VSVTLFPQHFLGLC 


3595 


17496 


A 


3618 


494 


17 


GSANPRNPRRSGSKTRETSPRNVMSLMS 
PMSPTLPMRPTSPGTPMHLTNSMGPMQP 
PNSSRPVNPASSATEJVSPASPARPRKPS 
CFPAPQ * * S P PL* S PEPRHGLS CLS WVK 
KECETWHWSMHI VEDRHS KPYHGPGMV 
AHACNPSTLGG 


3596 


17497 


A 


3619 


350 


480 


LGS WD YVCKP PHLAVLIiIj* RQS FVLXiPR 
LVTNS*AQVIHLPWP 


3597 


17498 


A 


3620 


2 


400 


AAAEPHGCYAKVLDEFKPLGEEP*NLIK 
QNCELFEQLGEYKFQNALLGRYTKKVPQ 
VSTPTLVEVS*NLGKVGSKCCKHPEAKR 
MP*AEDYI*SWLNQLCVMHBKTPVSDRV 
TKLLTES LVNRRPCFSALE VK 


3598 


17499 


A 


3621 


3 


476 


LGYSVHPPQPLAQVLSPPLPGASRAGWL 
LRMRGPP S PHP PRTPAGLQAPHAALVPA 
RAS PSTPPCXLRE WAPALAS PERGSHSA 
VGG*RAPQMPPKWEPRQGRCQEQARALR 
TASMLSPLSFTHPVTLSLLNFPTSLPPR 
NSPSWPWSVR 


3599 


17500 


A 

• 


3622 


40 

• 


390 

- 


MKGALTQPGRAGGLGSQSQMDMASTPKP 
SPDDQGCS IHIiWGFPFLVSXiLPPSLSPS 
P PS PPRHS VGTHCAP * VLPRP LGALRVP 
HSS VPKPHAVP WQ T T XR X S RGAN VuS Jj 
DFRGL 


*> /Art 

3600 


17501 




3623 


52 


475 


Avar JrUjJjr XaUxr ii&l iMliLio V UlUijF^JjA 

KSPK*NQI ICDLFVPFHliA*CF*DLFML 

EHVSVLHFMAE* FFFFFF*KGVSliFSPG 

WRAGAQFWLTKTPPHSPARLKQFSCLTL 
oncwMVOT? ik DDC'D2iWT?i?irf?criin?^ii? , rTT. 

irVjro W IN I KJCfvf xro lr AW r C V r OljAJu»r 

P 




1 /jUa 


"S — 




IT* 


*rDO 


S I IGKIRLWKMRPWPGAVAOACDPGTLR 
G*GKRITRSGVRDRPG 


3602 


17503 


A 


3625 


2 


138 


LRRGNRLNPGGGGCGEP * SRHCTPASST 
E * DSVS KNQKTKNS QWH 


3603 


17504 


A 


3626 


3 


337 


LSLRPPALPPCPPRRLPTPGQGHPGSAG 
FPGRPTL*RLPPLPDPPPSLGLSPGDDP 
GLAREELKSPCPWGPHPLCHLPFP* CMA 


3604 


17505 


A 


3627 


2 


393 


LGTDLSLL IRFELGQPGNLLGNDHI YNV 
IVTAHAFVIIFFIVIPIIIGGFGN*LFP 
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SEQ ID 
NO: of 

nucicuEiuc ( 
sequence 


SEQID 
NO: of 

sequence 


M. 
eth 

OU 

* ■ ■ — » 


SEQID 
NO: la 

boon 

09/515,1 
26 

• 


Predicted 
beginning 

DUGicOUQc 

location 
correspond 

■no ♦ n fipet 

ing w iirsi 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 

currcspun 

ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
Hj v> i u lu in i l nviU| f— jr Denyiaisnine, 
G=G!ycine, U=Hbttdine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, ^Proline, 
Q=Glutamine, R-Argioioe, S^Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, A=possib(e 
nu cleotide deletion, \=possi ble 
nucleotide insertion 














LIIGAPDMALPRINNISL*IjLPPSIi: 
I AS AI VEAGAGTG * TGYPPLAGNYS1 
ALVDLT I F YLHLARVCS I 


LLL 


i 


3605 


17506 

* 


A 


3628 


*> 
2 


379 


QDATYPIIEELITFHEHALIIICLlCFL 
ALDALFLTLTTKLTNTOI SDAQE IETV+ 
TMLPAI ILIIiIALPCLRILYITDBGNDP 
SLTIK* IGHQWY+TYEOTDYGGIiIFNSY 
ILPPLFLKPGDLRL 


3606 


17507 


A 


3629 


1 


400 


PTRPPTRPPTRPMTNTLNS IHPPLP] 
KKKKKKKKKKKKKKKKKKKKKTAAO 
* KKKNRGAKTHRGS KKLNFFFKKRK 
PPVKIEKKTFFLGGKKKNKPPQKNNi 
KKKNYFEGEGESLLLL 


R*K 
3GA 

HLK 


► 


3607 


17508 


A 

• 


3630 • 


416 


1 


SQLGYSGVRDPIiEEATCIiFSDIiKIjHAGR 
TTPLFQAVRQGHL I LQRFLQP FA* LCPA 
PRGGVYRGRQASLS CGGLHPVRASWPLC 
LPTQFSAMEGAPPLALLPRSSS ISDYCA 
SNEQGSVGIGPSEPCTGYNLLVCHLL 


3608 


17509 


A 


3631 


275 


83 


RWGSCYWIAGLQLLGSSDSPALASQSA 
GITGVSHCTQPKOTI *KISHLTVAGPAA 
PKTPQTPR 


3609 


17510 


A 


3632 


186 


3 


FFFFFFLFFFFFFFFFFFFFFFFLFFFF 
FFFPFFPFFFLVCQVPLLCNL*HLVYLF 
YMVL 


3610 


17511 


A 


3633 


3 


235 


GIISFLLIR**YARADANTAAIRAILYN i 
RIGD I GF I LALA* F I IiHSNS *DPQQ IAI 
LPHPSDSKKYLYLSHIQLKKA 


3611 


17512 


A 


3634 


1 


428 


RTLKTTLFNPDGG*DPILYQHLF*FFGH 
PEVYI I ILPRFR* I SHIVTYYS GKNE PF 
GYISMV*AMISLGCLGVIA+AHHIFTV* 
IDVDTRAYLTFATIMIAI PTAVRALS *L 
APliHRSN I K* SAAKL* ALRLI FLFTVGG 
QT 


3612 


17513 


A 


3635 


2 


129 


PVIYSTIFAGTLITALSSH*FFT*VGI*E 
INMLAFI PVLGFWD 


3613 


17514 


A 


3636 


2 


422 


DR* LFSTNHKDIGTLYLLYGA*AVVLGT 
ALNLLIRAELGQPGNLLGNDHIYNVIVT 
AHAFGI ILFIVIPI I IGGFGN*LDPLI I ; 
GAPDMAF PRIKN 1 3 C » LLPPSLLI.T.T AS 
AI MEAGARTG *TAY P PLAGNYSHPGAS V 


3614 


17515 


A 


3637 

• 


14 

• 


479 


QNCKINNFPMYFLPHQSEFPRKSCKQQN 
TICFRLSNQOTmKTTENHALLWIRKWQ 
T* KQQCKSNSGYTPSTPSRAKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKPGGGV 
LKKL 


3615 


17516 


A 


3638 


2 


449 


1 1 1 1 CFVGLCALVLTLTTKLTNANILDA 
QEVETG * A I LP AI MLVT I ALPC LRILYI 
TDEVNDPSLTIK* IGHQWY*AYEYTOYG 
GLMFNA YI L PP LFLKPGDLRLLDGDNRV 
VLPNBAP IRIVMTSQDVMHS *AVRTLSL 
RTDAIPGRL 


3616 


17517 


A 

• 


3639 


347 


98 


HFFLGGGKGGGFSPFPPQKFFFFLRGFF 
FGGGGGPNFPPPKKGFFSKKPQGVF*TP 
PKKKKKFFFFPRGFWGPPGI FFKGPPP 


3617 


17518 


""A ■' 


3640 


2 


426 


DR *LFSTNHKD I VTLYLLFGA* AGGLCT 
ALSLLIRAELGQPGNLLGNDHIYNVIVT 
AHAFVIIFFIGIPIIIGGCGN*LVPLII 
GAPDMAFPRINNI S F * LLP P SLLLLLAF 
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SEQW 
NO: of 
nucleotide 
sequence 


SEQW 
NO: of 
peptide 
sequence 

• 

■ 


M 

eth 

od 

■ 


SEQID 
NO: in 

T1CCKT 

USolN 
09/515,1 


Predicted 
beginning 
nucleotide 
location 

mm m*d h n 1 O ■ fc «4 

correspona 
ing to first 
amino acid 
residue ot 
peptide 
sequence 

< 


Predict- 
ed end 
nude- ■ 
otide 
location 
correspon 
ding to 
last ammo 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysieine, D=Aspartic Acid, 
Jcr=ijiutamic Acid, r=rnenylalanme, 
G=Glycine, H=Histidine, l=lsoleucine, 

l« I* « (MIMA V m T A hB AS n A ^ A lL * ^ T , 

k.— Lysine, jl— Leucine, ivi— ivietn jon ine, 
N s Asparagine, P=Proline, 
Q=GIutamine, R^Arginine, S=Serine, 
1— lhreonlne, V— valine, 
W-Tryptophan, y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possib)e 
nucleotide insertion 














AI VEAGAGTG * TVYPPLAGNYS HPGACA 
D 


3618 


17519 


A 


3641 


94 


294 


GRGGGGVAGGRGAM WS KGRQRGRNGP VR 
GREGGRKSHFRKEG*KEGRKEPPPPPKK 
KSRGEKNQREG 


3619 


17520 


A 


3642 


1 


405 


RTRGFDLBRFVLFDVAQQMVESGYVCEG 
DHKTMANAI IDRVSLIKRKREQRQLVRE 
EQBKKKQEESSLKQQVBQSSASQTGIKQ 
LPFADTGIPTASTTSASVSTQVEPEEPE 
AD*HEQLQYQQPSLSVLSDGTVD 


3620 


17521 


A 


3643 


34 


405 


HLLGRKQNKAI FLEAEKKKKKKKKKKKG 
GPLKKNPGGAKNNRGGEKKIFSP*GGVK 
KTPLGI PEKKPY FGGGKKffANPPKKIKT 
LKBKKKF*RGKGGKKPENPWGEKDESSS 
PK 


3621 


17522 


A 


3644 


430 


81 


EFAQLSPPQIKGFPSPPPNFLPPSLPGF 
FSPFSP*KFFFSLKGFIFVGGFFPFFSP 
PKKSFFSKNPQLVFISPPFKKKIFI/FPP 
PLNFGPPRVFFKRPPPFFFFFFF 


3622 


17523 


A 


3645 


399 


2 


AFFFTKKKKEFFPPFPFKNFFFFSRVFF 
FTjGGFFHFFPPQKK^FFLKIPRGFFLTP 
PKKKKIFFFFPR* FLAPPGFFL*APPPF 
FFFFFFFFFFFFFFFFFFSPCSWSCHQV 
FPHASNRIHNPSNSYPLQQY 


3623 


17524 


A 


3646 


49 


326 


KKKKKKKKKKRGGPFKKTPGGAQKNPGG 
EKKFFFL*GGHKKPPRGI FKKKPFFGGG 
KNWTTPPQKNKAFGGKKKFLGGKGEKPP 
PKPRVKKN 


3624 


17525 




3647 


327 


94 


QKKKKKFFPPPPPKNFFFSPRGFFFGGG 
GGPFSPPQKKGFFSKNPGGVFLPPPKKK 
KI FFFPPGGFGAPPGFF*RGPP 


3625 


17526 


A 


3648 


2 


301 


TSSVASTFIMSLFPTTIFMCLDQEGIIW 
N* H * ATTQTTQL YIjS FKLDYFS I IF I P V 
ALFVTWPIIEVSL*YINSDPNINQFFKY 
LLIFLITILILVTGC 


3626 

♦ • 


17527 


A 


3649 


2 


406 


STNHKD IGTL YLLFGA* AG IIjGTALNLL 
IRAELGQPGNLVGNDHIYNVTVTAHAPG 
I IFFIVI PI I IGGFGN*LVPLI IGAPDM 
EFARINNTSF^LLPPSLLLLLGCATVEA 
GAGTG * TVYPPLAGNYSHPGACV 


3627 


17528 


A 


3650 

• 


3 


418 


HAYADAWVHFHS ITLLILGLLTNTLTIY 
Q*WRDVSRESTYQGHHTPPVQKGLRYGI 
ILFITSEVFFFAGFF*AFYHSSLAPTPQ 
LGGHWPPTGITPLNTLEVPLLNTSVLLA 
SGVSIT*AHHSLIENNRNQIIQAIiLI 


3628 


17529 


A 


3651 


526 


2 


WGGGEKGRAGGAGEETPHQESQRQGCAL 
PQEPPLYPPCPAIHPPPLPLGGLLLFPS 
LPPLP*AAWFPHPGQPGRNLNGESPTTR 

nwTjT TijO. r>TJT Of HT T3tIDO«TV > »T> r P'DT CDDTP 

r>l\KJu WSafirXjir 1 irltif Wr i*i y Jrl irij&KKXG 

CPPCPREVPTPSPPWAEPPPLPGPPSEI 
AHLTTVRRG AALANVNHEN I S DTDENKL 
NAFVTA 


3629 


17530 


A 


3652 


464 


297 


RNSFHRVSQDDIiDIjIjTL*S PCLGLPKCW j 
DYRCEPLLPASYIDVS 


3630 


17531 


A 


3653 


162 


467 


nskkqqfvllsbitsyvywlhkrdldsh 
tiimgdfntplstldr3arqkvnkdiqe 
ln^alhqadlidiyrtlhpksteytffs 
aphctynflfqsyfssqf 
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seq n> 

NO: of 

n ii <• tontirl A 
n uc icuiiuc 

sequence 


SEQ ID 
NO: of 

sequence 


M. 
eth 

uu 


SEQ ID 
NO: in 

(09/515,1 
26 

• 


Predicted 
beginning 
nucleotide 

location 
correspond 
ing to first 
amino acid 

peptide 


Predict- 
ed end 

otide 
location 
correspon 
ding to 

\ait Amino 

acid 

r*»«ifliif* of 

peptide 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acfd. F^Phenvlalanine. 
OGIycine, H=Histidine, I=Iso leucine, 
K=Lysine, LHLeutine, M-Methionine, 
N=Asparagine, P^Proline, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine. V=Valine 
W=Tryptophan, Y=Tyrosine, 

X~Un known. *= = Stoo corion /=no«sihle 

nucleotide deletion, ^possible 
nucleotide insertion 


3631 


17532 


A 


3654 

• 

+ 


372 


1 


LMMLSIFSCVHLAFVCFLWRRBM3GVSL 
GCPDYSGAAAVHRCNHTALQPQTPOLK* 
SSCLCKRHLQLRPQVYTTMLGYLPLFLF 
LRQSCSVAQAGVQWHDHSSLQP* TPGLK 
RSNSPTLV*DLD 


3632 


17533 


A 


3655 


444 


341 


VGQDGLLDLLTS + SACLGLPKCWDYRRE 
PPRLAY 


3633 


17534 


A 


3656 


466 


315 


VGHTGLELPTSS ELPA* AS BSAG I TX3VS 
HCTQQSGCAPCYSTMI PTKP YTC 


3634 


17535 


A ■ 


3657 


40 


276 


EFDTAHAFVIIFFIVIPIIIGGFGN*LY 
PLI IGAPDMAFPRINNI SF * LLP PSLLL 
LLASAIVEKKKNEKKKKTGAPLV 


3635 


17530 


a 

ft 


30 JO 


331 


4/U 


PTTT.PVTSA^TjPTiKTTiAWPCiMVAHACNP 
STLGG+GGRIT 


3636 


17537 


A 


3659 


40 


182 


G IRKENHLNLGGRGR I EPGSHHCTPAWA j 


3637 


17538 


A 


3660 


160 


461 


INGKDTDTPGMVAHAYNTSTLGGGGGGI . 


3638 


17539 


A 


3661 


145 


437 


YFSLSLCLLELMNS VI LFF PL VLERGFY 

PPPT.VFM(V5PNFR*T»WPPPPf»Iifv*OFF("*T. 

TLPKSWNNRPPPPRGGNFLDF*KKRGFN 
YVGRVDLKLRTSG 


3ojy 


1754U 


7V 


3O0i 






T.tiS FKTP FFPFFPFGKGGP PS LP PGGKBG 
GLNFI *TLGPQGKRNSPPQPPGEPG*RG 
PTPHPGLFF 


3640 


1 KA 1 

17541 


n 






DO 


PI^OPLRK*LYLFRDGVLLC!HPDWSAMM 
QSRLMATSAEF 


3641 


17542 


A 


3664 


42 


193 


FQLLTRLRQENHLNAGGTSCSEPRSHHC 
TPA*VTETOSOKOKVTCVLGFSL 


3642 


17543 


A 


3665 


216 


2 


KSKGKCSFLHSRVNRGNSKELLFLRDGV 
WRYCPAWS * TPGIjKQS PHFNI PKCWDYR 
REEFHHPGLWYBFE 


3643 


17544 


A 


3666 


43 


267 


GRLWSAMTPGKLKTLCKIDWPALteVGWP 
LEGSLDRS LVS KVWHKVTYKPRNPDOFP 
YRDT*LELVLDPPPPTHSG 


3644 


17545 


A 


3667 


212 


2 


SQLLGRAEARGWLEPRSLRKINKTSQVW 
WYTPVLPSSCDYRHTPPRPANFCIFSRD 
GVSPC*PGWSRTPD 


3645 


17546 


A 


3668 


104 


334 


TVl^PPCFLLHIDFCiynn.DFFSNFFVll^ 

YF*NYRRVQKMYEEFPFFLFFFLIFSFS 

FLFFLSTFNFFLCPLCDLWYH 


J040 


1 /54/ 


n 


sooy 




zou 


DGLDLLTL* STHLSLPKGWDYRHKPPCP 
ANMPSFKN 


3647 


17548 


A 


3670 


111 


342 


NGVSFFPPKKGGIWANGNFRFPPKPGG* 
PJjFFLA*PPQKPGMAGPPPPPQPVGIAGP 
PPHARBILEFFFFFFKKKGV 


1 £. AO 

3648 


17549 


TV 

ri 


Soli 


377 


3 


F JVT TrtrtF T H * FPVFPGYAIAOTYA1CTHF 
TIIFIGVNLAFFPQHFLGLSGMRRRYSD 
YPDAFTT*NILSSVGSFI SLTAVILI IF 
MI * EAFAS KRKVL I VEAPS INLE* LYGC 
PPPSHTLEDPVY 


3649 


17550 


A 


3672 


39 


348 


AKIAPLNSSLDERMRVSKKKKKGHGLTP 
KPFLKKKLGKNFPLSQLEGVGKKNQKPF 
G FKKGLELAP FNG KGNGENPNTWRWG * T 
MSFFFFF*DSHPFIQAGVQWRDLGSLQP 
RPPRFKRF 


3650 


17551 


A 


3673 


327 


3 


SKDPVFFKVFKKNPPl'PNIGVFPFALPL 
KRGQFQPLFNPKGFFFFFFKPVQLGRGK 
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SEQID 
NO: of 
nucicoiiae 
sequence 

• 

• 


SEQBD 
NO: of 
pep no e 
sequence 


M 

eth 

oa 


SEQID 
NO: in 

09/515,1 


Predicted 
beginning 

n ti /* 1 q/\ f"5 rl a 

nucicvnae 
location 

rnrrocnAtiil 

ing to first 

AtninA ftrirl 

residue of 

peptide 

sequence 

- 


Predict- 
ed end 
nucje- 
otide 

Inr q finn 
lUCUUQD 

correspon 

Hi no tt\ 
Ulllg IU 

last amino 
acid 

residue of 

pepuue 
sequence 


Amino acid sequence (A=AJanine 
OCysteine, INAspartic Acid, 

Hj — Kjrl It in W IC ACIO, P—rnenyialBntaBf 

G-GIycine, H=Histidine, I=Iso leucine, 
iv— jujroiiic, Lr-ixuunC) ivi— ivie in ion ine, 
N=Asparagine, P=Proline, 

t"fc=f^llifamif1 & RsA romln* o — c*.;»» 
wiuuuumc, rv /\rginiuc, i>~ ocrinc, 

T=Threonine, V=»Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, ^possible 
nucieoucie oeieiion, v— possiDie 
nucleotide insertion 














ISPOPPPKKRPWG+TKflPPPPP*DQTTPP 
IQAGVQWRDLGSLQPRPPRFKRF 


3651 


17552 


A 


3674 


379 


267 


EPRLS P CT PAWATE * DS I S KQNKTKKPK 
LALNKKKMF 


3652 


17553 


A 


3675 


2 . 


270 


SSWDYRHAPPGLAKIGGLHVFCLPCCGS 
LL VCWS SHCSLAHQBLLXtBNKHTDVWMG 
6WMDR 


3653 


H Iff A 

17554 


A 


3676 


3 


54 


I TRSGDRDHPG * HL 


3654 


17555 


A 


3677 

i 


291 


30 


YSLVLLIFFFYCFSISYFISLMNPCHNL 
YYFLLLDFGLVCS YFPS ILWHNEIYFIG 
SMFFIFSVI V* I FFCWEMSCNI YKLHQE 
Mr 


3655 


17556 


A 


3678 . 


1 


298 


RDFSASASQVAGTTGMHRHAQLI FVFRV 
CVCVCVCVCERBRERERDGVLLCRPDCC 
SVAHT * VQGRDLG S LQPPP PGFKRFFCL 
KLPSSWGHRCWDAP 


3656 

■ 


17557 

- 


A 


3679 


20 


376 

* 


VP P P HLANFLNF I LL FVDTGS YYVAWAG 

Birr t n^iur o t\ euro * o t r/"* ttvt 70 ttt ti\ t t 

FKLLAPCNLSASNS * S VGITGVSHHaIjL 
LLALKCNQLFQFVI FJELLHAVCIYDFIM 
VRFSLQCFPQFYDFPFCFILFVSSKAQF 

PET DT Ti V 


3657 


17558 


A 




110 


334 


NTTDYFKLP* ILSGTR*DMYKQPL*HLN 
TVREYTSKYGWRPGVVAHACNPSTLGGQ 
GER I TRS RDQDQ P CQHGET 


3658 


17559 


A 


3681 


326 


511 


LlTQMLQMKI SVI C CLLRNI FWPGaLAHA 
CNPSTLGS*GGRIARSRDLENSPSLLGS 
QPSS 


3659 


17560 


A 


3682 


448 


248 


AKDLNTFFTKE * TPLANKHMKKGS ARFT 
YRNVQI QTT*RCLGTPTRVTGV* MRNS A 
RHR*GCGGLV 


3660 


17561 


A 


3683 


486 


176 


PAPINP PQLRSLVQRSGIRFC * AGQVGF j 
KLLVbSUP PTiiASQSAGITGVSHUTWP w 
NYS *NSGMPTHPRQASS WRLREQWYHVS 
LLP CAY YKNGRQNS LL I 


3661 


17562 


A 


3684 


238 


27 


NKHILCYVGQEMGMGWGSHSHGWKPHFK 
S FQVCPSAVAHAQTPSTIjGG*GGQITLV 


3662 


17563 


A 


3685 


1 


299 


L ITQTCHVGCLKIKCI *TSKKKKKKKKK 
KKKKKARGPLKKKGPLKTPEKPPGGPFK 

AflrrWoljr'i'e'Kr 17 cwwurtf f ir VJ? LtJSJSJV 

KKNSPLWGGGGFFFR 


3663 


17564 


A 


3686 


ACSA 

494 


91 


DT/3VT3PDnnnT&T.T<VT.CD11U&U&DDr2TD 

iriJOAivr rvjijJJXAul JUJOlClrlinWAtrxrvXlr 

PPLGGPGGGVP*VTPFKPTPFRPGEPPL 
PLKNQT FFGVPGP PPYFPPLGGIiGPEI P 
FTPGGPGS IKLAPPPSPPPGAPKRNFFP 


JVAn 


17565 


A 

* * 


3687 


17 


249 


YS RFGRHFGHDPGL Y FO I LRRLRWBNHL ! 
NPGGRG CS KL* SKHCT PAWATE * DPVS * 
KKNVTTIAKRWKQLTCPLADG 


3665 


17566 


A 


3688 


296 


461 


FWLWVSSVLKHMNWPGVVTHAC^OTIiG 
GQGE* SA* ARV*DQPG*HGETCSAKEK 


3666 


17567 


A 


3689 


149 


422 


ASGRQGPDRGCWGGQAPRHPARGAKPAA 
DMGMSGQGYTKQAAP *GDP3QRAGVRLP 
REPTSRKaSGGSGGHSRASGGAVPGQRV 
AQPSPAR 


3667 


17568 


A 


3690 


1 


415 


GDPAGGGDP I LYQHLF* FFGHPEVYILI 
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SEQ n> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M. 
eth 
od 


SEQID 
NO: in 

1 tC CTVT 

VSSN - 

09/515,1 

26 

■ 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
"■amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=GIutamic Acid, FNPhenyialanrae, 
G=Glycine, H=Histidine, I=Isoleudne, 
K=Lysine> L= Leucine, MNMethionine, 
N=*Asparagine, P=»Proline, 
Q^Glutamtne, R^Arginine, S=Serine, 
■^Threonine, V-Valine, 
W=Tryptophan f Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, possible 
nucleotide insertion 








• 






LPGFGI ISHIVTYYSGKKBPFGYIGMV* 
AMISIGFLRFiE*AHHIFTVGIDVDTRA 
YFTS AT 1 1 IAI PTGVKVFS * LATLHGSN 
MK*SAAVL*AL*FIFLFTVSGI/TGIV 


3668 


17569 


A 


3691 


1 


393 


GTRKY S FLHDSQTS FCLSNS I PTPSNME 
ETQQKSmjELLRISLLLIESWLBPVRFL 
KSMFANNLVYDTSDSDDYHLLKDL * EG I 
QTLMGRLEDGTRRTGHILKQTYSKFDTO 
S HNHDAL VKNYGLL YCFRK 


3669 

- 


17570 

< 


A 

• 


3692 

■ 


402 


2 

■ 


FLTQRWGAPLP I P P FWEAP FGPSP* APG 
FKPPLPPRGNPPFS * KTQI YPALGARPV 
IPPSLGGGGGESFLPPGGGFP*PRLFPC 
PPPRGAKPNFFFQKKKKKCLFCKAPQAQ 
HRSACLHECLPLGNLLKLCPR 


3670 


17571 


A 


3693 . 


3 


434 


I FYI F FGTTL I PTLAI ITR* GNQPERLN 
AGTYFLFYTLGGS LTLLZALI YTHNTLG 
SLNILLLTLTGQELSNS*ANNLI*LAYT 
IAFIVKIPLYGLHL*LPKAHGEAPIAGS 
IVLAAVLLKLGGYGI I RLTL I LNPLTKH 
IAYP 


3671 


17572 


A 


3694 


452 


208 


INKSARHGSMNL * SQLLRRLREVCLS PG 
VQG CI E PWSHHCTPAWETE * DKTLiSQKQ 
TNKKTCQHLYDHRPSALEATFRLSH 


3672 


17573 


A 


3695 


464 


146 


LKQTRATKGNLFLP*KPKLTPPGGFRPP 
FQPFWKPRPLGPPKSGVLDPPGPQGKTP 
FVLKKPNLLGGVI PTLFFPLSGGDORKI 
PSPRGGGGAKKRKP PRQKKKKK 


3673 


17574 

■ 


A 


3696 

• 


2 

• 


465 


VTRLRERPALLVSSTSWTEDEDFSILIiA 
ALEKFEQLTLDGHNIiPSL VCVI TGKG PL 
REYYS RL I HQKHFQH I QVCTP WLEAED Y 
PLLLGSAE PGVCLHTS SSGLDLPMKWD . 
MFGCRLPVCAVNFKCLHELVKHDENGLV 
FEDS * ELAAHLEKL 


3674 


17575 


A 


3697 


173 


51 


TFISWFGEQWFGCMEKFFSGDF*DFG 
APITQG*SFLSI 


3675 


17576 


A 


3698 


380 


26 


KQRTNFFLFFFKGTGAFWFFFPPFFNKK 
KKVFLGPGGPKIVKKKKKKEKKENQNKN 
RKQTKKK* KTNNKQNKKRETD I FGVEDE 
TWRQRWGLGGGRVANSSAEMGKGNSTTR 
PSWNS 


3676 


17577 


A 


3699 


11 


414 


RLWTAI Xj IQTP * S FTGAVI L 1 1 AHGLT 
SSLLFCLANSQQKKKKKK 


3677 


17578 


A 


3700 


1 


401 

4 


SSSLTLSNNNKSFLNQFVTFNEMWILYD 
NW* * PAQWLDQEAPKHFPKPNLDQKKVM 
FAVWWSAAGLIHYCFLTPGETITSEKXA 
QQFKEMNRKLQPLQSAMVNRKGPILHDN 
A* SHITQLKLQKLtf * LG YEVL 


3678 


17579 


A 


3701 


395 

• 


140 


TPPKKKKKI FPPPPKI FPPPKKYKKPPP 

FFFFFSIYIDFNLCICDNL*LQKTLVFL 
E 


3679 


17580 


A 


3702 


407 


371 


SFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFLWYKDI I I KGKNKQNV* CS 


3680 


17581 


A 


3703 


36 

* 


409 


LLKCEGI WKKEGKKKKKRREKKGGPFFR 
VLLGGPTFRGGAHKKSLLFRGVFFNLGG . 
FFFEGPFFLGGALIiGPPPPPIFTPLGKK 
KI FKGKRGPPPLI FLFF* IRGPPPPPSP 
PSTKPPPPPPPTP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 

OQ 


SEQID 
NO: in 

09/515,1 
26 


Predicted 

beginning 

uucieonuc 

location 

correspond 

ing to first 

amino acid 

ICS 111 UC 01 

peptide 

sequence 


Predict- 
ed end 

HUM A 

nucle- 
otide 
location 
correspon 
ding to 

111 ni- jfc mitt Jfc 

lasi amino 
acid 

resioue oi 

peptide 

sequence 


Amino acid sequence <A=Alanine 
OCysteine, D-Aspartic Acid, 
Ur^uiuiaroic Acta, r— rnenyiaiamne, 
G=Glycine, BHHistidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R«Arginine, S=Serine, 
i*- inreonine, v— valine. 
W=Tryptophan, Y=Tyrosine, 
a— un Known, "—atop cooon,/=possiDie 
nucleotide deletion, V=possibIe 
nucleotide insertion 


3681 


17582 


A 


3704 


56 


415 


QAPWGKKKKKGKEKKKKKKKKKKKKKRG 
GPLKKNPGGGPTMGGGBKKYFSPWGGEK 

AGGKKKI * RGGGGKTPLIGGGKKNGGP P 
RTSPR 


3682 


17583 


A 


3705 


108 


1 • 


NLAK* IQ**IQTIMHHDQVSFIPGIQGW 

T UTT tTQTIkl 

LjNXXiKSXN 


3683 


17584 


A 


3706 

i 


425 


52 


PFPH* * *L*KVDPIPLPPQNTLHPSNNI 
TCSPLSPKKFFFSLQPFFFFGEIFPIFP 
PPJCKNFFS*NPNLVFISPPLKKKIFFSP 
PPPNFAPPKNLFKSPPPFFFFFFFFFFF 
FSGKNTLYWVDK 


3684 


17585 


A 


3707 


117 


421 


VLLVIQAEYLLPEVFGTRSVSDFGFFRF 
WNICI IIiSG*ASIiI * KSBI *NAIiKNTSS 
FKKKF I FILE PRSHS VTQAGVQ WQDHGS 
LQPQTSGLKLFSHLSLP 


3685 


17586 


A 


3708 


308 


3 


RSPPWFFFGFFFFVLFFFFFFVFQLFFS 
KYNITLYIKKKTETFTETNDNGNATCQN 
LWDTAKAI PREKFIAISTY I * KVGKHQ I 


3686 


17587 


A 


3709 


94 


420 


KKKKKKKKKGGG AF KKKFGGAKT *RGKE 
KKNFFLKGGGKKKKVGNFGKKPYFGGGK 
KTIEKTPKKN*SLKGKKKFLKGKGGKTPP 
KPWA*KKWSS 


3687 


17588 


A 


AMI f\ 

3710 


413 


184 


LGECLS LRS KKLWSGPASGGVCL* FQQV 
RRRKWEDHLS PG I RG * RALS SHRS X PAW 
X abQDjt V SKub VWvjJjJrvaVjo W 


3688 


17589 


A 


3711 


3 


104 


GGCSELRSCHCTPAWRQS * TSSQRKKKK 

YWMV 
JNJ\KfIX 


3689 


17590 


A 


3712 


2 


393 


GLFS TNHKDIGTL YLLFGA*AGVLGTAL. 
SLLIRAELGQPGSLLGNDHIYNVIVTAH 

PD^FPRINNISL*LI^PSIiLIiLLASAI 
VEAGAGTG *TVYP PLAGN 


3690 


17591 


A 


3713 


424 


1 


FFF FFFIiRRSLTWQAGVQWRNLGSL * A 
RPRV 


3691 


17592 


A 


3714 


2 


130 . 


GRVGGRVG*NLKTiYSQRLKKiCKXKKKKK 


3692 


17593 


A 


3715 


3 


197 


FHHVGQIX?LNIiLTL * S TRLGFPKC*DYR 
CEPPRPAAFFFFFLKiaCKTlAM'KKFKTL 
DLTTRERS . 


3693 


17594 


A 


3716 

• 


3 


497 


RS P PSGFAWPGPAG P WEAGAAAAGQPLG 
AKPHSPCrTGGCRSAGTPP* PAGPAGPPP 
HAPAAAERKPRGAEQSQRRVQETGPPPT 

FGAGCPAVT*GTYPYRI AGQVS SRAPHP 
AESRGCTAGLS IGPDGRQ 


3694 

• 


17595 


A 


3717 


40 


379 


AITSPRSPPAVHGSCIARTHPNSPQCLP 
LSGTLLLPSSSLSL^HPATAAGE * TWDA 
EKKKKKKKKKKKKICKKKKK^ 
SPGGALKKKNPGGPNPPGGGKNISSPPR 
G 


3695 


17596 


A 


3718 


445 


284 


KKKK* METRS C YVGQSGLELLAS S DPAT 
LASQSVGITGVSHQAQPVNKLPRVQ 


36% 


17597 


A 


3719 


15 


383 


GYFLEASCLIaHPIiLHPMYQNCGHVACIA 
AKENG KFMQ AACGQLNS PALS LKNGRLS 
AAYGGSDCVYLFPMSKMVLGTQ*ALNKY 
TLNGPGAVAHACjVPSTLGG*GGRNLRSG 
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SEQID 
NO: of 

nucicuuuc 

"i 

- 


SEQID 
NO: of 


M, 
eth 

Ou 

• 


SEQID 
NO: in 

09/515 1 
26 


Predicted 
beginning 

II UCf CvUUC 

location 

correspond 
ingto first 
amino acid 
residue of 
peptide 

conn An m 


Predict- 
ed end 

n ii/*Ia_ 
uu tic - 

ntide 

location 
correspon 
ding to 
last amino 
acid 

■*A«iHiiA e\f 
rcsiuue ui 

peptide 
sequence 


Amino acid sequence (A=A la nine | 
C=Cysteine, D=Aspartic Acid, 

E^flliitflmlr Acid F=PhAnvlalanfn* 
G=Clvcine. H=Histidine. T=Tfto leucine. 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine,P=Proline, 
Q^Glutamine, R«Arginine» S^Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

/V.— UUKUUVrll, OlU|I iVUVD) f^possiuw 

nucleotide deletion, V=possibk> 
nucleotide insertion 














VGDQPGQHGET 


3697 


17598 


TV 


3720 






IvD rr Vjij x n/wAy X W •*> trJur V fi> VJu J. X 

AVIiLLSLPALAAGITILLTDRNLNTTF 
FDPAGGGDP ILYQHLF* FFGHPEAYILI 
LPGCAII IHIVTYYSGKKEPFGYIGRV* 
AMISIGYLGFIVRAHHIPTVGIDRDTRA 


3698 


17599 


A 


3721 


3 


298 


EAAIKYFLTQATASI XLLIAILPNNILS 
GQ*TITNTTNQYSSLiriMAIAIKLGIA 

PISIIYQISPSLNV 


3699 


17600 

1 


A 


3722 


86 


505 


AGAVTYIVTYYSL* KKDSFWIH5VWC*A 
MRVNWES *GLSV* AHHI FTVG IDVDTRA 
YFTSAT I I XAIPTGVKVFS *LATIiHGSN 

NSSLDIVLHDTYYWAHFHYVLSIGAVL 


3700 


17601 


A 


3723 


432 


48 


CFLSCPSSLSFLFFFLLSFHPFFFPTLL 
ISF*VFLF 


3701 


17602 

+ 


A 


3724 


2 


406 


HH I FTBR I DVDTRAYFTSAT I I I AI PTG 
VKVFS*LATLHGSNMK*SAAAL*ALGII 

VAHLHYVLS IGAAFAI IGGFIH* FPLFS 
S YTLDQTYAI IHFTMI FIGVNIiT 


3702 


17603 


A 


3725 


174 


158 


KFFGGGFPHFFPPQKKGFFPKNPPRGFF 
TPPLLGKNFFFP PPVNliGPPRGFF*GP 


3703 


1 7604 


A 


3726 


2 


392 


LFSTNHNDMGTLYLLFGA*AGGLGTAIiS 
LiLIRASijGQPGSJaXjGNDHMYWVI VTGHA 
FGIIFFIVIPIIIGGFGNRLGPLIIGAP 
DMARPRINNISC*VLPPSILLLIASAIA 
EGGAGTG* TAYP PLAGNY 


3704 


1 7605 


A 


3727 


48 


An a 

474 


KKTPAGPPPPPPREKKNPPPEGGQKKPP 
PAH*KKPPPPRGGQQPPPPPPQKNPPPE 
KQKTWGAEGGTPPPKPPRK3CNGSQQIAD 
PPSNAAKNRHPTQEPPPQSQCIPPPPTP 

FF 


3705 

• 


17606 


A 


3728 


387 


53 


PPPKKGVLPSPPNFYTPPK*GVFSPt.PP 
♦KFFFSPKGLIFWGGGGPNFPPPKKKFF 
SKKPRGGFFFPP* KKKILISPPREKLGP 
trt\± r uX\vjjt it Jr r rrjcrrrrrrrr 


3706 


17607 


A 


3729 


1 

■ 


397 


PTRPLQPLSPBFKRFSCRTSPDVAW*VS 
LPSPPPSGRS FHVFIi*VESFCVWIKKRF 
TMT APT.VT.DJ3 * PSnPPAWASOSAQT'MA 

SIiRASLLYI Y*CFS IQLGLLPEFLIQYV 
QLLIVYPRIHHLYLSVPHLH 


3707 


17608 


A 


3730 


407 


0 


TLiXHTKKKKia<KKKKRGGLLKKTLGGPK 
♦NGGRKKKI FFLKGGKKKQTGE ILKKKL 
I LGGEKKGKNPPKKKKPLGEKKNFRGEK 
GKKNLKMFGVIKLGTSSKKIIF 


3708 


17609 


A 


3731 


3 


200 


GG C S ELRS CRCTLAWVTE * DSVSLKKKK 
KKKGPFKVLRGKGPSGNKREGAKKKI QG 
KPCPFWESF 


3709 


17610 


A 


3732 


3 

• 


380 


TNHKDIGTLFLLFGA*AGVIiGTALSLIiI 
PAELGQPGNIiLGNDHIYNGIVTAHAFVI 
IFFIVIPIIIGGFGN*LVPLliaAPDMA 
FPRINNISF*IiLPPSIiLLLLASAIVEAG 
AGTG*TSYPPLAGN 
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SEQID 
NO: of 
nucicoiiae 
sequence 


SEQID 
NO: of 
pcpuue 
sequence 


M 
eth 

00 


SEQID 
NO: in 

09/515,1 


Predicted 

beginning 

nucieonae 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucie- . 
otide 
location 
correspon 
ding to 
. last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C^Cysteine, l>=Aspartic Acid, 
Hr=\jiuiamic aciu, r—Fneny (alanine, 
G=Glycine, H=Histidine, I=Tsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=GIutamine, R^Arginine, S=Serine, 
1»I nreomne. v«valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkoown, *=Stop codon, ^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3710 


1 T*t 1 1 

17oJl 


A 


3733 


6 


394 


AMIS I GFIjGVIG *AHHI FT V * IDVNARA 
YCTSAT 1 1 IAI PTGGKVFS * LATLHGSN 
MK* SAAVL*ALRFI FLFTVSGLTGI VLA 
N5 S LuJ I VXiHDTi YVRAHFHYVI-SIGAGF 
AI IGGFIH*FPL,FSGYT 


1*71 1 
Jill- 


170Iz 


■ y 
J*. 


3734 


27 


238 


Gi PGS SHASAH IG * I XAVLPHYuNT TIL 
NLTIDI ILTTTAFT.T.T.NLNSSTTTLLLS 
RT*NKKKKKRGGPF 


3712 


17613 


A 


3735 


402 


249 


ASAHASAPLFV*SVLITAVLLLLSLPVL 
AAG I T I LLTDRNLNTTFFDP 


3713 


17614 


A 


3736 

! 


100 


430 


VTWPKSDS YBEVKI KLB PRQTDLRALGF 
CLS FTGKKKK* KKKIOCKktU^KKKkkjuu} 
KRGG 


3714 


17615 


A 


3737 


158 


306 


FLSSVFSLVQGVLFSSLCLDRNLPDMMQ 
LWS E I FNK* L YLQSLS I LGVY 


3715 


17616 


A 


3738 


279 


3 


PGHCFYNFNLNFYFQIFFQNVLANSGSL 
LFLYQTLILSFNIFYILYLI IQISIVFQ 
SVIL* FI FSNS YSWWYGFLVCLMTFEGK 
KQTCAHV 


3716 


17617 


A 


3739 


268 


388 


HFFFFEMEADS VAKAGV* WYDVGLQQPP 
PPGVKRYS*LSL M 


3717 


17618 


A 


3740 


395 


« MM 

153 


GWLMRFKERCGLQNIKV*GEAASADAEA 
AAS * PEDLAKXADEGS YT * QWI FYVGET 
AFYWKKMPSRTFIAREKSMPGFKL 




17019 


A 


0*7/1 l 

3741 


271 


386 


TCPPPLKLWVYELGLU*RKNMRQGAVAH 
ACNPSTLKG*GGR I SRSGDR 


3719 


17620 


A 


3742 


406 


190 


QLLKRLRQENP WR WGG* GCS KPRSPPRP 
QAWVTERDFVSKS KTFCTNQPXQNGERKV 
FFWFFFLRKPKVQFEI 


3720 


17621 




3743 


272 


408 


MGLQACANSPL* YVDIAIPCNNKGAHSV 

»*r.tr.T»4irT *.nnrfT M\tgf\i iifi 

G LiW WMIiAJ<EVLRMRGT 


3721 


17622 


A 


3744 


211 


407 


KGKRNILFGPVKNI FCDTQFTSCVNFFR 
EEVSIiDCPGWSQTPALK*SSCLGLPTCW 

D*RREQPRI* 


3722 




A 


3745 


405 


189 


CVSQDGLDLLTS * S TRLGLPKC WD YRKC 
WTNQLFVNGQKS LNS FVCYSFMVLDTTG 
FKEIVMLWHLVPLF 


3723 

+ 


17624 


A 


3746 


120 


362 


LKA * E I S WLNGT I QTHQEKPE VQNQVI L 
EENIVrLGPODRI FE CQAVTAELE PEKK 
AIVAGCSDSHLYPSTLGGQDGWIT 


3724 


17625 


A 


3747 


401 


233 


NSGGRGCSEPRSLPCTPAWVTDRDPISK 
NKNKTNHQ*QKKCKR*TGKVIFOTRSIL 


3725 


17626 


A 


3748 


353 


109 


LGLANFCI FSRDGVFPCWPGWSRTPDLR 
* S AHLS L P KCWE YRA* ATMPGLLMCFL I 
KALDFQGCVGRWGGGS WTDI KLCQM 


3726 


17627 


A 


3749 


412 


221 


HGLKECLVYPKWCQNSNISLQGIHFSEV 
AM*QRAHNHGIHWFYHDSFPRKIWINWA 
RRGGSCL 


3727 

J i ill 


17628 


A 




i 

i • 


j j j 


REPLRPAENTVYVHFSSWSFTNHSGKIN 
* TLDQWKEQLPGWL * I LRGVFFFNPS I W 
NS * QKKAS I YVFPLCWKRDI FLYHF PVS 
GTFRGF 


3728 


17629 


A 


3751 


423 


125 


QVCEKVLISLITEMQIKTTVRYHLTPVK 
MAFRQKTDDNKCWRGYGEKGTPVHCWWK 
CKWQPL * RTAWRFFK* RKIBLP YDPAT 
PLLGI*PKERKSGQV 


3729 


17630 


A 


3752 


3 


171 


FQPIAFLYANSEQSGKEESTSIYSSYEE 
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SEQ n> 
NO: of 

nucleotide 

1 

sentience 

• 


SEQ 10 
NO: of 

m 


M, 
eth 

■ 


SEQ 10 
NO: In 
USSN 
09/515.1 
26 

■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

H fill ■ • m 

residue of 

peptide 

seauence 


Predict- 
ed end 
nude- 

mm %m w 

otide 
location 
correspon 
'dine to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, 0=Aspartic Acid, 
E=Glutamic Acid. F— Phenylalanine. 

m ■ ■ ■■■■ mwm m mm M mmmmmmmm mm A*. IlvM T HlMMHiiVl 

G=Glvcine, H= Histidine. I s Isoleucine. 
K=Lysine, I^Leucine, M=Methlonine, 
N=Asparagine> PHProline, 
0=Glutamine, R=Areinine. S=$erlne. 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=St0D codon. / cz DossibIe 
nucleotide deletion, ^possible 
nucleotide insertion 














. YLGINLTKEMKDL*KENYKTLMEKIELD 




I /Oj 1 


A 


3753 


7 

dm 


247 


FHHVS * DGLNLLTS * SAACLGLPKCWDY 
RHE PRRPALTRFLFVFLYLQRLDHLIRI 
NSPFIMSVNGMFTFWLKLFNLIHA 


3731 


17632 


A 


3754 


3 


197 


GLELLTL* SACLGLPKCWDYRHDPLCPA 
Q YLS FLYAINKLFS STYLE I HNRLVCCI 
VTLLIYKP 


3732 

* 


17633 


A 


3755 


2 


103 


VSQDGLHLIiTL * STLLS LPKCWD YS CE P 
PRPAW 


3733 


17634 




3756 


103 


2 


FRPGMVAHACNLSTLGG*GGGITRSGVQ 
DWPGQ 


3734 


17635 


A 


3757 


394 


244' 


ASTHPANFFCRNGVLLCCPGWSQMPGLK 
ne e * t iQT.PKf^rjYRORPPffftAfi 


3735 


17636 


A 


3758 


341 


2 


AEIAPLHSSLGMKRDSVSKKKBLKKLNK 
QKPNNPLKK*AKDMNRHFAKGIQGANEH 

AI IKKSTOISCW* ARREKG1LIYTWWEC. 
K 


3736 


17637 


A 


3759 


2 


322 


PPRPTRPLNNTINQMETICMYTKCYLII 

FKRIKI IQS AFFKHNG IKLENDKRKI TG 
KHPNTWKLRKHTSKPMSQEMFF 


3737 


17638 


A 


3760 


,444 


422 


VRLPHRPS PPPDPRVRFHHVGQACLKLL 
T<5S DP PTSASOSAQI TVPS IiO*QP * S VR 

LP 


3738 


i7o3y 


j% 


3/01 


zoo 


i 


rif3«vpr /3pvt p PT»RPPnpvciPnAr.apop 
GPGSKIF* IFFGPQFKNRVFFKGRNLVF 
FPPLFSLLPFFFFF*DGVSLCHPGWSLE 
CSGM 


3739 


17640 


A 


3762 


70 


381 


HSITSYI*AFIWGFFLFCFVFFLIKEME 
SHYVAOADHELPASWAPPTSASOYAGIS 
GGSHPTQHYFI FYWI FYFTVFOGGGFLR 
RSLAPS FFAQAGAQWRDLGS 


3740 


17641 


A 


3763 


229 


390 


D I P YS FCFFFFFWKGS FVFVPQAGGKGR 
KIjG*LKPPPPGLRGFSCLTLPGSWEN 


3741 


17642 


A 


3764 


119 


1 


QTS CRPGAVAHTYNPSTLGG* gvdhlrs 

gvrnqpdqhge 


3742 


17643 


A 


3765 


400 


200 


ITAYCSLDLLQPHFLFFSRDKVSLCCPV 
WSRTPELK*SPCLSLPECWD*S*ATASG 
LKHFIKSLPL 


3743 


17644 


A 


3766 


376 


44 


tqpiflgpgfqappgpw3*pnflf*ekk 
lvffpkkpktapagoggpkfplprkpkq 
kkgphpgsgnfokprpkfpprp ptwgkk 

l\i\WC A4J« WW^A14 4 V4» 4V47 4\4 * 47 IVfc ■» ^ f *W4%4\ 

P^FFSQKKKKKKRERDVKITDLNTST 


1*7 AA 

■ 


I 1 Oh J 

■ 




J /Of 




•a 


dhfffkgppsgkgefplflktfpsplgg 
f<3*islfpppinfppppkkrifpgplsf 
fffkkkktypagkgllffpfpkkpgggf 

KRWPGPPKKFFFP PPGFWACS PPFFFFF 
FFGAKVSLVAQAGARSRSRTS 


3745 


17646 


A 


3768 


407 


177 


FHHVS QDGLDLLTS * SARLSLPKC *DYR 
HDPPRLARCIKNFCRGWRCXJILRKSPLS 
LPPSAGELRAPFLDFI*EDAQ 


3746 


17647 


A 


3769 


292 


391 


RPGWAHTCNPS TLGG* GGQ ITRSGVQD 
QTGQY 


3747 


17648 


A 


3770 


395 


198 

-t 


LDMGFHHVGQAGPELLTSSDLPASASQS 
AAI TGMSHRAQP Y * VI SMCCKJCYKHF IR 
IPFNLHNNV 


3748 


17649 


A 


3771 


1 


241 


RTRGRTRGRTRGLLSAHTGRYQKHPRIR 
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SEQIB 
NO: of 
nucleotide 
sequence 


SEQ1D 
NO: of 
peptide 
sequence 

- 


M 

eth 

od 


SEQ1D 
NO: in 

09/515,1 
26 


Predicted 

beginning 

nucieouae 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- . 
otide 

location . 
correspon 
ding to 
last amino 
add 

residue oi 

peptide 

sequence 


Amino acid sequence (A=Alanlne 
C=Cysteine, D=Aspartic Acid, 
Hi— Criutamic Acta, r==rbenylalanine, 
G=Glycine, H-Histidine, I-Isoleucine, 
K=Lysine, I>Leucine, M-Metbionine, 
N=Asparagine, P=Proline, 
Vf^vjriutamine, K— Arginine, d=&enne, 
T=*Threonlne, V=VaIine, 
W=Tryptophan, y=Tyrosine, 
a= unknown, =otop codon, /=possioie 
nucleotide deletion, \=possible 
nucleotide insertion 




- 






• 




TPQIFSTGTTKAEAGELLEPGGGACSEL 
RS CHCS PARATRVKLHVKKFKKI K* KHP 

S ELRS CHCS PARATRVKLHVKKFKKI K 


3749 


17650 


A 


3772 


112 


406 


FFFFFFFFFFFFFFQGEGVPPI*KPPPP 
K* FMFPKKKLPP P P PPALKLFF FSPPF P 
FFFFF 


3750 


17651 


A 


3773 


24 


197 


I FS VEMGFCHVGQCDIiDLLGSSNPPAS V 
S*SAGVIGVSPHAWPLVKLFVKRNKLSY 
V 


3751 


17652 


A 


3774 

! 


85 


201 


VNFFHLI * FVNKR I iCLE KKKKKKKKKKK 
KKKKINWKKKKK 


3752 


17653 


A 


3775 


1 


382 


WENHLSPGSRGSSEP*SHYCTPA*VTE* 
DSVSKKKKPNLQEAEKRLTLLLIKKAIF 
KNRGS PCFSI KWAGFKKI KQGWFFVATR 
MGDRSRGTPNLGBKFLKSFLSPS IKIQR 
FYYLGLKNSPSRNIY 


3753 


17654 


A 


3776 


1 


181 


IKMATVTKATYRFCVIPSKLPMLLFTKL 
ETKILKLI WNLERA* IP KAILS * KNKMG 
LSNS 


3754 


17655 


A 


3777 

• 


70 


385 


LNKMQTKTFDSDTH^ICFTLHFINTKL 
QFI FLLAI YLLFVNFPFI FINFFEEPGF 
CFDDLSSCFLFY*CIP*FYYFFSAFFRIi 
HLFFFYRFLSLLTLDLFSFLI 


3755 


17656 


A 


3778 


2 


406 


KDDRNTEHQVHPNASDQEDEAFD PYENF 
FEDSDSPTKSSSTEPSPHIHPVDIQMTI 
FHCADNFARQ Y I LAKLAKKKKKKKKKKK 
KKKKKDSSSPGGGPKKGAPPPPPRGEKN 
FFr FGPPLIFSGGGF*NGGGGGP 


3756 


17657 


A 


3779 


403 


176 


FLFP ILFFFLRQSFTLVTQTGGQQRDLG 
SMQPPPPRFK* FNEGKGKRKLWGGA 


3757 


17658 


A 


3780 


3 


248 


GLNQTQIiRKILAYSS ITHIG* IIAVLPY 
NPNITIIJJLTIYI ILTTTACKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKGGGGA 


3758 


17659 


A 


3781 


3 


330 


VDKLKTGVQDHPGKfiS ^P&LPKKKKKK 
* KI LPKRGGVYLNPHFFERLRGENNLTP 
GGEGQRETES P PCLPAGGKNKI FV* KI Q 
fLKAKvivjr'r KGNKJTr FKCaovjiut 1 rr 


3759 


17660 


A 


3782 


399 


187 


FPPPAKGDFLPSHLFGLPPGFS PPPVFK 
PRPRNLILGPP *KKFYLPP PRP * TWFL* 
NGPPPFFFFFFFLDF 


3760 


17661 


A 


3783 


392 


11 


KKLKVNJjPDHPAVPPIjGI r PKEMaAGI * 
RAI CTLMFI AALLTI EK* WKQPNWLLVD 
K 


3761 


17662 


A 


3784 


324 


80 


PGPFFFFPPPAKRGFFPPPPFLVPPGFF 
PPPLF * T PPPDFFFLAPLKKFFFS PPPA 

LNFFFFKGPPPFFr FFFFrFFFFFF 


3762 


17663 


A 


3785 


354 


24 


PPNKI FFSPPPQKKYFPYKTPPPTFFFL 

F I VKYRKCKRGTR I ILHTPTLRQLVLCC 
LSDVLLGCCIF I YVTMCDLCAPKLG 


3763 


17664 


A 


3786 


3 


369 


S CRGL VWVYGE LLLHP CRGNQ I S RVMQ K 
KKKKKFGASPPREGAPPGEPP*RGP*TQ 
KEPPPLKK*APPPGPGGGRPPPMGLGGG 
PAPPPFFWPKAFFKKGPFSGQEFGFPLK 
GLFFWKFPF 


3764 


17665 


A 


3787 


1 


125 


FRHRFS RDGLDLL IF* SARLGLP KCWD Y 
RREPPRPAHSIIX 
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SEQID 
NO: of 

DUCIcOliac 

sequence 


SEQID 
NO: of 
pep Dae 
sequence 


eth 
oa 

• 


SEQID 
NO: in 

09/515,1 

AO 


Predicted 
beginning 
nucicoiiw? 
location 

curresp ins u 
ing to first 
amino acid 
residue ui 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 

lOvnuOD 

correspon 
ding to 

last UUllllO 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
u^Hjriuiamic aciu, t~ rnenyiaianine, 
G-Glycine, H=Histidine, I=l3oleucine, 
iv=ijjsine, jj— i^eucme, ivi^ivieuiion 1 ne, 
N=Asparagine, P^Proline, 
Q=G!utamine, R=Arginine, S=Serine, 
i*— j nreuniue, v— valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, V=possible 
nucleotide insertion 


3765 


17666 


> 


1TOO 

3788 


All 

•4Z7 


Z51 


KKKQAKNRCS LRMKYERNE FTSS PC 


3766 


17667 


A . 


3789 


250 


2 


GFPP VEKIKKNPFP I P* KKPdtftffrFPNP 
DRVLLCHSGRSAVAQSQLTATSTFRR 


3767 


17668 


A 


3790 


412 


3 


GSTPITTPFWGVGGALPPGGGVPPPPAP 
PGGTPFFFKKKKKKKFQNFPRFRCSRYN 
GGLGR KNFFTJjEAKGS INPNFS PAPPPG 
GQNKTPFPKKKKKSR*MTLQSFPRPGTA 
PGAAM* PVSPPEPCGPPAPALSTK 


3768 


17669 


A 


3791 


10 


187 


GCSEL*LIHCTPAWVHSETIiSQKKKGRO , 

KGGFKKNGIPCFGKNKGGKRFGKSFGHF 

STG 


3769 


17670 


A 


3792 


1 


331 


FRVPLGRLSRGEHQQQ*QQQPPPPPPPP 
GPLRPIAGPSRKGSFKIRLSRLFRTKSC 
NGGSGGGDGTGKRP SGELAASAASLTDM 
GGSAGRELDAGR* DRPGAGRQTS FSW 


3770 


17671 

< 

* 


A 


3793 


1 


431 


FRKGS ARCRAP KP AGADGVHWGHHRAS P 
QTAQAPPVTCSARGSWMDFSHASGPQDR 
DLHS PS P VCPRGWAPGTHS PTGASGA* G 
RTP VGGGK* VAGVQQRIHPGIiAPG*LVL 
PGIAVHITKKPGSKGTITPGTVAHACNP 

NTJjG 


3771 


17672 


A 


3794 


342 


48 


KRENMPGVAYNPSTIiGGCNPS I VGGFCN 
RVCNP SALGGCNPS FLGGFCNHVCNPS P 
LGG*GGRISRSAAQDHPRQHGEKMSPQK 
IQK.I SQACFAKVG 


3772 


17673 


A 


3795 


2 


189 


LNPGGRGCSEPRSC3ICTQAWA'rE*DSVS 
LGGPNL 


3773 


17674 


A 


3796 


1 


401 


LNPGCKGCSEPRSCHCrPVWVSB*DPIS 


3774 


17675 


A 


3797 


279 


3 


ILAKVYPMVNSFLEKIHYFLLVELCCFL 
QHVLYYLTFKTTETGSRSVAQAGVQWHD 
RSSL*PLPHGLKRSFFXjSLPRSWDYMHT 
PPCLATE 


3775 


17676 


A 


3798 


229 


394 


VFVKSLLCQPGMVAHACNPSTLGG*GGR 

TTMeon*TMTiv*'giintn70'inniiTTnrv t^/v 
1TM5GD* DHPeKHGEi VbTiKTlUUUCBvS 


3776 


17677 


A 


3799 


380 


206 


AGHM*S*LIjRRLRQEDPLSPGVQGCSEP 
*S*PCPPAQVTVRISLKKKRLHERDKSS 
AS 


3777 


17678 


A 


3800 


450 


3 


I YIiliRGRHS Lin YSRP S FYFSNYRGACS Y 
MRPPOQATIiLGRLNQGI TREGPAI AHS P 
AVRPRLSLAHAHVRRLCLPRPGAAPEFP 
AFFLPCLLVAQAGVRWRDIjGSLQPPPPR 

T?VT3I?Or , T CT.DCCMnnDDDDDT OG*C7Vt?T 

B fUCr s Ujo Jj rod vv UUxcKF F FliOb'riwij 
VEMGPRRV 


3778 


17679 


A 


3801 


301 


10 


KKQPPKKKKRCFPPLFPPPKGFKGSPPF 

FFFERGSPSVTQAGVLWHD3jGSLQP*TP 
GFKHMPSCPVIiLC 


3779 


17680 


A 


3802 


420 


51 


SASSPPASFFFFPPPKKKIFSHPFFFFF 
PPFFFPPPFF*PPPPFFFFFPPKKKKIF 
PPPPQKIFFFLKPPPPFFFFFFFFFFFF 
FFFOGFVNCQFSVLI 


3780 


17681 


A 


3803 


253 


73 


QTTPSKRYSYVLNCCNN* ITTCKRI KIJ3 
PFLTLDIiKIN*KS FADLNVRAKTIDTIK 
KVKG 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 

¥ IC CM 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

i 


Predict, 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-AIanine 

C=Cysteine, D=Aspartic Acid, 

Hi— Glutamic Acta, r— Phenylalanine, 

G=Glycine, EHHistidine, I=Isoleucine, 

K=Lysine, ^Leucine, M-Metbionine, 

N»Asparagine, P=»Proline, 

q— fjriutaraine, K— Arglnlne, s=Serme, 

•^Threonine, V=Valine, 

W=Tryptophan, y=Tyrosine, 

A— Unknown, -otop codon, /^possible 

nucleotide deletion, V-possible 

nucleotide insertion 


3781 


17682 


A 


3804 


385 


55 


AGTRPVFAAIAAAAP APRLLP LAPHGS A 
ATGFPPPPCPHGEGLRGAPAASSSPPLA 
HRSQTPNWVDCGES P I PDPQFPLiL *NGP 
PALCLPQGLETVNVESLALCFTSHGN 


3782 


17683 


A 


3805 


191 


2 


QKKLKALNFLPRCVCVCVCVCVCVCVNV 
TF*ACKI CTCSDFPGSPRKKYIjEFGTKI 
INIVDFN 


3783 


17684 


A 


3806 

i 


1 


367 


KLRQVI YLEDLPT 1 1 DEGS YTKQWI FNV 
N*TAFYWKKMPLKTSVTREEKSMPGFKA 
SKDRLTVFLGANVVSDFKLKPMPIYNSE 
NPKAIQNDAKSTLPMLYKRNNKACIAAH 
LFTAWFTEYF 


3784 


17685 


A 


3807 


3 


145 


LGPGGGGCSEPRS CHCTPAWATRGKLCIi 
QEKKKKR* RLDPEAEVAVSRGRAIAIiQP 
GQQEGNSVSRKKKKKGRDKVSIiCCPAGF 


3785 


17686 


A 


3808 


362 


86 


NPPFICSPPFFPPLWGPKQWPPWCEG* 
TPPGPPCEIPLPPKIOCSKSPPGVWGPPF 
FPPPWGGGAPKFFLPPKWGFPQTQICPL 
PFGLGEKN 


3786 


17687 


A 


•3809 


245 


377 


FGVMPS FFETQS RS YTQAGVQWHDLGSIi 
QPLPPGFK*FSCLGI^P 


3787 


17688 


A 


3810 


3 


268 


KRNFLNL I KRRYEKP I ANFVFNS KI VNT 
FHLRSSTS*GYPLLPLLLNILKVLPNEI 
RQQKE I KGRKNKGKKTIS VYS I MLSANT 
EKPK 




3788 


17689 


A 


3811 


2 


388 


SAENCLNPGGRGCGDIiRWCHCTPAWVTE 
RDCVSKTNKKETEKYMRN* TGNMLTI ID 
AGRRYMRCHQTIFTFFFFWRGSIiIFLAP 
PPSQGEGI FLPHP PBKWKKKVRI TTPTY 
FCFLRTDGFLPCGPGGF 


3789 


17690 


A 


3812 


292 


136 


PPTVIARKGKPRPKLSSEPWQFSAEQIiG 
IiPSFLCFFVCVRCCCLLLFIJiIiFCRDEV 
SICCPGWS*TPGLKQSSHLSFPNCWDCR 
CEPLCRAI F* FFCLCTLLLFAFVSAAVIj 


3790 


17691 


A 


3813 


359 

• 


52 


KKGRF FFFE KNP FGDLVPKKKKKKTP P P 
PNPKKKKRGAPP FFFFFFFFRH* VPLCC 
LGW FRT PGLKGS S CVGLPKPWDYRHKPL 
PtiTKDDi YVXGIiP Fv S UQ 


3791 

i 


17692 


A 


3814 


382 


1 


GGPGMEAPLGYHI KTPFPLKPQKNNPGG 
GAKPWPPPPGG * AGKMGLPPRQRFPLT 
QKSPPPPPLGGKREPPPQKKKKKRVTAM 
GDP*NSECR*GCHLRVQKEMRLPGLRNS 
PPHYIPNRNTDMCSC 


3792 


17693 


A 


3815 


472 


340 


TGFHRVS QLKjLDIjLiNS * SAGJjGIjP KCWD 
YRRGLFI 1 1 FNMGEWS 


3793 


17694 


A 


3816 


247 


378 


PPPPGGGXGXNLGLLKPLPPG*REIPPP 
PPRGGGNPGGAPPPPG 


3794 


« mm m+*\. 

17695 


A 


3817 


25 


465 


TPPLGG * GKRFPRGPGFSPPPPPKGNPP 

PLPPGPKGSH*PKFPPPPPPGETKRKPP 
PPKKKKKKRKKEKKKEI * AYKAY I CFDC 
PLTGPGDLKSDKTLAFPSVIGLPRPCTC 
LSPR 


3795 


17696 


A 


3818 


475 


147 


PPKG*RHPPGPPRFRRGPGNPPPHRFFF 
VLRTPGYSFPF*KKRGPPPTFFFTLPGP 
PGFLKGPPSPGGGGQKI PFPRGGGGPLK 
* IPPFPPPPGKKKDFFFLTKKKKKKP 


3796 


17697 


A 


3819 


461 


57 


TNQKKAAPPPIxLKKKFFFFFKKHTGGAP 
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SEQID 
NO: of 
nucleotide 
sequence 

• 


SEQ ID 
NO: of 
peptide 
sequence 

• 


M, 
eth 
od 


SEQID 
NO: in 

nccw 

09/515,1 
26 

• 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue or 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
OCysteine, D=Aspartic Acid, 
E=GIutaraic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 

I/ - — T .vsinp ¥ .purine IMsslVf AfrhiAn (no 

N a Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serlne, 
T=Tnreonine, V— Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, V^possible 
nucleotide insertion 














PGEKKKRGGGEKKKRGVYRKGPRAPSRG 
CAPPLYYRGGLFFFFFFFFFFFFFFFFF 


FFFFFFFF FFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFHVKNI LFVF* DYIVI IADL 
IQSLRHKDFQELLRR 


4% mm t*±Mm 

3797 


17698 


A 


3820 


1 


418 


HAYHI * KPSP* PLAOALSTLRMTSALAM 
*PHFHSITLLILQLLTTTLTIYQ*WCDG 
TPESTYQGHHTPPVQKGLRYGIILPITS 
QAFFFAGFF * AFYHS S LAPTPQLGGHWP 
STGITPLNPLEVPLIOTSVLLASGVSI 


3798 

• 


17699 


A . 

• 


3821 


385 


1 


LQFPPFSPLLFGVQRPFFSSFPSCPWWG 
PVPSKLLPLLQI LGRG FYPQKSLLVSLG 
FPFFFFLI I KXHRNEVLLCCPGWSRTPE 
LK* SSHLGTPKCSDYRCEPPHPPDPWDI 
CSYVPFLQSTHFGLNY 


3799 


17700 


A 


3822 


72 


424 


KKKKKXKKKFPPRKKGAPPAPPLFGKAG 
GEKLFPPRGL I FKKPKKAPGEKKKNLFK 
KXKKRGGP*KKKKLKAGGGEKKIFFKGA 
QKKNPGAGFKKGGRGKNROAPQKKGGEG 

sss 


3800 


17701 


A 


3823 


154 


2 


GHHTQHTE PQRLNNS YR YAELKHLSNT K j 
NNP I * K W AKDMNRHFS KED I QM 


3801 


17702 


A 


3824 


422 


212 


MGSSNPSPTASWVDEPTGICQHT*FV^1~ 
MGFCHVSKAGLGLIiGLRDPPPSASQSAG 
VAGVSHCTRLWIIL . 


3802 


17703 


A 

• 


3825 


2 


417 


KRVNTRGFPPPPFFKTPPPKIFMGPQKK 
KNFSPPPPIKIFFF*KAPPSP 


3803 


17704 


A 


3826 


423 


65 


ACGRG KY FKSGGRVKPFCAVD FPPSGKK 

^ 9k. m rmM~Wm k mm M Mm a m^m^mmmm^mmm, m m* Mm* M*m, f>m J» A^^A «h «k * 

ANQPTPKNPRKKQKPSGRGMSPMGKPPG 
CPCPMPKAS PGPVS FNKGGVAPATKDFC 
PIGKTA* IPSKKKKKKSDPG*VKKGKPN 
LYSIYNX 


3804 


17705 


A 


3827 


285 


136 


FFFLFFXFLFFFFFFFFFFFFFFFFFFF 
FFFFFFFFSGA*KIFIIATIHY 


3805 


17706 


A 


3828 


425 


316 


QDGLDLLTS * SARLS LP KS WDYRHEPPH 
PAQTFFKQ 


3806 


17707 


A 


3829 


3 


169 


LSNS * ANNLI * LAYTIAFI VKI P L YGLH 
L*LPKAHVEAPIAGSIVLAAVLLKLGG 


3807 


17708 


A 

- 


3830 

• 


6 

• 


452 


IGFCQGFAKDFCS KKFQRNVRVS PRGGE 
GRKFLLFKGGGFQKFPPLPPGWGAKTPF 
LKPPPKKKKKKERKEK3CEKEKKRNPGPP 
VSPSSWSPVP*APSEQRSGPQLPPSRGD 

^ ■%* mymf m »fc mm mmm. Mm. m**mm mm**, M^ M\ *mm jjMfc -» mmm mm, m mm 

* WRLPS PPRPFRDCVRRSATRGNSPPC 
RGSQGP 


3808 

• 


17709 

• 


A 

• 


3831 


45 

. ■ 


387 

• 


NFGVKVFS * LATLHG SNMK* S AAVL* AL 
RY I FVFTVGKKKKNKKKKKKKKKKRGGT 
PPGGEKRKFFSPKKKKKKGFLKNPQPKK . 
NFFWGKPQKKKGGKTPFWGGGKKAPKKT 

JT 


3809 


17710 

■ 


A 


3832 


463 


288 


DGLDLLTS *SVRLGLPKCWDYRHEPPRP 
APCILMATFFQFEVLFSEFLBTIVKSVL 
LH 


3810 


17711 


k " 


3833 


294 


90 


FFFLFFFFFFLFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFF*AGIYFI 
TCAPPPAYPRS 


3811 


17712 


A 


3834 


427 

t 


37 


SYHFGRIiRRADHEVRRIi*NRKGRPGAVT 
HACNP ITLGG *GGR I MRSED * DH 


3812 


17713 


A 


3835 


499 


53 


AKLMYFQSRENVAKRSYRRNQKASMVHP 
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SEQ ID 
NO: of 

UUCIcUllUC 

sequence 

■ 


SEQID 
NO: of 

|lc|IUUc 

sequence 


M 
eth 

nA 
Ou 

• 


SEQ ID 
NO: in 

09/515,1 
26 


Predicted 
beginning 

IIULICU Hue 

location 
correspond 

lllg tv III 91 

amino acid 
residue of 
peptide 

scijucncc 

1 - 


Predict- 
ed end 

II UvIC 

otide 
location 

LUI 1 couvu 

ding to 
last amino 
acid 

t*ACfHno iff 

peptide 
sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Add, 

Fl==flliitnrnir Arid F^Ph^nvlilonina 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=JUucine, M=Metbtonine, 

Q=Giutaraine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

/v~w iih.hu wii } — oiup couOD, /*— possioie 

nucleotide deletion, \=possibie 
nucleotide insertion 




. 










NQEPAVI AGKJGT I ALEVLNQVK*LSS YH 
CRLLVGMYHGCLLVSSVTP FCNI FSGLK 

V li 


3813 


17714 


X" 


3836 


304 


3 


RLKI TKTNNNQTE INS * YI KDLNARAET 

TKNIYKLDYTKILDFCISKDTINKVKR* 
PLE WEKI FANHI SDKG 


not A 

3814 


17715 


A 


3837 


475 


135 

* 


MrrrST.T?OOAOMT7XJDT7TD13'DTJ^2*r<^ir!r2CDr H 

PLCAJJUr v-JrAJbT'IVrlr'r XJrr 1 lr Jjvjo^lsulsOr'l? 

PGIQTPPGYPGEPPFFFTIiKKLPRRGAV 
FFFPPPLGGLGPKNVFTPGAGASLEPGW 
APSPPPGGPP*NPFPKKKKKKRLKIGWH 
F 


3815 


17716 


A 


3838 


415 


245 


ETGFHHVGQDGLDLLTS*SACLSLPKCW 
JJrlKKlSF JPMf UFQVo r IKAuf FfTOKAJrJjo 


3816 


17717 


A 


3839 


460 


413 


WDYIKLTGFCTAKETRVKRQPTEWDKTS 
AKYISDKGLKTRLHKEQ* KMTNLI *NWV 
K*LGRYSH 


3817 


17718 


A 


3840 


2 


470 


GGDAPHFPDGVAARRRGS S LLRWGGCQA 
QGLLTSQTGWPGRDAPHLPDGVTAGPRR 
S PHLRR WAAGQTRS SLPR CDGGREEALL 
TS * VGWRPGRuAPHJ^PDWAARQRGSSHP 
RRWAARQRRSSLPRRGSGRAEAAISNSA 
SLQRVSKVSVSCRPGG 


3818 


17719 

_ 


A 


3841 


54 


332 


QFGFHHVDQAAPELLTSDHLPSSASQSA 

r* T TYH TG UDRDDGT IMlO'CT'TJCT.TVT *T5TTi 7V 

urJ. lo VoHKi\KJroi-i/Vtic5r Irloi-iAij ir ItrA 

HSLIHWLTHSLTHSL3FPCSFSSPPSHT 

ltd JjrUvf Jjw V 


3819 


17720 


A 


3842 


208 


468 


. ENPTPPPLWCFALGVL*EDGFSIjYSLSA 
HVCEIVFLYCPVLESARPLMFLWNLAHV 
CYYEWHLFSVSLEFHTVS 




l / /Zi 


• 


J 64 J 


WO 

i 


o 

z 


PSSSSfiSSSSPPAAFFIiOePGVFPSPPF' 
LKPAPGFFFWGPKKKNFFSPPRGVKFFF 
FKGAPPLFFFFFFFFFF*DGVIjLCHPGW 

D/vuUlO 


3821 


17722 


A 


3844 


370 


253 


VGLLEPRGQGCSELRSHHSTPAWATE*D 
PILKQNETEQK 


3822 


17723 


A 


3845 


369 


73 


LRQDNGMNPGGRACS E PRSHHCTPAWAT 
E *DS I S KKRKSQNSVFTES YFASEHCYS 

CRCSPIIjPLALQCDH 


3823 


17724 


A" 


3846 


198 


388 


FRKSINVRYHVNRlNLKNH*ltPl£>AQK 
NLF*KPS 


3824 


17725 


A 


3847 


391 


20 


GFPPPRVF*HGPANFFLGPPKKKFILPP 
PR**TWFLLKGPPLFFFFFFF*DGVSLC 
RPGWSAVS KN* QTNKKS 1 1 LP I RGKLNL 

PI7IICMT.T V/^FTVTtT.T.TFVNPTfVlf PQ'PW'P 

IDHIFSRQLMA 






A 




999 


9 


KRRNP FHKAI AAIDS SDGPR* SKLKNFW 
RGFS XADAI KI FRDS WEVKI S I LTGFWK 
KWILTLMDDFEGFNRTRGR 


3826 


17727 


A 


3849 


373 

• 


2 


SSPQKYWGGPGFFPPPGFKTPPPPFFFW 
APTKKKIFFGAGRENFFFLRGPPPPIIF 
F F FFFFLFFFFFFFFFFFFFFFK* YNS F 
I * D I KTHVFIiLHS VG FRVLVYKTRLYFT 
LYHSFLKNELG 


3827 


17728 


A 


3850 


394 


2 


EACS VTQAGVQWRNLGS LQAPRRQS FCL 
RGSLLSPQH*EDRT*HIGGTQQTSMNE* 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 


SEQ ID 
NO: of 
peptide 
sequence 

4 - 


M, 
eth 
oa 

• 


SEQID 
NO: in 

09/515,1 
26 

• 


Predicted 
beginning 
nucieouoe 
location 
correspond 
ing co ursx 
amino acid 
restoue or 
■ nentide - 

sequence 


Predict- 
ed end 
nucie- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanlne 
C=Cystelne, D^Aspartic Acid, 
tLr- vviuiamic Aciu, tf— rnenyiaianittOf 
G=Glycine, H=Histidine, ]=Isoteudne, 
K=Lysine, L=Leucine, M-Methkmine, 
m— Asparagme, j^rToiine, 
Q=Glutamine, R=Arginine, S=Sertae, 

T^ThrAnnina \T — \Ja lino 

I— i Oreo nine, v — vanne, 
W=TrvDto d h an . Y=Tvrosine. 
X=Un known, *=Stop codOn, ^possible 
nucleotide deletion, \ppossible 
nucleotide insertion 


• 


• 








■ 


IAFCVCLVGFFCPFVFLFFETASQSPA* 


3828 


17729 


A 


3851 


,3 


164 


NHLNLKGRGCSDLRLCHHTIiAWATEGDS 
I SKTKTVQ * RKLQNNI PHEHTCKNL 


3829 


17730 


A 


3852 


35 


390 


















- 






• 






PPPRVNFGFLRKNGVPPGGPGGPQPPDIi 

GTGPPPPPEGSPPPGPQPSKTKRTAPGQ 
nv/vni? 


3830 


17731 


A 


3853 


6 


437 


PNK*KTPGLPPPPFFYFPPPPPPFLSFIi 
KPFSFPPPPPPPLFFFLPPP 


3831 


17732 

• 


A 


3854 


1 


236 


PTRPLRWLRQBNRLNPEGQGCGBPRSCY 
YTPAWATE*DSISKKKKGGGRIiKGPTLT 
SGGGQGLFFLkGPQKSTSGAAF 


3832 


17733 


A 


3855 


52 


351 


TVGIMEPQTTDVYVKLHKCTDFLLPDYI 














KKKKKKKRTKQKKKKREKKKKRGGOAPQ 
*LTLSPPPPPLYYFFF 


3833 


17734 


A 


3856 


72 


420 

- 


NT YG KKKKKKG KKKKKKKRGGRE KE KL W 
EAGGGGTTRGL* KENSGGNMGKPRIKKK 
GGKARG APLGVSSTQLL *RGEGE KCLS P 

EKK 


3834 


17735 


A 


3857 

• 


410 


107 


SSSS SPLARYFLGVAPGFPTjPTIiFKTGP 
RI FFLGP P* KTFSIjPTPGR* t* s p* kgp 
PFFFFCRDRVLLW* PGWSBMCGLKQSSC 
LCLQKCWRYRYKPPQLA 


3835 


1 7736 


A 


3858 


419 


239 


TTRS QIjXjL FjU VEMG rHHAGQAGXjKLiliTli 
R* SAjCJjP KCWDYRRESLYLADA* GFYNL 


3836 


17737 . 


A 


3859 


262 


i 


TAAQEFNFGGPIKKI FRCQPGEVKIiDPF 


■ 


- 










GWS*TPEFK*STCLGLPK*WDYRHEPPN 
pas 


3837 


177jo 


TV 






i n 


MTP P Pfl AORNVRNPT .PPTfKTf PP«UTIPT»P 

PPPRKAQTEEWIiYPGPK* FFFFPPPPPG 
GKNTKKKPPPQKKKKKKSL 


JO JO 




n 


JOOl 






TF «S POR* (3GPP PP P P HC3GGT.R A P P P P PG 
LFFVFWGKRGVSP 


3839 


17740 


A 


3862 


3 


482 


YITDEVNDPSLTIKSIGHQWY+TYEYTD 
YGGL I FNS YILP PLFLEPGDVRFL 


3840 


17741 


A 


3863 


1 


129 


LANSNYERTHSRI I XIiSQGIiQTLLPIjIA 

W* *T.T.2V QTt&NTAinm 
D " iJliAi9lirUiijnlV&V7 


3841 


17742 


A 


3864 


289 


372 


NHLVKKW AKDLNRHFS * ED I Q I AHRHKR 


3842 


17743 


A 


3865 


r\ S f 

266 


31 


y-rpc t DO DDTlWC^7T?l?DlfirD CDAVDVDD 

ivJ.t-r 1 r t> rfyiuw vj? f fjvvKr yyj\KJvrr 
PATjLFSAYFFFEAESSSVTQAGWQ*CK 

T Y^CPAPPPPfSPTTPPCr'T-'C'rT^VS 




A. 1 i*fH 






i 
i 




NTSDFCI KPRAYNESEHHWDMVRRHL*G 
KEDNLTLDI SKLKEQI FETS KAQLNLVS 
ETFAMVKAVDSLTNLNPVTWVKTIGNST 
IAKFVLXTjGCLASLLVVYRYIQQLRRDS 

dqregammtmavls krkgg 


3844 


17745 


A 


.3867 


137 


2 


APLIKKHECLRAVAPACNFSTIiGGRGGB 

itr*gv*dqpdqhgetp 


3845 


17746 


A 


3868 


1 


108 


enrlkpggrrcseprscyctpawaterd 

SMS*INNK 


3846 


17747 


A 


3869 


198 


302 


llninhnyggv*gcsel*shhctlawaa 
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SEQID 
NO: of 
n ucicoiiue 
sequence 

• 


SEQID 
NO: of 
peptide 
sequence 

* 


M 

eth 

00 

• 


SEQID 
NO: in 

1IC CM 

UoolN 

09/515,1 
26 


Predicted 

beginning 

nucieouoe 

location 

correspond 

mg to iirsi 

amino acid 

residue of 

pepuoe 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acta 

residue of 

peptide 

sequence 


Amino acid sequence (A~Atanine 
OCysteine, D=Aspartic Acid, 
Ur- oiuiamic Acta, f— .Phenylalanine, 
G=Glycine, H=Histidine, I-lsoleucine, 
K^Lysine, LHLeucine, M=Methionine, 
PF=Asparagine, Jr^rroune, 
Q=G!utaroine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
w— j rypropnan, jr— lyrosme, 
X=Unknown, *«Stop codon,/=possib!e 
nucleotide deletion, \=possib!e 
nucleotide insertion 














ASIjHFKKKKKKRKKKSVKSQ^ 

KAILNYIREKKLFLKWK*LLNINHNYGG 

VLEKKKP*SYTYNLKKSQNLPQLL 


3847 


17748 ■ 


A 


•5 OTA 

3870 


237 


419 


ixtIUoKQ(jIaRAJ.CIPMFIAASIjMIDKG* 
RQPKCPWREE * INKLWYKHLMAYHSALK 
RNEIL 


3848 


17749 


A 


3871 


239 


232 


KAFLHTRVKRG I Q PG AVAHACNP SALGG 
RGGR I TRSGDRDDJPA* HG FS WGKCLGI E 
LVGHMFNFLRNCQTVFQSGYI I YILPMP 
LDS 


3849 


17750 


A 


3872 


417 


219 


CHVAQADLKLLSSSDPPASASQSTGIAG 
ISHRAWPRGTNPIHKGSTLMT*LPPQRP 
HLLTLSPRGF 


3850 

• 


17751 


A 


3873 


45 

• 


414 

■ 


NILFFFFFWKKSNLVPQGEGQGQSLG*W 

m V V* -w jMm m» ■ ^ jTT. 1 — | — _ ■ — 1 J~_ 1 _~LL 1- ~K W W _~M. _~I_L "L _ W ^m. 

NPPPRGLRGFFNLTPPRTGNKGGDPPPQ 
ENFGF*RKGGVTQGGGGGSKTPALGGPP 
GLTPQRGK3NNGQD PRPPAKKFGLGKKKK 
NRQLPFLGFFR 


3851 


17752 


A 


3874 


278 


2 


LISIYVSIIYLYHSGFQLSKISPPQGQL 
AYLS IYLSVCHFLI YBIGVITMPASKID 
K*DLIKIKSFCTAKETIIRANR*PTEWE 
KIFAIYSS 


3852 


17753 


A 


3875 


233 


379 


QCD YFNELLiRMBI LGPGAVAHACNPNTL 
GGRSG * I TRSGV* DQPGQHGE 


3853 


17754 


A 


*m mm m* 

3876 


6 


247 


MIjITSAS YPEDLAKI IDEGG YNKSQIFN 
VDETALN*KKTPSKTFTARDKSMPGFKA 
S KDRLxLI#IiGANFKIbKPMNDXi I L I 


3854 


17755 


A 


3877 


411 


3 


AKI P P LTS S QGHRTRVCQ KKRERE RE KE 
KEEGRKERERERKKGRKKERKEERKGS * 
VRMLFLDKVDFRSQKI ITDKEGHYIMIR 
E * MHQ + D 1 1 LNMYI SNNRALKYMKQKLT 

EIjIGRVGRPGRSGTVPGRRWGVQ 


3855 


17756 


A 


3878 


174 


375 


SFQQ*TAELEIKNTTSFTLTPPKIiKYLG 
INLTKYVQDLYK 


3856 


« ******* 

\T151 


A 


3879 


1 


106 


FHRVSRDGIjNIjLTS * S ARLGIiPKCWDYR 
REPPSPA 


3857 

> 


17758 


A 


3880 


383 


24 


FALNNFPQGGAFSGPLTLS pqvglffap 

gppppgggfscpiaqsffrifpqksgpp 

ALSFS PLFKKNPVVFFIjRTQPLLPLFPQ 

prvfffppfffff*drvslccpg*sava 

QYGSLQP 


3858 


17759 


A 


1 DO * 

3881 


400 


71 


R I RQENRLN PGGGCYSEPRSRPRTTAWv 

tg*dsisknnnknk*nlsfiyfyyyflr 
wsltlspkl 


3859 


17760 


— * — 
A 


3882 


282 


•> 
3 


I FNJTOG x I Vb V i IYE VQEIF* YKHAMCK 
NHIVENGVSIPSSIYSLCYKQSNYILLV 
1 JjKi ACj WAy Wlj JL rVI PAIjWEABAldUoRS 

QEMETILA 


■joOU 


1T7I»1 
J / /Ol 






l 


1 /j 


T .VKNPPIf T »AFHf3fZWn T T .PPT.RnRnrT 
u l r^iMC c lutficnuu v I u i utuuiAUAULija 

PGG*GCSE FKSCMCTPAWVTQQDPVSKK 

FF 


3861 


17762 

• 


A 


3884 


415 


233 


SQDGLDLLTP* SARLGIjPKCWHYRHEPP 
C PAANFFL KAR Y F WLMVAR Y FWLW CNY S 
ALPS 


3862 


17763 


A 


3885 


237 


367 


GPVUGPGMVARTCNPRJ. IjGGRGRWIT*G ; 
REFETOLMJMAKPCIi 


386*3 


17764 


A 


3886 


359 


108 


tgoggggppppplgrlkkkngvnpegkn 
snkpklaps pppggkrkppskkkkkkkg 
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SEQID 
NO: of 
nucleotide , 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M, 
eth 
od 

• 


SEQID 
NO: in 
.USSN 

AA/C1C t 

09/515,1 
26 

• 

■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 

A' % 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A»Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F^Pheny (alanine, 
G=Glyeine, H-Histidine, l n Isoleucine, 
{(-Lysine, {^Leucine, M-Methionine, 
N=Asparagine, P=»rrolme, 
Q=Glutamine, R^Arginine, S=Serlne, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y«Tyrosine, 
X-Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














G*HLPMLPRL<3SNSWAHDMIIiPPYPPK 


3864 


17765 


A 


3887 


245 


377 


QFI KNI SWPGAVAYACNPTTI*RG*GGWI 
KRS RV* DHPDMVKPHL 


3865 

■ 


17766 


A 


3888 


406 


2 


SLNI FFFFFFFFFFFFFFFFFKPGRVFP 
LKNFFFFP FKGCR I PF P P IFPLWLKVFF 
GPTPL*NPRG* KEGFCPPPFFFFPFPFF 
♦GRVLLCRPGWNAVAQSRLTATSAHAS 


3866 


17767 


A 


3889 


3 


114 


EFHCVSQDGLDLPTS * SAHLGLPKCWDY 
RRKPPCPA 


3867 

■ 


17768 

* 


A 


3890 


395 


1 


MVLIS* PCDPPTSASQSAETTGISHRAR 
QFPMLIiSQ*LVCHSVTBVFLIFLLKLFY 
IYDILLYI IYNYILLI IYYLFLFLLFFE 
TGSCSVIQTGVSSTITAHYSLKSHPSGL 
KRSSHLSLPSSQDYRCAPT 


3868 


17769 


A 


3891 


2 


189 


GRVGERGGTLL* SQLLRR* ENHLNLGGK 
GCS E PRLHHCTPS WATEQDS I SKKKRS F 
IKALHM 


3869 


17770 


A 


3892 


410 


258 


GC*GCSEI*SRPCTPAWATE*EPVAQKE 
KRKKKRKEKKKHVLATLAISSLICL 


3870 


17771 


A 


3893 


244 


412 


GIEEACGNFRVIVLKSSLALEGEEPSGR 
DSS * EASGGRARWLTPVI PALWEAEAGG 


3871 


17772 


A 


3894 


208 


2 


RISPPITPFPPWVTKKTKG*PGFFFQKN 
* KS F F F FLR W S LAL VPQ AGAQ * LDLGS L 
LPPPPGYKPFSC 


3872 

- 


17773 


A 


3895 


407 


18 


PLLGHPPKKAGAPIiIFNLPRRPK*KPEF 
PPPPLSPTQPPPPPPPPICPAWLPPPGV 
PSSFKG*MGGPP*RPEV*GPINPFWAPS 
LPVWPQNPIPFPKKKKWGEIHCMVCPLS 
FNETF3 FFKMESLRDGK 


3873 


17774 


A 


3896 


82 


401 


KGFLKNSPKGEGLFLFSFPFFFLFLATW 
LWPRLKCSGAI IARCSL*IiLGSSNFSDP 
PT STFGE AGI TAAGHAQFPP PDS PPPKT 
RAHHVTQPGLKLPS *SSPPASA 


3874 


17775 


A 


3897 


263 


396 


VWWPGGIX3*Y*TPPU£K3QGGRITRSGD 
QDHPG * RGKPVSTKNT 


3875 

■ 


17776 

i 


A 


3898 

■ 


2 


386 


KTGIiELLGLSNPPASASQSAGIIGMSHC 
AWPNLFLYWNHHLYPHKK*VWPGTVAHA 
YNPSTLGG*GRRIA*AQEPKTSIi 


3876 


Mill 


A 

■ 


3899 


140 


33 


IFFFHL*GEIGFTMNMLDIGGGFTGTEF 
QLEBVNFS VNDINKI IAFFVT*GVDV* S 
RIFTMAKTKPYNISLKLCS*KTF 


3877 


17778 


A 


3900 


377 


2 


TPPKENGGLPKI KVPYFRGNKFKKFPLC 
FSKVPPISTMWVLKNQFQPPIPFLKKNF 
FKTPKKGVSHYNFP PQPKS * GVFFFS KK 
KNFYP FF WEEQKKQRGPNAFLFFLKKKK 
RAAARPTRPPTRP 


3878 


17779 


A 


3901 


2 


123 


RVSQDGLNLLTS * S TCLGLLKCWD YREE 
PPCPALRCVFYG 


3879 


17780 


A 


A /V A A 

3902 


188 


3 


LRCTWCPGTDGWVSHLAHL* FKKGRPGA 
VAHACSPS ILRGQGSWISRSGVQNQLGQ 
HVETP 


3880 


17781 


A 


3903 


422 


238 


ILHVQIiPSSWDSKRKPFFFFL*R*GFTM 
LPKLLLNS*GSSDPLASACQRLGITGMS 
HYTWP 


3881 


17782 


A 


3904 


414 


1 


STHLGLPKCWDYRHEPLHPASMSYL**I 
FFIYLFTHWSVSLTARMQFAEDRDLIFL 
LHCYISSDSNSTSWL*YILVFVHSPWI> 
VTTIALILGFYVFVFVFLFPFETGSRSV 
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SEQU) 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: In 

T1CPM 

USSN 
09/515,1 

26 . 


Predicted 

i 

beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 

i 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AJanine 
OCysteine, D=>Aspartic Acid, 
E=Giutamic Acid, F^Pheny (alanine, 
Cr=Olycine 7 H-Histidine, Hfeoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N»Asparagine, P=*ProIine, 
Q=Giutamine, R-Argtmne, S=Serine, 
T=Tbreonine, V=Vallne, 
W=Tryptophan, y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














AQAGVQW*GLSSLQPSIPPGFK*FSC | 


3S82 


17783 


A 


3905 

■ 


396 


236 


DRVS LLS PRLECNGTIS AHCNLCLLGS S 
NS PGL* NRGHQ I FS IEGQ I FLQAMSLSQ 
LLCSAL WQKQP YTMYKGMEMAVCL 


3883 


17784 


A 


3906 


377 


57 


RGLVFFFQNPQA* KNLGRGRVF FPSGAQ 
KNPQKGGWAFFGRPPPRFGGRPRGPGAR 
CPLNLTPGGP KGKALGPP PWG WGS KFQK 
GVPLQWVGGNSGLFFSKKKHKL 


3884 


17785 


A 


3907 

• 

i 


341 


3 


TRFFFFSPLGKKGFFSKGFKLGTPRVFP 
LP P FLNPGPG I YFWGPI KKKKTFP PPGV 
KFGSFKRAPPFFFFLMFIAALFMWKTW 
KQPRRPSVSE**NKLSYL*TTEYFFSIK 


3885 


17786 

* 


A 


3908 


3 


162 


MMRKAI RGHL * NN PALE KLL PH I RGNVG 
FVFTKEDMAPFQKPALLQHHSQYI 


3886 


17787 


A 


3909 


1 

• 


421 


PTRPPTRTRWGFTMLVRLLLN* KHSQAG 
AVAHACNPSTLGGQGRQITRTGVQEQPD 
QHGETP S RTRGRTRG 


3887 

- 


17788 

■ 


A 


3910 


2 


424 


PRVRTS PGLALS GLTGLKFS KNQ KKKGK 
KPQTTKTXJKKKKS PKKKKTPKKKKGGAP 
LKKKNFNARGGENFFFF*GPKKKNPGAR 
FKKGGGGKTRGTPKiqCAVGKTPFFRGGE 
KKKESPSQFYKKIKKKVFGGKKIFP 


3888 


17789 


A 


3911 


3 


236 

• 


GFCHVGQADLELLATDDLPAS APQSAG I 
SGLSHCAWPV* PHVMSSLKSLLATRPSL 
HR VKLQ KNGLRKTRE L YMKNS 


3889 


17790 


A 


3912 


304 


2 * 


VAKKGKSKPGGLKKNPPKIGLGYPQGGP 
QNQ I WEKFFFLTP KNRWGAPL * GGGGPP 
SLPKKRGPPPPGWSNPPPPFFFFFFLRQ 

n /mt fnt Am/it fAT.mnT nf 

SCSVTQTGVQWRDLEL 


3890 


17791 


A 


3913 


91 

■ 


344 


HAAPVCGRRTLKVHPHAPSWPPS3WAHP 
PPTWPEPSGTTWAPEGTWTLGPPQELS 
LRRGVSYHCPPP*PLIWGRKWPPPLPCL 


3891 


17792 


A 


3914 


328 


402 


KSWGGWPXAVAHVCHPSTLGG*GGR 


3892 


17793 


A 


3915 


16 


179 


GGGCS ELR * RHCTSAWVTRARLS QKKE K 
KKKG PRKGEVATG SVWTRGEL KS WAL 


3893 


17794 


A 


3916 


229 


46 


AGPELLASSDPLTSASQSSRITGIPFLT 
GLYSKDHIIETANISYTNA*AL*ITLRT 
RGSTR 


3894 


17795 


A 


3917 


42 


383 


AFPCFVIGRVISKGKECTLYFFLFLFFF 
FFFEKKFFFFSQGGKKGGEFGSMEPFPS 
GLKETPLPHLLEEGKFWAPPPPPVYFFV 
F* *KRDFTLVARWSNRLPSREKGLDYR 
P 


3895 


17796 


A 


3918 


402 


286 


PPPPF*TPPPPKIFGAKKKKKNSPPPPG 
ESFSF+KRPPP 


3896 


17797 


A 


3919 


349 


380 


KKSPFPQ*RNFGSLQPPFPGLKPSSPLS 
FPSSWVPR*TPPNPANLFFFKKMRISHV 
AQ 




17798 


A 


3920 


28 


327 


KTx JjLNQATX XQAKAG rWGHPPPLSPRE 
NKQNQKKKXKKKKIQCKKKKKKKKKKKKK 
KKKKKKKKKNGGGGP * KKKNFSGGGGEK 
FFFFWGAKKNFGGGGF 


3898 


17799 


A 


3921 


414 


1 


NFPKVSGPKLMSPS PPHFFFPKKSFPPP 
PLFLEKKNGRFSPPFPEKKV*F* KKPPF 
FFSPKKKPRFPPKVLKVGGKKRVLKNLG 
PGLLS KKBRGTP P P P PLFFFFFF * DGVS 
LCLPGWSAWRSQLTVTSASQVQVQE 


3899 


17800 


A 


3922 


1 


169 


LRS CHCTPS WATE * DS VS KKKKKKTLKN 
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SEQID 
NO: of 

n mw A 

nu ci co uue 
sequence 
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SEQID 
NO: of 
pcpuue 
sequence 


M 
eth 

Ou 

• 


SEQID 
NO: in 

09/515,1 

• 


Predicted 
beginning 

nu Lieu uue 

location 

corn»puoii 

ing to first 
amino acid 
residue of 
peptide 

ft a V 1 Alt J* A 

sequence 


Predict- 
ed end 
nucie- 
otfde 

IvCaIIOIJ 

correspon 
ding to 
last amino 
acid , 
resiaue oi 
peptide 

PA Al • • A n A A 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 

I? — flti^o m If* A /*irl ITrsPh an«/la lanina 

Hr- vaiuiaiiiii. /\liu, t— r ncDyiaianine, 
G=Glycine, H=Histtdine, I=Iso leucine, 

I/" — 1 .vein a T .PiMMnA ft/f • — lV^^>t-l» u\nlnA 

N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V«Valine, 
WKTryptophan, Y=Tyroslne, 
a— un Known, — ocop couon, / = possiDie 
nucleotide deletion, \=possible 
ntivieoirae insertion 














. LSPQKLNRAGKIFLRVFPLEKKSQTQLL 
*KRAGKIFLRVFPLEKKGQTQLL 


3900 


17801 


A . 


3923 

> 


263 


74 


LLSVWRIPCAQKKERKKRYECECFLIVF 
v rUlMJuariois z lor lU'JLiJwVi.VL.vuvL.v 
CVCVW* VKNFRKS * * SSDRS KTKTI KKH 

EVSLPKE 


3901 


17802 


A 


3924 


3 


129 


TQFHHIGQAQLKLLTL*SAOW3LPKWWD 

VDLXTTD DUD ROUTT* 


3902 


17803 


A . 


3925 


281 


10 

- 


NLP.LPGATDYPASGGGGCNEMRSCHCTP 
GNFNMVYILDKITK* FLYFKQLILSFLM 


3903 


17804 • 


A 


3926 


110 


1 


QLGAVAHACNPST* GGQQQAEHLRSGIQ 
DQPGKHGK 


3904 


17805 


A 


3927 - 


180 


365 


GAI ITHGTLKHLDSSEPPTSDS * VAGTS 


3905 


17806 


A 


3928 


92 


2 


VRPGAVAHACNPS TLGGGBG* ITRSGDR 
DH 


3906 


17807 


A 


3929 


2 


146 


KLNLGGGS CNELRLHCCTPAWATE * DPA 
biUvKJtisr KJ. laftJf 1 l_y A VMJb 


3907 


17808 


A 


3930 


221 


3 


KERAIWGTELNMVSKIFPI *LKQRKNL* 
PP^IKGxTiNLLRP+IXSPGTVAHACNPST 
LGSRGGRI TRSGDQDH 


AAA 

3908 


17809 


A 


3931 


• 

1 


102 


GGRGCSEPSSHNCTPA* VTKRDS XSKNI 
NNKIVK 


3909 


17810 


A 


3932 


351 


69 


GQGGGFRFIQPPSSRGKRFFPPPPPPIW 

IFFFPPPGVPPPWPPQNFGFQGGAPPPP 
PPFFFFFFF 


3910 

* 


17811 . 


A 


3933 


401 


94 


RGPPPLNPPLWEKNHAGPPKVGG*KPPP 
PTKKKPIiFS*KTKIFPPGGPPPLIPPPR 

TKKKTPPPKKKKKKKKVS 


3911 


17812 


A 


3934 


184 


2 


LKE FFPS RNSRS VLHTSNF * HLF I *LI I 
YFFEIESYSVTLAGVQWSDLSSLQPRPP 


3912 


17813 


A 


3935 


2 


393 


KILIKVEIEGTYRKITNAICEKPTANIV 
PNRKKLKAFPLRNGTR*GCPVSPVLFNT 
TLEVIARVI YSEK* IKGTKIQISIATME 
NSKKFPQKTENKTTICSSNSTTGCISKR 

TTPMH T QlTRHTiHQPVYPMT 
jnJCii*ux loiuUiunor v i\*ctx 


3913 


17814 


A . 


3936 


291 


2 


FPPPPQTRGPKSSSRAHSPPGPGPGSTT 
GAR* PFRPLLLVRASLRPPLGLPTPTGP 
TGGMQRPEA*RTPATDKPQNSRRGP*GA 
WGRPGCRCRGGA 


3914 


17815 


A 


3937 


382 


1 


QW\L/\y\L Wb VyNEjjliJjr CljtUJnPPFblxLiP 
PSSCPGPHLGTPWTPAFPSHPTSTIiKAH 
PIGSGPSSPLLLTSPRMPPEIQTTATG* 
PASSLTPTVCSQHGNRFFKKGVRAGRSG 
SHLES QHFEGPEAGG 


3915 


17816 


A 


3938 


398 


2 

•• 


" PPPLLLNKIGAKKKAFFCPS PFGPRKI P 
CGQKTKKSVFPFRVP*DKSTFDCFLCPI 
VYVKKQIV?LAP^ECVFSGRVAQGSKKSP 
LCLSLPFFFFEMESHPVTQAGVQWRDFR 
SPQPPPPGPKPFSCLGIiPSS 


3916 


17817 


A 


3939 


240 


385 


KLLFLFLFIYFF*DGVSLCCPGWSAWQ 
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SEQID 
NO: of 

flUUCUllUC 

• 


SEQID 
NO: of 

pcpuue 

9C\J UCIILB 

• 

■ 


M 

eih 

oo 


SEQID 
NO: in 

26 


Predicted 
beginning 

n n j* L^Attrl A 
HUtlCULlUC 

Inmtinn 

correspond 

■ HE IV III vl 

amino acid 

I vOIUUv Ul 

peptide 

t ■ 


Predict- 
ed end 

IIUC1C- 

UUUv 

location 
ding to 

latt nminn 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartlc Acid, 
£r"vriu uiiiiit. Ai.m 9 r— rueny laianine, 

l"3=f"3lvf inf* Tf=T¥ictf rl inc* I=l6nlnii4»ifiP 

K=Lysine, L=L€ucine, M=Methionine, 

NsAcnflfApine. P^PVnlinA 

Q=Glutamine, R-Arginine, S=Serine, 
T=Threoninp V= Valine 

W=Tryptophan, Y=Tyrosine, 

X=lln known *— Ston cod An. /=nnc«iihlp 
nucleotide deletion, V=possible 
nucleotide insertion 














TLLTEPSAIPGAGGTPVSPI 


3917 


17818 


A 


3940 


230 


381 


LISSSYREYLAKDLRFSSKQS*WIjCSKE 
FRGLGAVAHVCNPS TLGGQGRW 


* 


1 /5l7 - 


» 




i 
i 


JU/ 


QPHRNLL 




noon 


n 




*» 




HT iT ,TQ * Q IVPT^nT.P TfPWT) YW P TCP PP P &T . P 

AFS 


3920 


17821 


A 


3943 


163 


404 


CRHKAQWLPQIiPGHLLLSEKTIiKYSSPK 
QWRNLGSLQPLAAGSKRFSCLSLL 


3921 


17822 


A 


3944 

! 


3 


200 


GLDIJjTS*SAHLGLPKCWDYRREPPRLA 
YFTF I KKKKKDGCELLNLPPNS CMDTAH 
VHPAHLTGQ 


3922 


17823 


A 


3945 


423 


2 


LF F FFSEV* SHSVAQAGVQW YNLGS LHS 


3923 


17824 


A 


3946 


209 


413 


TTCI EQCNKTRRLGAVAYACNPSTLRG* 
GRWITRSGVRDRPDQHGETT1QITNISTT 


3924 


17825 


A 


3947 


425 


3 


SPSQGLRDKTPISLGQSTWGKGQMWAQR 
QQT*TSWPDGSEES3RSPSTFEQQISER 
SSSAKGQTNSSSGSI^SVSPDWETPPSR 
SQQTPHTGELQLASGRCSSGMNLPEEGT 
DSNIiCCSATHAS 


3925 


17826 


A 


3948 . 


299 


3 


ASQTTFREGC FLFFS FF I S VCLF VCLFE 
MEFHSCRPGWMECNGDRARLHLKTTITT 
TIEIVKDDTNKWKHIH*PXPNIKMSILP 
KVIDILFNAIPIKI 


3926 


17827 


A 


3949 

• 


331 


374 


NAPKSEKftSKAF*AILEENKEKLPLDIN 
PV^PHGHIFKCVAPPHCLGWDLSAVPA 
SHALGFPQPQNSAWHVARSKSAASCLGT 
AVTSAGSGVLGTPNKMPQKVRERARLFW 
R 


3927 


17828 


A 


^ #-\ f s\ 

3950 


439 


69 


TNQKVWVPPKEKGIjr KKKQIiKPRVGGPA 
FHSPPLEGQGS*FP*GQRFKPRPPPWVK j 
PLFSKNPKNYPGGGAPPFFPPSPGG*GG 

PFPKKKKKKSH 


3928 


17829 


A 


3951 


422 


3 


CFPALGIGPPHPPBCRVG*KGGTPPPGLF 
FGKFLKEKNFGKNPGPGQGGPFFPPKKQ 
RGPFNNGRPGKVKKKGAPGPGVPKKKPP 

QKRKNFFFFFFLRWSFTNEAQAGVQWR 


3929 


17830 


A 


3952 


223 


3 


ALFFWGRGTLPRVKGLiNPKGGGGKFFFF 
FFF*ETESHSVAQAGLQWHDLG*LQPPP 
SRFKPFSCLRLPGS SLS 


3930 

* 


■» ion i 

17831 


A 


3953 


1 


too 

388 


liWjvb IrtjvjK 1 LonUKW UHL. IrV Wiu HKiJ 

S VSKKKGMSNNFSLI LRSWNNKGQGGTY 
FQQKFIYPCFLKIKGHSYINCKIFKK** 
NLSIiHVHVTDVNVRI ILOKLDCOIHWVD 
FNFELNILQCQGLGGCK 


3931 


17832 


A 


3954 


1 


396 


KLRPKDQDTWPTQQSAIWEFVPPSVYF 
VKSISEItiFFFFLKKNFFLGPQLKGRGQ 
ILVKGNPCPGGKKKFPAPPPQRGGKKGG 
PPPPQFIFVFLKKTGV*KIGPGGV*TPA 
JjGIAPPFFPKGGE *RGGPPP 


3932 


17833 


A 


3955 


2 


199 


P GGGGCGE P KTRGCT PAWAT ERDS VS KI 
NK* KLKLKNNS WLYAKS IASS PTGR* EB 
WGCFTNEHI 
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SEQ ID 
NO: of 
nucleotide 

i 

sequence 

'i 


SEQID 
NO: of 
peptide 
sequence 


M, 
eth 
od 

■ 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino add 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A~A!anine 
OCysteine, D=Aspartic Acid, 
HXSlutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoteurine, 
K-Lysine, U=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon. / = D0ssible 
nucleotide deletion, \=possible 
nucleotide insertion 


3933 


17834 


A 


3956 


393 


2 


. KEFPLFPPGAPPWPLSGPFSVIiEKPKCG 
GGWPG INTPPGGGGFPQGSLWAGDFP* 
PL FS PQ I PFFKKQKFF FFLPRGGLWAPF 
FPPFGGPLFSQNF*KTGF*GGFFFPFFG 
DGVSLCYPGWSAVAPHAS 


3934 


17835 

• 


A 


3957 


278 


390 


LVYI FKIRTCW PGAVAHACNPQTLQQ * G 
GRVTRSQDQ 


3935 


17836 


A 


3958 


385 


311 


GPKNPPASASQEGETTEV*HHAWLIFFL 
FFVELWSSHVAQAGtiKLPASSDQS*TLY 

I . . 1 


3936 


17837 


A . 


3959 


410 


119 


SPPPRFKTPPPBFIFGAPKKKFFSPPPR 
PLNLILLKGPPLFFFFLPKAFKPHINFV 
FPPQKKKNV* INLYFPPPKKENBSS *GK 
HFALHCNPSCISK 


3937 


17838 


A 


3960 


77 


400 


TALPLKPKKKKKKKKKKKKRGGPLKKKK 
FKAPGE EKKNFFKG APKKKFRGRV* KTG 
ERKKPGGNKKKTFGKKPPFPRGBKKKKP 
QA 


3938 


17839 


A 


3961 


2 


194 


CL*SQLLGRLRWEDHLSSGN*GGSEP*S 
PHCTPAWVTB *DPVSRKRKKYPVVNQTL 
RGFLSPAH 




1 7R40 








409 


NSKTS SOEKKRRVIRLSLRSRVGDDTKV 
NTRRWG VFGWHLKNLPTTLVLKKMEKI L 
ETYFMTPDLHDPHTTSYFNGCQLQAIKV 
GGTITDLIVF* IVSKPNAAGLSLNENVN 
AKFNDLI 


3940 


17841 


A 


3963 


382 


96 


LS SQLLREAGERNE* TGWLQQPLQMQIH 
KNPC I E PRSRYCT PAWATEQDPVS KNKQ 
TNKQTKNSLHL 


3941 


17842 


A 


3965 


3 


169 


DAWELRSHRCPPAW * QS ET PS QKKKKRG 
KKKI FRFYLFFNKRKKFLNGGNLGVI P 


3942 


17843 


A 


3966 


1 


121 


AGFHRVSSDGLDliLTS * SARLGLPKCWD 
YWCE PPHLAENS 


3943 


17844 


A 


3967 


200 


2 


PPFRIYWGEVLTPLFLRVKPKFFKPWGF 
PFCLFFFFFFEMESRSVAQA*MQWRDLD 
SLHPPPPAFK 


3944 


17845 


A 


3968 


471 


440 


ME LRF YHVGQTGLKLLTS * APPVMASQ3 
AGIIjGVSHSTW*EAPNSIiAITFLSSHLS 
PRQP*THLGC 


3945 


17846 


A 


3969 


214 


2 


EGQPECQLQAGDWVSLWNLVCSGLEVRA 
WREAPRLEWSFCLFFVSEM* SHSVI * DR 
VQWHNIGSU2PLPP 


3946 


17847 


A 

* 


3970 


53 


398 


GDIiPKKQQQQKPIiNFCFQLLLYARLFGP 
IFLV*HIATYNNKHLLECVGKVMINCQA 
DI LN KME I S KVGAQKKKKKKKKKKKRGG 
PLKKKKFI PRGGEENFFFLGAPKKRTGG 
GFK 


3947 


17848 


A 


3971 


396 

mJ J \J 


160 ^ 


KRERIKEKEDNKEEKDKEKKASGKKERK 
KEKE I RDR ES KEE S S NLLLQ EELSLCCP 
S PGHPPNS SL *DP YFATGNSLRC 


3948 


17849 


A 


3972 


416 


3 


FPFPRVPEGVPGGEGVFPPGFFGKSGVP 
FPRPRGGGAPI * KPPPFPGGPPNFFF * K 
KTGPPFSPPQGAPPAKRAPPPFFGGGGG 
KIFTPLKKGGLFFFGPPKKGVFFGPKKT 
LFFFQKKKKKKKGRPPRSHASAHA3 


3949 


17850 


A 


3973 


232 


381 


YKTFSLI F* LFFFRDRVSFCHPDRSAW ' 
LS*FTVASISWAQGIIiLAQPLE 


3950 


17851 


A 


3974 


396 


141 


CFFPPPPKRGFFPTPIFGGPPGFFFPPF 
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SEQ ID 
NO: of 
oucieotiae 
sequence 

■ 


SEQID 
NO: of 
peptide 
sequence 

■ 


M 

eth 

od 

■ 


SEQID 
NO: in 

09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

resioue or 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
locaiion 
correspon 
ding to 

IamA A*W*|**A. 

• IBS I Amino 

acid 

resiaue oi < 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCystelne, D=Aspartic Acid, 
iir-tsiuiamic aciq, r— ir nenyl alanine, 
G=Glycine, H=Histidine, I=Iso(eucine, 
jvFUL/ysine, j.*-— Leucine, lvi— ivieinionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
i— i nreonine, v— valine, 
W=Tryptopban, y=Tyrosine, 
a— uoKnown, — ocop couon, /^possioie 
nucleotide deletion, V= possible 
nucleotide insertion 














FLKGP P P FFFFFFFFFFFFF FFFFFEDG 


3951 


17852 


A 


3975 


286 


422 


RKQLIS PHSQDNSKQKEQS WRHHATCLQ 
GITLPVPKLYYKATVTKTAWYWDQK 


3952 


17853 


A 


3976 


3 


116 


GFHRVGQNQFDLLTS*SACLGLPKCWDY 

OUT7DDOOT1D 

Krtii e ir KJr AX 


3953 


17854 


A 


3977 


223 


399 


EGPRTQS FFDTEVHS VTQDGVRRRNLGS 
LLPPPPGIK*FFYLSLPSNWDYRRLPPR 
PVW 


3954 


17855 


A 


3978 

i 


227 


422 


S FPS PC I FFS LVDLG VSDYCLGWSQTLD 
LKGSSLFSFPKCWDYRQ*ATTPOLPVYF 
FPFPVFTFF 


3955 


17856 


A 


3979 

■ 


519 


313 


KKNLLNPGDGGCRE IKSSHCAPAWVTEK 

DCXKiUUCKKTSvCVCVCV 

SSLGLFPEPVGLN 


3956 


17857 


A 


3980 


217 


a m a 

414 


I YI FOGDGVS IiCCPGWLKQS FCLS LP KC 
*DYRHVPPRPTSFHLLTNIGVLQFLKLQ 
ATLS FTLFLL 


3957 


17858 


A 


3981 


2 


409 


KLTFAV*DY*EKLLkSLASLTSQAAFLF 
PEHPYTVISNPCSPA^YIFTLSSGTTFH 
QFYAKS * CVTYE * KKKNLLS PPTGDKPN 
*GRSQTKKIQTKY*SEICFFVHLFETGS 
HHVAQAGLELLSSRDPPTSASQST 


3958 


17859 


A 


3982 


234 


416 


S VLPSTPLLNKL* KSKKKKKKKKKKKKK 
KKKKKKKKKKKKICKKK5RGGAPKKK 


3959 


17860 


X 


3983 


414 


291 


GGRACSE PRSCPCTPAWATERDSV*KNK 
QPNKQKNKKDLL 


3960, 


17861 


A 


3984 


192 


A ■% ^ 

415 


rNANKLDNLDE^RFLETQNIjLRIJQNEE 
TBYQNRPVTREB I E * VIKNLLTEKKNPG 
PDGFTGEFYQTFKEKLIP 


3961 


* *^A**Av 

17862 


A 


3985 


403 


145 


AFFFFFF P PGEKGD FS PT VLFGGP PGFS 
PPPVFKTRPRNFFLGPPKKKYSFPPPGP 
KNWFP*KGPPPFFFFFFFFFFWKLGIEV 
IS 


3962 

• 


17863 


A 


3986 


255 


425 


GGRFKGSNFTSAGLQGFXXXXGPPKFIS 
RPGV* QRGEWKNPGVPNLFALSTS PFGR 

w 


3963 


17864 


A 


3987 


247 


2 


EEAVRVSNIiYTKLDEEQ*WKKQLNYVE 
RJjKK* b Jjr LifJULi i KJv V fLiKX w 1 F * VKJNJj 
NSFWPDAVAHACNPSTLGGRGGWIT 


3964 


17865 

* 


A 


3988 


1 

• 


394 


HHHTWI I FKFFCKDGVSLCCLSWARTPG 
LKRSSCFRLTKCWDYRHEQLCPAFKHFF 
LY*HNFMSTEKMQE * HKNS * YPS PTVPK 
Gvl FY a JL S i J. CJp I A LiLLSAl XX ItF VHT Y 
IVSLNHVCYSNEAPLSLNI 


3965 


17866 


A 


3989 


208 


402 


ETDRRIaKSGGSLSKTLI KCMTCFWFRD 
GVSLCCPGWIQTPGIiK*SSCISLPSSWD 

VT?WMDPf*PR 
I nlllvlr w^<f* 


3966 


17867 


A 


3990 


2 


408 


THRP YHATPTYLPF YTNNLI KGGKKKKK 
KGGKKKKKGGPPFKKKHFSPPGGEKNFF 
FYGPQKKKSRGRV* KRGEGKNPGAPQNK 
PLGKNPLFPGGEKKKKPRFAS * PPPPPF 
S FGKKRAP PTKYPGAGIjHP 


3967 


17868 


A 


3991 


81 


406 


KGGLFLLPRGKGKGKIWVKGNLGLKNKK 
NFPA* PSEEGEKKGGPHQAGKILKFGGK 
RG LNKGGKTGQKL KPGG I PP PGP * KVGE 
KKEKPPAPGQKLFFKKRKGQKKNF 
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SEQID 
NO: of 
nucicuuuc 
sequence 


SEQID 
NO: of 

sequence 


M 
eth 

nrl 

oa 

• 


SEQID 
NO: in 

ITCC7V 

09/515,1 
26 

• 


Predicted 
beginning 

n it i» 1 Art ti A 

II u iicvuuc 

location 
correspond 
ing to first 

nmtnn ftciri 

allllllV Ul»IU 

residue of 
peptide 


Predict- 
ed end 

IIUI.IC 

otide 
location 
correspon 
Hinc to 

UIIIK IV 

last amino 
acid 

rpciflfie of 

peptide 
sequence 


Amino acid sequence (A a Alanine 
OCysteine, D=Aspartic Acid, j 
{^Glutamic Acid F^Phpnvfalonin* 

G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Metbiomoe, 
N=Asparagine, P^Prolioe, 
0=Glutamine. R^Arpinine. SsSerine 1 

T=Threonine, V=Valine, 
W^Tryptophan, Y=TyrosCne, 

X=Un known *=Ston mrfnn /snnjteihlp/ 

nucleotide deletion, V=possibte 
nucleotide insertion 




1 / OO-/ 

■ 


A 




jyo ■ 




YSSPPPTKKTDDSDASRGPFFFFPPPEK 
RGFFPNDFFWVPPGFFPPPGF*TPPPVF 
FFWAP * KKI FFP PPRE * KFFFLKGPPPF 
FFFFFFFFFEHLQVLLIC 


3969 


17870 


A 


. 3993 


367 

• 


193 


KLWE KKKI LWKKG * GE I KS CHCTPAWAT 
RVKLCLTKKKKKKKKKGGBGE I PKKKS F 
SFLGKOKKKRGPAFF 


3970 


17871 


A 


3994 


188 


2 


KKANTLPDFKTYYKAIVLKTLWYWRKDK 
HRSMKQN* ESNEVELLYHNIHRNKLKMD 


3971 


17872 

■ 


A 


3995 


2 


147 


QENHLNPGG*GCSELRS*HYIPAWATE* 
DS VSKKNYI * *TSHI FSGDP 


3972 


17873 


A 


3996 


23 


329 


RNSFIRSAFSNVLNKVVLFLTSEKKKKK 
KKKKKKKKGGRKREYLTRRGAGENNKYF 

GGKKKFPPFEEEGKKKPG 


3973 


17874 


A * 


3997 


164 


392 


KPLGRAGLVPCLCS S FTGFLM* KTEPCS 
VVQAGVQRGHLCSLQPQSPMFKRFSLLS 
LPSSWDYRVLLRHP3WSAW 


3974 


17875 


A 


AAA 

3998 


396 


3 


r r r r r P PGoKWG I b D b F Qbr r SG V KN p S 
PPPPPVNWGLRVWPP PQKNFFFFF* KKV 
FSFFWGGPPNPPFFFSLLLGGPPPPPIF 

CPI?VTT>'DCT.CT7I?T?T?l?T7'ffMWGnCVan3Vr , "\f ! 
obCiJVXIrrour Cf J7 c X c JSPUSO V>o VAyrtcv 

R* RHPGPLQPPPPWLKQF 


3975 

• 


17876 


A 


3999 


383 


2 


VPSPEKVGIKKIQPLDPFF*FLGDCQRK 
KKSPS *RPPPPPBGKKLN* IRIWGPPFN 

CV PTTTTW* TTP* ilTTf P FT?T?PPTffWRT *ftT?'P 
oAJLArr " Xvr rir Ivlr r r ~ irlrlNAJEAJJL v«7 IT 

SRAHSFPNQRALFWFFFFEMBSHS ITQA 


3976 


17877 


A 


4000 


213 


1 


RGRQN KT L FLTKPNQTKPNQTKPNQTKP 
TTFL*SVI E IRVKKKYKTTRKTTKNPTT 
SLTRSIjGRYKSPTRP 


3977 


17878 


A 


4001 


412 


129 


GSLSSPRLECSGTIPAHCSLNLPGSSSP 

r*T32VPPVARPF*MnOBinMPPr r riPVNFFF 

WGGGGI LVETRS C * VAQADLELLS FSDP 
PASASKVLEL 


3978 


17879 


A 


4002 


411 


293 


RDTRCLPP* LDNFVFLVEMGFHHVGQAG 
LBLLTSSEBIA 


3979 


i noon 

17880 


A 


4003 


46 


J92 


rincwirw it t? t?t jtz va a p pp xxzk vtwt^'DWOTC 
Uu" £ r r r f r iuiv.vjiir r r ruAAuu i rr f jx 

KKTKGGGPRGPPLBPPPLGGPGPPEPGG 

GDLKKP WPPQGTPPL * KKPKMGGGGGGP 

P*TPGTGGGGEKNPLTPGGGPKNGPPPP 

PPG 


3980 


17881 


A 


4004 


360 


I 


KCIjGFQFFFRFWNICI I FISQLS IASQV 
QKSEIQNAPMSIFFEHHVSFKQFQILEH 
FGFWIFILGMLSTYKTLILLKLN*RFNA 
MPINIPWI FMEIDJCLrLKFTWKYKEPR 
RVKAILRN 


3981 


178R2 


A 


4005 


152 

A J** 


408 


NKF VSHFYWMGNVLSRYPQFT* QFS FCH 
VRLPSVF*YLV*TRNKILIFFAHKAFFIiD 
AKYVCVCVCVCVCVCVCVYFGIGEGNLP 
S 


3982 


17883 


A 


4006 


411 


67 


YLPPPSSPTPGVFFFFPPPKKEFFPNPF 
ILGTPRFFPFPPF*KPPPKFFFLGPKKK 
K* FS PPRGKKI FFFKRAPP FFFFFFFFF 
FFFFLLGGSLSFMYFHIKTYK 


3983 


17884 


A 


4007 


217 


2 


PLFFFFKHLKAYYSDVIHISFAILRYLXj 
F * DRVSLCHSGWSAVARS WLPVAGQNTF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: In 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
OCysteine, D=Aspartic Acid, 
£>=Glutamic Acid, ^Phenylalanine, 
OGIycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argimne, S=Senne, 
Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














I TRQEAP YLLAI PRV 


3984 


17885 


A 


4008 


235 


406 


GRHRVSSNP IS I * RRRGLS LLPKLECSG 
VSIAHCSFDLPG*SDPSASASRLLGRLR 
Q 


3985 

■ 

* * 


17886 


A 

• 


4009 

■ 

i 


833 


237 

it 


SLDNRARPCL VS FI FKKGRS TAERCGAP 
GTTSPPFPRLSERGHPI I PS *RGWERKS 
QLCSHLQSSRDPSGLGRAGCGGGGDLRE 
SPGTFPGNGGSDLVPVPTAPIPIBPQIC 
QATCAVMKLSFDEEYRRAMNELGECPRS 
SGKPS SS PRTGGGSSAAKGTQAGSSELG 
LGIGARRGTDSLBSHSTSHEWTVTLGPL 
SP 


3986 


17887 


A 


4010 


276 


2 


VBVKGNNS F S EGS ITL I PNPDKQPKETT 
DQPLMSTYAKIESPGKNPSHDG*LIFDK 
GAKI IQ*VKNS I FNKWCWDNWDFHVKTM 
NPHFTLI 


3987 


17888 


A 


4011 


188 


27 


RBKFQI YTIKNDXGS ITTDTTER * KI IR 
DYYEHLYAHKLENLEEMHKLIGIPP 


3988 


17889 


A 


4012 


1 . 


202 


CLHQKVIISN+L*ATTQTTQLSLSFKLD 
YFSII IIPVALWTWAI IKFSL*YINSP 
PKKKEAEKDLN 


3989 


17890 


A 


4013 


1 


403 


CIiDQEVIISN*H*ATTQTTQLSLSFKLD 
YFSIIFIPVALFVTWSIIEFSL*YINSP 
PKKKKKKKKKKKKKKRGGAPLKNSRGGP 
NFWGGGKNI FFFFGGGDKKPPRAFWKKP 
LFLGGGNLGPPPPQKFTPWGKK 


3990 


17891 


A 


4014 


67 


439 


TEL 1 1 FYIFFETTLIPTLAI I TR*GNQP 
ERLNAGTYFL FYTL VG S L PLL IAL I YTH 
NTLGSLNILLLTLTAQELSNS * ANNLI * 
LAYT IAFMRKKTKKKKGGGLLKDPWGGQ 
ILAGREKIKFFP 


3991 


17892 


A 


4015 


3 


260 


LI VPTIILLPLT* LSKKHI 1*1 INTTTHS 
LIISIIPLLFFNQINNNLFSCSPTFSSD 
HLSHP I LLKKKKKKKKKKKKKKKKGGGA 
F 


3992 


17893 


A 


4016 


258 


3 


LKKIFSPPTINLFFPPFPLKKFFFPLSL 
* FFLGVFSP FFPPPKKGFFPKI PPGVFF 
SPPFKKKKFFFPPPFYFAPPRVFFKGPP 


3993 

« 


17894 


A 


4017 


287 


2 


KKQKNFIFFFKNLSFFGRGAFFPPLFPP 
ILRGKGGDFFYPRGLKPALGNQ* NPFSR 
F* I FFFFFFFLRRSLAPVTQARVQWHDP 
GSLQPSPSGFK 


3994 


17895 


A 


4018 


228 


487 

* 


SYDVKQEKPNGI *ASKYWAYVFHSFFLS 
FFFFFFKKKNFFFCPQGGGQGPNLSLRE 
PSPPGA^KKLFGLNLSKSWE*QNCPPPPI 
IF 


3995 


17896 


A 


4019 


35 


228 


EELNHLNQGGEGCSEQRSHHCTPAWATE 
*DSSQKKNKFWYIHPCNRIVTTTNTPMT 
BSPKHYAE 


3996 


17897 


A 


4020 


2 


402 

- 


ARGNLNTTFF0PAGGGDP ILYQHLF* FF 
GHPEVYILILPGFGIISHIGTYYCGKKE 
PFGYIGMG*AMISIGFLGFIV*AHHIFT 
VGI DVDTRAYFTSATI 1 1 AI PTGV* VFS 
♦LATLHGSNMK* SAAVL * ALG 


3997 


17898 


A 


4021 


1 


420 


TRGATELIILDI LVETTLI PTLAI ITR* 
GGQP+RLNAGTYFLFYTLVGSLSLLMQL 
I YTHKTLG* VNI LLLTLTAQEL* NS * AN 
NLI * IAYTIAFI GKI PFYGLQL* LPKAE 
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SEQ ID 
NO: of 
nucleotide 
sequence 

■ 


SEQID 
NO: of 
peptide 
sequence 

- 


M 

eth 

od 

• 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
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ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
C=Cystoeine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyl alanine, 
G=Glycine, H~Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W=Tryptopban, Y^Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














VEAP IAGS IGLAAGLLKLSG YG I XRLTL 


3998 


17899 


A 


4022 


409 


3 


LEMASCLIICNNNNPFFYQIVTCDEKWI 
LYENW * LPAQWLH * E BASKHS PRPNLHE 
KKIMVPVW*SADGL IHYSFLNPCETI TS 
E*YAQQINVMH*KLQGLQLTFINRKGPI 
FLHNNTKLHVAQPMLQKLNELA 


3999 

1 


17900 


A 


4023 


464 


374 


HLPPWEDKARRHLPDARTFI LAFLASRN 
VRNKFFSLQATQFLVLYAS STKI TKPLK 
TUQKKRLKKS EQS LRNM WTP I KR TNI C I 
VGI IEGKEGEKGAERIFEEIVPQTF*NL 
VKDIK*KVQ 


4000 


17901 


A 


4024 


58 


323 


RHKLLCEHVKKCS AVLI IREMQTKTTLK 
YHFLLVTIAKI FNL IHF*RSVGETLI CQ 
WAGRNAKS * P FWRR I SQ YLTKLYVCVCM 
YTHS 


4001 


17902 


A 


4025 


480 


150 


RNSLLRRLRQEDGMSFGCQGCSEL*LHH 
CTPAWVTEQDPVS KKKKKKKAYPDP I PK 
KTDKSNGKKGPDFSRKNPPKNS PPPP WQ 
RSTAFRGRAAKSCPTH 


4002 


17903 


A 


4026 


204 


1 


KKKFGALFFF!FMQGIiTLLPRLEYR*WCD 
HGSL* P*PPGSSDPP ( TSVSQIAGTMGVH 
HEFLRSGWLSK 


4003 


17904 


A 


4027 


116 


472 


MPPLPKPVSGPRPHSCSPLGGPQLSTPL 
TGPRPASLSLPISEGICGCITAIIjWAVW 
YLVLQMLGLiSLFMA* RGAGS S CSAHKKW 
WDGLASPSOTFFKNRTKGRARWLTPVI 
PALWEAE 


4004 


17905 


A 


4028 


355 


444 


VSYKKK* SRPGAVAHACNPSTLGG*GGR 
IT 


4005 


17906 


A 


4029 


410 


181 


ASIILLIAVFFNNILSGQ*TITNTTNQY 
SSLI I IMAIAIKLGIAPFHF*VPEVTQG 
T PLTSGLIiLLT * QKIiAPCF 


4006 


17907 


A 


4030 

■ 


49 


474 


PXRXXFCHIVTYYSGKKEPFGYIGMV*G 
MISIGFIiGFI V* AHH I FTVG I DVDTRAY 
FTSAT 1 1 1 AI PTGVKVFS * LATLHGSNM 
K* SAAVL* ALGFI FLFTVGGLTG IVLXN 
SSLD I VLHDTYYVXAHFBYVLS IGAVFA 
II 


4007 


17908 


A 


4031 


274 


427 


FFF* EMESHSVTQAGLLGRNHSLLVPQT 
PGLKRSSLLSLLSNQYYRSGPHT 


4008 


17909 


A 


4032 


205 


393 


TWGGFKNPSPGEG* IKKMHDFNEMFLQN 
CWKILFFGGGFFTGQIFPRAAPPINKNI 
LVWGGGK 


4009 


17910 


A 


4033 


2 

■ 

■ 


386 

• 


SEPLSRKKLYLS ILI SLQISLI ITFTAT 
ELI I FYI FFETTLI PTTaAI ITR * GNQPE 
RLWAGTYFLFYTLVGSLPLLIAL I YTHN 
TLGS LNI LLLTLTAQE LKKKKKKKKKKK 
KNIGGPPLRAPSGAPN 


4010 


17911 


A 


4034 


3 


169 


GLFLAMHYS PDASTAFSS IAHITRDVNY 
G*IIRYLHANGASIFFICLFLHIGRGL 


4011 


17912 


A 


4035 


3 


317 


HASADAWAFIGYVLP*GQISF*GATVIT 
NLLSAI PYIGTDLVQ* I *GGYSVDSPTL 
TRFFTFHFI LP FI IAALAALHLLFLHET 
GSNNPLGITSQKKKKKGGPF 


4012 


17913 


A 


4036 


446 


194 


KKKNQPGGPPPPPAFPVGGLIKKRFPPT 
PGKKFKFR *KFFPPKGGFFVRDPPPKPG 
CWEREKKKKPLLNTDKWV1LEGSEAGLF 


4013 


17914 


A 


4037 


213 


1 


GMAKVKARFLPQKRGQRVLNPPWWGKGF 
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Predict- 
ed end 
nucle- 
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Inrnrfnn 
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ding-to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 

Rsf^liitnmir Afiri IT=Ph<invlnlnnin* 
E/^vj luiaiiiiv. nviU) r— * iiviJjrHtuuiluO, 

G=Glycine, H=Histidine, I=lsoleacine, 

N=Asparflgine, J*=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonlne, V=VaIine, 

X=Un known, *=Stop codon, ^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














PNSGTKETPFFFF*DRVLLCQPGWSAVA 


4014 


17915 


A 


4038 


307 


36 


RKGFFFSKCGEFKKLPGPPLGOWYLKLI 
FOQKKFFHPKFPGFPG* IFLPPPKIFFI 

WAQANS 


4015 


, 17916 


A 


4039 


2 


372 


NDTSLTVKS IGHQ WE*TYE YTDYRGLVF 
NOTILPPLFLEPGDLRLLDVDNRVILPI 
BAPIRIINTSQDVIjHS*AVPTLC*KTDA 

XlruriCuMUl Iff L\9 Xks.tr a vi XuyLsKlLwi 

HRRSMLIVIiELI 


4016 


17917 . 

i 


A 


4040 


107 


279 


LLLLFFFFWRKSLALAPRWBCRGKI *GH 
CKLRLPSPCHSPA*ASPBAGTTGWTYGC 
F 


4017 

i 


17918 


A 


4041 


3 


351 


GSH Y I AQAGLKNPGS SHLFASAAHSAGI 
TGVSHHAWLKI FFKES VSD YS PHLNSRK 
NAVYVS F FFFWKKS FWVPQAGG KGLNL 
GSLKFSPToRLKQFSCLTLPRS *EY3IAP 

T3T5T5TT 
PPFV 


4018 


17919 


A 


4042 


1 


385 


RPRRPD IE* QERRTQE VLQAVAKKVKB E 
SQLPGTGGPKNVLQPVPRAKAERPKKQA 
EASGLKKBTDWLKVDAQEAKTEPFTQG 
KGGGQTTPESFEKAPQVTESI*SSELVT 


4019 


17920 


A 


4043 


3 


282 


TAILIQTP * SFTGAVILIIAHGLTSSLL 
FCLANSNYERTHSRIIILSQGLQTLLPL 
IAF* * LLAS LANIiALP PTINLLGELSVL 
VTTFSVDAA 


4020 


17921 


A 


4044 


2 


407 


GTRLE IMSRNHGI FPFTLE I FKDNEFEE 
PYREALPTIjKIjRDSIiYTOXBPBSHVSG* 
ESLEESCFATPTSKIDEVLKYYLIRDG* 
VSDDSVKQYTSRDHLAKHFQVPDFKFEG 
KDHKBVI LH * RD L VCGVMDE R SR 


4021 

■ 


17922 


A 


4045 


4 

• 


380 

• 


ENWLTI IRQAWHEADRNLNTTFFDPAGG 

/-ir>rjTr VTttJT C*DT J^TTT3I?r r VTT TT TIOC/^ T 

viiJF x Jb I OnJjr w r JLAwi JrKCa I x Jb x Urbrtjl 
ISHIVTYYSGKKEPPGYIGMG*AMISIG 
FLGFIV+AHHIFTVGIDVDTRAYFTSAT 

T T T ATPTVJVKVTTC. # T.ATT.Wn°.WMTr* QJVP 
x X XnJ, rivjVRV CO Xirll lAlvjOlirui" onl 

IIIAIPTGVKVFS 


4022 


17923 


A 


4046 


2 


94 


ADRNLNTTFFYPAGGGDPILYQHLF*FF 
GHPEGYI L I LPGFG 1 1 SH IATYYSG KKE 
PFGYIGMV*AMISIGFLGLIV*AHHIFT 
VG I D VDTRAYFTS AT INIAI PTGVKVFS 
* LATxjHGSNMK*S I PTPILI FRPP 


4023 


17924 


A 


4047 


376 


138 


LLARQRQENGVNPGGGACGE PRS RHCTP 
LLALPSEFFHFRILIVLQPHHSQ 


4024 


17925 


A 


4048 


145 


226 


EP IQACS SRRVCE P I QACSS RRVCVCVT 

RAHPGMLFPACVCV* E ? IQACS SRYVCV 
RAHPGMLFPACV * AHPGMIiFPACVCVCD 
PIQACSSRLCVCV*ERAHPRLLF*ACVC 
VCESPSRHALPGVCVCVRAHPGMIiFPVC 
VC^PSRHAIiPGVCVSPSRHALPGVCVC 
V 


4025 


17926 


A 


4049 


2 


414 

■ 


ADRNLNTTFFDPA*GVDPIIiYQHI«L* FF 
GHPEVYILILPGLGI ISHIVTYYSGKKB 
PFGYIGMV*AMISIGFLRVIV*AHHIFT 
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Amino acid sequence (A= Alanine 
C=Cysteine, JD=Aspartic Acid, 

IT^f^llltliniic* A fir) VsPhanvlalaninA 

kj— vjiuiuiiiiL. Atiu ? jt — 'j aenyiHiHninc, 
G=Glycine, H=Histidine, I=Isoleucine, 

iv jli jr o i uoj i^u;ui.iucj itj— lTicin luiiincj 

N=Asparagine, PMProline, 
Q=Glutamine, R=Arginine, S=Serine, 

* — i Hi cuiuiic, y — valine, 

W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 

iiunvuuuv ucicuuii, x^puasiuie 

nucleotide insertion 








■ 






VGIDVDTRAYFTSATI I IAI PTGVKVFS 


4026 


17927 


A 


4050 


472 


228 


LTRDGGACPFSKLFRRVRQKNPLVLGE * 
SCSEPKLCPCPPAKKTKDYPVS KKKKRF 
IWI INRTKIPRKRKKNKAYHKVKYF 


4027. 


17928 


A 


4051 


238 


3 


RRHGSFAPLPPGLPLCISYIWLFITILC . 
NILYLYFETGSRYVAQVGLKLLCSSEPP 


4028 

■ * 


17929 


A 


4052 


225 


172 


IARYPFHIQSCTNMPSPNKS ITSKEEDN 
GPGAVAHACNPSTIjGGQGGQITKSGVIUD 
HPG*HGLV 


4029 

• 


17930 

+ 


A 


4053 

■ 


471 

. ■ 


30 

• 


GPGGIKISGALFPtiNKVKRVPWFPRPP 
LI FPQKGVFNQNGGDPGGGFW PGETPGR 

VPPPPGVFFVKSPFLSLPFPPPPV*KTA 
VKTGFPYRPCGVPPRGLVFPKKKKSVLN 
RPDLSMGPQPE*SLPLPGEPSVSHAGCL 

PQAWRG 


4030 


17931 


A 


4054 


425 


41 


NWGSGTLCPQKLGVFPPPPPGEGGVPGA 
PPPARLFFFFF*KKGFSPVGRGGFKFLP 
PKNPPPLPPQKVGIPEGGPRAGPIFSPP 
QLFFFFLTWSLTPSPRLBCSGAILAHCT 
PAWBTEQDS ISKQTkk 


4031 


17932 


A 


4055 


312 


1 


IKNARAHLLPQGSPPPLIPIWIIETIS 
LLIQPIALTVRLAANITAGHLIMHLIGS 
AALAISTriLP*TRIMFTIIIERTILEI 
AVALIQAYVFTLWS LYLHA 


4032 


17933 


A 


4056 


648 


123 


DPDP VPSARGCVPS GAPGRGAQLKKAWD 
GALA* LP PCLCAQELPS FS LGTGGPAVG 
LSWSERDGRLGEVS INAEFTVAAFLKG 
TQSRRGINGMPLPQREPSSRYCLGLGGE 
SSAQGTSGLTGQRV*QNSDLPELASGDD 
KGQRSSQGGAVSQSSPRGRQIPPSSPPA 
ALFNFL 


4033 


17934 


A 


4057 


135 


464 


QHNSRLMQQAKKGVTVLARVIDLDYEDE 

X SLXjXiHNCAsKCi-Ei * AW JVTG Jo VXasCXaL VX*P 

CPVINVNGKLQRHNPGRTTNGPDTSGMK 
VWVTPAGTKPQPAEVLAEGKENKBCG 


4034 


17935 


A 


4058 


3 


575 


LRSRPLHIILSCGLLVTPRSLSPPPPPQ 
RLRLCRPSRGAAEFFFSLRTKLHFATLP 
JjK X Euro i^Iij JN Xj IHGC* lb* Xrixlvjvj 
LFDVAAKNIIHEEVEKYDKQYRGKELLG 
FATYKTFEIIVHQYIQ*LVEPALSMLQK 
AMETIQQAFINVANKHFGEFFNLNQTVQ 
STIEDIKVKHTAQAEYMIQLQVR 






TV 

A 


4059 


Jo9 


J 


TPPPPPFFGVFPEREKIFFFLKGALPLF 
SFRVTPFFGLKRGFFLSPTPPFFFPPFF 
WPPPFFFFFFFFFPLRRSFALVTQAGVQ 
WRDLG*LOPLPPGFKR 


4036 


17937 


A 


4060 


397 


1 


KRRGNFFFKKGISLFI*TPPRGPLWVFF 
F* KAPKGFCAKFFPFFYPPLPKENGKTH 
RQKTPFPHFCQIFKNGFWVLHAFSLGDF 
PARVWGPPFKRPFKGI FKEFPPYGSPPP 
KGPKKKKRTAARDLELADAW 


4037 


17938 


A 


4061 


38 


410 


AWISI * ERP FTLVRVFKNFLAI SGLLFS 
YFRINFLYVQIHIHTFFRKLFVLSYVFR 
FTDVKFFFYILPFFPPHFLFGWICSYAA 
L*FFIYFCFLPFFFNGPTSP* IGLFLLI 
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Amino acid sequence (A= Alanine 
OCysteine, D=AspartJc Acid, 

fi=G!utamic Acid. F=Phpnvhilnnrru» 

Mm* VIIIM»»Ulv VI j i'^x llCUTUIIItllUJvt 

G=G\vcine* H=Histidine. I=Isoleucine 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Th reon i ne, V—Valine* 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFFKKPTFGFGG 


*»UjO 




A 




*t\Jj 


0 1 


NPGHPPFLKKKKNAPGGWGNPWFSPPWK 
GAL* PXKKNPPKKKKKSPPPKGGKKKNP 
FPPKKKKXKKKA*GDNFDMGTGN 






a 
j\ 




• 


71Q 


R P <? F PFFPP p Tf KG PP*?T FFPPT PflFPF 

PPPFFFTPPPFFFFFPPKKKKYFFPPPG 
KJCFFFF*PPPPSLFFFFFFFFFVPFFFF 
FFFFFFFFFFFFF*G3jTSPHKQIGIiVLA 1 

PT.T.a T J? If TTWa «> T PVP VC <J P Q KVlTIfir KTf 

KKKKKKKKKKK 


4040 


17941 . 


A 


4064 


95 


241 


SPCRSPHRWVNS*ANNLI*LAYTIAFIV ! 

Tf T PT .VflT.WT.*T.PT*AW\rRAP T AO.QTVT.A A 
xvi jr ju xvyunju w ur aaii v iuut jljutsjl vuaa 

VLLKLGGYGI IRLTLILNPLTKHIAYPF 
LVLSL 


4041 


17942 


A . 


4065 


28 


288 


YFDI FVEARS P YVFQVGLE LbGSSNP PA 
SASQSVGITGGSHCAQP*VTILFTTCTL 

TTTrvT/^i* T *fTTf T?T.TITMTT3TT VTCTyWTP^ 
JHvriJtj^ JL ^V-.i.JXT JUXJtlMXIr JILiVXdJJwX JL r\jr 

FFL 


4042 


17943 


A 


4066 


338 


134 


PGGGCSBPRSGHYSHCTPAWQ * SETPS K 
KNKJCKXTTYFSVKNWGIRVFIEIILNSF 

* 

i 1 VKJr 


4043 


17944 


A 


4067 


26 


365 


WVSLMTSALAMRVDFYFITLLILGLLTS 
TLTIYQ*WRDVTRESTYQGHHTPPVQKG 

APTPQIiGGHWPPTGITPLNPLEVPLLNT 
S 


A A/14 


1 "70/1 < ^ 




Aft<Q 




1*7 A 


DLLKFL* SKGYDFESETETETI AKLVKY 
MYDNRE S QDTS FTTL VER VIQQ L VLSH I 
FKDNYANINAEKLSGN 






A 




7 1 A 




CGPSTLGG*GG* ITRSGVRDQPGQHGET 
PSLGPGAV 


4046 


17947 


A 


4070 


202 


I 


DSAIALQPRQ *EQNSVSKICKI YRTIDLR 
PPGFKRFSHAS 


4047 


17948 


A 


4071 


412 


143 


FLGAGV* PRLGPKG*TPFLLKNQKLIGH 
GGOALYSQYFGGWGRRNSFNPGGKGFTN 
ONFPPSLOTWGKKGVS PSKlTJCICJrRRKTR 

KWGGE 


4048 


17949 


X 


4072 


2 


324 


RRGNFCMF*SDRVSSCCPSWS*SPGLKR 
SS CI*SLPKCATO iTIFEPL YLARLVU4RCY 
S THNT Y I MYCOMORLFOGHR YLSVSTSS 

mm* 4b^4*l ♦ mm mm, a « A it •^g*^*^^ ^^i»lw| v MW w A mm* mm 9 

ATRPYIiDNFFFFCKNKVMPCCQG 


404.Q 


170^(1 
1 /jjU 


A 

* 


ACYTX 






RRGRMLPRT PGRPGNDAOOPOKHSIiVD * 
LFI YLFYFLWTGSHS VAQAGGQWRNHGS 
LQP* PPSLKGSSRQSAGRVKGVSHCAWL 
LLDFLSFGKPYTYKKVDKVVSFEA 


4050 


17951 


A 


4074 


422 


116 


EIKQEKNPGPFFFFPPPAKRGPFPTPLI 
WGPPGFPPSPFIjKRRQGGFFLGPL*KGN 
PPVFPPPKVFFFWAGPPFFFSFKKKFFF 
LVPQSTFIHFIFHICYLL 


4051 


17952 


A " 


4075 


3 


294 


VFCHVGQAGLELLDSSNLPSVASQSAGI 
TDTSHLT* PDLSFCCKTKFD 


4052 


17953 


A 


4076 


60 


389 

• 


PSAQLGSTKHIHMAVQP*PPPSPELLHL 
PKLMLCPHSAHTPRPPPNPGTVCTYSPR 
SSPAFPQEGPSSGEGLGGLVSLFSSPPN 
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Amino acid sequence (A= Ala nine 
C=Cysteine, D=Aspartic Acid, 
J?*— oiutamic Ac id, r— .Phenylalanine, 
0=0 ly cine, ji— rtistiaine, i— jso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=SerineV 

T" TkraAnlnn V/ V <•» 1 1 mm n. 

i— inreontne, v— valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














HVH PAPTS PESGRGRGGRVPREGAYV 


4053 


17954 


A 


4077 


93 


385 


KPLFAPGVLFGGPGEPKREKKGGAQGGP 
KKFSPPPPPFSFPPPKTLWVAPLFQETQ 
GLRLCPPFFF*RGKGALF*KRGGGRFKP 
PPPPPGIJX3PPTP 


4054 


17955 


A 


4078 


2 


365 


RKVEGRVS * DEDLKLTELLRYYMLNI EA 
AKDLL YRRT KAL I D YDNS NKALDKARF K 
S KDLKSDCAHPRDC *RA* AP YFLFAKNE 
LWPTGEQRHFLQDVPLIVQRTLTLRTP 
DKTSLPLSL 


4055 


17956 


A 


4079 

i 


6 


298 


PLAIMGGFFTLAETNRTPFDLABGESEL 
VSGFNIEYAGGPFALFFIAEYTNIII IN 
TLTTT I FLGTTYDAP PRP * LS P SL FYYE 
PPSPYPTPWSTST 


4056 


17957 


A 


4080 


414 


154 


PI PATREGKAGNSIjNPEGEGCNKRRSRH 
CTPAWQEGKTPSQKNKKVLLIALKV* SI 
FHLMES *KYLKPGFSLDLITVTPTYFDL 
FRC 


4057 


17958 _ 


A 


4081 


347 


1 


VLKPRPGNIIFSPKKKKNLPPPPGRNYF 
FFFPPPPFLFFFFF^FPFFFFFFFFFFF 
FFFFFFFF*RGGLLIj)FRLKIQIjPPTPHP 
YLSNTLRGWNPGEGSCWRSEGGRGEDCL 
QDH 


4058 


17959 

• 


A 


4082 

• 


1 

- 


354 


STFIISLFPTTIFMCLDQEVIISN*H*A 
TTQTTQLSLSFKLDYFS I IKKKKKKKKK 

kkkkkkkkrggoglkkkkfngggggkkf 
pfwggkkknwgggvkkpgggkkrggkkk 
kgggek 


4059 


17960 


A 


4083 


384 

F 


85 


fffpkeffgcprfffsppgfltppplqv 
lefpikkklsppppplknfffytpppff 
fffpffffffpf ff ffpfff ffffffff 

Fr FFJjJjM* AKKFI S WR 


4060 


17961 


A 


4084 


1 


122 


fhrfgrdgldllts* sarlglpkcwdyr 
repprpaflpqp 


4061 

* 


17962 


A 


4085 


223 


401 


NGKLVS VFFGFCFCFFEM * SRC VAQARV 
QWRIFSSLHFLPPGFERISCLSLLSRWD 
YRC 


4062 


17963 


A 


4086 


160 


408 


ALPPRRGGGGAPS FNFCLL PPG P * GKKG 
GYGDAPSCRSGLGQHGETAAPLKAQKSF 
GHGGRGYQVRRRVGPPGPGGRRQKLNBG 
APPPPRLGGKAAP 


4063 


17964 


A 


4087 


261 


81 


YDIDGRI FGQKIiESSHYSPGAVAHACNP 
SPLGG*GGQITRSGVQAIHPSWPPEVWG 
LQA 


4064 


17965 


A 


4088 


428 


140 


QPLKNPPLGGRAPPFIiRGKKLPPPPpQG 
E PP P F* KKKKKKRGGGGPP FF P PPPRGE 
PKKSL*PQKJ3GGKKKKKSPPPPPQKKKK 
NPPPKKKKKKKK 


4U€>5 




•TV 


ylAQO 

4Uo7 


i 
I 




PPT .P Ci F.NC!T .NPOOGfiP QRIiP ^JHVi^TP AW 

VTEQETLSQKKNFFN*KTFRKAKSTGNF 
HMPNTS IQQLPI 


4066 


17967 


A 


4090 


154 


410 


ITGCS 1 1 KYFYRAMREREKS E VG YFPSS 
S INTPLRHRLSDGVLDSRITSLSTHSPA 
PFIKSKNRPGTVAHACNPSTLGG*GGRL 
TK 


4067 


17968 


A 


4091 


428 


67 


LPHPPPPPPPLSSSSSSSSSSSSSSFPP 
PPPLLPGQGRTLTRPLPAPAGQLGSRES 
CGAPPSSLGASLHVA*NADGRQ*RAALS 
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peptide 
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Amino acid sequence (A=Alanine 
0=Cysteine, D=Aspartic Acid, 
n^i_»iuiamic Acia» r— Phenylalanine, 
vr— Glycine, n—nuuame, l— isoieucine, 
K=Lysine, L=Leucinc, M=Methionine, 
N-Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
w— i rypiopnan, i ^ i yrosine, 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, Y=*pos3iDie 
nucleotide insertion 














TPHGPLTGS PAAWRGRREQDLLLLLCS 


4068 


17969 


A 


4092 

• 


431 


3 


APTSNKGSTI P I LQ IRKLRATPCSSIGP 

PnOTT CT 4 OTJT T YV"OUT ITPOPffLl fc» 

ofo Jl JUlr VJUjaJj* o Vid^FGQWIjNr SFGHF 
FPRPKCSNRPRYTLPVPKLPSSGLPPRP 
SPSQQVTAFPPLTSRKEPSSPSTPASDA 
FPFPPAPVS ARRQEVLCES IPSVTMHB 


4069 


■ 17970 


A 


4093 


151 


1 


SIMPMWMDAVAHACNPCTLGG*GGWIIK 
SGVRDP PDQHGETSS P PKTQKL 


4070 


17971 

■ * 


A 


4094 


35 

* 


363 


LQSKIYCTFFFFLERGFFFSPRVGRPGA 
HPGLGAKTTGVKGNPPP* PPKGGDPGGV 
PPPPGCFFWFFLGKGGFFLLPGGG*NFG 
KKKKPLVWPPKGGELKGGTPGPPPY 


4071 


17972 


A 


4095 


376 


1 


LGKS PPFLGAP FFIiKKGSLIX3PLLGtKGF 
TPP*LGGPHPLEKGPLLGQWNPNQKVWK 
KRGF PQKGGENPP FFFFFFFFRDGASLC 
CPGWPQTPGLKQSSNLSr.PSSWDYRCLS 
KSLFKENTETLAS 


4072 


17973 


A 


4096 

• 


65 


446 


PFFLFFLNKGPLFTPKRQGKPRNFPT*T 
PCPRGKKNPPPPPP *GGEKRGGPPPPGK 
FFFFFGERGFPGGGVLFLDLGTPPLGPL 
KGGEKRGRPPGPGPKKPFL*KRKKNSPP 
TGGERGRGGKKHCLP 


4073 


17974 

• 


A 

■ 

• 


4097 




387 


IKEADEYVLTRIACVFVGDMEPLSSPHR 
LHTRIGSSPSTTCCTPACPCS3W*HGAS 
VI PSQTAYEDWFITL YNVL YTS LPVLLM 
GLLDQVGASHKQGHFWTDENALEKSQAN 
VSMHSSGVYFFL I KP YYNSWRHQGSWL 
CFTiFCNKERGF 


4074 


17975 


A 


4098 


80 


422 


II*l^YTCIILCFVLQS*ILWFVLQNYI 
LHVLYTCI I LCLHVLYTCI IYMCYIHV* 
FCILALFAGLS*DSPYHESL**SFFMEA 
DISLCRCVITDQLNFICSLLQH*ELRMF 


4075 


17976 


A 


4099 


74 


402 


I YLLSTHLHYQFS WIS ITFDISFFFFFF 
LKKESLFFPQRGGGGGNDGLLKPPPPGE 
SPFPPPIFRGGGKKGPPPPPWVIFFFF* 
KKGGSPIYPGGG*IFGPGDPPPPPP 


4076 


17977 


A 


4100 


3 


335 


DAWASAWVTEYDSVS KXKKKKRGGGGKK 
KQFPKKIGPRGSLFKPRGGEKPLSIiKTP 
PPPFF* F*KPPFKKGGAEPWEPFLKPPF 
CIjIUjRGGAPKI I r FPNPPGAPKCiAAr 
RGGGK* *GPPGGLLL 


4077 


17978 


A 


4101 


406 


1 

■ 


RFPKPFPPKPKMSKWALFK*KGFFPPRE 
LLIGPPPPPPKKKKFFPTLFFAKVLPPR 
Fr *NrNQ VSKKRPTFPLKVGQ* PPPMUV 
QKKPTRGFPALFKKKKRSSLVIREIKRK 
PTMRYHLTPIRMAVFHKSKNNRC 


4078 


17979 


A 


4102 


2 


183 


VNPGGGACSEPR* CLCTPAWVTERDSVS 
SFAF 


4079 


17980 


A 


4103 


119 


391 


dnlqcfffffwkrgllflprgggfkstg 
pfsfgvpgtppppppgeggltpkppppg 
* ilfflekgdfpllarw*nrqkkknpp 
psppkgg 


4080. 


17981 


A 


4104 

• 


413 


64 


gfkppppkdppppppqkggfmgwrpppr 
pfaplfspppprggggifp*gggggakk 
fhiwgpfg*kkpppggg*nsggqknppg 
gggknppqkkkkkkqpttaniwflclei 
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Amino acid sequence (A=Alanine 
C=Cystetne, D=Aspartic Acid, 
Hr=Criutamic AC id, m>— rnenyiaianine, 

f2— nivrina f¥=tt icfi#tin«* l=lc/il«»iif»2n» 
\jr— Vyljl-lllB, m— JllsUUlne, J - uOlcUCinc, 

K= Lysine, LHLeucine, M=Methionine, 
N-Asparagine, P^ProIine, 
v/^ijiUTamine, iv— -Arginine, s — oenne, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Un known, *=Stop codon, /^possible ! 
nucleotide deletion, yppossiDie 
nucleotide insertion 
















4081 


17982 


A 


4105 


2 


196 


rgrvgkvlislfntdllwllcilIcfhk 

TQ*LCNG*C3ISRMKCVCVCVCARVCVCV 
CVCERERD | 


4082 


4 M Aft A 

17983 

• * 


A 


41 06 


1 


335 


FJj VET* FHFVGIiAGIjEIjIjTJjGDPPR* ST 
LGDPKCWDYRCEPLDPASVSFFLCLKLF 
lr ViiiilUji? r r FKFPIj i SKJj 1 KGKDNflXN 
YFPPPNPGPLPHWRADKS I IGGNRRLL 


4083 


17984 


A . 


4107 


1 


148 

■ 


GERGCS ELRSCHCTATWAAE + DS I YKKK 
KKKBPPKFFLAGFSGGGKKTP 


4084 


17985 


A 


4108 


331 


41 


GLNFLAQKFSGRFS PPPPGRVPLGPKPF 
FF&GKPPILEGFPGP*KVFFSPILPPKB 
VLTFFSPPGEKIGPPFSGGACSGFPPKG 
AKKKKKKRPWSN 


4085 


17986 


A 


4109 


137 


3 


TKKERNINRGPGAVAHTCNPS TLGG * SG 
RITRSGDRDHPGQHGE 


4086 

« 


17987 


A 


4110 


3 


269 


GFHRVSQDGLYLLTS *5ACI*RI»PKCWDY 
RHEPPHPGFLGFFCFFFFFRDSVQPGQQ 
SENPSHKKKKDBGPFWKGAGKIFLFKPN 
3 SAL 


4087 


17988 


A 


4111 


196 


330 


■ GAIARGRSNCDCI *QSVGHVIj*NDPTLN 
CILDQDTLYGS IMINEYYERASDAWALL 
PDEEQLPEGDQTVT A* * A* DMSGGQQHR 
VSLARAVYSGADVYIiLDDP 


4088 


17989 


A 


4112 


426 


241 


LLKRVRHEILFTIRGKGFSEPRSPPCTP 
AWATERISVSHKEKKRKVLVRP*VDLSN 

/»»» n^rr 

SSLSVL 


4089 


17990 


A 


4113 


394 


1 


GPGVFSPSGCLYHARAQYFWPHKKKKYF 
PHPGQKNSVFLKGRPLPGFGFLLIFFFF 
LVEMGSCC I AQAGLELPTSNNPLTSASQ 
RAE I KDVSKRSQ * FFVFL FBTGS CSVTQ 
A* VQWCNHSSPRT PGLKQF 


4090 


17991 


A 


4114 


328 


2 


KKRAHPIWPDTGIEVPNSPSPNIRFLGA 
PESFFFKRNLPQGDFLEKPNPCYFFFQG 
VTTP PFPLFPPPPKI FFFFF * DGVSLCR 
PGWSAVARSQLTATTHLPGSSDSP 


4091 


17992 


A 


4115 


220 


3 


FKKGEPGDPPPVWFGIiGWVWVWVGFFFF 
ERHS FTLVTQVGVQWCNFSSLQP PPRGL 
KSFSHLSLPSS*NYRH 


4092 


17993 


A 


4116 


342 


2 


PINFLYLSFNPNQV* RSPNFSKILHFCF 

kilsffytlaflpfssqlirqsffrvyl 
inticflqdlyylyilnlyliiiflffhl 
asrlldfclffvtetvllchpgwstvaq 
s 


4093 


17994 


A 


4117 


124 


352 


TSGFT * TRMKS KI PT* RS F 1 1 DLLEAKM 

kekflkaaretqlityrgtsiqmpvdfs 
wrkteakaqwnlmfevlkekncetrslh 

PATMSFRNEGKTKMFSDERKLRDSVTIjP 
LKD 


4094 


17995 


A 


4118 


11 


387 


KTG KLLATSAPGDG I MVMVETFCLWS F 
FLSVSKKKKKNPLPQIKNPLPKPCWVFK 
RAPTPNH*KAFPGPPKTPP*KQPKKQIP 
CRPPKKGPFCI*PNPGINPSPIiFFEKKN 
KGKAPPFKNSKQK 


4095 


17996 


A 


4119 


2 


407 


NTFQDQSGSSSNREPLLRCRDARRDLEL 
AIGGVLRAEQQI KDNLREVKAQ IHSCI S 
RHLECLRS REVWLYEQVDLI YQLKEETL 
QQQAQQLYSLLGQFN*LTHQLEGTQNKD | 
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Amino acid sequence (A=Alanme 
OCysteine, D=Aspartic Aeid, 

XT'— — /^l 1 ■ i m if* Ant/1 TP— T>K AnvUlnM Sw%j% 

ci^iviiiianjic AciUy r— ^ nenytaianine, 

Cl—flhtrrini* ll=HictiHin» I-IeAlanmn« 
^iycin&) xi^niaiiuiiiBf 1 — 1SOI6UCIQC* 

K^Lysine, D=Leucine, M=Methionine, 

]V7— A;no i*q oinA PnPwilina 

ii = /Tt5paragincj r^rruimC) 
Q=Glatamine, R=Arginine, S=Serine» 

W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 
















4096 


17997 


A 


4120 


157 


2 


LTGSHS VT* AGVQWCKHGSLXjPRPLGS K j 
RSSSLNFLYSWDHRCGPPHSANF 


4097 


17998 


A 


4121 

i 


<> 
3 


168 


XiUjKbOKJWlJIKJiij<aHVVl,l^WS w 

NQSTCLGLPRCWDYRRBPPWIAIiNMP 


4098 


1 7999 

< 


A 


4122 


2 


376 


ETGFHCVSQDGLCIjLTS * STRLGIiPKCW 
DYKMRE P PRP ADG F IN I RDFNPLLRTS T 
CCCHVKKDI FAS P FTMIVS FLRPPQPCR 
TDP IQNTLPLVI MSPVALLGCDSFSAFP 

TTT HfKTT DUO C*T\f1TIV 

VJjMK JjP r boixn It 


4099 


1 8000 


A 


4123 


11 


.221 


GTQLLLRIiRGEDLLNLGRGGCTBPRLHH 
CAPFWETEPDPV* EKKQAAKAKNSLLTA 
ALLKTQVKKDSVAW 


4100 


18001 


A 


4124 


2 


125 


AI I KKTKSNRWW * GCR* RGMLVHCF WKC 
KLVQPL * KAAWRL 


4101 


1 8002 


A 


4125 


425 


152 


TPVI LALWEAKAGRSLE PRS * RSAWATW 
QNPISTKKYKK+GGRITSAWGG*GCNGIi . 
*WCRSTPASATETPSQRKKKK*EKNLFT 
LILKVTK 


4102 


18003 


A 


4126 

> 


3 


99 


CQAGLELLTL * SDSLRLPKCWDYRCEPP 
RPA 


4103 


1 8004 


A 


4127 


396 

• 


58 

- 


ENCNPLSWSGGIISGLIiLVSGYSNKPPS 1 
FLVCLLPVIGPPENTAGPSLVPGTKWHD 
HSTL * TSC2TPGP SLVPGTKWHDHSAL*T 
SQTPGPKLSSRLSLPSSWDYRHKTPYPA 

H 


4104 

■ ■ 


18005 


A 


4128 


51 


549 


LGQSYLLLLRKCFSPNFQVGDLDISYIN j 
IEGITATTSPESRGCTLWPQSSKHTLPT . 
ET S P S Vi PJjS envegtap p * ahqs fms p 
PSWGGSPLNLFFGGGGFBKEQSPLKKKS 
r 1 Jj I Jr iAjf tro cuJIi wlr * ro ir JjUr trlrVjr w 
SP PKTG I PSGDELGFF* TT PG PDKKK 


4105 


18006 


A 


4129 


97 


362 


RK* SACLSLPKRWDYRHEPPRPACSC 


4106 


18007 


A 


4130 


376 


279 


DQDDLDLLTS * S THLGI S KCWD YRHE P P 
HRAS 


4107 


18008 


A 


4131 


133 


344 


1 1 FFF F * NFVLQAGGQGGNIjGS LKPPP P 
GLKQFLGITFGRSWDHGPTS FTRANFCI 


4108 


18009 


A 


4132. 


164 


3 


MNEMS YES SLLDHLLKRQ * NQCCLGSS E 
PSPPRLKQFSHLSLPSSWNYRKTSK 


4109 


18010 


A 


4133 


355 


2 


uKKQNRKAGNFKTnSASPPPPPrCEKSSS 
RATBOSWMENDFDEMREEGFRRSNYPBL 
REDIQTKGKEVANFEKNLEECITRIPNT 

pvr^T itct JwiiTT.vpiraoiTT opphdct do*p 
am .i lis isi irictjjjvi i\jm&ijKisi^L.iCjijKo w i^ 

DQLEE 


4110 


18011 


A 


4134 


193 


357 


HHCNPSLCQNDLFWHLVLS PPTGVQWYN 

\TT1QT S\Ti-tr DTV2T P*CCPT,GT.1>nQ*l7VDT7 


4111 


18012 


A 


41 35 


357 


104 


LRRL* YENHLNPEGGGCSE PNLLRCTPA 
WATEODS ISKI I I I I *CNFLH*ONMOKI 
KQNPKTNSSKILTEIKISLKGNLRELRE 


4112 


18013 


A 


4136 


179 


381 


DTS LH 1 1 QNRLS I S KKKL * TL YFI Y*DR 
VPLCCPGWSAWQS*FTAALTSLKRSSC 
LSLLSNWDCRC 


4113 


18014 


A 


4137 


89 


318 

# 


CLE I SCTKIGQDL YTENYKTLLGKVCKC 
PNK*NTTFMDQAEVS ILKI S FLPRLIYR 
LRIIPVRIPAGIFVEILKFG 


4114 


18015 


A 


4138 


315 


60 


KKLPAEIHAPLAEEATD*ABLKPAGETH 
AEVQAPTEQTPAAEAATTIAEASVKVQP 
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Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
H^Giutamic Acid, Phenylalanine, 
u-i»iycine, tt— Jtiistidine, I=Iso!euctne» 
K~Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P^Proline, 
Q=Glutamme» R=Arginine, S=Serine, 
l— JDreonme, v— valine, 
W=Tryptophan, y=Tyrosine, 
X=Un known, *«Stop codon,/=possible 
nucleotide oeieuon, v=possiDle 
nucleotide insertion 














PPAEEAS LAKFPAE I QP PADQERGSED I 


4115 

» 


18016 


A 


4139 


377 


51 


GFTSPFPED*RIPGLFSCIGWTLQSFHF 
RNIGSVPFSPPGPFFFPQEMRVLLCCPG 

j wsqtlglkpssglslsnswvykcaplvp 

GMP I F * LS CFLAIGFLEFVI Y IGY 


4116 


18017 


A 


4140 


413 


175 


NFGIFOUDGVLLCCPGWS*TPRLKPSSC 
LPKCWDYRREPRTVPGLPFLFLNLRKDG 
NSVLE I YLEYKI TQ I I VQRLLVL 


4117 

■ 


18018 


A 


4141 

i 


1 


394 


EIFVTKRWSGLGVVAHASNPSTLGG+GG 
QITRSGV* EQPDQHGSPERRGLKERRGL 
MVMKATFLGLRHHLEHIVGHRELRDLHN 
NSKKKISSHPLSTPHVPGTVLKPCRLIIi 
CFKHHQKFVRKPPNSPSYR . 


41 IS 


18019 


A 


4142 


22 


376 


L VNKGKTI FFFLKRGWLP PKGKGRGI N 
TVNGSLNFRGQGNPLA* P PKKPGTKGGG 
HNPGEI * *FFGKKEA*QCGPGGSGIPGP 
RRPSGLTLQKGGTYGREPSPIiPGPTKBN 
PRLTKT 


4119 


18020 


A 


4143 


161 


1 


PIPAKYEHFCFFVSLF*DRVSLCHPG*S 
AW* SQLTTASTFQGSSDPLTS ATP 


4120 


18021 


A 


4144 


18 


159 


KHI..DPGGGGCRE PR$£HCT PAWVTTAKL 
HLKK* K* VELKS PS VIHT 


4121 


18022 


A 


4145 


383 


109 

■ 


LFFYKIQNYTGHGGPCLLTPFLQRVKQK 
NCFNPGGGGCR* PKLVFCPPTWGKKQGF 
VFQKRLKKPKPKPKPKPNWRGLFIiPPFS 
KRPKPKI 


4122 


18023 


A 


4146 


374 


164 


QCQLLRSLRWEDCLSWGG*GCSEP*SCP 
CIPAWVTARSCLQINKSLPAEIVIPSLY 
NQIARG 


4123 


18024 


A 


4147 


25 

i 


375 


RKKKALFFCQGGSQPPPSNIiMDPPPPGE 
KKISWLNIiFPKKEKKKF*KKKKFF*KVK 
KNPFLKPKGRPPPGEKKKGKKGKKPTPQ 
KGKGFLFAPPPKKPIKKSPHFKIPKKQK ■ 
NFWQR 


4124 


18025 


A 


4148 


376 


202 


hynskvfpg^pkrsflfflkalsfffap 

APIPFWIiHSKIPFFFF*DRVSIiCHPGWK 1 


4125 


18026 


A 


4149 


.3 


144 


IiFCPDWTPTPGLKQSSHLGLPQCWDYRH 
* VTVPGijP ISrVFFFFPP 


4126 


18027 


A 


4150 


3 


188 


QLQSQLLSSLRPEDHMNLGVEGCGKLSL 
HHCTLA* MTE*DS IS* KKILKKKCF*KK 
HSFLY 


4127 


18028 


A 


4151 


83 


358 


GWARWLTPGI PALWEAKAI KS LEDLNA* 
A* QTYAPS * EESLHP IMNTLIRDPVWT 
SFRVTVDTSTIARHWLFCWDLIiQASIGH . 


4128 


18029 


A 


4152 


162 


1 


TVWYWHKNKLSNQWNKI*FLELDPITYK 
HJjVYUKTGI 5 NH W VKDAL VNTQCRAG 


4.190 


i Kfrcn 

\q\jjM 










PPAWGTK*DS VSKKKRKKKS CEFVHRIL 
KLEEAYHSLGSNTAQRLFVSCFHKKILC 
LSISIPDPYLQFKMWPVPPSKPLYQIDW 
VGLL 


4130 


18031 


A 


4154 . 


117 


2 


QIFFFFFFFKMESHSIAQAGVQWRHLGS 
L*PLPPRFKR 


4131 


18032 


A 


4155 


369 


217 


FFSPPPFFKPPPPPKKI FPP* KKKNPPP 
PKKKKI FFFFFFFFFFFFFFQTP 


4132 1 


18033 


A 


4156 


214 


283 


ASPSEGQGLSSGP*GLIN*SDKPSVTHS 
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Amino acid sequence (A»Alantae 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny)alanine, 
G=Glycine> H=Htstidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methk>nine, 
N=Asparagine, P=Prolme, 
Q=G lutamine, R=Argiriiue, S^Serine, 
•^Threonine, V=Valine, 
W=Tryptophan, Y^OTyroslne, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \= possible 
nucleotide insertion 




• 






• 




PKPHCETA*GGTSPLSSSGS3PQAPKSP- 

• W*± A9m m^ • *fm wMt Arm ^a* mm mtwm mm ^mi ■» ^_ ^^m. ^ mm — - 

HHGPDSPLSDSIPMVISPPEPPPLRAK6 
CPQGPPGASWWGTT 


4133 


18034 


A 


4157 


319 


712 


QDFVSKQHSYCLKNLKSSSLIQVAMNRA 
QVCL I S S S KSGERHL YL I KVSRDKI SDS 
NDQESANCDAKGNNFICIMLYKIWLN*K 
RLMK* HWTAI * LKI YFSNKMKFSNLNLS 
SVINSFDVKKERIiNANTIiN 


4134 


18035 


A 


4158 


183 


369 


LKRKQSQRGEVNFLKVTQPECGKAGIIiF 
IYYFFBMESHSVGQTGVQWCNLG*IjQPL 
PPESSD 


4135 


18036 


A 


4159 


190 


336 


DPISIKNFWPGTVYFLIBTGFLHVG*AG 
LELPTSGDLSTLASQSAGITO 


4136 


18037 


A . 


4160 


10 


390 


QGILLPCFSMSSEERGRIYSHTFSFFFF 
FWKGSPINPQGGSQAKKPR*REPSPSGL 
TPQRPGNGGPPPPPGQNFFFKKKGG*PG 
GPGRA*TPGPRGTPPLGPPKAGNKRGGP 
RGRAKNFLKNQNGPT 


4137 


18038 

< 


A 


4161 


53 


376 


YLFAFFFFS KREAWQGGQPGMEGAKKKF 
MAP* PSKGGGGEFSGLTPPNQWNPRAQP 
PPRGELIIiXLEKKRPAPNAQPGPENLGP 
REPSPLAPQKGGNKSENPWPPPP 


4138 


18039 


A 


4162 


1 


356 


GVFILVSFNFWAFFLSFFFLGKQILVLM 
PQPESQGQNLAPQNPPPPG*RGFPALTP 
PRTGNKGMAPQAPQFFGFLKKGGFSHGG 
QGGFKTPTTjGNLPPLAPQGLGNNGLGPW 
PPPFFF 


4139 


18040 


A 


4163 


345 


26 


APGFKPALGPPGGPFFSKKKSKNLPGLR 
GSTIiQAVGPTFIiGG*GKRSLKPPGG*AS 
RGPG WG P S P P PGG PN * I PVS EKKEREGK 
KKKGREKKKETFSSNLLIFLSF 


4140 


18041 


A 

•• 


4164 


18 


284 


TIiQGTSGIFEGNFFFFIjGTGISIiYCPGW 
RAWGGDHGS LQL* PPGSS9 PASTPLGRS 
WDYRHVPPGWASFFFFKTLPKGVTQAGL 
ELWG 


4141 


18042 


A 


4165 


377 


30 


FSGKK3CFFQIX3WGCGPPPLIPPLWGV* P 
GQPPRVGGFNPPLPPWKNPVFFKKPKQP 
GVGGASPYSPFLGKFRPRI PFTLGPEGS 
G*PNFPPFPPPWAPKKNFSPKKKKKFYN 
CLW 


4142 


18043 


A 


4166 


305 


1 


ESRQVLTSRLTLLSIHEFNYLLNNSTYK 
H ITS LP IN I RQ YGQDLKTLKKETKDLNK 
♦GNIPCLGIGRINIVKMSLLSKLIYKFK 
AMPVKI PGE LFLRNQQA 


4143 


.18044 


A 

* 


4167 


394 


78 


EGKKQGRKKGRREGS * EGRREREXTR*K 
AEKEGMKEGGREG KKEGRKEGRRKGRKE 
GRKEEGQKGGRTDGGWWVIiRSIiDTSQLI 
S FASGP KWREAS S LVWASLCL 


AtAA 

4144 


18045 


J\ 


A 1 f O 

4168 


1 • 


At 1 

431 


RVCCV*VERVCVLVCMSVCWCVCCCARQ 
CVSVCHSRIESSRPSSGPVAPSKCQTHL 
YLPKLSAPPAAGLWDLPSEATASLSGSL 
CQPSCPRTRSFATLVSSMYHRPVTGELQ 
SIT 


4145 


18046 


A 


4169 


3 


423 


YNAREIEAAAGRDHATALQPGYRVRPCL 
KKKVLKLGQAWWLA*F*NFFFETGSHSV ! 
PRLECSGVISACCSLNLPGIV 


4146 


18047 


A 


4170 


118 


402 


QICTCTPKPKMKVKKITMLSNNMRYLYI 
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nucleotide deletion, ^possible 
nucleotide insertion 














HLTQDI *DLDIDL*NTADMNIjKDLNK*R 
E IHCLRI * RLNI I KTS I FLKLPYRVHAI 
PFKISARSFVB 


4147 


18048 


A 


4171 


404 


285 


RGCX3EPW*HHCTPAWVTE*DLVSKNKMK 
QNDNLSHLSLA 


4148 


18049 


A 


4172 


274 . 


431 

• 


ESLVGQRNWVAKYF YRI TAPGRAQRLMP 
AIPAPWEAEVGGSQGLBIETLIj*NNSSR 
PGAEAHACNPSTLGGRGGRITRSRD*DP 
PC3QH 


4149 


18050 


A 


4173 

! 


24 


366 


FLCLFHTCVLAFPVQQLCSVGWGCKVEN 
ADREEKQPCDSQAREEPRLCRRGFNRAP 
NINTRHEKLFFVCLF*TGSHSVTQAGVQ 
GCGHSSPQPQLPGPKRSSHLSLRSSWDT 
GM 


4150 


18051 


A 


4174 


397 

- 


1 

• 


TEPVLANETWGPWDRGPLGLGGLGPBNT 
DNSHAVGADQLLKEPPFPLSTRSIMEPM 
SLNAWLDS PHREMQAGTPLSLCGDTYET 
QVM* SWG IRGILPQQPWKMGASLSLTAS 
LSLSFAHTLKHKHTHTHTLY 


4151 

* 


18052 


A 


4175 


90 


387 


KGRWLFFPQGGERGHNG* TAPPQTGGKE 
NPPPPPPKDRGKKAI^PPPGGIWYLKKN 

^*an V% <W aa aaj an *\ an aa m ^ aa ate aa aa am aa «K MM 

GFFPIGQGGPEPPPPRGPPPPAPPKGGE 
KRGGPPHRPGKKLLK 


4152 


18053 


A 


4176 . 


398 


280 


RRLKWEDPLNPRV*GCSKP*SHRCTPAW 
VTE * DP VSNFF 


4153 


18054 


A 


4177 


253 


395 


LFNFFFLVKMGSRYVAOAGLKPLA*GNP 
PASAS PRAGITGGSHHTQP . . 


4154 


18055 


A 


4178 


366 


48 


P WAS PG I SLS FS FLRTKPT VKVR* YRVG 
PQRQPCPS RWAPPACPFSMFPPAWVS PP 
GS AASRP PRHAGSQVGATAS PP PAQGLG 
APAWTAGLGEKQKLAARMRGT 


4155 


18056 


A 


4179 


44 


413 

• 


GDGVNSAFFFFFFIiEKKVWFI PPGGGPK 
PEFX3FRAPPPPGVKKIFGFTPIiRTX3E*R 
PLPFPPGKGWFFKKNGGSPMWPGGV*PS 
DPKGAPP PGPPKGG2 * RGEPP WVGWNFA 
LLKERKPLFKE 


4156 


18057 


A 


4180 


396 


235 


HEVSLCCPDWS*TPGLKYSSCLGLPNCW 
DHRCRNHGPGKTQLEQGHRQVAGIjSR . 


4157 

i 
§ 


18058 

• 


A 


4181 


85 


408 


<• JM l II a— aa "aw a— fftA FNV « fl Vf* an a— |Bf aa «-> fja. ■ If 1 *» VS 

YTCS LR I TQDHFF F FFFFLGKKI FFFPP ; 
GWGAGGQILTSPKTPPGVKGIPWLNPPR 

GGGPGGPPPPPTLFLGF*KKRGFPGGGQ 
KPRPKGDPPPLPPKGGGLKGGPTP 


4158 


18059 


A 


4182 


1 


149 


NHLNL*GRGCSBPRliHHCTPAWVTE*DS 
VLKKKSYIHLGQNPTVRKVIiL 


4159 


18060 


A 


4183 


310 


3 


ATAPSQF IRIN* RNHOiYI EVQSEAASA 
NLETAVNNPENIX3KIMDEGG*MKKQIWK 
EMTSRI F I VEKSMPGFKTSKAOALILGA 
NIAVNFRLKPMLIYHLEN 


4160 


18061 


A 


A 1 D A 

4184 


1 


359 


irTKirij VJjUKUKir ir r r r c i/aoulLRJ\rJt\r V 
FLYFGGGWTPPKKKKNSLGWGGGGIjKKG 
GGGVFFKPLPFLGGGPFLKNSGGVDKKK 
RAPPFFTKTPL*KKPPPPFPPKIFKARG 
FW*KMGEPFFFCPPPPSFLKKGPPPKKE 
GV*KKPPPPPFLTPPPPNQGNFFFFWGG 
STPPQNI KKPLGFFSSQTPKKKKRAADR 
DLE1ADAW 


4161 


18062 


A 


4185 


1 


362 


HFTKYLFISC*K* *LLCWMODGCKKGIP 
MNTNVI QE KATSLYGSLKHKEGEGPKAG 
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IN^Asparagine, r^rrohne, 
Q=Glutamine, R=Argtaine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, /=possible 
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EFNAS KGWLDNFRKS FGFKNVKITGEAA 
SANQKAADKFPDAIKKI IEEKGYLPBQI 
FNAGESAL 


4162 


18063 


A 


4186 

• 


366 


47 


AaLFLVDPPGRGGVKGPPPPPGPFFVFF 
QRRGVPPLSPGGFFPRPPNSPPPGPPKO 
GGSGCX*PRGPA*THGvrIiKKHPPNCX*AG 
KKKGFFLKKKKKKPVFWFSVK 


4163 


18064 


A 


4187 


394 


112 . 


KNTENIi KI S WAWWHVSIWS APNEAETGG 
STEGTGCSKL* SCHCTAWVTVRETLS PR 
KFFKKRANYLTFRI *KVCSIiYHRKTKNS 
FKKHTQGRKF 


4164 


18065 


A 


4188 


230 


372 


KPMFLPFNPAILLLGIYLKEYKSF*HKD 
TCMRMFI TALFT ITKTWNQ 


4165 


18066 


A 


4189 


149 


2 


NRSLRFMRFKEGSHLPP I * VQGKAASAD 
VEAAAS YPEGLGKT INKGGST 


4166 


18067 


A 


4190 


2 


159 


MHQPWLLRRPKQENCLNPGGGGCSEPRS 
HHCI PA*AIERDSLSKKKKBK3GAF 


4167 


18068 


A 


4191 


180 


415 


LGFLQLLLFVGRSMFIFALELSHGWLGS 
SLSF* PVCVKQSSHLSLSSS WDHKHMSP 
YPANFCIFCRDKTLPCCLGWAQ 


4168 


18069 


A 


4192 


424 


41 


PPLLRAPPKGLL* PKNSFPPGAPP*TpF" 
PPKKI KI FKFTPGGGGAPLFLLPRQVKA 
EGS LP PRVFGP P PFF FPPLPS PLGAKPN 
PFFFFKKKKKKEWLHKSSLFYSAHFVLC 
LKYSIiSFFYQSVKTK 


4169 


18070 


A 


4193 


105 


424 


ELKRLVIKLIRGIPEKGKAQCKEIQKLA 
QEVKGE I FMEIGSLKKKQ * KI QETLDTL 
LKMQNALESLSNRIEQVEBRNSELEDKV 
FKLTQSNKDPSQIKKKILYNV 


4170 


18071 


A 


4194 


3 


240 


L CLQ S QLLGRLRQEN PLNLR I *GCNEP* 
LHHCTPAQVTERDPVSKENKGSFIPMKI 
GKLVSLTLNRMCNCKNSEKENV 


4171 


18072 


A 

1 


4195 


385 


1 


KM I I LTKKME I KHKDEKELQKTE VDLKf 
ENNTTRARLTNNI KKKPPESEKTNTKNH 
TKKSL*DQDTKKETRPQKKKNPFKSR*V 
RHWRNI LNLESRNRGKKNPSGVQKRRDS 
VPTLITQTQETWWWCF 


4172 


18073 


A 


4196 


3 


272 


LFYLIMALKHKSSDVGNSNIYAKEKLES 
ASFKERI * KSYVEVAKIYGKNEYCIYEI 
VKKEK* IMHSI YRVQYYLQFQTSLEVLE 
RIPHG 


4173 


18074 


A 


4197 


236 


379 


GGRFKGSNFTSPGliQGNSFFMGPPKLNS 
RAGV* QRREGKNPGVPQFNR 


4174 


18075 


A 


4198 


375 


1 


NFKIPAPPPQNKGPLRVGTPPFIiIFSPP 
GSPPKSFPKAKWARRDRPF*NPGETLQE 
G*VFFSGRFFFFFFFRDRVLLCHPGWSA 
VAQSRLTTSSDSRVHTILLVGRGCSEPR 
S HHCNPAW V TK VK 


4175 


18U70 






1 


27U 


E PRSRLCTPAWATEQDS ISKKERHGliLV 
FLKGF I HY I RLL INL F AARGRKR VLiFAL 
GGREMW 


4176 


18077 


A 


4200 


388 


1 


PPGIINLFDPGL* PPPPWVMGPPPS+KL 
ISPQKKKNKVPPPVPPINIFSGPPPPTL 
VFWVLFFVSLKRQLIL* *KSGVSAASLV 
RHDGAPRLTSSAGKGHAEPTFQPRPTHH 
RNACSFVLIHIQSYKVF 


417? 


18078 


A 


4201 


29* 


158 


FKI VINFQFLTPI FAPSEGGRGG*GRGE 
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a.— u n kd own , — oiop co a on, /— possi Die 
nucleotide deletion, ^possible 
nucleotide insertion 














LRSCHCTPAWVTK * EPI * KRKI KGKKRT 


4178 


J 8079 


A 


4202 


128 


358 


KLMIPPSPGQPPRPGGRWGGHPPARPAA 
GGWPSMSLRTGHDDNGGVWB 


4179 


18080 


A 


4203 


406 


1 


FF FFFIiRQSFTLVAQAGVQ * RNLGSQQP 


4180 

* 


18081 


A 


4204 


322 


2 


KPRPRKTPGGPFP I *GGGGAP P PRGGAP 
KGGSPPPPPFPPPGGKKGGKKKNNPPGG 
*KPWGGFFFFPPPF*TPPKKRGLFLKKK 


4181 


18082 


A 


4205 


,3 


127 


GFHRVSQNGLYLLTL*STRLHLPKC*DY 
RSBPLCLACFLFL 


4182 


18083 


A 


4206 


1 


216 


GFRRVGRDGLDLLTS *SACLGLPKCWDC 
GR* PPRPAHLGGNSNAKEPGLPACPLLS 
GHMHRAVWVRWATATL 


4183 


18084 


A 


4207 


427 


0 


LFFTTNFFFWARVFFLPPFF*NPPPGFF 
FIiPHKKKKNFPPPPGFFFFFFYAPPPFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
PWLKTWSC 


4184 
• 


18085 


A 


4208 


2 


404 


PRVRLGIWCKERLF^pYFFFFIiERGFCF 
LAQGGGPGGNFG * RAP PPPGE KKI PGP P 
PRGGGEKGPREPGRKIFGFLKKKGGGPF 
GPGGVGP PKKGP P P PKPPKGWG YGGKPP 
AQKRGGFFFGKKRAPFFNWVNG 


4185 


18086 


A 


4209 


40 


412 


PLFCLSEKPRYFE YAI YLMLS LKFLES C 
IMSVNSSAVSASVYY*5f*KIEREPLTSW 
TI£KLDPEVFHQKFAFT*RAKAILRKKN 
KFEGLTLSGFKtHYKTTI ITTVwY*HKD 
TQIDQWNRIESS 


4186 


18087 


A 


4210 


219 


392 


HFFFLNYNFLGKGVSFCPQAEGQGRNLG 
♦LHPPPPGLKQFFC^TriRNWNHRIiVPP 
PP . 


4187 


18088 


A 


4211 


409 


2 


LERKTVFFS PPFI FFFAPVFFLS PFFYT 
PPPLYIFCPPKKKKIFPPPPGKIFFFFK 
GPPPIFFFFFFFFFFFFFFWS*FFIMYQ 

TtfOMTOVMT Till H\OtTT UT OOtMTVTT T VMM 

I VFNI SYNLIKVDSPILYIjFKDKKILiLCT 
ISSDAW 


4188 

i 


18089 


A 


4212 


24 


399 


ADAFSTTNLHGLGPDLFTPTTQIiYINLG 
MAJ LL * AGAVN I G YR S KI KNALAH FLPQ 
tfi PTPIiIPILiVI IETINIjIjIJjATAJjVVH 
LTAI I TAGHLLMHL IGS ASLAG S T I SLP 
STLTVFTrLILLT 


4189 


18090 


A 


4213 


411 


197 


S WLTAP S FKRFS CLS LLSS * D YRLP VPH 
PDNFCIGSRDGARM V 5 IS* HHD P PAS AS 
OSVGFTAPKVIjGIjOT 


4190 


18091 


A 


4214 


218 


126 


KEPEPLLFRSSWSVGAIDLKNVMRRVLI 
-Ko L. 1 P(jVVAHTiJNFSTjj^RVjVjQ 1 TKoG 
DRDHPG * HKSALGS CAS CFIYIiWDIjIjIiH 
IVFE 


4191 


18092 


A 


4215 


1 


332 


MAPSLCLSNKKRGFIGPGFAGAPKHPGR 
GQGHPREKIiAFFKKGPPKPGLKS FLVLK 
TPPPKPFFFLSPKGGALKTEEEP*TKKK 
GGAEKKPPPGEKRPLGPFCPKKPNWG 


4192 


18093 


A 


4216 


1 


185 


KLYLSI LISLQISLI ITFTATELI IFYI 














FFETTL I PTLAI ITR* GNQP AN FLYIjLV 
SFHEK 


4193 


18094 


A 


42l7 


1 


388 


LRFCWETLFPLPYNPFSFHFSFLFFSFS 
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FFFFLGGKFCFCPQPGGGGPNIiNIiRDPP 

GGF* KKGGLNHGGGGGFKPPPQKKPPPL 
ASQRGRNKGGAP PLLKF 


4194 


18095 


A 


.4218 


3 


195 


VIYSTIFAGTLITALSSH*FFT*VQLEI 
NMLAFIPVLTKKINPRSTEAAIKYFLTQ 
AIAbriPA 


4195 


18096 


A 


4219 


261 


1 


EPRRRERKERKEKKKGATPSRICMKRER 
BERKS * ERRERGEEGERGBKEERERGER 
EREIRSEKERERKRERPKN*MRIjKGRTRR 
TRG 


4196 


18097 


A 


4220 


5 


375 


DEMLHLKFTYILN* TLKDTI I PKVNEKL 
YN*DFIjNSKvQGtPPPPRPSRPPSSPSP 
DPPPGPPPAGAGRRARRPGRRGS PPGGP 
PAPPPAPPPRAGRAAGGRGRPPARPGGR 
GPAARPAGGGG 


4197 


18098 


A 


4221 


258 


267 


GGGALKKKIYCGGGGGNFF^*SPPt>Syfe» 
FFFFFFFFFFFFFFFFFFFFFFFFFFFL 
IYPIFILIF * • 


4198 


18099 


A 


4222 


127 


330 


KKKKKKKKKKKKKKKGGGPFKKKKFLPR 
GGGKNFFF *GGQKKKLGGGVKKKGGGKK 
PGAKKKKKRGEK 


4199 


18100 


A 


4223 


1 


377 


RRFHLRRENLQEKQTWMGIFFFFGVAGP 
PGPGGKS FS PPS FFF * GKNTFFTP PPGG 
GGQNLVGENSNSNPPPPRDPPPSGPSEK 
I *KKTPPPFLVGPNTKNKKHFGGGGSPK 
KKTLLPPPPLDPF 


4200 

■ 


18101 


A 


4224 


223 


431 


IRKTGPLGFGGQNNPLPKV*PPPGLTFF 
QKGPPPPPFPPFQKPRFIiKFYPKREGPP 
PPKVGLTPPPGLL 


4201 


18102 


A . 


4225 


173 


2 


RCKLHCLFPTLAI ITR*GNQPERLNAGT 
YFIjFYTLVGSLPLLIAIIYTHNTLGPTR 
P 


4202 


18103 


A 


4226 




377 


FLTWFI CISLFGRLMGI *FFVw ffcel 
FFHAFFSLFIiKTDIIFSIDIiLEIIIYAH 
VCHICGNLIFSLYFVFYFNGSLKYIKC* 

QFSLKILIGF 


4203 


18104 


A 


4227 


1 


412 


KNSKVKNATDLLKNASESPHSRIDEAEE 
RI S ELEDRLFENTKSEETK* KRI KK* SM 
PTDLENS LKRANLRVI GLKEKVGKEL* V 

LSRENPKTTSRHLAIKLSKVKDKER 


4204 


18105 


A . 


4228 


373 


163 


IGVFLGGAPFFFFFFFFFFPQGPKRGGG 
KGGASGKTG*GGGGIiKLGGGKQGWPPFF 
FLRTWPTFTRAPGL 


4205 

• 


18106 

• 


A 


4229 


392 


3 

- 


PPC*NRAPGFNFGGPFKKIYSSPPPRGK 
EIGSLKGP P PFFFSAPPGGPKFS KGGGPL 
FFGAPDF*GGFGGKNYFWGOGFP* PHSP 
PPPFSWGPKRRAP FSKKKKKKRRRGWGE 
GEGQGSNHLLKHGYFQI 


4206 

* 


18107 


A 


4230 


211 


400 


YAI/I WL YC I LNNYALHS VFFWCCFCFLI 
FKFIYVFLNLIjFLFAF * IiVLCIiLKVQNH 
SGSLSIVSYFPLLEACRIFPLFPLF*NL 
+MICFNMVVLYFKQLCSALSVFLVLFLF 
FNF* IYLCI FKFIVFVCFLRRNPTLAQA 
GVQWHDLGA 


4207 


18108 


A 4231 


69 


386 


KRI FFFLGPRGGGKGKMWVNGNPPPKGK 



809 



WO 01/64835 PCT/US01/04927 

« 



SEQID 
NO: of 

mi o Iaa^I ft A 

nucicoiiuc 
sequence 

• 


SEQ ID 
NO: of 

n Anftf) a 

pepuoe . 
sequence 

■ 


M 
eth 

00 

. 


SEQID 

NO: in 
lie cm 

09/515,1 
26 


Predicted 

beginning 

nucieociue 

location 

correspond 

Ing CO Urol 

amino acid 

icMUUC OX 

peptide 
sequence 


Predict- 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
Er-uiutamicACia, r— phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K-Lysine, L^Leucine, M=Methionine, 
ii— Asparagine, r=rroiine, 
Q^GIutamine, R=Arginine, S=Serine, 

W=^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucieoDoe insertion 














GNFSA* PPQRGGPKGPQPKNKVIPGFKK 
KRGFYGGGQPPNIiGKELAGPPKGGEKRG 
KTHPGGKFYAFLGGGLFSKKRN 


4208 


18109 


A 


4232 


394 


287 


FKEKKKPPPPPLFSPPFF*KRGGGDPLF 


4209 


18110 


A 


4233 


442 


112 


LLGRLRWEDGLSPQGRSCSEP*SCHCTP 
AWATQ*DPVSOTNNKNPNPQNACSSLNP 
SLPWTAAVWRTLGEHCNGREALPARAKN 
ATKAPVLTTHjHICQTVRTISCALLPH 


4210 

i 


! 18111 


A 


4234 

i • 


415 


58 


SLQKGHFPPFQELDPQNPGRSGWSPVLP 
TTGL FFSLDF I R FT * FMAGVL VG YKHCW 
EWTAALVLLLFLRDRVSLCFPGLIHTIiG 
LKRSSCLSI*LSSWSYRHTIPHPAST*FF 

Q 


4211 


18112 


A 


4235 


406 


168 


HGETPSFLKIQKLAGRGGTLL*SQLLGR 
WRQGNHLNWGGRSSVYIYTEKHTVHTQQ 
SVTTPFMSMSQTLAYSPIiSLKSS 


4212 


18113 


A 


4236 


247 


397 


TVCFS VYI YTELLPPQVK+ FPCLTLPSS 
WD YRS VPPLPAYFCI FSRDGVS 


4213 


18114 


A 


4237 


36 


405 


RLYCFIKGLNVKNK*TFRIVFFFFLETNF 
PFGPQGGGEGANSGF^PBPLALGVKKGPR 
PPPPGGGE*GAGPPGPGNFGLLKKKGVP 
RGGGGGPKPPTQRFPPGRPPQKKKNFFG 
PALGKKKVFFF 


4214 


18115 


A 


4238 


407 


3 


KKKKKLTRPGGGGPFSPLPKRVRQKKGG 
NPGGGFSKKKKSPPPPPPRGKKKKPFSK 
KKKRFXtSGNNTGKNRLGKDPQS* PAGNA 

« - ■ - 

RKKPGWERAIiARGRAPGMTDRKPQHSGL 
DGSQQSGTGQEPGNSGPVPKVH 


4215 


18116 


A 


4239 


144 

i 


405 


PT VKXiVLLYLVLLKVAVS KNLR* *GGLT 
V*QQRQILWKRILSNTEFLYLRQSKMMH 
LVL * PTVNTLKSLKSGQARWIiTPVIPAL 

WEA . 


4216 


18117 


A 


4240 


369 


40 


PLLQGTP ASWPRWGRAGDRPRKEG * LHP 
RE TR VP P ALHHQ P AP AGVS IjT S PGP P FW 
LSLORLRGHSSSDSLPAVCQYSGSWREE 
KAAAEAPALTPAt CTCTI CSVLTWL 


4217 . 


18U8 


A 


4241 


397 


1 


ISKSLFFPIPFWKKKKKGFSPPIiNFLGG 
VFFFFFLKI FE I FFLKRFFFP I FLGAPG 
FKI FKKYFPFPRGFKTPPKKKI FLNPFF 
KKKGGGGGALFFFFFFETGSRSVTQAGV 
KWHDHGSLQP * PPRI*K*SSDP 


4218 


18119 


A ■ 


4242 


230 


393 


ADRPGS PLPS APETGSHS IAQTGVQ * HN 
YGS tr* Phi FGXjKjL£j bCLSLiFiN C WD xR 


4219 


18120 


A 


4243 


434 


104 


KQPTFQSTTGGSPSAGRPPPPRAKKKKP 
PFF * KKKKKSGNIGGPLFPPPPGGEBKK 
NLFPRKGEGSHKTDSSPPH*RGGKKKNL 
S PKKKKKKKAWCWD YRRKPLCPATYF 


A.1*)fi 
HZZU 


J 51Z1 


a 

A V 


A1AA 

4Z44 


I 




LNKKFNKETGNITKNQTE ILELKNSMSE 
IKNTVKSFNNRLNQAEERISELEDRSFE 
MTQAD KKRKKGEE S LQDVLYTMK*TNIL 
WSFQKEKRKEKGLQNIIiNE I VAENFPSL 


4221 


18122 
.. _ 


A 


4245 


12 


424 

* 


IYYKTLNCT* IF* FGGALACFHFSYPFY 
IHYVGFVPSPVYLFSHVIISVWTSGYLL 
YSLGCWP ILSIiFI FLLKLFHPGYLTI QV 
GIYVWILFVCLFLSNSFINI * FTYHAIP 
IPIYIPLKVYNSMFYSIFTTK 
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Amino acid sequence (A=Alanlne 
C=Cysteine, D=Aspartic Acid, 

1? — fZlii+o m IP AptH TTr-PKon vl<i la n ino 

j!/— uiuuiuiiL. /iiiu, * — rneuyiaiumae, 

G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, l>Leucine, M^Methionine, 

ri^Aopzir agiuC) r qiujc, 

Q=Clutamine, R=Arginlne, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,A 3 possible 
nucieonae aeieuon, x^possiote 
nucleotide insertion 


4222 


18123 


A 


4246 


215 


425 


LSIQISQIRNLIYLFINIMGKILRAKRK 
♦GTS DAWNQTI FLRPRTVAHSCNPS TL 
GGRGGRITKSGDRE 


4223 


18124 


A 


4247 


413 


2 


WEVESFSLGKAFMWGCATENPGPPHPKF 
LLFIMKAPPPWRVGFQDCFNPKDGRFI* 

PKTPPCPKAK«AJ\J*JN3VoJSJUU^ 

WPPTPLPPPGNGGTRKFPSSPPPPPPPT 
TPKITPPGGERQL*PQRAEGAGDTK 


4224 

• 

• 


18125 

■ 

1 


A 

• 
■ 


4248 

• 


61 


281 

• 


ITWKDFDSFSVLRVSKAFHLALPAPIGN 

CIKCHIKMGIYPEEKFKNIHKiIiYLEID 

VCGH * TWKKI FFLFFFLRQRFTLFAQAG 

GQWIaDIiSljLiQPPPPGrrUlroGLorliS 

DFDSFSVLRVSKAFHLALPAFIGNCIKC 

HIKMGIYPEEKFKNIHKYLYIiEIDVCGH 

IKliGRKr r r Lr r r 


4225 


18126 

• 


A 

■ 


4249 

• 


23 


13 


RVRP FAS PGGRE VTVCGGLGREGQERRL 
GSADGDGQSLAVGAGTLATVGGLELLNS 


4226 


18127 


A 


4250 


182 


342 


KGFFFLPPGGGGGGEF* FNEPPPPRVKG 
IFPPPPPGKGEKPPPPPFPGYIFVF 


4227 


18128 


A 


4251 


2 


219 

* 


T1CT D I^Mflin" T\TDtinT' , T'T T UCHT fnDT DDt? 1 

DCLS S GGRGSQTHGSEL * S YHC I PAWAT 
DGDPVSKNTKFFFFFF 


4228 


18129 


A 


4252 


223 


324 


LGAVAHICNPSTIiGGRGRRITR*GVRDQ 
PGQHGE 


4229 


18130 


A 


4253 


165 ; 


336 


Eo DAQGnu V AAK V bnvix&iUJALili V r KoJj 
CKLAMKPLGEGPPDPK*ADSSS WPPSVQ 


4230 


18131 


A 


A M AW A\ 

4254 


<* 
3 


233 


GITGPATKSAP*WS*VSGPHLGAFDSLA 
MLLAQDWRALLESAKFFPIiL 


4231 


18132 


A 


4255 


394 


44 


PVNFFPFKKGGGFPPWPGGV*NPAPINP 

GGGKNFFGKKKGAPTGGPPFFFFEMEFH 
CRPD 


4232 


18133 

* 




4256 


1 o 


Z 


VW3PFVPPQPPTPO<VlVAKTC r NrP<lTTtVCl 

QGARFTRS GVRN * HGQRGES PR 


4233 


18134 


A 


4257 


404 


190 


AQL I RGWRQ* DHLS LGGGGCNE PS S CHC 
*TVSPPKSHPBLQFS 


4234 


18135 


A 


4258 


288 


407 


ENCRPRAVAHACNP S * EKCRPRAVAHAC 
NPSAFGGRGEQI PRSGEQEQPVQHG 


4235 


18136 


A 


4259 


406 


163 


GWGVQTHPGHHGETLFFLKKKK* VGGGG 
LNPLLPPWGQKGVF FSKKKKKKERD 


4236 


18137 


A 


4260 


101 


276 


LTS I VPSLWEAEGAKLRGLGRPACRESS 
LCHCTLPWVTG*NLVSKKKKKRGGALGF 
FF 


4237 


18138 


A 


4261 


2 


414 


WVAAT PN LVFVGRGN CQ D WFTG P FLTRG 
GWGGVLFPPVKRVIKGLVAPFSGPIPIR 
KNHQKVGGFLEPNKTNFKB*NFVENKNT 
RRGIKKNVPEKKKPQKGIPLPPQIFNBN 
QTWGDFDFFFFAKKKKI INS FLGWA 


4238 


18139 


A 


4262 


337 


382 

■ 


RDAR FLHCS LV * L I T * LQKE FFFVGQAG 
VQWHYFGFLQPLPSGFNQFFCLSLLSRW 
HYRG P PP SLGKFL VGWLVG WLVFETKFW 
F CW PGWRAI TGFWVPATYT FRVQAI FL 
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ing to nrst 
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peptide 

sequence 


Predict- 
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nucle- 
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acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
jt/— oiutamic Acia, r—rnenyiaiamne, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L^Leucine, M-Methionine, 
i^i^Asparagine, r^jriQiine, 
Q=Glutamine, R=Arginine, S=Serine, 
T^hreonioe, V=Valine, 
vt = i rypiopnan, i— tyrosine, 
X<=Un known, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


4239 


18140 


A 


4263 


406 


184 


QLLGRLRQENCHLGGEGCS * LRS PPCI S 
AWATKLDPVSE KP KTKNQKPTGLGGLNQ 
GPASLLKKIRKLGKKRPG 


4240 


18141 


A 


4264 


233 


440 


LPVHHG WGS LRKLT I LGKGEANPSFFTW 
GyQGKAXQKGQKQPGAGAHACNSSPLGG 
QGR* ITRSGVQDQ 


4241 


18142 


A 


4265 


2 


230 


GARL* SHLL VRLRKENHLNL KG KNCREL 
KWPHCTPAWGTKKDSAAKNKNQPIjPPPK 
GVKSPPQREHpLIVTCMFVQ 


4242 


18143 


A 


4266 


1 


349 


HKTNIHFKLVMVSNVFTPFPFGKGFSFG 
APGGRAGPIjPGLVETP PGRE kaffgpnp 
PRRGE * RAMAP CPGKI LVF * KKTGPPRG 
LNFWPGGPPPFGLQRGGNNGGNPWPGPE 
FFWF 


4243 


18144 


A 


4267 


2 


375 


RSEQPAGSASSGNEGLSTRASGCGGCTG 
SPNSASPPALCS ISRRALAAFPRGRARD 
LQPAMPEPPTHSVGSCAAKPPR+APPPA 
PRRPVPSTTQGLRSAGAQRGTGRHLHLQ 
PQCRIHWVKPAG 


4244 


18145 


A 


4268 


377 

< 


3 


TPAWMTERDCIWRRI^TSAPGGSWPSGPV 
PSPGAQ* RP PSQGI/3 JjWWAAAAAPRC* T 
APGPRPPPHGPGSPQGASPPTRPPRCRP 
HPRAGSAGPTGATPPGSTQGQRRRHSHQ 
LPGHPGHRVALG 


4245 


18146 


A 


4269 


1 


294 


LEDWGGRGARAHYDGFSLEPESDHYRIiR 
LGQ YHGDAGDS LS WHNDKP F S TVDRDRD 
S YSGNWAL YQRGGCWTHACAQ * ILDAVY 
ATGPKYQVHCEALH 


4246; 


18147 


A 


4270 


2 


218 


TGRI I ILSQGLQTLLPLIAF+ *LLASLA 
NLALP PTINLLGBLS VLVTTFS * SNITL 
hQKKKKKKKKKKKIFF 


4247 


18148 


A 


4271 


32 


443 


LHSDVDQXiAGLVF PGCPWPLAS PARRAP 
AGPWPRRAAAPPS * DAPAPRLAVSAGS P 
AWPPPST*GLPAPAAWASPASAS PTSA 
RSRS * AS PTARRCRPGPGTAAPTSASRQ 
CQWRP*PCQRSPPSGTCSSAAAAPT 


4248 


18149 


A 


4272 


434 


57 


HLSFPPASAAPKPLPQAASLWSSPPSIV 
LAQVPPMNTDPCPQPSA*PASELSPEMP 
PARPQAPPENWSHP*G03AQTVAPLVPG 
ATPAHRPAS PS P VT L FVLTAGGHC F YL Y 
TYQIFLKFHYIKR 


4249 


18150 




4273 


288 


424 


GLSLS VAQAWQWCDLVSL* PPPPRVKQ 
FSCIjSPPRIWDY*HPPP ' 


4250 

* 


18151 


A 


4274 


1 

* 


429 


NTRGAAGPPQMPHPPRASAFPENPCGRK 
N* GQVSGP * ASGS S PVKWAGPAGTWRKG 
GLGWGPTRVRGRGPRRRPGAS SGYARQQ 
GPGHPGFPPSPRRRLSVPACALCPGQTS 
GTRAGIAVWPQIGIiCIiKAQGARS CPRDH 
SSD 


4251 


18152 


A 


4275 


253 


3 


PHQFNAVIYFPSPVSRGCPNNALYSPFV 
IWLFCFVKTGVSVTQAGV*WHDHSSLQ 
P*TAGLKRFSSLTILSSWDYKSTPPCI 


4252 


18153 


A 


4276 


406 


3 


PMVSQGGVCAK*QARAXPREGI FRRSAD 
TQWRBVQQKYNMGLPVDFDQYNELHLP 
AVTLKTFLRELREPLLTFDLYPHWGFL 
NIDESQRVPATLQVLQTLPEESYQVljRF 
QT AFLVQ I S AHSDQN KMTNTC I 


4253 


18154 


A 


4277 


373 


3 


DGVSLCPPGWSAVAQSWIjTAVL*AIiGFI 
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Amino acid sequence (A=Alanlne 
OCysteine, D=Aspartic Acid, 
EM?Iutamic Acid, ^Phenylalanine, 
(^Glycine, H=Histidine, Msoleucine, 
K=Lysine, LHLeucine, {^Methionine, 
N=Asparagine, P^ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyroslne, 
X=Un known, *=Stop codon,/»possible 
nucleotide deletion, ^possible 
nucleotide insertion 














FLFPVGGLTGIVLANSSLDIVLHDTYYV 
VAPFPYVLS IGAVFAI IGGPIH* FPLFS 
GYTLGQTYAKIHFP I IFIGVNLSFFPQH 
NLGLSGMRRCI 


4254 


18155 


A 


. 4278 


71 


176 


AGGRGCNESRSHHCT P AWATE * DSVKE K 
KKGLFFW 


4255 


18156 


A 


4279 


2 


379 


SPFVPLQSSLGNKSETPSHGRKKKRRKR 
KRTDENQP*KHPLRARPGPGAPRAQGGA 
QGSGGG RRAG P CR KLVHRGH TGNPKRE P 
GLIPQGEGRSLGIHPSNGCHKPCPSRGR 
PPPRPRQGRGRMQN 


4256 

• 


18157 


A 


4280 


2 


338 


CDMPRRKLDPLSGRNTLGFVCVVWAQTS 
GLK*SSCLCLPKCWDYRHEPLRPVWVYF 
KLWHPSPWKPGFYFSVFFPFYLPLCTTA 
SLFCSCLPPCCSRVTLVGSESSCLWPAG 


4257 


18158 


A 


4281 


407 


1 


EERGVATGHTAERGRADNEEERGEVANK 
RGGRE VQ P EARRMTTTATGTTAATRGAR 
T * TAATTATAP VTRTAAP ATTAS STRTL 
RTPRRARTAS VLWARCRS RWS AT WRAS R j 
TCDCRTAS HS R PTATRS RTARMA 


4258 


18159 


A 


4282 


367 


225 


PCDSACLGLPKGWDYRREPPRPAGVLFL 
*ASFS*QSKEVYVCKLTHA 


4259 


18160 


A 


4283 


452 

• 


1 


NTCGLQSVCRGTQRGGPKRCPCHPRERW 
DLGDEQESSKEGRPGREGVQGQGLLGAT 
PGAQGSH*GL*AEAAMSSGHALGPGQVP 
LTPLSPSPPNPHRRPRAGRASRQSREST 
EAQRAVP S QGAAPG WETD WG S SHQWQP C 
QAQQEGRTGR 


4260 


18161 


A 


4284 


436 


22 


CHVTGTQP I KVS WAKDSRE I RSGGKYQ I 
SYLENSAHLTVLKVDKGDSGQYTCYAVW 
EVGKDSCTAQLNIKERLIPPSFTKRLSE 
TVEETQGNSFKLEGRVPGSQPITVAWYK 
NNIEIQPTYN*EITFKNNCIAAARRI 


4261 


18162 


A 


4285 


301 


403 


I*SIS*PCDLPALVSQSAGITGMSHHARP 
RVKSLI 


4262 


18163 


A 


4286 


380 


2 


AQVLYSSREQERRQDLPEQVIQAEGE*V 
KASACQLTFEDEEAMESGPAALDKDFQC 
TRKHH FAEVQG S PRCM S S R YLVDGCPKT 
FAEAQNVCS * CCEANLVS IHAFTFILRI 
QWCTSTVNQAQVCI 


4263 

- 


18164 


A 


4287 


453 

• 


3 

< 


YIYEGSIMEEPRKPVPKGPLGLHCPGKF 
QG*RNSYNHHAVRVGTRCAPEGVKDLTS 
SLQSVITKPEQNIQELMKHFKKEKSEAE 
NHIRTLKAESLEBKNMAKIHRGQLEKLK 
SQCDRLTEELTQNENENKKLKLKYQCLK 
DQLEEREDV 


4264 

• 


18165 


A 


4288 


52 


400 


LDLYFFYRQCLALSPRLQRSSAILAHCN 
LKLLGSGDPPTPASQSKEITSMRYHI*P 
Nil * NSMC * NuVrDU THRIGEN I WES YI * 

*einiqnacripktqqqkthfqnkqglh 
snln 


4265 


18166 


A 


4289 


2. 


263 


IHGAHLWSQLPGRLKWEDRRSPGSRGCS 
EL*SHHCTPVWVTEGDPVSKKRINICKQ 
NLNEKTNLTVDTPRHSFTHKIKSLNHFQ 
MFL 


4266 


18167 


A 


4290 


3 


142 


YliGVHRLSIDGLDLLT**SARLGLPKCW 
DYWRDPPRPANFYFILFK 


4267 


18168 


A 


4291 


324 


19 


EMSKRHEQKFHRRELQMANKHVKRTSLA | 
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rr-i*iuiamic acio, r— rnenyialanine, 
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K=Lysine, L=Leucine, M-Methtonine, 
ii— Asparagine, jr — rroiine, 
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T^Threonine, V=Valine, 
w— iryptopnan, i^iyrosine, 
X^Unknown, *=&op codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














IRBMKI ETTMKYYHTP I QMA* VRNNDTS 
QVQWHTLQVAS P LVAE VGGWNEARNLRL 
U I VKM VPMN5HCT PAWVT 


4268 


18169 


A 


4292 


2 


204 


IHFCRDRDSLCCPGWPRTAELKQSALLG 
LPKCWDYRH* ATKPGLQSVFLLGLKI FL 
ILLVRGVDRYL 


4269 


18170 

- 


A 

• 


4293 


427 

- 


3 


RRKTVTVPRTGERGI I SNEDNGtiC KAVG 

mm* v« *m\ &m v> mm ^v&w Wtmm ti w *m.mrmm»m. itff r* 9± ■ ~m_ m 

NI PGS PLCNVSKLNIMKKAPGGHAARFC 
IWTESAFRKLDELYGTWRRAASLKSSYN 
LPMHKM INPDLS R I LKS PE IQRALRAPR 
KKI HNRVLKKNP VLRPL * R I KLTS RHKR 
R 


4270 


18171 

i 


A 


4294 


24 . 


372 


F I C PLQDYVI CS DVTSCWKKI IjATTGR 
RLRDDS VQFS FF FFGKGGSLIAPRVKEQ 
GGDLG*WNPRPPRLREFPGLALPRCWNN 
GLAP P P PLIIiVFLEEORGFPLAGKMGIiNL 

mi mt *m mm- 

LHSR 


4271 


18172 


A 


4295 


1 


332 


IHSSYFFPQS *FFGTINTSDKPLRILIK 
EKKREGTN*HIMHEPWKFGIDPEDIIK7 
KKKYYKQLCTHECFDNIiEEMVHFLKKKTI 
HLI*N£*FE*LYKYYRN 


4272 


18173 


A 


4296 


407 


30 


WSVIYGEN*ESLGWGAHPSHVANIRVTG 
LRYLFSYGRHALDMLESSQDNMRSWWVS 
QMS SE I DVDNLGH I SLCNAVQ * I RNLNP 
GLKTS KI ELKFKELHKSKDKAGSEVTKK 
EF I EVYHELYAVG 


4273 

* ■ 


18174 


A 


4297 


350 


1 


YKTWMNKEKW * YVG YN I E QE QKLAL KT 
TVLDEWYTLPDGRI I KVGGERFEAPEAL 
FQPRLINVEGVGVDELLYSTIQAADVGT 

mw*iMfVW#v m*mw m> wmw ^m> ammi *my >K m*rmm mmt m wm^^m mm mt**. m mm ^m>m» 

RSBFYKHIVLSGGSTMYPGLPSRMEREL 
KQHV 


4274 


18175 


A 


4298 


623 

• 

• 


1 

* 


S RRGCAATCDGS I TAWPQRQAAQGKPS V 
HSKLEAEAKPTPGDHAASESTGFSCLGP 
GGVHKTAHAHARNMPGDSNTGSGQSPAG 
RRWEARGSAPRHHPTQPDWTHPDALRQA 
MARNPASSF*SF*RCCTAASPPVPTPPS 
PVLMRGPVPGGGGGQGKIRPLQEEAPPP 
SSPWLSR*PQAGTPSSPAVSSLYHGGL 

o Jr^ 1 \3t\.\£U KWvaK 


4275 


18176 


A 


4299 


469 


3 


PVRNCLGARFRVSGRAAHHAL* QSASTL 
QGDPRTKRQAISA* SSAFDVQDVSHVTL 
PFYPKRAQSKDLIKEAILDNDFMKNLEL 
SQIQE I VDCMYPVE YGKDSRI VKEVDVG 
SCVYVMEIX5KVGVTKQGVKLCPVGPGKV 
x BEIiAIIiYICRRPCI . 


4276 


18177 


A 


4300 


506 


321 


KPLSLLKIQKIAGHGGSCL^IiQIiriRRVR 
QENCLNLGGEGCSGIiPKFWDYRCEPPCP 
DCWPS 


1 


lolVo 




4301 

• 


AC A 

454 


1 


KTN*GERPRSSVYSCSSWRI*SSSSLGRS 
+NSVGSLNI ILATLTAQELSNS * ANNLI 
*IAYTIAFI VKI PLYGLHIi* LHKAHVEV 
PIAGSIVLAPVLLKLGGYGI IRLTLILN 
PLTKH I AYPIiY 


4278 

• 


18179 


A 


4302 


3 


463 


AVSLPLPLPRFGRLPPLRVGWCPLRCH 
ILRTQRLEGAPSR+QTGAKQAGVWBEUR 
SRLSLGPELGRIRPYSGDPGQVTDSSGA 
LASSLERGCYPAALIjVGEGEGEMYIjGPY 
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NO: of 

U UUvUUUG 
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• 
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SEQID 
NO: of 
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nucle- 
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dine? to ■ *» - 

Mill t K\J 

last amino 
acid 

residue of 

oeotide 

sequence 


Amino acid sequence (A^ Alanine 
C=Cysteine, D=Aspartic Add, 

¥T~/1 In fa m ir" A rift rTsPliArivlalaviinA 

(^Glycine, H=Histidine, l=Isoleuane, 

K^l^vsine. I/=l J eucinf» M=1WpfhtnninPi 
N=Aso a rapine. P=Proline. 

* i tM.&mF*** C> y Villi W« 

O^GIutamine* R^Arpinine S=Serine 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X-Un known, *=Stop codon, /^possible 
nucleotide deletion* V=Dossible 
nucleotide insertion 










» 




. HPT*DGLKRLSLAKPPGGETRLTRSGHT 
FWGEGIPPSAVCL 






A 




my 


o 
z 


FF F FFS ETE S CS VTOAG V* WHDTV3S T.Y 


4280 


18181 


A 


4304 


194 


3 


FFFFFSETESCSVTQAGV* WHDLGSLY | 




1 ftl RO 


Jn. 


*ouo 


jI f 




WPRTA PPT .TnTTTrsTf5T.v f rcnnT .2124 Q Zt Q 

QRGGMTGVSHCNRKEFVLMNI I VMSSGL 
TFL PFS PDE V* GS * GLSFLPS PLAUTTG 
LRN 


■ 


1 




4JU7 


"inc. 
276 


3 


SSLAVYS IL * SG* ASNSNYAL IGALRAV 
AQTISYEVTLAIIIjIjSTLIISGSFKLST 
LITTIiV 


4283 


18184 


A 


4308 


234 


^ AO 

398 


KKr r rr PPLifiGGGPXTTl WS P PIiFG* RE 
SPAPPPPRGGIKGLAPPPNIiFFPLDKR 


4284 


18185 


A 


4309 


403 


100 


SGGQKAVGPPWAAPPGYKQNTNKLGPLP 

yrb 1 £»Q»ViAr WP iy<jrA.t I\7.UPPijJ?U w Li£f 

PRNPTPP*WPPPPPPFNTRSKGPHTPSP 
TPLCG PPPKNPRLFFFF 


4285 


18186 


A 


4310 


399 


85 


WKPIPLAPEWRGVIRFRALPPCRPLHCP 
TLKLTAPSVIHQRT*VHWGFLVGVSLSV 

1ST Trvi T3c» nPLKHATrpoci hiivt »t hot or t 
r, L K K rJ S CS r l\j V E (_. S b r Kb A LiKb bE Li IjG 

VSDPS I SAFQRAGI AGVSHHA 


4286 


18187 


A 


4311 


230 


2 


WTBEDTRRCLVLFS FFLS FFLFCFVLRQ 
SSHLSPPQRS*DYRHTPQCI 


4287 


<01 oo 

16 Ilia 




43 12 


I 


inn 

420 


Nib WG VGBiiS h 1 V I V INMlili P Y V WLj PKO 
KFFFP QNSRFPS APPHSS P PGLRSDF SE 
SGGLFFHLEVLWGLPLPPPPPPASTHVR 

PQLSCITjQDCIRSKGDILIFTFFTLCLS 


4288 


18189 


A 


4313 


403 


207 


ES S EG * LNPLAHLAMRYKG C P FKDVREK 
SEFILKSIQVRKSILNTRMSLLQLDFTC 
RYSRCVCFY 


4289 


18190 


A 


4314 


469 


328 


TSEGGGFNELRLCHCTPVWLTK*DSVSI 
QntJT'DinrKrr.vTACT \r\ronx> 

o UK1 KJVlVM Lii\.J. y o Li V VKL f 


4290 


18191 


A 


4315 


1 


212 


NTLPGVEVLKRLRWEDSLNP* GQGCRBP 
* S CHCTTAWATEKD P VS KKKKKKKNPGG 
FRPHSLSQPFGPPK 


42V 1 


18192 


TV 
A 


4316 


03 


ceo 
559 


SSATPKPIPS*AACPPPLK7UAABVRAL 
P AAGR PRFS EACLT PQ PN P CVL W P CLWD 

HSPTSCPTGSS*APHSCAFLPVPRLGGG 
MTTVHPTVHLPIHSSLTFAWE*PSPPAC 


4292 


18193 


A 


4317 


135 


405 


PAPPSPHPKPDSVSCVIPPS*PAPPSPir 
DCTnQVcrvTDPQWva^pcnT.piPMTT.r.w 

r (Uruo v d U V Xr r on Viujr Sujurfini uufi 

PRGPCPPEVPPTSPFVG*AWWPHPPARR . 
ASGRMDGRTDGRGRHLDLRSTHS FLTP 






A 


411 R 


413 


1 


LYLVGRA* SFHRS FPGOERIHPGGKPYD 
CKECGETFISLVS IGRHMLTHRGGVPYK 
CKVaSKAFDYRSLFRIHERSHPGEKPYE 
CKQCGKAFSCSSYIRIHERTHTGDEPYB 
CKQCGKAFSCCKYIR INERTHTGGV 


4294 


18195 


A 


4319 


479 


342 


GVGVDNDG I LVLGATNI P WVLDS AIRRR 
* DNTVHI LFLTADVT3R 


4295 


18196 


A 


4320 


400 


26 


HSEAVTTVRSHHS P VGHIHS IiNAP PTVA 
LTP*YNKPLNIFV*ARAQKIQGQFLTHT 
I PHILLLVI TRDTSRACVCVCVCVCVCV 
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M 
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OG 
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peptide 
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Predict- 
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correspon 
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residue oi 

peptide 

sequence 


Amino acid sequence (A=AJanlne 
C=Cysteine, XMAspartic Acid, 
j>-uiiuramic aciu, r—i'tienyiaianine, 
G=Glycine, H=Histidine, I=Isoleucine, 
iv— Lysine, ij— leucine, ivi^ivietnionine, 
N°Asparagine, P^Proline, 
Q=Glutamine, R^Arginine, &=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
. A = un Known, ^oiop coaon, /^^possiDJe 
nucleotide deletion, ^possible 
nucleotide insertion 














CVYAYIWVCMCVYIYVCVYIYICNYTFI 


4296 

• 


18197 


A 


4321 


370 


1 


SDRGQGKAGQATRLGSRQTGQDEDKGTE 
KSIPSWDKGPVNNEFGKSVNVSSNLVTQ 
E PS PEETSTKRS I KQNSNPVKKEKSCKC 

NdUiKAt £i x l^oivLi X Kriy K X Xl 1 \SJS Kir i U 

NECE KAFSRNV 


4297 


18198 


A 


4322 


342 


82 


GLGLTTNWIiLTGKSAGRGCAPSSGGLWL 
AVGDS VYYHRETEGGKPS PQRPCC * P S * 
AGLAPPSPPPPGCPEAPGSCCLHYRCLL 
TAH 


4298 


18199 


A 


4323 


284 


3 


ITKKFPRVIGGGCFSPFFMGLSKKNGLF 
PEGALSISLIPGPAIPPGGAQVISSAKE 
KKKVLSRPG WAHACNPSTLGG * GGRI K 
RSRVPDQPV 


4299 


18200 


A 


4324 


365 -. 


3 


QVEVYWENHP FRLE ES I YQ * CRLEGAT 
S VAGEQ IS E YNISMRASDGGS PPLSTET 
HITLHVIDINDNPPTFPHLSYSAYIPBN 
NPRGAS IFS VTAQDPDSNNKARITYAIiT 
EDTLQGVY 


4300 


18201 


A 


4325 


253 


3 


PSFIiSRD*SHIHKRIipCSGMISADYNLH 
LPGS SYPPTSPSQ VAGTAGPSMSIiQEGR 
ASHDITSSSRSIGAKELLRPADHPQCI 


4301 


18202 


A 


4326 . 


307 


95 


FLFLLKDKI LLS PRIiECSGTI IAHCSLK 
LLDS S YPPAVAS * VAGTSGMCHYTWLRW 
KNHLSPGIQGCSEI 


4302 


18203 


A 


4327 


371 

■ 


2 


WAPNHI S PTPRSGSTTRIWAPTPPSNSP 
RPCQDPGHRTDPWYPEEEFLLTNPDPPR 
APASWSFPFQEKRLHFPALPCP*HLDSS 
LGPTTLFSFSFPPTSIRPSQTNHSKGTP 
PPLtSYAQHDCX 


4303 

a 


18204 


A 


4328 


409 


1 


Rr.r.RARQPELEMAALIFFTLKYKHVERE 
Q KYHQLQDE YFT S AVVLTL I IiAAIiFGLV 
YLLI FPQSVWLLIoLVLCICFLVACVLY 
LHITRVQCFPGCLTIQIRTVLCIFIWL 
IYSVAQGCWG*LPWAWSSKPNLY 


4304 


18205 


A 


4329 


262 


2 


LHGAYLVLDIIGAQNRECPRYSFKWRVGQ 

HLDLLGSGD PPT S AS * I *GTTGAYHHTQ 
HV 


4305 


18206 


A 


4330 


1 


399 


NTF*FLKGMGLVNHVFTEDNLKKLYVSN 

t y^Tr , i7*ni>va , 7 w TV»Mty 
LblOHXKzAl I\£lN V 


4306 

• 


18207 


A 


4331 


408 


3 

■ 


SSVGIHVRTHTGEKPYECKHCGKAFSCH 
QYFQKHVRSHSGVKPYECTECGKAYSCS 
ASLQAHVRTHAGA* I YKYSGHV 


4307 


18208 

• 


A 


4332 


110 


426 


GLS PGTWSDMTGGPAVTAFPCTLTGTHA 
GHIIIIX3XDIJVKLPLHTLiRSRIiSIIIiOD 
PVLFSGTIR*APPPLRPTQPQAGSVPSD 
LEHKEEGVGGDQDPRGVSCSC 


4308 


18209 


A 


4333 


133 


1 


EPCCPSAIoAAPEVLGPEKYDKSCDMWSL 
GVFMYIL^VPSPPPLY . | 


4309 


18210 


A 


4334 


165 


1 


1 1 LTFFKKNERKWKAVQYW* KNR* VDQ 
WNRIASPEIDPHE*SQLISDKGAKECI 


4310 


18211 


A 

► 


4335 


42 


440 


SARRAGDPARGAPSRNNASLPDPRELTG , 
RPAGGLQ*GNCPAPPTPQLPSPSVSRPV 
SGRGPPPPPS FGDPRANRPQNPKGAKW 
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correspond 
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nucle- 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glu tamic Acid, F=Phenylalanine, 
(^Glycine, H=Histidine, I=Isoleucine, 
K^Lysine, L^JLeucine, M=Metbionine, 
N»Asparagine, PHProline, 
Q=Glutamine, R^Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=*possible 
nucleotide insertion 














LTVQMGEPGE IRRAACPQLSWAMTFPGS 
LPEPKRTGPQIQKSPPSRRGG 


4311 


18212 


A 


4336 


1 


160 


NTCGGGAQL* SQLLRRLMWEDLNSGSQG 
CSELRSHHCTAA*ETKGDPLSINBY 


4312 


18213 

< 


A 


4337 


236 


1 


QRSLLC Y I ME I RTVAVR I VAI KGVESEV 
YLAMSEEGTVYAKKECNEYGIF*ELVLE 
SHYYTYAAAVLRPVCCIES KVS 


4313 


18214 

* 


A 


4338 


408 


3 


CQSSVSKKERTNGAQNPGAAKQGNNELR 
DSTEQFQEYYRQRLRYHQHLEQKEQQRH 
I * QQMLLEGG VNQE DGRDQQ QNRNEQFL 
NRSIQKLGELNIGMDGLGNEVSALIQQC 
NGS KGNGSNGL * VNS FDTP PHV 


4314 


18215 


A 


4339 


363 


1 


RSQSSLPKSFKRKISWSATKGVPAGTS 
DT*GVQPGWQ*RWGASTATTQKKPSISI 
ATESLKSLIPDIKPLAGQEAWDLHADD 
S R I S ED E TERNGDD KTHD KGLKI CRTW 
RARYSINEV 


4315 


18216 


A 


4340 


1 


353 


DVFLDTLARPLRHSCNFFLCYLL*DFFK 
FIPQFFYWMFYFDYIFISRRNFYFLLLT 
HFFTLPLFLVIRIQCLHLFLFFSFFFFF 
FGKGNPFLPPGWRARAQFWVNGSPPLRV 
NALLP 


4316 


18217 


A 


4341 


376 


3 


LELREGGFLPH IADEVREERS PALLDDR 
AGRCQGQPRIQVCLTPKSMLFLAFHM*T 
CEHCLEQCFSTSL* PIEIRTLH*D*GCV 
CWCAYVCICVCVHMCMCAYICICVARMA 
KRPLESIHTSCT 


4317 


18218 


A 


4342 


158 


2 


LAFFFFCETQSCCQWHDLSSLQPLPQRF 
R*FSCl»SLPSSWDKKHTSPHPTCI 


4318 


18219 


A 


4343 


145 


2 


IPGEQWFDYMNKFFSGDF*DFGASLTQ 
AVHTVPNV*SVIPCHPPRV 


4319 

• 


18220 


A 


4344 


1 


434 

• 


RSLI FRATAYEYRYIDQADLKLPASKRS 
SCLGLPKGWDYRHEPPCLARFIFWQIFI 
AYKLR*KHYKSRQRSLPSRGSC*CACDT 
FVSDPSHWGTMAQ I LGGWITLGQVQRSC 
LADQPliLPLLPPTLAPCLP *NDVLLFFF 
TQSQ 


4320 


18221 


A 


4345 


4 


475 


KHSCRSLEHLDRPLPPAIiQBTCPVRAEP 
XiLLVRINASGGLI LRMGAINRCLKHPLA 
RDT P VCT >T AVLGEQHSGKS FLLNHIiLQG 
LPGLVRAGRGRREAGS KDGGS CLGEAG S 
GIPVCGDKEPTN*CSPFSPLQESGEGGR 
PRGGEASLQGCRWRANG 


4321 


18222 


A . 


4346 


313 


442 


DSTLNFFFFF*TESYFVAQAGVQGHNLS 
SLKPPPPGFKRFSCL 


4322 


18223 


A 


4347 


437 


338 


AEVQEFYGDYIAVNPHLFSLMILGCCQG 
RNWDPAQLS RTTQGLTALLLS LKKCPM I 
RYQLS S EAAKRLAECVKQVITKE YELFE 
FRRTEVP PLLL I LDRCDDAI T P LLiNQWT 
YQAMVHELLGINNNRIDLSRVPGCMRPL 
♦RIHPGTLERSIRIiLFMPSSSWTMA 


4323 


18224 


A 


4348 


409 


118 

J 


GLHDASFVAAI PSTQWYFFFEETASE F 
DFFERIiHTSRVARVCKNDVGGEKLLQKK 
WTTFLKAQLLCTQPGQLPFNVIRHAVLL 
PADS PTAPHI YAVFTS QW* AAGPSGRWS 
TSSAAAFRRPRHSCRL 


4324 


18225 


A 


4349 


2 


418 


YTLPDFPHHPHRLSHCRLIjQSRTAETYQ 
SPGACTPISQGHPGQRSSPCSSSSNPAC 



817 



WO 01/64835 PCT/US01/04927 

« 



SEQID 
NO: of 
nucleotide 
sequence 

■ 

■ 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 

* 


SEQID 
NO: Id 
USSN 
09/515,1 
26 


Predicted 

beginning 
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1 Am A^ Am ^m m^m^. 5 A** 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
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E=GIutamic Acid, ^Phenylalanine, 
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K=Lysine, L=Leucine, M=Methionine, 
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Q=Glutamine, R=Arginine, S=Serlne, 
T-i nreonme, V=valine, 
W=Tryptophan, Y=0ryrosine, 
X= Unknown, *=Stop codon, /=possible 
nucleotide deletion, \-possible 
nucleotide insertion 


■ 












*SSSSCGSTGFRCPGSAHPGCCSYVL*M 
GR+RSSQAPRASPIAQGPRSAPAGRAGH 
PALPS PLP *GAP PPPSAAS S PGL* SPA 


4325 


18226 




4350 


401 


276 


DPLNVGGGGYS * PRLCHCS PT WVTEGDL 
VS KTNTHTNKKHL 


4326 


18227 


A 


4351 


446 


277 


AQDFVL I KDGEVTGGICFRMFPSQGFTE 
IVFCAVT*NEQVKVSAAPRPQPQSTAHS 


4327 


18228 


A 


4352 


408 


3 


PKSSLFTSRPNGKSWSTKRPAISLAYSN 
ENAQGSWNGDQDAGPLLVENPRDYSTEL 
SVTIAVGASLLFLNVLAFAALYSRKDTR 
RQ*PLCQLSPHRGIGASELGHAPQ*ELP 
ALLLGP IRHECEAGPPHDTLPV 


4328 


18229 

i 


A 


4353 


62 


280 


VRIGCLTSVPEASSCGLPTTPGCC*PIjG 
TLPCRSRSRTRS FRSRSRTQSRSRPKTY 
SPGRRRRSRSRSPTPP 


4329 


18230 


A 


4354 


364 


3 


PVGEEGKSIxVPKSPVEEKGKSPVSQSPV 
EEKAQYRVPKSPVEEAKSKAEVGKGDQK 
BE * EKE VKQAPKEDKVEKKEEKPKHVPE 
KKKAES PVKE EAVAEWT I AKSVKVQLE 
KETNEECI 


4330 

• 


18231 

- 

• 


A 


4355 


3 

- 

• 


558 

» 


LfiPTLLWRRGKVPK^NS PIPTSLRRTLP 
RASPGPSRGPLFMRQAGSTPMTLPRSTP 
SMSPML*MVWPPTAVPVP*KPLMWAPJ>A 
PLVLLVTI LTB IQEPATPAPLTQF* KPT 
S LMVS R PVC P WQG P RTTRS TLCATT I A 
PSHATLRPGLSTTTS PLWQTLS LLLEGQ 
ASLPKG*NTSITLPSVS 


4331 


18232 


A 


4357 


258 


1 


U3FHDPGPAGWSQPHGGRSAQAQ* P PQP 
PTVPRTSLADVSDAPLPPCGHSRPQHHP 
FRLPHTS FSTPGLS AVCL I VHPAARVSR 

*mm mm 

CI 


4332 


18233 


A 


4358 


97 


343 


DRDLGDE YGWKQVHGDVFRPSSHPL I FS 
SL I GSGCQ I FAVSL I VI I VAM I EHLYTE 
♦VPTLNCIMSLLFLNSFNHWNLIiSQS 


4333 


18234 


A 


4359 


182 


2 


KDYFGSSVENGGEEELKQEDQLGGYSVI 
*ARDDEDEDDDDDDDDEDEEEDDEDBDD 
DDCI 


4334 


18235 


A 


4360 


88 


2 


RGSRTDTG*RGSDRGTG*RGSHTGTR*R 
GSRTDTG* RGSRTDTGGRGSRTGTGGRG 
SRTGTGGV 


4335 


18236 

- 


A 


4361 


2 


250 


E LRDE G KAS SAKQRLKCAG LQ KFGKRAF 
KAWAVARLSQRFPKAEFAEVS KL VTDLT 
KVHTE CCHGDLFGCDEESAGLAKYIM 


4336 


18237 


A 


A A* -M. 

4362 


169 


A\ AW V*" 

456 


EQLLCAGQGQLSCEPLHLPSCPVGPQTP 
AHRPGRMAPGPPSFLSPGSPCGLGLAIP 
CSCQPWVAGVPDHHLFPSPMGRAGLSPS 
ERSAGEGVIPWP 


4337 


18238 


A 


4363 


1 


419 


PEFKLQKLKRSQNSAFLDIGDENEIQLS 

VYCTMEVEGEKLQTDQAEASRPQWGTQG 
DFTTTHPRPVVKVKLFTESTGVLALEDK 
ELGRVILYPTSNSSKSAELHRMWPKN 


4338 


18239 


A 


4364 


375 


448 


. PGAVAHACNPSTLGGRSGR I MRAG 


4339 


18240 


A 


4365 


1 

* 


444 


D FLTNH YLHFLR IAGSQLTGLGTAVQLY 
SAYEENNRTFLLAAVKRNHNQYVNPSGV 
ATFFES I KEILLRQSGVKVESVDHDSCV 
HGPCQNGRSCLRKLAVSSVLKSRESLPV 
1 1 VANEPLQPFLCKCLPGYSDSWCEIDI 
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nucleotide deletion, ^possible 
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DEWLPFPW 


4340 

■ 


18241 


A 


4366 

t 


52 


402 


CP QE VW FHS FCHCTSAFMKDLYFLCLVR 
FGDLKEDKVTRHDGASSDGHLAHI FRHA 
AKBLFNEDVEEVTYRALRCGAVSTACLC 
LWLAR VHGVP VW I CACGGSMDGVYTSQN 
QKDFF 


4341 


, 18242 


A 


4367 


119 


247 


WVLLS FQLHGVAVIjGIALI AMQEBIQAE 
MALRTFGHLLRYGEP 


4342 


1 8243 

i 

1 


A 

• 


4368 

• 


2 


430 


GGTWDGQPGFHGDSLSKAPGMNSLEQG 
MVGLNIGDVSSSAVKTVGSWSSVALTG 
VLSGNGGTNVNM P VS KPTS WAAIASKPA 
KPQPIMKTKSGLPPSPIKHNMDIGTWDN 
KQPVPKAPVPQ0AP3TQACPQSQQVGQT 
LP 


4343 


18244 


A 


4369 


1 


406 


ETS TP EGEAGP I QRLDIPVENPVESKNI 
FLGAPLIICHVIDKRSPLYDISATDLAN 
QDLEVIVILEGLGETTGISTRARTFYIA 
EE IQWGHRFVS I ETEEQRPDSVDYSKFG 
KTDLVTTSRCNARELDEKPSILI 


4344 


18245 

• 


A 


4370 


2 


284 


GTGTLCDLTALIiSARYDGVRTCILPCWK 
TI S T I P VALL I HYHQNQAS CGKRAI I LE 
TRQHRLFCAD PKEQWVKDAKPHLDRQAA 
ALTRNGGTFE 


4345 


18246 


A 


4371 


3 


184 


EDYNILLSIMKGTTRPINKEIEALNDTV 
DQLELTDIiCRTIiTPTIAEYSFFSSAPKV 
LIQP 


4346 . 


18247 


A 


4372 


44 


304 


GLPRS IGCQGLPRLPEWPTWVGGCHRGL 
IMPSVPTQVSERPLMFLLDTPGVLAPRI 
ES VE TGLKLALCGE PGIjGLGLG PGPLP P 


4347 


18248 


A 


4373 


1 


422 


LHLFNPAASAPSRSLFSGPILDPLSRAL 
GVGPGICGLASSPGISEGWDQIRSWTHP 
PDPDHLSGFCRSQVYMHPSLSPPSTMIL 
SGGTALKP P YSAFPGMQPLEMVKPQSGS 
PYQPMSGNQALVYEGQLSQAAGLSASQM 


4348 


18249 


A 


4374 


2 


423 


NSGTHTPGLLEDLS KNGRLPE IKLPVNG 
CSDLEDSFTILQSKDLKQEPLDDPTCID 
TSETSLSNQNKLFSDINLNDQEWQELID 
ELANTVPEDDIQDLFNEDFEEKKEPEFS " 
QPATETPLSQESASVKSDPSHS PFAHVS 


4349 


18250 


A 


4375 


292 


429 


LKIS VFPS S TE I SIiQQRLE YIARAILTA 
KSSTGIAS ISADGESLRE 


4350 

• 


18251 


A 

• 


4376 


544 


32 

■ 


ALHACACLGLSPEACFLHLLQLTGTSPG 
PVDGQGLDNQGFRXjLKAILGVSPAPEPV 
HGAS EHNADTGVS VARACVL PTS PTLAR 
TAAAKDESPPLAGItTMGPSRBPGGSCLP 
LPSGLLSSPTRLALPRRPPSVSVWQDAG 
SSGSMPTEPLVQCGHRFYGTLVSWAVMV 
LK 


4351 


18252 


A 


4377 


378 


232 


PAKRAEEELLLHDTRCWLNGGAMPEARH 
PRTGASALHVAAAKG Y I EVMR 


4352 


18253 


A 


4378 


291 


1 


QMLSVDVASRYRAPSTYVLNSLKEGMDG 
LHGESCSS FLLGPSVAMNMQTAGLEMGI 
CDGHFRQNVGCGYVLKPDFLRDIQSSLH 
PEKPISPFKGQSL 


4353 


18254 


A 


4379 


171 


458 


RGPLFLTLHIVNSLQGNNFRREYIVTQG 
PLPGTKDDFWKMVWEQNVHNIVMVTQCV 
EKGRVSKQLS WHQS FAFCHLDRILPHHQ 
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SEQID 
NO: of 
nucleotide 

#1 A Al • AIM A A. 

sequence 


SEQID 
NO: of 
peptide 
sequence 

■ 


M 

eth 

od 

■ 

- 


SEQID 
NO: in 

TTCCM 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino 8uo 

IColUUc Ul 

peptide 
sequence 


Predict- 
ed end 
nude- ' 
otiue 
location 
correspon 

/I inn 

oing to 

last n m Inn 
1451 allllUU 

acid 

residue of 

pcpnuc 

sequence 


Amino acid sequence (A s Alanine 

OCysteine, D=Aspartic Add, 

ej—uiu ramie Acid, Jr— Phenylalanine, 

o=\»iycine, o— xiisnaine, i— iso leucine, 

K= Lysine, b=Leucine, M=Methionine, 

N-Asparagine, PHProline, 

v^oiuiarnincj iv— /\rgiri lne, oenne, 

T=Threnninp V=Vnlin» 
i x ii i cuii niC| r~*TuiiuC) 

W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 

nilClcOIlOc UclcllUII) V-p05SIDie 

nucleotide insertion 














SETPQNNVQIM 


4354 


18255 


A 


4380 


214 


410 


LDHLPTHPSPTGE I E VKWFEAVQTGLPM 
CILGAFFGPIRLGAQSLQVLDSBLIPWA 


4355 


18256 


A 


4381 


2 


74 


IGDSGVGKSNLLSRLTRNBFNLGL 


4356 


18257 


A 


4382 


290. 


119 

* 


IATVSLNIVKMPRL.PKAIYRFNAIPIKI 
PMTFFTEIDKTILKCMWNHKI I PGNSPA 
H 


4357 

* 

■ 


1825S 


A 


4383 


2 


423 


LBRVCWIKDIWAVTGENMEVMKSI IQK 
YQHKRISLVEAGVTRHRSIPNGLKALAE 
DQI NS KL»S KPEWI I HDAVRP FVEEG VL 
LKWTAAKEHGAAGAIRPLVSTWSPSA 
DGCLOJYSIiERARHKASEMPQAF 


4358 


18259 


A 


4384 


356 


1 


WYPCCLLASLI CVFRGLFYLSLCLVLRV 
PVPLCLQFPGSLLGSLLLFSLLRQGLAL 
WPRLE CS S A I RAH CS LELGSNDS PAS AS 
QSAGITGMSQCAWPLSLLSVSPRVPGLS 
SVPLIF 


4359 


18260 


A 


4385 


1 


125 


SGSKKTLRSSLEEPTILDIEKFHRESFF 
YTHIi INFSGKRYC , 


4360 


18261 


A 


4386 


53 


215 


WQLYWTTWCQKISRI^QGLAHVPRMECSS 
MIIAHCNLKLLGSSDPPASASQIDPS 


4361 


18262 

■ 


A 


4387 


2 

• 


390 

• 


PSLAARVLARGYGNFSVFVLWLRKNDRM 
HGECAPNVSVAVSTSHTTI IGGGIRGGG 
GGGYGSGGSSYGSGGGSYGSGGGDGGGR 
GSYGSGGGNYGSGCAGSGHGS YCSGSS S 
GGYRRGSGGFGSCNSGG 


4362 

■ 


18263 


A 


4388 


2 

■ 


441 


IKTRISVIHKEEFGLTPIEGAIEDMKNK 
TLQIiAFAINQE PS D AKMLQMVL KGSEGA 
TVNQGPLEVAQVFIjABI PADPKIiYRHHN 
KLRLCFNEPIMRCGEAVEKNKRIiITVDQ 
MEi QQEIiKKNYNKL KENIiRPM I ERKI PE 
LVKPIF 


4363 


18264 


A 


4390 


242 


24 


KLGNFLGGCVKESPFVPQGGMQWPDFGS 
LQHPPSRFKQFSCPTLPGNWNFRNAPLR 
Mslvr r roPvr JUvlSKax 


4364 


18265 


A 


4391 


1 


244 , 


VDQQMRQNIjFTTGAHHLQQANI QFRTD I 
DAIMLKRFTRYELGCQGEAWEECGW 


4365 


18266 


A 


4392 


2 


146 


LDIJjSQPCRAVYIFAKKNDIPFELRI^D"" 

L> X XVVjKo LAsV tail F JxoU CAjJJK 


4366 


18267 


A 


4393 


204 


489 


GAASEHPIPKGESHGTSKASSLCADAGP 
AGHFASVYNPIAWTVTTIVTLTVGFPGV 
RVTDEAGHPVPSQVSGIQRCLQGMKAPP 

V I v 1 VouExvlxb 


4367 


18268 


A 


4394 


2 


171 


LSDGFQPSRSIIFASWSAGDFGSVGATE 
NiiifiVr r 1 1 r oniu^ijVolAjv.Lra vnr L 


4368 


18269 


A 


4395 


2 


197 


WLELLKFYTLDCALEEYVICVRIQDILT 
T? FNfWNWPKRR T AT EGEMT CCT S FVEYF T 

CATHYLIQC 


4369 

ft 


18270 


A 


4396 


2 


285 


TVDFPFQPYKCQQEYMTKVLECLQQKVN 
GILE5PTGTGKTLCLLCTTLAWREHLRD 
GI S ARKI AERAQGELF PDRALS S WGNAA 
AAAGDPIGPS 


4370 


18271 


A 


4397 


366 


461 


TSNFS CSLVLiEEEAS D YLELDT I KNLVK 
KYSQ 


4371 


18272 


A 


4399 


157 


3 


DVLRNNFARSAVLYLFI DRVTLWKAPRL 
ECSGAI IAHCSLKLLGSNDPPTS 
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SEQID 
NO: of 

nUCIcOIIGc 

sequence 

— » 


SEQID 
NO: of 
pepnae 
sequence 


M 

etb 

oa 


SEQID [ 
NO: in 

09/515,1 
26 


Predicted 
beginning 

IlUCIcvllOc 

location 
correspond 
ing to first 
amino acid 

■Aft! i^»aa at 

resiaue oi 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

pepiiuc 
sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 

G=G.ycine, H=Histidine, I=Isoleucine, 
K=Lysine, Lr=0Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serlne, 

X=Thr*vtninA V=Vq linp 
i — j nr cum DC, v~vi*iiiiv, 

W=Tryptophan, Y=Tyrosine, 

X=Un known, *=Stop codon, /^possible 

nucleotide insertion 


4372 

- 


18273 


A 


4400 


2 


343 


DIDFKYHFDFFSVNFNEELVALYGGSLQ 

TCOTKPVRR PT KT T T .TCT iYKGDR PA PTfc; V A 

IMJA JvC V nDulix J. -LUIVLJ -L I\\J\JCt * /Mr «VO VA 

I IGHSMGGLVARALLTLKNFKHDLINLL 
I TOATPHVAPVMPLDRF ITGSTGWLKDH 
G 


4373 


18274 


A 


4401 


214 


430 


EU3NILNYWRWEAEAELSNRWPNGQAV 
HGE L S P VTDFRE S C CRQ YE MGE CTRGG F 
CNFMHLR P I SQNLQRQ 


4374 


18275 


A 


4402 


3 


227 


LTQVSPQMTGHAGLNTAQAGGMAKVSEL 
KHFQYFLPNRGLSLRVMRKMGPRGPLLF 
FLLGSPRATHAWKRQISF 


4375 


18276 


A 


4403 

■ 


4 MW A 

158 


A * A 

313 


NRDEF CHVAQAGJuEIiLiGSSS PPAATSQS 
ACITGVSYHVRPTLSNKFGCSSti 


4376 

■ 


18277 


A 


4404 


3 


^ A ft 

628 


HCIREGGQD V PSNKDVTS LDWNS BGTiili 
ATGSYTKSFARIWTKDGNLASTLGQHKGP 
I FAL KWNKKGNF ILSAGVDKTTI IWDAH 
TGEAKQQ FP FHS APALDVDWQSNNTFAS 
CSTDMC IHVCKLGQDRP I KTFQGHTNE V 
NAIKWDPTGNIiLASCSDDMTLKIWSMKQ 

T"MkT/T ftrfNT rtTV tTKTtTt? TVPT WiTC IMV« TV UVT 

DNCVHDJLiQAHNr\JEI jTi 1 K.W S rlvj Plj JL II W 
PNANLMLASASL 


4377 


18278 




4405 


35 


180 


MCVDYHYFFCLIjCy pnnnnp qts avrtpt 

QTNGSNVPFKPRGREFSFGK 


4378 

• 


18279 


A 

• 


4406 . 

• 


1 

■ 


438 


DFQRPRDDHGDVDWEKLVELIiTDCSNLQ 
DQTYILYILYVIKGPSWDTNLSGQNGVT 

TT^KTT T PPT VPT/tV^T TkT/ - ^G*T»W*'T TQVTOOT TJ 

VQDUjIjGE JbYGKTGijNQEWGIji P x ISS JjF 
RKKVDVIiAEACTDLLS DQKQLTVGtiSDE 
PREKI IYAPLPPKELTKLI YEASGQDI I 
HjVJbly 


4379 


18280 


A 


4407 


1 


428 

■ 


TESVNAYFKGADPTKCIVKITGDMTMSF 
PSGIIKVFTSNPTPAVLCFRVKNISRLE 
QILPNAQLVFSDPSQCDSNTKDFWMNMQ 

JVXTTTtTVT TrtTT.CT?01JD7y &CWMVTYT7T WftV 

SLNGIQSTPLNLATYWKCSASTTDLRVD 

VIC 


4380 


18281 


A 


4408 


174 


383 


KNSWPGAVAHACNPSTLGGKGRQITRSL 
YRDHPVQHGEIPLYLKLTALAVTFLGLL 

TOT .TTTIT.YT iTTNTiT 


4381 


18282 


A 


4409 


1 


288 


LMAEKDSLDPSFTHAMQLLTAGKGASRV 
PLGRPAI PGMSGPGFVPLASRPLGTEAV 
GSSGLILIPGSLDSTFPPQPGVRGRISR 
LQ WRPEGDFAI P 


i A TOO 

4382 


1o2o3 


A 


A A 1 A 

4410 




4ZZ 


NIFCTIMTSEDFLDA 


4383 


18284 




A A t 1 

4411 


412 


238 


HSAEITGLNHHAQPLTTNFNKYCWISLQ 
VH 


4384 


18285 


A 


4412 


1 


405 


VTSSDKSLKVTJLiDAEDKVFNEIRNEHFS 
NVFGFLSQKARNLQAQYDRRRGMDIRQM 
KNFVSQELKGLKQEHRLLSLHIGACESI 
MKKKTKQDFQELI KTEHALLEGFN IRES 
TS YIEEHIDRQVSP IESLRLMCL 


4385 


18286 


A 


4414 


2 


149 


GFHHVGQAGLEVLTSDNLPVSASQSAGI | 
TGVSHCAQLRSDGFKKRSSPA 


4386 


18287 


A 


4415 


1 


366 


DICHRKLGVECPSHAS INRLWQWSSI 
TASLRFEGPLNVDLIEFQTNLVPYPRIH 
FPMTAFAP IVSADKAYHEQFSVSDITTA 
CFESSNQIiVKCDPRLGKYMACCLLYRGD 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


eth 

• 

od 

• 


SEQ ID 
NO: in 
,USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AlanIne 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, (^Phenylalanine, 
OGlycine, H=Histidme, I-lsoleuctne, 
K-Lysine, JU=Leucine, M^Methionlne, 
N=Asparagine, P=ProlIne, 
Q=Glutaroine, R=Arginine7S=Serine, 
T=Tn reon i n e, V— Vali d e, 
W=Tryptophan, Y^Tyrosine, 
X 13 Unknown, *=Stop cod on, ^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














WPKEVNAAI, 


4387 

* 


18288 


A 


4416 


1 


164 


RRRCEMSTWFADTLLIVFISVCTALLAE 
GITWVLVYRTDPFGTLAGSEGPGGRY 


4388 


18289 


A 


4417 

< 


2 


141 


OIAI PCNNKGAHS VGIiMWWMLARBVLRM 
RGTISREHPWDLTSSVMG 


4389 


18290 ' 


A 


4418 


136 


1 


EQTKAFGQLHKVLGMDPLPSKMPKKPKN 
ENPVDYTVQI PPRDPMQ 


4390 


18291 


A 


4419 


1 


280 


GNYCKYYGYRGPSCEDGRLRVLKPBWFR 
GRDVLYLGCNVGHLTLS IACKRGPSRMV 
GLDIDSRIilHSARQNIRHYLSEELRLLP 
QTLEGDPGA 


4391 


18292 

i 


A 


4420 


3 


446 

■ 


TVSTSQVRQNYHRDSQAAINRQISLELY 
ASYVYLSMPYYFDRDDVALKNFAKYFLH 
QSHEDRDHADKLMKLQNQRGGRSLLQDM 
RKPD CDD WES ALNAMEWALHLD KNVNQA 
LLELHKLTTDKNDPDLSDYIETHYLNEQ 
VTAIKELG 


4392 


18293 


A 


4421 


6 


182 


DPSFCSEPRSCHCTPTWRTEQDSISEKK 
KKRNI LKNWAKNY I TLB SEKERKS KQRS 
RR 


4393 


18294 


A 


4422 


1 


456 


GPISYVWMPKKRQALVEFEDVLGACNA 
VNYAADNQI Y I AGHPAFVNYSTSQKI S R 
LGDSDDSRSVNSVLLFTILNPIYSITTD 
VLYTICNPCX3PVQRIVIFRKNGVQAMVE 
FDSVQSAQRAKASIiNGADIYSGCXTLKI 
EYAKPTRLNGSK 


4394 


18295 

* 


A 

• 


4423 

- 


2 


257 


FHHVGQAGLKLL ISVDP PALDSQSAE I T 
G VSHCAQP VLC X LNQLCAKAVLS PDSHP 
YPPKSGRHBGQVREQKMSPCSYIYPKRT 
V 


4395 


18296 


A 


4424 


106 


259 


CCCFKYFFSNVKI CFYRDLKPENTLLNE 
DMHIQITDFGTAKVLSPESKQGV 


43^ 


18297 


A 


4425 


3 


344 


MDE I EKYQEVEEDQDPSCPRLSRELLDE 
KEPEVLQDSLDRCYSTPSGYLELPDLGQ 
PYSSAVYSLEEQYLGLALDVDSEYLTVK 
VI S LHLVFQ I G V I FL FQVPLTDP RDVI A 
AGRTYGRI 


4397 


18298 


A 


4426 


2 


241 


HHVWL I CFVF FVC VETGFCHVAQVGLER 
LGS SDLPTS VSLS AG ITDVNHWAQSKPS 
PSKGTGMNRP IGLRVGKG VWSRL 


4398 


18299 


A 


4427 


2 

• 


438 


TRLIGPNCPGVMNPGKCKMGIMPGHIHK 
KGRIGI VSRSGTLT YEAVHQTTQVGLGQ 
S LCVG IGGDP FNGTDFlDCIiB IFLNDSA 
TEGI I L IAE IGGNAQENAAE FL KQHNS S 
PNS KPVES FIAALTS STGRKMGHAGQLL 
AEEKV 


4399 


18300 


A 


4428 


34 


427 


AGMLPAVGSADEEEDPAEEDCPELVPME 
TTHS EEEEKSGLGAKI PVTI ITG YLSAG 
KTTLLNYI LTEQHS KR VAVI LNEFGEGS 
ALE KS LAVS PGGELYEERLEL INACLCC 
SVKGNGLIAIENLMRQKGE 


4400 


18301 


A 


4429 


330 


8 


YSDRISHWIDMCPSKYIFRGPVHPPHIS 
TFLHTAHNLVARGNYENMSPHDBEKGAY 
QRKE KC CDD PMAGAAR I HFLC FFEMGFR 
SVSQAGVPWPHITSMQPQPPRIK 


4401 


18302 


A 


4430 


446 


222 


LTSSDPPASAPQSAGITGVSHRPQPIWG 
VSIQLYFSGCPDTPGQGRLTREHCDGQD 
GEQGGPCS ESVAQFCHDWS 
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SEQID 
NO: of 
nucleotide 
sequence 

- 


SEQID 
NO: of 
peptide 
sequence 

• 


M 

eth 

od 

* 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
{^Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, Wso leucine, 
K«Lysine, L=Leucine, M=Methk>nioe, 
N»Asparagine, P=Proline, 
-Q=Glutamine, R«Arginine, S=Serine, 
T=Threonlne, V=Valine, 
W=Tryptopban, Y^yrosine, 
X= Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


4402 


18303 


A 


4431 


3 


243 


AREDSADFVSFFPDFVWTLRDFSLDLBA 
DGQPLTPGTSQKDNNFILPRLC IRKFTS 
LIRTS RS YQS PDTVS VASSRGAR 


4403 


18304 


A 


4432 


433 


0 


PPPSPPPPPPPPPPPPPPPPPPPPPPPP 
PPPPPPPPPPPPPPPPPPPPPPPPPPPP 
PPPPPPPPPPPPPPPSPSSPSSSS 


4404 


18305 


A 


4433 


25 


185 


ARWVACSE LRLRCCTPAWATE PEP VFKK 
KKKVQQQGG W PGGLTP VS YRFGRLT 


4405 


18306 


A 


4434 

• 

i 


2 


329 


VRD I KEKLC YVAVD FEQEMATVE FSSSL 
EKSYELPDGQVITIGNERFRCPEAtilQP 
S PLOMES CGMHETTLNS IMRCDVDZRKD 
LYANTVLTGGTTMYPGIADRMHNEI 


4406 


18307 


A 


4435 


2 


374 


WVTFISLIiFL PS SADSRGGFRRDAHTSE 
VAHRLKDLPEENFKALLLIAFAHYLHQC 
P FEDHVELLNEVTE FAKT CVADE SAENC 
DQSLHTLFGDTLCTVATLRETYGDMADC 
CAQQEPERNECF 


4407 


18308 


A 


4436 


2 


352 


KVRR I DSRLAELDRE IRNPQI SCQLI YS 
PSLFLGAFFLFFFFGFFFPPPYliLAPKN 
QFNPPPLCTGGPVPRPKPPIPPWEPPRP 
APPLFSPQWDLGFQI^PFLWVINGGGPS 
GGSRA 


4408 


18309 

• 


A 


4437 


2 


369 


QSCETLFHSWKDVEVCSSALSCLSQVSV 
HLQGIiESLFILPGMEVEQRDSQMALVES 
LBYVRGE IS KAMADFTTWKTHIiLTSDSQ 
GGNQMLDEGFRBDFSEQMEIAIRAILCA 
IQNIiEERKNE 


4409 


18310 


A 


4438 


2 


231 

* 


SDTSRPQLPCPAARERFPDGPLSLRPLP 
FFLSQVYNEQIHDLLEPKGPLAIREDPD 
KGWVQGLS FHQVWDWARVG 


4410 


18311 


A 


4439 


1 


365 


MAHAME EVKKCLGPDMMDDI CHEQFLEL 
SYLNGVPEPSRGRGVPVRGRRAAPPPPP 
VPRGRGVGP PRWALVRGTPVRGAI TRGA 
TVTRGVPPPPTVRGAPAPRALTAVMQRI 
PLSPPPATK 


4411 

i 


18312 


A 


4440 


92 


510 


IoAALPDGGQELHIPRSRAGPAGEIiSWAV 
VGTGALSSLSILFQMARCRPWPPLRPNS 
PLQGW I FNWP WLVAI PASL FS G FLS DH 
LINQG YRAI IVRKLMOGMGLGLS S VFAL 
CLGHTS S FCES WFAS AS IGLQTFNHR 


4412 


18313 


A 


4441 


2 


266 


FP PHL PELG KWECP WHQCDECS SAA.VSF 
CEFS PHS FCKDHE KGALVPSALEGRLCC 
SEHDPMAPVSPE YWSKI KCKWESQDHGE 
EVKE 


4413 


18314 


A 


4442 


2 


279 


KGAEtiVFLPSNTRS LMYPIiDQGVVRTFK 
AHYAGYSMERIVSAMEENPGREKIMKKL 
LKLSSPKE I PAGEKLCPNDAHDFTELDD 
RVIHENHE 


4414 


18315 


A 


4443 


2 


371 


KWVTF IS LL FLFS ATDSRGVSRLDAHTS 
EVAHRFKDLGEENFKALEAIAFAQYLQP 
WPFEHKVELGNDVTESAKTCGADESADN 
CD KSLHTL FGIDTLCTVATLRE TYGEMAD 
CCSQPEPCEKG . . . 


4415 


18316 


A 


4444 


2 


372 


WVTF I SLLYLFSSG YS RGVFRRDAHKSE 
VAHRFKDLGQENFKALVLIAFAQYLQQG 
S FEDHVKLMNEVTEFAETCVADES SDDC 
DKSLHTLFGDTLCTVATLRETYGEMADC 
CAQQEPERN3C 
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SEQID 
NO: of 
nucleotide 
sequence 

'i 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 

• 


SEQID 
NO: in 

rTCGM 

09/515,1 
26 

■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acjd 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
Hi— vviutamic ACio, r = rnenytalanine, 
OGIycine, H=Histidine, I=Isoteucine, 
K=Lysine, L=Leucine, MNVfethkmine, 
N=Asparagine, P=Proline, 
(<j=Vjiutamine, K— Argimne, 2>— Serine, 
Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Un known, *=Mop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


4416 


18317 


A 


4445 


1 


283 


FRKNAEDILTMEOLRATMKHELEAAQKK 
HSLWELLRIPNICKRIWXLSFVRSA6VC 
WKb I JfJJi^ VCLfcri. ISeVjCRTAUlFACSrij 
WKLHSIGAPA 


A All 

4417 


18318 


A 


AAAC 

, 4446 


173 


2 


IUUUnv. V It Jf PEQxcivj Vif HAljVt^CVCVCVC 
VCVCIQVNFRNFCFISW1NLSVQVSLLD 

V 
X 


4418 


18319 


A 


4447 


104 


221 


SCHLHYSLGDKSETLSKNHHYHHHHHNH 
HHHPAGYFKMK 


4419 


18320 

• 


A 


4448 


241 


1 


WVSWDKAPSWVRGLGSGSRGGVWGGGSP 
PHR VTD VS CS FL P DGRHVWBMEAKTDRD 
LCKPVSAFGAAGATVGPQQLGGRC 


4420 


18321 


A 


4449 


229 


349 


WKRSHSFGLDWAHACNPSTFGGQGGRI 
MRSGVQDQPGQY 


4421 


18322 


A 


4450 


76 


189 


TESKPAATTRSSGGGGGGGGKRGGKKDD 
SHWWSRFQKV 


4422 


18323 


A 


4451 


40 


359 


SGEPLFPFFFSLARLIPGVCFVEMHTRV 
RLLIGLKIWEKKVPQVSTPTLVBVSRNL 
RKVGSKCCKHPEAKRMPCAEHYTiSVVLN 
QLCVLHEKTPVSDRVTKCCTES 


4423 


18324 


A 


4452 


2 


371 


KFQNALLVRYTKKVPQGSTPTIVKVSRN \ 
LGKVG I KCCKHSBAKGMPCAEDYLS WL 
NQ VCVLHEKTPAS DRVTKCCT E S LVNRR 
PCFSALEDYETYVPKECHAETFTFHAD I 
CTL YENERL PQ 


4424 


18325 


A 


4453 


118 


369 


GRYILLLKTKKQTANNNIKTPQYLSNMS 
KKFRHSE FF FFYLKWSLALQRCNLGSLQ 
PTPPRFNQLSCLSLPNSWDYRCVPPNLV 


4425 


18326 


A 


4454 


364 


3 


GHLSLQRLLLPFVWLCPAPRGGAYRGRQ 
ASLSCGGLHP VRAS RIiLCLPKQACTMAO 
APPPASLLPCSLISDCCASNQRDSVGVG 
PS EPCAGYNLLVCRFLS PSE KRS PSAGV 
MRFSRCRL 


4426 

■ 


18327 


A 


4455 


1 

* 


379 


AAFTE CCQAADRAACLLPKLDELRDEGK 
ASSARQRLKCASLQKFGERAFKANAVAR 
LSQRFP KAE FAEVAKLVTDLTKVHTECC 
HGDLLE CADDRS DLAKYICENQDS I SSK 


4427 


18328 


A 


4456 


1 


362 


LRPARSLVFPWFAPGGSGLRGLKLIjEAK 
CQGDGVS YEETT I PRPS AYHNLFGLPL I 
SRRDAEWLTSRELDSLALNQSTGLPTL 
lliVKul ALJaPriU-djlrlijtsUrKKi VrWUj 

DDLRSWEA 


4428 


18329 


A 


4457 


250 


330 


EWWTHLWLNEGFASWI EYVCVDHCFPE 


4429 


18330 


A 


4459 


96 


319 


YLTI PFSQAMENCKDARLTTS IGAYNFN 
HRLLEMILNKPGLTYKPVCNQVSALSLL 
S FIiFFMPLFI»SYCQ VS IH 


4430 


18331 


A 


4460 


50 


339 


NSQTLKLSNVQCLKTRALFFFPKGIYPQ 

TlTT*T.DDPI>T , Qr3PnPVVP'l7MT.T.PirftK'r2D<^ 
X M lJlrJrv.lr 1 Ol7XrU.tr I VK.ViMiaiJr' J\\jJ\Ajrrk». 

LKKI SGNRKTLKP LFDKTLKFLVPMEKV 
PKSPLNVPVKNI 


4431 


18332 


A 


4461 


2 


357 

± 


G YDGVKRWTRDGD I FN KE LLL I P I HLEV 
HWSLISVDVRRRTITYFDSQRTLNRRCP 
KHIAKYLQAEAVKKDRLDFHQGWKGYFK 
MYCKHLALSQPFSFTQQDMPKLRRQIYK 
ELCHCKF 


4432 


18333 


A 


4462 


182 


325 


IRIiSSW LCDCASCQVYEFRVKESS I IAP 
APAEDVDTPPRKKKRKHRL 



824 



WO 01/64835 PCT/US01/04927 

m T 



SEQID 
NO: of 

n ii«*lc*At iff tt 
U U CICVI II u c 

sequence 

- * 

■ 


SEQID 
NO: of 

sequence 

- 


M 
eth 

uu 


SEQID 
NO: in 

T7SSN 

09/515,1 

26 


Predicted 
beginning 

niirlfntirlp 

tlUVICvllUC 

location 
correspond 
ing to first 

Amino Actrt 

residue of 

peptide 

sequence 


Predict- 
ed end 

mimics* - 
ii uni* 

otide 

location 

correspon 

ditto 

U1IIK III 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
OCysteine, D=Aspartic Acid, 

V=Cllainm\c Arid F=Phpnvlnlanin(» 

G=Glycine, H=Histidine, I=Isoleucine, 
K-JLysine, L=Leucine, M=Methionine, . 
N=Asparagine, P=Proline, 

0=f* In famine RcsArcpiniiiP' fiaQAr!n» 

vf^VJIU Mlllllllv, II /»l glUIUCf O IIIC^ 

T^Threonine, V=Valine, 
W=*Tryptophan f Y^sTyroslne, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucieouae insertion 


4433 


18334 


A 


4463 


1 


338 


SRHSPGPTPQPDCSLRTGQRSVQVSDTS 
SCSQLSSSSGSSSSSVAPAAGTWVLQAS 
QCSLTKACRQPPIVFLPKLVYDMVESTD 
SSGLPKAASLLPSPSVMWASSFRPLLSK 


A A1A 

44.54 




A 

• A 


44o4 


L 


1A1 

J 43 


TTr\PT?T/^2iVT VTVfftrT.VTMDVnKTVFDZiriri 
XXUc C UjAI J-iJNJJlJj V Jjl\.J.l'lf vypjyi,KAljv 

RTRFKAFVAIGD YNGHVGLG VKCS KE VA 
TAIRGAI ILAKLS I VPVRRGYWGNKIGK 

ys 


4435 

i 


.18336 


A 


4465 


I 


371 


INSTLKMSFVGENSGVKMGSEDWEKDEP 
QCCLESI FPALAS SLSS AS AEKGS P X IjIi 
GVSKGBFCLYCDKDKGQSHPSLQLKKEK 

T WT7T TV T^/"\^T7 C? TV T5T5 T3I7T T7VT3 Ti/*kT If* O UUMT 

ESAAHPGWPIC 


4436 


18337 


A 


4466 


246 


12 


LCFTPIjPSYILFFFETKSHSASPKIiECS 
GVI TATSEVI LLGGGGCSELRS CY CTPT 
WVTKLDS ITKKRRKRKRKDL Y 


4437 


18338 


A 


A A f**J 

4467 


2 

• 


355 


WVT F I SLLFLFSS AYSRGVFRRDAHTSE 
VAHRFKDLGEENFKALALIAFAQYLQQC 
PFEDHVKLANEVTEFAKTCVADESAENC 
DKSLHTLFGDKLCTVATLRETYGEMADC 
CAKQEP ' 


4438 

• 


18339 

- 


A 


A A V* O 

4468 


2 


361 

• 


EDHVKIjVNE VTE FGKTCVADES aencd k 

SLHTLFGDKLCS VAT IRETYGEMADCCA 
trtrn , m? , Dwi7r , T?T j^uwTiFkKroKn dot \rotiPTr 

DEMCTAFHDTDETFLKKYL YE IARRHPY 

PV2XPPT.T.T. 
IT InruULHJ 


4439 


1 8340 


A 


4469 


1 


373 


SSQPSSPSESVSGTVSVSPSSLPSSPCL 

oUO VV^JjLiVV^ V oiJOfir irOlJr' V^,JUo VfJjV- V 

SPRPHPSVSHAHPHRASALVNCPWGPAP 
TSAPVSFYKPQSPALKTGQASPHPHPHP 
FPTPSPMQSQKK 


444VJ 


ioj4i 


n 


AAH(\ 
44 /U 


4J0 


j 1 1 


WSAWQSQLTAAVTSQA 


4441 


18342 


A 


4471 


345 


201 


AWMTAHLFMAWFTEYFKSTAQKKKFVFK 
TLLL I DNFDPLWDS KEGKL 


AAA*) 
444 vi 


1 Q1A1 
1834,3 




44 /Z 


1 


\AA 1 

144 


ITGLNHHTWPFSVILTYTYY 


444j 


1 CJ/l/t 

1oj44 




44 /J 


j 




VWUT TfSRWPFPPriH^ PQr'DHT.TiQ C! Q 
RBnUMO AVJOHJCr J XT Ar UtlO JroVvtrCXLUJOOO 

NYHSLSCFLGGFFSETGSAPSPRLECSG 
VIAAHCSLKLLGSTSPLTSASWVAGITG 


4444 

1 f 1 w 


18345 


A 


4474 


288 


1 


TSAVNRIS IKSQADQEATEQKLRDYSHQ 
GIOIliAWOKPEYKPGKVTMDPKDCILOL 

AKKKEE EEEEEE E EEE EBEEE EEEEEED 
OEEEEEEGEEEE 


4445 


18346 


A 


4475 


3 


124 


PRIiECSGMIMAHCSIiDIiLDSSDPPTLAS 
OVAGTTKKISLH 


4446 


18347 


A 
• 


4476 


196 


2 


SCYEHPGRIVYGFLFLRQDLAVTQAGVP 
WHDHGSLQPLPPELKRFSGTSHTiTtSSQO 
YRRAQPRR 


4447 


18348 


A 


4477 


3 


162 


LLERNKE LEGS LQQMYSTNE EQ VQE I EV 
RALHVLAHTPRGPLGLCVPIiCNLE 


4448 


18349 


A 


4478 


167 


2 


ACX3ISPFSLSSSCFGHVRHASLPLAFHH 
DOGFPEASPAMLPVQPSQI»NliFI KYPV 


4449 


18330 


A 


4479 


195 


26 


WVSMLDIHDNGPHEVAVICFVCFLSQSL 
ALLPRLECSGVISAHCNLQPGRQSQTPS 


4450 


18351 


A 


4480 


175 


1 


NRSATKCDWGEKRTSQELNLFFWFFVGL 



825 



WO 01/64835 PCT/US01/04927 

i 



SEQ ID 
NO: of 

niir Ipotirte 

II UtlbvLIUV 

sequence 

■ ■ 


SEQID 
NO: of 

sequence 


M 
eth 

uu 

. 


SEQ ID 
NO: in 

09/515,1 
26 

* 


Predicted 
beginning 

location 
corresnond 
ing to first 
amino acid 
residue of 
peptide 
sea uence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

seauence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid. F=Phenvlfllflnine. 
G=Glycine, H=Histidine, I=Isoleudne, 
K=0Lvsine. Lr= Leucine. M=Methionine. 
N=A$paragine, P=ProIine, 
Q=Glutamine,R=Arginine, S^Serlne, 
T—Threonine. V^Valinei 
W«Tryptophan, Y=Tyrosine, 
X—Un known. *=StoD codon./=DOSsible 
nucleotide deletion, V*possible 
nucleotide insertion 

mm mm mmm ^rmj mi • mt m ••Wwl X* Bfl 




• 






■ 




FEMESHSVAQAGVQWRNPGSIiKPLPPRF 
KR 


4451 


18352 


A 

» 


4481 


122 


357 


YHAPLYQWYNLKREHKRFRCSLLIIFLG 
GERKLQSIQLWNISHYIQIRRYVLBIPS 
GLGAVAHACNP S TLGGQGEW I T 


*t*T JX 


1 OJJJ 


A 


44K2 


137 


9 


FYSYLKILEBMGPHYVPQAGIiELLGSRD 

m» mm m^aw mm mmmm VmovW* mim^ m> • ♦ ^» w ^ m^mmmmWmmrmmmmmmr^gj Mr 

PPDSAYQSTE I IGMSHH 


4453 


18354 


A 


4483 


347 


208 


GQAGLE LLTS S L PAS AS Q SAG I TGAGHH 
TRPR X YFLKKYLAFVIX5S 


4454 


18355 


A 


4484 


411 


268 


RLNRRGGGCS E PRS CRCT PAWATEQNS I 
SKKNQQKTTKLPLCCRLILL 


4455 


18356 . 


A 


4485 


2 


102 


PGLLKMQQLAGHGGMHLCSQLLRRLRCF 
LHSYP * 


4456 


18357 


A 


4486 


1 


113 


HSAHSTHAGHAGHTSLPKCWDYRSBPPR 
LAKFLNTRY 


4457 


18358 


A 


4487 


163 


353 


QVAATLNNLAVLYGKRGKYKEABPLCKR 
ALE I REKVQKKGAWFFE I PVLCVCGFF 

E i? 17 iUwr 


4458 


18359 


A 


4488 


408 


307 


RGCSEPRSHDCSAAWLTBRDSVSKKNKN 


4459 


18360 


A 


4489 


337 


179 


YS INFALI L I FL FYLGVL VWAAI INYHR 
LHGLNNKHLLIMIWMVGSPRLSCG 














GQEIETILANM 


4461 


18362 


A 


4491 


180 


365 


ENNtKTtVALDMSFYFltFLFYPiLFYF 
ILFYFLRWSFALVAQAGVQWRYLGSLQP 
LPPRFK 


4462 


18363 


A 


4492 


269 


385 


GNLQNLE ECVFYKI QHPFM I KTLNKLVID 
RTYLNIIECAIY 


4463 


18364 . 


A 


4493 


103 


336 


QLLVS L I NLFLQAF KLQ I DCGHTCMHFW 
RQNLCLLPPTLPTPRRNLTLSPSLECSG 
TIWAHRKIiRLPGSHHSPASATR 


A AAA 


J oJUJ 


A 


4404 






YIKIENFCLGWAHTCNPSTLGGRG 


4465 


18366 


A 


4495 


47 


224 


ETATKTQRRWETALVPQHPHRPIAPVTW | 
VKTTGNS TIANFli I LVYLS FLFLVYS C 
IQQ 




i oJO/ 


A 






190 


ITLNPGGRGC5ELRSCHCTPAWATERDS 

mm m mm m m> \m*\mf mtS>*mJ ^m+mj mm*mmm+\*+Jr mm +7 nir«««« 

VSKKKKKFPPPPF 




J OJOO 


A 


4407 




119 


LPELEFEAAVSHDCASELQLGHQSKTIiF 
QKKINRI INNNNNSHIAISLLGIYPKEF 
KAGTQRDICTPMITASLFTIVKRQKQPK 

• m^mmr m> mm • mm m m mm mt m mmmr Mi m* m> mm w • ™^ ~ m^ • ™~ 

CLATNELVYRM 


4468 


18369 


A 


4498 


1 


152 


LSLPGSQTWWRALIVPATWEAEVGGSPE 
PRRSRLQVAMITPLHSSCGRRG 


4469 


18370 


A 


4499 


92 


9 
*» 


KRPGWTHACNPSTLGGRGGPITKSWR 
DQ 


4470 




A 


4500 


116 


1 1 


AG IAGMCLHAQL I FLY FLVETRFCHLGQ 
AGLELLTLWS 


4471 


18372 


A 


4501 


99 


1 


QGT P VHRWWKCKLVQPLWRT VWS FLRKV 
KIDNS 


4472 


18373 


A 


4502 


140- 


288 


RQFHYVAQAGLELLGFSNLPALTSQSAG 
ITGASHHNQPTS RFLKEKKIG 


4473 


18374 


A 


4503 


330 


181 


FFFFLFFFFFFFFFFFFXXFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFF 


4474 


18375 


A 


4504 


171 


2 


VETVSKESSLRICLQDIRYFFFETKPHS 
lAQAAVQWHYQGSIiQPQPLRLKQSSRLS 


4475 


18376 


A 


4505 


345 


74 ■ 


S VERTCHS PKPLMIiFLPLVFQARDDILN 
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SEQED 
NO: of 
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SEQID 
NO: of 

Ft A Pi r-t /I A 

pep no e 
sequence 


M 
etb 

Ou 

- 


SEQID 
NO: in 

09/515,1 
26 


Predicted 
beginning 
nucicuiiuc 

location 
correspond 

■ no fifst 

Ing IQ IlrSl 

amino acid 

I Colli UC Ul 

sequence 

i ■ 


Predict- 
ed end 

nil /*| a_ 

otide 

location 

currespun 

ding to 

I act o m Inn 
. Ia3i StllllllU 

acid 

residue of 

n en tide 

sequence 


Amino acid sequence (A=Alanine 
0=Cystelne, D=Aspartic Acid, 
jc^— oiuunnic Aciu, c*— j^nenyiaianine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
ii^ A3p«raginc, r^rroiine, 

Q=GlutaminerR-Arginine, S=Serine, 

T^ThrpnnlnA V=ValinA 
W=Trvntonhfln. Y=Tvrosiru» 

X=Unknown, *=Stop codon, /^possible 
nucleotide deletion. \=rH>mihl£ 

nucleotide insertion 


■ 


- 










GSHPVSFEKACBFGGFGAOIOFGPHVEH 
KHKPGFLRA.CKPYQTAEPASALTIjDFPA 
SRTVQK 


4476 


1 8377 


A 


4506 


268 


102 


KNIRELW PGAVAHACNPSTPGGRGRW IT 
RSGDWLNLRTCBHVILQQKGKFRFLIS 


4477 


18378 


A 


4507 


1 


67 


GLPKCWDYRREPPHPARNCKYF 




i oj iy 


a 








ANFPVF FE EMGFHR VADTGIiEVTiGSSDr. 
PILASKSAGITGISYSVRPQTNTE 


4479 


18380 


A. 


4509 


174 


3 


NEP KWI Y 1 1 JPGDPLFFLLLLFFETESH 
YVTQAGVQWHDIiGSIiQPLPPRLQRFSCV 


4480 


loool 




4510 


84 






448] 


18382 


A 


4511 


127 


2 


LLIKIHCWPGWAHACNPSTLGVRGRWI 

TO Cf^T JfMVl DT?AVf 
1 Kou V UJjy ira\l X Vj 


4482 


18383 


A 


4512 


109 


345 


EIFLIQGHEGSCLCFLPGSFMCFVATGF 
HARANNLFCFGSMPSHGITGLNF 


4483 


18384 


A 


4513 


290 


13 


RDLRPTPQKGPKNHTAPNPLQRAQTHVK 

GRWGEKXKKNKNGPKDKKGKKKKKKKE 
RKIQHSRF 


4484 


18385 


A 


4514 


234 


362 


TV^nirr CVmTMI?iyoT?V17\7IiV"'TPT TTV3DT3T 

KYRLNGDFLFLRLKL 


4485 


18386 


A 


4515 


190 


330 


L.JaALiUo£iUt' KKW5 r AJjI AUAo VQWRULifa 

SPQPLPPVFKRFSSLHPFE 


A A 0£ 

4450 


183o7 


A 


4516 


2 




. V* VL V \- VL V L. V C V C ± C C C r Lifa JEUS.WVVF irirXj 

CPPGGPKTLGPKESPPLAPKNRGNSGLT 
PPPPPTFFF 


A A 0^ 

4487 


18388 


A 


itn 

4517 


Ait 

94 


339 


on! iAiJjKr r L»r r r rrerer JrrviVr err r 
FPNKKKLKPPQIFFPPQKRPPIFFFPPP 






a 






• 


Ii YIiGPGAVAHACNP S TLGGRGGO TTRSG 
VQ 


4489 


18390 


A 


4519 


14 


121 


LIFFGRNRI SLCCPVWSQTPGIiKPSS WL 
SLPKCSD 


A/i on 








OAT 


L 


SLALPPRLECNGT I MAHCSLHLPGSSNP 
PASASQVAGTTS 


4491 


18392 


A 


4521 


169 


1 


ANLTQVGSYYSLSLHTLIQVTFFKNIRA 


4492 


18393 


A 


4522 


2 


82 


GFILCVCVCVCVCVCVCVCVLYFIFF 


44>3 




M 


45213 


3UU 


3 


T.RT ,DDC PT .UQriT Pf3/3nif DT.T3T. T 17DDPUD 

KKFIPPNFWAPPLISPRPFWVRFLGPPT 
FIKKKFSPPPKKQK 






a 


*I5Z4 


3*rO 




KPRFrinKRPONPP PfiRT .KPfiNPTiMPOGPl 

GSPEPNFPPWPPPRGAKKKTRPQKKKKK 
KKGRKE S RLN YGME KRMKE KE KYFLS AL 
N 


4495 


18396 


A 


4525 


313 


161 


RGHPKNLKSKEVNIRREAPGHPRRNKKK 
TPNLKKKKKKKKKENKKQKQKP 


4496 


18397 - 


A 


4526 


118 


3 


FFFFLRGS FALVAQAWQWRYLS S LQAL 
PPGFKRFSCP 


4497 


18398 


A 


4527 


153 


2 


KDAINKMKKWPGWAHACNPSTLGGQGG 
WITRSGAQDQPSHYGETPSLPK 


4498 


18399 


A 


4528 


258 


2 


GGQKRGLPGFIKKP PFRKI FLKNLGEGP 
GKNSLSKFFFFFFFETBSHCLTQAEVQW 
LNLSSQQNKPPRFKQFLCLNHPSSWDYR 
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SEQID 
NO: of 
nucleotide 
sequence 
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SEQID 
NO: of 
peptide 
sequence 


eth 
od 


SEQID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue ot 

peptide 

sequence 


Predict- 
ed end 
nucte- 
ouue 
location 
correspon 
"ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=olutamic Acid, F=Pnenylalanine, 
o=i»iycine, H— Hist id me, 1= 1 so leu cine, 
K=Lysine, L=Leucine, M=>Methfonine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginioe, S=Serine, 
1— inreonlne, V— Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Un known, *=Stop codon, A=possible 
nucleotide deletion, ^possible 
nucleotide insertion 




• 










R 


4499 


18400 


A 


4529 

* 


296 


*»r\ 

39 


SIR 1 1 N I Q I LLGF FE TGCHS VAQAHLKL 
LGSNDPPASASESTEITEMQHCTRLSIQ 
IFVAFPYSyiKL5TKKEPILIKT.TJ.YNC 
K 


4500 

• 


18401 




4530 


292 

• 


37 


SIRI INIQILLGFCETGCHSVAQAHLKL 
LGSNDP PAS ASBSTE ITEMQQCTRIiS I L 
IFVAFPYSNNKLSTKKESILIKLLLYNR 

K 


4501 


18402 


A 


4531 


3 


138 


GLWNGS CLCVCVCI FMGMGACLVCICTC 
LYCCVPVNTCLCMDGR 


4502 

1 


18403 


A 


4532 


380 . 


2 


GGPPKKRESAKAPGGRNPQGGGREKTPP 
PKKKRPPQKGGGKKTPPGGGGGPPRGGG 
KKKPPPKREKECHRGGRGGGGPPWTPK 
EGPPFFFFFFFFFFFFSLPFFFFFCFFI 
FILFFPNIVINGFY 


4503 


18404 


A 


4533 


3 


191 


LSKITHAS I PVPVSKKKKKKKKEKKKKK 
KKKKKKJOCKKXKKKKGGGGFKKTLGGHN 
FSRGLE 


4504 


18405 


A 


4534 


370 


48 


GPAPEVSSNGGKGGGCPKGLGKPLGSQF 
NGVKGYPS P KSS PL FQAPLGGQGFPKPR 
GNGGTPPFFPKKKKKKCALGDGTPTSGC 
PRRS WNHPE TQDGOGGWGSRVPR 


4505 


18406 


A 


4535 


147 


1 


KGPPLFFFFFFFFFFFFFFFFFFFFFFF 
FFFFSFYFYI IYLFFLSFLRL 


4506 


18407 


A 


4536 


198 


1 


NPPYFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFLSFIF I FLYFNF I 
SSNIVFIGFY 


4507 

■ 


18408 


A 

■ 


4537 

a 


2 

• 


356 

- 


QNRTTGNS KTQ S AS P P PKECS AS PATEQ 
SWMENDFDELiREEGFTRSNYSELREDIQ 
TKGKEVENFEKHLHECITTITNTEKCLQ 
ELTEL KTTARELRD E CRS LT SRRDQLAE 
RVSAIiE 


4508 


18409 


A 


4538 


347 


220 


KCSGTI I AQCNIKLLGSSNPPVSASQSA 
GITGMSHHVQPKYC 


4509 


18410 


A 


4539 


308 


,2 


FSISCPHDTPAAASQSAGITGVSHCARP 
IFSFICFFFIClMCIKYLIiRTGDSVMYF 
HGLGCFGLCCYIiESYIRKLSHMKKLSIL 


4510 


18411 


A 


4540 


2 


194 


I KQ WSTALS PRLE dSGTI TAHCSLKLP 
HYVDVDAA 


4511 


18412 




A <A 1 ' 

4541 


101 


2 


Pl.CtlS£4lJliH 1 iyuiuKuJUJr loAl Jr ftAToAl 

KAKID 




1QA 1 ? 


7V 


454/ , 


J 


it a 




4513 


18414 


A 


4543 


3 


183 


KIQILFFVSS KTRLRPSAWHPRFVFCFL 
VDAA 




IO*rI3 


n 




1 




RKKKKKEKXKEKKKKKOGRKKRKRERKE 
EEAKKKTKNKDK 


4515 


18416 


A 


4545 


104 


2 


QKKKKKKKKKKKKKKKKKKIYDFFLQIN 
CMCSHK 


4516 


18417 


A 


4546 


5 


108 


FKRLSCRSFPSSWDYRREPPRPANFCIL 
SRDGIS 


4517 


18418 


A 


4547 


131 


1 


RPRRPQVGLQDATS P I X EELI TFHDHAL 
iriFLICFLVLYALF 


4518 


18419 


A 


4548 


156 


3 


KGGQGMS CLS RS PSALLKFECRVI SS YW 
AWWRAPWPATROAEAGESLEP 
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M 
eth 
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SEQD) 
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nucicuiiue 

location 

correspond 

ing to first 

amino acid 

resiouc oi 

peptide 

sequence 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 

loci fi m inn 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 

IT— T^l it framfa* AriH P^Phanvlolanina 

Hi — wiuuuuiv ^vtiu, jr— x nenj iditinincf 
G=Glycine, H=Histidine, l=lso!eucine, 
K=Lystne, D=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 

T=Thrpnninp V=Vo lino 

W=Tryptophan, Y^Tyrosine, 

X=Un known, *=Stop codon, /-possible 

IIUCIKQIIOC UolcUOI], \ — possiuJc - 

nucleotide insertion 




1 QAOfl 






i 




KKKKKKKKKKKKKKKKKKKTFFGGGPLW 

QLKTWGKKAFFPRKGGGPLNPPQGPYQG 
A 


4520 


18421 


A 


4550 


165 


335 


PHYLPRRSI IHRPFKSHSPEPGAVAHAC 
i^<?TTinnRnnwTTCinnRFT?'r<?riTMMFTrc! 

HrOi J-IV3V3lvV7V? nil DOyO r CIO UimiCiJUS 

L 


4321 




A 


4331 


123 


3 


no unp t rr t.t .w q wn yciwt .pt .wt . zvwpp 
IFSRDGVLSSW. 


4322 






4332 

i 


3 


in 
131 


FGYVGMDVSYDPVRL 


4323 


18424 


A 


4553 


Of) 

87 


ULA 

364 


lUAOVd VVf A I/UwnnOuuiliia X CiOU Jr 9 

AAGTASAAVPSLEAQTPAAFRPTPPGLA 
PSSPPAPGPPAPTGKVCGRHFLRSESSA 

QftDT .PfcT.T. 


4524 


18425 


A 


4554 


114 


0 


LTWAPHSLIETIRNQIIQALLITILLGL j 


4525 


18426 


A 


4555 


91 


3 


TTTTYIAIATACVCVCVCACVCPCVCVC 

M . 


4526 


18427 


A 


4556 


239 


364 


CIKITFSRPGAWH^GNPSTLGGQGGRI 


4527 


18428 


A 


4557 


250 


1 


S TEHVKQLRNE FMKI SVIL I FLE ILLY I 
rs.VxlljiVLu.liJl llJxlljljlJUXlwljrvXux^Kirva 
A VAHACNPSTLGGRGGRI PRSGDRDHS 


4328 


18429 


A 


A«0 

4558 


2 




ULAj-J-iUlilj 1 W wo Aulio Jjir Av, n JJ X 1C9JS Jr 0 Jj 
F 


4529 


1 8430 


A 


4559 


65 


394 


DPVSKKKrGCKKKIKNEKEKICFFGKKK^ 
KKKLRADL TLKPIHCK^LKKREWKKRE K 
GODPF^KNKPQVGKTGNNLNIXJQNGLGK 
GPSDLKKOTPBTKLNLVQKKFQI FF 


4530 


18431 


A 


4560 


93 


2 


GIISAHOJIiCLPGSEDSPASASQVAGIT 
GO 


4531 


18432 


A 

* 


4561 


125 


4 


GAY I LYLNKKS KS I RPGMVAHACNLS TL 
GGRGGRI TRSGD 


4532 


18433 


A 


4562 


3 


84 


R S CHCTP AWATERE Tl^ PKKKKKKNF 


4333 


1 OA'i A 

18434 


A 


4563 


1 A €. 

145 


•* 
3 


P PIAS IGFFFFFFETQSRS 


4534 

* 


18435 


A 


4564 


125 


388 


IPJKPQILYSPQSSEimQHljHISVJuMGVT 
GKRGKFMDILDVTERCRFSLKKCTGIQB 

nT.MPf WWftT PPT.WT TPT JJTVTfMAVPQTfT.T 

GRFS 


4335 


18436 


A 


4365 


CO 

5y 


323 


V & I? VfiWT ^7 Q PTA7 T 1TVDT »Wn T PPVTir DT .UF 

VLISSFDEGEiWELEFFFFFFFKKKSPF 
UTQASFOWPNLSSLKPQPPRLKOFFCLG 

LLIT 


4536 


18437 


A 


4566 


326 


3 


PEKE FRRL VI KLI REAPEKGEAQCKE XQ 

DI LLVMQNALESLSNRIEQVEERNSELE 
DKVFDLTQSNKDKAKRIRKYEQS 


4537 


18438 


A 


4567 


333 


209 


EQAFDPYGTLSQGI FSPKFHEPFHKLMA 
RPAVALPFQVNLK 


4538 


18439 


A 


4568 


125 


2 


HNQSMSRS I DCSDP I LGHCNLCIiPGSSD 
SPASASQVAGTTG 


4539 


18440 


A 


4569 


212 


2 


PIC^GKKCYGQNv^lVIKa^MWKHNHRL 
GVVvTIAOIPNSLVGQGGRFMRSGVIjEQH 
EQHAETPSLLKTLK 


4540 


18441 _ 


A 


4570 j 142 


14 


GRVDRSNPGRFLSTSNSSLYERPREIRP 
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NO: of 

n it 1*1 Pftti ft P 
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sequence 

1 
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SEQ ID 
NO: of 


eth 

OU 

■ 


SEQ ID 
NO: In 

09/515 1 
26 
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Predicted 
beginning 

II ULICUUUC 
lUVHUUII 

correspond 
ing to first 
amino acid 
residue of 

- pC|JllUc 

sequence 


Predict- 
ed end 

n 1 1 o 1 o 

vUUC 

location 
correspon 
ding to 
last amino 

UCltl 

residue of 
peptide 

sequence 


Amino acid sequence (A=Alanlne 
OCystelne, D=Aspartic Acid, 

Er=\jriuiaii]ic Aciu, r—.r Deny la la nine, 

v=wjcine, n— xiisiiaine, i = Jsoieuctne, 
K=Lysine, L^Leucine, M=Metbionine, 
N«Asparagine, P=Pro1ine, 
Q=GJutamine, R=Arginine T S=Serlne, 
•^Threonine, V=Valine, 

w— i rypiopiian, I — tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucicoiiuc insenion 














TSQAPSPVNHIIST 


4541 


18442 


A 


4571 

• 


2 


419 


ARDS FLHDSQTSFCFSDS I PTPSNMEET 
QQKSNLELLR I SLLLIBSRLE PVRYLR5 

MT77\T\TT\TT 1ITTn r POT\OrYrwTTT T VT^r PPfi T l"\rp 

W Jp! AJMJXJli VrUU 1 oUb JJU YHJjujIUJ JjJ&JbG 1 QT 
IWGRLEDGSRRTGQILKQTYSKCDTNSH 
NHDALLTNYGLLYCFRKDMDKVETPLR 


4542 


. 18443 


A 


4572 


3 


194 


HEGRLQSQLLRRLRQKNCLNTGGGGCSE 
PRSHHS FPAWVTEHGFISQISLKPLFI S 
HPEPFLL 


4543 


18444 

* 


A . 


4573 


2 


286 


CRDGLTYNDFLILPGYlDFTAbrtVbLTS 
ALTKRITLKTPLVYSPMDTVTEAVMAIA 
MALTGG IGF IHHNCTPEFHANEVRKVNK 
YEQGFITDPW 


4544 


18445 


A 


4574 


44 


325 


RQWLQEEAYITKEQKYSFLHDSQTSLCF 
SDSIPTSSNMEETQQKSNLELLHISLLL 
IESRLEPVRFLRSTFTNNLVYDTSDSDD 
YHLLQDLEEG 


4545 


18446 


A 


4575 


374 . 


33 


LRLARRSGASLCX5SRSGGPPTCANICPR 
YAPLIPPCRTSIiliEAQQLVMWGPDFRV 
RLVTS SERRFRYQP PPPGFNRFPCLSLP 
SNWDYRHAPPRIiANFNSLFFFFFLVETG 
Y 


4546 


18447 


A 


4576 


400 


225 


PI FFWFFLFVLFLVETRLCHWQAGLE 
IjLGSSNLPISASQSAKITGMSHPAWPRA 

EF - - • 


A CAn 

4547 


1 O A A O 

18448 


A 


4577 


2 


372 


ARETI FCFSNS I PAPWNMDETRQKSNIjQ 
IiLRISLIilliLYRTRLTLPCITQLHILQT 
YKVNARCS HFHS S S TCGPLYSCTLYVSL 

rrv^T nMVT /TJOTT T VTVXT0 7VT MOT T tPT/"\tFKT 

LNQ VS P VFNAH 


4548 


18449 


A 


4578 


349 


174 


GCG E PRS HHCT P AWATRAKRCLKKKLKL 

r\j TnjxnTYTPQTmn&T y\vrc»exjT TwnQHTKTriv\ 
ULKIiVVKb&lttiQPijyrvJr bnJjrTSTlT'JTQ 

EN 


4549 


18450 


A 


4579 


3 


124 


HRT.TjNPEGEGCSEPRSCHCTLAWVTRAR 
LRLKKKKKKKKN 




15451 


A 


a con 


lol 


1 


nSS x x JfS PCSRIjIjMATVYNIiHLi PoSSNS 
PAS AS Q VAGNAGARHYAW L I FVLLVETG 


4551 


18452 


A 


4581 


227 


3 


HRKRNRCV S RTALLFS LGDG VS PCLKI I 
III KVLFWLGE VAHACSPS PLGSRGGR I 
TRSGVRDQPGQHGETPSC 




10/10 


_ ... 

A 


A COO 


OCT 

257 


2 


KVKyJjUAr bi^PGSGSrSKILGAJjKXMG 
PPRAGPVGSPRYSRPLGGPGRQIPRSGD 
PGPPGLPRGNPVPLKKNPKRSLLILPAR 
A 




1 OAKA 

18454 




45oi 




94 


GP 


4554 


18455 


A 


4584 


345 


50 


lYIYICIFI^RNRVSLHCPGWSKTPGLK 

QDRIPKNDPSCTPS PAQMSTNL IGWASS 
YLQSYSPAPYNNKN 


4555 


18456 


A 


4585 


338 


19 


FQLSRRFRPKNGGNPGSGACS EPRLGPC 
PSAWVTKPJ5SVSKKKKEYHTLSGLSTTG 
I YFSQFWMLGVQGASRFGVWGGAVS WS I 
DGAVLLRPHMAEGARQLSGLOF 


4556 


18457 


A 


4586 


2 


224 


ARGVGRAGLELQTSGDPPASASRGAGIT 
GVSHRVQLLGDRAKFSLKKKKRGI ILPG 
EFYSPLKKRFTLNTPDNP 



830 



WO 01/64835 



PCT/US01/04927 



SEQID 
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sequence 
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SEQ ID 
NO: of 

sequence 

• 
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M 
eth 

OU 


SEQID 
NO: in 

IJCCNF 
Uoon 

09/515,1 
26 
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beginning 

VI t ■ J||a|| flHA 

location 
correspond 

lag lo iirsi 

amino acid 

icoluUc Ml 

peptide 
sequence 


Predict- 
ed end 
nude* 

otide 

location 

correspun 

ding to 

I as I a III 111 U 

acid 

residue of 

peptide 
sequence 


Amino acid sequence (A=Alanfne 
OCysteine, D=Aspartic Acid, 

YT— - fl ln+a m i/* Ai*trl G^PtiAntilnlonina 

n^— o i u la 1 11 it- /\ciu, nenyisiiinine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 

Q=Clutamine, R=Arginine, S=Serine, 

TT— Th r&n n i n P V— Vi lino 

W=Tryptophan, Y=Tyrosine, 

X=Un known, *=Stop codon, /^possible 

n iif*lp«t#irlf> H**lft«nr» 

nucicouuc ucicuun, v^pussiuie 
nucleotide insertion 


433/ 


loUo 


A 
*v 


*tJOf 




106 


P PT5TWHT YNPSTLOnnAGR T R <? T^5Vnn 
KPGQHRQGLSLGVEDKPGQHSETPSVQS 
NP 


433o 


Jo459 




4D66 






GVSHHTQPYF 


4559 


184oU 




4389 


1 1 A 
110 




T.QNinVTtfnT.VNDTJT.T .T VT*\7VP ITT.lf CC YT»K' 
UofM^IMN^U X INZrlt jUJU X X V« V V XtAJjAOO X Ut\. 

QAEVQGSKLGSIiQALPHGPTPLSGLNLP 


4560 


18461 


A 


4590 


2 


127 


APJGCSEPRSRHCTPAWATEQVSKKKKKG 
REKTFGQFFFWVI 


4561 


18462 


A 

• 


4591 

! 


211 


2' 


ARGS LEAGS STPAWATINQSTNKKTDKT 
AljXnxUvMroKl x ivAKyKoPlrvjr l^ooXVUX 
IjTLFIiGAKVSCLV 


4562 


18463 


A 


4592 


3 


120 


HbKlKlNMPXKJlwAiUJl^K^ 

KINGWPGAVF 


4563 


18464 


A 


4593 


388 


227 


GHVGQDGLELLTSDAPPSLASQSAGITG 
VS HRAQPKWLLFQHRS PCL PEAAGK 


4564 


18465 


A 


4594 


1 


348 


PPPPPPPPPPPPPPPPPPPPPPPPPPPP 
PPPPPPPPPPPPPPPPPPPPPPPPPPPP 

OT DBD151JDDOOCGDOGOni3Mnnil1'TCTnk 

c 


4565 


18466 


A 


4595 


2 


138 


ARAARGFRHVGQAGLELLTSGD P PASAS 
RSAGITGVSHHTQPTNF 


a m aT 

4566 


18467 


A 


4596 


315 


2 

■ 


ALKPNYS STSMN I KARLPQ YNINNLNLA 
PLYSSTSNQINI IVI I LINKENNHTLFP 
CLVSNSRLKQSSHLGLPKCWDYRRELLC 
ixftx^UxUr r XoxoAFAiyixLiisL. 


4567 


18468 


A 


4597 


1 


146 


GTSVDQGSLELLTSGDPPASASQSAEIT 
GV3HRTQPPLT I IQLYIjGYC 


J m*A% 

4568 


18469 


A 


4598 


293 

* 


3 

4 


GGTP WPRKKIjGQRKFPHRP pggsgnpd f 
S IiLFFFFLRQSFALVAQAGVQWCDIiGS L 

GSFFGREASASC 


4569 


t QA*7f\ 

lo47U 


TV 


4599 


1 1A 

174 


*> 


PPPRFKRFSCLTRIiSSWDYRHAPPHPAS 
C 


4570 


1847 1 


A 


46UU 




ZjZ 


XirUr r oxlC» iAs 1 Oil o Mil w A 1JNXU1IUJ£>U X \J IV 

PJCjDASGLERAS YPLAAEFKVGGRGCSS Q 
PLTLCGYP 


4571 


18472 


A 


4601 


233 


366 


I H YL PD AVAHACNP S TLRGCGGR I TRS R 
npnvpnnwf5PTPFTiT.1T 

UIvU' X rvyilOu x ire \ IIjJn. 


4572 


18473 


A 


4602 


182 


325 


LFVFCLFLRQCXSVAQAGVQWHDHCSLY | 
pnppQT.irnQciPT jTiT.PQQKmY 


4573 


18474 


A 


4603 


202 


2 


GGSS SGLLR PCAVLGLKQSS YLS LVRRW 
nHMQWT.lTRQTHT dCLiPKGDJDYPP RPPHT ,ZV 

CILKFSINLV 


4574 


18475 


A 


4604 


317 


2 


FRLGPQVFFS P PGAGFFFCGP WJGPGP F 
SRSLQIiLTFGAKFFFCLSLPES FGAPLR 
GPPSGFMAHCFFFIiVLLFFFFFFRDSVL 
LCCPGWSAVAIHRRDPTTARA 


4575 


18476 


A 


4605 


233 


71 


FFVIiLVE I EDQIjNE KKQEGKI RKKKRMK 
RNKQSLQKIWDYVERPNLAGHSGSRL 


4576 


18477 


A 


4606 


114 

■ 


1 


HPRPARVSSCCPl^SAMVWSRIiAATSAS 
RVQAILQSRA 


4577 


18478 


A 


4607 


348 


3 


LKFQNFRGWHGARCPSFFGGLGRGIPF 
TPEVEVLVNLGPAPALPFWPPSQTLFSQ 
NQKKNQKKKQERKREKERKEERKKKKER 
KKKEKBRERKEGRKKGQEKKRKITAKSL 
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SEQID 
NO: of 
nucleotide ( 

ScvJUcllCC 
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SEQ ID 
NO: of 
peptide 
seq uen co 


eth 
od 
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SEQID 
NO: in 
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no/cir f 

26 


1 Predicted 
beginning 
nucieotiae 
location 
correspond 
ing to iirsi 
amino acid 

i*0ciif it a nf 
ICSJUUC Ul 

peptide 

can it A«i oa 
Sc(|UcIlCc 


Predict- 
ed end 
nucie- 
onae 
location 
correspon 
ding to 

loot dmlnA 

iasi a in 1 no 
acid 

residue oi 

peptide 

can ii Ann a 

acqueoce 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
£=ulutamic Acid, F=PhenyIalanine, 

Glycine, h— Mist id me, l~lso leu cine, 
K=Lysine, LHLeucine, M=Methionine, 
ix^Asparagine, x^rroiine, 
Q=Glutamine, R=Arginine, S=Serlne, 
i—i nreonine, v s vanne, 
W=Tryptophan, Y«=Tyrosine, 
a— un Known, — oiop cotion, /^possioie 
nucleotide deletion, \~possible 
nucieonoe insertion 














LG 


HO / Q 








I 1** 


i 


GI I IAHCNLKLLGPSDPPTSVEKTKKLV 


4579 


18480 


A 


4609 


171 


334 


DPHSSLNIRFSRSVAQAGVQWCNIiNSIiQ 


4580 


18481 


A 


4610 


2 


164 


ARGSRGHIWSAFCHVGQAQLELLTSRPA 
LtASQS AG ITG I SHRAQPFLNTIiYSNS 


4581 


18482 


A 


4611 


2 


164 


ARE PRGHI WS S F CHVGQAGLE LLT&kPS - 
LASQSAGITGISHRAQPPLSTLDSMS 


4582 


18483 


A 


A 4f mj **% 

4612 


180 


**** A 

334 


CCLMGSSyLLPRKPNTXiRTAGFLQQRKS 
LIRPGMVVHACWPSTLGGQGOWI 


4583 


1 8484 


mm. 

A 


4613 


189 


a 

1 


SGFSRWENGLKCKGQKGGQS LVLLP I R 
VFFFFF FEMKSHS FAQARVQWGNLGSLQ 
PLPLGFK 


4584 


18485 


A 


4614 


216 


358 


KSTPRLGTVAHTCNPSTLGGQGKRITRS 
GDRDHSGLHKGT PRLSKNI 


4585 

■ 


18486 


A 


4615 


179 


393 


■M « A AK AaHb W| AM Ate A A ^te A A te te »AiA| A^ ^te^ ^A A A^A ^te AA 

EKQEEEGS WGWAKTEDGVALMLSBAIGS . 
CW I KAFLTDAETFTFHAD ICTLSE KERQ 
I KKQTALVELVKHKP 


4586 


18487 


A 


4616 


2 


126 


ARVQRYLFLFFTLDGPLPDPSMIRGSVP 
NQMMPRITPQSGK 


4587 


18488 


A 


4617 


338 


181 


RVGEAGLEUiALSDIjPASDSQSAGITDX 
SHHTR PG S P FLEHHL YPR YR I SGK 


4588 


18489 


A 


4618 


3 


197 


HEKRPEVQNSTVEFIASSDYMIiRPPLPL 
PFCFFLFFFKIKMGVSPCCPGSCQTPSL 

• AA, Ate Atey^ te tete v 

KQSSCLNV 


4589 


18490 


A 


4619 


1 

> 


339 


GTRTFGSGNGPTKPDtiLQELRVATERGL 
VIVNCTHCIiQGAVTTDYAAGMAMAGAGV 
ISGFDMTSEAALAKLSYVLGQPGLSIiDV 
RKEIiLTKDLRGEMT P PS VEERR PSLQGN 
T 


4590 


18491 


A 

-r 


4620 


3 


306 

■ 

■ 


LEPGDGGCSELKSCHCTPAWVTERDS IS 
KKKKKKKKGGPLKGTLGGPKFNGGWQRK 

B #\ f P?f ^* ™* ~ ■ te te Vte Jtea M aa ate te» ^ 

IFSQKGGQPCKPNLGILGRNLNFGGEKNW 
NKFLTKI KVLREKKNF 


4591 


18492 


mm 

A 


4621 


397 


29 


NTKEKILSARKNNQVFPPHTQKKNFFPK 
KRKNFREGIPPNFPPPKKSPPQKNHPPQ 
KKKPPKEKKKTIARPPKKRAPPGAFKKK 
PPPFFFIiSSIiCISYSFIYFFSLINLFFF 
FFFFHDSTVY 


/i coo 


18493 


A 


4622 


2 


235 


VSLjCHPGGTI sthynfclpgs RD PPTSA 
SAYALL I FVF FVETGFPLVAQAGPKLQG 


4593 


18494 


A 


4623 


209 


324 


I LRVL W SGTVAHACNP STLGARGRQXMR 


4594 


18495 


A. 


4624 


52 


287 


NPFSFLFFSFLS FFLFFFFLEKGPRRGA 
KAjQIuQ Kr HP J\JU1KKjSJSKGGGKKNK TQKP 
FSPLKGPRGPPRGKGQTTPKGP 


4595 


18496 




4625 


102 


1 


THSFPVSISLSLSHTHTHTHTHTHTEIY 
RVREIR 


4596 


18497 


A 


4626 


125 


1 


DRHVPQHTANFFWLIEMGFLHVGQAGL 
KLLTSGNPLVPLV 


4597 


18498 


A 


4627 


3 


167 


ERLRQQNCTN PRS EGCSEPGS PHCTPAA 
WATEQDSVSKKEKEKRKKEKKKKIW 


4598 


18499 


A 


4628 


135 


3 


AASTLS RGTITAQGNLCLPVS SNS PASA 
S R VAG I TGARHDTWL 


4599 


18500 


A 


4629 


320 


210 | W FLG 1 1 FRRVENAES VNIDLTYD IQSFT 
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DTGKTPSS 


4600 


18501 


A 


4630 


98 


326 


SSPTLAFQFQYFFSCATKBAVKLSFPKS 
LLGTGSRSVAQPGEQWHNHGSLQPQPAQ 
LRLSSCLTLVSGWDHRCAPP 


4601 

■ 


18502 


A 


4631 


275 


2 


RVKLLLFYMRQSCSVIQVESSSTVNGSIj 
QPQPPGPRQSSHVSLLSRWDHKRMPPCI* 
ANFFLFFWRAKGLAILPVHSNLVNTBSP 
LISIKSN 


4602 


18503 


A. 


4632 


180 


1 


KGLTLYFYFF,I FKMRRGLAMliADWSQTP 
E LNQ FS CLGLP KCWDFRHE P PHP VKE F K 
PFHR 


4603 


18504. 


A 


4633 


234 


5 


KPLRDKPLRSFSNLPKKNLNFFWPFFPQ 
KIWGFFFFFFETESHCVTQAGVQWLDLS 
SLQPPSPGLRQFLCLSLSSS 


4604 


18505 


A * 


4634 


171 


2 


GrRVDi LTPWQHFEITjFS FFS FF W&3LS V 
TRLECSGVTMAYCSLDLLGSGNPPTSAS 


4605 


18506 


A 


4635 


324 


211 


LRNII FMPVTMNPGMYTGGCMCVCVCVC 
VCVCLCVCVC 


4606 


18507 


A 


4636 


95 


1 


MRQSINLSPRLECSGTIPAHCNIjCLPGS 
SNS 


4607 


18508 


A 


4637 


234 


1 


KKGHDTLTEKAMPVTLS srmls invckc 
IYKVWKNSPNYSSLTHHINNIKPSFTRE 
NTLMFIHLSPILLLStOTDIIT 


4608 


18509 


A 


4638 


2 


243 


WGKVNVDEVGGEALGRliLVVYPWTHMYF 
EFFGAFYISFCLFFLIFPHLFFVSSIHI 
VSS FYLLFYNFP 1 1 FFTLFCHFFL 


4609 


18510 


A 


4639 


118 


2 


GRVDLQCSGAISAHCKLHLQRSRHSPAS 
ASQVGAHATT 


4610 


18511 


A 


4640 


50 


168 


^INKMHLITFLKKKKKKKKKKKKKKKK 
KKKKKKKKKKN 


4611 

• 


18512 

■ 


A 


4641 


315 


87 


GIFLRDRNIiALLPRIiECSGTI IAQCNUS 
LLS PKDSPAS PS PVARTTGVQPRPDTLT 
LFELIYIRYKGMIFSQGDC 


4612 


18513 


A 


4642 


383 


2 


KFKKKKFAGGKAPARLSPPLWGPKGGGP 
PRAGVIiSPPGPPGETPFFIiKNPKFPGHG 
GGPRKSPFFGGLKGENSI/TREKGGPINP 
NLAPPAFPPKGQNPGPFPKKKKKKKNRS 
S PAWNWDWIGTNAGK 


A A* 4 ft 

4613 


18514 


A 


a a* a *± 

4643 


3 


205 


LKRFFCFSLGGQGCSEPRSCQCI pawet 
QRDSISKKKKKKKTRPGGLTLEPPPFGG 
VKPKGPLGAGI 


4614 


18515 


A 


4644 


149 


1 


FFP I KADLVFFFFFLRRNLADSQGAVQ W 
HDLGSLQAPPPGFMPLSCLSL 


4615 


18516 


A 


4645 


474 


344 


AIHEHYLPESSASGDCDEDLQAACTILV 
NALMDFHQRLAAHTV 


4616 


18517 


A 


4646 


165 


357 


LCN/UiSHLIiPQCTPTPLISILCIlDTIS 
LLIQPITHAVRLSANITACHLLMHLILL 
STLSICTIi 


AC n 

4617 


1 Of 1 o 

18518 


"™s — 


4647 


2 


339 


RVNCVSQKKKKNQPKPKPNQTKPNQTRK 
GSPGFPLLNLKVISPEKSLNPNILKTAP 
KVTFRNWLTPE I PP FWKMEEKGS FDPRI 


4618 


18519 


A 


4648 


313 


187 


LTSGDP PASASQSAGI TGVSHHALP PL V 
QFPSHSLTPTPLFC 


4619 


18520 


A 


4649 


236 


324 


GQVWWPGMMAHACNPSTLGGRSRWIVRS 
R 


4620 

* 


18521 


A 


4650 


225 


313 


GQVMWPGMMAHACN PS TLGGRGRW I VRS 
R 
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46ZI 


\o522 


A 


4651 


240 


76 


■ r t. VrlAil in JL a Xro P UAouoSBHHrlTjETil 
PSGS PEHTHT YTHTT FLASRSAEPQL P 


4622 


18523 


A 


4652 


273 


379 


KQLAAQCVRPLSPSQAIAS PVTCTYMPR 
WPEVTEE 


4623 


18524 


A 


4653 


369 


189 


GTVKQAGAQCRNLGSLQPTPPPGFKQFS 
CKSLYSIMICYSKDCTMSTTLGIRANNR 
IiHL 


4624 


18525 


A 


4654 


168 


346 


DRI CRASS YTIHLMIjYFVFLIFIjFYWYI 
YFLF FVLFHFT I F 1 1 LFYS PI I LLLLL Y 
FYN 


4625 

• 


18526 

* 


A 

• 


4655 

* 


56 


354 

• 


WKRTFPFFFFLETGSGFASRPCTLLLPL 
YSLGPNPRVLIRVFLFLSS3RCYYYYLF 
LQLSLliFLFFYIMYLLFIRLFFFVLSGL 
LLLLSSYYRFIIFFF 


4626 


18527 


A 


4656 


50 


186 


SAC LG LP KCWD YRHEALR P AMYLLTGS S 
YVAQAGI4NLLGSADPPE 


4627 

• 


18528 


A 


4657 


131 


366 


DHFVASSESVKEEGLFFVFVFFFFIFIF . 
FFILYXCFFYFKFIFIILLYIIIIFIFI 
FYFFYLYYLILI FFLI FFI FIG 


4628 


18529 


A 


4658 


290 


155 


LEYI VEKGFHQDDKAS PELLTSRYLSDS 
ASQSAGI TGHRAWPTKG 


4629 


18530 


A 


4659 


163 


372 


LRLFALFAFLFMFFFYFLFLLYVFCFVF 
FLFFFFTFFLYYFFLYFFLYFLCSLVFL 
FFICFIFFWFFVFF 


4630 


18531 


A 


4660 


227 


378 


FI FFSNCTS TAIiVHPItFYLSRLVLSKNR 
NKFI GRARWLTP VFPALWEAEA 


4631 


18532 


A 


4661 


3 

• 


141 


RSRHCTPAWQQSEILSQKERKKERKKKK 
ERKKEKERKKERKKKKKK 


4632 


18533 


A 


4662 


346 


3 


TFKFLGGVCPFFSPPKKRVFSKNSPGGF 
FLPPFLGENFFFAPPVNFGPPGGFFRGG 
PPFFFFFFFFFFFFFFFFFFFFFFFFSQ 
DSFDNLGI FVIPYKIQDDFSSSMSNATS 
ILIKIAL 


4033 


1 OCJ A 

18534 


A 


4663 


302 


3 


GDINRRSHS PFVWSKPFAPCFMSSIiDVF 
YKSSDFFYFFFETEFHSCCPGGISAYCN 
LCLPGSSNSPASASQVAGITGVRHHTQL 
IFASWETFKITYKN 


4634 


18535 


A 


4664 


2 


76 


RLAPTPQLGGHWPPTGITPLNPLKV 


4635 


18536 


A 


4665 


1 


210 


SPTRSPKPTPPYYQTTLAKPFTKKKKKK 
KKKKKKKKKKXKKRGGAPIiRKS PGGPKI 
NRGKKKKFFFLKGG 


4636 


18537 


A 


4666 


2 


356 


GSRPGGGKRTARRWTRPWRLRGACIiGIXj 
MAFAFHHQVI YKI KSFLRKKKKKKKKKK 














KKKKKKiOOCKKKKKKKKKKKKKKKKPRG 
GGFTjKKLWGGKFFMGGGKIFFFF IRGGK 
KKMWGG 


4637 

• 


18538 


A 


4667 


1 


323 

- 


RTRGIFCYDVCVESGCADIQLLLLCIiKK 
KKKKKKKKKKKKKKKGAPLKKNPGGAQN 

FGGGPNGAPPPQKNKTPKEKKKF 


4638 


18539 

■ 


A 


4668 


167 


446 


ELNKGLCKPKKKKKKGGGRSQFKPTRPK 
KKKKKKKRKKKKKKKKKKKKKKKKKKKK 
KS 


4639 


18540 


A 


4669 


321 


52 


GVFSPFS PKKFFFS PKGIiNFWGGFAP IF 
SPPKKRFFSKNPQGGFKYPPLRGKILIW 
GPRENWGPPGGFLKGAPLFFFFFFFFFF 
FFFFWG 


4640 


18541 


X 


4670 j 


82 


359 


NAYGI ILIKGNEDTKTVKWKSSCQLANT 
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Amino acid sequence (A«Alanine 
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jy— \jiuiamic /\ciu, r— jrnenyiaianine, 
G=Glyclne,H=Histidine, Msoleucine, 
K=Lysine, Lr s Leucine y M=Methionine t 

Q=G!utamine, R=Arginine, S^Serine, 

^Threonine, V=Valine, 
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LDCNOVII IHNNLESLGIiSNPPMLASQV 
ARTTGLCY 


4641 

■ 


18542 


A 


4671 


249 


60 


LLIFIYYFYFFLFFFLFFFFFFFFFFFF 
LFVQVHLPVHLPCYBLSPLYKCVGVLVK 
CPFKYT 


4642 


18543 


TV 

A 


4672 


2 


1tA 

319 


AAASTIjVrVFS FFWVIjGF F FFF FFFFWG 
PPKKILKKKGGGGGGGGGGPRGGPRGGG . 
KRNPGGQKKGIGGANGGQKKERNFQGGG 
PPRGEKRGGGGGPGNPKGGGLK 


4643 


t O B A A 

18544 

* 


A 


4673 


3 


451 


HESFNPVI ISQLERGEDPWVLDRFfGAKK 
SQGLWSDYSDNLKYBHTTACTQQDSLSC 
P WECETKGESQNTDLS PKPL I SEQTy I L 
GKTPLGRI DQENNETKQS FCLS PNS VDH 
REVQVLSQSMPLTPHQAVF SGERPYMCV 
ECGKCFGRS 


4644 


18545 


A 


4674 


446 


189 


SrsiiSTLVNVCQLISLSDPASGFVDHVC 
CIFTFYS INFALILI FLFYLG VLVWAAI 
INYHRLHGLNTKHLL I MI WMVGS PRLS C 
G 


4645 


i ob a z 

18546 


A 


4675 


26 


170 


HLi WFL V I VAH WAS S RQ IIS HW KNCTRH 
DCPVCLPLKNASDKRNQQSK 


4646 


18547 


A 


4676 

■ 


3 


378 

> 


HEDEFDKMGNQHQALLEAMEQQS ISLAK i 
AG WC I LPARTS 1 1 AVAHS FAGLYLI FN 

fill fftT 7X9 ULMJUflMUYU 1 T TJlTVHflTnTV 

TVS VVFFFFSNF IFLIIiSLFNx FCI S I Y 
ILFLI IFIFIFISSLIVFFCLISFRFLL 


4647 


18548 


A 


4677 


18 


242 


PIGLALSPRIiECSGVI IAHLLGSNNPPT 
SGSRVAGTTG IHHHAQPHCFNC I FPMS S 
EVECLFICLIiLIHILSRA 


4648 


18549 


A 


4678 


288 


2 


P KYRWVPE KPTTAS ENAE I P S ERQKEG I 
KbTIRISSRKKKPDSPPKVLEPETKQEK 
PEKEEEKTNVGRTLRRSPRISRPPAKVA 
QIRDPKAHKKK 


4649 


18550 


A 


4679 


340 


208 


SVGQAGLELLTSGDPPATASQSAGITGL 
SQCYPPDSVPETFLPL 


4650 


1 Of ft 

18551 


A 


4680 


4 


379 


EENTG I NLHD LG PGNG FLDMAP KAQ ATT 
AKKKKKKKKKI KI KS FLFLGG LFKKGKK 
TPPKGBKNFGNFKFWGGDLTPKI FGDPG 

PHPLESPSPFSVF 


• 




7A 
n 


/I £01 


1 


350 


KEHFEAFQS VAQCTYNQTVQLD I TAFLK 
TVKKNKHKFYPAFIHILARLMNAHPEFR 
MAMKDGELVIWDSVHPCYTVFHEQTETF 
SSLW 


4652 


18553 


A 


4682 


365 

• 


45 


SPKWFVASYDPYAGPGRPTRPDNSRDET 

CCUCVCT7C C O O Cn C t/DDnTIC 1 C?T"\T3f JV XTCT% 

bona Va Ci hdbboDb KicbSTobDKEANCiP 
SHRASPEPEPPPTNKWQLDNWLNKVN 


4653 


18554 


A 


4683 


238 


372 


SATPLISSFSFLFFFFFFFREGVWLCHP 
GWSAVSRDCTTAAVWPG 


4654 


18555 


A 


4684 


3 


462 


KTKESREAVEKEFEPLLNWMKDRALKDK 
IEKAWSQRLTESPCALVASQYGWSGNM 
ER I MKAQAYQTGKD I S TNYYAS Q KKTFE 
INPRHPLIRDMLRRIKEDEDDKTVLDLA 
VGLFETATLRSGYLLPDTKAYGDRIERM 
LRLSLNIDPDARG 


4655 


18556 


A 


4686 


410 


0 


FFFFFFFFFFFLLGKGGFGIIKFL 
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A mS t— S 

4656 


18557 


A 


4687 


151 


2 


IFFFFFDRVQWHKIjBSIjQPkPPRFyQFS 
CLSLPSSWEYRHAPPRVRPRV 


4657 


18558 


A 


4688 


342 


183 


FPLFLRWSFALVPQAGVQWCDLGSLQPL 
S PGFKRFSWAEVGGLHE PRSQRLQ 


4658 

• 


18559 


A 


4689 

■ 


126 


404 


NCQVWPLRFTGHSVFSVHPQWFILFYCF 
ALYFI FLEMGS YS VTQAGVQWHSIpGSIiQ 
PPQWSYVLTFLWHPSSWGYFVQNHLTPC 
FDGKVPWV 


4659 


18560 


A 


4690 


2 


128 


GESLFYI VS PRDVWAKERDQDDHIDWL 
LEKKKYEVPL ILLL 


4660 

- 


18561 

i 

i 


A 


4691 


407 


2 


YFFFPLPRMKIFFFPHQKGFFFGGCPII 
SPPPNEVFFS PKEGGFFFSPPKKRVFFH 
NPPEGFFPPPPFFMSPPPPFFFFFFSLF 
FFFFFFFFFFFFFFFFFFFFFFFFPSFR 
I ILIiNQKKNLSACVELRSCCTFM 


4661 


18562 


A 


4692 


1 


176 


SRRLNPGGRGC S E PRSHHCTSTWS TERD 
SVSKKKKKRFFALKFLGKANVKPTIGGQ 
KN 


4662 


1S563 


A 


4693 


1 


403 


SKAEIGRRLGLLRQTVSQVVNAKEKFLK 
EVKSATPMNTRMIRKRNSL IADMEKVLV 
VWIKDQTSRNI PLSQSLIQNKALTLFNS 
MKAERG VEAAEE KFEASRG WFMRFKERS 
HFHNIKAQGEAASADVEAAASY 


4663 


a fTh dm a* 9 a\ 

18564 


A 


4694 


mkm M 

343 


425 


HCFLS LQLNS r hiQIiCXN i TNE KliQQLF 


4664 


18565 


A . 


4695 


395. 


2 


KNKGLF FFF FLRWS FTLVTQASNVTTAV 

mirrrmTT ttrrkTY TtT n/*t^OikTfinimii Antrim A 

CNVT I IAHCNIiHLPGS SNS PAS AS R VAG 
ITGSCHQG 


4665 


18566 


A 


4696 


429 


0 


VTKWCYITKKKKKKKKKKKKKKK 
KKARG 


A a+* a< a** 

4666 


18567 


A 


4697 


37J 


217 


I P VFKQSSCIiS FPRSWDYRHMPPCVTQK 


4667 


1 8568 


A 


4698 


415 


59 


LQQQCECFISELTKKKKKKKKKKKKKKK 
KRGGGLE KKKKKGFFAP YPPH 


4668 


1 8569 


A 


4699 


431 


0 


LiKKX r ir Pvj^jK V JbbPbPPPPPPPlrl^ Fr Ptf 
P 


4669 


18570 


A 


4700 


371 


245 


TLGDPPASASQSAGITGVSHHAQPLFGI 
r 1 JtlUTl X 1UCK1KT 


4670 

• 


18571 


a 


4701 


410 


0 


KQCVJFTQKKKXKXKKKKXKKXKKKKXKK 
DRG 


4671 


18572 


A 


4702 


340 


402 


PFAGGGYRLGAAPEEESAYVA 


4672 


18573 


A . 


4703 


188 


2 


TSLPKCWEYRHHPPCPAHTLLI FLFFI F 
VDMGS CHVAQVALELDGS S DP PTSASQS 
AGIIGM 


4673 


1 8574 


A 


4704 


423 


34 


IT TFf.PT/yfT/Tf fyt/Tf Tf WT/T/ffT/T/T/TfT/Wt/T/AT/tXT 

LKKIOGCKKKKKKKKKKKKKKKKKKQKDL 
KKMVGGGNYKNVGGAQILGYKKLCFSFF 
YRVF 


4674 

• 

• 


18575 


A 


4705 


2 


.297 


PRVREREKEMASMKEEFTRLKEALEKSE 
ARRKELEEKMVSLLQEKNDLQLQVQAEQ 

ERLEDEEEMNRSLP 


4675 


18576 


A 


4706 


2 


179 


QENGMNPGGRACCEPRLRYCTSSWATER 
DSVSKKNEKKRSASLRRFDILFGALTGR 
KAS 


4676 


18577 


A 


4707 


432 


578 


PRIVCFQELRKPSYAEICQRTSKEPPSS 
PLQPSSSSSSSS 


4677 


18578 


A 


4708 


419 


35 


GSFFFFFLKDQPKKGRKKEGCPPPPPKK 
IIYPPPATKWGGGGGKKPPPPKKRGPPK 
KNRGRKKKPPKKKKKKFPPPPKKRAPQK 
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G=Glycine, H=Histidine, l=lsoleucine, 
KMLvsine. L^Leucinf* !V1=iVfpthtnninp 

N=Asparagine, P^Proline, 

0=f»lu tannine. Rc~A roinine SsSerinp 

^£ WIUMIIIIIIIlfj »> 1 g till II tj - Utl lllvj 
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W=Tryptophan, Y^Tyrosine, 
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■ 






KKKKPPPPIFFYLYFFKNFI FFFFFFFY 
FFPF FFFKFI FNIiKKF 


40 /o 


loD/7 






1A7 




NFFFFFF FFFXFXF FFFFXXXKFFIiF FF 
♦FFPLKLKIFF 


4679 

• 


18580 


A 


4710 


3 


229 


HASAHASDQPERIiNAGTYFLPYTLVGSL 
PLLIALIYTHNTLGSLNILLLTLTAQEL 
S KKKKKGGAVLKNP WGAQS 


A /OA 

4680 


18581 


A 


Alt 1 

471 1 


1 


1 CO 

158 


DTDDDTPDDCOCT?T.DCtTWr , TD&IJVT'nDT. 

CLKKKKKKFPNLSVGFKILGILKG 


4681 

* 


18582 


A 


4712 


163 


404 


KKKKKKICKKKKKKKKKKKKKKKKKKKKP 
KKGGGAQNRAGGKRRPSRGGGEKQN 


4682 


18583 


A 


4713 


3 


378 


FEEPVYIKSRQKRKESNPPKLVSSQPHG 














KKPGGAKNKPGG E KKNFFLKGGG KKNPW 
GPPKKKTFFGGGKIGAKPPKKKKSLKKK 
KKFLRGKGGKKT 


4683 


18584 


A 


4714 


2 


208 


VSNPAVSVPHLPFSVYKSSPMASMTFSK 
KKKPWGAQNKPG 


4684 


18585 


A 


4715 


290 


56 


oAfPPJLr rJ_tFlrr.br J_jr _Lr r xYt JrifrKQl 
LEKKGGP PPF FLFFFFI FFPFPFYRMGC 

UK W Vie XJ-iXUoJ. rxtKIS,yi\Jr vl_i\.V 


4685 


18586 


A 


4716 


50 


424 


GGFKIKFLPTQFLFLLFSPSMFLISKSP 
AYLWQSS PRQVPMWFGKTNFLVS IEGK 

LAPLRRSIiALLPRLECSGAILTHCNPeii 
PGSSNSHAWA 


4686 


18587 


A 


4717 


161 


2 


GRPGPADFR VRPQLLQRFL P I YLFTEME 






SCSVTQAGVQWCNLGSLQPIiPPGLQ 


4687 


18588 


A 


4718 


2 


115 


VYTTAMAI EEYPEAWGSGVEVLVS VLVG 
UAna V t»o i Kir 


4688 


18589 


A 


4719 


1 


416 


GNQGGGYGGGYDNYGGGNYGSGNPSSSS 

CCGCGCGCQGGCCQCCCCGQQCCCCQQG 

SSSSSSSS33SSSSS3SSS 


4689 


18590 


A 


4720 


1 

- 


454 

• 


QPDSGISSIRSPKTIXiPAAAQDVYYRDE 

TOMX7QTQWT .T .T T.nn O^rPMT? T D DP PDT.T?r! 

GWKTHYIVGYNLPSYEYLYNLGDQYALK 
MRFVDHVFDEQ VI DS LTVKI ILPEGAKN 
TEinS P YE TSPJVPDELHYTVTJYTFGRPV 
iVAYKKNLVEQ 










j 


A /J 


SAGITDVSHRTQLDVPFSFSLS PLPHQC 






» 


/f 777 




7rt1 
ZU1 


Q KKKKGKWGLER I GWGGSTTNNGAKNQR 
GEKFGKKGGF 






a 
f» 




z 




LKPGGGGCS E P RS CHCSTPAWROS ETPS 
Q KKKKGKWGLE 1 1 CWGGS PTNKRATNS P 
VEKF 


4693 


18594 


A 


4724 


1 


395 


QDATS P I IEELI TFHDHALI 1 1 FLICFL 
DLHALSLTXtPSALT 


4694 


18595 


A 


4725 


388 


3 


SFSPLEENLGNTIQDIGRGKDFTSKTPK 
AMATKAKID 


4695 


18596 


A 


4726 


223 


44 


WLFFFXPLFFFFFWGGIIFFLLKKLYIF 
FFFMSKI FFFFFFFFFFFFFFFFFFFFF 
FFFF 


4696 


18597 


A 


4727 


41 


374 


KKKKKGKK3CKKGGGGGGNPKKKKGKMEK 
P PGGNKRGEGGKKKNFLKKKGGGLFKGK 
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SEQ ID 
NO: of 
nucleotide 
sequence 

• 


SEQH) 
NO: of 
Deotide 
sequence 


M, 
eth 
od 

, .. — 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 

■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Add, 
E=GIutamic Add, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=lioIeucine, 
K=Lysine,L=Leucine, M=Methjonine, 
N-Asparagine, P^ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophao, Y=Tyrosine, 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














ICPPKKKNGGRICPWKKKNFKKKKKKAPFF 
LGPKIFKKKKKRGLKKKLGFFSQSVGP 


4697 


18598 


A 


4728 


323 

mw mmwrnS 


69 


FFFLRQSLGSASQAGAQGHNLGSLQPLP 
AGFQLFLQPPPPGSKQFGRLNLLGNKDL 
WGLPVLRAHPQAPPLAFIL 


4698 


18599 

• 


A 


4729 


3 


197 


CLALLHLSRQRQYALPLLGIiPGCPHIHT 
HSHTHAHDHTHYTSPHAHRHAHTHTHMH 
TLPYTTHCI 


4699 


18600 


A 


4730 


309 


443 


VLTLSPRLECSSMNIAHYSLKIiLGSSDH 
PTSASQSGGIIGMSYHA 


4700 


18601 


A 


4731 


3 


172 


DAWAFFNGPGVSLCCTGWSRTPQLKRST 
CLSLPKCWDYRPB PLS PAPKTLFSYVLL 


4701 


18602 

« 


A 


4732 


98 


232 


QTVFLDI WLLALCSPGVLQTVKRFIiEFR 
CIFSLGIVTFICLYRN 


4702 


18603 


A 


4733 


238 


1 


LGPPPPSPLGPPPFLVIFRGFSFPKTPL 
FFSPFFFGPPPFFFFFFFFFFIRRSFAL 
VTQAGVQWRDLGSLQPLPPGFKR 




100U4 




I/O** 


101 




LI oghrwi yfsolrkvrgrqacdstll 
ghlpkwrrmfpflsftvagleptshyr 
mfvdwlvdohhwryotgkwvocgkaeg 
tmpgar 


A If) A 




A 




J J 7 


ft 

o 


NP POKKKKKIFFPPKKKTPPKKI KKNPP 

pftiffffffffffffffffffffffff 
ffyfqhvwfcfei ihvifflievlfiaf 

HVGDS IADAWSTKNSNRTK 


4705 


18606 


A 


4736 


297 


442 


tqgffgggprkigfffrglknggggnpk 
lrflkpgqlvgrgktkknqg 


4706 


18607 


A 


4737 


430 


0 


CVIjGSKDKKKKKKKKKKKKKKKKKKKKK 
KAKKKDS 


4707 


18608 


A 


4738 


137 


1 


INYYFFLKQGFPLLPRIiECSGAI iahcs 
LEFPGSSDPRPSASQVA 


4708 


18609 


A 


4739 


188 


3 


ISPKALNPRREVGPIFPPPKKKVPPQNP 
PGGFFTPPHKEKNFSLPPPVNLGPPKDP 

m ^mr^mTmm A» mm m m m^ Jmm mm mm mm **VMW4 • mtrnmrn* mm 

LKRPP 


4709 

■ 


18610 


A 


4740 


188 

■ 


2 


TSLPKCWDYRHEPPCPAHTLLIFLFFIF 
VDMGS CHVAOVALELLGSSDPPTSASOS 

▼ mwm mmmmwm^my ^mmm*m • m\ 4^ T * *U4iM WW WW 4 *» «4> mmwm mm^m^m^mm* 

AGIIGM 


4710 


18611 

• 


A 

■ 


4741 

■ 


410 

■ 


1 


RRPGGGFCPPPPPKIFFSPPPPPFFVGV 
LPQTPPPPKKFFFLNPPPGFFFPPPKKK 
KIFFFSPPFFSPPPIFFFPPPPPFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
GiCDRVSLCCSDWSRTPGPKPSSLIi 


4711 

• 


18612 


A 


4742 


225 . 


444 


ENLKLKSFPCSHFHWLRWFNLTIGALSR 
S FDEAALLFFQNDTYWPGTTVAHA.C3TPST 
LGGRGGR I TKSRDRNHS 


4712 


18613 


A 


4743 


2 


452 


VSNPAVSVPHLPFSVYKSSPMASMTFSK 














KKKNR 


4713 


18614 


A 


4744 


3 


144 


LAAVLLKLGGYG 1 1 RLTLILNPLTKHIA 
YP I LLLS LNPDI ITGFSS 


4714 


18615 


A 


4745 


206 


2 


IQNPQTERVNSTS KLKI SSKDTINRAKK 
QPMDWEVI CKKHI FGKSLISRMYKELQL 
NNKKQIVPPTRP 


4715 


18616 


A 


4746 


523 


178 

r 

* 


RHRRFFMDIEREQVKEQQRQKEQKKDIE 
KI KDKREQECYAAEQRI LRMNFHEDPYS 
GEKLSBIXiAQLQLQEIKGTREKQQREKE 
YLRYVEALRAQIQEKMQLYNITLPPLC 


4716 


18617 


A 


4747 


3 


518 


ENRLNLEGGGCGEPRSRHCTPTWAT 
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SEQID 
NO: of 

sequence 

- - •» 


SEQ 10 
NO: of 
pepifuc 
sequence 


M 

etb 

oo 


SEQID 
NO: in 

09/515,1 
26 


Predicted 
beginning 

nucicvijuc 

location 
correspond 
ing to first 
amino acid 

fAfiiHllA ftf 
I C51UUC Ul 

peptide 
sequence 

i 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 

1*1 c4* am inn 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A<=Alanine 
C^Cysteine, D=Aspartic Acid, 

1T^?||itnmil> A IT — X>V^ nmllnlonina 

iji^vviu ui iiiic /\ciu, r — r iiefiyifliHnific, 

G=Glycine, H=Histtdine, I=Iso)eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^Glutamine, R»Arginine, S=Serine, 

TsThrjvmini* VaValin* 
i i hi cum iic, r^Toiinc, 

W=Tryptophan, Y^yrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


4/1 / 


1 QAl ft 


a 


A1AQ 

4/4o 


AQt 


145 


GVSHRTRPHSHFYFYFLYSSSFFFFFFL 
AQRGNRAPRGKKTPFGLATLNI KSNGGV 
FWFLFLGFGFLKREPTFALGIikDRGGNL 
V 


An i o 

4718 ■ 


18619 


A 


4749 


a 
O 


362 


WrUjt J r45j i Jar IjooJLi V FJN I r rt o r Jjfcrr Per 
FFFFFWGKGGPPRGGGAKFYPLEFPPPG 
VPPPPKGGKKGGPPPPTNFLFFKKEGGS . 
PFGPGGVKTPGPKGFPPPSPQRGGKKGQ 
WAQKPP 


4719 


18620 

i 


A 


4750 

t 


1 O f\ 

380 


3 


S LLLAFALLCL P mjQEAGAVQTVPLSRL 
FDHAMLQAHRAHHLAI DTYQEFEETYI P 
KDQKYSFVHDSQTSFCFSDSIPTPSNME 
BPQHKSNLELLRISLLLIBSWLEPVRFL 
RSMFANNLVYDPS 


4720 


18621 


A 


4751 


1 


345 


LFYIFKKTWSLFLCFLWMMCVFIFFFFL 
LFVFFIiCLGGILVSFI PPGGBKREKKKR 
RGGEKKKKKKNWFKKKKKNPWGGGGGL 
KKQRARGGKKKKKKGGGGSGGGKKKQSP 
LRG 


4721 


18622 


A 


4752 


1 


407 

■ 


QEFEETYI PKDQKYSFLHDSQTSFCFSD 
SIPTPSNMEETQQKSNLELLRISLLLIE 
S WLEPVRFLRSMFANNLVYDTSDSDDYH 
XjijIUJIiArAsXuJI JuMGVKVAPG V 1NPG\L PI* 
ASRAGGEKYCCPLFSNKARDQEN 


4722 

r 


18623 


A 


4753 


357 


1 


PLFFFKRNLFFFFFFTKKKBKKAPVFFF 
FPPPSREWGGFFFRVGGVNTQFLHPFFF 

FFFFFFFFFFFFFFFFFFLFYCFIYFSR 
DRGLTLL 


4/Z3 




A 


AHZA 

4754 


1 


207 


JjtjJ_iAva V iLJit. vAiKljJNAU(jKtjL.i»Js.FKi>JrlD 

WTPAWAMEQDSMS KKTKNEYQRRQLIHL 
QSSTSGFLYVHEQ 


4724 


1 8625 


A 


4755 


2 


336 


HEERERERERERERERERERERERERES 

RAQFRARKNSQAPHPRVCETLWTEEGGS 

T?VDT.Ti7impnnr t Po aiinvtrr^nrciJDTy^M 
r x fJu l JtSJxUr AULJSKMrt V JLAj V oiUrJtAaW 

GL CKKMNP E CERATHS LHGDAD I SG VL 


4/ZP 




n 


4/DO 




AAtL 


GAISAHCNLHLPGSSD 


4726 


18627 


A 

• 


4757 


3 

• 


390 

- 


QTSFCFSDSGPTPFNMEETQQKSNLELL 

TSDSDDYHLLKDLEEGIQTLMGRLEDGS 
RRTGQILKQTYSKFDTNSHNHDALLKNY 
GLLYCFRKDMDKVETFL 


4727 


18628 


A 


4758 


234 


387 


LAKI FLS FSSATGWIENRPQSPATGRTP 
VFVS P 1 P PPLPPP FP FAFVT 


4728 


18629 


A 


4759 


396 


0 


I YTHTHTHTHTHTY I YMYVFMI I 


4729 


18630 


A 


4760 


176 


384 


MVI PLFNKDTQHI ILFWGYVLIFFFLKR 
NFVFVAQAGGQGRYFGSLKPPPPRLKPF 
SCLSLLGTWDYRR 


4730 


18631 


A 


4761 


419 


208 


IYPPPPPWEPKGGSFLKKKKKKIOCPGPG 
VPPFIPPPLGGQAGGFPNSKIQIPPAPQ 
GKPLFFQKNQNYLA 


4731 


18632 


A 


4762 


158 


2 


GNQIiNPPPPSGSAFCFLFFLRRSIALVT 
QARVQV7HDLGSLQPSPSGFKRFSC 


4732 


18633 


A 


4763 


489 


398 


AGFELLTS S DP PAS AS QSVG I TGMSHCT 
QP 
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SEQID 
NO: of 

DUCIcUlIUC 

sequence 


SEQ ID 
NO: of 
peptide 
sequence 


m; 

eth 

Ou 

• 


SEQID 
NO: in 

09/515,1 
26 

• 


Predicted 
beginning 

mm it a1 jVA^ifl A 

nucieonue 
location 
correspond 
ing to first 
amino acid 

■■AaSjIiiA at 

restaue oi 

peptide 

sequence 


Predict- 
ed end 
nucie- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue oi 

peptide 

sequence 


Amino add sequence (A=Alanlne 
OCysteine, D=Aspartic Add, 
Hi^ijiuiamic acio, r = rneny (alanine, 
G=Giycine, H=Histidine, I=Isoieucine, 
K=Lysine, L=Leucine, M=Methionine, 
N«Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
j— inreontne, v— valine, 
W=Tryptophan, Y=Tyrqsiue, 
A=un Known, w =Mop cod on, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


■ 




A 


4/04 


42U 


1 

- ■ 


• rr ff rijuriXf PF(^J\(jOV.r PPPFIiKlUfr 
FSPRGLFFLGGGGPNFPPPKKRVFFKXP 
rtttvjr r trfFisv jjjucuirbiruc ui PPUmsKN 
FFFPPPGVLGPPKGFFKGGPPFFFFFFF 


4734 


18635 


A 


4765 


167 


3 


IYFFFFLKKIFFYKKKINKLKKKKKKKK 


4735 


18636 


A 


4766 


424 


222 


FQNIHKKNFPSPPLLKTGPAPLFKKPPK 
KKKKYFFPPPEKLGPPREFLKGPPPFFF 


4736 


18637 


A 


4767 


254 


423 


VIKSWARRNKNKWDWPGAVALSCNPSTL 
UuPv^AfYPLiKb^VyUnPtjynGE 


4737 


1863? 


A 


4768 


363 


0 


KTTTS QARGPTS VI PAAQETEAGDS SNP 

orvrcvc o c o o o o r» 
faGiGblSSSoSSSP 


4738 


18639 


A - 


4769 


51 


169 


^YTi4TkPRTflWKi353a30mQaQOT^" 


4739 


18640 


A 


4770 


188. 


3 


ISPKALNPRREVGPIFPPPKKKVPPQNP 
PGGFFTPPQKEKNFTLPPPVNLGPPRDP 

LKRPP 


4740 


18641 


A 


4771 


1 


131 


KLDKENALDRAEQAEADKKAAAk^kQLE 
DBLGSLQKKKKGGPP 


4741 


1 O 4f A^ 

18642 


A 


4772 


363 


A f\ 

452 


RLE CSGMI IXHCSLNLPGS S DS PTSAS Q 
LA 


4742 


18643 


A 


4773 


121 


1 


RPRRPVAQAGVQWHNLSSLQPLPPGFKQ 
FFHLSLLSSWDY 


4743 


18644 


. A 


A * 1 A 

4714 


31 


A** 

226 


MIS SQLS 1 1 PTPTQEQGLKKKKKKKKKK 
KKKKKKKKKKKI^KKKKKGGGLLKKILG 
GAKFLGGEKKNFFFP 


A1 A A 

4744 


18645 


A 


4775 


161 


■i 

1 


KPPPPFFFFFFFFFFFFFPFFFFFFFFF 
rr r r r Frcr rF Fr r FFLr r FFLKKSWRP 
WGLKSAPAL 


4745 


1bo4o 




4776 


394 


67 


oPPKOfcWjVAf'PPPPKNr i: r PRAVFFIjG 
GGGGPMPPPQKRGFPPKTPGGVPKPPPK 
GGKFFFPPPGEGGPPPEFFKGPPPPFFF 

ffffffpfffffffffcffffppff I 


4746 


18647 


A 


4777 


2 


409 


GVC I FSGHKHNKAPLPFSVLYS I Q Y INS 
KKT 


4747 


18648 


A 


4778 


456 


0 


PFYKKKKKKKKKKKKKKKKKKKKSSS 


4748 


18649 


A 


4779 


438 


0 


LLKIDKKKKKKKKKKKKKKKKKKRTSKR 
GG 


4749 


18650 


A 


4780 


58 


330 


sp fyqlkkkkkkkkkkkkkkkkkkkkkk 

KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKPQQKGGEKKKNPGEL 
GKKNCGGGGVLVGKPHKEKKRGGGKKKKT 


4750 


18651 


A 


4781 


435 


73 


FFPPPPRGKFFFKKPPGGKLFSPPELGV 
FFPPFPLKIFFFPSTPFFFWGGFPPFSP 

PPFFLPPPGFPFSPPPPFFFPFFPPPPP 
FFFFFFFSP 


4751 


18652 


A 


4782 


420 


109 


LFKKFFSPTEREGVFPPQPPKNFFFSPN 
PLFFGGGGAQNFPPQKKNFFPKNPPGFF 
FSPPKKKKKFFSPPGKILAPPGYFFKGP 
p PFFFFFFFPFFFFFFFFFF 


4752 


18653 


A 


4783 


230 


2 

■ 


LGLWGALYSPFLGGLGGRKFLLPWRPR 
VPLTPISPPPSRLGGPPKKKKSQAWWHA 
PWPTTREAEAGGSLEPRRX 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 


SEQID 
NO: of 
peptide 
sequence 

■ 


M 

eth 

od 

• 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

tng to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 

* * * 

location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
OCysteine, D=Aspartic Acid, 
E>=Glutaraic Acid, ^Phenylalanine, 
(^Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Metbiomne, 
N=As para gin e, P= Proline, 
Q=Glutamine, R^Arginine, S=Serine f 
T=Tbreonlne, V=Valine, 
W=Tryptopban, ys^Tyrosine, 
X=Unknown, *=Stop codon, ^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


4753 


18654 


A 


4784 


271 


440 


VLYSKRDNNKS FKNNTYTGIGPGAVAHA 
CGPSTLGGRGGQIMRSGVRDKPYRHGET 


4754 

« 


18655 


A 


4785 


2 . 


266 


KKYGI 1 1 FCSFFLSLFLKDQGLAPLPkti 
ECSWFAAHRGLGLLGWSDPPKSAFLVA 
G I TGMHHHAWL I FKVQFSNLMGPWS YTW 
SAVD 


4755 


18656 


A 


4786 


216 


435 


HRSGDSSLKKKKKKKKKKKKKKKKKKKK 
KKKKKINEFWKGPFLWGGKRRKSGGG 


4756 

i 


18657 


A 


4787 


249 


437 


MNNEWPLKTETSRPDWAHACNPSTLG 
GRGGQ I TRLGVQDQ PCQHGEAPSLPENR 
KYLAWSV 


4757 


18658 


A 


4788 


107 


368 


AHGSLDFLPGGPKKSHQWFPKPKDGKPP 
PFSKKPLKKKTPGKKGGGHPTPGARSKK 
GGYPRGGDPGGPFFFKKPGQKKKTPFLL 
RRD 


4758 


18659 

* 


A 


4789 


1 

• 


225 


PTRPF ITNNIFPAS PFQTTI PLYLKLTA 
LAVTFLGLLTTIiDLNYLTNKIiKI KSPLC 
AKKKKKKKI KKKKKKKNEG ' 


4759 


18660 


A 


4790 


2 


216 


SGRNS RVDGRQE AACRDYQS SLEDLTFN 
SKPHINMLTILAEENLPFAKEIVSLIEA 
QTAKVF IHPAAS YY$ 


4760 


18661 


A 


4791 


389 

- 


30 

. 


EKKSPPQKKKKKFFPPPPQKKFFSPPNQ 
KTCGGGGPKKPPPKKKGSPQKNPK3GVKK 
PPPKKKKKIFPPPQKKGPPPKILKKPPP 
PFFFFFFFFFFFFFPFFFFYFSLFFSGG 
FSYKIFL 


4761 


18662 


A 


4792 


164 


3 


LFPGFFNPGVKGEKGISFPPFFFFETBS 
HSVIQARGQWRRLGSLHPLPPGFKR 


4762 


18663 


A 


4793 


550 


413 


AEMG FHHVGQAGLELIiTSSDP PALASQ S 
AGITGISHRAQPHRPIC 


4763 


18664 


A 


4794 


456 


352 


RSQDGLYLLTS S FACLGLP KCWDYRRB P 
PCLAVFF 


4764 


18665 


A 


4795 


242 


3 


KPFSGGGGAFQTAFFITFGKKFWSRLG 
LLKKKVFLKFLPQAFFFFFFETESHSVA 
QAGVQWRSLGSMQPSPPGFKRLS 


4765 


18666 


A 


4796 


218 


346 


GFSCFVFETVSLCRPGWSAVVRSRLTAT 
SASRVQAIILPQPPE 


4766 


18667 


A 


4797 


383 


80 


KFFFNQG IGGFFPPFPPKNFFFSLKPFF 
FLGGFS PFFPPP KKI FFSKI P PG FFFS P 

«k m> mm mmm m m> mmmmmm m mm m* m* m- mi h& «m m»m^ m» »m mi w& mm. mm 

PLKKKI FFFPPRLILAPPRFFFKAPPPF 
FFFFFFFFFFFFFFRPI 


4767 


18668 


A 


4798 


1 


126 


ILIKLSSTWEGIQAGKELEBQHGIHCNM 
TLLFS FAQAVACAE 


4768 


18669 


A 


4799 


81 

• 


356 


ILFFFFFFPGGGTPKKSQVFFKNPQNPI . 
GKPTRMGKGEKPPPPRGKKKKKGCrPPTL 
GKNKNPVPGWPGAFLPPLEEKKKPFPGP 
PPLKKKK 


4769 


18670 


A 


4800 


1 


99 


GCSEPRSRHCTLSWATRAKLHLIKKKKK 
KKKutA 


4770 

■ 


18671 


A 


4801 


238 


5 


NHQNS VQNYFKNLRI FQLIYSLSMNLPR 
SSFKLKSMTHSISFSFLFSFFEIQSHSV 
AQDRI KWHDLGSLQPQPPGLK 


4771 


18672 


A 


4802 


166 


62 


I FLFS FFLFLRDRVLLCYVGWS VMVQST 
QTPGLK 


4772 


18673 


A 


4803 


102 


6 


AASTTES PS VAHAGVQWCDLGS LKPGSR 
RFQ 


4773 


18674 


A 


4804 


17 


374 


FGSRAATLFFFFFFLGRGGFFLGPRGPK 
KKTPGGEGFTRFFDPGWGR3PEKKTPPP 
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SEQID 
NO: of 
nucleotide ( 
sequence 

i 


SEQID 
NO: of 
peptide 
sequence 


M. 
eth 
od 

- 


SEQID 
NO: in 
,USSN 
09/515,1 
26 

■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

ins tt% fir** 1 
lllg IV III SI 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 

ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyialanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K^Lysine, L=Leucine, M=Methionine, 

VsAtiiornffin* PaPrn lino 

Q=Giutamine, R=Arginine, S=Serine, 
T=Tnreonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X= Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 










- 




WGGKGGPKKMfJLWGAGKI KFLE I KGKS F 
PKGTL 1 PNrrGPPIaFrSPPKIl^PGNFJjD 
FSFKLKN 


• All A 

4774 


18675 


A 


4805 


59 


175 


or xr r JuMMKiOuU*^^ 
LKKKTGGGPN 


4775 


18676 


A 


4806 


406 


300 


SSGVLYSRAPTYFCGQrLTFRQVDIKTS 
HLPQEAE 


4776 


18677 


A 


4807 


123 


1 . 


ICS IAWLGTMAHS CUPS ITjGGQDGQTTRS 
GVRDQPGKHGETP 




18678 


A ■ 


4808 


319 


406 


LYKSYSSWPNWAHTCNPNTLGGQGGQI 
I 


4778 


18679 


A 


4809 


236 


351 


GFWPAPGVHACNPS I LGGQGGRVTGP KL 
GNLLGQHGSI 


4779 


18680 


A 


4810 


173 


2 


EVLFLGFRSXLISLLQMNVtiLVLItAKFA 
FVAQAGVQ WHNLS SLLP PPPRFKRFSCL 
R 


4780 


18681 


A 


4811 


2 


94 


TRILKVWQKRKCSVKNGFLTISHGTVSI, 
LF 


4781 


18682 


A 


4812 


358 


2 


FFFFFSETESCSVAQAGVQWRDLGSL 


4782 


18683 


A 


4813 


362 


3 


YSEVCI I SLQQAHDSLSVINFHLAECVI 
QLPVNLREIiPGAVAHSPLGSELCFSDSG 
SVPSHTRS FPKAAIRLTANHRRGCTESG 
WPFSLVFLFERCMWPGTVAHTCNPSTLG 
AQGGQIT 


4783 


18684 


A 


4814 


341 

■ 


I 


WKnQKRAHITKSILSQKNKAGGINLPDFK 
LYYKATVTKTAWYWYQNRD I DQWNRTE P 
SEIMPHI YNYL I FDKPDKNKKWG KDSL F 
NKWCWENWLATCRKLELDPFLTPYTKIN 
S 


4784 


18685 


A 


4815 


2 


283 


VYTGEE I LQKKE IT I RDQEAP YLLRNLS 
DHTVAISSSTTLDCHANGVPEPQITWFK 
NNHKIQQEPELYTSTSPSSSSSSPLSSS 

sssssssss 


4785 


18686 


A 


4816 


266 


3 

* ■ 


AASTLQAHRAHQLAIDTYQEFEETYIPK 
tfQKYSFLHDSQSLRQPIiHTHTGSVGTRT 1 
LAASLS SGRTAHTRTHMHTQHTRKRTAH 
TNA 


4786 


18687 


A 


4817 


365 


1 


SFFFFFSETESRS IGRLEFSGTISAHCN 
LRLPDSSNS PASA 


4787 


18688 


A 


4818 


87 

■ 


372 


AS YKKDKYCMIPLIRVI KFPQTERKKWL 
SGLGGKGNKELLFNGCNMAVFLVLGGGF 
ETEPPSVTQAGEQWCDLSSLKPLPHRLE 
RVSCLSLP SGW 


4788 


18689 


A 


4819 


5 


175 


FAFLT I PLSRLFDNAMLRARRLYQLAYD 
TYQEFEEAYI LKEQQDS FLQNPQTSLCF 
S 


4789 


18690 


A 


4820 


235 


351 


FFXFFFXFFXFFFFFFFFFXFFFXFXFF 
XFFXFFFFFXF 


47VU 


loo91 

• 


7V 
n 


A <>*> l 

4821 


2 


155 


X f ioJuLt Iho WliEF VRr JjKSMt ANWIiVzlTT 
S DSDDYHLLKDLEEGI QTLMGRL 


4791 


18692 


A 


4822 


90 


2 


LFFFFFE TES PS VTQAG VQWHDLGSLQP 
P 


4792 


18693 


A 


4823 


247 


3 


KVKXSSPSHRRAMVAKGNIjSIjHERMKSPG 
I SNYMGNSKI LFSYFKNCWPGTVAQAYN 
PSTLGRGGWITRSGVRDQPGQYSET 


4793 


18694 


A 


4824 


93 


407 


QHPKKKKKWGGRGPLKNKEKVFKHKTFN 
FFSRQKKKKKKKKKKKKKKKKKKKKGGG 
GLLKKFWGGPKFPGGKKKFFFFFGGGKK 
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M 
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od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
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nucleotide 

location 
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* A 1* * A 
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residue of 
peptide 
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Predict- 
ed end 
nude- 
otide 
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correspon 
ding to 
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acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, • 
K= Lysine, l>Leucine, M=Methionine, 

»T 1 * W TV 1 • 

N^Asparagine, P^Prohne, 
Q=Gtutamine, R=Arginlne, S=Serine, 
T=Threonlne, V=Valine, 
W=Tryptophan, y^Tyrosine, 
X^Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=*possible 
nucleotide insertion 














NPPGVFLKKNFFWGGEKKKKK 


4794 


18695 


A 


4826 


2 


129 


TTAHC S LNL PGS S D P PTS RS LVAGTTHV 
CHNAWVNFVFCRRG 


4795 


18696 


A 


4827 


342 


140 


GGFFFFPPPPFFFKFPLFLPPPPPPGPS 
CVGGVQKFTFPPPPQLSRSLFFLFFFAP 
PPETDNPFFFS 


4796 


18697 

- 


A 


4828 


1 


259 


GGGCSEPRSCHCTPAWVTEYDKKKKKKG 
RGKG KKKI GKGG FQRKFKGPNPHGRFLR 
ESHNS PKKGFPKKFFGNPPPGTS KKVPL 
PH 


4797 


18698 


A 


4830 

! 


125 


I 

• 


KGTRTFETESRSVAQAGVQWHDIiGSLQP 
LPPGFRRLTCLSL 


4798 

• 


U699 

i 


A 


4831 


1 

• • 


323 


ARGEREREREREREREREREREREPPPL 
SRGGSESYLKSAWRERLLLERDPLFFFG 
GKMGNLFPPPPPPVGRGYTGGGLITGGP 
FWWGE CVS HTHHTHTHTHT xMS V 


4799 


18700 


A 


4832 


3 


297 


TRRERERERERERERERTL 


4800 


18701 


A 


4833 


1 


150 


AREREREREREREREREREREREKKKKN 
HSRGGGPGPQFPIiGGGVKKFGGGV 


4801 


18702 


A 


4834 


1 


98 


ARGERERERBRERERERERERERERERE 
RERGAGGGPT 


4802 


18703 


A 


4835 


2 


66 


HEERERERERERERERERERAAR 


4803 


18704 


A 


4836 


2 


73 . 


PEERERERERE RERE RERERETAR 


4804 


18705 


A 


4837 


146 


2 


FGLVYS I S QFCL FF VRRSLALVTQVGVQ 
WRNLGSLQALPPGFTAFSCL 


4805 


18706 


A 


4838 


354 


0 


PPPPPPSPPPPPPPPPPSSPPPPSPPSP 
SSSPP 


4806 


18707 


A 


4839 


62 

F 


358 


GLVHDKHS PHGGGG SHHVLP S LNPSF PT 
FSTKSPSVPPARALWPPDSRWALQVGHL 
GLTPTTRQESAQAPRRKLALLEGSFQGQ 
AQWLTPVT PALWEAE 


4807 


18708 


A 


4840 


121 


13 


RPGRPPPEFTPFSCIiSLPGSWDCRRPPP 
RPANLLYF 


4808 


18709 


A 


4841 


3 


221 


L VMG FVG FSS KP S P I YGGLAL I VSGWG 
CVIILNFGGGYMGLIVFLIYLGGMMVGF 
GCILDFS LNFNVGLNF 


4809 


18710 


A 


4842 


3 


84 


CHCT PAWTANLDS I KKKKKKRKKFKN 


4810 


18711 


A 


4843 


33 


230 


LTVT I S KAAEVLKHKKKKKKKKKKKKKP 
KKKTPKKNPNPPPQKGELGKKIFFFFWE 
RFLPQKKRP 


4811 


18712 

- 


A 

■ 


4844 


3 


368 

■ 


HELN P RGRS CGE P R P CHC T P AW ATRAKV 
HITKKKKKKKGFNLFKPGGSLSNFFPKK 
KETPWSPQNFEKKGPRRFSPRGKNPPLA 
LRGKKKGGKSLSGKIFPPFPRGKTPPWA 
FMGTQKGEYS 


4812 


18713 


A 


4845 


163 


2 


HUGHS I S KFLLT VS S PKS FKQLRPDA VA 
HACNASTLGGRGGWITRSGVQDQLV 


4813 


18714 


A | 


4846 


387 


234 


KGGGEPRS CHCPPAWATIAKLLFKKKKT 
KPKKNPKKQEKTFPYNLNRVCY 


4814 


18715 


A 


4847 


189 


349 


KFSRFSGQKALLLKDILRLGTAAHTCNP 
S TLGGQGRW IMR CQ VF KTS LAKMVK 


4815 


18716 


A 


4848 


3 


95 


HENFYCPGSALGRNEGNIFPNPEATFVK 
EM 


4816 


18717 

■ 


A 

• 


4849 


2 


325 


DSHTI IMGDFNTPIiAILDRSTRQKVNED 
IQDLNSALHQADLIDIYKILHPKSTEYT 
FFSAPHRTYSKI DHKI YSQGNHETDCRY 
GKKGGGWKDERFPDVDCEKFKSQ 
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26 
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peptide 
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Predict- 
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nucle- 
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ding to 
last amino 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LpLeuci ne, M=MetbJooine, 
N=Asparagine, P*=Proline, 
Q=Glutamine, R=Arginine, S=Ser!ne, 
^Threonine, V^Valine, 
W=Tryptophao, Y^Tyrosine, 
X-Un known, *=Stop codon, ^possible 
nucleotide deletion, v=possibte 
nucleotide insertion 


4817 


18718 


A 


4850 


241 


348 


SFFSQTQASEQLTKSKQGLALRTLPPVQ 
FPPPPPRP 


4818 


18719 


A . 


4851 


2 


176 


LVETGFCHVGQAGLELLTSGECWDYRCE 
PPSPAYCFLHTIVTIYPLLCLAPFSYLC 
DC 


4819 


18720 




4852 


2 


191 


VGRVGLELLAS SDLPTSGS PSAGITGMS 
HCPWPNFLLIFIGYFFTVLTPCEYYSGO 
MTVPLWL 


4820 


18721 


A 


4853 

• 


67 


376 

■ 


IKCTTFNIKKKKKKKXKKKKKKKKKKGG 
ALKKKPWGGQKKTGEKKKKFFSKRGGKK 
KRGPGRVLKKGEREKIFFWNFLKKNFFW 
GGGNLGQPPPKKLRALGKK 


4821 


18722 


A 


4854 


278 


363 


I WPD WAHTCNPS TLGGQGRWTMRSGVR 


4822 

• 


18723 

■ 


A 

• 


4855 


3 


282 


RG CS E PRLRYCT PAWATEQDSASKKKKK 
KKTQ I FWEGPGGGVS PFKPHFFKGPGGK 
i KPWGQKFKNPPPQWEKPLPFFKKKKTKF 
SPGGGGPP 


4823 


18724 


A 


4857 


382 


224 


PVIQLIGGVRQEKNFNLGVKGFNEPKSP 
PCPPTWVAKQNSLSKKKKKKIGKK 


4824 


18725 


A 


4858 


392 


102 


FFFFFFLRQGLSWQARVQWCFCSNLGL 
LGSSNPPTSASLSCFLNKKEYRWENVTN 
KV 


4825 


18726 


A 


4859 


3 


111 


FHRI SQEGLILLTTiGSAHLGLPECWDYR 
REPPRPA 


4826 


18727 


A 


4860 


282 


83 

• 


AQIYKVQNWGSSGNKCICVCVCVCVCVC 
VCI Y I YMLCMSRFKYKCIVGCMNTDS YV 
VYIPNIQCYI 


4827 


18728 


A 


4861 


3 


417 


NHSNLGGRGCSDPRSRHCTPAWVTERDS 
ISKKJCKKKKKKIFFGLREKKLTLFPFLG 
KKNENSGFPLNPNFFFGGKGKKATLGPL 
GLVLRGVPFFRGGETQNGBNQKPGTTLE 
KFEKHFWVGPPPLEKISPHKGKQTQK 


4828 


18729 


A 


4862 


1 


88 


MEEMYKKAHAAILENPVYEKKPKKKKKK 
K 


4829 


18730 


A 


4863 


274 


369 


VSLC CAG WRAAVRS Q I TAAL I S RAQAI L 
PPQP 


4830 


18731 


A 


4864 


37 


444 


DPRVRSEKiaPFKILLLIDNAPSHPKAL 
MEIYEEINVIFMPANTTSILQPMDQGVI 
S TFKS YYLRNTFHKALAAMDSDVSDGSG 
QSKLKTFWKGFTILDAIKNIRDSWEEVK 
LSTLTGVWKKLI PTLIDDYBGFKT 


4831 


18732 


A 


4865 


341 


210 


VYMCKVGUjYHQPVIYIRNRIEASBITP 
HIYNYLIFDKPSHQF 


4832 


18733 

■ 


A 


4866 


206 


412 


QLCLS QACTIARGNAKGFLKY IHRNNVS 
MPSVAGHTWGPDQLVKGQGBTCPATIHF 
SDSPSLKYTMYPM 


4833 


18734 


A 


4867 


248 


78 


SHFEAGLAAQLTVSWFPQGTPVFVHAGP 
FANI AHGNS S 1 1 ADR I ALKLVG PEGFVG 


4834 


18735 

■ 


A 


4868 


637 

• 


2 


EFDGRWPRQIVSS IGLCRYGGRIDCCWG 
WARQSWGQCQPVCQPRCKHGECIGPNKC 
KCHPGYAGKTCNQDEHIPAPLDQGSEQP 
LFQPLDHQATSLPSRDLNECGLKPRPCK 
HR04NTYGSYKCYCLNGYMLMPDGSCSS 
ALTCSMANCQYGCDWKGQIRCQCPSPG 
LQLAPDGRTCVDVDECATGRASCPRFRQ 
CVNTFGS Y I CKCHKG 


4835 


18736 


A " 


4869 


276 


426 


VSFFLFFFFWGANPPFVPQAGGQGGFLG 
SLNPLPPGLKHFFCLTPPSSGN 
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SEQID 
NO: of 
oucieonae 
sequence 


SEQID 
NO: of 
peptide 
sequence 

■ 


M 

eth 

Ou 


SEQID 
NO: in 

09/515,1 


Predicted 
beginning 

^_ milk a AZjlih 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 

< 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=oiutaratc Acid, F-Poenylaiamne, 
G=Grycine, H=Histidine, I=Isoieucine, 
K= Lysine, I^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R»Arginine, S«Serine, 
T=Threonine, V=Valioe, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


4836 


J 8737 


A 

• 


4870 


100 


485 

• 


EERNREMTERCSVTQAGVQWHDLGSLQP 
SPPGFKRPSCLSIiPSSWDYRIiECDGMNL 
AHGKLRLPGS SE S PASASRVAGITGMCH 
HAPLARLVSNTLPQVTLPPLGLQQAEEL 
PRRQEVEHNTPSLVPI 


4837 


18738 


A 


4871 


718 


408 


EVLTLQIAHPPGFLGAHWWNQQDAALGR 
ATDSKEPPEELCPDVLYRTARTLHGQET 

YTPRIjI lmdlh ydvs lg al vl taag yra 

SSCSQSVAVCTGSVLEARA 


A OIO 

■ 


15739 


A 


it OTl 

4872 


392 


246 


RGGCS ELRSCHCTPAWATRVKIjHLKKKE 
KRKCKI RPEGNE I IiVNNAKG 


483 y 


18740 


A 


4 Of) 

4873 


• 
1 


156 


ggggyseprs chs t pawttraklrllkk 
KKKKKGGVCFGVGFIjGPRPKGGPP 


4840 


18741 


A 


4874 


246 


2 


AHG I S I YASMLAMS I FSRRNVS FRRAGQ 
VKAVFPVSWFGPLHRHHLGEHFLFFLR 
QSFTLVAQAGVQWCHI/3SPQPLPPA 


4841 


18742 


A 


4875 


72 


2 


CFPAAPDEDSTTNI TKKQKWTVE 


AHA1 
4o*W 


IO/4J 


TV 


a one 


375 


221 


VTBQDSISNICIYINIFSWHGQ 


4843 


18744 


A 


4877 


151 


2 


RAPFFFFFFFFVAQ^JBVQWRVIjGSLQAP 
PPGFTPFSGLSriPSS ( WDYRRP 


4844 


18745 


A 


4878 


401 


70 

■ 


PHRREGGCVPPPPPKNFFFSPGGYFFWG 
GGGQNPPPPKGGFFPKTPPGFFFSPPQK 
KKIFFFPPPERGPPPGFFLRPPPPFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFF 


4845 


18746 


A 


4879 


4 


152 


LPYSTPSTPFRAKKKKKKKKKKKKKKKK 














EOCKKKKKKKKKKKKKKBOCKKKKRGGNFK 
KR 


4846 ' 


18747 

- 


A 


4880 


267 


1 


TPPKKKKKKI I PPPKNGPPPHI FKKTPP 
PLFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFRTRGRTRGRTRGRT 
RGR 


4847 

* 


18748 


A 


4881 

• 


2 


408 • 


LQDATSP I XEEL I TFHDHAL 1 1 X FL I CP 
LVLYALFXiTLTTKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKGGGGPKKECNRGG 
P KKLGGE KKI F F F FKGGKKKTRGGVGKK ! 
KIFLGGGNFGPPPPKKKKPRGKI 


4848 


18749 


A 


4882 


413 


69 


LPPPPPGFKHFWAPPPPGRGVPGPPPPP 
RVNFVFLGKKGVSPFGPGGPRPLFPNPG 
GVGAGDPLDPRGQGFHGPKFMPCPPARG 
TKREPPFPKKKKKBKEEKKBPERKEKKK 
EI 


4849 


18750 


A 


4883 


46 


394 

• 


KQYQVSVKLLFVTQNLGTKKKKKKKKKK 
KKKKKGGALKKKPGGGQKKPGGKKKNFF 
LKRGGKKKPPGIFEKXTLFGGGENWGTP 
KKKP PGKKKKI JjKGeXjUKKI Ux FGRG 
KNFS 


4850 


18751 


A 


4884 


399 


3 


FFFKKFFSPNEFWFFFPPFPLKIFFFPP 
RLFFFWGGLAQFS PP PKKVF FS KFPRGF 
FFPPPLGKNFFFFPPGYFWPPPGFFLRA 
PPLFFFFFFFPFFFFFFFSVS IRNAIA.I 
RMGTMRS RRLAMGMLLRRG 


4851 


18752 


A 


4885 


391 

• 


47 


PHFFWGGVFPPPPPKKKKFFFIiPQKGGG 
GGAPKKKKKKKKFLPPNFSGGFFFFSPP 
GGPPRFFFFIFFFFFFFFFFFFFFFFFF 
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Amino acid sequence (A»AIanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalamne, 
G=Glycine, H=Histidine, I=Isoleucine, 

xr x t t a /f —TV/I ^. i l. : : 

K— Lysine, L= Leu cine, IVl^Metniomne, 
N=Asparagine, P»Proline, 
Q=Glutamine, R»Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X= Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 




- 




• 


- 




FFFFFFFFFFFLS KMFTFI LKYTLKQNE 
FY 


4852 

■ . ■ 


18753 

- 


A 


4886 

■ 


32 

• 


379 


LLPVKKKKEKKGE KKKKKRKKKKKKKKK 
KKQKKKGGGPSIKI IGGGHLSSGGRKNK 
S FSHRGGLKKPLFFliGRTY FLGGGGLWP 
HPPPKKIAGGAKNIILRGWGEQKPLLFP 
PGEK 


4853 


18754 


A 


4887 


178 


33 


I FKRKWFF FRFF F FF FL Y YFFFKFFF F 
FFFFFFFFFFFNICHHPKLY 


4854 


18755 




4888 

i 


238 

• 


3 


KPFNPFKKKKIGPQKKKKKFPPFLKINP. 
PKYKTQNINIYKYIKIFFFFFFVRGSDS 
VAEABVQWHHLGSLKPASPGFK 


4855 


18756 


A 


4889 


93 


1 


DGILLLSPRIiKCSGTISAHCNLCPPGSS 
NSR 


4856 


18757 


A 


4890 


6 


219 


IEKAHKALGTVPGS TF VRYHHYHHHiJHY" 

HHHCHHHHPHQDNSEWEKAQSLEPDKLF 

LNPGFATSQLRGLR 


4857 


18758 


A 


4891 


1 


364 


TSSRSRAGRPMDFGFFFKKKGHREPPGG 
VEKKKKNWGSRKT P P S G VKNF PGFDP P K 
MWNKGGPPTPPVNFFGFKKKRGFHKGLK 
PPPPEIAPLNPQKGGEKKGGPPPPPTPF 
LGGKFQGKR 


4858 


18759 


A 


4892 


387 

■ 


81 


GVFSPPRGGPPPGGKTPPFFPRGGPPCG 
PGGRGPIFPPLSHKKFFFSPPPPLLGGG 
GGPKSAPPP KGVFKEFPENFFWGAFIKK 
SFFLKKGGFFGPPRGFF 


4859 


18760 


A 


4893 


213 


14 


KTPPPFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFMLE VMFLVNRRGKI CRVPFTFFNL 
SL 


4860 

■ 


18761 

■ 


A 


4894 


384 


10 


PLPPFIPPPPPPPPLFGGGVCPYF3PPP 

• 

HLGSPSPPPGGRPSPPPPPPRSTPPPPP 
PPFFPPPPPFVSGAPPPPPRSCLAPPFF 
FFFPPPFFFFFFFFFFFFLPFFFLKKSD 
FIFSLQMKYFMLI 


4861 


18762 


A 


4895 


240 . 


355 


LI IT PALFFKI FVLWLGAVAHACNPSTL 
GGQGGQIMRS 


4862 


18763 

■ 


A 


4896 


339 

• 


3 

• 


GVGWHLFFPPQKKFFFFFQVTGASCAAK 
SSKESHKGIiHLNIYLPKIFFYRPPPPFF 
FFFFFFFFFFFLFFFFFFFFFFFFFFFF 
FFFLWREGYNQHFRGMGPIGRVGGRV 


4863 


18764 


A 


4897 


56 


335 


TCFLYKLKKKKKKKKKKKKKKKKKKKKK 
KKRGG PL KKNLGGP KKNRGG KKKI FFFK 
GGEKKNLWGFFEKKPFFGGGKICPTPPQ 
KKKPPGEKK 


4864 


18765 


A 


4898 


7 


206 


LCLQPDNRWNDDQS KSFLIEKKKKKKKK 
KKKKKKRKKKXKKGGGPLKKNPGGAPNN 
PGGEKKNS P P 


4O03 


1 Q7AA. 
10 /OO 


a 
*\ 


A BOO 


HA 




INICSEICLPKKKKKK1CKKKKKKICKKKKK 
KKKGKKIKRGLNSQTSLLKKREKKKKRG 
KI 


4866 


18767 




4900 


151 


312 


KYHFHKNYFFSLLYFSQNVSQLSPDGPL 
PQLPLPYINSSATRVPFGHDRRPADG 


4867 


18768 


A 


4901 


1 


364 


LNLGGRGCS EPRLR YCTPAWATEQDSAS 
QKKKKKKNPNF 


4868 


18769 


A 


4902 


1 


96 


GGGGCGE LKSCHCT PAWVAEQDS VS KKK 
KKRG 


4869 


18770 




4903 


3 


225 


GFHRVGQXX5FDLPTSWTARLGLPRGWDW 
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09/515,1 
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C0/IIIDII I*A 


Predict- 

■ 
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Inrfltinn 

IVHtUUU 
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ding to 

loot Qminn 
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I C9IUUC ui 

peptide 
sequence 


Amino acid sequence (A^Atanfne 
C=Cysteine, D-Aspartlc Acid, 

ITi— r^liito m ip Aciri fi^sPh An vlalonino 
ix-UI U la llili. niiUj X"~X licit jrlalaUIUCf 

G^Glycine, H=Histidine, P=Isoleucine, ! 

KssfjVsine. L^Leuoinc l\4=MethtnnlnA 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S-Serine, 
T=Threonine. V=Vflline 

M A UI v*JlJ Ijlwy r T oil If vy 

W=Tryptopban, Y=^Tyrosine, 

XsTlnktiAwn *— Stun rod An '^nnftsiHtA 
nucleotide deletion, ^possible 
nucleotide insertion 














RREPPRPAVWAQLQLLTS3DPPASDSQR 
AGITHLSHLTLFCHLRF 


4870 


1 on i 
1 6771 








lO/l 

iy** 


nn v unr v r uijnvjy iuic r r rrr juws i yu 

WGPPPPPPFFFFKKTPIFSQVFNPPPFF | 
FFFFFFG 


4871 


18772 

■ 


A 


4905 


3 


387 


TI I TP I LLTLFL I TQLKI LNTNYHLPLP 
AQYPLPQEOCKKKKKKKKKKK 


4872 


18773 


A 


4906 


1 


327 


PTRPIEPIPENPKFSVPPITPHPKKKKK 
KKKKKKKKXKKKKKKKKKKKKKGGALKK 

EFLKKTSFWGGENIJVTTPPKKKKARKKK 

knf" 


4873 


18774 


A 


4907 


134 


2 


ALYSLKVKWPGAVAHACNPNTIiGGQGEW | 


4874 


18775 


A 


4908 


59 


449 


ATl^TIIvlIAYT^VGi^l^AWl^Ka^ 

GCHSTHTVIiPPVTWVKTTGNSTVANFV j 
LILVCLS FLIiLVYRC IQQLQRDSNQREG 
AMMMVVVLSKRKGGYAGK 


4875 


18776 

+ 


A 


4909 


337 


3 


LiKTAW Ax RGK Jr FJj Y iU\JN 1 w x 5GAKGEGJP 
PIPLIGGGLSQKNFFTPGGENSINPDPP 
PSPPPGAKKETPPPGKKKKKKPTTKELG 


4876 


18777 


A 


4910 


14 


loZ 


ZvU Cfl 7i VRPT7A PPPTXPnOfSTlTnKPT .WT T V 

AAKKHMKTCLPSLAIREMQIK 


4877 


18778 


A 


4911 


331 


2 


LPIiLAPKEGQGKIIiGYKKPPPPGLPPFF 

GSTPPRNGEKGGPPPPRGNFWVFKKNGG 
pp p t /v;p np p pr po itp p p p p p n fcrtj irno 

GNPAAGPFFFFFETEFHS CCPGWSA 


4878 

■ 


18779 


A 


4912 


116 


316 


ACAHAHTLFFFFFLKKKPPFVPLLGGKG 
GNLG YLKLWL PGLKKVCLT P PRTGNYGG 

APPWP T T PPPTi 


4879 


18780 


A 


4913 


93 


16 


SPPGWPGTVAHACHPSTLGGRGGRIT 


4880 


18781 


A 


4914 


316 


2 


KPRPGNPLRARVFNPPGPPGETPFFPKN 

DirJ7TDn/2Df2fiD T VWDPDfiD VP PlfMPT.VP 
r&T IKouKourX liNfcT trvMt VKxr fiTfrXi X lr 

RGSRFHWPNS IPCPFAWGTKPNFFSKKK 
KKKRKKKBKLCFSASVPDAQ 


4881 


18782 


A 


4915 


3 


289 


TS CNPSTLGGRGGRI TR VGVRGQ PDQHG 
ptp qt.t.ttmvt yt vt ounwT.wpMWini 

CiX froUUfSJN X X X X X 1LVUV XUX VU'iJ VuH 

YIYICIOSYIYIWRxXSLTSEHTHTRAP 
GQ YRVTYFFCG 


4882 


18783 


A 


4916 


1 


222 


ARGEREPJERERERERERESRHQGGGAGP 

Xr XXlXJnvjQXliO V V XJOdJErvXVJ VKXiXWuUTXnV? 

CAAPVTiAPRVGRTPFGVD 


4883 


18784 


T 

A 


4918 


i 
1 


zol 


7APfSPPFPPRFRRRPRPP'RPFPFRPPRPF~* 

RERERERERGAP P P PPRGGVRERGTPCP 
SARGARALSPTISCGDPHRGPKIYRGVG 
VHTQRYFSVGSSLCINTHT 


4884 


18785 


A 


4919 


1 
* 


259 


ARGERERERERERERERERERVFVEKGD 
MVTL PAG I xTiRFGGG EKKYTKAVRVCVR 
KPGWRAHNRPADH FGARGPHVKFLAQTV 
YE 


4885 


18786 


A 


4920 


797 


910 


NTMQPGVVTHTCNPSTLGGQGGRITGSG 
VQDQPGQHGB 


4886 


18787 


A 


4921 


2 


67 


LEERERERERERERERERERERSS 


4887 


18788 


A 


4922 


1 


698 


TLLLAELGTICDPYRSCSISEDSGLSTA 
FT IAHELGHVFNM PHDDNNKKKKRGG P F 
KG PNFS P P VRAS KN FFFG P PNLNS WARF 



847 



■ 

* 

WO 01/64835 PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
Sequence 


SEQID 
NO: of 
pepuae 

eaniionita 

oetjuencc 

■ 


M 

eth 

oo 

■ 


SEQID 
NO: in 

YTCCM 

26 


Predicted 
beginning 

n u cicuiiue 
location 

correspond 
ing to first 
amioo acid 

leoiuuc uk 

peptide 
sequence 


Predict- 
ed end 

n nolo. 

ntiHp 
vim v 

location 
correspon 
ding to 

ln*t amino 

acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 

Rr=f2liitamir Acid lfePhenvlalanine 
C^Clvcine. H=Histidine. l~L»oleucine. 
K=Lysine, L^Leucine, M=Methionine, 
N»Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Thrennine V—Valine 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *==Sitop codon,/=possible 

nucleotide deletion \=no£sib!e 

nucleotide insertion 














LT PGEGKTPGVPP FNPFATKAEJj 


1 A OOO 

4888 






A CYT2 




z 


r jVCRGT I HKFRC VPHI/rGRRFEHGVTDC 
YTLFRDAYHLAGIEMPDFHREDDWWRNG 
QNLYLDNLEATGLYQVPLSAAQPGDVLL 
CCFGS S VPNHAAI YCGDGBLLHH 


4889 


18790 . 


A 


4924 


3 


225 


HEEALHIiFQTLMECMKRKELITVFHIGS 
nirunn t nra t t.t at .t .kgkpt iR tut i pvp p 

FILCMYMCVYVCICVCI 


4890 

i 


18791 


A 


4925 


371 


169 


HTQS ISVVLVERRFHHSGQENPLNPGGR 
GCGELRLCTCTPAWATRAKSRLKKTKQT 
KKDAIKCSLCN 


4891 


18792 


A 


4926 


2 

• ■ 


352 

■ 


AKAAxUUiK ± ± rvEjJjr r sec c\jr» ii AJvAiSJLiJj 
FWWPGGVKNPDLGNPPPWPPKGGGITGL 

GPREFFGtALQRGGDLGLNHRGPPFPFG 

V XJ& JLI*1 


4892 


18793 


A 


4927 


2 


379 

• * 


ARANMEETQQKSNLELVRISLLLIKTGL 

DLQEG I QTLMGRLENGRRRTGQ I LTQTY 
e VTTDTNTS DNHDATj T.K3ffYRLL YCFTKDMD 
KVETFIiRMEQCRSV \ 


4893 


18794 


A 


492© 


1 /J 


n 


TT^bT,pppAWLHPFLFLLLPFSRPSIiAV 
TEVAVQWRNLGLKQPLPPGF 


A OCi A 

4894 


18795 




492/ 


HCQ 


lf\l 


VDRT.PPCCPGWS PSLELNOSACL3FLKG 
WDYRCEPLCSVS 


4893 


1 6 /VO 






4 


lift 


ARGE PRSHHCTPAWAMS ETVS KKKKSGG 

ruvwurivuufiv x ^ An/it A4hm? u x ▼ v*\4\«v(wwv 

LFFLPRLV 


4896 


18797 


A 


4931 


150 


1 


KYVAPCRPLFLVLSPCRRSCFPFPFCHD 
CKFPES FSEATMFFLQPAEPRA 


4897 


18798 


A 


4932 


2 


155 


ARDDLNPGGGCSEIiRSRHCTPAWATERD 

g K Tf Tf TfMPnWTLJftWTiTrTnCWWIjTC 

Did IVTvJVIMN IT ^ ii -L « VJ1N UlVlVXVf <1<I Ui\ 


4898 


18799 


A 


4933 


144 


249 


ETLYPQPPQSLNASCVCVCVCVCVCVCV 
CLCVCV 


J AAA 

4899 


18800 


A 


AfVi A 

4934 


2 


153 


apse l?VP KTJTJP TIVROmSTiTfTiTiA <5f3fJP PN 

LATQSAGI IS ISHCTQPWRQGL 


4900 

* 


18801 


A 


4935 


1 


355 

• 


GTS QE FEGRKDRIjAS PLETGNAGAGRER 
LAEVSTCPSGASKPLQTPRPGGQBGTPG 

T/VlPPMRWCJARP^PWPAWR^TRriP^riT 4 ? 
TALL VACS IFHIHEDPILHDVLISR CMS 
IKTSHR 


4901 


18802 


A 


4936 


112 


358 


NIIKFSKAFFFSFYIFILINSVGLASTF 
MWGGELKFSFTSGILIIFQPFKIKGHPQ 
RG I TjRTGTFKEDPDDPEGNKVSSIxLO 


4902 


18803 


A 


4937 


1 


258 


GTSPY'VTQQFFSGWGRQGLTLSPKLEHS 

elrlchctps wvterds is kkkkkklfy 

IiAGPTFFPrXSGGNFNLALGRIxlaKRGGGE 


4903 


18804 


A 


4938 


380 


1 


FFFFFSETEPYSVAOAGVQWC 


4QA4 


1 OOUJ 


A 

* * 


4940 


255 

Aw<JJ 


i 


i/3nnlagykirglys i slrnnrhpspvi 
slcgnlfyvirffdmescsvtqagvlwc 
nigssqaltpgiapfsfislpstrqnsr 

P 


4905 


18806 


A 


4941 


10 


173 


KYIEYDTNKWKHNSCSQIGRINFVKMSL 
PPKAIYTFSVFPIKIPSVYLPVHLSI 


4906 


18807 


A 


4942 


144 


3 


DKC I RRPGTVTHTCN S STLGDQGGRVMR 
SGVRDQQGQHSETPSVRV 


4907 


18808 


A 


4944 


484 


274 


PSSQRS PRTARMIALSTRPTTRSMKFCH 
VAQAGLKLLGS SDLPASASQNVG I SGVS 
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correspon 
ding to 

to tr4* nminn 
InM UlllinU 
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Amino acid sequence (A«Alanine t 

C=Cysteine, D=Aspartic Add, 

jCi— viiu luinic Aviu 7 r— rQcnyiaianinc, 

. K=Lysine, ls=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R-Arginine, S=Scrine, 

1 — A ill cU III lie, V — TillinC) . 

W=Tryptophan, Y«=Tyrosine, 

X=Un known, *=Stop codoa, /=possible 

UUCtcULlUc UclcilOUi V"poo9ioie 

nucleotide insertion 














HHTWPRFYYLS 




loouv 


A 


4945 


103 


i /Hi' 

412 


P6GAKKKR6RKKKI FFFKROBKKNPGG I 
FLRGKGGKNPPIPPGKKKL 


4909 


18810 

■ 


A 


4946 


309 


81 

* 


KFFFFLKGLIFLGGFCPIFPPPKKIFFF 
rvlfJrVsvirr Z>trrr lUvtUNr c iTlrrlrLtiBldlrir 
RVFFKGPP PFFFFFFFFFF 




1 OOI 1 

18811 




4947 


18 


264 


PTVPYTFFKNPHGTLCRTDHMLGHNTGL 
NKFKRTECTQSMSSDHNGIKYHSAP 


4911 


18812 


A 


4948 


418 


293 


QLLRRVRQENCFNPGGGGCSEPRSPPCP 
PTWVTKQVFFSKP 


4912 


18813 


A 


4949 


3 


252 


DSAKHLGLKHVVYCGLENVKRLTDGKIiB 
VPHFDS KGEVEE YFWS IGIPMT8VRVAA . 
YFENFLAAWRPVKASDGDYYTLGKTK 


4913 


18814 


A 


4950 


434 


270 


KRGFPPKTPPGFKKPPPKKKKKI FPPPP 
KIGPPQKIIiKKPPPPFFFFFFFFFFF 


4914 


18815 


A 


4951 

• 


399 

■ 


1 

- 


VGFSLFPP PQKKGFS PKPPRGFLFPPPK 
GKKKI FP P PGKIGPPQGFFKRP PPLFFF 
FFFFFFFFFFFFFFGPPSVTLLFliVTIL 
LQ IQTTALGRKSWGKGFRIPKPVCPVPA 
QQTKQGaKHIiDGQVV£ XNSCA 


4yl5 


15816 


A 


4952 


235 


50 


NKKIFLPPKFKGPFSFGAPLPFPPGQKK 

ynr. 
JUrJj 


4916 


18817 


A 


4953 


13 


194 


SILISLQISIillTFTATELIIFYIFFBT 
GGAF 


4017 

**y if 


loo Jo 


A 




Hi 


i 


P POKTCKITO T "PR PPOTC T OP POOPFTTRflPP 

LFFFFFFFFFFFFFFFFRFBGECSDLDG 
RV 


4918 


18819 


A 


4955 


450 


0 


LPNKADKKKKKKKKKXKKiCKKKKKAR 


*ty iy 




A 




"90 
zy 


1 JO 


KKKKKKKKKKGGGGF 


4920 


18821 


A 


4957 


397 


0 


PSSPPPPSPPPSPPPPPPPPPPPPPPPS 

Oct Jr XTnxr ctrtrtrtrCiDcirtr 


4921 


18822 


A 


4958 


257 


397 


FFFLSFTFVAQAGXQGXXYGSLDPP PPG 
FNLFSCPSLPRIWlTSrrAPP 


4yzz 


1 CBOI 

looZJ 


JV 


AO CO 

4yoy 


41y 


/OD 


IGVSHRAQS I KSLAPQYWYFPSF 




1 QCO/4 

1B5/4 


A 


4you 


inn 


|yu 


rrrrrr rrri vnr cere uc v^f jjr r r see 
FFXIFNNIF 


4924 


18825 


A 


4961 


3 


179 


DXWVL^FllAALAAiHLLFLHETGSNNP 

iMl loHOl/U 1 f fir X X X X t\l/AXA»XjJJXJg U 

LS 




loozo 


A 


4yoZ 


2 


182 


D KT> C PTWT .TPT.TTP A VTA C r*C I? W &UAt/T. 

K vru\j\±Joi\.w jj x rijriri/iVAov^xSiSAvy via 
LKHSADVNAIU5KNWQTPLHIAAANKAVK 
CAKL 


4926 


18827 


A 


49o3 


273 


440 


KIHIQHCWWEYKLLQSLWKAAWHFLKEL 
KAELPFNPAIPLLDIYPEBYKSVYLKDT 


4927 


18828 


A 


4964 


179 


3 


SLPFCC1JLTPKKKLFFKFCRKNGFFFFK 
ME SHS VTQAGVQWCNLN S LQPCPHGLKQ 
FS 


4928 


18829 


A 


49(,5 


268 


3cU 


TVCVIELLTSGDRPASASQSARITGLSH 
HAQP ! 


4929 


18830 


A 


4966 


374 


2 


QNFFLKKKKKKRVGPGVPPFNPPPFWGQ 
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Q=GIutamine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y«=Tyrosine, 
X^Unknown, *«=Stop codbn, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 




■ 










GGGFP RS I LGNP I KPLFFLNS PKIS PPL 
WPGPFFPLPWGVWAGKNPLPRGGGFPLT 
KFPPLPSSLGKKKKFRFKKKKKKKGQAR 
WLTPAI PTPREA 


4930 

• 


18831 


A 


4967 

■ 


79 


346 


DKIFSNISCGIVIDLLAIYNWDAYATRL 
GIYKHWDFII IDKDTSSCRLSFSSYPRF 
LESLDDFYILSSGLI LLQTTNS VFNKTL 
LKQVT 


4931 


18832 


A. 


4968 


3 


93 


FLHVGQAGLGLPTSGDPPASASQSVGLQ 
A 


4932 


18833 


A 


4969 


158 


1 


SKSNFAYIFPI KDNGLTGKKTINRWTK 
NWANDLNRHFS KVNTQMAKRYMKK 


4933 


18834 

» 


A 


4970 


2 


266 


VGQAGVELLSSSDPPASAYEGAGIPGVS 
HHTWPKHFFPALFVACISSLVNCLFQLF 
AC I S IGLS FFFFFFREGDLGNLLKI AGL 
TFLG 


4934 


18835 


A 

• 


4971 


3 


96 


GVRHHAHLIFVFLVETGFHHVGQDGLNF 
L 


4935. 


18836 


A 


4972 


253 


348 


NF I FLLFI YFEMESHS VAEAGLQWHCLS 
SLQP 


4936 

• 


18837 


A 


4973 


326 


2 


PMEIPQHKLSLWPGFAISVSYFERKLLF 
SADVSYKVLRNETVLEFMTALCQRTGIiS 
CFTQTCEKQLIGLIVLTRYNNRTYS IDD 
IDWSVKPTHTFLKRDGTE ITYVDY 


4937 


18838 


A 


4974 


3 


244 


DAS L VF KVAE TANE E E VKKMCM YKYPGN 
KCGREGGREKKREEERREGRKEGREGLR 
EGETDEEREEGEEGLSYSPFKNSI 


4938 


18839 


A 


4975 


6 


79 


THFSLTITSLQPEDIGTYYCQQYD 


4939. 


18840 


A 


4976 


15 


339 


PGEAGNCIiNPGGGSCSEPRSRHCTPAWA 
TERNSVSKKKKKKKKKKKIPRGRGLPPV 
S HP FW KAGG ADWFDLGTLE P P W PTGETP 
VFKKKKKLTGGGGAPLVAPTWGGG 


4940 


18841 


A 


4977 


204 


275 


I IEVDPDTKEMLKLLDFGSIiSNLQ 


4941 


18842 

• 


A 


4978 


3 


352 

— 


RRPSPHGLVGAVSVGGAGVMAVETLSPD 
WEFDRVDDGSQKIHAEVQLKNYGKFLBE 
YTSQLRRIEDALDDSIGDAWDSNLEPMA 
WRRLPYEQSSVLEL IKTENKVLNRWTG 
YAGL 


4942 


18843 


A 


4979 


219 


3 


AAS TMAMS FEW P WQYRFP P FFTLQPNVD 
TRQKQLAAWCSLVLSFSRLHKQSSMTLM 
KSRLTSLS I TVMLO C 


4943 


18844 


A 


4980 


142 


3 


FQGFFFLRDKVLLCCPGWSRTPBLKRSS 
CLGLPKCWDYRRELPRLA 


4944 


18845 


A 


4981 


351 " 


2 


FFFFFSETESRSVAQAGVQWRDLD 


4945 


18846 


A 


4982 


133 


3 


SFFFFLFETECCSVPQAGVQWCDLHPLQ 
PLPPGFRRFSCLSLP 


4946 


18847 


A 


4983 


3 


161 


LGSLHDTANTLWPRLERSRTIMAHYSLD 
LPGS SDP PTSASHVRGTTGMRSTRP 


4947 


18848 


A 


4984 


363 


3 

■ 


GPSNKKRGRSKKAHVLAASVEQATENFL 
EK3GDKI AKE SQFLKEELVAAVEDVRKQG 
DLMKAAAGEFADDPCSSVKRGNMVRAAR 
ALLSAVTRLLIIJUJMADVYKLLVQIiKVV 
EDGILKL 


4948 


18849 




4985 


101 


1 


LQVCYGRRWMNSREYGAWKQQVESKNM 
PFQDA 


4949 


18850 


A 


4986 


341 


1 


PPPMSPPDVLKTS PVADAAGWVDVDKET 
LQHQRYPNVFGIGDCTNLHTS KTAAAVA 
AQSG I LDRP I SVIMKNQTPS KKYDGYTS 
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nucleotide deletion, \=possible 
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CPLVTGYNRVILAEFDYKAEPLETFPFD 
Q 


4950 


18851 


A 


4987 


86 


1 


KCSGTISAHCNVRLLGSSNSGPSASRAI 


4951 


18852 


A 


4988 

i 


124 


3 


LP WLCS KPE TKTFQFQ I YYKAKGVKTVW 
YSHKDRNIDQWN 


4952 


18853 


A 


4989 


305 


150 


WLLNYRYSHSAEVQVCFQFIiTWVFSTLG 
S PDHFS KFFNQ YFFK I CFD YWKN 


4953 


18854 


A 

• 


4990 

• 


64 


340 


KKKKKKNLL KFPGQREKPLS PNFLGGLG 
GGKNGPPGGKPSRDPDWPPGIKPGEKKK 
NPLKKKKKKKKKPLSKGGGGKKTPLKKK 
KKPPPKGF 


4954 


18855 


A 


4991 


3 


158 


PS LVQTRLRHAGQAGLKLLTSTDSPAS A 
SQSAGTTGVSHHAMPKDCSLNSN 


4955 


18856 


A 


4992 


116 


66 


FFFFFFFFFFFFFFFXE 


4956 


18857 


A 


4993 


3 


107 


GQAQ WLTP VXAALWEAKVGGS FEVRSLR 
PAWPTQ 


4957 


18858 


A 


4994 


1 


105 


KLDRLARHGLYEKKKTSRKQRKERKNRM 
KKVRGTW 


4958 


18859 


A 


4995 


110 


3 


RR IGR YKTVFLCTQLE ILMARYPLP PS P 
KPIKIKN 


4959 


18860 


A 


4996 


109 


3 


DEVSILCPRLECNGTILAHCNRRLPGSS 
DSPALAP 


4960 


18861 


A 


4997 


1 


148 


ACCPFCTIYLLPMFMIIKAPLMGTSKKK 
KKKKKKKXKKKKKKKKKKKKKKKKKKKK 
K 


4961 


18862 


A 


4998 


3 


110 


VILQGSNDVELVAEGNSRFTYTVLVDGC 
SKKTSTRP 


4962 

■ • 


18863 

• 


A 

. 


4999 

t/77 


3 


330 


P IADRGAEWSAREWMTICFELLQLLKA 
HKKAI RRATVNT FGY I AQAIG PHDVLAT 
LLNNLKGHERQNRVCTTVAIAIGAETCS 
PFTALPALMNEYRVPELNVQNGVLK . 


4963 


18864 


A 


5000 


3 


110 


VILQGSNDVELVAEGNSRFTYTVLVDGC 
SKKTSTRP 


4964 


18865 


A 


5001 


25 

« 


329 


NSRRRRNDQGSPNLCQTFTFMTLPYLPE 
HRSLLLKIRSCABRETKKKDDIPEBDKG 
NI KQCE INYVKKFQS FQDHKLKI S KEES 
KILKXAQKDGFLHETLLN 


4965 


18866 


A 


5002 


278 


345 


EDEEGYNDGEVDDEEDEEBLGG 


4966 


18867 


A 


5003 


101 


3 


VRIIISGTGKKKKKKGKLPKNYDPKVTP 
DPER 


4967 


18868 


A 


5004 


18 


350 


VSHECLIFHIQNVLTGLVISLSCPSVPS 
HTHKHTHIHTHTHTHTHTHTFS 


4968 


18869 


A 


5005 


26 


147 


KEEVRMALFANDMIVYLENPFVSAPYVL 
KLI SCFCKGRLG 


4969 


18870 


A 


5006 


85 




S AS RVAG 1 1 GKHHHTRLI FVFLVETGQS 


4970 


18871 


A 


5007 


370 

• 


3 


RARKPLL W VLKKI WS P WI YKKKTR INFQ 
KEKKSPFFSRPFFLFKKGGFLPPGVFFS 
RGFLKKIPPFFFFFFETESRSVTQFGVW 
SVQWCNLGLLQPEPPGSKQLLCRGLPSS 
WDYRCT tSTRP 


4971 


18872 


A 


5008 


1 


85 


IRILSKIKNALTHFLPQGTPTPLIPILF 


4972 


18873 


A 


5009 


218 


93 


FFFFFFFFYFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFF 


4973 


18874 


A 


5010 


3 


250 


RPRRRRLRQENHLNLGGEGCSESRSRHC 
TLTWATEQDS VS KKKKKRVFP P PHPRGK 
FFWAGGPTFFPPLKTVPFPGGGKPNF 


4974 


18875 


A 


50U 


336 


0 


SSSSPPSSSSSPPPPPPPPPPPPPPP 
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SEQID 
NO: of 
nucleotide 
sequence 

* 


SEQID 
NO: of 
peptide 
sequence 

- 


M 

eth 

od 

• 


SEQID 
NO: in 

ircCM 

USolN 
26 


Predicted 
beginning 
nucleotide 
- location 
correspond 
ing to first 
amino acid 
resiaue oi 
peptide 
sequence 


Predict- 
ed end 
nude- ■ 
OuOe 
location 
correspon 
ding to 
last amino 
acid 

residue oi 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=oiutamic Acid, * Phenylalanine, 
v>— vjiycine, ri— ttistioine, I— isoieucine, 
K=Lysine, i^Leucine, M=Methionine, 
N«Asparagine, P=Proline, 
Q^GIutamine, R=Arglntne, S=Serine, 
i^inreonine, v = vanne, 
W=Tryptophan, y=Tyrosine, 
A^unKnown, w== otop cooon l / c: possiDie 
nucleotide deletion, \=possible 
nucleotide insertion 


4975 


18876 


A 


5012 


2 


80 


SNQNGTGGES I YGEKFEDENFHYKVK 


4976 


« noil 

18877 


. A 


5013 


53 

• 


324 


r J-iVr J? VPcaN orii\J.oJj^jljrll lAI.iisflSSW V 
AQVNES 


4977 


18878 


A 


5014 


329 


1 


FFFFFSETESRSVAQAGVQWRYDGSIiQA 
PPPGFM 


4978 


18879 




5015 


193 


32 


KTKQQQQQQQNQGTKCMFLALLGQI 


4979 


18880 


A 


5016 


354 


235 


CGE IGMLLHCWWECKLVRPLWTNLFLRD 
QSTLTVGSKSSF 


4980 


18881 


A 


5017 


■ 348 


2 


RLLVGKLMELHGEGSSSGKATGDE ! 


4981 


18882 


A 


5018 


76 


1 


IFFLSQVS PLS KEDAGE YECHASNS 


4982 


18883 

■ 


A 


5019 


68 


356 


YFGGVGGFFFFFFFFFWFGPSGGVPFVG 
VQAI F F F F F FFLGGFF FLVRD YFFVAVF 
WWGLLFFFVFFFWFCFVFFFFFGWKRK 
NIFYGWVFLFCW 


4983 

■ 


18884 


A 


5020 


1 


288 


FFIxNLEKNIKMLSSYTDNGIWATAEDF 
MQNFKNLVGYHNS ITEENL PQLGANENL 
ES QSGNFS WF I F FNADRKRGMVLLL PN 
NEMTILKPRTSV » 


4984 


18885 


A 


| 5021 


133 


1 


WATETMWSTKPKMWPGTGAHTCNPSTL 
GGQGGQ I TS PQKFKTS 


4985 


18886 


A 


5022 


2 


337 


RRSDPNFKNRLRERRKKQKLAKERAGLS 
KLPDLKDAEAAQKFFLEKIQLGEKLLAQ 
GEYEKGVDHLTNAI AVCGQ PQQ I LHVLQ 
QTLP P PVFQMLLTKLPT I S QR I VSAHSL 


4986 


18887 


A 


5023 


1 


325 

• 


VDGC PANLLS SHRS L VIiRAET 1 3 LGBHP 
CDRGEQVTLFL FNDCLE I ARKRRKVI GT 
FRSPHGQTRPPASLKHIHLMPLSQIKKA 
LD 1 1 ETED CHNAFAIiLVRPPTEQA 


4987 


18888 


A 


5024 


13* 


2 


ATMFLNS KVS KYSG YLLGFHE CREKGWM 
TWDGERDPSPGILiQIjQ 


4988 


18889 


A 


5025 


2 


94 


KTATKLIGGXHYDSXNIKAIRDGLLARR 
HAL 


4989 

* 


18890 


A 

• 


5026 


2 


362 


QELERSWAQRCVCVLALVAMLLLVFPTV 
S RSMGPkSGEHQRASRX psqfs keerva 
MKEALKVFPTWSTSFIQHEWEEYSHL 
FTIQGSDPSLQPYLLMAHFDWPAPEEG 
WEVPPFSG 


4990 


18891 


A 


5027 


222 


84 


AASTFCHVSQAGLEUiTSDDPPASASQS 
AGITGMSHRTQ PMVnlj i 


4991 


18892 


A \ 


5028 


366 


0 


SGRSKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKGGR 


4992 


18893 


A 


5029 


231 


68 


AGMGSRALPKPLLMS HSSL KAVELPDS F 
S PELRSLIjEGIJjQRIJVNRKIjGCuGRG 


4993 


18894 


A 


5030 


2 


366 


LNLGGRGCSELRSCHCTPAWVTETPFKK 

QGERTGPGWGAPVHTGLLPRRGQAFTQP 
PSAFKGPFHPTNKSSSEAKKKKKKPRVG 
RKKNVGYIIi 


4994 


18895 


A 


5031 


241 


368 


QVERNFKSQSGAEAHICDLSTLGGCGGQ 
I TRSGVHDQPCQHG 


4995 


18896 


A 


5032 


279 


396 


DGGMWPGTVAHACNPGTU3GQGRWITRS 
GVRDQPGQHGE 


4996 


18897 


A 


5033 


173 


398 


SNESSLRVNPFFFFFFIiKTNFSPCPPGG 
RQGPNFSLLDPPPPKVKEI FCIiTPPKRW 
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SEQ1D 
NO: of 
nucleotide 
sequence 


SEQ1D 
NO: of 
peptide 
sequence 


etb 
od 

• 


SEQID 
NO: in 
ussn 

26 

• 


Predicted 

beginning 

nucleotide 

location 

correspond 

a**^to to A*. A* ^m*m^L 

ing to first 
amino acid 
residue of 
peptide 
sequence 

■ 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alaoine 
OCysteine, D=Aspartic Add, 
E=Glutamic Acid, F==PnenyUlanine, 
u=orycine, u— Histidine, I— isoieuctne, 
K=Lysine, L=Leucine, M=Methionine, 
rv-Asparagine, r^rroiine, 
Q=Glutaroine, R^Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V= possible 
nucleotide insertion 














b x RrAFr c i*KNFtiFL»I KTG 


4997 


18898 


A 


5034 


2 


143 


SNPPALTSQSMGITSMTRHALPIACFLV 
FV FliF FEMBSHS IaAQABVQ 


4998 


18899 


A 


5035 


350 


2 


QEISSVQTSTQLFNGMTVKARATTREVM 
ATYTIEDI VIELI IQLPSNY PLGSI IVE 
SQKRVGVAVRLFI F ITQKSFI FLFS FLT 
LCLCLQHFHNDFLLLIVPILIAMAFXiML 

tbrfc 


4999 


18900 


A 


5036 


1 


100 


hecgsssqrtlsvqeaaaVl^WeirI 

y t y* n 

LIAIF 


5000 


18901 

i 


A 


5037 


247 


334 


LQIDI S AWAYT IAVKEDGBLNLMKKAA 

s ■ 


5001 


18903 


A 


5038 


2 


296 


DKAPMLXVIVNSLKNMINTFVPSGKVMQ 
VVDEKLPGLLGNFPGPFEEBMKOIAAVT 
DIPLGKVHLEALFJCKVIKFFYKFPLRCD 
IHTAKCTGLRYTAQ 


5002 


18903 


A 


5039 


152. 


335 


RPI WS YYLLFF FLFWGLYLDVNKRI FTF 
I LSEKYFDMKKNQCKEGLD I YKKFLTRM 
TRISE 


5003 


18904 

* 


A 


5040 


336 


1 

■ 


GGLT I S S LLKE KEGS E VAKFTLEQLCL I 
CNIMS TAEYCVAATQQ LEE KL KEKVD VS 
LFDRINLSGDMGTFS PVISSS IHLLVQD 
VDAACDPAMVAMS KMQ CQNVQHVG VKS S 


5004 


18905 


A 


5041 


3 


204 


LNLGGGGCSEPRSCHCTPSWVTARIjHLK 
KKKKKKRGKNPLKNGGKENFKFLQILVN 
PKNSLENLAV 


5005 


18906 


A 


5042 


382 


148 


WCNHRGPRSRKKRSEGSTKSRRLGATIR 
MVTPHTTRTCARDPELTSKEKCVYIEEH 
THTHTHIYIHTHTHTCIYVKTH 


5006 


18907 


A 


5043 


137 


1 


RPRRRYMKKFSTSLIIKKMKIKXTMKYH 
LSHLIPVKMAITKKTKD 


5007 


18908 


A 


5044 


198 


34 


KPASRFCHVGQADLELLTSSDLPASASQ 
SAGITGVSHCSQPNFITLCLWTDHFF 


5008 . 


18909 


A 


5045 


11 


357 

* 


ItLTYFIIMFKXIiBIYEKEEQLIIKQISB 
GQEKVKELRQFKEHRKAKDSALQS IESK 
MLELTNRIiRESQEEIQ IMRKKKRKNAAR 
GP YWKFKYSQRW KS P PGP FQWG WGPRKN 
FFV 


5009 


18910 


A 


5046 


2 


193 


GLQLLTSSDPPASASQSAGIAGMSHCTQ 
PQVHLMPSLYHFRFLQVDTKDLLRASAD 
LIHRGIT 


5010 


18911 


A 


5047 


1 


196 


KLCLVMNSPMKAAPRNFSCAPSLSLLPF 
S FRKTTTTLTSNTADDFAY FDLCKMEPC 
SMVLFFVFC 


5011 


18912 


A 


5048 


386 


276 


AQAVLELLDSSDLPPSASQSAGI SGVSH 
CTQPDSIF 


5012 


18913 


A 


5049 


1 


324 


VDAAAEKLEASTGWIjMRFKERSCIiHNIK 
MHGEATVADTEAAAG YPEDLAKI TDKGV 

QAVPGCKAS KARLTVLLWANAAGD 


5013 


18914 


A 


5050 


1 


129 


PRSCHCFPAWVTEQDSVSQKKKKKKKKE 
KEKKKNPVEKSAKVI 


5014 


18915 


A 


5051 


193 


356 


RSFIPSASASASTLCLKCSQNGQPGAVA 
HACNPSTLGGQSRQLTRSGVQDQSGQY 


5015 


18916 


A 


5052 


192 


341 


AKRVKRNNFFFFFbKQFCSVPQAGGRGP 
DPGSLKPLPPGLKGFSCPTPLN 


5016 


18917 


A 


5053 


340 


1 


KRIPDKPQKELRRIiATKLIREAPEKGQA 
QGKEIHKS IQEAKGE I FKAIDRI KKSQF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 

- 


SEQID 
NO: in 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

■ 


Predict- 
ed end 
nude- . 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A a Alanine 
OCysteine, D=Aspartic Acid, 
j£=Olutamic Acid, r— rnenyiaianine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K= Lysine, L=Leucine, M=Methionine, 
i>F=Asparagine, r^rroune, 
Q=Glutamine, R=Arginine, S=Sertne, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, v-possible 
nucleotide insertion 


• 






1 






DRQBALDILLVMQNAbESLSNRIEQVBE 
RNSELEDKVFDLTRSSKAKGKRIRNYDH 
S 


5017 


18918 


A 


5054 


2 


383 


VVKVATQPADNPLDVLSRKIjHLGPNvGR 
DVPRLSLPGKLVFPSSTGSHFFMLGIGD 
IVMPGLLLCFVLRYDNYKKQASGDSCGA 
PGPANISGRMQKVSYFHCTLIGYFVGLL 
TATVAS RI HRAAQP C 


5018 

* 


18919 


A 


5055 


2 


383 


GSVLSKKGDYLKYHYNASLLDGTLLDST 
WNLGKTYNIVIiGSGQVVLGMDMGLKEMC 
VGEKRTVI IPPHLGYGEAGVDGEVPGSA 
VU4PDIELLELVAGLPEGYMFIWNGEVS 
PNLFEE IDKDGNGEV 


5019 


18920 


A 


5056 


2 


383 


AVI DE VRTGT YRQLFHPEQL I TGKEDAA 
NNYARGHYTIGKENIDLVLDRIRKLADQ 
CTGLQGFhVFHS FGGGTGSGFTSLLMER 
LS VDYGKKS KLE FS I YPAP QVFTAWE P 
YNFILTTHTTLEHSD 


5020 


18921 

■ 


A 


5057 


26 

♦ 


452 


KLMSLRLRQAWHEAAI DEFRTGTYRQLF 
HPEQLITGQEDAANNYARGHYTIGKEKI 
DLG VDR I RKLADQS TG LQGF LVFHS FGG 
GTGSGFTSLLMERIiSVDYGKKSKLEFFI 
YPAPHVTTAVVEP YNFILTTHTTLEHSD 
CA 


5021 

■ 


18922 


A 

• 


5058 


2 


385 


AVIDEVRTGTYRQLFHPEQLITGKEDAA 
NNYARGHYTIGKEI IDLVLDRI PKLADQ 
CTGLQGFLVFHSFGGGTGSGFTSLLMER 
LS VDYGKKS KLE FY I YPEPHVCTAWEP 
YNFI LTTHTNLEHS DC 


5022 


18923 


A 


5059 


3 


390 


GDAANNYARGHYTIGKEIIDLVLDRIRK 
LADQCTGLQGFL VFHS FGGGTGSGFTSL 
LMERLS VD YG KKS KLE FS I YPAPQVSTA 
WEPY^S ILTTHTTLEHSDCAFMVDNEA 
I YD I CRRNLD I ERPTYT 


5023 


18924 


A 




103 


3 


KIFFFLRWSFALITQAGVQWRGLGSLQP 
LPRAT 


5024 


18925 


A 


5061 


262 


412 


KAS PFRTGTAFGNGKTSDYLLLGNFGYT 
FGG ITGCLKAGLETS YWTWFTH 


5025 

* ■ 


18926 


A 


5062 


293 


406 


V I IGS I FEVXWAVTKPGTSFGI S VLRAL 
RLLRIFKVTK 


5026 


18927 


A 


5063 


419 


15 


WEEEGPLPKKKKGGFSNKRGNI IMGPFX. 
KRPPEKKPPPPPRKINPFFFFFKRKGPP 
PPRVKGKGAIKGSRNPPLFGSRDFFSPA 
PQKSGAPGGPPPPPVNKNQFFFFFFLVE 
TGFHHVTQA3PELLSSSSPPTI 


5027 


18928 


A 


5064 


2 


216 


GLTNLFI FCRDRILLCCPGWSQTPSLKQ 
SSHIjSLPKPWDYMYEPIjCLYFLYPWFCL 
SLSLS FPLSHTYFFX3 


5028 


18929 


A 


5065 




36 


CCLFLRQSPFIAQAGVQWHYLKSLQPPP 
PGLKQFS CFSLLLVPRLA 


5029 


18930 


A 


5066 


2 


404 


GKGAPTTSLISVAVTKI IAKVLEDNKLP 
GAICSLTCX3GANIGTAMAKDERVNLLSF 
TGSTQVGKQVGLMVQERFGRSLLELGGN 
NAI IAFEDADLSLI VPS ALFAAVGTAGQ 
RCTTARRLF IHES IHDEWNRL 


5030 


18931 


A 


5067 


1 


400 


GENM I TGT S QADCAVLNVAAG VGE FEAG 
I S KNGRTREHALIiAYTLGVKQLIDGVNK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
oeotide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 . 
26 

• 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon ' 
ding to 
last amino 
acid 

residue of 

nentide 

sequence 


Amino acid sequence (A*»Alanine 
OCysteine, EMAspartic Acid, 
E=Glutamic Acid. F=PbenvIalanine. 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, ^Methionine, 
N=AsDaraeine, P= Proline, 

P f ^™ V P™ Pm ■ ■ • ^P p PL M P %p * ■ M pPVJ 

Q=Glutamine, R=Arginine, S"Serine, 
^Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion. Vsnoisible 
nucleotide insertion 














. MDSTEPPYSQKRYEErVKEVSTYIKKIG 
YE PDTIAPVP I S G WNGDNMLE PRANI PL 
VTGWKATPIDCDASGTTLIiDA 


sat 1 




A 


JUOO 


41 
*»i 


"2Q7 


GS PHHPCAHI ERKKKPYNSNIGFYTKRN 
ALRVAEVWMDDYKSHVYIAWNLPLENPG 
IDIGDVSERRALRKRLKCKNFQWYLDHV 
YPEMRRYNNTVAYGELRNNKAKDVCLDQ | 
GPLEN 




I 07JJ 

« 


*» 


j\JOy 

• * 


■j 
j 




KTQIiDQAIiKI CQAMLDEAANQGWLVTVL 
NNTNLIQMAIQGRWVKDSSLLTLPYIEN 
HHLHLF KKRKP I M KGPHAKSRTS IBCLP 
ELI RACGGKDHVFRS LVQQ 


* 






c/\*7A 

DU/U 




Wj 


rKnOu V k^kjC SUB J. X (w X X wtr V f Jf JS 

QTKGPLGI DPQKG I GTAYEGHVRIP KPA 
GVKKGWQRALAIVCDFKLPLYDIAEGKA 
SQPSWISQVIDMRDEEFSVSSVIiASDV 
IHASRKDIPCIFRVTASQLSA9 




1 OOIC 




C/V71 






FDREGDGYIDYYEFGAALHPKKDAYRPT 
SDAPKTFHQGTRQVAQC I CAKRFLVEH I 
GENKYRFF1XNHFGDSHQMRLVRILLST 
VWVLDGK3GWMALDQFLTT 


JVJJ 


1 &Q7£ 
15730 




ju/z 






T TP HIT I? T nfl T T.21 C! If DDTT V T VMTX/fl & n T 

FDRDGDGYID YYE FVAALHPNKDAYRPT 
SDAYKIEDEVTRQVAQCKCAKRLIABHI 
GEKKYRFFLGNQFGDSHQLRLVRILRST 
VMVP VGGGWMALDE FLVND 


5036 

• 


18937 


A 


5073 


1 


393 


GEDAANNYSRGHYTIGKEI IDLGLDRI C 
i\unuu « 1 wix^vjr u v riiACuuviuouv x o 
LLMEHLSPDYGPKSKLEFSIYPAPQVFT 
AVHE P YNS MLTTHTTLEHS DCAFMGDNB 
AIYDICRTNLDIERTTYTN 


5037 


18938 


A 


5074 


39 

- 


482 


LGLHS AWRDD K I G YN PDTVS FVP ISGWN 
GDNMLEP S ANMPWFKGWKVTRNDGNASG 
TTLLRALDCILPPTRATDKPLRLPLQDV 
YKI GGIGTVP VGRVETGVLKPGMVATFA 
PVNDTTE VKS VBMHHEALSEALPGDNVG 
FNVKNVSV 


5038 


18939 


A 


5075 


1 


390 


GVSMAVSLVI I YQYWRNMPDPHNLPI V 
AGWKKYPLFFGTAVFAFEG IRWLPLEN 
OMKESKRFPOAl^IGMGIGTTLYVTLAT 

y£+ W J>\p4V^ i* Vf* XAiXl pk J>^pf JW A 4P« JkJ A w 4 ppppT1> pk 

LGYMCFHDEI KGS I TLNLPQDVWVYQS V 
KILYS FGI FVTYS IQFYV . 


5039 


18940 


A ■ 


5076 


330 


462 


VNFFFREGGTES PS I AQARVP WCS LNS L 
OPLPPGFKRFSCLSLP 


5040 


18941 


A 


5077 


1 


403 


GGWGVEAEDFEYAPDVEPLEPTLSNI IE 
QRSLKWIL^jGGKGGVGKTTCSCSLAVQIi 
SKBRBSVLIIFTDPAHNISHAFDQKVSK 
VPTKVKGYDNLFAME IDPSLGVAELPDK 
FFEEDNMLSMGKKMMQEAMSAF 


5041 


18942 


A 


5078 


348 


1 


RPQGNNI ERWQGGEVLIIQLLRIAKSVK 
NVQLLWKTQFLI KLHI LLFYD PR I PFVS 
NYPKEKKTYVHKKTYAQIFISLPRTFLF 
LRQSLNMLPRQDLSS 


5042 


18943 


A 


5079 


248 


3 


PGQVLEREHTCRRP WAWNVC I C LAGPA 
KPNREDVSSSQEESLQLNSIPPTPTLTS 
TAVKSRQPLWGLKEMEEEDGSELDF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ 10 
NO: of 
peptide 
sequence 


M 

etb 

od 

• 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 
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nucle- 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C-Cystetne, D=Aspartic Acid, 
E=G!utaraic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=Iso!eucine, 
K=i.ysine, L=Leucine, M-Methionine, 
N^Asparagine, PHProline, 
Q=Glutamine, R=Arginine, S=*Serine, 
•^Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Un known, *=Stop codon,/ a possible 
nucleotide deletion, ^possible 
nucleotide insertion 


5043 

• 


18944 


A 


5080 


3 


192 

• 


QDOS ELRS CHCTPAWLTERDTS SQKKKK 
KEKKTNNPAPELFFFLLLEKKGQNQGKP 
NMKKVL 


5044 


18945 


A 


5081 


10 


171 


CFCLGPVGVCRSLLKJPQLCFLGFPWLQD i 
ARQKFRSVLVEATVKLDELVTRSTRP 


5045 


18946 


A 


5082 


94 


1 


DFWPGKVAHACNPSTLGGRGGWITRSGV 
QDT 


5046 


18947 


A 


5083 


142 


3 


AASTGSHSVAQAGVQWHDLSStiQPLPPG 
FKRFS FLSLPS S WGRKI A . 


5047 


18948 


A 


5084 


257 


177 


ISLGEVAHACNPSTLGGRGGQITRPG 


5048 


18949 


A 


5085 


143 


1 


GGESHSDTQAGVQWCYLGSTQPPLTLTF 
KQFSCLNLPSSWNYRCVPP 


5049 


18950 


A 


5086 


3 


87 


HVGQAGLVLLASCDPPSIiASQSDGITV 


5050 


18951 


A 


5087 


375 


220 


LES LSQHIATRS KKPAFNLFAFGTLS P V 
STNWVGFNP I S FEGLMLI I FFFFF 


5051 


18952 


A 


5088 


40 


203 


IFLPLLLVSFFF YI LTWGFTFFWGLTLI I 
NDLLNFFFGNSGISFSWFGSRAGELG 


5052 


18953 


A 


5089 


1 


145 


MDMRVPAQLLGLLLLWIiSGARCDIQMTQ 
S PS SLSAS VGDREDGD VDAA 


5053 


18954 


A 

* 




146 


328 


FMP FPLLES LEI FS SKLFNP PNLFFFFE 
TESCS VAQDGVQWYDLGSLQP PSPGFKR 
LSCLS 


5054 


18955 


A 


5091 


338 


1 


PNLPS VQIiP PTTS CTPLIGDLL VARAHF 
AGWKHQGINSPSLPMAPREGMEAPRNPF 
HPHI FLCLI HGFHIiAVS KLYS LFYYYYY 
YFSRWS FTLVAQAGVPWRDLGSLQPLPP 


5055 


18956 


A 


5092 


2 


327 


PPLTPPIFPSSPKNKNPPPPHYNPPPPP 
FRTPPPPPYPPLSPPKSPPPPPRVDPPL 
PTfSIFPRPNLISPPPYSPFYLLPQLQAF 
P 


5056 


18957 


A 


5093 


133 


3 


AQTCTPSTQINSKWITDLNVKGKTIKLL 
KDNIREKLDDLGCGE 


5057 


18958 


A 


5094 


170 


350 


AGGQGGNFYSLQPNPPWIRESSHSTLPK 
FWNYRHTPTGPPNFGFFFLKMGFGLVAQ 
TIFN 


5058 


18959 


A 


5095 


133 


274 


RDIIIFLESGIKGYFUFFSDKITSELVS 
KIGDKNWKIRKEGLDEVAG 


5059 


18960 


A 


5096 


2 

• 


188 


REMQI KTTVRYHLTTARMVLI KKSEKNR 
CWHGCSEKGTLLHCWWECKLLQPLWKIG 
WRVDAA 


5060 


18961 


A 


5097 


235 


81 


FPKKKKAQ IKKPQAAQSLNLSLSLYI YA 
FT YTHTHRHTHTQTHTHTY IH I S 


5061 


18962 


A 


5098 


355 


169 


KILLLITDSCAHISCRFSECMKYFNFFF 
HPHSVYVTDDKIRLLEEQLPHVFSNKME 
PFKVCN 


5062 


18963 


A 


5099 


160 


1 


FSASYFNSKKKLIFLLLKLRWNFFVCFF 
GMESHS VAQAGVQWCHLGS LQPLPS 


5063 


18964 


A 


5100 


364 


151 


QMGFCHIGQADLELLTSSDPSASASQSA 
GITGVSHYASQEFLKKEFHSAHLIPLQG 
TCVQGKTAQPYSEAL 


5064 


18965 


A 


5101 


205 


3 


AASPGVQWPDSRLLQPRPFGVKQFFPLS 
FPSSWGYRGAAPPPGHFFFFFPSRDEIS 
VCLPGWSPTPE 


50& 


18966 


A 


5102 


349 


207 


DGASLCCPDWGLELLSSSGLPALAIiQNP 
EITGVGHHTQPWTLSLLIY 


5066 


18967 


A 


5103 


425 


0 


CEFFKNKDFCFLSKKKKKKKKKKKKKKK 
KKKQSSSQTSYS 
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sequence 


Amino acid sequence (A«AJanme 
C=Cysteine, D=Aspartic Acid, 
E^G I u tannic Acid. F^=Pbenvlalanine. 

mmw • mmmmm9 mmif m *m m\ m\ mm ■ — m mm m ■ • 

G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methiontne, 
N=Asparagine» PHProline, 
Q=Glutamine, R=Arginine, S^Serine, 
•^Threonine, V=Valioe, 
W=^Tryptophan, Y==Tyrosine, 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion. \==oossibfo 
nucleotide insertion 


5067 


18968 


A 


5104 


437 


184 


. LFLFFFPPPKKVFSQKTPRSVYSPPPKK 
KKKNSPPPLKLGPPQEIFKTPPPPIFFF 
FFFMFFFS FFFFFKGHRPRFI SLWSPPL 


5068 

JUUO 


1 07W 


A 


51 OS 

JlvJ 


401 


in 


MFSPPFGGPRGGVPLRPGFKNPPGPKGK 
TP F FLKNQNYLG PPLY S P FLERLNKKKA 
FTLEAKGPNNPNSSRSPPPGAQKKNPPF 
KKKKKKKRKI I FMPQAHPAVYWCSGNQ 
LPITFPYSIISGPLSSFT 


JUU7 




A 

mm 


JiUO 






PNKKPS PPPFS PPHKTPSS PSPPPPPNK 
PTPTPPPPNKKNFPPPQPPPKKKNPPQT 
PPPQKKTPPPPKKKKHPPPPHKKTPPPP 

o ^» W #>^»» #^ ^» mm*m^mm»m ^mmmp m* m* mj mm)mmm^m* my mt m> mM 

HTKNPPPPPPLPPSLSI PPLFPLYINSP 
FFI PFFFFFFFKI FFYLGRVGORV 


5070 


18971 


A 


5107 


269 


115 


GFFFFFFFS FFFFFFFFFFFFFFFF FFF 

V* *• * * m *»» mr m> m> m* m, mi m> ^ m> m m> m> m mm m* m m> * 

FFFFFFI KKCHFI I FKYIGSTSN 


5071 


18972 


A . 


5108 


238 


4 


QWLPLVDRIWVILLFFD^LlttikMFPCiYQ 
IGLQKIKVKNNTTFYFLFSFFLRWSFTL 
VTQAGVQWCDLGPLQPPPPRFK 


5072 


18973 


A 


5109 

mJ A 


496 


0 


STPSRASPKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKSSK 


5073 


18974 


A 


5110 


1 


243 


DHLSLGGRGCSEQRSCHCTPAWVTERDS 
VS KKKKS FRALD I FFFNGKKI LGTLRGK 
NQKPPREVSSGNYFHFKGLFGPCYK 


5074 


18975 


A 


5111 


78 


4 


LGMVAHACNPSTLGGRGGRITRSGV 


5075 


18976 


A 


5112 


2 


96 


AGVQWRDLGSLQPPPPGFEQFSCLSTMP 
NFL 


5076 


18977 


A 


5113 


128 


6 


IYKDVLEPGVLRLLDVDNRWLPIEAPI 
RIIIPSQDVLHS 


5077 


18978 


A 


5114 


132 


2 


DDLSSLQPLPPRFKR FC CLSLLDSVfDYR 
RP PLRPAX QEAEAAE 


5078 


18979 


A 


5115 


1 


146 


AVETGFCHVAKAALELLHSNDPLASASQ 
SAG I TGMSHRAWPS LFS ILS 


5079 


18980 


A 


5116 


213 


3 


GAVGFSSGCFUjYLYALLGPFPPLPLLQ 
LKF I CWPGAVAHVCNPSTLGGRGGRI TR . 
SGVHDQPDQHDET 


5080 

mJ \J 


18981 


A 


5117 


407 


0 


LGWAWWFTPVI PT LRE VKAGGS P EVRS 


5081 


18982 


A 


5118 


215 


414 


KIRPKEIRNNGGSQQRKRAQLGKLEQTW 
P VLNFKAP FFGBQGD YYKS FFKTCLDNL 
PRRGKGVFFF 


5082 


18983 


A 


5119 


40 

- 


326 


KKKKKKKKKKKKFKKTGGAKWAPFLNLK 
KKGI KPQKKKGFFQGKKKRGKKKK1PKI 
LIQGFFQKKKNFSKKKISQKKKKNGFWP 
UGKGFCPKKKT 


5083 


18984 


A • 


5120 


99 


2 


GRSTRHSPAHTHTHTLRHTGTLLHTHSH 
RHSP 


5084 


18985 


A 


5121 


3 


219 

• 


ELRLCHCTPTWATRVRLSQKKKREKKKK 
NFVGKKKGKKGKKP PNPKKKGSGLGFRG 
FLFSKFFKTRGTTI I 


5085 


18986 


A 


5122 


262 


351 


TRPGTIiAHACNPSTLGGRGGWIMRSGVO 
DQ 


5086 


18987 


A 


5123 


340 


0 


KKKKI S QAW WWVP WPATWEAEVG 


5087 


18988 


A 


5124 


94 


2 


LMP 1 1 P ALWEAE VGGAPGQE FQTS LANN 
VK 


5088 


18989 


A 


5125 


310 


68 


LSPFYKLGFWGRVIYPPPPPKVSPPVFG 
APPPPEKKKKPPPPPPKSSIKKPTEKKI 
CLAASGKGLEVCFCKKPLIQSYFI* 


5089 


18990 


A 


5126 


343 


131 


SRPRRPGLELLTSDDLPAPATQ3AGITG 
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Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Cfutamic Acid, ^Phenylalanine, 
OGIycine, H=Histidine, Msoleucme, 
K=Lysine, L=Leucine, M=Methionine„ 

w» www^» w»- j ^w^ ^^w^^r ww M V A A ~ ww V B v v v # V* Ira 

N=Asparagine, ^Proline, 
Q=Glutflmine, R=Arginine, S=Serine, 

^w- ^w— — — - ww w w ww www ^ wf W w| V Hw « « w www V^-w W «p ^P*W 

T=Threonine, V=Valine, 
W=Tryptophan, Y^yrosine, 
X«=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VSNHTQ I W P I S VGDIjMNTGWT PSNR I V 
SELLREIKFLRICL 


5090 


18991 


A 


5127 


2 


145 


FSLLGS S VGRLEGTGT ISTHCTLPASRD 
SPASNYRLTPPCPADFLYF 


5091 


18992 


A 


5128 


1 


130 


SRSVTRLECSGTISAHCNLPGSSDSPAS 
DYRLTPPCPADFLYF 

™ Wwww wh H 4^ w^ «f W AwF w» w wB 


5092 


18993 


A 


5129 


1 


338 


WRKEG I RFEKE KS KD FKNHVI KYLETL 
LYSQQQVCKLWVTPI 


5093 


18994 


A 


5130 

i 


346 


3 


DFFFFHP I LFFFFFKKTLFFS PPEGNGG 
HTLFLKNTPPRGRI FFFLPPLLLWGGWA 
PPPPPKKNFFFFFFFFFFFFFFFFFFFF 
FFFFFFFMP PTQSLFKDQEGPVQGGEEA 
PV 


5094 


18995 


A 


5131 




114 


FLHIGOAGLELLTSDDPPASASOSAGIT 
GMSHEAQP 


5095 


18996 

1Q77V 


A 


5132 




IRft ' 

lOU 


AASTDS I EGHGASLP S KKTPS EEDFBT I 
KLISKGAYG 


5096 


18997 


A 


5133 


338 


216 


RGENRL P PGGRGFS GP KSHFCP PAWATE 
RDSLS KKKKNPV 


5097 


18998 


A 


5134 


337 


3 


I WLGNGKGI IRNNQI FSNKEAGI YILY 
HGNPWSGNHI FKGRAAG lAVNENGKGIi 

ftudif V V kJVliilJhii i^^UVVul/l V Hull VwfXVwXJ 

I TENVI RENQWGG VD IRRGG I P VLRSTL 
I CFGYS DG VWGDEGKGLI EGUP I SAN 


5098 


18999 


A 


5135 


227 


343 

▼ w» 


NYVKEKLI PTWNWMVS I MDS TEAQLPYG 
SALTSWDPGQ 


5099 


19000 


A 


5136 


166 


375 


ATFVSLG VFCS AVI LL YFKNMMKLDLSL 
TPHTTINSKWIKRLHTRPEAIKFLiEVNI 
GKKFFDIGHBIIL 

^W* • w>^k W Wb WW' W w^i ^» ^wp WW wBw 1 


5100 


19001 


A 


5137 


90 


2 


KTTLWPGAVAHACNPS ILGGRGGRITRS 
G 


5101 


19002 


A 


5138 


213 


379 


PTFQSVGETGSLLKMHILGPGAVAHACN 1 
PSTLGGRGGRITRSGVQDQPGQYGETP 


5102 


19003 


A 


5139 


384 


2 


FFFFFFLSVMESCSATQAGVQWCDLSS 


5im 


jyuu 1 ? 


a 

** 


5i in 


f 1 


j I / 


RYSFSALKDRHNAVE VNW IDPNNGWETA 
TELVEDTQAIARYGRNVTKhSDAFGCTSR 
GQAHRAGL 


5104 

* 


17WJ 


A 

w> w. 


5141 




573 


VKAEAAYKKADD I WNLRKDDYPVNDEAR ! 

ARYWDDREKARLALEAARKKAEQQTQQD 
KNAQQQSDTEASRLKYTEEAQKAYERLQ 
TPLEKYTARQEELNKALKDGKILQADYN 
TLMAAAXKDYEATLKKPKQSSVKVSAGD 
RGEDSAHAALLTLQAELRTLEKHAG 


5105 


19006 


A 


5142 


432 


144 

< 


GGFLPKAFLPPKKKKGAFFPPPGKGAIP 
PLFWGGWTVFLIPEKNPPRGFPPPKRGB 
KPVGGPFFRIFAPPPQKKIGGGGFFLER 
FFFWDWGKTGP 


5106 


19007 


A 


5143 


403 


6 


FFFFKKOIGPRGGGPPGKSlPPLGGGtitefl 
GSPRPGVLTPLGPQGKNLFFLKKKKKLG 
GGGGPRNPNFLGGGPGKSLYPGGQRFQG 
PKIIiPFFPPRGKKKKI FFPKKKKKKSHR 
KLIQTLILQTGKLMKMLWRT 


5107 


19008 


A 


5144 


115 


427 


KKKKKKKKKKKKKKKGGGPKKKKNFSPG 
GGKFCKFPFNGAKKKKPRAPVKKTGEKKK 
GGKKKKKCFEKNPFFGGGKKKKKKCSSS 
YPCPKKQ 


5108 


19009 


A 


5145 


2 


79 


GCS E PRS CHCT PAWVTEGDS I S KKK 


5109 


19010 


A 


5146 


317 


444 


I HQP VCVAHA YNP S TLGGRGGH I TRSGD 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Add, 
E^=GIutamic Acid. F^Phenvlalanine. 
G=Glycine, H=Histidine, I»lsoleucine, 
K=Lysine, L=Leucine, M=MethfonIne, 
N=Asparagine, P=Prollne, 
Q=Glu tannine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=TrvDtoDh a n. Y c= Tvrosine. 
X^Un known, *=Stop codon, /^possible 
nucleotide deletion. V— oossible 
nucleotide insertion 














RDHPGQHGETPYPL 


5110 


19011 


A 


5147 

• 


3 


413 


S FYRGFI PVLQTVTVLI IGDPSLHGDAW 
SWMEFFLTVIFIALWMLPLIGLTIVVNA 
IWFQDIADLAPBVSWTKPHPPP8VSQII 
ADMLFPLLLQALFLIQGIAVSLFPIHLV 
GQLDSLLHMTLLYSLYCSEYRLVIN 




i onio 


n 
n 




ACiQ 




OTKFRHVAOAGPH FLGSSDVPTS ASOS V 
G I TGMS LHADHQHFFTASMGRAFGEN 


5112 


19013 


A 


5149 


284 


1 


AEAS RAHMEAHARTQGTPGS KAEGVGP V 
EEYRIWRRPGPDRAHASDSBESGEPGGP 
PPHPANFVFLVETGF FRVGQAGLEFPTS 
GDPAASASPV 


5113 


19014 




5150 


439 


120 


FGS PRLECS GVI S AHCNLRLLGS SDS PA 
S ASHVAG I TARPNAR I IKAFGAPSTCTV 
VWERHEQQTCIQRSDLPAGRQKKTGPKN 
RPGLVAFPFNAS PLGGPGGR IA 


411/1 




A 


^1 si 


414 


n 


FFFFFFFFFFFFLPFVQSFYIYIYMYI 


5115 


19016 


A 


5152 


47 


179 


1 CI^LFLTVHVCVC VCVCVCVCVCVCEM 
VFGVSIPCN 


5116 


19017 

- 


A 


5153 


280 


2 


KTPFFLNLKEKTRAPFFLKKTFFKKKPF 
KI FFSKKRNFFFFPPPGKRPPTLFKKKF 
FFFFQKKKNFFFS PKKKKKKKKKK3RRS 
RSRTSPRV 


5117 


19018 


A 


5154 


296 


406 


DRVLLCHSS CSEVAESRLTAASTSWAQA 
ILLPQPPEW 


5118 


19019 


A 


5155 


114 


402 


VWRIKASVEKKKKKKKKKKQKKKKKKKR 


5119 


19020 


A 


5156 


110 


I 


QRFLLTLKS L I LRLGAVAHACNPS XLGG 
nnrii? tmrs 

UUOAXl'lAO 


5120 


19021 


A 


5157 


382 


95 


SSSSPHFLTPSQLRLFFPFFPLKIFFFP 
KAFNFCX3GVFP I FS P P KKKF F FQNSQVG 














FKNPPOKEKNFS FPPPVKFGPPKGFFKR 
PPPFFFFFFFFF 


5121 


19022 


A 


5158 


2 


350 


TIjQPGRQSETPFQKKKKKKKKKKKKKKK 
KKKKKKRGGGE'LKNIiGGGP KI WGVEKFF 
FFFFGGGYKKPLGFFIiEKIFFLGGGIWA 
PPPPKKISFWGKKKFFWGGGGKTPLFFC 
GGKK 


< loo 




a 
** 




i 

* 




PTRTLYLVKLGLMGPAEIFLSS BX3EIRL 
I D YDKMVDHRGVRWAFGQWAGVAGKYA 
RAYITHFLNWLGB 


5123 


19024 


A 

• 


5160 


1 


208 


PTRPLYLLKLSIiMGPVSIFLSSFQEIRL 
IDYEKMVDHRGVRVVAFGOWAGVAGKYA 
RAYITHFLNWLGB 


5124 


19025 


A 

* 


5161 


419 

• 


25 


AQKKKKKKKKKKKKKGGGPLKKNFGGGK 
NKRGKKKKN FFLKRGGKKKTGG I LEKKN 
FFGGGKKGEKPPQKKKALKEKKKFLRGK 
GGKKPLNWGGKKKW 


5125 


19026 


A 


5162 


475 


150 


KPRTSGTARVPGDPPPRCLDDKRLPSRP 
VIFVFLAETECDRVGQDGLQLLTSGDHP 
HLTFPNCYNYRREPPRPTSSGFQHDIET 
LHYRRGKADRFKAHLPK 


5126 


19027 


A 


5163 


237 


409 


FHNQKTYLDSALCTLVYFWFVLRDGIF 
LLPQLBCSGTVMAHCNLKLLGSRSPPAL 
A 


5127 


19028 


A 


5164 


373 


476 


LGFILLSFLETRSHSVAQAEVQWCDHGL 
LQPQTP 


5128 


19029 


A 


5165 


374 


132 


I F YSPFLYYHMLGKYHKI KNLKI KHMWL 
DAVAHACNPSTLGGRGGRITRLGDRDHP 
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sequence 


Amino acid sequence (A=A)anine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, ^Phenylalanine, 
o^Oiycine, H=Histidine, J^lsoieuctne, 
KpLysine, l^Leucine, M=Methionine, 
N=Asparagine, P=ProHne, 
Q=Glutamine, R=Arglnine, S=Serine, 
T=Tnreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, i 
X=Ui>known, *=Stop codon, /^possible 
nucleotide deletion, V=possibte 
nucleotide insertion 














GQHDET P S LLFL WT FAS S LS GF PL 


5129 


19030 




5166 


332 


2 


RPQGLANCFPGALPPTKSEGFPGQSPGF 
QI w GGKAFFLRGFPFFPQ VES KGAVS PP 
CKVPPPGLRPFSAPTPPKNGDKRGPPPG 
RATGFFFFFVFLVETGFHHVSNDGLY 


m> * ^ j\ 

5130 


M* ^% J>V^^ Ml 

19031 


A 


5167 


238 


1 


GGFTGTGGKYE PAGIVHRGEFVFTKEAT 
SRI G VGNL YRLMRG YATGG YVGTPGS MA 
DSRSQASGTFEQNNHGSAGESRG 


5131 


39032 


A 


5368 


1 


89 


FFFFKTESRSVAQAGVQWCTLGSLQPPL 

* 

P 


5132 


19033 


A 


5169 


3 


127 


DEDFSDFDEKADDEDFVPSDASPPKTKT 
SPKQVSYLIWVLS 


5133 


19034 

i 


A 


5170 


20 


254 


I AP PAR I RHE ERERERE RERERERERER 
E RE RERERDTRAE WRARLF FFFFFFLPA 
DQCFF I KPSGQRFLPPGGGGVP 


5134 


19035 


A 


5171 

■ 


36 


479 


TXGXDNKKDLL I S VGDLVDRGAENAECL 
ELITFPWFRAVRGNHEQMMIDGLSERGN 
VEmWLLNGGG WVFNLD YOKE I LAKALAH 
TADBLPLIIELVSGDKKYVICHADYPFD 
E YEFGKP VDHHQVI WNRERI SNSQDGI V 
KEIKGADT 


5135 


19036 


A 


5172 


278 


448 


RFLCPLLHPFFSSETKSLTLLPRLGCSG 

m mf*mi h ml m 9 ^Ma ^bi 9 mb mm ^bi mm a mbl mmi ^bj «t Mb n* bbbi a mbbb ^bi 

VIIAHCSLKLLGSSNPPTSASRV7VGMTG 
S 


5136 


19037 


A 


5173 


205 


1 


HARLI FVFLVETGFHCVGQAGLELLTSG 
DPPSASQSETPSPLKILKSAGOGGANST 
ALASGLL FSLGP 


5137 


19038 


A 


5174 


409 

- 


194 


FFFLRWSLALSARLVQWRDLGSRQAPPP 

^M Vshh ^bb, .mbiMp bbi b# MP mb. bmh Mfe. ^bl «h mm mi Mk awp bw »MA MM 

GBTPFSCLSLLS IS FQPSATYMRFLNTI 
TIVEYSFAYFPISQP 


5138 


19039 


A 


5176 


339 


472 


Mb M ^— & M M~MP Bk B b by M BP rfkUH MBh ^» MBV 

AKFTCI SSNI KLSNTRPGTVAHAOGP ST 
LGGRGG Q I TGSGDGDH 


5139 


19040 


A 


5177 


426 


148 


GGRGPPPLSHPFGGGGGGGPLRAGGQKN 
PGPKGET P FFGQTQKKNRGGATPPFSQK 
LGGEKHKNSFTPGKENSIKFDFPPAPPT 
WGEKKNFFF 


5140 


19041 


A 


5178 


328 


474 


NNTEYNKNRLGAVAHACNPSTLGGRGRW 
ITRSRDRISP 


5141 


39042 


A 


5179 


448 


330 


V^MnfeM^B n ^^iM W V^NM m Mt n MP M* ^Mn M| #M Mt ML mWj M^ ^M 

ETGFCHVGQAGLEIiLGSSDLPASASQSA 
GIIGVSPHAWQ 


5142 


19043 


A 


5180 


259 


14 


Mi M M w Mk m^ ^9 ■ M MV Mh BVtt MM V ^tf> ^ MM 1 MM * MBM M Bl w «Ml b V BB1 

LRYKAI LCS W IRTTNI VKMAELPKVI Y I 
FNAIPIFKIPDMFCRIARFILKCMWNVK 
VSTIDKTI FKKENR vGIPPRVLMVV 


5143 


19044 


A 


5181 


2 


423 


KFYATLVRYVGDRKNLVCREMSMALLSN 
LAQGDALAARAIAVQKGS IGNLI SFLED 
GVTMAQYQQSQHNLMHMQP PPLEPPSVD 
MMCRAAKALLAMARVDENRSEFLLHEGR 
LLDISISAVLNSLVASVICDVLFQIGQL 


< 1 A A 

J 144 


1SHJ45 




3loZ 


loz 


312 


pt?t n\7r* TTHw/an 2ir2T . vt ,dc c c c d dut .a c 
r r X U " v r nil VA^iiibu Aiir o so o r rnuiu 

QSAGI TGVSHCAQPG 


5145 


19046 


A 


5183 


301 


472 


GI PFFFFFLRRSFALVAQDGVQWRDLGS 
PQPPPPGLKQFFRNSTAWTKGLQPFPRL 


5146 


19047 


A 


5184 


216 


413 


KPXXKKKKKKKKKKWCKKKKKKXGGAVK 
KKGPQKTP VKPPGGFFFAPPLGVS PPPG 
VFFAGGGAPP 


5147 


19048 


A 


5185 


357 


465 


IKPTKILAPGPGAVAHACNPSTLGGQGG 
WITRSGD 


5148 


19049 


A 


5186 


467 


0 


FFFFFFFFFRFLFFFFFSEKK 
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5149 


19050 


A 


5187 


346 


3 


RFLPLGQGGVEILTPRSTCIiGPPKWWDY 
KGEPIiRPAQKCFFFKKKRRRKKVQNSVA 
VYMVYHSMCGENKLRWRWWG I FTHTHTH 
THTHTQKKQN I LS RTHKKLVI TTGSEEG 
TL 


5150 


19051 


A 


5188 


II 


109 


GETPS LLKVQNI XWAWWHMP VI PAMWEA 
BARES 


5151 


19052 


A 


5189 


3 


151 


QLLRRPRQENHLNPGSRGCSEPRLHHCT 
PAWATVQDS I SNTNNNNNKCP 




1 7 V J J 

• 


A 


J l?U 




J ID 


AVAADKGVPLYRHIANLAGNPDLILPVP 
AFSVINGGSHAGNKLAMQEFMILPVOAS 
S FKEAMR IGAEVYHHLRGVI KAKYGKDA 
TNVGD?GGFGP 1 1 LDNNDALELVKTAIQ 
AAGYPDKGAIGR 




IQ054 


A 


5101 


55 ^ 


421 


ARVAD VCES M KEHLLVLVERAKYI PGFC 
ELPLDDQGALVTAHAGBHLLLGATKRSM 
VFKDVLLIjGNDY IVPSHCPELAEMSRVS 
IRILDELVLPFQELHIDDNEYAYLKAI I 
FFDPDKETEA 


5154 


19055 


A 


5192 


163 


466 


TCFLGSQSAGITSVSHCPSREVFFLKLI 
HWRQGGQVALLVATPHS PCCPQVRLAP I 
PRARHDFACASL I FACILLVHVLLMPRV 
S AGRGVGVR PAG IQAGR 


5155 


19056 


A 


5193 


470 


0 


ECLWPGTVAHAS N PTTLGGRGGRIMK 


5156 


19057 


A 


5194 


491 


281 


RGLAUjPMGQCSGMMLAHGSLDLLDS sd 
PPASASQSVGIMGVSHHAWPSLVNISFV 
CLIHRSPKTKPEGR 


5157 


19058 

I y \J mf \J 


A 


5195 

ft 


3 


222 


LS FHS LHRCLYMLGTTS BNVS P FSLLEL 
IjSRLATLLGDYCGSLSEGTISRNVALVY 
ELLDEVLRLESRCVAQA 


5158 


19059 


A 


5196 


20 


191 


STW WNS SRGGGCS E PRSHHCTPAWVTER 
DSVS KT KKK INNKPALAL PAVH PTSHPG 
S 


5159 


19060 


A 


5197 


460 


350 


RRLRRENHLNPGGGGCSELRSSHCTPAW 
VTERGPSD 


5160 


19061 

• 


A 


5198 


174 


41 


SLGELLTECLFETKSHSVTQAGVQWRHP 
GSLQPPPQGFKRIPPH 


5161 


19062 


A 


5199 


131 


454 


FE YFKNRKPFF FFFFFGKGVS FCAPGGK 
AGDPFKLREPPPPRVKGFFGPPPPSGGN 
NGPPPPPRVFFWFFKKRGGSPFWPGGVL 
TPGGGP PPPAPQRGGINGLDPPAR 


5162 . 


19063 


A 


5200 


168 


51 


T VKT P I RP GAVAHACNLST LGGRGMW I T 
RSGARAQPNR 


5163 


19064 


A 


5201 


414 


497 


LLFXXXXDNGS I YNPEVLDITEETLHSR 


5164 


19065 


A 


5202 


381 


614 

w A- ■ 


VQPGAPPEWVALSTCPSAAPEGSQQPYI 
PPTFCFFPVKWLLSVTWLYLFIYFLSWI 
SVAQAGVQWCNHGSLQPRPSRL 


5165 


19066 


A 


5203 


377 


485 


PKEPGPQPCAPQPQPHAPQPQPRAPQPQ 
PCAPQPQP 


5166 


19067 


A 


5204 


54 


434 


MRTPENLELTNPQE FGSSWAAVECPDTL 
DPRDMCVLNPLREPFAKKECSHjLSEVF 
EICHPWDVTWFYSNCLTDTCGCSQGGD 
CECFCASVSAYAHQCCQHGVAVDWRTPR 
LCRECPRQSPEGTE 


5167 


19068 


"A 


5205 


201 


418 


GDAGAHAVLRLLPGHPGTSHLPFSSLTT 
S PLGLKRVTGGFNS KNRCDARTYCYLLP 
TFAFAHKDRDVQDETY 


5168 


19069 


A 


5206 | 27 


177 


NSFHHAGQSGIiBLLTSSDPPASASQSAG 
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ITGVSHRAQPKAGTFLSPFHR 


5169 

■ 


19070 

• 

• 

i 


A 

• 

• 


5207 

- 


6 

• 

- 


1175 


DLGPHTPAWTRPKREDLVYQSTVRLPEV 
RISDNGPYECHVGIYDRATREKWLASQ 
N I PLNVMAP PTS I E WAADTPAP FSRYQ 
AQNFTLVC I VSGGKPAPMVYFKRDGEP I 
DAVPTiS E P PAASSGPIjODSRP FRS rJ.HP 
DLDDTKMQKSLSLLDAENRGGRPYTERP 
SRGLTPD PN I LLOPTTENI PETVVSREF 
PRWVHSAEPTYFLRHSRTPSSDGTVEVR 
ALLTWTLNPQIDNEALFSCEVKHPALSM 
PMOAEVTLVAPKGPKIVMTPSRARVGDT 
VRILVHGFQNEVFPEPMFTWTRVGSRLL 
DGSAEFDGKELVLERVPAELNGSMYRCT 
AQNPLGS TDTHTRL I VFENPNI PRGTED 
SNGSIGPTGARLTLVIiALTVILELT 


5170 


19071 


A 


5208 . . 


401 


3 


FFFFFSETESHSVAQAGVQWHHLGSIiQG 
PPPGF 


j 1 1 1 






59 no 




77Q 


GFFKKGPPFFFFFFFFFFFFFFFKMVWIi 

It * At +. Mm Mm ± S, M* MmM»±.K±MmMm-l, +\A 1 V I I JUJ 

RLSGS KVE WVWGS Q KIiYMNFGCCRGRS W 
FT 


5172 


19073 


A 


5210 


198 


2 


PPQAKILSSSSPPIRPPPKKGFFPKNPQ 
VGFYSAPHKEKTFTLPAPVKFGPPKDPL 

■ V VB*» » ^ * • ^ Mm MMW Mm m W Mt^MW Mm Mm Mt^MmW Mm m\mJ 

KRPPP 






A 

*• 






77? 

• 


GSTLRLTGRSMPASSSTMAPSSKRIiIiLP 

M^ MAfJ Mt^mmmW • m mm «|fey mm* k** •» % MUm* Mmf warn* %^^^AJ<h*»# *fa 

RPERSVPAAAGTAGHHEASRNCGRGGAG 
ADBGPATKGDSHPKPGYCRAAHPSAAPW 
PPGPEKNFMRVG 


5174 


19075 


A 


5212 


103 


203 


DKVSLRCPGWPQTPGLKQSSCLSLPKCW 
DYKCE 


5175 


19076 


A 


5213 


409 


249 


RRNQVGGPRQLNPGKRIKSPGIDPYIYG 
QL I FDKAVRAI WWRKGS S FLQKALE 


5176 


19077 


A 


5214 


2' 


170 


ERECILGY I QLRKGPNWGPYGLLQPFAD 
AIKLFTKEPLKPATSPYPLQSPAFPLKP 


5177 


to 


A 

Mi m 


5215 


270 


464 


GQLNKNVWGWDPGKVTFRDGVSLYCLHW 
NAVMEGHRHSVPQPQTPGFKQS CLRLP 
SSWDYMPLY 


5178 


19079 


A 


5216 


156 


248 


PDKSYI ICVCVCVCVCVCVCLCVWVCME 
CP 


5179 


19080 


A 


5217 


412 


2 


PPPQIFFFSPPHFFSPPPKGGFFPPPPP 
KIFFFPPPLFFFWGFFPPF3PPPKKFFF 
PKSPPFFFFPPPKKKKFFFFPPLFFFPP 
PGFFLTPPPPFFFFFFFFFFFFFFFFFF 
FRMGCDRWHGEFWILRDGFD 


5180 


19081 


A 


5218 


165 


1 

m 


QVKYHNKKTHNPIKIGKRSEQTFHEKDT 
QMANKYMKRS PTS LRI KG IQIKT I MRY 


5181 


19082 


A 


5219 


408 


88 


PSSSPHFFSPPPLGGVFPPFPFKIFFFP 
PGLFFFWGVFPLFS PPPKRVFFPNSPPG 
FFFPPLFGKKFFFS P PPFFWPP PGFFLS 
PPPLFFFFFFFFFFFFFFFF 


5182 


19083 


A 


5220 


3 


259 


NLLLLI VP IL I AMAFLMLTERKILGYIQ 
LKKKKKKKKKKKKKKKKKKKRGGAFKKK 
PRGGQNFPGEKKKKFFFLREVKKKPRGN 
F 


5183 


19084 


A 


5221 


10 


479 


RLMPAESDGRHRERERERERERERERER 
ES S S ARARCVSLSLDR YKNRVYMKVAEA 
LCECRLLAYISQAPTQMFFLLRLIWIIH 
AHTLTQENDMCLHTTLEILTLARQRERV 
PLYLRVLQRMEHKKKHPGFVLNNSHNLV 
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RLWQQLYLHRDRDS I C 


5184 


19085 


A 


5222 


142 


56 


KFQPGAVAHTCNPSALGGWGGQIKRSGV 


5185 


19086 


A 


5223 

* 


14 


471 


LEMRSEIDSQNYCN CDAGRNEWTSDTI V 
LSQKEHLPVTQIVMTDAGRPHSEAAYTL 
GPLLCRGDQS FWNSASFNTETS YLHFPA 
FHGELTADVCFFFKTTVSSGVFMENLGI 
TDFIRIELRGKLATLDKSQGTHYLAIKA 
LTQKKKKKKTRG 


J lOU 


1Q0R7 


A 




*tOU 




FFFFFFFFFFFLIVAOFYFL 

M,M,±itMm±±MwM»+.m» fOX" ^ •» JU1 


5187 


19088 


A 


5225 


16 


422 


VRRTAR I RHEGKP YECNACGKAFNRSAH 
LTEHQRTHTGEKPYVCKECGKTFSRSTH 
LTEKLKIHSCVKPYQCNECQKLFCTRTS 
LIRHQRTHTGEKP YQCNECGKS FSLSS A 
LTKHKRIHTRERPYQCKKKKKKK 


J loo 

i 


1QA8Q 




DZZO 


oo 




YLFDB I MYGGH I TDDGDCKLCRVYLEB F 
MNPSLLAALSGTHACGEKPRLPANSHVS 
EPSWKWILQPQSSLQMTAAIiANILTATS 


5189 


19090 


A 


5227 

■ 


473 


9 


FQITATPHIjAVYDPTVQFEFWFSSKQIA 
DIRQVEASTRYLGTALYWIAAS INIKPG 

WnVV77VTPQUTJTVnTTQaPVRaVlf3PJV<5r>r> 
AL/Z IE 1 IRC) Vii 1 VVxAdZxrvT VOA ¥\3i\£%S9iJU 

AEGYLDFFKGKIAESHLGKELLEKVELT 
F.T5 T ASTJIiRRF 9 PCRWICDAKOIOTN AMWA VTC 

IEQTKVGERDVTSG 
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EADYTLGPLLCCGDKS fwnsasfntets 
YLH F P AFHGELTADVC FFF KTTVS SGVF 
MENLG I TDFIRI ELRGRIATLDKSQGTH 
YliAIKALTQKIOaCKTRGGAGPPFPLIGS 
RITIHGPPFNNAAMREK 


5191 


19092 


A 


5229 


170 

1 fv 


25 


KTPLLGPGTVAHACNXjSTLGDQSGCXMR 
SGVRDQLRQHDETPSLLKRI 


5192 


19093 


A 


5230 


465 


0 


VVLYPPQKAQKKKKKKBCKKKKKKKKKTK 
ARG 


5193 


19094 


A 


5231 


419 


56 


CVLLRSTKKKKKKKKKKKKKKK 


5194 

• 


19095 


A 


5232 

■ . . 


79 


451 


AGSNLQEHRGLRPESPFNPGGGGCREPR 

QGRHGPFFPGLGPEGFLPRRPGSRKRRK 
GRNVP G P ELKG PGAVL FSRKRRGGADLR 
PPRKGPRGGGGG 


5195 


19096 


A 

• 


5233 

• 


39 

- 


467 


VQQQQRAQEQQQQHPVLHLQPQQIMQLQ 
QQQQRQI SQQPYPQQPPHPFSQQQQQQQ 

QAHPHQFSQQQLQFPQQQLHPPQQLHRP 
QQQLQPFQQQHALQQQFHQLQQHQLQQQ 
QIAQLQQQHSLLQQQQQQQIQQQQ1VQRM 
HQQ 


5196 


19097 


A 


5234 


194 


45 . 


M I LKFFFNFKIHFFFFI1RQSI1AL VAQAG 
VKWCNVGSQQPPPPGFKRNST 


5197 


19098 


A 


5235 


204 


475 


ASITVHLWYVEKCYSQSTEKVTPMPSSS 
WFGPQPDAQLKHGLRPGAVAHACNPSTM 
GGRGGRITSSGDRAHP 


5198 


19099 


A 


5236 


59 


357 


NHWRKIYCKVYNRKKIKHNTTNTTKITYQ 
NFQRNRTKS S FT I FRIQSHYNKH I KKH I 
QRKRQS I ETDIjEMTQMLELVW I FKWIV 
ILTKIKTPKTLKING 


5199 


19100 


A 


5237 


447 

.. _ 


166 


ELLAFWQTQ YNNMNAVNLFFFMI KAPG I 
PTPSFASRRTHSPFFSFLFFLFFFLGBCK 
GFPLCGPGGPKSLNLKTPPLGPPKGGK 
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5200 


19101 


A 


5238 


34 


296 


LS VAE FLAF WQTQ YNNMNAVNLFFAW I K 
VPGIPTPSFASRRTHSPFFSFLFFLFFF 
LGKNGFSPCGPEGPKFLNIiITPPLGPPK 
OWE 


5201 


19102 


A 


5239 


340 

* 


472 


S LN ETHVRS PAHHRPGATAHACNP I TLG 
GRDGLIMRSGDQDHPG 


5202 


19103 


A 


5240 


460 


0 


PKKKIFTIYFFYFFFFFFFFFFFFFL 


JZVJ 


IQIOd 
J y i\rt 


A 
** 




161 

J U J 




SFYSTHRKPCHWPGSVAHACNPSTIjGGR 
GGRITRSRDR 






Tl 








LKYYLI FI YLLS F F FLRO S RS VGQAGVQ 
WRDLS SXjQPPNSTALQYR 


5205 

■. 


19106 


A 


5243 


11 


442 


DTIRWGLPTLGSKSTTNEKKREKRRKKK 
EOOOSEANEIiRNLAFKKIPOKSSHAVCN 
AQHDLPLSNPVQKDSREENWQEWRQRDE 
QLTSEMFEADLEKALLI hS KLE YEEHKKE 
YEDAENTSTOS KVMNKKDKRKNHOGKDR 
PLTV 


5206 


19107 


A 


5244 


459 


333 


• FLRVTQAGLKXiLSS S DPPTLAS P KCWDY 
RHBPLCPAQWSVS 


52U7 








900 


*tv50 


FN RTWRKRGLGGF IHLNTjIOTSFFAHL 
WSLTYVLCVSFFVCLFLRQSHFVTQAG 
VOWHNLGSLOPLP " 


5208 


19109 


A 


5246 


227 


30 


VFFARWVFLRQDLDLSSRLECSDVILVH 
CNVRI±AGS RGTP S S VSKVAGTTGMLYHT 
WVFFYEFHR 


■ 


1Q1 1ft 


A 


- 


173 


442 
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FLGSSSKAAITLYYCQYMEFFFFFFFFE 
KNFFFSPPGGGEGPNFILLEFPPPGLKK 
ISCPTPPRGWDYGVAPPPLVWFWFFKKN 
GVPPPW 


5210 


19111 


A 


5248 


255 


358 


TG PGTVAHTCDPS TSGGRGGQ I TMSGDR 
DHPGQH 


5211 


19112 


A 


5249 


3 


130 


QPQLAAPS ISWAPTSASQVAGTTGVCHH 
AWLI FLFWVESRRG 


5212 


19113 


A 


5250 


90 


1 


S TRLENNG TMS AQCNLRLTGS SNS FNS A 
SR 




191 14 


A 


5251 


358 


447 


S F IS YRRRSPSP Y YSRYRSRSRSRS YTP 


5214 


19115 


A 


5252 


55 


202 


CWNTSMRGPIWEAEAGESLEPGGRRSCG 
EARTCHCI PAWWRAKLVDAA 


5215 

• 


19116 

■ 


A 


5253 


330 


3 


EEMGFSHVGQGGFKIliASGDTPAWAFQK 
GGI SGVSPRAWAGFI IFIWNFSGGPQTG 
FPFYPFRGIFFPRAKINYIRVLNPKGLC 
QKGVFFFFFSGF PHYRCRLCQ I SV 


5216 


19117 


A 


5255 


175 


3 


MYLRFPLGLSLSERPGMVAHACNPCTYW 
ESKAGGSLSLGVQDQPREHGGTLCLQKS 
K 


5217 


19118 

1711U 

t 


A 


5256 


263 


i 

M 


RPRRRFNRDKTALLQE MPFRTFTADGKL 
ISDFKDSKDGLTLYFVETASRSWIAGL 
ELLTSSDPPTSASQSTGITGVSHFSQPG 
IiAF 


5218 


19119 


A 


5257 


1 


103 


CHVSQAGLELLASSDPPASASQSVGITG 
VSHRTW 


5219 


19120 


A 


525* 


128 


2 


IHKWRDCKMVQPLWKRAWQFLMNLNriHL 
S CDPTVLLLD ILTQ 




19121 


A 


5259 


381 


2 


FFFFFSEMESPSVAOAGVQWCGLGSLQA 
PSPGFT 


5221 


19122 


A 


5260 


2 


173 


ENYLNMGGGGCSE PS LHHCS PALAIERD 
SVSTTTTTTKQQQQNNQLGLFSYPFNPL 
I 
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SEQID ' 
NO: of 
nucleotide 
sequence 1 


SEQID 
NO: of 
peptide 

• * 

sequence 


M 

eth 

od 

• 


SEQID 
NO: in . 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine . 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
OGIycine, H-Histidine, I^lsoleucine, 
K-Lysine, L=Leucine, M=Metblonine, 
. N=Asparagine, P^Proline, 
Q=Glu tarn ine, R-Arginine, S=Serine, 
^Threonine, V^Valine, 
W=Tryptophan, Y»Tyrosine, 
X-Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possib!e 
nucleotide insertion 


5222 


19123 


A 


5261 


1 


175 


I KGNQ I GKKE VKLFLFTDDMILYMENPK 
DSAKNYYKQSKTPCLPWLGRSCKRTCP 
LA 


5223 


19124 


A 


5262 


2 


145 


QVSLKLLGSSDPPALTSQGAGIMGVSHR 
TRPRNTDFKNNTQKSKNTQ 


5224 


19125 


A 


5263 


182 


3 


STTS FLNFFLTEMVSHYVSQAGLKVLGS 
SDPPSSASHSAGITGMSHHTRPQQPLLN 
LKV 


5225 


19126 


A 


5264 


183 


369 


WNSLDFFFFFPSFFRGYSSMGKVQDAFI 
FYRRF IDKSKPRANTWGS IGGLYQQQIH 
PMDALR 


5226 


19127 . 


A 


5265 


329 


207 


RGCSELRSRHCTPVWVTEQDSVSQKRKK 
KSPCHLQLGTSQ 


5227 


19128 


A 


5266 


177 


13 


TLT I LLNFFFLRQS Y S VAQAG VQWHNLN 
SLKPPPPGFKRFSCIiSLPEFQQGHNIR 


5228 


19129 

- 


A 


5267 

■ 


474 

• 


122 


FICSPHPGGFIiTLGPKLNILGPSQPLFP 
LRGGS KKQGIPFYS YECCKGPPGAGPPGS 
ARYSNPFGGYGGRVPLPPBFPDPPGEHK 
PRFFLKKKKKGHQVPTSKYLPEYLPAYY 
NASLE 


5229 


19130 

• 


A 


5268 


55 


311 


GIQTFGKNWVAGRSKNVGMP LAMLLHT 
DGEHERPGGDATVT IAHRYTPKEQLKIH 
TQLADI X I VAAGSLS ISHfiGVQWRNHGS 
Ii 


5230 


19131 


A 


5269 


238 


133 


NKNIWVQKKCVFYIYLPLICVCVCVCVC 
VCVCVCV 


5231 


19132 


A 


5270 


336 


517 


PVLYILEHSPYNAS IYQLALKKYQSRPG 
AVAHACDPNTLGGRGGRITRSGDRDHPG 
QRGE 


5232 


19133 


A 


5271 


262 


467 


VLSMRPRIHGSAAREEDEHPYELLLTAE 
TKKVVLVDGKTKGTFPTTPGRTNSKGI F 
KVCDPEWKGKMS 


5233 


19134 


A 


5272 


213 


2 


MKIYINVYILLFSLKKRQGPTIjSPRIiEC 
SGM 1 1 VHRSLKLVGSSDPPALTSRIAWL 
IGARHTPDCSVCKF ! 


5234 


19135 


A 


5273 


284 


105 


YSCS YS YFDEP VELRKSSFFRWNHSSDS j 

YWKKKSSKDTEPVLKPPGYSARYECKTV 

GSS 


5235 


19136 


A 


5274 


290 


449 


I FYFSGRVRAKLS APIiAGMGNAKADSRG 
RSRTKMVSQSQRMYCLSALLYLSLC 


5236 


19137 


A 


5275 


248 


423 


NKVNI TLMTKLEKDTT KKENYRP ISLLN 
IDMKILSKILQNQIQQYIQS I IHHDQVK 
XiL 


5237 


19138 


A 

* 


5276 


425 


30 


TRRGMPHRQGS PRKAP VS VRRQRVREEC 
GREPLLWFLQSQLLGRLRQETHLNWGGR 
GCSEPRSCHCI PAWATRANS IFCNFQAS 
SVEVRRSARKKLFSDILKRHNTITWRVS 
GLLLVDSYFGRLATPVRTQ 


5238 


19139 


A 


5277 


25 


185 


SLFSVAEFAFNAYKVQTBKEEKEEARSK 
YKEAKESFQRFLENHEKMTSTTRYK 


5239 


19140 


A 


5278 


263 


483 


VFTWFFQLFCVYKLIFSFLTECRSVAQ 
AAHAGAHWHNLGSLQPPLSGFKQFSCLS 
LPEFPRMYSVLSPSP 


5240 


19141 


A 


5279 


25 


252 

* 

• 


LETLGLMTLVQCX3GI PGFCHVGQAGLKL 
LTSGDLPASASQGAGIAGMSHHTRPILL 
HFI YFILLYHPPCLAYFI IF . 


5241 


19142 


A 


5280 


326 


203 


IETESCYVAQDGLELLGSSNPPTSASQS 
AG ITGVSHHTWTT 
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SEQID 
NO: of 
nucleotide 

U %M w*w"V fcB%*W 

sequence 

■ 


SEQID 
NO: of 
Deotide 
sequence 


M ■ 

eth 

od 

■ 


SEQID 
NO: io 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ine to first 

WWW WW " ■ » *af wm 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle* - 
otide 
location 
corresDon 

wtfWjj m w www ww mw 

ding to 
last amino 
acid 

residue of 
peptide 

r r 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, IHboleucine, 
K=Lysine, I/=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, ■ — 
^Threonine, V=Valinei 
W=Tryptophan, Y^yrosine, 
X=Unknown, *=Stop codon, /=possible- 
nucleotide deletion, V=possible 
nucleotide insertion 


5242 


19143 


A 


5281 


210 


459 


NKNNKKFNBKKKKNYKKWARELYRHFS K 
DD IHVANRYMKRYPTSLI IRKIQIKTSM 
RYHLTP IRIAT I PR 


5243 


19144 


A 


5282 


262 


454 


LLFTDYHEGMLHSVKIiLFCFLGWSFTLA 
IQAGGHWRDLGSLCPLLPRLKQS SCLNL 
LSSWDYKR 


5244 


10145 


A 


5283 


451 


303 

mJ WW 


GTEFHHVGAGLELLGSSDLPVLASKSAG 
I IGMGHHAQPKQYS FVILTVQ 


5245 


19146 

1 S ITU 


A 


5284 


217 


489 


PVS IS IILRCLSLPTPGSRVLCMLEGMD 
GISLGWVPIiPiSLPVPLHPLDLSLPVCRQ 
VGGTKTG WR YVGE TDFAKGEWCGVK LD 
KPLGKND 


5246 


19147 


A 


5285 


247 


99 


FFFFFFFFFLYGFFFCFFFFFFFFFFFF 
FFFLKNKTQPYFIYKRNILSM 


5747 




A 


5286 


2 


117 

M X 1. 


PRVRPRVRPRVRPRVRPRVRPRVRKKKK 
KKKKKKKKKKGGGF 




10 140 

x y it? 


A 


57 R7 


121 


V 

ml 


KFWPGAVAHACNPS PLGGRGGW ITRSGD 

& 9 v mm wwm m\ m* w w> wmtwmt mtwtmywm 9 wm Wmw mm Wmw mm wmwmw WmT V W t WW m* w^mmw *mw • mmr 

RDHPGQHSETP 


574.0 


10150 


A 




416 


48 


IGKPGSFFPPPPSPRGAPPPPPKIFFLP 
PPPFFFRAGFPLFPPPPQIFFFPKIPPP 
FFFTPPLKKKFFFP PPPI FES PLGLFFF 
PPPPSIiFSFFFFFF^FFFFFFFFFFFFF 
FFFFFFIiGVR 


5250 


19151 


A 


5289 


503 


0 


FFFFFFFFSPFFFFFSSSFFFFFFFFFF 


5251 


19152 


A 


5290 


70 


236 


IFCNSQLTSPHKHQKEaaaCKKKKKKKKK 
KKKKKKKKKKKKKKGGGDFKKKIGGAK 


5757 

r 


1 71 JJ 


A 


5701 




2R7 


OFTKKKKKKKKRKKKKNKKKKTKQKKTI 

mm wm m* w mi *Wmm *m w ^ m wm>m* wmmm wmm\ m m^w,mh m wmt w * » w> ^ w » mt wwwww> w-wtw- WW W • wm* w mm mm m 

IPKPPLF 


5253 


19154 


A 


5292 


334 


115 


KNPNALFFFFFFVDR I SLLPRQWHNLGS 
LQPLPPGFTQFSCLSLPNSWDYRCEPLR 
PALLFFSSKSKIHIKPL 

mm mm m w*' wmw mr-w^mm* mm w mmm mm a mmm m>™Mmr mmm 


5254 


19155 


A 


5293 


413 


3 


SSSPPIFFPPQKKKIFSPPPPKKFFFPP 
KTFFFLRGFFQIFPPPKKNFFPKKPQNF 
FFYPPKKKKIFFFPPPKIFPPPKIFLKT 
PPPFFFFFFFFFFFFFFFFFFFFFFLVS 
LEKGF PDAWADAWADAW 


5255 


19156 


A 


5294 


291 


11 


IQPYIKRLMQHDRVGLSSEKQSWFRIYI 
YINVPGFQNKGFIIIGSSQKAQKMIFFF 
LDRVLLCHPGRS TWQSQPTAALKNWAQ 
KTLPPQPSB 


5256 


19157 


A 


5295 


2 


401 


NPRSTEAAI KYFLTQATAS I ILLIAILF 
NN I LS GQKKKKK KKKKKKKKKKKGGG P K 
KKKIGGGKKKPGGKKKIFFFIRGEKKNP 
RGKFEKKTFFWGGEKRANPPPKKKTLGE 
KKKILGGGGGKKFFFFVGKKK 


5257 


19158 

* 


A 


5296 


49 


363 


RLQKKNKEKKKKKKKKKKKKKKKKNKRK 
KKKKKJaCRRNGGGGKLLWKGGGQILTKE 
KIL ILVAVS YTCN INNIiGRQRKW I IGGG 
KSE PNYE FKRNIGAGRGGSLL 


5258 


19159 


A 


5297 


100 


323 


E YLNTCVRI CHSHAHFLPPS YVS FALLP 
RFLFKWLL YMQVFGEGTEAVKKS LEGI 
FDDIVPDGKVKINVCFIiF 


5259 


19160 


A 


5298 


595 


0 


VCVCVCMCACICIC 


5260 


19161 


A 


5299 


412 


141 


TVDTS YS EKKI LFQ I LLL I DNGPGHLRA 
LMEMYKE INWSMPANTTS IMQLMNQGV 
ISIFKSSYLRNAFCKAIAAIHSDSFDES 
GQSNLL 


5261 


19162 


A 


5300 


216 


2 


SQDWLRKFC YHPRVFQRQP I QKLMGLFF 
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nucleotide 
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correspond 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
OCysteine, D=Aspartic Acid, 
^-Glutamic Acid, ^Phenylalanine, 
G~G!ycine, H-Histidine, Msoleucine, 
K=Lysine, L= Leucine, M=Meth Ion ine, 
N=Asparaglne, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W«Tryptophan, Y^Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=posslb)e 
nucleotide insertion 














FCLPNMVKNGVFPFFEMBGRSWQAEVQ 
WHDNS PLQPKPPGLK 


5262 


19163 


A 


5301 

» 


368 


3 


IFFFPQGKEGGGLFYIFFPPKKKGFFFK 
KKIFFFYPPPKKKKKIIRAAEKRGPPIiF 
FLKKTRPNFF FFFFFFFF FFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFLVREK 
VFNSTISTQ 


5263 


19164 


A 


5302 


389 


2 


AMAQASGSCLCSRLLVDGFIPWGVFRYF 
IVBAGSOTVDQGGIAILASSYPTTLVSQ 
SAGMTGLSHRARPPLNGPPPGLTPSLSM 
VPKANVFSLLEKPTGLTQALRITLANVS 
NAPGQTRSGLEGWGCFF 


5264 


19165 

( 


A 

* 


5303 


84 


333 


FHDCCRPERSRI LRNFLLSSAGIQS IAD 
AYRLALPQGLLYGPTNFAP I INHVARFA 
AHAAHQGTASVRWGGSGSGGNQGLP 


5265 


19166 


A 


5304 


485 


93 


PPliFRRFSPPPPPKFFFSPRAFFFGGjQF 
FSFFSPPKKSFFFKNPPGFFFFPPLKKK 
KI FFLP PLFLAP PKDSFLAPP PFFFFFF 
FFFFFFFFFFFFFFFFFFFFSRENI PFI 
IIIFIYLFRDRFLFCCLG 


5266 


19167 


A 


5305 


157 


2 


PPPPFFFFFFFFFVWWVITLNVELEPSF j 
SPNTESQIGPEEAMERLQENRVE 


5267 


19168 


A 


5306 


I 


133 


TQES IMDLENDKQQLDERLKKKKKRPAV 
LKNPWGGQSLPGMARE 


5268 


19169 


A 


5307 


239 


2 


PPRNWGFFSPLSPLKSSSPPKAFNFWGG 
VGPNFP PPKKRFFS KNP PGGFF YPPLKE 
KNIFSLPPVNLGPPRVFFKGPPP 


5269 


19170 


A 


5308 


454 


116 


FRLPSSSNSPAPASRVAGISGFRCVGRA 
GLQLLTASDPPASASQGAGIADGVSFTQ 
CSMVPRLECSGMISACYNLHLPAACLGP 
PKORDCSLCLAATPSGK 


5270 


19171 


A 


5309 


123 


1 


CS VS KW Y P I AFL KN 1 1 ML W EAEAGG S RG 
QE I KT I IiANTVKP 


5271 


19172 


A 


5310 


153 


33 


WPGVVAHTCNPPPLGGRSGLITRSGVQP 
QPGQHSESPSI 


5272 

* 


19173 


A 


5311 

• 


350 


3 


NSRNL. I LTQEHS PEE KNEFLFS LPLQS V 
SMNTTHSPLLSNSLPHFIISFLPTRFLV 
GPSPTRSFPPAQSPDRISFSSRLECSGT 
I TAHRS IiNLPGS SD PL ISVS Q VAGTTCM 
CHH 


5273 


19174 


A 


5312 


1 


392 


RTRGRTRGICKSITIIHHINRTNEKKNH 
MIISVDAESAPLQIHSAKLKK 


5274 


19175 


A 


5313 


1 


239 


LKTSFHHVGQAVLEFLASSDLPALASQS 
AGITCMSHCAQPYISINFIPAABTQQSS 
PELVPLPASQKTPS FSTPLFSLP 


5275 


19176 


A 


5314 


345 


237 


LYFYI YFLKSACVI ILSTLCVCVCVCVC 
VCVCLCM 


5276 


19177 


A 


5315 


268 


412 


GMISSNNSSDSFGNCLNPGGKGCCEPGS 
YHYP P A WATDTFS KKKKKKL 


5277 


19178 


A 


5316 


112 


224 


PWMLENELOLHASYLAMSTPLSPVEIBC 
ASKKIFTFC 


5278 


19179 


A 


5317 


377 


139 


KPPPRI FYLGPQKKXKLFYPPPFKNCFF 
LGPPPPPLFMI FFFFFFFFFF FFFFFFF 
PQWVMFLFFLLSASNQNSPSWSP 


5279 


19180 


A 


5318 


216 


385 


RGVLLCCPAWSQTPGLKGSSCLSLPKCW 
DYKHEL PRPATS CNS I LFFNLQGAFMYH 


5280 


19181 


A 


5319 


376 


3 


RIDFGGPKKKHiLPPPPAVKIVSLKGPP 
LFFFFLNSFFAPAGGQWGVFQLISSGDP 
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sequence 

• 


SEQ ID 
NO: of 
nentide 

sequence 

• 

- 


M 
od 


SEQID 
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26 
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nucleotide 

location 

correspond 
inp to firvt 

amino acid 
residue of 

peptide 
sequence 


Predict- 
ed end 

niirlf*- 

otide 
location 

rnrrKiinn 

ding to 

lacf Am inn 

lujl DIJIIUv 

acid 

residue of 
peptide 

>CL| UCI1V.C 


Amino acid sequence (A=Alanine 
C-Cysteine, D-Aspartic Acid, 

F^tTvlutnnnic Arid TT=Plipnvlalnninf 

ij viuuiiiiii> LIUf Mr X lieu jrjniniiifiCa 

G=Glycine, H-Histidine, I=IsoIeucine, 
K-Lysine, L=Lcucine, M=Methionine, 

Q=GIutamine, R=Arginine, S=Serine, 
Tt=Threonine V=Vflline 

A lUIWIIIIICj T T OIIUV) 

W^ryptophan, Y«Tyrosine, 

X-Va known, *=$top codon, /=possibte 

nucleotide deletion, \=possible 

IIUUwllUC IIJ>CI LIU II 














PPPAPESVRIPGVSFCAGVPQGFFFFFL I 

NSSVAQAGVQWRDLGSLQPLPPRSKRVS 

CLSHPSSWDYSH 






A 




J / / 


7R7 


ETjTjTSSOTjPAT.ASOQ.AGTT^MSITHTOP{3 

SF 


5282 


19183 


A 


5321 


381 

■ 


65 


SSSPSSRTSGPFFFFSPPKKKKNFQKKK 
NFLyPPFFFPPPFYKNPPKKKLKGPKKK 
KKISPPPPKKNLSLKNPPPFFFFFFFFF. 

C C c C c 


5283 


19184 


A 


5322 

i 


132 


396 


EGGFVLKXIREGIGPHFLGLEEEGSKFH 
LS KNP YFRVKQRVSHIEGFKNFFNEKKD 


5284 


19185 


A 


5323 


366 


1 


GFLKKGVPFFPPGKRGGGNQTPPKPFVL 
GGKQPPPPP PLKKKNPKGGPPQQKKI FF 
FLTRGGVSLFGPGGGFP FFFQP PKKPPP 
GGFPFFFLKKPPPPPLFFFFFFFFETGF 


5285 


19186 


A 


5324 


131 


1 


DGALS PRLEWCSGTIIiAHCSLRLPGS SA 


5286 


19187 


A 


5325 


112 


1 


DGI LLLLPRLESNGAISAHCNIiHLPSTS 


5287 


19188 

* 


A 


5326 


404 . 


18 

• 


PVSPHGKNHEKHTVTCVGSTSGAFLHGE 

HGNSFKNLWRS WAWWRTP VVSATQDAEM 
GR P LE PRGLS IiAWATOODPSLKOKOKOIT 

KPTHLWS YTLGHHOLP 


5288 


19189 


A 


5327 


109 


3 


KCVLTPVIRALWEAETGGSRGQDIETIL 
SDTVKPR 


5289 


19190 


A 

• * 


5328 


i 


1 17 


ETGFCCVAOSGT^IiLDSSNLIjATjVSQSA 
GVTGMSHRSRP 


5290 


19191 


A 


5329 


3 


120 


DAWVAAE AEGQNDT I EEPNKVQKRKRGI 
NDNVPAGOAH 


5291 


19192 


A 


5330 


105 


2 


RSGWAHI CNP STLGGQGRW I TRSGVQD 
OSVODG 


5292 


19193 


A 


5331 


204 


411 


S KKKQKKKKKKKKKKKKKKKKKQDKKG 


529"? 


19194 


A 




• 


o 


FFFETESHSTTOAGVOWRDI.fii3LOPT.PP 
GFKSLPSSWDYRCLPPCPANFCIFSRDR 
VSLCW 


5294 


19195 


A 


5333 


561 




WSTLHS FP IWLLNTGYYFKELTGFFF PL. 
TVALS PRLECSDT I IAHYSLRLLGSRDP 
PTLVFQSTKITSVSHCTQPSVRF 


5295 


19196 


A 


5334 


3 


96 


EGKAAFSQEKSRRVKEENPEIAVSASTI 
PR 


5296 


19197 


A 


5335 


375 


228 


LSPWI^CSGTIIAHCNIiRPLGSSNSPGV 
FGFYFFLTLIIiGSGVHVGLC 


5297 


19198 


A 


5336 


403 


0 


NISVPASKHISPFPKKKKKKKKKKKKKK 
KKSQKKKSRG 


5298 


19199 


A 


5337 


397 


80 


1 1 1 T S KKKKKKKICKKKKKKKKK 


5299 


19200 


A 


5338 


406 


34 


KRTLKDS KKKRKE KKKKKRKKKKKKKNR 
EKKKKP KKKKKKKKRGGAQKKKNNHPAG 
GKAYFFFLGGTKKKRGGGDTKPLGEKNS 
PAQTKKLGGNNPPFVERGKKKKTPGI 


5300 


19201 


A 


5339 


410 


0 


KKQKKKKKKDAPGGGGGGRKKKGGPQKK 
KKGSHLKTPKKEKTGGA 


5301 


19202 


A 


5340 


306 


392 


IQYKILPQKENDWDKIQGQIiAIPVSSLN 
F 



868 



WO 01/64835 



PCT/US01/04927 



SEQID ' 
NO: of 
nucleotide 
sequence 

'i 

■ 


SEQID 
NO: of 
o en tide- 

sequence 

■ - — 


M 

eth 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 

• 


Predicted 
beginning 
nucleotide 

99 \m mmt ■ mm* \# M im* mm* 

location 
correspond 

inp to first 

mm m tg^ U a w V* 

amino acid 
residue of 
oeotide . 
sequence 


Predict- 
ed end 
nude- 
otide 
location 
corresoon ' 
ding to 
last amino 
acid 

m-m, w J v* 

residue of 
neo tide 
sequence 


Amino acid sequence (A~Alanine 
OCystetne, D=Aspartic Acid, 
E^GIutamic Acid. F^Phenvlalanine. 
Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methlonlne, 
N=AsDaraeine» P^Proline. 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=TrvDto oh a n . Y ra Tvros ine. 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion. V=nossible 
nucleotide insertion 


5302 


19203 


A 


5341 


3 


173 


TFTATEL 1 1 FY I F FETTLI PTLAI ITRO 
GNQPERLNAGtYFLFYTLVGEGSLLARS 
R 


5303 


19204 


A 


5342 


247 


426 


GRAWWFTPVITPLWEAEAGVQWRDLSSL 
QPLLPGFKRFSCLRLPNNWDYRHVPPRP 
ANFL 

mv mm»m> mm. u 


5304 


, 19205 


A 


5343 


3 


149 


S WFLHFGEAGLELLTSGDP PALALPKCW 
DYRLVSNSWPEVILQPQPPK 


5305 


19206 


A 


5344 


464 


147 


LRRIiRWENRLRPGGGGCSBPRSHRCTLA 
RVTES LSRVQCQRDP RRLAQTSRRPGAS 
S F P FAG P AFSAVNSTYQRALP P AQGAP W 
TPRIiPLGPHGTTS S TS S YLPV 


5306 


19207 


A 


5345 


228 


1 . 


VIRL.EETYFLLFFVLFLFLFFRQTRSGS 
FAQAGVQSSHLSLPS SWDHRYTPHLANF 
CI FLQKQSLAMLPRLVSNTK 


5307 


19208 


A 


5346 


2 


106 


FVLELLGSNNSPSSASESAGITGVSHCG 
QPLFNF 


5308 

■ 


19209 


j\ 

A 


5347 


140 


436 


GGEFGL IENPPPG FNPFFLLNLPKKGEL 

PADDPADTPPPPPPF.KIJGPTPI/POEGI.N 

FWTWEPPPLTLPRGG 




lyziu 


n 




1 
I 




i ir< jivi\jc ctiAjxivvrvjo x lonur rciiinv v>jtr . 

FSDLQLRAGRTTTLFKAVRQGHSSLQRI 
LLPFVWOCPAPRGGVYRGRQASLSCSGL 
HPVRASWLLCFPSEAWAMSGTPPPASLP 
PCNLISDCCASNEQGSVGIGPSEPGVEY 
NLLLRHLLKPEEKRS 






a 

mTm 






JO 


ERDYEEGPGAVACACIPSTLGGRGGWIT 
RSGDRDHPDEIVDP 


5311 


19212 


A 


5350 


288 


404 


I EXNGM I LAHCN FQLPGSSNS PASAS PV 
AVITGMNHYAR 


5312 


19213. 


A 


5351 


413 


305 


AEAGLKPLS S TYIiRAS ASQSAG I TGVSH 
HTOPLLPF 


5313 


19214 


A 


5352 


91 


356 


T VPKRNTF S PKMVAI TTAGTVSTSVLAL 
GNRESALPTPJOGWALCCFIFWGGRGHRV 
PLCHPGWSTVARSRLTATSTSQVQAI LL 
POPP 


5314 


19215 


A 

■ 


5353 


2 

• 


410 


FVQWQ PLKTGSDRS HKAVESLGYE KVEK 
AMTVNYS FRSLNTKGKIiEIiLGKI S VNLY 
AKW KH FREKGRRYTNT LGPTATAAFYST * 
EKTEFLPWSQTCKPFIDSFNKYIiLPGA 
ADHAQTPSTLGGRGGKITRSGDRD 

# mm « mmr m mmm*mmw^m^m9m\^^m9^mT my^m^mm mm\ -mu ^mmmmr ^mmm mmmw fcAr 


5315 


19216 


A 


5354 


3 


288 


S LDDDLLKLLLPLMLQ YS DE FVQSAYLS 
PJILAYFCARRLSLLLSDSPNLLAAHS PH 
MMIGPNNSSIGAPSPGPPGPGMSPVQtA 
FSDFLSCAQH 






A 

* * 


j j j j 


187 


a 


KKILWOTPGVNIjFSFKGAPLFFFFIiRRV 

llccpgwsavarspltatsasrvqaill 

PQPPK 


5317 


19218 


A 


5356 


383 


261 


GYKINGQKSQAFLYTNNRQTESRLWYCY 
DSPQRLIHLSIK 


5318 


19219 


A 


5357 


367 


241 


FFFRHVAQAGLQLLGSNNPPASPSQSVR 
VTGVSHCTQPLCFV 


5319 


19220 


A 


5358 


217 


1 


S C DRENTGLKLLS I LLLIHCETDTR I S L 
LCISLSI KRE I WLGAVARACNPS TLGGR 
GGRI TRS EDRDPGBTP 


5320 


19221 


A 


5359 


255 


412 


TQLCRKRNAF FFIjATNHS QKGQS RWGTV 
AHACNPS TLGGRGGR I TRSGDGDH 
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SEQID 
NO: of 
nucleotide 
sequence 

• 

■ 

■ 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 

\ 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- - 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
(^Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, PMProline, 
Q=Glutaimne, R=Arginine,S=Serine, 
T=Threonine, V=Valine, 
W-Tryptophan, Y-Tyrosine, . 
X=Unknown, *«=Stop codon, /"possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5321 


19222 


A 


5360 


391 


271 


RSCH I AQAGIiKLLASSNPPASASQNAB I 
TGVSHHLTPICC 


5322 


19223 


A 


5361 


232 


397 


EGLKBPAGLFTMEYYTARKRNKIMPFW 
TWMQIiEAIIIjGESTQEQKTKFCMFSPV 


5323 


19224 


A 


5362 


213 


392 


LFIKVEGKFFLRS INILNESLLFLSSNG 
PGLLKKLGVWPGWAHACNPSTLRGQGG 
RTA 


5324 


19225 


A 


5363 


306 


418 


SDGITGTLPLASQBSAWEDLLYVLVGV 
DGRYVSAQP 


5325 

► 


19226 


A 


5364 


347 


48 


KKRWLKPGGGKNPPKKGGKKGGTPPFFQ 
KKGFFFPPFFFFLEKRDLSPQAGGKGEK 
KKPPQPWKAEGKKNPWGNPPQRGGYKGN 
PTRPPPQGFFFFFFL 


5326 


19227 


A 


5365 


412 


172 


PDSNLS FGAHYQAS LLGS FSLMPLLLYS 
KCSGQWDVARSAECTSKSSQNKERKKER 
KKERKKERKKERKKTRKQKKNPLT 


5327 


19228 


A 

■ m 


5366 


400 


296 


IjVEGGFHHVGRAGLEL1?TSSDPPTSASQ 
NLGLQA 


5328 


19229 


A 


5367 


162 


1 


APSSI^ETPTLLKKTLFGPGLVAGICNP 

stlrgrggritrsgvrdqpdqhgetp 


5329 


19230 


A 


5368 


152 

1 *J a* 


403 


TTVLCKLVQ PLWKTVWRFLKELE VDLP F 
NPAVPLLGIYPTEKKSYEKGTCTHVFIA 
AQPQIARIWNQSKCPPINEQKKKKGAPV 


5330 


19231 


A 


5369 


170 


367 


IVFEILVISYFKSFPQKTTEWNTFFFHP 1 

FKVFNWPSTVAHACNPSTLGGRGRQITR 

SGDRDHPAR 


5331 


19232 


A 


5370 


396 


1 


QMCVRAHTFAGGRARTHTHTHTHTHTHS 
WATPRVSHNHCVCVCVCVCVC 


5332 


19233 


A 


5371 


139 


3 


P KLECGAS 1 I AHCS L ELLGS KDP P PS AF 
QVABTTGCAPPCLAFFF 


5333 


19234 


A 


5372 


1 

■ 


341 

a 


PLINILSHLILLCRNYFSFVKFCIRHTS 
LLYWISLFHVLLCWSFLTDRRAHNAS 
VTYGFVL PVTFKNS PL PGAVAHACNPST 
LGGRGGOIMRSGVQDQPGQARSSRPALP 
T 


5334 


19235 


A 


5373 


280 


449 


ICHNLQLCPSKDTIEKVKRQPIEWEKIF 
ANHISHKGLSS 


5335 


19236 


A 


5374 


170 


1 


QKRG FSMGTPMVS I FRPQDLTPLG FPKA 
WDPRVNPWGPWPCPLFFKGNPQLFQAPG 


5336 


19237 


A 


5375 


462 


165 


PPTRFLHWQTGLE LPTSGDLPASASQG 
AGITGVSHHAQPRGHVFDVLLEYSQQGV 
FTLLSFLGFSVFFEPLFC 


5337 


19238 


A 


5376 


3 


451 


PRAKLGTRRGLRNLDTIGVAVDLILLFR 
ELRVEQESLLTAFPCGLQFLRNIASRHE 
DSQSIGWVHAFPPPVWPCLSLPPPHTGS 
SSSITLPPLLPHRPHPPPHSHPPMATLS 
RPPYPPPPHPPPHPLPPSLSPPALSPPP 
PPPPPPPSL 


5338 


19239 


A 


5377 


460 


300 


KVLLCCPRLECRGAIMAHHSLELIiGSSN 
PPTSASQSAGVTGVSHCRSPWEFL 


5339 


19240 


A 


5378 


452 


191 


DSRFHHVVQADLEHPTS SDLHASASQS A 
VITGVSHRTRPDIAFAPSISSYFSCVAE 
ISLIiHLSESFPKQFL 


5340 


19241 


A 


5379 


240 


395 


IiATSSVANLRNTFYLKLKSSVLGVEAHS 
CNPSTLGGQGVRITKSGVRDQPGQ 


5341 


19242 


A 


5380 


210 


411 


TSVNSAPNSGQKASVEECFGQVTKVKIQ 
NVFLGWAQWLTPVI P AL WE AG AGGS RDQ 
EIDTHiANTVR 
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SEQ ID 
NO: of 
nucleotide 
sequence 

• 

■ 


SEQ ID 
NO: of 
peptide 
sequence 

i - 


M 

ejth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

* 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to - 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid. F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleuciue, 
KpLysine, L=Leucine, M=Methionine. 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=VaIine, 
W=<rryptopban, Y^Tyrosine, 
X=Un known, *=S>top codon, /=pos$Ible 
nucleotide deletion, V=possible 
nucleotide insertion 


5342 


19243 


A 


5381 


406 


300 


FHYVGOPGLELLTLGSTCLGLSKCWDYR 
REPPHQA 


5343 


19244 


A 


5382 


364 


479 


INDG I S KISLFliSHLKDQAGQVWWLTPV 
IPALWEAEAG 


5344 


19245 


A 


5383 


182 


395 


HHLRGSAAS FLKSAQE PTD FRHTGPKRF 
SCLSLPSSWDYRCPPPRRGGGVLCLPGW 
RTWQSWLTTTSISQ 


5345 


19246 

• 


A 


5384 


37 


478 

¥ 9 %f 


GGIHSYPVICLFILMGSSVSLPFSSVAC 
LLAWLLPSFLPSLLPSFLPSFLPSLTGR 
AFL 


5346 


19247 

1 


A 


5385 

i 


201 


458 


WNP P FLS PRKWAARADTE WGVGKTAQEP 
GLS S RL.QAYE ERQ RHWQRE REALREDCA 
AYAQCAQRAQQLLQLQVFQLQQEKRQLQ 
DD 


5347 


19248 


A 


5386 


206 


407 


FFS PGGRAGEKFGFMEPPPPGFKKTLWL 
NPPGKWKQRAPPPIPVYFLEFKKKGGWP 
NMVQVGLNPRP , - 


5348 


19249 


A 


5387 


468 


o 


NFF P PRPAPP FFF FFF FVAQAGVQWRVL 
GSLQAPPPGFTPFSCLSLPSSWDYRRPA 
RVSPHC 


5349 


19250 


A 


5388 


136 


2 


LFFFFLFETGFCSVP^LSCSGTI IAHCS 
LKLPGSCNSPTSVSQV 


5350' 


19251 


A 


5389 


397 


0 


ILYPSQKKKKKKKKKKKKKKKKKKSQ 


5351 


19252 


A 


5390 


299 


3 


AQPI*APFPGSFILMYFFMLLLFGIPLLY 
MEVIMGQWLHVDNIRVWKQLVPWLCSMS 
YAHSLVNEGPSPRPSPISLCGLQVSAHR 
VHPQVCASVSLYNS 


5352 


19253 


A 


5391 


264 


202 


FFFFFFFFFFFFFFFFSBNKI 


5353 


19254" 


A 


5392 


2 


150 


PRVRFLIETEFCHVGQAGLKLLTSVDPP 
TSASQSAGITGVCHLVRPEKS 


5354 


19255 


A 


5393 


382 


0 


NSRAQAWWLMPVI S PHWEAEVGGSLBPT 
S 


5355 


19256 


A 


5394 


395 


239 


SLPS S WD YRHAPQCWANFI FIiVETGFSS 
VILCS FRLGGFGLS LNILCSLKSL 


5356 


19257 


A 


5395 


24 


265 


LLPHRKS RNYRTRPTRS LRQ KNGMNLGG 
GACSESRS HHCTPVWATERDSVSKKKLF 
TIELVYVFLK2CNSKGEPLPKVLGL 


5357 


19258 


A 


5396 


,174 


118 


FFF F F F F FFFFF FFFFRS S 


5358 


19259 


A 


5397 


406 


134 


PSAPSSPPPPPLIWGPPGFFPPPLFKNP 
PPKFFFGAPKKKIYLPPPRALNFVLLKG 
PPPFFFFFFFFFSFF 


5359 


19260 


A 


5398 


294 


64 


IFFFFFFFFFFFFFFFFWEIiHILWGAIiF 
LLKSR I P PGKKKNTCKIFLCTPLTGYI P 
LLKCFQKQNKKSQSQNTELKS 


5360 


19261 


A 


5399 


1 


211 


LLLPTP YHH INQDS SNKYHETKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKRGEKKKK 
KKKKGGGGGYKKKR 


5361 


19262 


A 


5400 


305 


1 


SPPRPPFLLPFLFFLPSFFFKAPHTINH 
FFSPPPLFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFLKLYIiFYKMHRINYTSNI 
LKINIFVFSSPGSGGRR 


5362 


19263 


A 


5401 


416 


3 


ENT E RDRGRQRRE DRS RAE TQE TNRAGQ 
QESTPPPQKKCNTYSIPSSSSSSPTPQK 
GKKGGPPGGPPPPAGERGGPKKKKRGPQ 
KKKIRPPPPGKKKPPPKTPPLFFFFFFF 
FFFFFHKPQTFIVGVNTYILLRSVS 


5363 


19264 


A 


5402 


154 


416 


KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
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SEQID 
NO: of 

nucleotide 
sentience 


SEQID 
NO: of 
nentirfe 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 

. 


Predicted 
beginning 

n 11 clenHrfe 
location 
correspond 
ing to first 
amino acid 
residue of 

peptide 
sequence 


Predict, 
ed end 

micl^w 

1IUVI" 

location 
correspon 
ding to 

lfi<t Amino 

add 

residue of 
sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid. 

K^GIutflmic Acid Kt^Phenvlnlnnin* 
G=Glvcine. H=Histidine. J~T s n 1 pi 1 rl n p. 
K=Lysine, L= Leucine, M=Methlonine, 
N=As para gin e, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 

X— Th reo nine V«Vflline 

M 1 III VVINIIV) T T QIIUVJ 

W=Tryptophan, Y=Tyrosine, 

X=Un known, ^top codon, £=possfbIe 

iiuucunuv UCICUU1I9 lr - UUaoIUMS 

nucleotide insertion 














KKKKKKKKDSGGGAF 


5364 


19265 


A 


5403 


327 


2 

■ 


PPPQKFFLGGVFFFPPFFYNPPPPFFFF 
SPKKKKYFSPPRGKIFFFFFPPPPPFFF 
FFFFFF FFFFFFF FFFFFFFFFFFFFFF 
LRWSLALVAQAGVWWLNHGSLQSL 




i 107££ 
J7ZOO 

1 


a 




177 


1 si 


VTCflT.AWTRrQTPNPT.NPTT.TTWJtfGlCPPiy ' 

AQAGLEPLGSSDPPTSASQSDGITGVSR 
RTRPINIC 


DjOO 




a 




110 
1 1U 


1 

1 


GGQGGRIT 


J JO/ 


i<v><g 

lyzoo 

* 


v ■ 


D4UO 


o — 


* 


PPPGHALEWTLENVLLAAPRPRKAQVLF 
AIVASETSSWDRBKLWTLSLEAKCKGIT 

T .PVT.AT /I W2VflTTJPT . APT. ZiPT .W3 AD CBft 

HT.TjRLOGV < 3 kpp. vntyaogftr 


5368 


19269 


A 


5407 


3 


181 


QLILFHDKLSPICLLSWGNVDIIjINFFLS 

liV^kJ f X-J X\_rt-CVXv J_I\^n XVI H I J. IN J_l V f^i | p a * ^ I 

GIN 


JJU7 


10970 


A 




171 


777 


S AE I TGVCHCARPNHDS YRRS 


S17fl 
J J /U 




a 




704 




KTnVnP WVT iP Jf fiTTiTJDT . T M ^ MT ,T7H7?T,WT? 
vv^civv vtuuuj x XA7i/ujLi*ienjjii ncjj v v Xv 

SKDEKNTKTWPGAVAHTC3^PSTLiGGRGG 
QITRSGVRDQPGRHG 


5371 


19272 


A 


5410 


236 


308 


ACaRPGTVTHTCNPSTtGGQGGRil l 4 


5372 


19273 


A 


5411 


375 


216 


GCSEPRWCHCTTPAWVTVQDSPPKKBKK 
RKE KKKKLLTT ENS KKALKRNG YL 


^171 


1077/1 


IX 




1 
1 


1B1 


TGMSPRTQP 


5374 


19275 


"X" 


5413 


144 


3 


RPPP I FFFLAJCKIJlAIiFFFFHS VTQAG V 
OWRNI/3S 1SOP LP PGFKDF 




1077£ 






417 


17 




5376 

■ 


19277 


A 


5415 


414 


2 


NNNSFVFFSKPIFLKTVGGAPPPSPIYN 
P I P I FLYI YP PHKWGGVYI PTHM YGGGD 

IWGYMYTPPPPPLFIYFFPPPPFYIEKK 

KK'TVPCFFAnGGVPT.r^KKinOJTCDPGHfG 

ivt\ x v l v«f cnyv»v lujuiJiuUuUiiwxujnvj 

DTMTMGHGGRLEMVANVCGLKRYW 


5377 


19278 


A 


5416 


1 

4 


240 


GGEGCS E PRS CHCT PAWATERDS ISKKK 
KKKGGPFKRTHFSSPGLPSFFFFFGAPK 
FNSRARFLTPREGKNPGLPPFYPA 


^17R 


1077O 


A 




7 


/11 




5379 


19280 


A 


5418 


322 


1 


W PLC FS RCSKKRGS PLGFP FS P PLGQS K 
KFVRFACKTPSAFKPFPSLS FFQRWAGP 

CCS ilyUllMDiJrAuiJC U ^V^>£ EEC C C JLIXVrt 

SLTLVAQAGVQWHDPGSLQPPPL 


SI Sift 


10781 


a 

1 


S/L1 0 


1 
j 


107 


PPM Afl Q YDPflAP A T T . a nTf R nOT?0 QP PLQ 

D PARVFHHNAWDOTE WSEEQAAAAERKV 
QENS I QRVCQEKQVDYE INAHKYWNDFY 
KIHENGFFKDRHV7LFTEFPELAPSQNQN 
HLKDWFLENKSEAPSQKK 


5381 


19282 


A 


5420 


273 


91 


RW Y I VLDNLI FX LENI CCETS LK I YLG P 
GWARAC^STLGGQGGQITRSGVRDQQ 
GQHGB 


5382 


19283 


A 


5421 


1 


149 


GGGGCSEPRLRRCTPAWATERDSVSKKK 
NIJCKFFCPGLGLYQKQNTWTL ■ 


5383 


19284 


A* 


5422 


381 


179 

* 


S SCLGLPKS WDYRCEPLCPARTALNLKT 
I KKS KDNJIRWRGCX3EKEMLMHCWNECDM 
VWLCVPTQISC 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 

■ 


SEQID 
NO: in 

TIC CM 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

uig to iirst 

amino acid 

resio ue oi 

peptide 

sequence 


Predict- 
ed end 
nude- . 
otide 
location 
correspon 
ding to 
last amino 
add 

residue oi 

peptide 

sequence 


Amino acid sequence (A=>Alanine 
OCysteine, D=Aspartic Add, 
£r=vriutamic Acid, t— v nenyiaianine, 
G=Glycine, H=Histidine, I=Isoleudne, 
K=Lysine, L=Leucine, M=Methionine, 
iY=Asparagine, r=Froline, 
Q=Glutamine, R=Arginine, S=Serine, 
I— jnreomne, v* 3 valine, 
W=Tryptophan, Y=Tyroslne, 
a— unknown, —Mop cod on, /— possiDie 
nucleotide deletion, ^possible 
nucleotide insertion 


5384 


19285 


A 


5423 


195 


369 


CSLSPIKECKLHISGLQDSSISLVNLGL 
AIRNIRNKPGVVAHSCNPSTLGGQAGWI 

T 


Of 

5385 


19286 


A 


5424 


253 


it 1 4 

414 


KKKKKXXRGGAAKKKPHSPAPAERDL 


5386 


1 /VIA*? 

19287 


A 


5425 


160 . 


2 


Nl^KAjbAFr aKKlbPr r ljviU^FIjr rrFF 
EMESLSVAQAGVQWRDJjGSLQPLP 


5387 


19288 


A 

■ 


5426 

t 


53 

■ 


426 


ASS S FFF FFFJjGKGGS I FP P KGGGEG KN 
FFL WKPGAPG<pGG P PP PTPRGGGKKGP P 

KKTLPEKMGLKK 


53 88 


19289 


A 


5427 


60 


1 




5389 


19290 


A 


5428 


87 


247 


KKKNPQKKKKKKKKKKKKKK^ 

DSPfirTlT.CT WDDRfOMMT T717WVT 13 


5390 


19291 


A 


5429 


209 


405 


Klil^WFSSLSSLKAMISAILLKLSVPI 

r frlK.i V 1 Lib 1 JjJN bub W b\iA V AJiALjJfc'b A 

LGGRGGRIT 


5391 . 


19292 


A 


5430 


324 


237 


VMNAFVC VCVCVCS C I CVCVCVCVYFL F 


**** 

5392 


19293 


A 


5431 


409 


3 


F FFFFS ETE SRSVTQAGVQWRDLGS 


5393 


19294 


A 


5432 


270 


402 


MKI I FHL IR ITG WAQWLTP VTPAL WEAE 
TGGSRGQEFETTVKPC 


5394 


19295 


A 


5433 


107 


277 


YNFNNHR YREAI Q KWDEALQLTPNDATL 
YEMKSQVI IMKNFTFLMKLCYVCSVSEQ 


5395 


19296 


A 


5434 


95 


1 


RNRKKNGQSGTVAHACNPSTLGGRDGWI 
TRS 


5396 

- 


19297 

• 


A 


5435 


412 


105 


GGEGCSELRSCHCPPAWATRARIiHLKIK 

KKEkWDLAMLLRVGYSGHRQVHCTO 

IiQPQPF 


5397 


19298 

i 


A 


5436 


JB V 

276 

i 


488 


PLPQPNKDSRIjLPIjLVCLGFLFAPLIjML 
GHG PKRARL P I F FPQEAYF ITFRLVFAE 

KlQb xKliFJj 1 Ml.liAfc'K 


5398 


19299 


A 


5437 


179 


2 


GISPLKGGTPSLKKKKKAKDFJMRRVSKE 
YLP 


5399 


19300 


A 


5438 


3 


156 


TPLYSQILI SLRQEHYFKAEGRGCNELT 

C VTJ<^**PT3 a MTV HTICftCV c VMrrrtiflT* 
O X HC 1 ir AW AiXXJoL/b V b IVM J\UTf 1 


5400 


19301 


A 


5439 


47 


173 


SLQLFLQTEERKDSDDEKSDRNRPWWRK 
RFVSAMPKGNFIKY 


5401 


19302 


A 


5440 


3 


122 


TTLTLTSLIPPILTTLV1WNKKNPPKLI 
RXuVjjirsfi rciv 


5402 


19303 


A 


5441 


390 


489 


KYKSYSLQPPPPKFKHFSCVTLPKHWDY 
RREPP 


5403 


19304 


A 


5442 


86 


275 


RQEFETSQQGETPCLLKIQKIiAGRAHLN 
PGGRGCS EPRLCDCTAAWVTRAGLSQIK 
KIKKKKK 


5404 


19305 


A 


5443 


424 


166 


CFFVSPKAKSQGWLGSPPPPPPGLKHF 

bAir X JrKVjljr XjC r f r rKuvrrtM&uuono 

PPPGVFPPWPPQKGGDSGGYPPPRAKIP 
LF 


5405 


19306 


A 


5444 


356 


3 


AFFWFTNPFPPGFKGFFCPHPLSGISKG 
GPPPRGNFFFFGKAHFFS PFFKFLPPI F 
LPFLAPQKVGFPGFGPPPKPSQGVFLFF 
FFFFFSFLFFFFFETESSSVAQAGVQWR 
DPDAW 


5404 


19307 


A 


5445 


198 


86 


PWPI^QAILl^SSCLNLPKCWDYRRE 
PPHTALLNS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 

- 


SEQID 
NO: in 
USSN 
u9/515,l 
26 


Predicted 
beginning 

■ X * ft 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
. peptide 
sequence 


Predict- 
edend 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A»Alanine 
OCysteine, D=Aspartic Acid, 
EXIIutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso)eucine, 
K=Lyslne, L=Leucine, M=Methiomne, 
N=Asparagioe, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, ' 
Threonine, V«Valine, 
W«Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


5407 


19308 


A 


5446 


198 


86 


P WPLGAQAI LLPQSS CL^tPKCWDYRRE 
PPHTALLNS ' 


5408 


19309 


A 


5447 


143 


3 


TMIYFYFYFFLSQSLALLPSLEGNQTIS 
AHCNLRLPGSSDSSADAW 


5409 


19310 

< 


A 


,5448 


324 


1 


KKNSATS FFSVEPFSTGGVLANKGALTQ 
EKILPCFKI KKI RKRVGAVAPARNHFLF 
FSPFFKEGLQGIVYFFFFFFELESRSVT 
QAGVQWRNLCSPQPLPPGFKRTRG 


5410 


19311 


A 


5449 


403 


0 


LFFFFFFFFFFFFPKENH 


5411 


19312 


A . 


5450 


114 


409 


LNIWTLKYLISKHLLIVCLPDTILSSLC 
L IKYI GLQGKVLI FFFKQFSAIDKLGQN 
IAWGKFGFAHYS LLT KKWKLFGNITQE 
QNM I VTGGLS WWND 


5412 


19313 


A 


5451 


192 


406 


RNFFLGHLKYVKR FEQIGWGRDFMFKTP 
KARATKNKMDKGDL I KLKS FCPAKBTTF 
RGTGHPTKGEKI FAT 


5413 


19314 


A 


5452 

• 


254 


48 


NAFGWLCVHYALQFIDKSVKNSVCWIiGA 
VAHACNP S TLGGO.GGWITRSGVRHQPGQ 
YGETLRTRASTR 


5414 


19315 


A 


5453 


1 


288 


PLLFI LI PSPFSFPHPF03PI KFILPRE 
KKKKKKKKKKKKKKKKKKGGGPFKKKKI 
FPRGGKKKIFFFWAQKKKIRGGVLKTGG 
RKKPGGKKNKSQ 


5415 


19316 


k 


5454 


285 


415 


TSGCCFVFI^VLKLSIFCKGKVLNAIBDN 
GLKNS T FTYFTSDHG 


5416 


19317 


A 


5455 


372 


125 


ITPAFSFSPMGKKGVFPPPKFFGFPPGF 
PPPPFFKPPPGIIFLGPKKKKFFSPPPR 
VYFFFFKRPPPFFFFFFFFFFFFFFF 


5417 


19318 


A 


5456 


3 


159 


AHASAHAYDTI KDALGLLLF ILSLMTLT 
LFSPDLLGDQKKKKKKKKKRGGP 


5418 


19319 


A 


5457 


109 


392 


NIIIIKRSTSLFLRPIFPPKLFPFILYL 
FISLRTLLFFFFFKREFCFLPQVGGQGP 
FIiNSLKPLPRGLMQFSCLTLLRSGNYGP 
PPPPPINFFF 


5419 


19320 


A 


5458 


243 


402 


FLKIGFIHKYLKBFVWGILNIVFNRFNS 
HERAPHAI VFVNGTTI EGHVEKCYW 


5420 


19321 


A 


5459 


394 


105 


IPPPS PRPGGPGAAPPPGGGGSTTPPPK 
KKPPFFPKNKNQPGGGGGAPPSPPPRGL 
GPKIPLTPGTKVFKETKPPPFPPPRGAP 
PKFLSKKKKKET 


5421 


19322 


A 


5460 


56 


379 


VCFSPS I YGAWLNKSPLRSHGSSSTSG 
WGGLPSPFPAQRLPLASQVQEHNGHVFA 
S YQVS I PQSCEQCLS YI WLMDKALLCSG 
E WLPHQAPKPSMS PTTPHGS RGLQ 


5422 


19323 


A 


5461 


184 


420 


FFFFFLNNNLFFFFFFFFFFFFFFFFLG 
RN 


5423 


19324 


A 


5462 


295 


384 


LGAVAQARNP ITLGGRGGWITRSGVQDQ 
PG 


£ A*> A 

5424 


19325 


A 


5463 


a m n 

413 


257 


FliNPv^RGCSEPKSHKSTIjAWVTERDLI 
S KKKKKE KCW I PQRM PERGQWS PD 


5425 


19326 


A 


5464 


2 


181 


SLYCSGWSQTPGLKQSSCLSFPKCWDYR 
CELPR PVYVLL YKTS FP KRYVFGIHLCF 
MSQ 


5426 


19327 


A 


5465 


1 


229 


TGSCYVAQANLELLSSSDASTSVSQNAG 
TTGVSHHVQP YTS S FSQVPPLRAQSAAT 
ETTSILWVQKIiRIiQWRCSV 


5427 


19328 


A 


5466 


1 


368 . 


KERETKKEREEERRMRRRREGRKEKGRK 
ERKKEREKERKKREEKKRNNLGQSAGIT 
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SEQI0 
NO: of 
nucleotide 
sequence 

■ 

• 


SEQID 
NO: of 
peptide 
sequence 

• 




SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ins to first 

amino acid 

residue of 

Dentide 

sequence 


Predict- 
ed end 

nude- ' 

I1MVIW 

otide 

location 

corresoon 

ding to 
last amino 
acid 

residue of 

oentide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D^Aspartic Acid, 

E E =Glutftmlc Acid F=PhenvInlnn5np 

G=Glycine, H-Histidine, I=Isoleucine, 
K~Lys!ne, L^Leuclne, M=Methionine, 
N=Asnftrfloinp P=Prnline 

Q=Glutaiaine, R=Arginine, S=Serine, 
^Threonine, V«Valine, 

W»Trv n f n nhnn V=TvrrKi n p 

X=Unknown, *=^top codon, ^possible 

ntideotMp riplpfinn Vsrnnccfhfp 

■ J ULILU 1IUC UCIGUUU) inUUSAIUIC 

nucleotide insertion 




* 






- 


- 


GMSHHTRPKSHNVIiSKFTNLCWAAFKAI 
LGHMWGRLDKLNVEQC FSTELSALPS 1 1 
ATSHMRLWVI 


5428 


19329 


A 


5467 


183 


3 


VSLCYGEKFSPQMNSSQLYFRPGAVAHA 
CNPSTLGGRGRQITRSGDRNHPGQHGET 
GRV 


5429 


19330 


A 


5468 


1 


128 


GGAGYS ELRLCHCTRAWATRAKLHLKKK 
KKKGPLRALKKIGG 


5430 


19331 


A 


5469 


1 


155 


NKFPVLISNVINI IGMCHCTPAWATERD 
SVSKRKKKEKKRSKIARHSVLCL 


5431 


19332 


X 


5470 

I 


ni 

1 u 


1 
1 


FFF FF PQKTS SRLGTV 
GGRITRSGD 


AHACNPSTLGGR 


5432 


19333 


A 


5471 


407 


9ft 1 


NGM I PGGRACSELRSRHCT PAWATBQDS 
IS KKQNKSKNPGLK 


5433 


19334 


A 


5472 


1 


177 


GGGCSEPRSRHCPPAWATERDSYLHKMK 
QS KTKGERMEVAC YCI I ILFMLTDALEV 
LMA 


5434 


1Q335 


A 


/ j 


to 




KERREEKEKEKRKRSERKGRKKEGRRDR 
EKEKEERKDERREGEEGEEGRKEKKEKD 
IKRRGWP 


5435 


19336 


A 


5474 


133 


2 


LCRMVVLFCFEAGSHpSVTOAGVQWCGIuN 
SLQPLPPGFKQFSCL 


5436 


19337 


A 


5475 


5 


394 

• 


yGVGTKGFRAGVKEFQALFLLITVHSQL 
CHIEGTRHHSRHI I WVDFINQIENGTF 
FPLVLSNGCIWLLIVYLFWNIFTRQEGQ 
FF YGKLSHYS VFVFCFLRRS FALAQAGV 
RWCDLGSLQPLPPGFKRF 


5437 


19338 


A 


5476 




7 


SEKPWVVALTLMTFSHFYFFIYFYYIFI 
FQMESCSITQAGVQWRDLNSLQPLP 


5438 


19339 


A 


SMI 


111 


2 


FLFLKQDLPGAVAHACNPSTLGGRGRQI 
TRSGDRDH 


5439 


19340 


A 


5478 


232 


0 


HLVWRMESGSVAHAAVQWCDLSSLQPPP 
PRFKPFPSS 


5440 


19341 


A 


5479 


388 


225 


ALFSSKKKKKRKKERKRKKEKKKKKESM 
QRKGGHLPRVCCGASMVSPGHKLSL 


5A41 


19342 


A 




15/) 


0 


HR VAVI YEVFFVCLFF1 
TQPGVQWRNLS KLQPPI 


fFLFLRQSLTIiV 
JPGFK 


5442 


19343 


A 


54521 


I JO 




DSVCTCVLHTCHSSGLSQGRSCLHCSIiG 
LPNSNCPCVCVCVCVCVCVCVCVCVSFC 
PLHQGVCVCVCVCVCVCVFLSAHFTKGC 
VCVCVCVCVCVCFSPPI 


5443 

• 


19344 


A 


5482 


391 


115 

Hi/ 


RACPDCRVRFPHTSPPCLPCGPEAEPGP 
GPALRELVQPLPGQliQPPFGMPPSGLGA 
APSPAPPACAWTRPPHLHPSSFSSSVPQ 
ISSLFLCF 


5444 


19345 


A 


5483 


392 


2 


FFFFFSETQFRSCCPRLECNGAISVHCN ' 
LRLPGSNNSP 


5445 


19346 


A 


5484 


381 


282 


PVPWGQAIFLPPPPPYLGIPGPPPPPGL 
FLFP 


5446 


19347 


A 


5485 


407 


3 


FFFFFFLRRSLTVAQAGVRWCDLGSLQA 
PPPGFT 


5447 


19348 


A 


5486 


2 


417 


QENHLNPGGVGCSEPRSCHCTPAWVTEY 
DKKKKK | 


5448 


19349 


A 


5487 


3 


413 


VYLHPSLQPSLSSLVPDQALSHDIWIjKK j 
KKKKKK2CKKKKRGGIS PPS P VGGVGPRE 1 
TWGVWGPGANLFQRGGGVRBLFSQKKKG 
EKGQRNPEKXPPHKRGPGGPKEKFIFFE 
KKKKKGGKPFGKKTKP PPPRVNTPGG | 
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SEQID * 
NO: of 
nucleotide 

■J IIVIVl/MUv 

sequence 1 

■ 


SEQ ID 
NO: of 
nen fi Ha 

pv|/UUC 

sequence 

* 


M 

eth 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
corresDond 
ing to first 
amino acid, 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
dine to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid. F^Pbenvhilanhie. 
G=Glycine, H=Histidine, I-lsoleucine, 
K=Lysine, L^Leucine, M^Methionine. 
N=Asparagine, P^roline, 
Q=Glutamlne, R=Areinme, S=Serine. 
T=Threonine, V=Valine, 
W^ryptophan, Y=Tyrosine, 
X^Un known. *=StoD cod on. A*oosslble 
nucleotide deletion, ^possible 
nucleotide insertion 


5449 


19350 


A 


5488 


229 


411 


IHGPGFKNGGRGKTIiGLPNLMAIiKKIPF 
SPGGYKPKNP' 


5450 


19351 

* 


A ' 


5489 


190 


50 


YKHHHHCHHHHHHQCHHRHHHDHHYHGD 
NDYS SHHSRSTCHVAVTV 


5451 


19352 


A 


5490 


615 


402 


RWCLTLS PRL E CS GM I S AH YNLHLPGS K 
QLSCIiSLPSSWDYRCPPPGKTTFSRGIR 

^^mmmmm^ ^^^mwm^w ma^mw m*mr mmsw W w mm* mm fr^^aWMm mm mm * w 4w mm, mm **** * mmm m>^ 

DALKPRKPKRINLI 


5452 


19353 


A 


5491 


338 


53 


KGPPFFFFCKKGGLFCSNWFLTPGGKWF 
SPPTPPRSGGKRVPPEGGGKGFSLFFFV 
EGGS CHIAQAGLKUjGS S DLVTSASQSA 
GIMGKSPHAWP 


5453 . 


19354 . 


A 


5492 


266 


412 


NPNFFERKS CSVSQAGVHRRBLNS LKAA 
PSGYTPASCVSLPSSWDYRRP 


5454 


19355 


A 


5493 


285 


412 


RDGVSALSPGLECGGMI IAHCNLELLGS | 
SDPSAIASQSTWNY 


5455 

■ 


19356 

• 


A 

- 

• 


5494 

■ 

■ 


1 

- 

■ 

■ 


1350 

■ 


MGFMTGIC I EDNWGPS PESGSVLTVGG 
VR IQMLDRCHTAHCPVI PGLLRGQLWTB 
NIAERNSHDRRPFPTWCSHFAQDLWPE 
QSIKDSYQKVILRKFEKCGHGNLHFKRG 
CESVDECKLHKRGYNGLNQCLTTTQSKI 
FQCGKYVKVFHQFSNSKRHKRRHTEKKP 
LKY I EGDKAFNQ S STHTTHKXIDTGBKP 

\TT/f"»'I7"E , /""V' V7\ tFWD Q OUT TTUirTTLI'I'lJTJXrD 

xit^iSi!iU^i\/vriNK£>o£lLil In JV1 InlK&JxJr 
YKCEECX3KVFKYFS3FTTHKKIHSGBKP 

V T CRVCCZ\C A T7MYP VTT .TTWTT T TWTWRn P 

YKCKECDKAFNHPATLSSHKKIHTGBKP 
YTCDKCGKAFISSSILSKHEKIHTGEKP 
VTCC'EFPGKAPTRSSHLTIWIKTIHTGHICP 

YKCEECGKAFTWSAGLHKHRRTHTGBKP 
YKCEE CGKAYTTS SNLTBHKTTHTGBKP 
YKCKECGK&FNWSSDIjNKHTCRIHIGQKP 
RT 


5456 


19357 


A 


5495 


4 


146 


P PTRP TCYYLS FYVS YPTGY KEDTRMVN 
KHMKRCSTSLVYLIiWINFY 


5457 


19358 




5496 


5 


388 


CGCFFFS PPPKKKVFPPPFFFFAFGFFP 
PPLFFTPPPQFFFWAPKKKKNSPPPPGK 
KIFFLKGPPPP 


5458 


19359 


A 


5497 


231 


163 


FFFFFFFFFFFFFFFFXXTNKWF 


5459 


19360 


A 


5498 


163 


2 


SVKFTLGKVSQFLKKHTPAGWAHTVI PA 
LWEAEAGGSRGQE IETILTNTVKPH 


5460 


19361 


A 


5499 


229 


388 


RTHLQAS SDLFCLYS LKTGS CCVAYASV 
QRHDLSSLQPPPPRFKRSSCLRPPE 


5461 


19362 


A 


5500 


405 


289 


HV AQAQL KLLAS S D P PT PAS Q SAGVTS V 
SHCTHPRCLFV 






A 


JJvi 




37^ 


GDFVRFS F WPGGGAHS CNP S TLGGRGGO 
VT 


5463 


19364 


A 


5502 


20 


374 


IIiQFQNFWIVLFKWFYHPGKFLIYILNW 
FSGLFWYYFSELS CFSLSS KINI LNLLS 

m» hrW *■ *m It mm> mm »r i f Hi 1 1 WWMi«*4w AMil^MiV 

GILNWFSISGVRKKIF^IKRYCWU3TV 
THACEPSTLGGQGRQITRSGDRDHPGQR 
WKIQN 


5464 


19365 


A 


5503 


1 


185 


FHH I GQAGLI LLTS SNLPASASQKAGIT 
GMSHCTQPI I IFLYSLFLQLSFSWQDFV 
MRKKF 


5465 


19366 


A 


5504 


202 


402 


NSFSSTAKLREKYRQFLYTPIPTGTQPL 
PLLTFCTRPITTVAHACNHSTLGGOGGQI 
TRSGACNQPGQ 


5466 


19367 


A 


5505 


118 


7 


IKKENNWPGAVAHACNPSILGGRDGRIT 
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SEQID 
NO: Of 
nucleotide 
sequence 

• 


SEQID 
NO: of 
peptide 
sequence 


M 

#h 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=GIycine, H=Histidirie, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methk>nine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine J 
T=Threonine, V=Valine, 
W-Tryptopban, Y^Tyrosine, . 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possfble 
nucleotide insertion 














RSGDRDHPG 


5467 


19368 


A 


5506 


205 


422 


ANHFTS IRMAI INNNKQTBNNKCWQGQG 
EVGTLVHCWWBCKMVWMLRQIVWSFLKK 
LNIQLSRDPATLLLAI 


5468 


19369 


A 


5507 


3 


425 


RS CHCTPAWATERDS VSQKKKKKKKKK 


5469 


19370 


A 


5508 


3 


86 


EKKLQVLLNCMTEIYYQPKKDKAERSK , 


5470 


19371 


A 


5509 


395 


108 


GVLBCWDTITAHCSIiDLVGSSDPPTSAS 
QVTGTTGTCHHAQLGLPKCWDDRHEPRL. 
PAWNKTLRRKNWHS LIiPVVPAIQELE VG 
GSLRPRSLRLE 


5471 


19372 


A 


5510 


421 


59 


PFFPPFLRRSLAVS QAGVQ WWDLGS LQA 
LQA 


5472 


19373 


A 


5511 


168 


3 


EICKMCFPVPFGVFFLWIVFVFVPVFVIi 
RWRMESLSVSQAGVQWCNLSSLQPPP 






A 




AAf. 


1 AO 


FFFFFFFCFFETESCSVARIiESSGMISA 
HSSHNLCI 


5474 


19375. 


A 


5513 


74 


398 


DRI I DAG P KGNYARFMNHCCQ PNCETQK 
WSVNGDTRVGLFALSDIKAGKNHFRILQ 
LTSEFOGFCFLQTASS DYNFKHFYMYRS 

* mmw mmw ^rt 1 A 4b Art •m m iy rt» 4k ^ A A & A ■ mm m> 

LELS S FTFLCNHHHHPS PELFS S S 


5475 


19376 


A 


5514 


73 


335 

■J J m> 


EGGSLFCPGWSAGALFWLTAPSASRGWG 

Art mm* ^mw mmr i A a ^rt v v * w ? Art rt> a a m nrtrti mmmw m w^rt v » ^rt 

GAPGVPWGEVGQKNCWNPGGGGCNELG 
LHHCI PPWGTGKTLS PKKKKKKGATDMG 
GAS 


5476 


19377 ) A 


5515 


416 


0 


WLGPGAVAHTCNPTTLGGRGRWITRSRD 


5477 


19378 


A 


5516 


242 


423 


AGSPLIHQGGQAI ILTQNPTPGWGTMVF 
QPVLRPVKGLYNANHVTSSPVASQPIIF 
.TTAG 


5478 


19379 


A 


5517 


2 


151 


GR VG FVGMGS HCVAQ AGLELLG S GG P PT 
SASQSAI IAGVSHRAWPYFPS 

u 'mmr m «PNrt A B rtw rtV • A^rt — ™rt A aa rt> ■ • A m* 


5479 


19380 


A 


5518 


312 

i 


405 


GQVRPGAVAHACS P NT LGGRGR I TRSGV 
PDQ 


5480 


19381 


A 


5519 


3 


387 


ACSGPRS CHCTPAWATHQDS VS KKKRKK 
KREKPFLLEKKRGPTLTWGEPKKGXPXjG 
ERNGPPSHFKKGPPLAPKPPGGNLHPEI 

4b^*%rtx^^rt^ mm ^rtA^iA. A %**^rt A A Airtrt, Mrta ^^A* As V WAi'M Art mmt 

AQGPGS PTDGNKKGDLNPTPVFYKFKTP 
EQAINWAAPPFPSEK 


5481 


19382 


A 


5520 


292 


140 


KKGPPRARPGGGGSPPLFFFFFFFFFFF 
FFFFFWESTLKKRELHRSDLCV 


5482 


19383 


A 


5521 


1 

• 


399 

• 


KKWLFLS S AQ I PTKLSRRRRNRETWPKE 
RS KINLHKP ILKKQRHELPDKEFKIAL I 
KMLYELKVTMHEQNENINKETENTKKKK 
KKKENPPLFGTGKNQSPPBKFQMGGQPP 
PPPPQJOCKPPPPZ^RFFVFSPP 


5483 


19384 


A 


5523 


316 


399 


MLLYWPGTVAYACNPSTLGGRGGRITRL 


5484 


19385 


A 


5524 


400 


0 


FFFFFFFCFFFFFLFFFS 


5485 


19386 


A 


5525 


3 


429 


TTNIGRAMGATTS I VGS DTS QAERPGGT 
TWSPGASSTSQSSRPGTSVTPDSSAS^ 
SETVTTKE FSGTTAI S RTSHTGTPAASG 
GQATGSLTATTGVAPGTTVAPGS SNTEA 
TTSVGEKKKDQKKKKASSP 


5486 


19387 


A 


5526 


135 


3 


GGKICTLI KQSVIQESWPGTVAHACNTS 
TLGGRGGW IMKSGDR 


5487 


1938S 


A 


5527 


105 


5 


KKKKKKKSGRWPGAVADACNPNILGGQG 
GQITI 


548$ 


19389 


A 


5528 » 


411 


31 


GQRVPLSRGVAKKKKFLWNPGGGAGKNG 
GKNFLKKKPPGAPPPSGGPKKGPGPPPP ! 
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SEQED ' 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
Den tide 

sequence 


M 

etti 

od 


SEQW 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 


Predict- 
ed end 
nude- 
otide 
location 


Amino acid sequence (A=Aianine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid. F=Phenvlalanine. 
G=Glycine, H=Htstidiae, I=Iso leucine, 
K=Lysine, L=Leucine, M-Methlonine, . 


• 




: 




ins to first 

+ + mm^ ^mmj mm\m mmr^ 

amino acid 
residue of 
peptide . 
sequence 


correspon ■ 
ding to 
last amino 
acid 

residue of 
peptide 
sea uence 


N^Asparasine. P=Proline. 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V«Valine, 
W^Trvptoohan, Y=Tvrosine» 

» » • W mm V U M mm mj m\ mm, j mj mm mm* mmjm mfmmm 

X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 




■ 






■ 




KKPL I LVS PPPRKGGAHNMHFRQKKSGN 

KGGEGSTKGGKEKI flgkfvqknfffff 

DPEFHSCCLGWSAMP 


5489 


19390 


A 


5529 

» 


411 


142 


PRFS FLFS PPRKKEVFP P PKILGGFRFF 
PPPRFLKPPPHFFFWGPKKXKKFPPPPG 

mm m» mm m> m 1mm ma m m mt mm mm mm mt m mm mm mm w t ^ ^» ^» Mi ^» ^M 

EKFFFFKGAPPPFFFFFFFFFFFFFFF 


• «j*t7v 




A 




174 


401 


QVQPQQKKK3CKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKASPGGGGKKKKK 
ERGGGGERKNI I WVPLLFPP 


5491 


19392 


A 


5531 


410 


0 


QPQS KKKKKKKKKKKKKKKKKKKKKKAS 
QKKKKKKKKKKKKKKSSGGGFFKEV 




17J7\3 

t 








145 
if j 


GGFFFNPPAKKGSFPOPHNLGGPPVFSP 

UW4 mm. mm mrW -*r mm n4»tMJh'4 ^ + m>m\V mmmV+mmt Vm* ^ mm W m± mmr m\ 

PPVFKTPPRNIFFGAPKKKKFFPPPGAK 
KyFFKKAPPPFFFFFFFFFFFFF 


5493 


19394 


A 


5533 


418 


328 


KTG FRHVGQ AGL KLLASSDLPAXASQS A 




170 50 


JV 




1 CQ 
1 07 


•i 


I I LI I HPCSHVPRRETLGLKACTFKNLF 
WPGMVAHACNPSTSGGRERRITRSGDR 


5495 


19396 


A 


5535 


1 


254 


PTRPCKPPQTSPNSGKLSPSSNGCMNTL 
HISS TNTVGEVI EALL KKFLVTES PAKF 
ALYKRCHREDQVYAOd>SDREHPLYLHD 


54yo 


iy3yy 


JV 




A 1 


At\A 


fWlTiTVT .PPTiVT.N^WPDTCP ^ ^RTdRTtPKP 
mjwij x vxjtrXvLiV uvto h ruivnooi\ii\3iJiri3\< 

WDYRYEPPHQAN 


<yfQT 




TV 
*\ 




Z 


zi£ 


NSVSKKKKKNFRGGGLKGKGPGFYPLEK 
KFPGVGFFP KTNLV 


j17o 

■ • 


17J77 

- 


A 

• 


SS10 




1R7 
JO/ 


I FOEHKNCGEMSE IEAKVKYVKLARSIiR 
TYGVSFFLVKEKMKGKNKLVPRIiLGITK 
DSVMRTOEKTKEVl^EWPLTTVKRWAAS 
PKSFTLDFGEYQESYYSVQTTEGEQISQ 
LIAGYIDI ILKKKKKKD 


5499 


19400 


A' 


5540 


317 


407 


GLSHHVFFEEGKLKKPRIKDKDKKVPEP 
DN 


5500 


19401 

1 /TV 1 


A 


5541 


120 


3 


NFFFFFI1RRSFALSPRI1ECSGTI SAHCN 
LHLPGSSDSP 


5501 


19402 


A 


5542 


430 


0 


QKAQSNG PEKQEKRGVI QNFKRTLS KKK 


5502 


19403 


A 


5543 


172 


2 


FIFNFLHIFSFWFITKFGCFIVLFLRQS 
I1AI1LPRPGVQWRDI1GSLQPPPPRVRPRV 


5503 


19404 


A 


5544 


149 


2 


EKDIQELNSALHQADLIDIYRTLHRKST 
FFSAPHRTYSKIDHIVGSKAL 


5504 


19405 


A 


5545 


84 


276 


TLKKRP KE I RNQGG S PQRKRAQLGKLQQ 
KYP VLNFKAPFFGERGDYYKS FFKTCLD 
NLPRRGKG 


5505 


19406 


A 

• 


5546 


2 

• 


914 

• 


RPAWHEEQKS KNE PEDTEDKBGCQMDKE 

JrOA V l\m\m\jrl\Jr Xntr VJaXAialaXJAO loriUfi 

KADPGAVKDKASPEPEKDFSEKAKPSPH 
PIKDKLKGKDETDSPTVHLGLDSDSESE 
LVI DLGEDHSGREGRKKKKEPKE P S P KO 
DWGKTP P S TTVGSHS P PETP VLTRS SA 
QTSAAGATATTSTSSTVTVTAPAPAATO 
SPVKKQRPLLPKETAPAVQRVVWKSSSK 
FQTSSQKWYMQDSSSSSSSSSPQNQQQQ 
PQSSQGTRYHTRQAVKAVQQKEITQSPS 
TSTITLVTSTQSSPLVTSSGSMST 


5506 


19407 


A 


5547 


2 


244 


TRLIFVFLVETGFRRVGQAGLKLLTSGY 
VGTSASQRMTAHLCLPRCWDYRGEQLHP 
AQRFFKRMLSVAVDAAIiLHTSGGM 


5507 


19408 


A 


5548 


406 , 


1 


FFFFFSETESCS ISQARVQWCNLSSLKP 
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SEQID 

NO: of 
nucleotide 

sequence 

• 


SEQID 
NO: of 
neotide 
sequence 

■ 


M 

& 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 

sen u en r£ 

1 


Predict, 
ed end 
nude- 
otide 
location 
correspon 
ding to 
last Amino 

acid 

reciHne of 

peptide 


Amino acid sequence (A=Alanine 
0*Cysteine, D^Aspartic Acid, 
E=Glutamic Acid. F s Phenvlalanine. 
G=Glycine, H=Histidine, I=Isoleudne, 
K<=Lysine, L=Leu cine, M=Methio nine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginiiie,'S=Serine, 
TtaTh reo nine. V=VaI in e. 
W-Tryptophan, Y=Tyrosine, 
X=Un known *=Stnn cod An. /—nossible 4 
nucleotide deletion, V=possible 
nucleotide insertion 


5508 


19409 


A 

* 


5549 


417 


43 

■ 


IRPARKGCSSGLFGAPLAGGGAPPPPPP 
RQTSPPPPHTKGGGGGPHKPPPKKKRPP 
PKKKGGGKKNPPKKKKKNTPPPPKSAPP 
PAAKTDPPPPLTVPPFLPPYIFTKKIPF 
LKSYFFFFY 


5509 


19410 


A 


5550 


3 


136 


QPR FPTLTNINENNPTLLNP I KRLAAGS 
LFAGFLITNNISYHSP 


5510 


19411 


A 


5551 


75 


194 


NKELSRNKKKKKKKKKKKKKKKKKRGGG 
LKKKNGGGAKT 


5511 


19412 


A 


5552 


343 


493 


IIFSFFFFFFLKGSFVFAPQAGGHGGNF 
GPLKPPPLGLKEFSCLTIiLSSW 


5512 


19413 


A 


5553 

* 


148 


3 


DAVSLFCPGSLEHSDTI IAHYSPDLLGS 
SSLS ASTSRVAG ITGICHHA 


5513 


19414 


A 


5554 


407 


252 


KKKF L KKLAGQGGMRbQ S LLLRRLRQ ED 
HSSSGVQSCSELWGGHLIYSKNR 










i 
j 




HKFVPT.VKTR FHTTVnRAfJT.FT.T.TRSYPT. 
exetc v cuv dim nnvwimvii iri i ii mu j rii 

ASAFRKAKARREDLLSPGVKDKPGQYRD 
TLTLKKKKKSFFIYLGECW 


5515 


19416 


A 


5556 


3 


131 


SWVS WBESKDAKELVG YY I EAS VAGSG 
KWEPCNNNFVKGSR 


j j Id 


1Qit1*7 

1?417 






J 




KWE P CNNNPVKGSR " 


5517 


19418 


A 


5558 


462 


255 


RSHHSAQAGLELQAS SDPPALTSQSAGV 

TMVQT .P AHP T T.AC Q Q VPVQ <3P (ViVfT^HV 

RRGKCPRAESYR 


30 lo 


10/11 Q 


A 




t r t rj 


£y 


ARE WHRTTKMS AAGT YAWMAPEVIRAS M 
FSKGSDVWS YGVLLWELLTGEVPFRG ID 
GIiAVAYGVAMNKLALPIPSTCPEPFAKL 
MEDCVJNPDPHSRPSFTNILASCQAWR 






A 






475 


GAVESSCTEEHMW I PEKAVEGSLTPTPE 
SRSVAHAGVQWHNVSSLQPPPPGFRRFS 
CLSLPSSWDYTSAGEGCAASYRSKVLK 


5520 


mix 


A 


5561 


248 


142 , 


NKNIWVQKKCVFYIYLPFICVCVCVCVC 

vcvcvcv 


5521 


mzi 


A 


5562 


262 


456 


NFPLKTTYFLKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKASSPPPLIKRGPKN 


1 


1Q49^ 


A. 






444 


LiKSMFCFVLFCFFE I ESRSYAQAGVOWC 
HLSPLQPPPPGFKRS 


5523 


19424 


A 


5564 


220 




LNSSHPESPLPLFGLVLQFFYPSNPKSF 
NKKNLEIRMLLTVDRGPNHPDSLFFVHS 
NVDTVFLPQEGPSFLQPLGQGITEAFEH 
NQAYV 








a 


<<<< 


1"7<? 
1 /O 


492 


ATRLASISGASPIFRSFPYLQPTAPGLV 
LPS I L ILVDVGLRXEKGPDTVAHACGPG 
TLGGRGGRITRSGVRDRPDRH 


5525 


19426 


A 

* • 


5567 


24 


470 


PMDXKXPXXXETPLNGGAGENQDDMFAK 
LKEKLFNEINKIPLPPWALIAIAGVAGIt 
LlxLTCCFCICKKCCCKKKKNKKEKGKGM 
KNAMNMKDMKGGQLPQDDDDAKTGLTEG 
EGEGEEEKEPENLGKLQFSLDYDFQANQ 
LTVGVLQAA 


5526 


19427 


A 


5568 


45 


485 


GXDGRLLRGHNQYAYDGLDYIALNEDLS 
CWTAADTVAQI TQRKNSAAREAEQLTAY 
LEXSLCVEWLRTHlxENGKETLQRADPPKT 
HVTHHPVSDHEATLRCWAIiGFYPAEITL 
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sequence 
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SEQID 
NO: of 
peptide 
sequence 


M 

etH 

od 

■ 


SEQID 
NO: in 
USSN . 
09/515,1 
26 

- 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid ; 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to ~ 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
OCysteine, 0=Aspartic Acid, 
E=Glutamic Acid, F=PhenvlaLanine. 
G=Glycine, H-Hbtidine, I»Isoleucme, 
K=Lysine, Lr= Leucine, M=Methionine. 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X«Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 




■ 










TWQRDGEDQTHDTEIiVETRPAGDRTPQK 
WAAVEVP 


5527 


19428 


A 


5569 


221 


422 


FFFLTGRVRQSLADLEQKIEVHSMPPPE 
SLALS PTL YCRGAI LQATSASQVAG ITN 
ICHHAQPSYLF 


5528 


19429 


A 


5570 


401 


477 


IFSRDKDVAMLPRIVUffSWAQVILL 


5529 


, 19430 


A 


5571 


102 


36 


EVENNDDI SHHHHHHHHHHST [ 


5530 


19431 


A 


5572 


106 


407 


AGRQG WIiEGCRPAAWPTLLTP PATSSR I 
GASRPPTATCPTPDPAPATTTWQPWPPR 
PLWAGVQWHDIX3SSQPPPPGIKQFSCLS 
CPS S WDYRGRRAVCHH 


5531 


19432 . 




5574 


253 


363 


YHSNQCVGLRKWPGAWHACNPSTLGGR 
GGWIMRSGA 


5532 


19433 


A 


5575 


1 


132 


MECHYVAQAGLELPGSSDLPASASQSTG 
ITGVSHCAWPGFFFFF 


jjjj 


tQAIA 




Jj /□ 


lo 


■i fo 


KKKKKKK 


5534 


J 9435 


A 


5577 


21 


232 


ILRCTTIETLLKVQDNHLNPGGGSCSBP 
RSHHCTPTWARERD SVS KKKKKGGAALW 
DPGGGQP8AGSQTR 








J J JO 


J / / 


9 


LSL I OT I TLCLGAI TTLFAAVCAVTOND 
IKKIEAFSTSSQIGLIIVTIGINQPHLA 
FLHICTHAFFKAI LFMCSGS I IHMLNNB 
ODIPJCIGGLLKTIPLTSTSLTIGSLALA 
GIPFLTGFYRTRG 


5536 


19437 


A 


5579 


2 


227 


I ES WLE P VRFLRS M FANNL VYD TSDSDV 
YHLLKDLEEGIQTLMGRLEDGSRRTGQI 
LKQT YS KFDTNSHNHVDAA 






a 




1 1 j 




N FP F FFFLiRDS I1AVAOAGVOHCDI1GSI1O 
PPPPWFKQ 


5538 


19439 


A 


5581 


184 


3 


I MP PHS RliS PTARFCLQKKTNNNNTP I E 
MAS IQCPGWSQTPGIiKPPSCFVLPKCWD 
YRCE 


5539 


19440 


A 


5582 


381 


3 


FFFFLGDRAPPGGEKNPLGGGLKTPGKP 
PGGWGVS P P PQKKKM P PPN F F P PGKKKS 
FGGGGPIFFPGKNFPPPTQKKKRGGGGF 
FFFFKKPGGFFFFFFPFFFFFSSFFI/jrK 
SFIKISFKKRRRG . 1 


5540 


19441 


A 


5583 


314 


389 


FSCLSLLS SWDYRCVLPHLGNFLYF 


5541 


19442 


A 


5584 


131 


• 


FPLWTSEQGVGRNKQTYVTWQADCKENA 
GGD Y YWTFFPQ PTFV 


5542 

• 


19443 

• 


A 

• 


5585 


484 


3 


MTEILLKQAMVGIVGS IGSAIGGAVGGG 
ASASGGTAI QAAAAKFHFATGGFTGTGG 
KYEPAGIVHRGEFVFTKEATSRIGVGNL 
YRLMRGYATGGYVGTPGSMADSRSQASG 
TFEQNNHWINNDGTNGQIGPAALKAVY 
DMARKGARDSNS AGDQ I FHH 


5543 


19444 


A 


5586 


136 


48 


NTMQPG WTHTCNPS TLGGQGGRITGSG 
R 


5544 


19445 


A 


5587 


306 


479 


DS FFPGVGSAFL YQVAAWTTKYFKKR V 
ALSTDIARSEMRLTFLLAPFTKFLIDVC 
DW 


5545 


19446 


A 


5588 


201 


2 


ILETGFHHVAQAGLNLLASSNLPASTSQ 
SAGITGVNHHAWPSFLIQGTKHSFKNSK 
PVFHIKLSKN 


5546 


19447 


A 


5589 

* 


30 


477 


TPSIARGVYPCWLYTGFLHDFLFSPKIE 
ERFMDVGVLRATBPSNSLFAI I YSTPIE 
LAXKS WHI XGDGLS IEL VAVERGNRTTI 
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NO: of 
nucleotide 
sequence 
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SEQID 
NO: of 
peptide 
sequence 
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M 

eth 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

t 


Predict, 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine. 
G=Glycine, H=Histidine, I=Iso leu cine, 
KHLysine, L=Leucine, M=Metbionine, 
N=Asparagioe, P=Proline, 
Q=Glutnmine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=»Un known, *«Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


- 


• 




• 






ISRLPEFEKSSLSDQSLVTLTSGYFAVY 
RVKLTAKKLEGIHEGAFQITTDYEILTI 
PVKARNALC 


5547 


19448 


A 


5590 


208 


91 


KGHFWVTGKKKVPCPPPRFFFFFFLDQI 
SLCYPGWSAW 


5548 


19449 


A 


5591 


325 


467 


ADLCSPCDILQLDFRHIRKTVDTLLALG 
EKAP PPTSALRSRDLISFCF 


5549 


19450 


A 


5592 


332 


403 


LSLCVXHTHTHTHTHTHTREIFIY 


5550 


19451 


A 


5593 


292 


26 


FLLDMEGAEAGAEU3LYHCLLCAWETPS 
RLAVLQHLRTPAHRDAQAQRRLQIiLQNG 
PTTEEGLAALQS I LS FSHGQLRTPGEEE 
GTGE 


5551 


19452 


A 


5594 


387 


51 


LKKIFLGRGQGVSLFFSPPLGGPGGRFL 
GLE I GAL FGL PGKTP PF PK I KKNYPG L W 
GGALFPLPLEGLGGKMAFPRKGGAPFSQ 
KNPPPPPLK3GKTKPPPPLKKKKKKKVL 


5552 


19451 


A 


5595 


88 


362 


AETS PGQGS YQDGIjRRPGFAEGP PAGRN 
PRRKKTQTDRRGGSAGSQPQACADGVRIj 
CTRLQCSGAISAHCNLCLPGSSDCPASA 
SRVAGIT 


5553 


19454 


A 


5596 

*r *r m \J 


92 


366 

I 


DQPWPGQLPGRfl PEAGIRGGSTSRKKPQ 
EEENS DRS PGRQRG i PASGVRGRCAALY 
QAAVQWCTLGSLQPLPPGFKRLSCLSLP 
SSWDYSH 


5554 


19455 


A 


5597 


3 


138 


TRPRTRGVGQAGLELLTSSDPPTIASQS 
AGITGVSHRARPQSSS 


5555 


19456 


A 


5598 


150 


3 


SCVIYSWMKFFSFLRWSFTLVTQA3VQW 
CDLSSLQPPPPRFKLFSCPG 


5556 


19457 


A 


5599 


148 


3 


TQHLNLAKKSQ I T PGMVAHS VAHAGRQW 
CDLGSPKPLPPGLNRASCLP 


5557 


19458 


A 


5600 


195 


1 


MVY F KKFCG F FL FVRS R I ACS VAQVGMQ 
WCWHS S LQPRTPGLKQSSHFS FPSRWNN 
RHMQPCPGA 


5558 


19459 


A 


5601 


228 


415 


VNVS I PKDVFLLI I I ILFFFGPAPQAGG 
RGCNLNS LL P WP PGLKQ FS CLTLP RSWN 
YGLPPP 


5559 


19460 


A 


5602 


405 


2 


AATGVDRHAQRTQAGLPLKTPQTPCPNF 
TS KARGTPSAQHPS CTRQRKQASNPTQR 
RRRR P QAEHS I R TD AHG PG L PAS G HMAG 
ERPTRKAQSRDVSVLRQRLALLPRIiECT 
SAISAHCSQNLTGSGNPPVSCL 


5560 


19461 


A 


5603 


3 


236 


RQENGMN PGGGACS E PRSRHCTPAWATE 
RNSVS KKKG I KI KGHCS SAMHHQ YI*SP F 
GWRQVCSHCVCPKPWLLVRGH 


5561 


19462 


A 


5604 


184 


2 


RPFLRIQGKIRMFSGSRFIFNFIjXLETW 
SHSVPQARVQWCHRSSLQPKPIiVFKSSS 
YLLP 


5562 


19463 


A 


5605 


261 

■TV 4 


2 


GWFLKGPPLFLGPKKPGGLGFFQKKLVL 
QKKNPPFGIGPPKNLKKTPLFFFFFLRR 
S LALS PRLECSGT I SAHCNLR I PRTSNS 
PA 


5563 


19464 


A 


5606 


134 


2 


GSPFKNLVFFLGQSLTLSPRLECSGTIS 
AHCKLRLPGPHQS PAS 


5564 


19465 


A 


5607 


3 


269 


RQESHSNPGGGGCGEPRSRHCTPAWIAT 
ERDSVSKKKKRKRNFLNFIGKKKKPCLP 
KPKLTKKKKWGGITALVLKLYKRAIAPK 
TGGY 


5565 


19466 


A 


5608 


36 


380 


PPFXXPPGGETFGGTPRGPPPKKKKSPR 
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SEQID 
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sequence 


SEQID 
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peptide 
sequence 


M 
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od 


SEQID 
NO: in 
USSN. 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict, 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F s Phenylalanine, 
G=Glycine, H=Histidine, Msoleudne, 
K=Lysine, L= Leu cine, M=Metbionine, . 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V-Valioe, 
W«Tryptophan, Y-Tyrosine, 
X^Unknown, *=Stop codon, ^possible 
nucleotide deletion, Impossible 
nucleotide insertion 




■ 

■ 


■ 


■ 


■ 




. KRGGGGKKKP PRGAPP PKHLLGGKNRGG 
GTPPPPRKTPfcLNPKKKRGGGKRGVFFP 
PKGGIFQKRGFFFSRGGGGGAPPPHPPP 
GGG 


5566 

• 


19467 


A 


5609 


3 


411 


SSRSRAAALFYFFFFSRRGQKEKEPFLG 
GKTPPVFVFFPGGVGKKNRGFPPPPPAG 
KGPFSGGGGREQTPIWSPPPPPRGKWGK 
PPKPVTPNFNFLGRVGFLGKNPSPGGVC 
PPPKTPIFFFGRPPPGGFFWGGGG 


5567 


19468 


A 


5610 


364 


483 


| I SWVLFFLFLKWS FVLVAQAGEQWRGLG 
SLQPLPPGLKGF 


5568 


19469 . 


A 


5611 


399 


285 


GGCSEPKSRPCTPAWGTERDFVSKKKNK 
NNYLVKKSN 


5569 


19470 


A 


5612 


408 


3 


KEVFS FP PGLEHGG VFYSLP PLPPRVKG 
FSPPPPPGGGGPRGPPPPPGYFLLFPKK 
GGFPPLPGGFSSPQIWPPRPPQKVGVPG 
GAPRPGPNFFFFFFRESFPFLARGPPKK 
KKKTCWNVTI SHSASVQSS VLL 


5570 


19471 


A 


5613 


1 


112 


LAQHVFFLLNTS IAS PADS SELLMEVHG 
NGKRPSPER 


! 5571 


19472 

* 


A 


5614 


2 


389 


FVREGERKKEKRKKEKKRKEGTKEKGKE 
EGRKERKKEKVRKKTVDDRRRPQVTFQA 
VNLESEGAPGNCGVTAPPGGHIWLKQVE 
VQTHLBNRCHRLKATVTTTTRPGAEPRS 
PPFI ITCGEIMNVFSDM 


5572 


19473 


A 


5615 

• 


385 

- 


3 


, KPPPGDPGKTPFFLKSQKITGGGGGAPL 
FPPPQKVKAKKRGYPRRQS FKRARLGPC 
PPSRGGKKKAGFKTNPPTKKKNQTKKPM 
KAKTS KLLQENRSGPGTVAHACNPSTLG 
GQGGW XMRSGDRDTK 


5513 


19474 


A . 


5616 


218 


402 


RS YS FSQhVGTVl KARKPESSCWLCAPN 
RKNAFWPGAVAHT CN PS TLGGRGGRI MR 
SGDGD . . 


5574 


19475 


A 


5617 


2 


235 


KSFG CASRIjQMHGRTHTGE KP YKCKQ CG 
. KAFGCPSNLRRHGRTHTGBKPYKCNQCG 
KVFRCSSQLQVHGRAHCIDTP 


5575 


19476 


A 


5618 


347 


421 


IYLXXXKKKKKKKKKKKKKQKKKKK 


5576 


19477 


A 


5619 


92 


1 


ENLALLPRLECNGTILAHCNLRJLPGSSD 
SP 


5577 


19478 


A 


5620 


3 


402 


APPPSFFFKKGLIMAISPTLGAIKLINF 
LFFFKKGKKKKKPPLEKKKKKKKKKGGA 
PLKKKKFS P PGGGRKI FFKGAPKKISRA 
RVKKQRRGE KTRGNP KKKRGKKKRPF PN 
TVSR 


5578 


19479 


A 

■ 


5621 


420 


3 


HPPPQGSANFFPPPRKIGINGPPPPAGI 
I FVFLECRGFPQVGQGGFKIRPPNKPPP 
PPPKNLGLQKKSPPPPPIFFFFFLWRWG 
LMTLPRLVSNSWLKQSSHLGLLKHWDYM 
SEAPHLAKNFLKEYEWSLSFHVEGS 


5579 


19480 


A 


5622 


246 


132 


EVAPHLLGC ZSPTMCVOJZVCVCVCVCV 
CVCVCLCSW • 


5580 


19481 


A 


5623 


219 


424 


ILNVMVLGIGTFGRIiLGLEGRALTNGIK 
ALIRRDMG PGAVAHTCNPN I LGGREGWI 
MRSGDGDHPGQH 


5581 


19482 


A 


5624 


32 


454 


GLEKVHSMTXDRNSRLSVQLRDWDGGGE 
LLQFSVHLDGEDTAYSLQLTAPVAGQLG 
ATTVPPSGLSVPFSTWDQDHDIiRRDKNC 
AKSLSGGWWFGTCSHSNLNGQYFRS I PQ 
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SEQID 
NO: of 
nucleotide 

• m mm Vtv W • m ^mmr 

seqaeDce 

• 
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SEQID 
NO: of 
oentide 

mr m* mM mrm mM V 

sequence 


M 

eth 

od 


SEQU) 
NO: io 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

mm mm m* mmm^j 

location 
correspond 

^m'mm m m m*^f mm ^0 mm mmt 

ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nude- " 
otide 
location 
correspon 
dine to 
last amino 
acid 

residue of 
peptide 
sea ue nee 

^ »-* ^JJ mm mm mm mmmm< 


Amino acid sequence (A~Alanine 
OCysteine, D=Aspartic Acid, 
E^Glutamic Acid. {^Phenylalanine. 
OGIycine, H=Histidine, I=Isoleurioe, 
K=Lyslne. L=Leucine, M=Methionine, 

mm ^» m^^^m ™^ m m « ^mm* mmm* mw mm ^m m mm mrm m m ^m mm mt m m m mm m m m mm- mmms 

N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine. S=Serine» 

------ — j ■ " o p ----- —j — ~* J 

T^Threonine, V«Valiae, 
W=Tryptophan, Y-Tyrosine, . 
X=Unknown, *=3top codon, ^possible 
nucleotide deletion, \=possible 
nucleotide insertion 

■ m mm mm m mm mm m ■ mmi mm i if ^# m m m m> m mm mm 














QRQKLKKGI FWKTWRGRYYPLQATTMLI 
Q 


5582 


19483 


A 


5625 


166 


2 


LFGLVKLTALKGLKMILPSQVWWLTPVt 
PTL WEAKAGGS QGQEFETS LANTVKP 

mm mm mmm w m mmmm mmm mm my^m* ^m mm* m*m* mm mm* mmmmt » ^» ~ 


5583 


194S4 


A 

» 


5626 


120 


406 


GEAATQENLAELRPEPELIiSPSTVLSRE 
PELPSPSTVLSREPELPSPSTVLSRKPD 
LLSPSTVLSRKPDLLSPSTVLSRKPDLL 
S PSTVLSRKPDLLS PSTVLSRKPDLLS P 

m^ mm, mm* m\ V 1>'«JAJW Wm m+J mm m mmmmmf fl>*li%l> mm* AJAM/ 4m 

STVLSRKPD 


5584 


19485 


A 


5627 


324 


16 


LSPPPPLPKTAPRPQKKNPPPKKJCKPAP 
PKKKNIiPFFFFFFFIiVFFF I WCS WRIiAE 
WNBTLGECDMRWGQAVGTGTSRELRPGG 
SRSQPRRLAGKELGPHCAP 


Djoj 




a 




77ft 
J to 


79 


GRGGE RP PPPOKSGAKKKPGVLNFFPOR 
GKRSGPPGPPGFFKIFSQEAPGPFSSFF 
WGRSLALFPSWGARGGFLVPANIjGPPAG 
APKGNPP S KKKKEKI ED 


5586 


19487 


A 


5629 


3 


204 


qesclnmggrgcseprs chctpawatek 
dsvskkkkkwkfggfyevignflhlfpk 
wsyppsfrif 


J JO/ 




A 

** 


i 5fi^f> 


177 
lit 


1 


sfhemrfwkli fi iri yvhferrscpvp 

mw A A m-mam m AAmmm WW m m^mS mmM mm mm mmy 4*ifB mm m mm m> mm UAVvvV ^*<fc V « 

qsrvqwhdlgslqplplgfkrfsclgll 

lg*' • » » ••^^ *«*^»* ^» w ^m* mm mm^m m mm mtmr mmimmw^mr mmmmm^ 

SNW 


5588 


19489 


A 


5631 


154 


3 


KWFQMGSRSVTQAGARQCDHSSIjRPPTP 

qlkrssclsfpggwdyrhvppc 


5589 


19490 


A 


5632 


239 


17 


egvrnylilqprslcrlncsffpnreke 
lchhssfssplaaakesqgekgrllsqd 
eglllwevfvedveins 


5590 


19491 


A 


5633 


2 


110 


GCSE PRS HHC I P AWVTEGDS VS KKQP P P 
QKKKCIAG 


5591 


19492 


A 


5634 


433 


0 


fffffffffflkkipppegggf 


5592 


19493 


A 


5635 


306 


433 


FS I S I ERSSSGRAQLLTP VMPAL WEAEA 
GRSRSQEIKTILAN 




1Q4Q4 


A 


JUJU 


9 

At 


*>OJ 


OSRTSVI FNSEKAERGEETTEEKFEASR 
GWLVKFKERS CIiHN I KVQS EAI S ADVBA 
AASDPEDLAKI IDEDGYTKQHI FGQALC 

4 ^^^m* mm mmm^^r mm^mm ^^^^^^^m mmm ^^^^^^ m^ ^m ^m ^m ^r^m^mm^m ^^m ^m ^p^pv ^^^^r 

LMSNPTTLGG 


5594 


19495 


A 


5637 


160 


1 


LYGGKIYWKFIRLNFFLSQGLTLSPKL 
ECTGTITAHCSLKLI3PSNPPARAA 


5595 


19496 


A 


5638 


3 


261 


KIQIiNPGSQGCSGLSLCHCS PAWATETL 
SQKKKKKNPPFGFKIKGEIEPLRBFGTK 
KKKGGFIiKKFPPHPLGFFLEPFTWGGFW 
N 


5596 


19497 


A 


5639 


2 


121 


SELRSCHCTPAWRQSKTTSHKKKKNIKK 
KNKFFLPFREN 


5597 


19498 


A 


5640 


370 


180 


VEME FCHVDQSGLELLISGDTPTSASHN 
AGITGVSHRTWQPSFFSFFKERRNLKSV 
TSLNVSP 


5598 


19499 


A 


5641 


254 


1 


FYILRKVFVFLFPAVSYGKGTYFAVDAS- 
YS AKDT Y S KPDSNGRKHMYWRVLTGVF 
TKGRAGLVTPPPKNPHNPTDLFDSVTNN 


5599 


19500 


A 


5642 


292 


106 


NFVILFYLFKQGSHYVAQAGItEPLSSSD 
PIATASQRAGITSVSHRAQTKQAETYW 

PFSLFT 


5600 


19501 


A 


5643 


3 


100 


DTFFEVEVEMEKEVCRDLMCTSPKDEEG 
FLLI 


5601 


19502 


A 


5644 


1 


282 


EKVPEAKRLYGKRGDPFYEAQENHNLIG 
VANVFLECLFCDVKLQYAVPI ISQQGEV 
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Amino acid sequence (A=AIanine 
OCysteine, D=Aspartic Acid, 

E^Glutamic Acid. F^OPhenvJfllnninp 

G=Glycine, H=Histidine, I=Isoteucine, 
K=Lvsine. L= Leu cine M— Methionine. 
N=Aspar&eine« P= Proline, 
<>=Glutamirie, R==Arginide, S=Serine, 
T^Threonine, V=Valine, 
W«Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion. \=D0ssible 
nucleotide insertion 














. STGQPWRTMVyQVFCPGLFWFSFYLSYV 
LEVDQETKAN 


5602 


19503 


A 


5645 


142 


67 


NKNRPGTVAHACYPNTLGGQGGQIT 






A 


J040 






GLKIjIiTSS dpp as asosagi tsmshrpw 
PPLASRLDLCCGTLYLI 


5604 


19505 

• 


A 


5647 


3 


180 


EHNABRAFLAQAHIjGPGAPCTPG fps F I 
PTQFPPFRSSGLPNIPVQTISRAAABKL 
FQ 


5605 


19506 

': * 


A 


5648 


366 


I 


n CWT?TT 1 VVTi , C CT.T.UTT infOOVS 7\ C »T"\T V 
K&Wr 1 1 X cVClKo JjLtnX ±£v VUKi&AAo/u./XiS 

TAAS YSEDIAKITHESGYPKQQI FNVDE 
TAFYWKMI PLSTC I AREEKS I SO FKAS K 

RALKNYASSL 


joUd 








J 




KVWFCI PPGPGGKLCFKKKKI FGLYPGG 
GGFWGPQTPPLAPTFPPGFPPPPGPNFS 

KGGTGP 


5607 


19508 


A 


5650 


348 


124 


RAPPVFFWVFLGQFPKKKSTNSPRGGGP 
PRS FQP FS LP P FFRNVFKKKPPP FFFBG 
GRGPPAPPFKNPFFFFSRL 


5608 


t nenn 


A 


5651 


194 


43 


LQPPTPTFERFSCLTLSLIAI P 


5609 


19510 


A 


5652 


27, 


244 


KKQQSTLFLFYFIiSFILFFFLVKTGSHY 
AAQAGLSLRAS SNPNTSVSL PKS WD YGH 


5610 


19511 


A 


5653 


38 


429 


GLFXiS CLFFFFLEKG VPLTPPGATGENP 

QKKFFFAPTI TGDQRAPRVFKG PGAQKE 
PFAQKKKERJGGGKKTRAAP^^KKKKPGAF 
FSLKKTPLWAGGGPPPFF 


C£l 1 
DDI 1 




.* 






Do 


KTT.VPHflnVPGV^ PCiKn^TiPKAPNPTfiOT 

PPSGEPPPLPPQGFFPCLKPKKRIjGKAP 
GPFFFFFl^Il^WFQISNTDSCVQVFI 


5612 


19513 


A 


5655 


159 


34 


GARFLFFSPVLSPLIEHCVCVCVCVCVC 
VCVCVCVCVLCACMCT 


5613 


19514 


A 


5656 


88 


388 


S IFCFI YVFFIiRQSFTLVAQAGVQWRDIj 
GPLQSPPQKKKGKETFSFLGENRYIjQPK 

nfptfltovcffeki ppnglingkrpkw 

*1 Ij £ 4 4 * V 4> 4J#>\4> 47 4> 4>^ *mt 4m§ mh mi 9 , \J 4\AV4T 4>X»F 

I PGCE FLKTLWGP WV 


5614 


19515 


A 


5657 


176 


371 


HFYFCFSDINLAAEPKVNRGKAGVKRSA 
AEMYGS VTEHPS PS PLLRSGTLL FITAL 
CPSVGIFSF 


301 J 


nolo 


a 


J ODD 




t \x 


CSEPRSRHCTPAWATERDSVS KKOOTKN 
LSYKKIHW 


JO 10 










0/ 


S SSNKP P TRP VF I L P FFGHG VPPJL.P PG K 

kppgpppgppppqveqmygrkggfalnl 
pprrqnknmlyspelgek 


5617 


19518 

• • 


A 


5660 


2 


377 


GRIS CSIWTPIJUJRSIjMLIVCGLPJ^PLS 
GFLPWVGQVFLALIPVNPQPSSLGVRTI 
MDMLR C CRDF I YG Q FWGQ FMARL WGACS 
QQYTE FFR FKI I F I SS IiLFNI VLE VLAR 
AIRQEKELNGKLQ 


5^18 


19519 


A 


5661 


256 


381 


NFNTGQAW WLMP VI LAI WEAEAGESRGQ 
EIETILANTVKPHL 


5619 


19520 


A 


5662 


19 


313 


LYSSIjAIIiGFLPLLINFIIIIIFFFKER 
DPGQQNQNFVSLKKRETPPPPFSLGVPG 
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W=TrvDtoohan. Y— Tv rosin e_ 
X^Un known, *=&op codon, /^possible 
nucleotide deletion. \=oossible 
nucleotide insertion 








• 


• 




KRSLLPPPPGPKHAJGPPVLPAGLWGGPE 
PPTWGRGAKQKREI 


5620 


19521 


A 


5663 


78 


365 


ETGRKRKETRWNLKKVCLLKMS LQGEKE 
ERKREKYRRTEMKRQKNSEKQRGREKGK 
KSKYQRRRKKEKKGENQTKPKPKVEESS 
SRQ 


5621 


10522 


A 


5664 


159 


28 


DTSTLPTPTGFKRFFCLSLPSKWDYRRP 
PICLPNFFFFLWTGY 


5622 


19523 


A 


5665 


350 


151 


I RCLCPDKFLAFCRDGVLLCCS GWSRTP 
GLKRSPCPSLPKSWDHRLCLADIYIYIP 
LKDTSTIMSF 


5623 


19524 


A 


5666 


394 


248 


LTPGGGGCGKLRLRHCT PAWATRANLPL 
KTKKKRKRFLFTLAETSKVY 


30Z*t 




A 


DOO/ 


i 
l 




P T .MOnnPftP^ T3T .P QPRf 1 T PAWA TP TVT.W 

LKKKKKGDQGFRGDHTKNIWLFPRVFLP 
LHSIJfTIiK 


5625 


19526. 


A 


5668 


2 


312 


RGONS ARG 1 1 1 IAI LKKVCVYI YRCGLT 

ASQ IARTADMHHHTRLI IASLTFCLLKK 
KKNVFCGTYNHCPPKGFYL 




j yji. / 


A 




i 
i 


74.^ 


RLNPGGGGGGE PRSSHCTPVWATRVKIiS 

LKKKKKKRPFFFFFGIRQPPQGGPFQTL 
TPLNLLKNLTTLLGAKKFFFFTGGK 






A 




i 
1 


1"?5 


ARTPS PSIiSESS EDEKPTKKHKKGKALR 
LKRRFWWLMSALPCIH 


5628 


19529 


A 


5671 


97 


2 


KKFTHVWPG WAHACS PSTLGGRGGRI T 
PTJvJ 


5629 


19530 


A 


5672 


362 


173 


VFSPPPPFKTPPPPPKKIPPPKKKKTPP 
PKKKKLFFFFFFFFFFFFFFGTRIVIDN 
IIjSGNRV 


5630 


19531 


A 


5673 


237 


13 


SPGGGGGFAPYWMFFLDFERQSCSVAQA 
GVOWHNLS S LOPS P PRFY F FSRDGVKNR 
PVPRYAPPSPKGISIiRPYK 


5631 


19532 


A 


5674 


245 


14 


ANFLKAHI VSTLGLQ X I WSLLQLLNFAI 
VFY FLRQS FALVAQAGVQ WRD I GS LQPL 
P PRFKRFACQRFS WEAEVAVS 


5632 


19533 


A 


5675 


2 


249 


SHPTHS HPQHLPLTPTWNWS STPVDFI F 
RKAP P VFPHWOHHRAG PGTS L I PTDAPO 
EASPPPPYPSSPSSTPTIPRETFLRQ 


5633 


19534 


A 


5676 


115 


241 


SSTLGGQGS I TLAQELRSCHCTPAWATG 
QDS IS KKKKTLI/3P 


5634 


19535 


A 


5677 


460 


2 


RI/JVTGRWQKLKPQERMRAWRENGKGPK 
ALP AS WEGPTVNIiliWKMSHPAWVSCHnrj 

SGFCAWCQGLGSEQSGLPSLI PSWDGLV 
LTS P PNPTGPPP PIjKTRSVLRGDDVLLP 
CDOPSNLAJRALWLIjNGSMGLSDGOGGYR 
VGVDGLLVTDAQ 








JD/O 


7 


34fi 


ENCLNSGGEGCS KlxR LCHCTPAWVTERD 
S I TKEKKKKNFKPVLEKFGE I FFLKGWK 
FFQLNPLKS SKENE FFNTTLPLEPCFNG 
NKPFWAAKI WTLNFFQTQKGS I FKWGGR 
KK 


5636 


19537 


A 


5679 


181 


4 


PKEVRTLFGRFFFFFHMESHSWQAGVQ 
WCNFSSPQPPPPGFQKFSC3jSLPISWNS 
RHM 


5637 


19538 


A 


5680 


549 


410 


FFVQTDFCHVAQAGXiQLLGSSNP PASAS 
PKYWDYRREPVCWPSSGF 


5638 


19539 


A 


5681 


III 


403 


SRQ INGFFWSRTKVAIGQFLVGPS VKI 
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nucle- 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^lutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=lsoIeudne, 
K=Lysine, L=Leucine, M=Methion!ne, 
N=Asparagine, P=ProIine, | 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W^Tryptopban, Y^Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 










• 




. FENNNEICMIKSIQSYIVLNHKHLKVYS 
SNI WQLKNI LLRGGAVAHACNPS ILGGR 
GRRITRSRDRDHP 


5639 


19540 


A 


5682 


189 


3 


NNNRKASWKKKDRVLLYCPGCNWCDLGS 
LQPLPPGFKRFSCLKLPKCWDYKHBPLH 
PAPHV 


5640 


, 19541 


A 


5683 


2 


227 


IKVCVCVRVCVCILREDSAQPARCVCVC 
VCVCVCVCVCLDPRRELSPTSQLESSCH 
PL YVPS GGSRINRPYPGGC 


5641 


19542 

m f *J • *m 


A 


5684 


318 


421 

W mm m 


DVPCFPVAFCCDCKFPRASPAMLNCESM 
KPLSFI 


5642 


19543 


A 


5685 


213 


398 


SLCCLLFCCCYFETES RS WQAGVQWHD 
HSSPQPGPPRLKSSSHLSLWRHDHSSPQ 
PGPPRIi 


5643 


19544 


A 


5686 


398 


1 


SFPTSSTYSSSVQGGGQQAHSYKSCTAP i 
TAQPHDRPLTASSSLAPGQRVQNLHAYQ 
SGRLSYDQQQQQQQQQQQQQQALQSRHH 
AQETLHYQNIiAKYQHYGQQGQGYCQPDA 
AVRTPEQYYQTFS PSSSHRE 


5644 


19545 

» 


A 


5687 

* 


2 


288 

mm W 


SDRPRSPSFSFPSSPAGVGRASSIFPIL 
SAIIiLLLGGVCVAASRVyKSKRNIILGA 
GILFVAAGERQREGATGAAHLGAHVCVC 
VCVCVRARARA 


5645 


19546 


A 


5688 


314 


2 


CRPGSQLQELGDGGGGES FVGGHGQQHP 
GAPQRGRYLQPLLMC3GASPHASPRQPLA 
ZLAAGVKFRVLRHEAGRPLRLLMQINPL 
PYSRWHTYRLPS CGCLTC I 


5646 


19547 


A 


5689 


122 


3 


MGGGEFSKKMNINGQGVVAHACNPSTLG 
GRGRQ I TRSGV 


5647 


19548 


A 


5690 


144 


1 


EAVKKYTCP TWAGAVAHACNP STLGGRG 
GWITRSRGRDNNGKHGEIPR 


5648 


19549 


A 


5691 


377 


65 

* 


LG VFPP FFFLRGEGLS P P P PGGLVKNQF 
FFPPGQKGP PGT PERGFQG FLGPKFFPP 
LP PQGP KG I F FLKKKKKKKI P I CALAVL 
TYQHFCYRTGQERVDGLREL 


5649 


19550 


A 


5692 


12 


147 


YERRFCHIGQAGLELLTSGNPPASASRG 
AE ITGVS YHALMRNIS 


5650 


19551 


A 


5693 


183 


395 


GHRFGFLHPPPPGFGGFFRPPPPKTEGK 
. RAPPPPPRFFFFLGGGGFSGGGPRGVLF 
LEHEN P P PGAPKGVG 


5651 


19552 


A 


5694 


177 


283 


THFLNRTDGRRWSLASLPSSGYGTNTPS 
STVSVSA 


5652 


19553 


A 


5695 


266 


460 


LSQRKYAFFFFFFFFGKKIFFSSPGGEI 
RAQFFLTEPSPFWVKGFSPLPLPKTWDY 
IGRDPPPLG 


5653 


19554 


A 

9 


5696 


l 


412 


MQPLKVNIiLEQKNLGLEFVAPFTWIHCP 
SYWSIAYNVYFSQYSFIWVFCTMFITVN 
IPSKKKKKKKKKKKKKKKKKKSSSPGSP 
GYAPKKAPKNKKKK 


5654 


19555 


A 


5697 


379 


0 


FFFPSQKKKKKKKKKKKK 


5655 


19556 


A 


5698 


1 


187 


PTRP PTRFRFENGVNLGGGGCGEPRLCH 
CTPAWVTERDSVS KKKKKKKKAVGGP WF 
FQKKKV 


5656 


19557 


A 


5699 


374 


0 


DFLLQK3CKKKKKKKKKKKKKKKKKKRGG 


5657 


19558 


A 


5700 


219 


413 


PFCPQGGGGGPNFVFLGPPPPGLKEIPP 
PPPRGGGKKKPPPPPRGGFFFFKKKGVS 
PGGPGGSPP 


5658 


19559 


A 


5701 


381 


249 


FHRAGQDGLDLIAPGSAPLGLPKCWDYR 
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^ oiuiaiuine, iv — Arginine^ o — oenne, 
T-Threonine, V=Valine, 
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X=Unknown, *=Stop codon, ^possible 
nucieoTioe deletion, v— possioie 
nucleotide insertion 














PR P PP P2VF I7T?T? WPP 


5659 


19560 


A 


5702 


382 


197 


LANFCIFSRDGILPCCPGWSGTPGLKQS 
AELCLPKCWDYTHKPPCPAPQVIKPSIN 
ILIVN 


5660 


19561 


A 


5703 


1 


381 


KRGSKHPPRGLKHIRGRTCFVLKGCMIQ 

GRE KKKNGNQGERVKKKKS KRKKKKRS S 
SS 


5661 


19562 


A 


5704 


291 


167 


NGPPFFFFFLYKMGPCHVGQAGLEFLTP 
GDPPaLASRLIajL 


5662 


19563 


■ 


5705 


367 

• 


23 


KNPePPP&Pbt^iJ^l^lPPPRfoWTflfe' 
VPPPLFLIKGPPFKRGKGPPPGGFPPLF 
CPLGGKRGGVFLSPRVGKPRGKGGNPLF 

tiwrrvywiiDti r> won wntj vi?d/v?vto 
PKKiUUCKJU<KRRKKliGIuCGR3UL 

TV 


5663 


19564 


A 


5706 


411 


116 


FL KR WGS HYAAQAGUSLLDS S DIiP VS AS 
C3SAGTTGVTHCSRLSYSALACQLCQPVW 
RI P WRGRGVGKETS FRGWGRR Y GRAWS S 
SGGGAESLPGLSPG 


5664 


19565 


A 


5707 


162 


2 


S x CFPNSDDFI PFQPIjPnTWFiiWPRPF F 
IFKTAHGWARWLTPy;iPALWEAEAG 


5665 


19566 


A 


5708 


1 


416 

• 


LPLLYKIPPPPAILFLFIKDSPPPPPPP 
PFFFFFFPPP 


5666 


19567 


A 


5709 . 


403 


142 

• 


LSKEKKWGWPRGFSPRIFEKPPPPKIFF 
APPKKKI FPPPPGEKKFFFKGAPPPFFF 
FFFSFFFFFFDCYNSSTWNSAWHIRCS X 
SIC 


5667 


19568 

• 


A 


5710 


339 


461 


PSTFILFIIiRRSFALVAQGGVRWRHLGS 
PQP P P PGFKRVPA 


5668 


19569 


A 


5711 


408 

i 


42 


TGARI X FSGPPKK.TYPSPPRAFIXJjSFK 
RAAPQL I YFVLNLFAYKRLVKLTTVIMR 
RRPTFFLLLFKTFLFVEPGFCHVAQAGL 

UFlAioa ICuoijJry UWUXKCbJryrtLiAKKo 

TFIKSESAV 


5669 


19570 




5712 


277 


y 


r*>'\7f~ r \.T(~ r %mr*'\T\T\TTiT3Mr i T /YTDCaDUTPT.CT Yv 
vj Vt*. V V, VoVjV V V MrrrHjLfJ 1 Vo/vlrWolJoiaA 

pplgtlcsvqwlaesihlcicqalve.pl 
rrql yqvp vrklllass i vsgfgds i wn 


5670 


19571 


A 


5713 


411 


0 


FLQILLFLIDIAFFLVFFFFFWKK 


5071 


Jy572 


A 


5714 


One 

205 


/inn 
4UU 


gavahtcspstlggrggritrsgdgdhp 
gqhdeapsl 


5672 


19573 


A 


5715 


1 


89 


FFFFKTESRSVAQAGVQWCTLGSLQPPL 


5673 


19574 


A 


5716 


176 


392 


GVHLVLEWKLSMVCAEDDELFPLCSWKK 
FFFFLFLRQSRSVTQAGVQGRNFGSLQP 
LPPGLKRI SCLTRPSS 


5674 


19575 


A 


5717 


1 


89 


cr rf JUiiibKb vAyAL3VUWL.llAsfalJUPPXi 
p 


5675 


19576 


A 


5718 


306 


42 


GSGLRRCALNSVKCXiNTLECILLGIYX«R 
QIKICVYTKTCTCKFI VTLFI I AKKYEY 
P S VHQLINKMWHI Q 1 1 FLVI KRNEGOGG 
SCL 


5676 


19577 


A 


5719 


1 


263 


PTRPGRFFLILGDRGCSELKS C VCTPAW 
VTEGDS IS KQNKKQVELRS P AAHLLGGL 
VPNRPQASTRIGDPCTKKLLELISKFSK 
AAA 


5677 


19578 


A 


5720 


3 


278 


NQE I LKNVESSRT VQPHFLEFLLSLGWS 
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* 




. VDVGRHPGWTGHVSTSWS ItfCtDbGteGS 

■ 

IKYFLSH 


5678 


19579 


A 


5721 

■ 


127 


3 


KKKKKKEEEEQGEGRRRRRKKKKEKEKK 

E»lUsEJLJGSGiusKG£ 


5679 


19580 

♦ 


A ' 


5722 


204 


96 


VTSRSSLFSTPGiKRA^dLdl,PK!fcWDYR 
PQPPHLA 


5680 


19581 


A 


5723 


3 


261 


QIWQWKyNDDDDDQLFYTRVYLDPGLR 
VGKAEEWAHPAALWEWRPQAEWDLGSKG 
GCKEKOTCLGWAHACHPRIjIajGIjGRRI 
A 


5681 


19582 

i 


A 


5724 


134 


358 


PPPPPPPXLLFFVSAVPPSPSPSLSFPP 
HRPCYSPPSPTLSSLPPPLLFPPPPSSP 
PPPPRPSAPSPPPSSPFFS 


5682 


19583 


A 


5726 


248 


470 


LLNVQNG I RNVCS C PHYT P FHKWKYLS 
VFLTEVDSSQLRRQLCGGSQAAIERMIH 
FGREKVQILHIALIYFLV 


5683 


19584 


A 


5727 


1 


281 


NQHNRG F FDS CDGF FTNYNWREEHLERM 
LGQAGERRADVYLGVDVFARWNWGGRF 
DTDKGGGGFLS KGQRPSAS POTS FPHGP 
SIPVCTSKE 


5684 

■ 


19585 

*> 


A 


5728 

• 


460 

- 


3 


SQTRTLTLDRE ITIiPSSGTALISLVDGS 
GNPVSVEVQSVTDGVKVKVSRVPDGVAE 
YSVWELKLPTLRQRLFRCVS IRENDDGT 
YAI TAVQHVPEKEAI VDNGAHFDGEQSG 
TVNGVTPPAVQKLTAEVTADSGEYQVLA 
RWDTPKGGPVTV 


^ /* 

5685 


19586 


A 


5729 


70 


465 


SQCS PHHSVLCQKVQDLESGTIiQSNWKK 
ITPWDENVCTCVCVCVCPCVSMCVYVCV 
YVCLCVCVCVTVCMC^CLYIKGLLFLIiA 
KKETAGSFFHSQLTCLGVPCAFTIHPVP 
FLPQCRSHCVSGGRFWPGVL 


56S6 


19587 


A 


5730 


281 


427 


SWLTAAPNFQLQMESRSVAQAGAQWHDL 
GSLQAPPPGPQSAETQKQMIV 


5687 


4 a. Ann 

19588 


A 


5731 


418 


194 


TKFHHVGHAGLEIiVTSGDLLS S ASQSAR 
ITDMHHCSWPASVLSMNRPSGSTRCLRN 
GIFGPLKDFKPNDLTVQL 


5688 


19589 


A 

p 


5732 


229 


2 


PGGAPPLVRSG FPGWGGAPPFVFVPNGE 
PPKNFFFFFDGVSLCRPGwSAVAGSQLT 
ASSTSRVHAILLPQPGIVA 


5689 


19590 


A 


5733 


435 


138 


PPRYPPPLWGAPPHIiSRGGGSFTPLTPQ 
GKPRFFLKI KINPGGGPPPYS PP FGGGG 
PRNFFSPGAKGPLVLISPIiFFPPGDQKK 
NSFFKKKKKERBRE 


5690 


19591 


A 


5734 


113 


409 


KD VCGG VLTGLS G VLT S PEY PNNY PNS M 
E CHWV I RAAG P AHVKXj VrVD FQ VEGNE E 
CTYD YVAVLGGS S PTP VHQYCGS I SAPY . 
LFFLAKNIMSYSLPN 












12Z 


vyr vocj^i.oJ-Ai.JNovxrVjuloxlJli 
LFHLGAEEISR 


5692 


19593 


A 


5736 


14 


428 


FLALRWGHIRLPASGPRDTATIiFSTLDT 
QLLMTLYVGKDETGFYVSKALVHTGVAL 
VPRGliTIiAPADGPTTDEVTLQVSGEREG 
S PS TAVR YPSGS VALPS QWLL IGHHELP 
PVLKTTMLRVHPTLGSGTAETRPPEN 


5693 


19594 


A 


5737 


415 


1 


QHINTIiGQFWTTKAFLPRMLELQNGHIV 
CLNSVXAIiS AI PG AI D YCTS KAS AFAFM 
ESLTLGLLDCPGVS ATTVIiP FHTSTEMF 
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QGMRVRF PNLFP PLKP ETVARRTVEAVQ 

T TViTiT TTT DUTMUM T7TT VC TT DATl JV 


5694 


19595 


A 


5738 . 

• 


335. 


152 


GVEMGFHHWQAGLEFLSSSDPPASASQ 
SAGITGVSHQAS P FFIiFLSG I TLPLS I Y 


5695 

♦ 


19596 


A 


5739 


35 


462 


IDASDFGECSQAAINDNWIVYELLEEM 
VDNGFPLATESNILKELIKPPTILRSW 
NS ITGSSNAGDTLPTGQLSNI PWRRAGV . 

IfVTPM>TT?7k'VE*r , >» 2\ T?1S» TTJ74 T T TMPC DO 1*1 TO 7V 

EIQGVIDACL.KLSGMPDLSLSFMNPTLL 
DD 


5696 


19597 


TV 

A 


5740 


. 429 


325 


W J^GGKLrCbbLtKoHPC I PAW V X hyDAlJUG 

GRSGRIT 


5697 


19598 


A 


5741 


344 


423 


F WPGAVAHACNP S TLGGRGGR I TRSR 


5698 


19599 


A 


5742 


431 


103 


RGPFFPRPRQKKGGSPGGGPPQNKKNGP 
PLHTGGAKGGSFFKKKKKKKVPKKGPGlj 
FFPGPQGPRKGPVGAPPVLIPNRLGPPR 
IPPGWEPGRQIFFLAPVFFFPSKPK 


5699 


19600 


A 


5743 


31 


224 


EEMGFYHaEQADFELLSSNDPAALaFQG 
AGIASMSHCSWPGK^TFNMFCCIWKMKRIi 
AIHWECIA „ 


5700 


19601 


A 


5744 


'217 


400 


HSVMGGS PS FFFFFFFFFGEKGLFFFCP 
GGPKLGKGNPPP PGKKGFP PPPPPKGGN 
NGAPP 


m> mm f\ * 

5701 


19602 


A 


m> Mm M m* 

5745 


A 

4 


455 


DGLEF PGRRFRGQRCS S PPRRGRGQ VEA 
LLTSQTGRRGRGAPHVSDDGRRGRDAPH 
FIiDGLVAGKRRSSLP IWDGGRAETLLTF 
QTGQPGRGAPHVTDDGRFGREAFHFPDG 
VAAGQRLQSRHFGGQGRRLGGGGCIEPR 

ODUr"Pt5JitiT7i D 
oKrtL- 1 JrAWAF 


5702 


19603 


A 


5746 


381 


212 


P RGGGGGRYS P P PGGGGRGNS FYPGGGG 


5703 


19604 


A 


5747 


334 


432 


MFCENYKE IAWPGAVAHSCNPNTLGGRG 


5704 


19605 


A 


5748 


190 


1 


IPPKKGGLNFFSPPTPKKGVKSKPPQGK 
VFFFXRRS FTLVTQHGVQWRDLGS PQP P 


5705 


19606 


A 


5749 


425 


297 


ES AFHHVAQAGLELLGS CDLP VS ASQSV 
GVTGMSHHTRLYCS 


5706 


19607 


A 


5750 


425 


0 


KKKQRKKERKKERKKERKKERKKEKKRT 
KSDE 


5707 


19608 


A 


5751 


136 


3 


LLASGDIiPTSQVPRRP 


5708 


19609 


A 


5752 


3 


237 


GCSEPRSRHCTPAWVTEGDSVSKTNKQK 
lWKGIitvi»IAijr FAIHoIjLjFVVIKJjAIVIFI 
IiAMDIFPHPIVEFSYHVWEQM 


5709 


19610 


A 


5753 


1 


358 


GDRGCSELKS CMC I PAWVTEQDLPSKKK 
KAKKGGKKTKNGPQj. CGGDLiPPPPiiGGG 
KNFPKGET FWFHSGKKKRFFSLGRGEKC 
FFIPRAPKGPGKTLGGPNRGGGAKKKLK 
RPPGPVY 


5710 


19611 


A 


5754 


2 


191 


QGCSELCSYHCTPAWETRARPVSKKKKP 
KTKNQTKKNRIiGKKPPNLIFPPFQKKKK 
WVQIPGK 


5711 


19612 


A 


5755 


42 


408 


KHAAP PAS LS LSLLLHHGQKRACFP FAF 
CRBCX2FLEGSPAMLPVQPAKLTPRSTPP 
H PCS LEND SAHFFN 


5712 . 


19613 


A 


5756 


179 


410 


LQAFLRPSPPVLFLLYADLSNQNLWPGM 
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VARACNPTTLGGRGGRIT 


5713 


19614 


A 


5757 


166 


2 


GRPGPADFRVRPQLLQRFLFIYLFTEME 
SCSVTQAGVQWCNLGSLQPLPPGLPD 


5714 


19615 


A 


5758 


334 


3 


EEKVFGLPSP ILLSKGQGGAASGPPCPI 
SPRAMG PQKKGWETMVF ITKAPPS FGFV 
LVKQS PGTG PVPRNFFFFLRRS LALSPR 
AGVQWRDLGSMQAPPPGFTPFSCLNL 


5715 


19616 


A 


5759 


373 


470 


SSCWSLRNSSPGAVAHACNPNTLGGRGG . 
RITR 


5716 


19617 

1 


A 


5760 


210 


384 


EALDTTIFFFFFFETKFCFVPQAGGQGQ 
NLGSLGAPPTGIiTPFSCLTIjRKTWDCGP 
PP 


5717 


19618 


A 


5761 


257 


400 


SLGSEYTWVCFCVFETESRSVGQVGVQW 
CNLSSLQPPPPGFKRFSCLR 


5718 


19619 

UUI7 


A 


5762 


400 


2 


aravgsvgedpglgtgpsparetwewav 

VCRACRGSQLLWGKLFILRHWDLRKVLA 

rgngtedclkqlws qcgagmgrgp fqgl 

KGKPVGCLCSSREVFYLFLYLFRDRVSL 
CHPGWNAVKQTRLTAASASQ 


5719 


19620 

* 


A 


5763 


34 

• 


440 


RVRAPLGWEEGNYF VLRAKPRSGKRS CL 
PFLRCGHQEPLLWG ISTASPLRHLVGAS 
QI CX3VLFLCFQVSLKPAGS WKVVIGNNG 
ERKAQRLCPLRATRVEGIFCFRFVFLRW 
SFTLVAQAGVQWRDLGSLQPPPL 


5720 


19621 


A 


5764 


297 


468 


LVGAGVPTGCGQRNGWNSHKSVQAQWL 
MP VI PALWEAEAGGSRSQE I ETILANMV 
T 


5721 


19622 


A 


5765 


1 


205 


GFKLTDS FASGAKAI LRSQLLGRLRQEN 
RWNLGGGGOIGLKWHNCP P AWAIEGNLL 
FKKKKKKGGGTF 


5722 


19623 


A 


5766 


307 


I 


KERDPPPTPFGKTFFFFFFKRNFCFFPP 
KEGQGG I LGPRHP F FSRLS FLGRGVPRG 
FP PGPRNFFFFWI L VKMGVPHVGQADFK 
LLTSGDLNKTASQNAGIK 


5723 


19624 


A 


5767 


2 


319 

■ 


LLGNGEELSRFPFQRSSQAWVFESSAGI 
FGGKAGATGLE KLTGSFQQLTGHPDPRT 
PEEXjGDPE KS SAS E KTRGLQEANGVP E I 
LKAVT YTQAW P KEAKVD P I TP 


5724 


19625 


A 


5768 


787 

9 W f 


1118 


EAAWRELEAERAQLQSQLQREQEELLAR 
RKAEKEQLSEBIAALQQEHDEGLLLAES 
EKQQVREPWRGLCCSLSCSSSGAGPCSA 
TWQLGALG QATAL LG AS VS LPAGGED 


5725 


19626 


A 


5769 


2 


452 


NGAGTMSV5LEADENPFAQGALKSEDCF 
ILDHGKDGKI FVWKGKQANTEERKAALK 
TASDFITKMDYPKQTQVSVLPEGGETPL 
FKQFFKNWRDPDQTDGLGLTYLSSHIAN 
VERVP FDGGNLDT FTAMAAQHGMDDDGT 
GQKQIWRIEG 


5726 


19627 


A 


5770 


2 


130 


GTQGKIVDLVKELDRDTVFALVNYIFFK 
GKWERPFEVKDWKS 


5727 

■ 


19628 


A 


5771 


422 


8 


HIARWGARAPQGGWALLKAELCLNPAPP 
PIGARTAPLWPGCCFSIiRPVFLLIiFPFF 
SLFIQSFRILYPQPWPLQGRRSTTLSRM 
GTTDHVI VLAS TNRAD ILDGALMRPGRL 
DRHVFIDNPTLQERRKCMRRLGASRN 


5728 


19629 




5772 


432 


3 


YSLNLKKRWFVLTHNSVHYYKSSEKSAL 
KDGTWINSLCS WPPDEKI FKETEYWN 
VAVYGRKHCYRL YTKLLSEAS RWS SAI Q 
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NVTDTKAPIDTPTQQLIQDLKENCMTSH 
VVEQIYKRKKILRYTHHPCIAAAVEDHD 


5729 


19630 

■ * 


A . 


5773 


20 


432 


SRAAALLEAVTETLFYYEVAEKIWSNRA 

GQHRALGSGISKVQSLKLDTSVWSNEIV 
QLFIVLGNDRANRFWAGTLPPGEGLHPD 
ATPGPRGEFISRKYRLGLFRKPHPQ 


57J0 

■ 


1 (SHI 1 

lVo31 




5774 

i 


All 


*> 


urns T^CiT 1 ! swcWiT *ni± p\tt ,c;c;\7nT jTl\n TYZC 

CFKLRTVHNIPVPSNKVDDECPGLYGFL 

HVIVHFAKGFKQSATliSCALEVDSFGYF 

VSKAKTRVFRDTAEPKWDEEFB I ELBGS 

PSLRIIjWYEEYNNKSKVKNLYSAAVGFS 
awn 


5731 


19632 


A 


5775 


133 


436 


MLISLPHPSTHLTSLPFCADFGTRQTTQ 
AASHVLTLSAHS S VCS CPPLMGEATTS R 
AQDLPADEHTAFAHELEAPALSKGQGLS 

T T ,T3 JV T ,TY3 WT . P T /2m?VKrT . 
JL lj£\*\U l\j."ljrij\JV£i V IMJLl 


5732 


19633 


A 


5776 


397 


58 


KGTI S IGKLEEKLRGAARQALADAI IEF 
QLLPASLCTEDTPTGMQVKRSLGSRECH 
RSKQDSVAGERAKVGFRGSTSENYYIRT 
TYSQLSVSLSGHHPG^TLAAVNGCSCSP 


5733 


19634 


A 


5777 


1 


324 


MRRCV S AGGVELEE YYPAFLDMVRSMLE 
GSIDPTQYEDTLREMFTIHAYVGFTMDK 
LVQNIARQ VSRAGVG PALRTAEMALP S F 

H«!PT.RPr;PW<;TAWT .QPPPTiT FTK" 
norijOAur v v o j./xmijoi.c ruiLi in. 


5734 


19635 


A 


5778 


1 


398 


RLGSRPSLHDQSPLELRSEIQELKGDVK 
KTVKLFQTBPLCAI QDAEGAIHEVKAAC 
PJ3EIQSNAVRSARWLFKTRSLDVTNQDP 
SQVRL I RGISLEEGARPDVSATRWIFDT 


5735 


19636 


A 


5779 


281 


12 


VACNLGWRVRTPTPASYPPCVPQIiSVRG 
KNRKELLGGFLRNIVKSADEALITGMSG 
LKEVDDFFEHERTNLLEYHTLYCGRSED 
HATSR 


j /JO 

• 

1 






D7oU 


• 




IVKCSPLEHVLKYLDPPPPPTRCSRPAS 
LRSSPKHCVQQQFPIACRPGAAAPPPPP 
AQCPNPGPAAASARHEPKWGSRTEGGPA 
RARRREGTGRRKKRR 


5737 


19638 




5781 


415 


18 


TSQAGDDLLHGSLliRRGPEMGYLPGYDP 
D PTLAT P P AGQTLAVP S LPRATE PGTGP 
LTTAVT PNGVRG AG P SAPE LLT P P PGTT 
APPPPSPASPGPPLGPEGGEEEPPPTII 
TTTTVNTTVTc? PM YOGRSEG 


5738 


19639 




5782 


168 


432 


GMRRQTGLDQQSRGVGKNWRVDFFFPSS 
LSFFFFFFGKESYFVLKFGIHGLKFGYL 
E PLS PGLKE FSRLS PP KGGHLKCRPPS P 
LFFF 


5739 


19640 


A 


5783 


1 


423 


TDDDLNWLDHS RT FREQGVDENETLLLR 
RKFFYSDQNVDSRDPVQLNLLYVQARDD 
ILNGSHPVSFEKACEFGGFQAQ IQFGPH 
VEHKHKPGFLDLKEFLPKEYIKQRGAEK 
RI FQEHKNCEKI SERSAKV 


5740 


19641 


A 


5784 


273 


400 


DSVVLLMLSPGVCPHVQVWPISCTIiGAT 
FGYVAGLVTSPLWI . 


5741 


19642 


A 


5785 


445 


28 


DTNRAPNHMEIKGSGGTEAKVNGADS ID 
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. KEAAGDETMEARTME AEATGAkP*3^3VBA 
TGAKVTETKFtGABVREMESTEEEANME 
SKPTGAQATDTBTTGVEAMOVEATKTKA 
EEAEMQAYGVGAGQAE PPVTGTVLRPL 


5742 


19643 


A 


5786 * 


380 

* 


2 


QRQQESPEASSLHILERQVQELQQLLVE 
SQEKKESLGREVESLQSRLSLLENERGN 
TSYDVTTLQDEEGELPDLPGAEVLLSRQ 
LSPSAOEHLASLOEOVAVLTRONORTjMF 

KVQIMENFEKDBCI . . 


5743 

• 


19644 


A . 


5787 


663 


2 


FGVERJRATRT I RLTRTVLDLYS FLAGVS 
ENLRHATQDDAS RTRAPGLS S QH PKPDT 
TVSGDTETGQS PGVFNTKESGMKDI KSE 

LRQIiQEAAQGPTVTMTTNELYQAYVDSK 
IDALREELMEGMDRiCLADLKNSCSYKLT 
GLQQQCDDYGSSYIiGVIELIGEKETSLR 


5744 


19645 

■ 


A 


5788 

• 


398 


1 

m 


TRCYHSALRYGGSFAELGPRIGRLQQAQ 
TjKMPHTGSCTJHTJAIfVT.PPTiROVWMT.T.WT. 

EYKRNYGAKRGGPPVKRAAEPPWQPVP 
PAALSGP SGEEGLS PGGKRRRGCNSEQT 
GLPPGLPLPPPPLPPPPPPV 

W^rf^ *■ KJAJAw- dmJMm 4m 4m *¥ 4m* 4m- mm mm mm 4m mm W 


5745 


19646 

- 


A 


5789 


116 


371 


RHPKIWGLRGPPGLTLEKRGKGEEPLDP 
RS WGFKEAETP PRP FTRGKKGTPRLKKX 
PKERKTRERGGKKEGKKKGKKNEGKTNP 

Ii. 


5746 


19647 


A 


5790 


406 


284 


RRGFPHVDQAGLELPTSGDP PTLASQS K 
PGPIiCPALLHTP 


5747 


19648 


A 


5791 


200 


3 


GDALIYMEPEKQVMSRSSDECVVAIjCDQ 
WLVS YSKKI PLVNLIiTFDLKKNIS ICPL 
PGTWIMEMY 


5748 


19649 


A 


5792 


372 


278 


LQIQSPAGQSRCGGFLVREDFVLTAAHC 
WGR 


5749 


19650 


A' 


5793 

+m* r ^ m/ 


1 

A 


167 


VAPPKLDPHSGRVRAQAGLBLLTIiCDPP 
ASSSQTAAI TGASHHTRQVLGSYKALS 


5750 


19651 


A 


5795 


112 


3 


SSWGKVGAHAGEYGAEALERMFLSFPTT 
KTYFPHCI 


5751 


19652 


A 


5797 


356 


238 


FLFCNNGWYSLEHFRCIjAENAGDDAPVK 
DVTVLQNTDGR 


5752 


19653 


A 


5798 


379 


60 

• 


vkirhchtwsmsllknirlileeeqve 
gnysfcvldnqnlqqlwdwdhrsltika 

GKMYFAFNPRLCVS E I YRME EVTGTIQQR 
QSKGDINTRSNGERASCE 


5753 


19654 


A " 


5799 

4 — 1 


377 

ml t r 


3 


KVAFPGEGROGESWSAFOGHTRWSDLK 
WAVYEPDLMVTSS VGTYIYIWDI KDTRK 
LTVAL FWAGAS Q VKWNKKNANCLATSH 
DGDVRIWDKRKPSTAVEYIAAHLSKIHG 
LDWHPDSEHNLY 


5754 


19655 


A 


5800 


143 


316 


CPTSPPRLPYTEFFGGVSGLTVEQFRki 
NGFPNAFWGWGGEDDDLWNRYLCRSFGV 
F 


5755 


19656 


A 


5801 


390 


2 


LFFFFFFFLRQSLTLVPQAGVRWRDLSS 
LQPPPPGFKRFSCLCV 


5756 


19657 


A 


5802 


342 


465 


VI FFFKLKQGLTLS PRLECSGTIITHC3G 
LDLLGS S E S PTFT 


5757 


19658 


A 


5803 


1 


378 


VYSSEFENI KEEYRGRGFPTI CYFEKQR 
FLIQYDNYGSTAEDIVEWLKNPQPPQPQ 
VPETPWADEGGSVYHLTDEDFDQFVKEH 
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NO: of 
nucleotide 

W 9 ■ *m< ^M ^mw ^» 

sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
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SEQID 
NO: in 
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26 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- ; 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutaraic Acid. ^Phenylalanine. 

^m^ ^mm* m ^™ — -■ — ™ * ^» ■ ■ ^™ ■ m ^m 1 m » _ m m MM> mmmmm m m> 9-M ^rjM 

G=Glycine, ENHistidine, I=Isoleucine, 
K-Lyslne, L=Leucine, M=Methiontne, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=*Threonine, V=Valine, 
W«Try ptoph a n, Y=Tyrosine, 
X=Un known, *=StoD codon. /Possible 

— mm ^m* mm mm^mm ¥ w MM M mmr m*^m r ir Mmr^^mjmi ^m- mm m w m*r WW! Mr m^m 

nucleotide deletion, \=possible . 
nucleotide insertion 














SSVLVMFHAPWCGHCKKMNPEFEKASEA 
LHGEADSSGVLAAV 


5758 

m* 9 v 


19659 


A 


5804 

m* w V » 


152 

* W mm* 


1 


KPE I FFFESRSVTQTGVQWCNFGSLQPP 
PSGFKLFSCLSLPSSWDYTAHV 


5759 

w m* m* 


19660 


A 


5805 


207 


31 


LYLQLLQQTASSGNLNTLSSLHPMGGKC 
FTAAEEQQHPGQQTQLTTVLGVWHRFCE 
NIF 


5760 


19661 


A 


5806 


199 


2 


GSECSCIiLSRGGEMHVSLSSFKVTFSCD 
AIIHFSFLFFWRQEFP3VQAGVQWRDLG 
SLQPPPPGV 

^ Mm ^m M> Mm* w 


5761 


19662 


A 


5807 


385 


239 


QDQPGLQKMCVCVCVCVCVCVCVCVIIC 
KSCIRLS YMKMPINFGLKNL 


5762 


19663 


A 


5808 


39 


184 


GRXIIKFFGEMHSHHAAQTGLEIjLGSSD 
PPTWASQSAAITGVSLRIRP 


5763 


19664 


A 


5809 


376 


2 

* 


waeptsfhvcitsytqffrgltaftrvr 
wkclvidemqrvkgmtbrhweavftlqs 

isrpylssplrapseesqdyyhkwirl 

HRVTQPFIIiRSV 


5764 


19665 


A 


5810 


271 


120 


flcfigktsergsfslysrdtglpgcqv 


5765 


19666 


A 


5811 


367 

* 


3 


dksfkewhkmledanapteerderiagl 
yeeierdlmllgasavedklqegvietv 

TSLSLANI KI WVLTGDKDETAIKIGYAC 

nmltddmndvfviagnnavevreelrka 
kqnlfgrci 


5766 


19667 


A 


5812 


294 


37 


FRGENPDDGVRGS P PED YRLRQ VAS SL F 
RGEHHS RGGTGRLAS LFSSLEPO 10 PVY 
VPVPKVSHWAFFPNSSLEPLTYKAIFLP 
TA 


5767 


19668 


A 


5813 


184 


3 


I IS YLKNYLLFSPLQRQNFVLLPRLECS 
GT I TAHCNLKL VGS GDP PAPAS 0 VARTT 

VMM * Mm MM M Mmm+^mMmim mmmfm^MmmmM ™ ^m**MM^m*mm* M* mm * M+m * MMmW ■ ^ « A 

GMRH 


5768 


19669 


A 


5814 


468 


1 


DDMEYEGSVSVFTQVPRKSVNKGLDYTL 
MVAGEFGLGKSTLVWSLFVSDLYRDRKL 
LGAEERIMQTVEISKHAVDIESKGVRLR 
LTIVDTPGFGDAVI STECGKRRAMY 


5769 

* 


170/V 


A 

mm 






314 


OAPFGPKCFNVNLCFSLSLSKGE PEFHY 
IAGAHGNEVLGRELLIiLLVQFVCQEYLA 
RNARIVHLVEETRIHVLPSLNPDGYEKA 
YEGVMALAHLTQ 


5770 


19671 


A 


5816 


474 


3 


T TLRKG Y SRH KGLLHS I RQEVQAT P LEG 
I IVSGQMSTMNLSSTLWPSPKRVRRLCI 
GRTLRSLRIRPCKRTPLQTRRMPQKMRA 
LYGDPGDSPPLRSRAIPASSPGSTGSFR 
ARPPHPIiHPPPHLTREGEPPLLPVIiQPR 
RGPPLPLHSQASRPIV 


5771 


19672 

* 


A 


5817 


511 


389 


GM YCTEGREWPTFRNE I E XEEDHCGRV 
SASIILTHCSNR 


5772 


19673 


A 


5818 


510 


2 


PLSCEVPNPIjFPVPGVRMIQHHIEACAQ 
VRGMQEIVLIGFYQPDEPLTQFIEAAQQ 
EFILPVRYLQEFAPLGTGGGLYHFRDQI 
IAGSPEAFFVLNADVCSGFPLSAMLEAH 
RRQRHP FTiT J iGTTANRTQSLNYGC IVEN 
PQTHEVLHYVEKPSCIAAAIEDQASSRQ 
R ^ 


5773 


19674 


A 


5819 


282 


152 


FSLFLFLLQSERHFIMQWCEATQCPDT 
RVSVRLYEKSVLHLS 
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SEQD) s 
NO: of 
nucleotide 
sequence 1 

• 


SEQJff) 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ1D 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 

. correspon 

-ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
OCysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
G=Giycine, EHHistidine, I=Isoleucine, 
K=»Lysine, L=Leucine, M=Methtoninc, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, $=Serlne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Un known, *=Stop cod on, /^possible 
nucleotide deletion, Impossible 
nucleotide insertion 


5774 


i 19675 


A 


5820 


347 


1 


EPHPVTPPAVGTSRYNPLIMMDRNRSAV 
TPPSHPPQQPSSMQTGMNPSAMQGPSPP 
PPPPSYMHI PRYSTNPITVTVSQNLPSG 
QTVPRALQILPQIPSNLYGSPGSIYIRQ 
TKV 


5775 


19676 


A 


5821 


482 


13 


I YEKL I TGC YN I LANHADPNS GXiDBS I L 
EECLQYLEKQLESSQARKAMEEFFSDSG 
ELVQIMMATANENIiSAKFCNRVLKFFTK 
LFOLTEKS PNPSLLHLCGSLAOLACVEP 
VRLQAWLTRMTTS 


5776 

■ 


19677 


A 


5822 

• 


608 


7 


FRFAE KWEGMF 1 1 VN3 IT I KIHSKAFH 
AS FELWQLQGYS VNPNWQQSDLRLTRI T 
DPCRGBVLTFKE I TWOTLR I EADATDNG 
DQDPVTTPLRLITNQGRIQIALKRRTKD 
CNVISSKMFIjLDDLLWVLTDSQLKAMM 
KYAESLS EAMEKS AHQRKSLAPEP VOI T 
PPAPSAQQSWAQAFGGSQGNSNSSSSPV 
LRPL 


5777 


19678 


A 


5823 


328 


3 


ASVASEDCSLPCFIiAVWNRI IEPVAAMR 
KEADMLRLFPEYLKGEELFGLTVHAVLR 
IAESLPGVESCONYLFRYGRHPLMEIjPL 
MlNPSGCARSEHKILTHYKRPSLY 


J / /o 


170/7 




5 894. 






TSVPSCVRCRYI I LRTS S ALTN I LGTHS 
NSSFFHASSSAIiHSCCFPFFSWLQTLDI 
NVKAPALMTKAWPEMEKRGYRE 


5779 


19680 


A 


5825 


424 : 


2 


LQRAFSES IRKLRGYQGADRKQI YHRRC 
SFANHSVRPSADEKCNSNFFEQRHGGSH 
QSSKWTPVGPAPSTSQSQKRSSGLQSGH 
SSQRTSAGSSSGTNSSGORHDRESYNNS 
GSSSRKKGQHGSEHSKSRSSS PGKPQTV 


5780 


19681 


A 


5826 


107 


2 


SSLTAGVRMGVPAQSTQGTVNGSSPQMS 
GTAALTS 


J/Ol 


17UO& 


A 

** 


JOi/ 




417 


FPTEDSRTS KESMSEADRAOKMDGESEE 
EQES VDTGEE EEGGDES DLS S ES S I KKK 
FLKRKGMTDS PWI KPARKRRRRS RKKPS 
GALGSESDKSSAGSAEHIGPCDSTGDME 
VSSGLPGSRRPENPVFLCIWVAVRAICP . 
GFLWS 


5782 


19683 


A 


5828 


487 


22 


MPEP VFPLSHFRQF IAAI KLQFQARLSR 
CVRDLVRSLAAPNHDTLRMLFQHLCRVI 
EHGEQNRMSVQSVAIVFGPTLLRPEVEE 
TSMPMTMVFQNRWELILQQCADIFPPH 


5783 

• 


19684 


A 

• 


5829 

• 


2 

■ 


871 

• 


RG I CSQR WRREGSQSRG PGLVI TS PSGS 
LVTTAS S AOTFPISAPM I VSAL P PGSOA 
LQWPDLSKKVASTLTEEGGGGGGGGGS 
VAPKPPRGRKKKRMLE SGLPEMNDP YVL 
SPEDDDDHQKDGKTYRCRMCSLTFYSKS 
EMQIHSKSHTETKPHKCPHCS KTFANSS 
YLAQHIRIHSGAKPYS CNFCEKSFRQLS 
HLQQHTRIHSKMHTETIKPHKCPHCSKT 
FANTSYLAQHIiRIHSGAKPYNCSYCQKA 
FRQLSHLQQHTRIHTGDRPYKCAHPGCE 
KAFTQLSNLQ 


5784 


19685 


A 


5830 


463 


1 

* 

* * i 


LPESRLPSPHKREEGSRARVIMTSYPIE 
PHERRKGSLADWATLKQKKLEEMTRTE 
QEDSSCMEKLLSKDWKEKMERLNTSELL 
GEIKGTPESLAEKERQLSTMITQLISLR 
EQLLAAHDEQKKLAASQ IEKQRQQMDLA 
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M 

eth 

od 

— ' 


SEQID 
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location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A- Ala nine 
OCysteine, D=Aspartic Acid, 
E^Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso)eucine, 
K=Lysine, Lr=Leucine, M= Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R^Arginine, S^Serine, 
•^Threonine, V=Valine, 
W^Tryptophan, Y^Tyrosine, 
X-Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














RQQQEQIARHQQLY 


5785 

• 


19686 

• 


A 


5831 

• 


462 


2 


MILFDSSLYFLCGWYLSNLILGTYGVRK 
PWYFPFTASYWKSVGFLVBKRQYFLSSS 
LFFFNENFDNKGSSLQNREGELEGSAPG 
VTIiVSVTKEYEGHKAVVQDLSLTFYRDQ 
ITALLGTNGAGKTTIISMLTGLHPPTSG 
TIIINGKNLQTVY 


5786 

• 

* 


19687 

- 


A 


5832 


396 


2 


VP S TPAS KRKG IVPRCDI FI YBRGGALP 
KD I HVAG I LFASAWMNVWPLLS AX E FHS 
AWAMGMGVNFFVASTHHVSLNMSGSGRY 
APNGRKVYH YDMKTEMGELLLS BVDSHP 
LSSLAYPTADNWNAYATLY 


5787 


19688 


A 


5833 


384 


137 


EEICLNPGAGGCSEPRLWHCTPAWATKR 
D S S KKKE KNCLS KNKKEKE KEM F ACVCV 
CVLFS I YAFPSKAQELSNPCKHYGNF 


5788 


19689 


A 


5834 


171 


2 


FVEGINISGNFYRNKIiKYLAFLRKRMNTT 
NPSRGPYHFRAPSRI FWRTVRGMLPHMY 


5789 


19690 


A 


5835 


278 


404 


S VL I KFEKKTLKYKAI W PGAVAHACNP S 
TLGGRSRGITRSGE 


5790 


19691 


A 


5836 


417 


3 


SKGKEGR CE WQ YDLPS GTHAVFQFFLKQ 
QGRVPGLTQAVQ VAKMSL IDLAGSERAS 
STHAKGERLREGANINRSLLALINVLNA 
LADAKGRKTHVP YRDS KLTRLLKDS LGG 
NCRTVM XAAIS PSSMTNEDTYSTHV 


5791 


19692 


A 


5837 


409 


230 


AGRAYCYNGMCLTYQEQCQQIiWGPGKAL 
LGSTALPSPPIiPSQPPIALRWWNLGFK 
WHL 


5792 


19693 


A 


5838 


3 


409 


LRSVPCKDYLTQNHYITSPLSEEEAAFP 
IAYVMVIHKDFDTFERLFRAIYMPQNVY 
CVHVDE KAPAE YKESVRQLLS CFQNAF I 
ASKTESWYAGISRLQADLNCLKDLVAS 
EVPWKYVI NTCGQDFPLTTTRPV 


5793 


19694 


A 


5839 


385 


174 


GLAVEIGSRRIAEDGLELLASSDPPTSA 
SQSAGSTGVSHHAWPDTVSSVYWCFLCC 
TKQVWGALSIVSL 


5794 


19695 


A 


5840 


260 


2 


PIiPR YTAACSRLLVQ YKAAFRQVQGSE I 
SSIDEFCRKFRLDCPLAMERIKBDRPIT 
I KDD KGNLNRC IAD WS FY I TVMD KLRQ 
CI 


5795 . 


19696 


A 


5842 


299 


3 


FSGIKYICIIFTiFVCFFIiYLLGIGSPYV" 
AQAGLXIiLGSSDPPPSASQSPGIIGVSH 
YVQPIHTVAQLSPSSIFRTFSSFQTETV 
PIKHTSHSPSPSLY 


5796 

- 


19697 


A 


5843 


391 


2 


APHAPAFLRARGEPQDPLSHPRVPAVSA 
NCRMWKHIiPVHSSPTPRLTPLWKLQARW 
LLPQLVYLQ GWGS YSLLRPAAL1 SMVLL 
AREFLYPAKMSVSEVCSSGLSSPLLEQH 
KTNIi I FYASGD ICSANV 


5797 


19698 


A 

■ 


5844 


479 


59 


F VGME PG VGHAS LAMHGLH IR3LGKI KL 
RKRKCTLFFNTQEKSARRRGHLLGENIY 
LLLFAI ALR ILNCLLVQTSFVPDEYWQS 
LEVS KHMVFNYG YLP WE WTERIiRS YAYP 
LICASIYKVLHLLGKDSVQMLVSGNKSS 


5798 


19699 


A 


5845 


406 


548 


SVQTKPFFETESHPVTQAGVQWHNLCSL 
QPLPCRSKQFLCLRLPSSW 


5799 


19700 


A 


5846 


2 


394 


TLCHRAKDPTVHHESGRIMYLNGLCFLM j 
GPAQLTQRLSVSRQGEC 


5800 


19701 


A 


5847 


472 


201 


LSILSDTSLCLGRFFTYENGCAYFHEEE 
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Amino acid sequence (A* 3 Alanine 
OCysteine, D=Aspartic Acid, 
E=GIutamlc Acid, F=Phenylalanine, 
G=Glycine, H-Histldine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R«=Arginine, S^Serine, 
T=Threonine, V«Valine, 
W=*Tryptophan, Y^Tyrosine, 
X=l/n known, *«Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














REGLARI CRLS IHSRY EDF WDGFNVL Y 
NKKPVIYRSAPARPQLQQSIiCNQVMWYQ 
VAILNK 


SS01 


19702 


A 


5848 


127 


3 


SSNVKAAWGKVGAHAGBYGAEAIiERMFL 
SPPTTKTYPPRCI 


5802 


19703 


A 


5849 


238 


1 


MAETHPNPTGLPLAPALPCLAGSATHPC 
LPS PE 1 3LDVBADRDG WEKNNPKKVPS 
FQGRHHPEPRASEDTAPDPAGMY 


5803 


19704 


A 


5850 

- 

i 


423 


2 


LS PLGNWSSANQKWFDSVKPSRKDLFA 
VDTQVGTVPSLTAGEQGFGDGGKGRAGE 
QATQAAGDWS SNMWGSDG I LSHSAPLTV 
PQPLLTGPQGPCLCRPLPNQQTSFPALL 
PFSTQLCSMCPGPASARPPPLLLKPTMy 


5804 


19705 

• 


A 


5851 


326 


682 


PSYSLLSLIiQVKNEVEKLPRQQRKESMK 
QKMEEHTQKKQLLVS P WPQLG S AGAGAR 
PGMVTRPPLCLCPIPAPQIPRSEYLNLA 
SSFMAPLSPPPCSLHRLSCLHLEVRNS I 
SGYPNGI 


5805 


19706 


A 


5852 


263 


30 


HBKTDDERGPGQSARSGAITKPPGPPLP 
IEPHETTPEHPAPSGTIPEPPLPVELHE 
TTPQHPVPSGTIPEPPYLLSQ 


5806 


19707 

* 


A 


5853 


389 

• 


2 


GLPTQREKFGASMRTRMTIQS IAWTSF 
VCLVGEGNNVQG FRAESRCWRYDPRHNR 
WFQIQSLQQEHADLSVCWGRYIYAVAG 
RG YHND LNAVE R YD PATNS WAYVAPLKR 
RCMPTKAKRWRGRCISP 


5807 


19708 


A 


5854 


379 


2 


GRSLRYSGSCSGEENSTNNSAGQSRAVI 
AAAARRRGNSHNEYYYEEAEHERRVRKR 
RARLVVAVEEAFTHI KRLQEEEQKNPRE 
VMDPREAAQAIFASMARAMQKYLRAAKQ 
QNYNTMES ILQHC 


5808 


19709 


A 


5855 


374 


2 


SDAGAPVNIYEFRHRPQCLEDTKPAFVK 
ADHADEVRFVFGGAFLKGD IVMFEGATE 
EEKLIiSRKMMKYWATFARAGNPNGNDLS 
LWPAYNLTEQYLQLDLNMSLGQRLKEPR 
VEIWN5TIPPCI. 


5809 


19710 


A 


5856 


516 


18 


PYECKECGKAFNCGSSLVQHERIHTGEK 
PYBCKECGKAFSRGCHRTQHQKIHRGET 
PI KCKE CGKAFS WGS SLVKHERVHTWBK 
S YECKD CGKAFG SG YQLS VHQRFHTGEK 
L YQRKE FGKTFTHGS KLVHERTHSNDKP 
YKNKECGEAFLWTTYSNEKCIASAK 


5810 


19711 


A 


5857 


392 


3 


CERGMGGS PRALGRHWTS FLKLRLNCS V 
PGJDSTFYPDVLQALTGPVNLHGRSALPG 
VFTTQTN'S I PGSAVCAFYIiDE I ERGFEG 
KFKEQRS LDGAWTP VSEDRVPS PRPGSC 
AGVGGAAKFTS SRDLLY 


5811 


19712 


A 


5858 


391 


3 ' 

• 


ARRTTGM VPKAGGG KGRRGAVFRS Y IRE 
IEELRSKLVESEAMNESLRRSLSRASDR 
S P YSLGAS PAAPAFGGS PAS S MEDASEV 
IRRAKQDLERLKKKEVRQRRKSPEKEAF 
KKRAKLQQENSEETNGE 


5812 


19713 


A 


5859 


409 


3 


SLPGEGDPWWAGQEELbFVQEGKLSLPV 
AERVLIiRIACRYDPRSNSWAEIAPMKNC 
REHFVLGAMEEYLYAVGGRNELRQVLPT 
VERYCPKKNKWTFVQSFDRSLSCHAGYV 
ADGLLWI SGGVTNTAQYQNRQCI 


5813 


19714 


A 


5860 


405 


135 


NLSRLSLCRQPMTLVTWVLLFYDSEEKA 
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FWLLVALWKRMLPDYYNTRWGECPRVL 
PRLGLQVPAGSMRVAALPRPSSRLHVGS 
LGVRG 


5814 


19715 


A 


5861 


389 


2 


YYSELEEQLTDEFNAELNRVPLKRLDLI 
FVTFQDS RMAKRVRKD YKYVQCGVQPQQ 
S S VTT I VKS YYWRVTMAPHPKD I IWKHL 
SVRRFFWWARF I AINTFLFFLFFPLTTP 
AI IMNTIDMYNVTRPCI 


5815 


19716 


A 


5862 


525 


0 


SHLPESERIHHTVGQKEQVLMDTSkTRP 
NNDVPE PPMP IADQVSNDDRPEGS VEDE 

• ~ m\^ mmr v mmm mtkw mm • m mm m mWm\ UF V mmr mt v mmW mmWmti^mm mmm ^mw^mW W mmwltr mmm 

EKKESSLPKSFKRKISVVSTKGVPAGNS 
DTEGGQ PGR KRR WG AS TATTQ KKPS I S I 
TTESLKSLI PD I KPLAGQEAWDLHADD 
SRISEDETESKGDDGTHDKGLTIC 


5816 


I97I7 


A 


5863 


399 


2 


ATRRNRNRVPSGMTRTNVRBMIAAVGPG 
PSPYPLPPPPEGTSSIEYSNQGNTCQGH 
GNFDFPHGNPGGTSMNDFMHGPPQLSHP 
PDMPNNMAALEKPI SHPMQBTMPHAGSS 
DOPHPSMOODKARNTPOPMY 


5817 

• 


19718 


A 


5864 


3 


714 


RRP F I ALCLSNVAFMLP WQ FAQ F I LFTQ 
IASLFPMYWGY IEPSKFQKI I YMNMIS 
VTLSFIIiMFGNSMYLSSYYSSSLLMTWA 
I I LKRNE I Q KLGVS KLNCWL I QGS AWWC 
GTI I LKFLT3KILGVSDHI CLSDLIAAG 
I LR YTDFDTLKYTCS PEFDFWEKATLL I 

*** Jki JIV mm. m% mm* m* mm* m\. Mrt A * V»f|#* U4k imWrn 4 

YTKTLLLPWMVITCFIFKKTVGDISRV 
LATNVYLRKQLLBHSELAFHTLQIaLAFT 
ALAIIiILRLKLVL 


5818 


19719 


A 


5865 


423 


1 


APPVSTAVAQSNSSEEEAREVGS PAQEF 
KYQKSLPPRFQRQQQQQQQEQLYKMQHW 
OPVYPPPSHPQRTFYPHHPOMLGFDPRW 
MMMPSYMDPRITPTRTPVDFYPSALHPS 
GliMKPMMPQESLNGTGCRSEDQNCVPPC 
I 


5819 


19720 


A 


5866 


497 


2 


AVGAGQKGGRCGGGRELMPPQK I WGGGA 
FTNNAHVGPLKIMPRLIRTGMLEEIHLE 
PGWQGLFPCVDELSDIHTRFLSQLLER 
RRQALCPGSTRNFVIHRLGDLLISQFSG 
HSAEQMCKTYSEFCSRHSKALKLYKELY 
ARDKRFQQFIRKVYCGRCRGSQQGR 


5820 


19721 


A • 


5867 


382 


74 


LALS PRLKCSGQ I IAHCSFDLLGSSDPP 
PSASRVAGTTGARRHARLPYSI.RHYHFV 
LRLFNPSKHSKLKHFHPILIRNPGKVGS 
YSFQMSDLLQSRRARQQS 


5821 

* 


19722 ! 

A mw W mfmm 


A 


5869 


378 


3 


SFSRSANLI SHQRIHTGEKPFQCAECGK 
S F SRS PNL I AHQRSHTGEKP YS CPEGGK 
SFGNRSSLNTHQGIRSGEKPYECKEQGE 
SFSYNSNLIRHQRILTGEKPYKCTDOGQ 
RFSQSSALITPV 


5822 


19723 


A 


5870 


375 


3 


IHVT VFNQATGHVMAKRVFDTYS PHEDE 
AMVLFLNMVAPGRVL I CTVKDEGSFHLK 
DTAKALLRSIiGSQAGPALGWRDTWAFVG 
RKGEABCHWADTELNRRRRRFCSKVEGY 
GSVCSCKDPTV 


5823 


19724 


A 


5871 


373 


1 


QPEEVSGALSPPSASAYVKLVLLGLIMC 
VSLAGNAILSLMVLKERALHKAPYYFLL 
DLCLADGIRSAVCFPPVLASVRHGSSWT 
FSALSCKIVAFMAVIiFCFHAAFMLFCIS 
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nucleotide insertion 














VTRNMAIAHHLY 


5824 


19725 




5872 


382 


2 


KSFSQKSQIiI IHLRTHTGERPFECPBCG 
KAFRE KSTVI IHYRTHTGEKP YECNECG 
KAFTQKSNLIVHQKTH1GEKTYECAKCG 
ESFIQKLDLI IHHSTHTGKKPHECNECK 
KTFSDKSTLIIPHV 


5825 

4 


19726 


A 


5873 


370 


3 


MGRVRAQNISGVMSGPQKMLMQSQFPTQ 
GQQGFCEGKEPYQAMSQNMGNTQDMFSP 
DQS SMTMSNVGATRLSHMPLP PASNP PG 
TVHSAPNRGLGRRRSDLTISINQMGSPG i 
IGHLNSTTCI 


5826 


19727 

* 


A 


5874 

1 


162 


2 

mm 


GGKFLVLGNLPS KLBESMVQ YYRL VTAA 
SLVRGQISEYNI SLRASDGGS PPLSTET 
HITLHVIGINDNPPTFPHLSYSACIPEN 
NPRGAS I FSVTAQDPDSNNNAR I TYALT 
EDTLQGVY 


5827 


19728 


A 


5875 


369 


1 


RIRPRPTARLASARTLHEVSLQBSIRYA 
PGDAVEKWLNDLLCLDCLNIARIVSVCP 
LPEACDLSYVNRDTLFWCHKASEVFLQR 
LMAL YVASRFKNS PNDLQMLSDAPAHRL 
LNKCLLCPPPV 


5828 


19729 


A 


5876 


119 


39 


VIQDYTTPPNEELSRDLVNKLKPYMR 


5829 


19730 


A 


5877 


461 


2 


RRGWTSRRKPKEDPSGAAVPEMPKKSSK 
IASFIPKGGKLNSAKKEPMAPSHSGIPK 
PGMKSMPGKSPSAPAPSKEGERSRSGKL 
SSGLPQQKPQLDGRHSSSSSSLASSEGK 
GPGGTTIjNHS I S SQTVSGSVGTTQTTGS 
NANSVQLPQPLCI 


5830 


19731 


A 


5878 


503 


3 


PSPQVPGECPSPKKLGARAAFTTPDPAP 
LSPQSRVASSGSBQTEEQGSSRNSFQBD 
GSGMKDVPSWLKSLRLHKYAALPSQMSY 
EEMMTLTEQHLES QNVTKGARHKIALS I 
QKliRS RQS VLKS LE KD VLEGGNLRNALQ 
ELQQIHTPIKAYSVLQATVAAATLY 


5831 


19732 


A 


5879 


421 


34 


LVHKVLSASNVLVDAEGTVKI TDYS ISK 
RLADI CKEDVFEQTRVRFSVNAMHYRTG 
NKGDVWRLGLLMLSLSQGQECAEYPVAI 
PSGLPGGFQDFLKKCVCLDDKK3MESPA 
WETKQNKS PANVLRPL 


5832 


19733 


A 


5880 


379 


3 


AQGLIiLAWPNLNRQHCPCSCSHWLSTFI 
QNWKS PGLS LKTRAPGQGS VLSS S DL PR 
AVLTGT FAVMS VMVGS VTES LAPQALNG 
SMINETARDAARVQVASTLSVLVGLFQV 
WNSQEYVPPTTCI 


5833 

■ 


19734 


A 


5881 


429 

■ 


1 


SLVS YMDTESTAE KLGKE LGLQAKELS A 
VHSSHHEIGVNDSNLFSLEMREPLESSN 
TKSFHSAVEIKNRSQHELPCFQHHGIDT 
PTSLQKRSACCPPSLLRLEETASSSQDG 
DPQI WDDLPFSES LNKFLTVLES E IAVT 
QCI 


5834 


19735 


A 


5882 


436 


3 


WSIiVSYMDKKSTAEKLGKELGLQAKELS 
AVHSSHHEIGVNDSNLFSLEMREPLESS 
NAKS FHS AVE I KNRSQHELP C FQHHGID 
TPTSLQKRSACCPPSLLRLEETASSSQD 
GDPQMWDDLPFSESLIKFKTVLESEIAV 
TQCI 


5835 


19736 


A 


5883 


195 


353 


DSYSYVRSTAPAVAYDSKQYYQQPTATA 
AAVAAAAQPQPSVAETYYQTGGFS 
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C=Cysteine, D=Aspartic Acid, 

1fe=f2liifeimic Acid fftsPhenvlnlnrtinp 

G^GIycine, H=Histidine, l=Isoleudne, 
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Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V=»Valine, 

X-Un known, *=S top cod on, impossible 
nucleotide deletion, \=possible 
nucleotide insertion 


JOOO 


10*711 








I 


. Jj9a& V r vn<9 V Oi/ynnnnniJOUUrAJNiinl 

SIAELRTASQKAHKLCGPMBIDSALNTV 
QTLKNEI^CAKMAAVESQLKPLPGBTLE 
KCAQDLGSTSKAVGSSMAQLLTCAAQGN 
EHYTGVAARETAQALKTLAQAARGVAAS 
TTDP 


5837 


, 19738 


A 


code 

5885 


313 


1 


CYC OUT T TT /VPO t5T CVS nAQT Tf fTT VMT 

SWVGWSVTAASRBTEAGIQVSSEEPGP 
RNVS PHSRLQDETNLRLEAENNliAAyRQ 
VREVQGRDREGRKGLMATLY 


5838 


19739 


A . 


5886 


432 


3 


GSGFCSPILCLWVWEEAVGIIRLHLWLR 

KLGCwHSrSIiPGTYSLBrljSTCTQV^ 

SWSGLLFASGNYLSLYLVPSRAETDSRG 

SQPLPVARHGVGACKCKPRQTRPGLSPQ 

VCVEKLMPLSSFCSAFQQNTYNKQPMSR 

PV 


5839 


19740 


A 


5887 


417 


2 


AS LMVAPDYAE I S PIiAMPAPSSGVVCT P 
IMS TS S S EAM STP LMLAPDSGELS P I LM 

f\t\lkATKt Ti 7M CJ TVl T3T 7 T) A Tl O W Ti M O T3 T T »TT"\ 

yUMWFbVWia rUlr V PAF^r KAJYlisFijU J- 1U 

EDTEAMS KVLMTAIiASGE I SSLLMSGTD 
SEAISSIiIMSAVASGGTSPQPTSTLY 


5840 


19741 


A 


5888 


420 


3 


KPDLIKKLDLKTMSSYDLDIEASDGGGL 
SGKCSVSVKVLDVNDNPPEIiSISSLTSP 

S I QDD VLFKLKP S VENFYRLVTEGALDR 
ETRAEYNITITITDMGTPRLKTEQSV 


CO A 1 

5841 


19742 


71 

A 


coon 
5889 


321 


At A 

414 


SAD 


5842 


19743 


A 


5890 


415 


1 


PGL PGTS VKGI PAS KQS PHES PRTLHLK 
TSPI IQQLGLYLSHTAIRYHPQETLKEF 
VQLVCPDAGQQAGQVGFLNPNGSSQDKV 

TKPPTTSTEGGADYPTSPTYSTPSLY 


5843 


19744 


A 


5891 


193 


3 


KPSS KVWSRRDPFLLSSVBQP I KDAVIT 
VPVFPNQAERRAVLQAARMAGLKEMQLI 
NDNTACI 


<0 A A 

5844 


19745 


A 


5892 


A 1 *V 

412 


2 


^JiUVbUWrJJor X VoJjI JrDr ^oijoKAXIjIJ 
WQFFKWKTVTVVYDDSAGLIRLQEIjIK 

apsrynlrlkirqlpvdtkdakpllkem 
krgkefhvtfdcshemaagilkqalamg 
mmteyyhyifttldlfaldvepsv 


CO AC 

5845 


19746 


A 


cor* "5 i 

5893 


413 


3 


IjJjjjJjn-oW 1 aLLKLr UvdUjISVIiVvjIuIA 

aeaklqepwscymclpqrchgvlrrrkd 
wnvrlqafftsdtgleyeapkl ypai pa 

ARRRPXRVLSLFDGXATGYLVLKELGIK 
VGKYVAS EVCEES I AVGTVKHE WR 


5846 


19747 


IV' 

A 


5894 • 


424 


*> 

3 


EAPGTRS CTP S HGQNTAAEATP AQKTPA 
K\ATYVFS T EMANKAAKAVL KGQ VETMVS 
FH I QNI S S SKTERS TAPLNTQ I SALRND 
PKPLPQQPPAPASQDQNYSQNTRLQPCI 


5847 


19748 


A 


5895 


400 


1 


ASVQN PALRLVTREE FAIMQTPAGELYD 
KSIIQSAQQDSIKKANMKRBNKAYSFKE 
QII ELELKEVS TRRHLRKVAAELGGLKL 
VAF P TAHMTjFTWN I IiRPMAHS SDAPDQC 
LVHWHTFPSPLLHSFFHPQIiY 


*848 


J9749 


A 


5896 


421 


3 


VSS IQMGAVNLLGGGLDSLLGSDLGGGI 
AGSPAVGQSFIPSSVQATCAPSPTPAW 
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E=Glutamic Acid, (^Phenylalanine, 
G«=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=ProlIne, 
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T=Threonine, V^aline* 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, * ci Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














SSGLNDLSELSTW IGMAHGGYVAPKAVW 
LPAVKAKGLES SGTFTRRQGH I YMAMDF . 
TNKALQHMTDS AI Q FTKNS FGVI PS C I 


5849 


19750 

• 


A 


5897 


384 


1 


PRASRFIHEAGIVPLPMRTHTPKKPFTC 
EVAGVQFSRNVKVKIHMRKHTGERPYSR 
QSCSVRFVHSYDLKNHMHLHTGDRPYEC 
HLCHKAFAKEDHLQRHLKGQ I CLEVRTR " 
RRRKDDAPPHYPPPCI 


5850 


19751 

i 


A 


5898 

! 


429 


2 


QDIAAWQSLTQVLTPESWRKANIMTEPQ 
KSQERYKGIYVKEKLYRRARHDESLNWT 
SCDHHESOECKGEDPGRHPNCGKNLGMK 
STVEQHHWHVLPQP FTCNNCX3VAFADD 
TDPRAHPSTHLGEKSYKCDQYGKILSQS 
LY 


5851 


19752 




JOT7 


424. 


i 


GTMLOALAAHDAGSRAHVLLSLSOQDG I 
EQHMDFDSRYTLLELFAETTSSEEHCMA 
FEGIHLPQIPGKLLFSLVKRYLCVTSLL 
DQLNSSPELGAGDQSSPCATREKSRGQR 
ELEFSMAVGNLISELVRSMGWARNLSEH 

v 

w 


5852 


19753 


A 


5900 


135 


459 


YFMLKAEHVISYVCSENSLMVWNMRFC 
LIGLIHIQEFYFFEMfiPHRAIQAGVQWC 
DLCSLQPPPPGLKRCSCIPPSSSRDYRC 
APPYP 


5853 


19754 


A 


5901 


418 


10 


GRS PKPGDRLWE S VHLSFPGPADSLSGG 
SAPASSYEPSQRSSFSSNRSQRGSTSTR 
NSSQ KGSS VLS I KQKGKRELYMEKLQEH 
LIKAKAFTI KKKFSQNIiVAQLWYFVKCV 
YFGLSAYQ I PRGYPTRVLGNFLTK 


5854 


19755 


A 


5902 


402 

1 


2 


GYRHPELE I KS VDGFQGPEKQP VILSLV 
RSNRKGEVGFPVEDRRINVDVTRARRLV 
TVI CDYRTVSSHAFLKPLVEYFTQHGEV 
RTAFBYLDDIVPDNYSHEDSQGSSQAAT 
RPQGPGTSTRTKKQRQEGGMY 


5855 


19756 


A 


5903 


346 


1 


GICPLSHIKMTDARCQIYEMKGGEKSPK 
DTGKBPGHS EAKTGPPQVLAGVPAQPEA 
PQPGPNTTAAPVDSGPKAGLAPETTETP 
AGASETAQATDLSLSPGGESKANCSPED 
PMY 


5856 


19757 


A 


5904 


139 


212 


E VENNDD I SHHHHHHHHHHSNS KS 


5857 


19758 


A 


5905 


2 

*• 


345 


PCGCWGRCALLLISAAAKAKSKCGPTFL 
PCASGIHCI IGRFRSKGFEDCPDGSDEE 
NCTANPMLCSTARYHCKNGLCIDKSFIC 
DGQNNCQDNSDEBSCESSQGVGYVCGFK 
KA 


5858 


19759 


A 


5906 


432 1 


17 


PQTTPHRTFGGGKAAWLLAVGGQFLLC 
WSPYFSFHLYVALSAQPISTGQVESWT 
WIGYFCFTSNPFFYGCIjKRQIRGELSKQ 
FVCFFKPAPEEELRLPSREGS IEENFLQ 
FLQGTGCPYKSWVSRPLPCIAAALRV 


5859 


19760 


A 


5907 . 


407 


1 


pyacgkcgksyrqssslvshrrihsgvr 
phhcdecgeffsrkydllihqrvhsger 
pykgsbcgks fshs csliahqrihtgmr 

PYECSBCGISFIHSCSLITQQRVHLGTR 
PYMCSBCGKSFSQSCRLIKHRSV 1 


5860 


19761 


A 


5908 


419 


3 


VLAKKTI ITKSARDCHEFGNILHLSTNL 
VAS I QR P DKHES FGNNMVDNLDLFS RSS 
AENKYDNGCAKLFFHTEYEKTNPGMKFY 
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nucleotide insertion 




■ 










GYKECGKGLRRKKGIiSLHQRIKNGEKPP 
ECTACRKTFSKKSHLIVHWRTHTGVY 


5861 


19762 


A 


5909 

• 


441 


57 

■ 


KVMGI FKSSLFQALVDIQBPYEVTLINS 
QKSCEQKXEEANQVLQKWEKTSLIiAPCH 
DRLQKSSELQSLIGQHSSLBTTDGPQPW 
LFPRGSEGHRGKT YDHCHS IGLQ IAVDQ 
RKMLHVLS KI KKIRV . 


5862 


19763 


A 


5910 


3 


317 


CLLMLWRRWGSRGS RQRRLQATP S WEST 
KTADGQGDRRESG SQRHCSS PHPQSKST 
TTSSVPAAREGAQEAAQGGPRJCGQTSRP 
QVSTRQ CAMPPARHRKLENDA 


5863 


19764 , 


A 


5911 


183 


1 


QNS D S KD S LNR VAS RDHAKPNLT CHVS P 
AXQGTG3 XSESS I PSVSDTSTPRRSRRQ 
IiPPCI 


5864 


19765 


A 


5912 


395 


I 


KTIQFNI CVKVFHR FSNSNKDKIRYAGD 
KTFKCKECGKSFHVLSRLTQHKRIHTGE 
NPYTCEECGKAFNWSSILTKHKRIHARE 
KFYKCEECGKGFTRSSHLTKHKRIHTGE 
KPYICEKCGKAFNQSSTPV 


5865 


19766 


A 


5913 

« 


449 


2 


FGSHLEKEDEKKQELVDKAIKPS IEATL 
ESIQRKLQYKRAESSRPEDIKDMTKAQI 
ANEKVALQKALLYYES IHGRPVTKNERQ 
VMKPL YDR YRLVKQ I LSRANTI P I IEEE 
EGSEADSWKPDFMVNKKTDFSARMVKR 
PLGGSARPD 


5866 

■ 


19767 


A 


5914 


464 

- 


23 


S AHMTETRS KS FD YGS LSLTGPS APAP V 
AP PARVAP P ERRKC FLVRQAS LSRPPE T 
ELEVAP KGRQES EE PQPSSS KPSAKSSL 
SQISSAATSHGGPPGGKGPGQDRPPLGP 
TVPYTEALPVFHHPVAQTHKHEKPYLPP 
LYCGRCG 


5867 


19768 


A 


5915 


366 


1 


IiVSFINFFTSVIiATBWFAVLGFKANIM 
NEKCVVENAEKIIiGYLNTNVLSRDLIPP 
HVNFSHLTTKDYMEMYNVIMTVKEDQFS 
ALGLDPCLLEDELDKSVQGTGLAFIAFN 
EAMTHSPACI 


5868 


19769 


A 


5916 


246 


366 


TSVKEQQMPGAVAHACNPSTLEGQGGRI 
TRSGVRDQPCQH 


5869 


19770 


A 


5917 


400 


3 


NDNAPBFYQSVYKVTVLENAFNGTLVIK 
LNATD PDDGTNGD I VYSFRRP VWPAWY 
AFTINPNNGEIRTKGKLDFEEKKLYEIS 
VEAVDKGN I PMAGH CTLLVE VLDVNDNA 
PEVTITSL3LPIRBDTQPCI 


5870 


19771 


A 


5918 


410 


200 


CTPPQPVKCRFLNRDRVSSCCPGWSQIP 
G I KRS S YLS LPECWDLRPVI LLFCFLSH 
ISSKQPYFLPPSYR 


5871 


19772 


A 


5919 


214 


407 


MYFYIDRASLLSPRLECIWTIAHCSLE 
LL.CSRDL PAS AS Q SAG I TG I RHHTWLKT 
HFYSSFKT 


5872 


19773 


A 


5920 


476 


39 

• 


VPYGLFVGSTGRLGLPYLRGTSHPIiRGC 
LHAAAXiNGRRLVQPIiTRNKHEGCAEEFS 
ANDDVALG FCGSKS LAALP AWGTQDEGT 
LEFTLTTQSWQAPLAFQAAGWHGDFIHV 
D I FEGHLWS KVEKGQGTVLLLNS VPVTD 
AQPHM 


5873 


19774 


A 


5921 


430 


2 


SLRPSSTGPSPSGGLSEEPAAKDliDNRM 
PGLVGQEVGSGEGPRTS S PLFNKAVFLR 
PSSSTMILFETTKSGPAbGKAVSEGAEE 
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I SEQDD 
NO: of 
nucleotide 

sequence 


SEQID 
NO: of 
n en tide 

sequence 


M 

eth 

od 

■ 


SEQID 
NO: in 
USSN 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ino to first 

lilt *V * 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nude* 

otide 
location 
cor res Don 
ding to 
last amino 
acid 

residue of 

UUC 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D^Aspartic Acid, 
E^GIutamic Acid. F=Phcnvlalanine 
G=Glycine, H=Histidine, J=lsoleuclne, 
K=Lysine, D=Leucine, M=Methionine, 
N = AsDflrflffine P= Pro line. 
Q=Glu famine, R=Argin*ne, S=Sertne, 
T*=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, . 
X=Un known, *=&top codon, /^possible 

nurl^Atirip riMrtifin \=nnccthli> 

nucleotide insertion 




m 










AKLGVSGSRPEVAAKPALPTQKPAGTLP 
RSAPLSQDTKPP VPQEEAGQDHPPS KAS 
RC 


JO /** 

p » 


1QT7< 

J7//J 


A 




A1 1 
«*1 1 


o 
A 


DSQPKEQGQGVLKKKKKKKKGMVPKNYD 
PKVTPDPERWLPMRERS YYRGRKKGKKK 
DO IGKGTOGATAGASSELDAS KTVSS PP 
TSPRPGSAATVSASTSNIIPPRLY 


5875 

• 


19776 


A 


5924 

i 


402 

• 


2 


LQSGAIMNKFYQPHEAKIPYLLQLFIDY 
NLYGMNL I NLAAVKFRKARRKSNTLHAT 

ILEGVEPQSTCELEVDAVAADILNRLDI 
EAQIGGNPGLQAIWEDEKQRR 


5876 


19777 


A 


5925 


215 


403 


AETSTYFFFPLERQFPFAPQLGGHGRNL 
NSLNLL PRGLREFS RLSLPGG WI YGGAP 
r or icrL 


5877 


19778 


A 


5926 


320 


1 


PGRRDR WI WGQGE CLEYLLASMTQ WCER 
EISS lAPGELCCLLLSFLPQEECQNYVR 
VLIVAGRKVFMCGTNAFS PMCTSRQVGN 

r CTDT»n»17V'TWr , iniPr*DVT1T3'DTrn\7 
LjOXn. i. J. fiJV^i^fuVAK^r XiJJrrULKV 


5878 


19779 


A 


5927 


400 


1 


WATDGGLPLLAS S ATVS VALQDVNDNE 
PQFQRTFYNASLPEGTQPGTCFLQVTAT 
DADSGPFGLLSYSLGAGLGSSGSPPFRI 
DAHSGDVCTTRTLDRDQGPSSFDFTVTA 


5879 


19780 


A 


5928 


398 


62 


HNTE VLVGES VTLECSATGHP PPRI S WT 
RGDRTPLPVDPRVNITPSGGLYIQNWQ 
GDSGE YACSATNNI DSVHATAFIIVQGG 
YPSSVPEPGLSPILTPPLARAAIPTDAP 


<oon 


1 o*i© 1 








i 
i 

• 


VFTQPE YRI S I PENTLVGTR I LTVTATD 
ADEG YYAQ WYFLE KS PGETS EVFELKS 
TSGBLT 1 1 KDLDYEDATFHE IDI EAQDG 

r\ Tin* J. v v a » in/ v iwivui v : 


JOOl 




a 






■a 


VASDC5GD P VIiSGTS RI CVKVIiDANDNAP 
VFTQPEYRIS I PENTLVGSRI LTVTATD 
ADEGYYAQWYFLE KSPGET S EVFELKS 
S SGELT 1 1 KDLDYEDATFHE IDI EAQDG 
PGLLTRAKVTVTVLDVNDNA1IV 


1 5RR2 


J7/OJ 


A 




126 




P VLTCAHFHFSAGAGGS YFM I SRSLGPE 
FGGAEGLCLYLATT FAT AMY ILGAIQ IL 
LVSQAE PWSTLAM I KQFLWTPNRRLMVL 
I PTC 


5883 


19784 


A 


5932 


436 


359 


NKE WLPGSKVPEKS INDVKNTSGLL 


JOO*t 


17/03 








j 


AVVFVAFGTSVPDTFASKAAALQDVYAD 
ASIGNVTGSNAVNVFLGIGLAWSVAAIY 
WALQGQE FHVSAGTLAFSVTLFTI FAFV 
CISVLLYR 


5885 


19786 


A 


5934 


394 


3 


FQYFSRTDRVLKHERMCHENHDKKLNRC 
AI KGGLLTS BEDS VFS TS P KDNS LPKKK 
RQITEKRSSGMDKESALGKCDLKKVKND 
YSPLYS S S TKVKDE YMVAEYAVEMPHSS 
VGGSHLE YASGE IHPPCI 


5886 


19787 


A 


5935 


393 


1 


REDLIAGIDEFLDEVIVLPPGEWDPNIR 
I EPPKKVPSADKRKS VFSLAELGQMNGS 
VGGGGGAPGGGNGGGGGGGSGGGAGSGG 
AGGTSSGDDGEMPAMHEIGEELIWTGRF 
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SEQID 
NO: of 

n ttr\i*ntitifr 

U II VlvVtlUC 

1 

JCUUvI'vv 


SEQID 
NO: of 

o c ij uni c w 


M 

eth 

nd 


SEQID 
NO: in 

09/515 1 
26 


Predicted 
beginning 

n nrlpftfiflp 

IIUUVVIHIV 

in ration 
correspond 
ing to first 
amino acid 

r« ill ue of 

peptide 
sequence 


Predict- 
ed end 

nut* fo- 
il UVIC" 

ntide 
location 
correspon 
ding to 

last amino 

add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D»Aspartic Acid, 

F!=Cw I utn m ir Arid. l^sPhenvfalnnffiP 
G=Glvcine. H=Histidtne* l=Isolenrin#v 
K=Lysine, JL=Leucine, M=Metbionine, 
N~Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serlne, 
T=Threonine. V«=Valiiie. 

W=Tryptophan, Y^Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














. FGGLCLDIKRKLPWFPSCI 


i C0O7 




» 




Am 
410 


izo 


QLIFVFLVETGFHRVAQAGPELMTSQDP 
LAYNFLCSYPNEVFRSQASASDLQFVLR 

NLEPPDAGNV 


5888 


19789 


A 


5937 


426 

. 


1 

. 


DAT FHEI D I EAQDG PGLLTRAKVTVTVL 
DVNDNAPEFYMTSATSSVSEDSLPGTI I 

PT T7TCTC7VrriPT^Cr^A'WRn w T r rPC?T.PT7rtT.DT?TrT. 

\sLiK JMVfiiJitUfauyHAr 1 iL.oJjr'iSiJijrr rJj 
EKSVDNYYRLVTTRALDREQFSFYNITL 
T AKDGGNP S LS TDAH I LLQ VAD I NDNAP 


5889 


19790 ■ 


A 


5938 


438 


3 


ADPTKGLLRNGSVCVRAPGGVSQGNSVN 
LKNSKQACLPSGGI PSLNNGTFSPPKQW 
SKESKAEQAESKRVPLPEGCPSGAASDL 
QSKHLP KTAKPAS QEHVRCS AIGTGESP 

£vE>faAVi-ib uinfa r* nJSoiroKl* r! Air " 

SPW 


5890 


19791 


A 


5939 


3 


191 


YS VCLW SQ.T iTiP.P. T iRQEDHLS PGGGGCTE 
PWNTERLS KNKKTKKREREKQCGFHSM 
WYCNLK 


5891 


19792 


A 


5940 


399 


1 


QYSDDNDMS WKVTRAAAKCIjDAVVSTRH 
EMLPEFYKTVSPALIYRFKEREENVKAD 
VCHAHHSLFKQTS PVQSWLCGLDAMDQG 
GT P LAMLQS QVAN I VKAVHKQMKE KS VK 
TRQCCFTLLTELVNVLLGAVY 


5892 


19793 


A 


5941 


Aft 

411 


3 


fa F Ch<jr.f RKr yL. Ki bKK VlKj\jltV t-U VyKD 

CRDWSDELLKVWCGACLRPLAGLSLLPS 
PSPmiGSRPSSAPCPDTFCSGPLFQPMC 
RPMASHGAFRPQASGLHLYKVLRACPSQ 
VLKNYVFSHKLGLSSFLPRSDHV 


5893 


19794 


A 

.* 


5943 


431 

• 


26 


GVQGO KGEPGVGLPGLKGLPGLPGIPGS 
PEEKGS I GVPGVPGEHGAI GP PGHQG I R 
GEPGPPGLPGSVGSPGVPGIGPPGARGP 
PVGQGP PGLSGPLVI KGEVSRVR 


5894 


19795 


A 


5944 


396 


3 


PLPVELIRVPAFIiDLFMQSLFKPGARIS 

INKDELKSTSKAVETVHNLCQTENKGAS 
ELVAELSTLYQC IRPPWAMGVLKWVDW 
TVS E PR YFQLQ TDHTPTV 


5895 


19796 


A 


5945 


465 


32 


ERVTLADI TWCTLLWLYKQVLE PSFRQ 

EKMAQ FDAKKFAE TQPKKDTPRKEKGSR 
EEKQKPQAERKEEKKAAAPAPEEEMDEC 
EQAMAAEPKAKDPFAHMNKSTFVLDDCI 
AAAL 




lytyf 






414 


•> 


AT P P VP C T P T PMW WD TO PVORPKAO 

ACGNGHGI 1 1 IAETSTGCLFAGSSLGKR 
GVNADKVAIEAAEMLLANLRHGGTVYEY 
KC^QLI VFMALANGVSI IKTGPVTLHTQ 
TAILYDEQIVKVNCKENHSVDEQVY 


5897 


19798 


A 


5947 


411 


1 


EPCWRRIADSSVQTDDEDGESRYLLSR 
RRRARRSADCSVQTDDEDSAEWEQPVRR 
RRSRLPRHSDSGSDSKHDATASSSSAAA 
TVRAMSSVGIQTISDCSVQTEPDQLPRV 
SPAIH1TAATDPKVEIVRYISAPCI 


5898 


19799 


A 


5948 


153 


409 


LPFFPILLPFISFFRYRVGLLSPKLKCS 
GVI I AKCS LELLGS SNPPTSASRVI FLY 
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SEQD) 
NO: of 

UUUCUUUC 

sequence 

— 


SEQD) 
NO: of 

sequence 


M 
eth 

od 
uu 
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SEQD) 
NO: in 
USSN 
09/515,1 
26 
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beginning 

n ii rlRftiiflffc 

U U 11 U C 

location 

rnrrpcnnni) 

Wl I CjUUUU 

ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

otide 

lutauuii 

correspon 
ding to 
last amino 
acid 

residue of 

pcpLlutS 

sequence 


Amino acid sequence (A=Alanine 
OCyateine, D=Aspartic Acid, 

TTxsfSlii+a mir Arid P^PhpnvlfllnninA 

xLr V-I i U li* II 1 1 C null) Mr X 11 VII j IMimHHC, 

G=Glycine, H=Histidine, I=Isoleucine, 

K—l ,\<iinp I i=T£iicine_ iyf=IVfcthirtriinA 

N=Asparagine, P=Proline, 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 

iiucicuiiuc ucicuuu, v— posiiuic 

nucleotide insertion 














LSLCLF VF EMES CS VAQAGEQ WRDLGS L 

O 
w 


5899 


19800 


A 


5949 


21 


447 


RAAAMS SRNFRFLSSHRELYFGVDIiPSG 
NL WRE P ADREQLCRAKAACVLTYDLVL 
EDPLELHKIRIHVLDTNDNSPLFPAGDV 
QLHIPEFLTPGARFTIiPNAQDDDBGSNG 

VL 


5900 


19801 


A 


5950 


268 


19 


QDNWAY YCTVPB YCKTKLNI LAWWLVIN 
YLSGDFYFFFIFFLDGVSLCHPGWSAVA 
RSQLTATSSSQVQGRFSCLSHCIAAAL 


5901 

* 


19802 

i 


A 


5951 


389 


3 


QQMDLARQQQEQIARQQQQLLQQQHKIN 
LLQQQ I QVQGHMP PLMI PI FPHDQRTLA | 
AAAAAQQGFLFPPGITYKPGDNYPVQFI 
PSTMAAAAASGLSPLQLQQIiYAAQLASM 
• QVS PGAKMPSTPQPMY 


5902 

- 


19803 

* 


A 


5952 

« 


2308 

- 


307 

• 


RRRPARXKTWVRDGGPHQGL/YPQLPSA 
VIJQPTQPGHGPRALGKGRELESWQGKRP 
GK/GQTHRRKTRGTASPAVYFSVEWGD\ 
S GG C P MTG KGAKQQRAGLGAAPNP E US G 
EQGTGRGRAQDS QGHGGGKGRRGQHPPG 
ND/PRGVSGMGQESVkSPEARPTGGGTG 
/EMLGRTWRHQQGLVGGTGNLLETGSWS 
GSGS WGRGWLRPQEGQVTTGMGLAGR / 
YQPARQ RAVELS PGAQGLRQRRGG WSG P 
PQETEQSGVPGG*GSWPLESHQGEQ\PA 
AGQKQSLQMLPGGFPCWS/ SGMGGGPQQ 
LLESBGAGPSPGGGRHHGKGRVAVTTTP 
REGD\RGQSPGGHTLQLFPCPLWS*ETQ 
GQGS RTVQHQEWGGEGREGEGAGSDQSR 
AIjGIA* I CPHTLRPTSMEPQSTAPAPGQ 
PPGPPS WGHRGHQGMGYGGRCPGQ/ XQG 
RGRAQ LG S TVG * RBG / QRG * MQGRDQQG 
PRSNGAGVGSCSHTSQK* IPPSSLCT*N 
SSHGPASG\QLWWSSPFIHSPGETNIPH 
TLTE PHS VPGWCWDTLRRHGAGQGHPGM 

KKQGLKEPG*RAAPTLKGATRPLCRCLR 
KVQKP KQDGD VGS * LLKVFRAPGALGTK 

XtUjCV X v- tVV3 £Jt\U C C lLt\ XhJNJl v nivo X X 

NSGFGACM 


5903 


19804 


A 


5953 


461 


117 


HPRDIIGPVHQGLPPSPQPPPPCKQAPT 
TPSS\M* P ♦TNRHDOXJLSGPVQVPGADW 
KPLNAP*LPELAFGEEPCRPGVPDPPYV 

ri JUsr U%Mt\tr ATll* V X TUil O JrOXtSX/XjXld X trtvK 

KQA 


5904 


19805 


A 


5954 


845 


610 


FFETGSY/ SVAQVGAQWCNFGSLQSQPP 
RLKRSCHLSPPSSWDYRGAPPRLARFFF 
FCIFYRNGVSPCCPGWS*TPELQ 


5905 

• 


19806 


A 


5955 


1 


378 


KT P VS DR \ ATKCCS ES LGNRRP CFSALE 
AYETYVPKEFNA£TFTLHADICTIiSDKE 
RQ I NKHTALAEL VKHKPKARQEQLKAVM 
DDFADFVEKRCMADDKETCFVEEG*TLV 
AASQAALGLLHRIK ' 


5906 


19807 


A 


5956 


723 


343 


GCQEG I CPSPCPVI PATREAEAGESLEL 
RRRRLH * AE I VPLHS \ KAQR3 LCSSDFX 
RILVIFSGMFLVFTLAGALFLHQRRKYR 
SNKGES PVEPAEPCR YS CPREEEGSTIP 
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SEQID ' 
NO: of 
nucieouuc 
sequence 

'i 


SEQID 
NO: of 
pcpuae 

CMIIIMirA 


M 
eth 

OU 

• 


SEQID 
NO: id 

26 
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Predicted 
beginning 

n tt /»l«M\firl a 

o ucicvuue 

■UvUUUU 

correspond 
tag to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nu vie— 

UllUC 

location 
correspon 
ding to 
last amino 
acid 

rcsiQuc ui 

peptide 

sequence 


Amino acid sequence (A=A)anine 
OCysteine, D=Aspartic Acid, 

IT=r^|iitn tnS<* AiMft KWPhfknvlalatiin* 
JC/^Viuiaiuib AUU) r i ucu YlaUIDIIIC) 

f3«C5lvrine H=Hi«tiHin* iBTvnknrin* 

K=Lysine, L=Leucine, M=Methionlne, 
N=Asparagine, P=ProHne, 
Q=Glutamine, R=Arginine, S=Scrine, 
T=Threonine, V=Valine> 
W=Tryptophan, Y^Tyrosine, 

A^UIIKllOWn, ^Olop vUUOIlf f™pv33HJlC 

nucleotide deletion, ^possible 
nucleotide insertion 














IQEDYRKPEPACSP 


5907 

* ■ 


19808 


A 


5957 

i 


55 


491 


AQHFNPHNWP VT * LLPHFTNEGGQVTRA 

AEVTPPVCNQHHWPQVEGIPSS/APAFQ 
WPLTMGHP C\ WES APRQDAS TS PRPGTA 
SHCPSAVAFQERGSSPCPPHHSDSGLCL 
LPRLLPAP 


5908 

• 


19809 


A 

t 


5958 

• 


2 

■ 


413 

• 


KKPDQ \ AE \REHLCTSL \ WSGRNTDKNG 

nor rr/iovninrDnT vtra i wt Yiunnunn ot 
EEijHGGKRVi^RLKKAVKIANs^MDPSIj 

QMQLYIE I LNRYI YFYEKBYDAGTIQVL 
NTLB HLRLRRE S PE S EG P I YEGLILYB 


5909 


19810 


A 


5959 


871 


564 


WMCHATFGSHLN*AWRLRPVNSG/LLM/ 
PSEGKAKKESTLMVGVTNPD\MKGKIRL 
LIiHSDGKDEY VWNAGHPD IZX3 \HPLTLQ 
CPI I KI KKKL* * PKSGRTADGPDHSRIR 
V*VTPLGKEPTTDBVLiABGKBNME*VTB 
TC* I DMN * VLQGQTWNLQR 


5910 


19811 


A 


5960 


*fe A 

390 


271 


GINFFSRN/RSIiTT*SRLV*NSWAQVIV 
LPWPSRVLGFQA 


5911 

• 


19812 

- 


A 


5961 


333 

• 


922 


GLS CRVFGRTLS CPPCILRHLSEVETLQ 
TLQKKEIEDLYSRIiGKQPPPGIGAPAAM 
LSSRQRRLSKGS FP TSRRNSLQRS E PPG 
PGIMRRNSLSGSSTGSQEQRASKGVTFA 
GDVGRM* IQNRSHVSPPHQGPPNSLCSQ 
NUMLSDQQN*ARKI PTLKG* KA\GGHGK 
CSS 1 1 VKS QTYVNCLLiCGGVSSAAYHLH 
L 


5912 


19813 


A 


5962 


705 


387 


CVAQTGVQ * HDLGS LQPLP PGFKQSSHL 
SLPSSWDYRRVTPRLPNF\* FFVETDFC 

VSHRARPTLTLDI QAE I PDLKPSG 


5913 


19814 


A 

»* 


5963 


425 


182 


ASNQ * DS VG VGPS E PGAG YNLLVRFUCC 
\LEKHSSWVGVTQFSRCHLSPLSLTRKG 
NSLTPCTSRVRQCLALLWLTHGSRTH 


5914 


19815 


A 


5964 


388 


14 


PPMYTQIiColKoTQAr CrVrrJaRDB/ o Jj 
ALLPRLVLNSWVQAItiLPWPPKVQG*QA 
QWLTLVIPALWBAMAGGLLBPRSSRPAW 
ATTQDLISTKENKTKSLSASDAAEDCVH 


5915 


19816 


A 


5965 


1 


373 


DDQRVKS V INLI»LAAYTGD VS ALRRFAL 
SAMDMEQRDYDSRTALHVSAAQGHVEW 
KFLNIiEACKVNPFPKDRWNNTPMDETLH 
FGHHDVFKI LQB YQVQYTP * GDSDNGKE 
HHTVHKNIiDGLS 


5916 

• 


19817 


A 


5966 


3 


329 


HEETSRYEIYELIHVHPYAVKQSFDLEE 
YSLSQSTLEQVGHF*GFVSSMVYVYKTF 
PGSCRGSTAIiFSSLYFLPPFQVFTiRTiSK 
EOELGDFE / EDFDPS VKWKLLPOEEP 


5917 


19818 


A 


5967 


2 


47 


MEDRRLVRGI P CPQHNARQCPAVP PGIQ 
AYGAAP FEDLQVDFTEMS KCRGNKYLLV 
LGRTYSG * VEAYPTRTEKAREVTRVLLR 
DLLPRFEL PLRIGSDNRPAFVADLVQKA 
AKILRITWKLHAAYWPQSSGKVERMNQT 
I KNSLGKVCQETGLKWIQALPMVLFKIR 
CTTSKRAG YS P YE I L YHRPP P I LRGLPG 
TP*EIjGEIEIjQRQLQAIjGKITQTI/Y5P 
SKWPVSLFS PVHPIiS PGDRVWI KNWNVA 
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sequence 
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SEQID 
NO: of 
pent id e 
sequence 
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M 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
dine to 
last amino 
acid 

residue of 

peptide 

seauence 


Amino acid sequence (A-AIanine 
OCysteine, D^Aspartic Acid, 
E=Glutflmic Acid, {^Phenylalanine, 
G^GIycine, HNHistidine, Msoleucine, 
K^Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine. S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X«Unknown, *=Stop codon, /"possible 
nucleotide deletion, V=possible 
nucleotide insertion 














SLCPLWKGPQTWLS PPTAVKVE* I PS W 
IHHSHVKP * TA 


5918 


19819 


A 


5968 


2 


150 


KSQSI *QSSLLS YFEKLPQPPQPSA\TS 
TS KQDPPLATRL WLAEGSDDH 


5919 

■ . 


19820 


A 


5969 


3 


371 


HEGKSGPMTGWLVAVGEVAMKILLLCL 
CLILLRVRS CRRXAARAALGMEAADAVT 
D*SPDSRLLPD\PPHPVPPQSERPCFLC 
RLCMVCPLU5S PDPSFPCHPTSPQTLRA 
KDSAALDTLHV 


5920 

♦ 


19821 


A 


5970 

i 


396 


58 


YWVLS KGFVTCMLLNSVFS FQASS CLS F 
LINS*AP*PGIFliGMSVFILPFFVFVGQ 
AGLE LLTSGDLPDS ASQ \ AR I TGVSHCA 
OPHTALSOYLEO CLAPSRQL I S VS L I LR 


5921 


19822 


A 


5971 


442 


120 


ICMKQGLALS PRLDSSSAFSNTVQPMPP 
RFKS S YGLSLP /RS WDYRCTPPHMARSG 
NGV* P\ FMVETGSW* AAQGGLEL&SSSN 
LPP/ SAFQSAGITSRGHCTRPAVFLVC 


5922 


19823 


A 


5972 


459 


434 


MWPLPWFSISSPSSPPPT*VSSNNHPCA 
VPRAYPGADI PGPGWQRLPPPHRGLSGP 
PESCLLTTILCSKHPNGCPLTQNPRSLP 
GPRPCLIYSTPSPPt»*AQL/YAAPSP*L 
YTWREALIR 


5923 

• 


19824 


A 

• 


5973 

• 

• 


681 

• 


1758 

• 


VANKCQI PDI KAKTYCI CNTKEKRS YLK 
T*NLHRGF\ IKKQIEEFNIGKRHLANMM 
GEDPETFTQEDIDRAJAYLFPSGLFEKR 

YL FYTGKQS YYSLMHDVYGMLLNLEKHQ 
SHLQAKSLLPEKTVTRDVIGSRWLIKEE 
LEEMLVEKLSDLDYMQFIRLLEKLLTSQ 
GGAAEEEFVORFRRS VTLES KKOLIEPV 
QYDEQGMAFSKSEGKRKTAKAEAIVYKH 
GSGRIKVNGIDYQliYFPITQDREQLMFP 
FHFVDRDGKHDVTCTVSGGGRSAQAGAI 
RLAMAKALCS FVTEDEVEWMRQAGLLTT 
DPRVRERKKPGQEGARRKFTWKKR 


5924 


19825 


A 


5974 


1232 


980 


SLSLSPRLECSGVTAHCSLRLPASSNSC 
FSAS Q VAGT I GACHH I WLVFVFL VETG F 
HHVGQTGLELLT\*VICPPWPPKVLGLQ 
A 


5925 


19826 


A 


5975 


1 


259 


LTSYDYRRALPCLVN\FCVYFIiBQREBIiA 
LLPKLFWNSWA*ATLLP*PPKIFGLQAQ 
VAMLRLKFPAC LP ACLPAYL P P FLFS F I 
PP 


5926 


19827 


A 


5976 


24 


223 


PYLTQISTNESSVCSHARAEFLDHQNQL 
/WLGTVTYTCNPS/TLGG*GGRIT*GQE 
FKTS LGNTARPCL 


5927 


19828 


A 


5977 


395 


136 


GSWD YTHVP PRS ANF / LVETGFRHVA* A 
GLELLGNP PTS AS QS AKITAMSHCTWPS 
SYY* CEYNARFGSLHKRSGMTIYIKSYK 
RON 


5928 


19829 


A 


5978 


1 


785 


GTRLKSGTNTKKALQAVYSMMSWPDDVP j 
PEGWNRTRHVI ILMTD/GSEGTSLLSQP 
PHLLRPACGP*VHL*HYTHGWGPECDS * 
LAVHLSCDPS*GILPKPCDQLSLTLPQL . 
AHPAMCI PAFSQF I FLISYPHGPVSSAG 
LHNMGGDP I TV I DE I RDLL Y I GKDRKNP 
REDYLGE* PA*DPAPHFLRAWTLILPFY 
PSDVYVFG VGPLVNQVNINAIiAS KKDNE 
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mr ^ wr M ^m* ^0 

sequence 


Predict- 
ed end 
nude- 

mm %m 

otide 
location 
correspon ' 
ding to 
last amino 
acid 

residue of 

oentide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E^GIutamic Acid. ^Phenylalanine. 
0=Glycine, H-Histidine, I=lso leucine, 
K s JLvsine, L= Leu cine, M=Methlonine. 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W»Tryotonban. Y^Tvrosine. 
X«Un known, *=Stopcodon,/=possible 
nucleotide deletion* \=D0Ssible 
nucleotide insertion 














QHVFKVKDMENLED VFYQM IGRE IQGNK 
EPNSPQVPLK* 


5929 


19830 


A 


5979 


2 


419 


KAIAP IDSDSSDGSGQNKLKT* KBPI III 
NAI KS I YDSWBEVKI STLTGVWKKLI P I 
LIEDFEVFK\ ADLVE I ARERELEVEGBL 
/LQSQDKT 


5930 


t 19831 


A 


5981 


1 


154 


KKC*MMLTWWCMPVG/PTPPJVEAGGSPE 
TGNLRL* CTMIMS VNSHCTPAWAT 


5931 

• 


19832 

» 


A 


5982 

• 


12 


311 

• 


RKSLNFVEGCNIRLNKCRHTNFVPQTI F 
FFLLFL*LGQC+ *WVFSFFFFFFFCFV 
LFFVFVCVVFVFFCFCFF /GCIiFWFIil, 

/ YFYELFCI VI IMCYYLR* YV* IVYMLF 
FVCWIFVLYLCFHY*LFLFFFVFVFLLF 
ICGVFWFYELFCI VI IMCYYLR 


5932 


19833 


A 


5983 


3 


203 


IGQKRASEDTT*GSAD/PKKSSAGPKRD 
ARQIYCrPPSGKYSSNLGNFNYERSLQGK 
YRWPRLGHTRKS 


5933 


19834 


A 


5984 


368 


138 


STVGKMLSGQHPCYRE/ IL/REQK5QSM 

*HT <?T.T.QYFICKT»POP17nPfi ATTAINT ^OO 

PSAS RQNP P PAKRL * LAEGS YDH 


5934 


19835 


A 


5985 

* 


2 


155 


LANF* I YFC/RDKCLPVLPRLVSNSWPQ 
VILPPWPPKVLGLQGMSPCAPRHS . 


5935 


19836 


A 


5986 


325 


403 


FIYYLFFLSSLYLIITYALFLSSF*R*V 
VYHMVMVLLKVVICHHFLTLFFSFRSLI 
LYIiFFILFM/YLLIFVLFFIQFSFYLFF 

QSYYFIS* FIYYLFFLSSLYLI ITYALF 
LSSFSI 






a 
n 




51 


OjI 


LPJOTFTSQPOTIMQVTLLTRCVAHSKH 
LGTVSYSSWYIKLLFPKSHFIYIiFILRR 
SLAPVAPGVQWLDLGSLQPPPPGPKRFS 
CLS/LPR*LDYRHPPPRLANF\ *FLVEK 
GFCHVGQAGLELLASSDLPTSASQNAGI 
TGVTPAPSPQKATLNLILGGQFHI 


5937 


19838 


A 


5988 


301 


341 


FCFCPCKNRVSLW*WHAPPCPANF\*LF 
IYLFLVETGFSHVGQASLKLLTLDDLPA 
AASQSAG I PGMSHHDGLFL 


5938 

• 


19839 

- 


A 

• 


5989 

• 


71 

- 


1252 


TKEQRSLRSQQLGVRQQRSHRLKFGGGG 
APSARGVGGSWSIiATRI/3 PVSTANMSRP 
VRNRKWDYSQFQESDDADEDYGRDSGP 
PTKKIRSS PREAKNKRRSGKNSQEDSED 
SEDKDVKTKKDDSHSAEDSEDEKBDHKN 

QEEEDEAP FQEKDS GS DEDFLMEDDDDS 
DYGSSK/NEKQKDG*EVQT*KKRKENAQ 
TQTKGYSDAKSSERQRESGSP\QASKAS 
KEKTPS PKEEDEEPESPPEKKTSTSPPP 
EKSGDEGSEDEAPSGED*K* * WSGERFY 
*KKKBKKRKKREBKKEPT*DRTWFWLWL 
DSWGFQCFFPFVBSNISLSLSFFFFFFK 
ANHCMCKCLSYLFVYWSLCQPS PFPMKA 
MSN 


5939 


19840 


A 


5990 


630 


316 


RWNVNSVA\QAG\IQWCDIiGSTGNPRPP 
GF\K* FSCLSLLSSWDYPTHATMSSKFI* 
YF* WRRGFTMLARLVSNWLVLNS * PSDP 
PASAAQSAGITGMSHRAQPIYLS 


5940 


19841 


A 


5991 


2 


182 ' 


WQEQKAE/ CLRDVWTAMCXAALLTIAKG 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W»Tryptophan, Y*=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 








• 






R\ N I P C PMMKEWVRKMGP I YTFE YYSAV 
IMQVILKF+ * HATTGMNPDDVI S EVCQS 
QKDK* RILLS CNNAGNS 






A 

* * 


J774 


79 


278 


EAK*RVNGCSKPTWHMYTYVTNLHTYV/ 
I KPAHCAHVP* KLKYKI I IT 1 1 K* * RWG 
LTILPRLVSNS WAQVILLPWP PKI LGLQ 
A 










7? 


27A 


AGDEHRAWG YQPAM * ARLS4NTGPGDT / 
MPAMASQRAGITGVSH *GCPFFSLIiNKI 
WRSH* VAQAGEQRL FRE *TPGLS CYSCS 
LPSN*DYTSAPLCPPYDT 


5943 


19844 


A 


59,94 


347 


195 


YMHTKHTHTHTHTHTHTHTH P \ E KHLKA 
PNEV*KSKNNV* IKPKQSQSCPS 


5944 • 


19845 


A 


5995 


1 


122 


RYELEDELHLQMTTQLHALKENNYQLR* 
SSHVfl IPST*GYRHVPSCPANP/SSLTM 
FPSLVSSCWAQVIHLPWPYKVLGL*ALK 
ENNYQLRTI I IDCTLNNNIF . 






a 








RPPPCPANF\ *FLVETGFHHVGKAGLEL 
LTSSDLPAPALHQVFYA 


5946 


19847 

■ 


A 

• 


5997 


357 


17 


I LVDKINNVWNLFQNNL WC * RGWGYK* 
DRSFHLLKTWAB* WVYEGSLY* SLSSY 
YLF /C*RQGLTMLPRLVUTSWHQAI LLP 
WPFSLLLCILEVFQNRELKKKNQGKPMT 
GS 


5947 


19848 


A 


5998 


I 


207 


P I LYNLFQKI EAEGI LPNSFYEAS VTLL 
PKPDKDITR/ITTY*FILLT*NILNKVL 
SNQIQLYIKRITY 






a 

** 


S09Q 




760 


EDRVLVLLPKAKSAVAWPRLTATSASWN 
VQRSFHPSLPSRWDHRHVPPHSANFCCV 
CL*RQGLSLLPGLVLNSWA)QVILPPWP\ 
PKVLGITSMSHHARPLS FYYGHFKYIQK 


SO/10 




"A " 








bVRHA^LHIATFKK/ FFCRD/RGLSVLP 
GLVSNSWPQTVLPP* PPKALGLQA 


5950 


19851 


A 


6001 


365 


62 


AITAHCSLNLLGSSNPTASATRIAGTIG 
EAAMSYF* ISSTLTHFSYT/C*LHHVWL 
I FKFFFVDEVYVAQASLELLSSSDPPAL 
VSQNARFTGVSHCTEPV 






A 


6002 


537 


236 


DR VSLCCPGWSAVARFQLTATSAFR \VQ 
+SSCLSLPSSCDYRSAPPCPANFLSFCR 
DEGGGLPGLPRLVSNSWAQAIHPPQPPS 
VWITGVINGTWPKIPLS 


5952 


19853 


A 


6003 


22 


368 


NFFLQNKENI KRXI F * ERKHKLMQRTS h 
LSYFKQLPRPPTTSQQPSTWRQDPPPAK 
R/LRTYKGLDDR 


5953 


19854 


A 


6004 


327 


335 


NQSIFTFNNYPRPF/ CCKFKFKN/TLHL 
WADKVAHTCNLSTLEGQGGRIA*GQEFE 
ATLANMVKP*LLCKVK 


5954 


19855 


A 


6005 


2 


367 


WQFLKKLNTKLLYGPVF/ PYLGI YKQEK 
WKHVHAETCTQMFITVLFI IVPK* KQPK 
CPSMEE* ID*M*FVHMLEYRAIKENKLI 
HATWMNLRNTMLSEISQTQITTCMNLLH 
KMFRKVTETES 


5955 


198S6 


A 


6006 


1 


176 


PGFKQFFCLSFWTRGIFIEMGFHHVG*A 
GLGLLMSGDPPASASQSAGITG\VSHCA 
WTY 


5956 

• 


19857 


A 


6007 


1 


289 


EVLTKI P IR/ FFVE IGKLIQDLKDTGPR 
IA* TMLTKKTEMGR I VHS DLVAYYI VAV 
IKMVWYYW*RDRHINQ/WERLEISEIDS 
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correspond 
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peptide 
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Predict- 
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nucle- 
otide 
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correspon 
ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanlne 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, (^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleuclne, 
K=Lysine, L= Leucine, ^Methionine, . 
N=Asparagine, P=Proline, 
Q=Glutamine, R-Arginine, S=Serine, 
T-Threonine, V=Valine, 
W«Tryptophan, Y^yrosine, 
X-Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














QNMFNWVFTKMQKLJ? 


5957 


19858 


A 


6008 


439 


223 


FFSPGSH/SIT*ARVQW/PHHSSLQPBP 
PSLKQSSHLSLPSSSDYRCAPPHPADLC 
RDWVSPCCPGWSQTLGLK 


5958 


19859 


A 


6009 


1 


252 


MVSIS* PRDPPALASQSAGITGMSHHAQ 
LFFLN/ CIS/ELISSrVTPGLKESACLG 
LPKCWHYRREPQHPACTVLYLQFQSDFQ 
LK 


5959 


19860 

* 


A 

• 


6010 


131 


514 

■ 


PWFYSLSSSGLLSSFEN/VEKWVPEIIH 
\HSLFLGTTIDLRS\EPCTTEKLN\KDK 
YNPLTPQMADMMVCDLKVVB * LECSAFM 
WKRLK\ VFDEALWAALEED * RS SCRKBT 
STLLQSTPLLVSYALPCTP 


5960. 


19861 


A 

• 


6011 


625 

■ 


141 


ETGS RFVTHAGVKWHNHDS LHPQPPRLK 
*FS YFS LLLS S WD YRHVPPRPANLVYPL 
*RGGPPSMLPQASLEF\I»DSK* IPPCLQ 
PS/AKVLGLQVSHRPSPKVTFHQRAKBG 
DWSHWSRGKS I PDRGNSQCMHRSIPC 
VFEEQKGGQCYCSIESKEETSRI 


5961 


19862 


A 


6012 


302 


42 


GLAMLPRLPGTPGLKQSSCLSLPSSWDY 
KHILS * PAVY/C* FLKNIK 


5962 


19863 


A 


6013 


363 


82 

* 


VSVSLSPSLKTKVNYAVASASFTIAKIW 
KQLKCPSMDKQMK* I WHIHTMEYYSA/I 
KQKI MS YSTTWMNLBDIMLS E I S I SQAY 
" XGKHHMIYGI . 


5963 


19864 


A 


6014 


329 


99 


H I FGNT YF FQVAS S F I YLFRDS VS LCRP 
GWSAVA* S WFTATSG \ FRPSACIjGLPKC 
/WDYRREPPRPANVLFSMRVTH 


5964 


19865 


A 


6015 


324 


238 


LiAMIiSRLVSNSWPQAILLPLPPKVLGLQ 
A 


5965 


19866 


A 1 


6016 


2 


308 


FWKAI AG I EGDS SDKCRQ/S KL KT I WKK 
FISU)AMRWIHDS*EKVKMSIFWKMIPT 

PRDDF/ eglkts vekrtedwktarq* e 

LKVDRT * TDEELLLLDQQRK 


5966 

• 


19867 

■ 


A 


6017 

• 


2 

■ 


457 

• 


FFFLRWSL/ DS VAQAGVQWRDLGS LQAP 
PRGFTPFSCLSLPSSWDYRRPLPRPANF 
FYF* *RRGFTMLARMVSIS* PRDLPALA 
SQSAGI TDVSHRRAERVISKQRIVS VMW 
KPLPEIHI PCI LNLS LRFNHRS VAGLRN 
SLIVRMLSILTHG 


5967 


19868 


A 


6018 


38? 


142 


AHMLFAAQGDSS I PMLVAPLFTWKMWE 
QSKLPLADEWSGFW*HIHVTEY/YAIKR 
RE ILTHATTS LKALCYVT * ALTKGRTL 


5968 


19869 


A . 


4019 


384 


232 


LNLPSSWDYRRAPSHSA/NFSYF+RDGG 
LTML\PGWPQVIQMPRPLKAL*LQA 


5969 


19870 


A 


6020 


333 


84 


GLIDQEFATDTTKSTSLSLT*T/IPSRC 
SIiCLRGSSDSPASC/SQPPASYAGITDT 
CHHAHI I FVFLVETRVHHVGRANTEKPR 
LH 


5970 


19871 


A 


6021 


106 


366 


DTISFFFFLKAPIEPVFIIFIFFYFFFI 
FFFI*FFFLLFLFLFIFYFVF/IFFFIL 
LCLFFFFLVLLFYLLFFF/SLFFLFCFV 
FCFFF* FCYFYFYLFFILFFFFFYLALF 
I FFFFGS S FLF IVFL FL FFL FCFVFCFF 

F 


5971 


19872 


A 


6022 


384 


18 


REKKPGPGTCKPPKKKGGPKFGPKGPKK 
RGEKKKKGGP/NS FFPRGGGTPGGNF*R 
GFPPENQKKRGRKKFFQGGKRGANPGIP 
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Amino acid sequence (A=Ala nine 
OCysteine, D=Aspartic Acid, 

U^fjIlltQmi^ A Jllfl 1|WD||A|1l»lAlAHIBkA 

cj — v?iu utniic aciu, r = r nenyiaianine, 
G=Glycine, H»Histidine, I=Isoleucine, 
iv— ijysine, Li— Leucine, ivi— ivieiniontne, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
a— i nreonine, valine, 
W=Tryptophan, Y<=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucieonoe Deletion, \— possioie 
nucleotide insertion 














PLLGP KAEGGF I PGKKKKKKKNSNKKS K | 


5972 


19873 


A 


6023 


3 


191 


LVKGKIRSWEETWVKTW/RPYPGVMAH/ 
rYNPNPLGGQGRWIA*AQBPKTSLGNW 

BT1DXXT .WWT 
JNx'rlLi X ruSJXd. 


5973 


19874 


A 


6024 


364 


142 


I INGLI IHLHNFKINGHTLSTWYTFHIL 
CVCVYIYICLC\CIYVSVCLCVCVYTHI 
QYTY*INKHIC*RLEAK 


5974 


19875 


A 

V 


6026 


1 O A 

338 


102 


WQFT1 IjARJjVSNS * Hli V I CHLiDIjP KFWN 
YRHEPPCLALFLIFVGT**VYIFMGYRR 
Y/C*YRHAMCNNHIRVNMVSVGLNF 


5975 


19876 


A 


6027 

• 


rift 

99 


77 


REICTPTFTEAFSIEAKGWK*PKQPLTN 
EWINTMWCIHMRGKF* P / YAKTSMKLED 
VFtffilSKSQKDKHYYI*FLSPTYVKIQA 


5976 


19877 


A 


6028 


2 


328 


TITYRGAKIRITSDSSSBTMQAREE*SE 
IF*VLK/EKKKTH*PKILYSGK1jSFKSK 

geik/yf*gkqn/lkefvssrpvlqgml 
kwlqregklyrsetqtykkkekaseke 


5977 


19878 


A 


6029 


8 


241 


GFAPLPRIiECSNTNMAHHSLNLSGSSDS 
PASANFS I F\ I ETR FGHVAQAVL * FLS S 
SNPPISPSPSVEITGVSHHARP 


5978 


19879 


A 


6030 


108 


362 


CCYLMVTEISAHQNPAKQGLA/HTAFP* 
KTFFLKVSWTGTVAHV*KPSTIiG3*GGR 
TS * SQKFKTSLGIMRPPSLQI KKKVFKI 
YP 


5979 


19880 


X 


6031 


1520 


77 


ASSSSSSSSVILSS*NIPNYNYIVIFM/ 
CL*IIF*AASSSSIYYVPGICYIYH\IL 
CVCV C I CMCI YVC I CI YS I C* Y I VYVS I 
YYIHY 


5980 


19881 


A 


6032 


405 

> 


100 

• 


EGFFLPPRGGGGKR/PPGPLTPGGPGNP 
RPKPPQKWGKPGPPPGGGPF*WVLLSPS 
LPGGQVESPGAVSGPRQVPPSGSRPFSP 
TAVGTTVT FJjS KKKKKQ V 


5981 


19882 


A 


6033 


42 


309 


CDKFFHKASNHIYVSTYIWVFQICVCVC 
VCVCVMCirjC*AY/CGCLGIMIWFLFVL 
*CI/CYCCY*SSLFSVHCIFLMCFVSY 
ERRFIIiFA 


5982 


19883 


A 


6034 


397 


2 


KGPPP *GG PD FF \ * PRGGPPQKNPFFPP 
GGGPPPPKNGVGGPPPQKAPFFGPKRGA 
PQKGGPPGGGPPRGEKKSLCPPPNWENT 
PRGLKKGPPPFFFFFFFFFFFFFFFVLR 

oloioKAF r VO Vr JL V £LiViSiJMo 


5983 


19884 


A 


6035 


343 


2 


LWPPQGS FKTAAPFFFFFFLFISDFITF 
HFLYM\YVFFIPLFLL*SFFCIK*IFYY 

SF*LIRIYFRFILT*FQ*DTETLFFYYS 
IP 


5984 


19885 


A 


6036 


319 


68 

■ 


SLGEICAPK* FFPPAQKKGPFQKFPSCK 

c oirD r vr cvx rvrr xuiivr i\\jr it rv p x v/i 

NPGKI \GPPKGS FKRPPLFFFFFFFFFF 
F 


5985 


19886 


A 


6037 


419 


191 


RIHTIPITALFAVAKR/WQSKYPLVDE/ 
MDKIWYIHTMED*AAYGKKBIMSHARTW 
INLKNIMLSEISQSQKDKYFPFI 


5986 


19887 


A 


6038 


434 


401 


YGKNFSFSLKV* IFCEGLVLVFFPPKKK , 
/ FFFKNFHRWFPLYVFF*TGRGVVFFK 
PPFLEKIFFFLTRVNLGPPRGFF*GAPF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFS 
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OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidme, I=IsoIeucrae, 
K=Lysine, L^Leucine, M a Methtomne, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arglnine, S=Serine, 
T=Threonine, V'Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














*NF YFMN* I FTVI 


5987 


19888 


A 


6039 

• 


12 


346 


SFSFVRRSLTVTQAGVQWCNLGSLQPP\ 
LPPSPK*FSCLSLQSNWDYRCLPSCLSN 
FCIF/M*RRGFSMLARI*VSNS*PQ/CDP 
PASASQSAGITGVSH/SHLV*NA*SFKT 
ESHF 


5988 


19889 


A 


6040 


275 


45 


KDVQHQWKIVQPLWKQYGSLKASTESPH 
DPS ILLU3TYPRKMKT* IHIK\TCIRMF 
AEALPLISKK*KLPKCLLADK 


5989 


19890 


A 


6041 


355 


1 


FGTI KNFFFYI FFFKNYF/ FFLYI I FFK 
KKVLFFF*KKKIFFTiFIPNKKIFWVFFF 
FFFFFFFFFCPRFFFFFFFFPFFFFPPF 
FFFFFFFFFFFFFFFFFFFFFLVTRHLF 

* 

EMRKGN 


5990 


19891 

■ 


A 

- 


6042 

• 


635 


209 


plni kktffflrqsftlvaoagaqwhdf 
gspqpppprfk*fsclslprswdhrrpp 
prpanlvf\dflvbtg fhrvs qdgldll 
t/s/gdlpalasqsagitgvshcarptii 
eflkvkemifcsttvkavldhanflilr 

FSIT 


5991 


19892 


A 


6043 


12 


348 


ESRYSSICLPFSSVYCSRI ISLQLYSYT" 

LLVYILFFF/CFIiKREFGFCPPtiGVQGP 

GLGSLNPPLLG*KQFSGLIFPGIGNtGL 

APPSPPNFGFFRKKGVSPCGPNRPAISK 

F 


5992 


19893 


A 


6044 


192 


1 

* 


I FHRFCT/H I TLP / TTALNNN * PWIX5TM 
AYAYNPNTL*G*GRRIV*AQAFKSSLGN 
MTRPCLYKK 


5993 


19894 


A 


6045 


338 


42 


WKTAW W / FLKKLD / MNYC * D PAI LLLG I 
YPKELKAGT*TDIRTSMFTAELFAIAKR 
WKQNVDLKNLRQNECRQFIWAKVEGNWL 
PSWFGEWSIQHRLQAGF 


5994 


19895 


A 

.* 


6046 


300 


3 


RLDCHYLHSHTHTHTHTHNPAGIYTHH/ 
HC WNLHTHNPGG I IX3VALDL* MQFLLNR 
GWSWGGEQMAWPCPQGHVMWLGDPGKMI 
LKPPLWVCQLQGSSYDG 


5995 


19896 


A 


6047 


332 


3 


SLES AGFT* YKMYFYYYYKI K\ YIYKS I 
FIIIiFL*IFFFFDLiFFIFFFFIFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFLECSP 
KRVIYFGFGTQNPQMFSLLTLKYVLC 


5996 


19897 


A 


6048 


2 


152 


KNLKISWT*WLMPWSAT/WGGSLKPGR 
LRLQ * AVI APLHS S LGDRARPYL 


5997 


19898 


A 


6049 


1 


288 


PIIIYVIiLiLLFFETDSHSVT\RL*CSGA 
FSTHCNLCLPGPGDSSASDS /RVAGVAP 
ACLF* PYRGFSGPATLAFRALDPGLPLH 
PGFSLQRPSCSRGG 


5998 

• 


19899 


A 


6050 


358 


1 

- 


FSIFSILIFDFLGGFLKFFAFFYGWDFL 
RFPFFYR* PFGPKSSFIFPHVF*R\WI W 
FLLLS SWFLKWSFDPQFFLKRFFFFPFF 
c elSJor c X JxK v£ r r rur b r roJMJLil il ooyva 
RGVWLI 


5999 


19900 


A ■ 


6051 


308 


45 


ADLSAEAL*TRREWDDIFKVLKEK/NKG 
QPKI LYPS KLSLINE/NEI KS FPDKQKL . 
REFTTRLVL*EMLKGILHMEAQGQYLPS 
*KHTKV 


6000 


19901 


A 


6052 


2 


124 


I FCVLVETGFHWVAQAGL* LLSSGSPPA 
S/DLPNAGITCVSH*AGITCVSH 


6001 


19902 


"A"' 


6053 


I 


330 


LC*PGWSR\FLTS *SACVGtiPKCWDYRR 
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ding to 
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■ CjlUUv VI 

peptide 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 

F/=Glutflmic Arid Fs=Ph en v In Inning 
0=Glyrine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leurine, M= Methionine^ 

N^A^nnmoinp P=Pi*olinA 

Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine; 
W=Tryptophan, Y=Tyrosine, . 

XcaTTnlrnftwfi *=Stnn rnHnn /cnncoi Klo ' 

nucleotide deletion, \=possibIe 
nucleotide insertion 














EPLHLVFFFFFFFFT* IYLKGFYLNPRK 
I FKNPFPLLKNPFFC3VPPKIOFSPT PNIj 
P SLWPNMVF FKRVLGE KI FRG F FP P 


6002 


19903 


A 


6054 

* 


34 


293 


S PI LFLLCIxRHQS GCHPS WTAVAQSELT 

GWSHAALPRPDSNF* PQAILLPQPQVLG 
LQA 


6003 


19904 


A 1 


6055 


319 


217 


YHFTPxX3MAISKNRQKTGDNKCC*GCRB 
IGMLVCC*WQYKMVWI»L/WKTVWHSIiKG 
rK/L*ELPYDPAIPL*GIQRVSWKPIPH 
SSEG 


6004 


19905 


A 


6056 


341 


151 


VHAGGS PES/ Kar EiXfflAxNKARlJiuYIUt 
* ISQMWWCIPAVPATHAGPSLHFNLYYS 
PWLFQCDY 


6005 

- 


19906 


A 


6057 


2 

■ 


558 

* 


FFFLRWSFTLIAQAGVQWCNLSSLQPLP 
PKr KKroi-JjoIjPfabWDxRnVi'P \KUftNr 
FVFLVEMGFLHVGQAS LBLLASGDPPSG 
DPDIiR*IiAFQSIiGITGVT/HHAGQI FLF 

\GSE*FSCLGLPSSWD\YRHALTHPANF 
\SIFSRDGVSPSWPGWSQMPDLS 


6006 

• 


19907 


A 


6058 


320 


21 


EICMTTNHPSVSNTIW/SKMTLASVYLT 

RYFVFYFI IMLFV* RQGLTMLPRLVSNS 
WAQYILPPWPPKVLGLQA 


6UU7 












DTFE*SWSVI/PGiGVYSS*E*TRGLAM 
KPRQVYNSWAQAI LMPWLPKMLEVQT 


6008 


19909 


A 


6060 


616 


354 


ERVS PCHPGWS AVALSQ PTAAxVTS W \ VK 

RNE/SLLMLPRLTSNPWAQVICPP*PPK 
VLGLHA 


6009 


19910 


A 


6061 


214 


11 


AASTRPSAWQPPLLGSEEPLCPATTPSG 
KRLRNRLVAMSV 


6010 


19911 


A 


6062 


349 


54 


QSKSAFPKKRINRISDSSEGYGQSKLKT 
r rir*\jr x x u luxx iui x u» oom v n x x jupkj 
V/ WDDFBGFRTS VKE VAADVC/ ETARQL 
EVEPESVTG*QQSHVQP 


6011 


19912 


A 


6063 


332 


3 


SQPASGQI SKVTQSTIiRCLKMSVQQVKS 
CYKCTATYliCAKFNSHEFRS SGVCENPV 
SYFYTHSIRSKIL/WPGMVAHTCNPSAL 
ESQGGCMT*GQHCETSMVNMVTPRVSRA 


6012 


19913 


A 


6064 


174 


416 


NENELFFCFFFXIFFFFXFFFFFFFFFF 
FFPFFFGFFXPFFI?PFFFFFFFP l XP r I?VP 

FFFFFFFFFFFFFFFFFFFFFFFXF 


6013 

* 


19914 


A 


6065 


826 


571 


DGSHSLA\RLECSGVI S AHCNLHL PGSS 
DS VAS AS * VAGTTGTCHHARL I F/WFL 
VETRFHNLGOAGLELIiVTHPPWPPKILG 
LQA 


6014 


19915 


A 


606* 


1 


284 


GTSFFF*NRVLFCCPG*RAVPQ\SWLTA 
ALTSWAQGSS*NHRLKP/HMLSFFNFCK 
QGLTMLPKGALNSWAQAILQPQPPM*LG 
LQACTYAWSLGP 


6015 

1 « 


19916 


A 


6067 


36 


384 


VSKNKPIHLFXjFFFSKTKSWSVTXRIiEC 
SGVTSAHCYLPXPGSREGAASAL*liSAT 
TWIiCKVALPLGEDLGALCRNPWGAGVKG 
S SRES PPLGEARGGELPWGQVRI RLWGR 
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Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanlne, 
0=Glycine, HHHistidine, Hsofeutine, 
K=Lysine, L=Leucine, M=Methkraloe> 
N=Asparagine,P=ProHne, 
Q=Glutamine, R=Arginine, S^Serine, 
^Threonine, V=Valine, 
W^ryptophan, Y=^Tyrosine, 
X=Un known, *«Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GEPAP 


6016 


19917 


A 


6068 


393 


116 


PQPIFPRY*QNKDPNL*KTF*HKYMPKC 
PAK\LIIFIEKGFSRFGQPGL*HLSSSlff 
LPALASQSAGISGINPGGRARFPLFFFF 
FFSLLLLSF 


6017 


19918 


A 


6069 


53 


280 


FPRFCFC/LLESESHSVA*ARVHWRRIiG 
PG/LKRFACLKLPSNWNYRHTPTHPANF 
CIFIRDQGMTL*PSWS*T\PDIK 


6018 


19919 

* 


A 


6070 


3 


^77 


HEE POGCATVPGE FFKFLWRHGL ITYPP 
LVSNS WAQAALP PQ * METHI I FS IGRIiH 
MVK\RQYWSGAVAHAYNPRLLGS *GGKA 
A*AQEFETSLGYVMRPCLHKNLKNSPGT 
VAHPCGSSC 


601 Q 

Uv 1 7 

■ 




A 




j j / 


J 


KGARQQSHLRKSIYV*EIASAKALRRHM 
PGVFQKQ PGG / W PGWAHT * NPRTLGGR 
CGWI T * AQELETS LGNPEKPCL YKKYKL 
V 


fJY)(] 


19071 

1 77X1 






1 ft/17 


A*7St 

OJ.O 


f r ii i.ifmorno vn^nuvynuuiJUdijyruJJr 

PGFMPFSCLSLPSSWDHRRLPPRIANF\ 
FIFLVATGFHRVSQMVLIS * PCDPPASA 
SQSAGTT/GARPKVF* IFVYMYFGVRKH 
S ILMSMPOHDLFGYKI 


6021 


19922 


A 


6073 


27 


364 


MDLPSVS I FDLRSLWADISRATSTLFWL 
LFS WNCFLHLLS FNLFLSLGQK* VCCR* 
H IQS CVT *Q WRY\ WSGIiVTHACMPSTFG 
SQGGWVASV*EFETSLCNMAKPCLYKKY 
E 


6022 


19923 


A 


6074 


2 


68 


ARACSHGTVALTAS CFPLQLD YKCLQYS 
FVCFFFFETRSNFVT\RLKCNGPISVN* ! 
NFNLPGLTRSQA*ASREAGTTGTCYHA* 
♦ILLPSSTRL 


6023 


19924 


A 


6075 


330 


49 


KRRFALWQAGVQWRDLRSLQPLPPVFS 
CLSLPSSRDYRCPAS /RPS * FFL3LLTS 
GDLPTSASQSAGI IGMSHCAQPPLI YPL 
FQIKVILFFS 


6024 


19925 


A 


6076 


2 


267 


ARGVEVKVGKTQKFIiNMIRRTLVHCWWK 
CKLI QPLF / WRI LP KVKEELP YHLAIL P 
LGIYPKQMKSISL\KYICSYI * IVDHFV 
♦AILPLGIYPKQMKSISUJIHMFLYIDC 
RS FC IGTFLLL 


6025 


19926 


A 


6077 


379 


236 


PPPPQTRETPPPGHVHKDPPPPHHPYPP 
PQEPQPRTPVFNTHPGEEQNHBLETCSSL 
LEPWW\YYSSLLPPEQA*BPHIH 


6026 

* 


19927 


A 


6078 


7 


290 


RRSLPSSPLEDGRLSIAPGLTWEPPSHP 
PPQ*RSPPPPKKKKKKKKSPPGPKF/HK 
FKTCKPEKPPRVL*RGHWPPFKFF*LKP 
GLPFF*GTKPPKV 


6027 


19928 


A 


6079 


1241 


1487 


KMYCI FFVE* KPCLSCAIFIRLLKWS *P 
TVCNS ISGGPPHG* IMPSI IL* FLFLET 
VFHSV\TRVGVQWPDHGSLQPGPPGUC* 
SSCLTASQSTQI IGVSHHAWPPSITFEI 
FKRLAYSMSADGW* KS PPMKLDLGALKK 
KLSFARPG 


6028 


19929 


A 


6080 


333 


1 

t 


LCHHVQLS FKFFVEMEVSLCGPGWPFK* 

FSHLGLPXCWDYRHDPPHLAFFFSLS\* 
FFCRDRVSMLPRLVLKSWSQAILLL* PP 
KVFFLFVC FLRRVS FHHAEP YAVS SC 
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peptide 
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Amino acid sequence (A=Alanine 
OCy stei ne, T>=Aspa rtic Acid, 
E=Glutnmic Acid, ^Phenylalanine, 
G=Glycine, H^Histidine, I=Iso leu cine, 
K^Lysine, L=Leucine, M=Methionine, 
N=Asparaeine, P=Proline, 
Q=Glntamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop cod on, /-possible 
nucleotide deletion, V=possible 
nucleotide insertion 


6029 


19930 


A 


6081 


2 


239 


ARGKYIYNRDKISPCCPGWNAII/RSLQ 
P*TSELKRSSHINRPNSWDYRCMPSHLA 
NF * I FRRDGGLASLPRLGTGWLKS 


6030 


19931 . 


A 


6082 


323 


185 


SACLGLPKC/WDYRREPIrCLV3R*VIiNS 
* AQAI CLPWPPKVLGLQA 


OLL>i . 


1 0010 
1 yyjjz. 


n 

•** 


OUOj 




*% 


GERI LE SCI YARSHRAEI TSLHCZSLS VS 
K/HSVAQVWKWPDFCSMQSMPPRFK*F 
SCVSWSSWDYGLPPPCAGLHIWRRNVT 
LALQGLPTTSLALV 


OUjZ 


i 7 7 J J 

* 


A 


DUO** 

! 


* 

fr.? 


l 


OKP K* * KTOTWS S I S LOSRKMKKKI LS V 
ITPFKT *GEEEEEKEKKRK\KEE*EK*K 
EPSQKKEVEEVEVEDERRGRK* KVDMEV 
KEEEKDKGEEKSGEEQ*KESAEAAVRE* 
VDS /SEIjVTFAVLPGDSSEEEQGMMVPR 
A 






A 


OUOj 




i 


SNILI PKAOVTKHI YSLETCSPDYPGIS 
QAI KANFS PCTVG * LPSNWAYRHLPPCL 
DNF* I FCRG\RS ILCRLISNSWLQQVKL 
OEFPOFLiGCHARFOGAGFLSHCLV 


6034 


19935 


A 


6086 


299 


46 


LDDLHLLS LCLS LFRDRVS LCYLGWS S V 
A*S*LTVA\*TPVILSIiRLK*SSHLNLP 
SSWD YR WSNS * PQdkLLPRS PKVHLLL 


6035 


19936 


A 


6087 


266 


48 


GIEIVRQMRSRTGADKVIYIVQSPLLKM 
QKI / WLGTVAHTCNPSNCGGRSGRIT*G 
QEFKQSSHLSLPKCWDYRL 


6036 


19937 


A 


6088 


2 


185 


ARVTMLPRLISNS \GFK* SSHLGLRKCW 
DYRCE PPCPAE I S F I S5GYLTRSGI AGS 
YCNSV 


OU J / 


1QQ-2Q 
177JO 


A 








LGYRDFIjHTjSFPIS /wnyrclpscpa/n 
FFILVETN* FHHVGQAGFEHLTSGDLPT 

sasq\saeitgmtyraqpaags 


6038 


19939 


A 


6090 


388 


187 


SQHLGKLSREDCL/RLIVPDQS \GQHSK 
TPSLKK* F*K*AWVW*HMPWPAIQEAE 

a» t«a# Aal aV^ ■ * a> * * ■» ~ T W » ■ •* *• *aW « * • ^( aVaM 

VGGSLEPRKSKLQ 


6039 


19940 


A 


6091 


188 


329 


TLGLKRFFCLSLR* F I YFF PRDRVLLYY 
PGWKAVGIHRRDYSMLYSIjEIiIiASKDS / 

aW * * afc 1(4 W mar aVA 4#>%aV a% V^aw ■ al aaav aa Vaaf AMvCafli MrAlaww * 

SCLSLRSGWDYCVWLIFFGCM*CKRQDL 
SIIPWQRSLIGQ 


6040 


19941 


A 


6092 


124 


342 


RPAWPTWQNP I S AKNTN ITWA* AARVAG 
IT\GTCQRAQVMPVFLAEMGFCHVGQAG 
LYPKI I ILYDKLSHYKCSHLQMRJCALBT 
YSGIYSLFTIYLV 


6041 


19942 


A 


6093 


169 


2 


SH*PFTSSTIRSIWT\GAVAHTCNPSSW 
GE *GGRMA* DQKI ETS LVNMTRPHLYPR 
A 


6042 


19943 


A 


6094 


2 


324 


FFFLRWSL/DSVAQAGVQWRDLGSLQAP 
PRGFTPFSCLSIiPSSWDYRRPLPRPANP 
FYF* *RRGFTMIiARMVSIS * PRDLPALA 

4* A ata ^^a> *^a» ^ afc • wa^k a«BB • » Na* aa% fca* (VArMA •» AHf a» 

SQSAG I TGMSHRARPAVYILTSTSN 


6043 


19944 


A 


6095 


76 


340 


LRKKFCFSVYPFIHRWTFELLPLNDFLM 
NPYFRIVFRFMKKLQR\WPGAVAHAYNP 
STIX3G*GGRIS*AQEFKTSPGNIVRDPI 
STKNM 


6044 


19945 


A 


6096 


349 


84 


FIiI*SLF*DTISLCHPGQSAWRTMTSQL 
KQSSHRSLLSSWDCRCVPPCPVNFFSFC 
RDE/SLAMLARLVSNLWDQALVA2AGVS 
VWARQE 


6045 


19946 


A 


6097 


346 


184 


LTMLPRLVSNS \GLK* SSHFGLPKCWDY 
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X=Un known, *«Stop codon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














RHEPPCIAKISLYITKFFHILSLPI 


6046 


19947 




6098 


378 


250 


FKQFSCLSLPISWDYSREPPQMANS\*F 
LVETRFHHVYSYVP 


6047 


19948 


A 


6099 


3 

• 


225 


QFKRFSCLSLLSSWDYR/HPAPYLAKFX 
*FLVEMGFHHIGQAGLELLTSGDPPTLA 
SQSAGITGVSHHAQPLI IF 


6048 


19949 

i 

t 


A 


6100 


1129 


457 


YC VS S KYRRI I S FS LI A* LLLTKLGS / V 
QKVGVQWRDLG* LQPPPPG\FKLFSCLS 
LPSGW\DYGHVPPHPA*FCVF*WEMGVS 
PCWLRLV\LNS + PSGGSRPPSASOSAGS 
LQGLI \HRARPEH* F*TS * PLGSGKTLD 
EIYIKTKTFYQFCIVCLKIiH*RIiMEFAL 
VAQAGMQWHDLfGSLrQPLPPQFKWFSCLS 
LPSS*DYRYWLLYHPALELKEAPSFPBS 
PCTH 


6049 


19950 


A 


6101 


3 


296 


EIIQALLITI VLGLYRTLLQAS * YL*SA 
FTISDGIYGSTFIiAA\TGLHGIjHVIIGS 
TFLT I C FIRQLI FHFTS KHHFGFEADA* 
YWHFVDED*LFLNG 


OUDU 


* 




o lUx 


i 
i 




P H RHttT . PffT T T .T . T * TQT ,T T (Z<1 T AT .AftT.T . 
iA-TXQVjyj i_i r tv x j. i lui lam njoununwiju 

FFTGF YSKNHI I E \ TAN I S YTNA* ALS I 
Til ATSLTSAYSTRI ILLTLTGQPRFPT 
I/TNFXENN\PALLNP\ 1 1 YVAAGSLFAG 
FL ITNKIS PAS PFQTTIPLYLKLTALAV 
TFLGLLTALDLNYLTNKLKJ KS PLCTFY 
FSNILGFYPSITHRTIPY\LGLLTSQNL 
PLLLLGLT*LEKLLP\NTISQHQISTS I 
ITSTKK\GILKFYFLSFFFPLILTIjLLI 
T 


6051 


19952 


A. 


6103 


377 


I 


SKAFGPPGF*APYGLKAHCFPPGFRRGV 
WAPSGFWARPPIGYPFGALIGAPVWPSP 
GGPPKPGPP\GPFGVPKLGGKRLGIGGP 
FPGSPGFLTTPGSKKKKSTKKGKPQKKI 
TPLDTSCSRTIVM 


6052 


1 77JJ 


A 


6104 


1001 




FFLRRSFTLVAOAGMOWCDLGSLOPPPP 
GFKQFSCFGLLSSWDYRHVPPTLANFSV 
FLVEM\GFSMLARLVSTS* PQ/CDPPAP 
ASQSAG I TGMSHCAQLPKP I FNYL * AID 
SVTTS ISLARDWPNSS ENRLLGYMGKPS 
KFKRQEQKMECYRDLAPS 


6053 


19954 


A 


6105 


376 


117 


CSDWSAVSRSQLTAAPNS*\VKQSSCHL 
SLSSCWN/YRCQPRHLANFVLFCYVLF* 
RWZVLxAMLPILVLNCWPQAVFLPWPPKEM 
GUQA 


6054 


19955 


A 


6106 


689 


433 


LCHPGWSAWQSWLIATSPSLV\KPSSH 
L SLL S S WD YRCTS PCPANFFVFF Y* R*G 
STM* PRLLWNS WAQAI LPPRP P KVLGLQ 
A 


6055 


19956 


A 


6107 


2 


176 


AREYPANFFVCLFFLILCRDG/DFTMLL 
RLI SNS WLKRSDRLSL PKW * DDRREPPH 
QAY 


6056 


19957 


A 


6108 


328 


69 


LYIFSFG*ERVLFCHPGWSAVAQSQLTA 
ASTFRTQ VLSLIGRWDNRHVP PHLADL / 
SFYRDGVSPCCSG*SQTQAILLPWPPKV 
LGFQB 


6057 


19958 


A 


6109 


111 


329 


FLEMGSHS VAQAG VQ WRDI/SS LQP P PPG 
FKRFSCLSFPSSWDYRRPRPRPA/NFFV 
FLVETGF\TILARLVSNS * PRDPPASAS 
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K=Lysine, L=Leucine, M=Mettaionine, 
N—Asoaraeine. P=Proline« 

m\ * m mm mm* m UElllvi m* M\ m mr m m m • J 

Q=Glu famine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=TJn known. *—55tnn codon. A=DOssible 1 
nucleotide deletion, \=possible 
nucleotide insertion 














QSAGITGVTYPRPALTQNFSNLTFGEEN 
RKPNPEDSCVRYLLLSIRQRS PTALLHV 
ILFQQMIQLVL 


6058 


19959 


A 


6110 


345 


309 


ASLIFV*LVEMRFLHSGISLFNSGKNGP 
TAFSQDPPG\SSVSKKAIIIIINKINK* 
IGHVTV I OI BVO WHNHVSLOPLNPELNR 
FSRLS LLSSWDHRHMP PHSANF * NFKN 


6059 


J 9960 


A 


6111 


332 


3 


RKVLVC P PGLS AVAGFQ VPARS AFQVHP . 
LFLEAEVAVSPGCAPAFKPGPQGPF*KK 
KEEERKKKGRRKEEE/EEEEEEEEEEEE 
EEREEGEEGEEEEEFYFLRAGKVKSPRA 


6060 


19961 

i 


A 


6112 


498 


191 


STDRVLLCQKARVQWCDPGSLKPPPPGF 
K*FLCLSLPKSWNYSHAPPCLANF\*FL 
VETGFCHVGKAGLELLASSDPPALASQS 
VEITGVSHHAWPMQEYFQI 


6061 


19962 

- 


A 


6113 

- 


1381 


172 

■ 


RNIDSKAILSKKNKAGGITLPSFKLYYK 
ATVTKTA* YWHK/NH/DVD* WNRIENLE 
MRPHTYNF\L I FDE PDKNKQWAMDS LFN 
KSCWDNWPAIWKRLKMDAFLMPYTKINS 
RWIK/DLNVKPKTVKILEDNPGSTIQDI 

KSCTAKETIKRVNRdtiTEWEKIVANYAS 

DMNRHFSKEDICGQQSY/VKK/CPASLI 
I REMQIETTTRYHLTPVQMAI I KMSENN 
RCW*GCX5EKNMLIHCWWKCK/LKCVQPL 
WKTVW /RTKTEIPFNPAISLLGIYPKE* 

KS/C/CYKDTCTRMFVAA\CPSVTDWIK 
KMWY IHTREYYTAI KRNKTD W AWWLM P V 
I LALCEAEVGGSLEVRSSRPAWPTW 

* 


OUDZ 




a 
** 




i 

* 


1Z>J 


HKXK" T SO VWWCMP WP / TI IjWWLR * ED^ 
LSPRKLRWi*SYI)CTTVLPAWATE 


6063 


19964 


A 


6115 


47 


368 


STPT PSAFLCLAF/ YRQRWLCWPDWGA 
V\IQirrYSLPT*LSII*KQD*PIiNIJjGT/ 
WSS RRVAACPENOI YRRNGVS /MIiPRLV 
LNSWTQVILLPWPPKV/LGLQA 


6064 


19965 


A 


6116 


1 


592 


RQRIFLLECGGAISAHCNLHLPGSSNSP 
ASVS * VAGTTGVRHHAQLI FVIIiVETBF 
HHVGODGLDLL /NLM IHPPRP P KVL* LO 
G 


6065 


19966 


A 


6117 


1 


306 


LWKTLWQFVLKLNILLPYNPAIVPIjGIY 
PNKMKI YVHTKTCTQICIGALFI IAKT* 
KOPTCS S IGEW / 1 KKL WY I OTMEYYST P 
ERNE PS SHENTWKNLKCI 


6066 


19967 


A 


6118 


2 


284 


QTGVHWHDQ\NLL* PETPG\SRDSPGSA 
SQVAGTTGMHQHARLNFFSFSFFLSFFF 
FFSLGKTLGLGLEFFPRWLQMSGPKKIL 
PT, YTjONP knkg 


6067 


19968 


A 


6119 


2 


315 


SRVAGITGVGHHARIilFVF/CFFFFFLB 
RDFLFVPRVGFQGPNLG*LKFPLPGLPL 
FSGLTLP KTGD * GAPLQPRVNFWIF* KK 
G\V* YCGPGWFE ILDLRGSPPF 


6068 


19969 


A 


6120 


42 


392 


LQWRNLCSLQPLPPVFKQFSCLRLLSS* 
DYR\HAPPRIjG* FCI FSRRI VFLHGWVR 
AGSRNS * PQDGSAPPQASQLAGITG\VS 
HHHTWAPHFLFFKNSGCSTQNR* QVHKV 
GDKETS 


6069 


19970 


A 


6121 


2 


202 


VhLSDQI^QPKCP *MIEWIKKMWHIDTME 
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X = Un known. *^=StoD cod on. A=DOSSible 
nucleotide deletion, \=possible 
nucleotide insertion 














YYAAI KKDKFMS F \ ETITLGEETQBWKT 
KHRLLSLIRG^ 


6070 


19971 


A 


6122 


355 


i 161 


FIXYLFFFFFFFFFFPFPLFFIiSFLFPF 
XXXXXXXXXLXVFTGMIS*LWVCLYVYM 
NVYM*IYMF 


6071 


19972 


A 


6123 


1 


133 


LC* PGWSR \FLTS * SACVGLP KCWDYRR 
EPLHLVFFFFFFFFTL 


6072 


19973 


A 


6124 


3 


212 


HPGLPS SWDYRCT* PCLANF/ SCCCC* R 
LGLTMLPRLASNSWTQVILLLWLPKLAQ 
NTSLINNFYIDQMLK 




1007/1 

iyy t*f 


<** 




J JO 




*GGWIT*G*EFNISLGNIVKHHLY 


6074 


19975 

i 


A 

■ 


6126 


345 


34 


HPSTIHRVFCSL*3*SSPQARNNGSCLS 
SEHFGITGVNHHTWLLFFKKNY/CILIK 
I ETGS *YVAQAGIiELLSSSNPP I SGSES 


6075 


19976 


A 


6127 


2 


328 


ARAYLPLCFF 1 1 FFI * V&ttWt Wtt* G 

LI * LNKRYLAHPK\ Y * LNVS YS VS FCKT 

HFFPRHKEKAI IQLGWAHACNPSTLGG 
(V3QW TT*QPRPETT T/TSNfEKPRtWY 


6076 


19977 


A 


6128 


1 


341 


GXRKFSDYRGKPPHPAKNRNIiLFSIFFY 
FETLSCSVLLCEVTSSQL/CNSFHFPDL 

FFVETGPDWS *TPG/LLAVLPPRSPRML 
ELQA ' 


6077 


19978 


A 


6129 


264 


2 


TAISENKVASRSEHRRHRML*LQVI/LI 

CNPSTMGGQGGWIS*GQQFQTSLSNMVK 
PPRA 


6078 


19979 


A 


6130 


1392 


1157 


LSLPKCWNYRREPPGPALCCPGWS*TPR 
IX.* S S PLGLPKCWDYR /R* TTAPGLFF T 
SKSSLRSPAYSCVTVCTLQLSHH 


6079 


19980 


A 


6131 


383 


108 


MRLCSQPHHHVDFSVIjETESCCVTQVGV 
QWHDLSSIiQSPPPRFKRFSSI»K*LGAIJL 
T iTjPSS WD YR (TP PHPAN /liCRF * HGPVT 

LYMFGEKVTP 


6080 


19981 


A 


6132 


307 


1 


LKTLPALCELESHSVSQWRNLSSLQPPP 
LRFKRLS YLSLPSS * DH6HP *AHQANF\ 
CIFSRDRETGFEHVGQENLIMLIVQQSS 
GLGFS I FATLLAVMQL V 


6081 


19982 


A 


6133 


351 


66 


RWSFTMLHRLLLKS*LQ/CDPPASASQS 
AGMTGMSHHTRLHNLF* TANNLTQSHTK 
FYSFISLTPHFAINVTOYIFLYYTSINI 

VnTFT.QFJCPVART 
vua* um* ivr xm%4 


6082 

. 


19983 

■ 


A 


6134 


2118 

- 


1392 

■ 


FFCKGK* IFIQCPDWGG/DLCV*QEKIQ 
KKTYHTGKDKYV* LF* KVKP ITKI *RQI 
HT SGNTDAQ LS YKE NHDQGH * QAKNFKF 

SLQPPPPGFKQF/SCLSLPSSWDYRRPP 
PRPANFC/IFE*RWGFTMLARMVSIS*P 
RNPPTSASQSAGI TGVSHCTRQEFKVKT 
GYHFHLLPSGCLFLFYVFYLYACIiLYVS 
VYYPLRQQESRKISKIEMFLTTWH 


6083 


19984 


A 


6135 


383 


47 


HLFLNFYYYLSHYNYLYYFLLHLIPSSP 
KIFFFFT*KIIFFLIKFNKVF*BJLSF*K 
KIF/IFSYTF+FWPP*NFF*KPPPFFFF 
FFFFFFFFFFFF*QTLFLGGCGYNTKLR 


6084 


19985 


A 


6136 




196 


TRPS ILVTALYYLYI FTTTQWG* \LTHH 
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•^Threonine, V=Valine, 
W=Trvntonhftn Y—Tvrosine. 

X=Un known, *=S(top codon, /-possible 
nucleotide deletion, V=possible 

nnoloc\4i(t a tnwrfinn 
11 uucvuuc iiidcrnwii 














INNIAPSFTRENTLMFIHLSPILLLSLN 
PDIITGVCS 


OUoj 


1 QOfiA 




ATV7 




All 


FLR WS SAS VTRLE * SGAI S AHCILCLLG 
SS YSPALAS* VAGTTGTCHHAQLI FVFL 
VETGFLHVGQDGI.DI1L/NLMICPPRPPK 
VLVLQA 


6086 


19987 


A 


6138 


377 


298 


QMKWKLLIiLFRYRVLLC/HPWSAVA*liQ 
T ,Tfi. A W\ QSfTLSIi /MI/TMI»VTiNS 
WPQVILLPWPPKVQEL*AIAAYTIKMWV 


6087 


19988 


A 


6139 


I 


199 


LYTTYFVTKTLIiLTSLFL* IRTG\YPRF 
RYEQLIHLL* KNFLPLTXiALLI *HVSIP 
ITISSIPPQT 


6088 


19989 


A 


6140 


381 

1 


28 


STPNRL I ERQEDP PES *NTNPPGLPRKT 
Pb r 1 ivN F KJJNliAK WtivJJrliU iriralraaifjrti 
KVPESPKRKKP*TKIRPLPSSPGDPTKP 
P\LKKKKKKKKSWPATQEAEMGGPLKA 
RSLRL 


6089 


19990 


A 


6141 


1 


209 


JjKJjIiASSDLSTJbGbQSAGNXVGMbnCTQ 
PKINMGILLKLVYRFNINPVKTPAGLF/ 
TKKKT*KTEIDKRILKFIWKCTEHRIAK 

YRFN I NP VKT PAGL FKKKKHE KQKLTS G 


6090 


19991 


A 


6142 


3 


282 


QG I FLTKR VKNF * RGKFKTL VKKNQGD P 

KKGKNPPGPKIGKNNPGKTPFWAKKI * K 
FNT.T \ PTCKTPPFFFOKLKICTGVKFFWAP 

KGPKGFLSKK 


6091 


19992 


A 


6143 


382 


158 


FFF*DSLAVSAPGVQ\WHDLCSL\RPLP 
PRFRRFWCLSLLSGWD\YRHVP\QYPAK 
ICIFSRDRVLPILAKASRNVSF 


6092 


19993 


A 


6144 


185 


316 


PCLANF* IFFCK/NGGFSMLPTLGLNSW 
AQG I LLPKPLKVLGLSA 


6093 


19994 


A 


6145 


489 


231 


SVA* AGVQWHDHS * LQPQ/ PLGLKQS3C 
LSLSNSWDYRHMAPHPTNFLKKFFVETG 
VLSCCPGWSSTPGLKGSFCLNLPKF 


6094 

• 


19995 

- 

• 


A 

- 


6146 


179 

. 


1011 

• 


GT I LNPQSLP FKPFFFS TL I FLKTVLLC 

UlrGW VoWooiabbV^KoijlrivCWJLiX *Hoi'rtt 

SAL* TFPWPFP*LVKDTS IHSTQLLKAE 
IFSASLDAAPS PTP YIRTHPSSGLQRWL 
LDAARSPTPYIRMHPASRLQRWCSTCTL 
LPSPSYHTPA*AMTTASSLISPFFPLPP 

rrinV kJVLlu'J X r WJ>Mr Vii/ LUf -Ql'Lfi j£tJ Vn 

QGWECSGAILGSIi\QPPPPGFK/RGFSC 
LSLI»SRW\DY\GHAPPCPG\NFCIFSRD 
G\VSLCWPGWSRTPDLWHPPRPPKA1jG 

LQA 






a 






5 


/ IPAKYFVDPDKLVLKFTWKGKRPRlAG 
TILKEKKVRRLILPNFKTHYKATVIKIG 
W*W*NNRHNHWNQIGSPEIAPHKYSQLI 
FDKEA 


6096 


19997 


A 


6148 


22 


228 


THRVALFVRT*1WIKQPKCPSTGEWLNTL 
C YIHTI E Y* YYSAI KRNKLS IHVCT\WI 
DLKGIVLNENNQI 


6097 


19998 


A 


6149 


406 


279 


RSIQGSSHSHASA/S*VAGITGARHHTQ 
VI IVFLVETGVMALTY 


6098 


19999 


A 


6150 


406 


10 


MRYHFTPDERLKSKGLKLEYW+GSGETR 
SLIHCWLEYKMSHPLWK* /TVWLFPIKL 
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OCysteine, D=Aspartic Acid, 

F!=f^lntflmip Ariri "F^Phpnvlnlanlnp. 

fl—flx/rlnt* U— UieitHina I — Ionian j»4 a a 

^j^vjiycine, ri— uisiioine, i—iso leucine, 
K=Lyslne, L=Leucine, M=Methionine, 

N=Acnflrfloinp PssPrnline. 

Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stopcodon,/=possJble 

miplontirto rlo lotion \— nncciHlA 

nucleotide insertion 














. NKNVP YGPAI SLLDIL * BK* KHMSTKNT 














YK*MLQALFIITKNWIiLSIKRWfrKQIM 
VYSNRNYYSTIKNELPIQTRI 


OUSrV 






OlJZ 

* 


Jo/ 


• 
i 


SCTPAWAPGS PPVSKKKKMQFHRRMGHV 
SSAWWPCGADVRNLRAGGYVRHRERFPG 

HLAGPRGGGFP 


6100 


20001 


A 


6153 


441 


38 


ILNPGEAITSEKYVQ* FNEMYQKAKRL/ 
QVNRKGPVLLS YNA* PQVAQPVSQKLNK 
LGHEVLSHRPYSLSLSPTDYHFFKCINI 
FLKEKRFYNQQBAENAFQEFTBS*SA/D 

TZCWTZ TKPFPT CTIMPtfVlJTkVKrD OVPH 
r\~X iKlNir JLoMvVl£x\.x VUxJWKoxrL* 


6101 


20002 


A 


6154 


308 


121 


GESFPAEKTFPKENDYRWPSFFQWSSVL 
/CKIHGLIjMIiPRLV*NSWPQAILLPWPP 
KVLGLQA 


6102 


20003 


A 


6155 


I 


420 


QNLQ PHWKTLRQF/ L YKVKHMMLS PAI P 
IiLDIT^GRCKPCPYK/ESIC/RMFIEAL 
FI IAKKLKLSKCGSAGE * INKV*DIYTT 
EYYSAIKSNKQiiTHll TWIKLKSITPSE 
* SQT * ETACFTMLLI WHSRKGN 1 1 VTEN 
RSMF 


6103 


20004 

- 




6156 


77 


415 . 


TZiGLKPTSSSHIiSijSKCnDxRQEPPCPT 
SEANFFFFFGGEFPFWSPS *LKQFFFVE 
MG\FTMLPRLVLNSWPQAHLILP/ASAS 
QSVGITGKSHRAPPQKLTFFFFLGGNFP 
FGPQAGCPGAESNYPEP *TPGLKAFFGL 

SPSCASRVIjNPQPR 


6104 


20005 


A 


6157 


436 


206 


LWYIHTMEYYSALKKNAILIHTI\STWM 
SLENMLNEIKQENHKRANI *FHIiYBIPR 


6105 


20006 


A 


6158 


416 


50 


RFFTHPPAGEFFSPPPPKNIFFPPPP*K 
FWGGGGKASPPQKKFPPLPPPQNGVLCF 
PPHKKK* FFPPPEKTVEPGPI*HVGQRPP 

ne\ PPT >f*SfPPT.Q PITT .VTrr*VPT PPDPVWP 

HFLPFFFFFIR 


6106 


20007 


A 


6159 


443 


145 


FF FFS I RFLFWWLYQG * EYLCFFMT * 














* IGVLLFLFEICEFFFFLLVGILFGMFL 

/ YFFATASPGLFFFCFFFI FFFFFFFFF 
pppppppp pp p pp pppppmir q 


6107 


20008 


A 

• 


6160 


322 


2 


TQDKNKRHRARHI YKRARKSARD *RSLV 
HR * K* KVQKRNE Y* P EGTLHTC I ALSQG 
NNI*DA/HSQE*WITPVI*TLWEAEAGA 


6108 


20009 


A 


6161 


421 

• 


57 


FCEMKFTPGNNAVNIV*PTTKDLEY/WI 
NI /DKTVAGCEKTDSNYERSSTMGKML\ 

QWQTaiffVTRTPPf3PTfQnTJWIWr»TQT.PQPir 

KIi*QPLQYSAT IALIRQQPTTSRQDPSP 
AKRLRLPEGSGDH 


6109 


20010 


A 


6162 


333 


41 


GAWACHLHVGAALNAPTPGLSRFHEEPG 
WSCRTAGGGSRLELS VFFVQMGS CHVGQ 
VGFFCI FCR/DRGIiAMFSRLVSNS *TQG 
ICPPWPPKVIiGLQA 


6110 


20011 


A 


6163 


2 


423 


KINNTKSQCFENI LKIEKTLSNLTKKE \ 
REDTNY*YWE*NMDITVESTDIKRIIRN 
TT\ETLHKFDNL 


dill 


20012 


A 


6164 


434 


5 


ICHIFFSISKLLNILRPVSKLFFSINFI 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=posslble 
nucleotide insertion 


* 






■ 


• 




FGPEIiYYL* IHFNP* *GKCLSFLFLQNI 
CIGFFFNSFMQI*SYFLRISLFIPVSPI 
LI/3 \ FVQ I FIVS INI LYLS I * LRLFFS F 
FLSLFI/FFFYYYTLS FRVHVHI VQVS YI 
RIHF 


6112. 


20013 


A 


6165 


1238 


1017 


ETGSHS VTQAGVQLHNSSS LQPAP PRLK 
*SSYLSLIiSSWDYRRTPPQPAN\FCIFG 
RDKVSPCCPGWSRTPEFR 


6113 


20014 


A 


6166 


504 


210 


FFFKTRSHDQGS LQ P * PJjGIiKGSS CLS R 
TLPHS WD / YRV/ PP PHPAKKYKNI FI FF. 
VETE FHHVAQAGLKVLDSSNS PVSASQS 
VGITSISHCAQPILCQF 


6114 


20015 

< 


A 


6167 


258 


65 


YKTKNTTSIFSLPSSF*TQILTF*NTYY 
YYY/CRDE/SLAMLPRLVSNSWPQVILL 
PWPPKWELQV 


6115 


20016 


A 


6168 


1 


156 


LARMVS IS *PHDPPAPASQSAGITGVSH 
CAQLFFFFFWRQPR FFLRAGGOGRDLGK 
VKPLLSGVRPFLS * TLPTEGRKSGPPTH 
FG*VLRE* FGGSH\ PVSSLW*QSAGITG 
VSHCAQLFFFFFWRQPRFFLRAGGQGR 


6116 


20017 


A 


6169 


3 


285 


hasahasgnilsyvgsfisltaai \li i 
imi*eafaskrnvliVeepsinle*lyg 
cpppyhtfeepvyiksrrirrslidqda 

ARKETQQGAAD 


6117 

■ 


20018 

• 


A 


6170 


645 

■ 


15 


LPFIYCSPCRNFNSLFWQERDSNNPLG/ 
ISLPHSDKITFNP\YYT\lKDALGLTSIi 
HSLLKDLHSFPHETS *GPQAYYSL/ QPT 
PLNNPSPHARPEWI FPYCAYHNFSGSGP 
L™W3GVL\ALLL\SiriHPKQ*FPHPPS 
YPNKQRH*YFGPLRPITFIDFLGRRPSF 
I SNRE SGGQPVS YP FTI IGQVAS VLYFT 
TIIilLIPTISLIENKIIiKWA 


6118 


20019 


A 


6171 


409 


25 


YTTPGGKIFFKSNPGSKLFFPPPF/PPF 
FSPFSP*KFFFSPKPLNFFGGFFPFFPP 
PKKRFFSKNPPLVFFSPPFKBKVFPFPP 
PLNFAPPRVFLKAPPPFFFFFFFFFKGG 
EVKWLSEALDCKS KDRG 


6119 


20020 


A 


6172 


73 


52 


VQLLKCRIHIF*DKVLLCHPGWSTVARS 
♦LTATSRSSCLSPPSSWDYRRVPPGIAS 
F/SF* YIALNN*A*AI ILPQPSKVLGL* 
AVAHAYSPSTLEG 


6120 

■ 


20021 


A 


6173 


325 


470 


I FTF FFUjAFI * FFCRNR/ S LALbP RLV 
LNSWT^AVLPPWPPKVTiGrtQT 


6121 


20022 


A 


6174 

■ 


2 

• 


228 

- 


TIiEDNST 1 PCKT I QS SSCTPWYVTQKAA 
CITLLSM* PKKLWYVYTKSCTQMFIVAI 
FI IAKNWKQPRCLSVSE* INKLWYI*1W 
E*YPVLKRNELSSHEKTW/MEM* IQLLI 
ERSQS * KAWCLHKKLHTDVYS SYIHNG 


6122 


20023 


A 


6175 


3 


408 


RYLTLAIRDMQI ITPVSYHFVPSGDYGQ 
KDNNKCW * RG*KMGSLMHC* W * CKMVQP 
LWKAAWQ FLK/DVKRRAI LFLG I YPREM 
KTYVHAKTCMHIFTLALFLYVHLNALIS 
VMDWMLGHPIEEISRDSYSCVCLL 


6123 


20024 


A 


6176 


2 


237 


WVWWLIPWIAAWEAEAKGLLEU5SSRL 
* \WIMPLH YSLGNRARLCIiKKKKKKKFK 
VFPPQKKKKKKKKKKKKK3GGAL 


4124 


20025 


A 


6177 


387 


154 


AWQWRDLSSLQAPPPGFE*FSCLSL\W 
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neottde 
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M 
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corresoond 
ing to first 
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residue of 
peptide . 
sequence 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

eminence 


Amino acid sequence (A^Alanine 
OCysteine, D=Aspartic Acid, 
Er=Glutamic Acid. FMPhenvlfllanlne 
(XJIycine, H=Histidine, I=Isoleucine, 
KpLiVsine. Jj^ Leu cine. IVf = Methionine- 

N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine,~ 
T=Threonine. V=Valine. 
W=Tryptophan, Y=Tyrosine, 
X«=Un known, *=Stop codon, ^possible 
n ucleotide deletion, V=possible 
nurlentirie incprfinn 














DYRKKLLTSGDLPASASQSAG ITGVSHC 
AQPLSLFVFFLTNQRFVAALS 










i 
i 




FFFFEMECLTVSOAGVOWYNLHSLOPLP 
PGFKQFSC\LSLPSSWD*RVPTSRPAKF 
/CVI F*DGVSHCQPGWSAWQPPIfl 


6126 


20027 


A 


6179 


325 


50 


RKFPLTWRRVS F F S LLRS S TDWMPTPHR 

KNV* P NV\WHPMAKSS*HPKLTITVYPL 
STWHTYTSL 


6127 


20028 


A 


6180 


157 


338 


SQHFGRPRQEACSSSEVRDQPGQQREFP 
HLYKN*KEITQVWWHMPPBAQDY/LBPG 


6128 


20029 


A 


6181 


3 


273 


HEGQSKLETFWKGFTI*DVIKNIDDL/W 
RGVKIPILTGWEKLIPTLINNFEVFKT 
SLEBITEYWEIGRETiF.TiRVEPKDVTEL 
LQSHEKT 


6129 


20030 


A 


6182 


2 


359 


YYPAI PLLGIHPK* KNS IYQMHTCTHWF 

MHTHTHTHIHTMEYYVPIK/NEIISFAA 
TWMELGVI ILTB I IQAQKKTKG I FSHLY 
VGAKIPDYM 


6130 


20031 


•x 

A 


6183 


3 


211 


LLF\SFYGGQAIPLITPSVGTQIFRLKM 
/ PEVTHHFHSFP*EPLTAAALNQPSLPL 
SLLVDF/RDRGLFCHLGWSAVD* SHLTA 
ALNS \LVTQLS CPSFLSS WDYRRMLPCL 

nMp * T PP T.KTVIT .Tf <5 T .T T PTT.FHF QTI 


6131 


20032 


A 


6184 


707 


384 


LQI F FYNR VKCVKN KS TQNQ FY I FSR I Y 
MHANRHGKYE I / W I IKKMPMREG*AVQL 
RMKWKI KKNTHI KDG VWHGYNLSTIjG 
GQGGRIT*AQEFKTSLGNIARLHLYT 


01 JZ 




TV 


OlOJ 


Tin 
MX) 


in 
303 


WHHHSLLQP*TSGLKQSSHLSFQSSWDH 
RCAP PRVANFFLD FCRDRVCVAQAGLEL 
T JLQ <?NT> PTVASO Q JVf3T TRfi^TrRTWP PUTT) 

ILGIMYWGC 


6133 


20034 


A 


61 86 


439 


66 


KPKTLGS KKVNSNPRVPPGK/ PWGVGPI* 

GPGP*MGGSKP/S*NPSPPPPLFKTRFG 
FFPPG* ROCJHYSflFIinPPPPOGICTFFPP 

S PPKKWGHKKKI PFQKKKKKERKKRNI I 
GSWTTSETVGNPAPW 


6134 


20035 


A 


6187 


83 


287 


NKMFKI FRFRF/CGFLGQSHSVTQGGVQ 
WHDHSSLQP*TPGLKRSSLPTCWDNRHK 
PPHLVIiFCFYYRIi 


6135 


20036 


A 

► 


6188 


290 


609 


RRMDDPPFFKYFFERRVLVCHQGCS*V\ 
K* SSHFSFPCSWDHRCV/PMPHPAN\FN 
FFHTDKSLTMLPRLVSNSWAQVI LWPLP 
P * VLGFAGMSPCPOP PSOFCIEAV 


6136 


20037 


A 


6189 


2 


371 


RYIFEKVFAKYSELTNDSIiS ILSGKSGI 
LW KKRLSLFS *Q * S YKWFSIX3AI YCGMH 
* KRFLGTS IIj/ FTE* VETGFHHVDQGGL 
ELLTSCETLASAY*SAGITGISHCAESQ 
STLKKYI PGSTI 


6137 


20038 


A 


6190 


531 


205 


RWRFALVAQAGVQ W/ HDLGSLQPLPP * F 
K* FICLRLQRSWDYRCAPLRPA\NFFVF 
LVETGFHHFGQT\N1jQLLTSGDPPASAS 
QS AG I TGVSH* ARP WF 1 1 FTALLYK 


6138 


20039 


A 


6191 


52 I 375 


i^rcetsflffffekkfwfcppgggeWrn 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine } H=Histidine, I=Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GJuiamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W»Tryptophan,Y=Tyrosme, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














LK* MEP \LPPGLKE FSCLTLRRGGN* GP 
FPPLQENFGVFRKKGVPHCNPG* FQTPD . 
PRELARLTS PRGGDGGGRPPPRVK 


6139 


20040 


A 


6192 


3 


158 


FFFIiLLNCGLKKETG/WLGWAHACNPS 
TLGGRGGWITR*ELKTSLA1IMAEPY 

mm 4VV w WW mm, mm, 4> mmt mml i\A mm* «M M»* » ** * 1 1 1 mm mm 


6140 


20041 


A 


6193 


2 


54 


FFFETESHSVAQAGVQWCDLGSLQPPPL 
GLKE F S CLSLLS S WD YRHTP PHPANL FV 
F/S FLVETGFRHVGQAGLELLTSGGP PA 
SASQSAGITGVNHRAWPBVFKK*RVSFC 
RPGWSAW 


6141 


20042 


A 


6194 

! 


1 


323 


FILPSSWDCMCAPP/RPDWVLYFLVETG 
FHRVGRAGLQIiLTASDLPASAS *GAGI A 
DGVSLTQCSMLPMLECRGVILARYNLHL 
PAACLGL PKCRD CS LCPAATPSRK 


oi 42 


20043 






oly 


OA 3 

24J 


K* S S YLSLLSNWD YRCA/ P PRIiANFLFF 
VQAGS CHLAQAGLELWS S CHLPALVSQN 
AGITGVSHHARLAVAWKRTPLYGLPSRI 
SNT.CVLT1KFLFFYI1Y 


6143 


20044 


A 


6196 


385 


19 


GGHFGAQE I S P PGVQLF FRLHFLSNWGF 
KRPPWPPG*FF/VFL*KRGFPPFGRWP 
PSIiFP/SDLPPPAPP'KGWVSGIiRPCPRT 
FFFFFFFDKEC/RIjSCPANFCIFSRDGV 
SPCWPPWSRTPDLQ 


6144 


20045 


A 


6197 


498 

• 


437 


ENXtQS S INtiVDKAMAWFER I DSNFDRSF 
TTVKMPSNLTS I INACCGE I FHERKS * S 
T WOTSLLS YF* KLLOLAPPLLATTILIS 

Am Tt\^ Am mj Utft t * *7 IVU^jyUnit & 1 »i mmm ^ Ami mm, mJ 

KEP*TSR/QDRLUiAEG*D 


6145 


20046 


A 


6198 


79 


394 


SVKLLALFFFFFGENFCFAPQVELKGPN 
U3*LNPPPPGLKNFSGLT/LPKKWELGG 
HPPPPSKFFFLFF* KKGMAPFSQGWFKT 
LGLNQFACLTLPKGGNLRREGP 


6146 


20047 




6199 


2 


313 


SASHSAGIIGESHCAWPIMPVYSLVLNF 
FFFLEKKPPFVTQLEGQGGNLG/SMBPS 
PSGLKGIFCLGPPKKWELRAPPPCPGYF 
WFF* KKGGLP IRPGRFWKSVP 


6147 


20048 


A 

- • 


6200 

• 


62 


732 


AFQQARKGTQ \GRLS S PAAGAIjP \PVLA 
PSGPPLPA*\GSSS\PASVAPYPPPQAT 
G/SPPTPPGELRRGGWSKGGLDHPRNTF 
YCWBSSEP/PPN*GQS*APGGVNSEGRG 
O/LPPS S I ARGVP * TPSHVPPHPLAFP P 
SLPHPGPAS RAPP / TGSSDLS ASAVSAP 
GREGHP * GTRMVPQGAGRQKPGHRGAS * 
GSSCSKRTLSPLSPRQSGFQSRCCFHPP 
RQNHSRC 


6148 


20049 


A 


6201 


424 

- 


7 


KNMKRS PIS CVTRDPQI KI TMR*HYTGI 
KMANV/RLTGNTKC* *GYGAKGPLIQC* 
WECKI V* PLRKTAWOFSGS /LKI V* /PY 
DP AIALPGI QLC * KCTS \ KKNCM * I F IT 
ALFIMAKKKQKPRYPSIGEWKYKLLHL 


6149 


20050 




6202 


371 


144 


RWALPRRTNF\* FFVETRSLHVAQGGVK 
LLGSVSPPISPSPSAVITGVSYCAQPYG 
HP FELS FSL IEYBYI YLDFF 


6150 


20051 


A 


6203 


409 


110 


LRRSS*LCHLGWSAMVQSWPT*FKP*IiL 
GLSDPLTSASRVDYRGMPPCPRNFI IFD 
M*FI*CVCVC/RTWGLIVLPRLVSNFWP 
EVLLHPGPSKVLBLQV 


6151 


20052 


A 


6204 


15 


221 


KASRKMQIETTLRYYLSPMRLAKMQMLI 
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26 
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n ii c fori tiff a 

II U vlw v UUC 

location 

rt*Acn n n A 
ing to first 
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Predict- 
ed end 

otide 

InroHon 

IUUUVU 

correspon 
ding to 
last amino 
acid 

ICSIUUC UI 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cystcine, D^Aspartic Acid, 

V*sf*lllrflinir > Arid P=Phpnvlnlnninp 
*-> vfi u iti in iv, miiif i x ii vn j mimiJiiCj 

G=Glycine, H=Histidine, I=lso leucine, 
.vsinf I .=1 .purine M=lVfethtfwiin#> 

UJ o 1 1 J v y U - JL/C U t-l 11 C, IwM lTlCUIIUIIIIICf 

N=Asparagine, P=Proline, 
Q=G)utamine, R-Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

Y— T In Lnnu/n C*r»r\ <*n<1nn AsnncelKlft 
/\— 'UUMIUWII, *— OIUJJ LUUUU, /*— pU9dlUlc 

nucleotide deletion, V=possible 
nucleotide insertion 




• 










. NTS CW / RKSQETRTL IC * W * EGKMVQPL 
S KGI WP YLTOARYP 


6152 


20053 


A 


6205 

> 


23 


371 

* 


KVACTCMFITVLFTIVKL*NQSRCPSTD 
DWIKKM/WX^CIYSSVCVCLYVCIDR* 
IKRHDI YVYIMEYYXALK* /DEIMFSAA 
TRVKLDAII*S*ITQNVKCCIVSTHSGN 

T.TMVQT 

UlrJvDi 


6153 


20054 


A 


6206 

• 


113 


417 

■ 


j SAVTLLVI YLNDLKTYVHTKTCTQMFTA 

S YQAMKRS YARNLWS SKKLWRNIiKCLLL 
p j?p q ri q i? ire & vfVT ptt 

XvIUvO fit NSH X V» V X c X X 


6154 


20055 . 


A 


6207 


510 


167 


GLGQAISPSLPKSQGIGAYGFGFFFFFK 

R*DLVCCPG\SLNLVASSQSSCLSLPKH 

WDHECEPPCLAFFFFLL*RQGLAMliPRI 

LPLLKALASGGGMAKKKCIRYFSMKIHL 
nan 


6155 


20056 


A 


6208 


418 


40 


S YTHTHTHTHTHTHTHI P* * LTGS/RLQ 
GICRKGSRKHPRGLSQLLP 


6156 


20057 


A" " 


6209 


1 


258 


SGFLLS * VPSVRDPTGNRTV* LTWQPLP 
APLEF\CPRLSD 


6157 


20058 


A 


6210 


229 


310 


DFSEAQKSRQCGGGISNVNHPGWLKPV 
IQH r * DG VXiIiCCrGWS AW * SQltff * LFR 
LK*SSQLSLPR/SWDYRHVLQ\HPANCF 
VFFVBMSLAMLPRIiVSNSWALAILLPQP 
PKVLDD WLEHHTWM I YTTYATTTLP 


6158 


20059 


A 


6211 

• 


496 

• 


1 

• 


CCQES FE I LGS I NS FGPGLKKGGVKNMR 

VSQRGS QKDRVP P KWF FFFFS F* DRVS L 
WPPGPSARKPSWLPVASTSR\V*QSSCP 
PLWFPGPPTMVSYFFFF\KRQDLPMLPR 


6159 


20060 


A 


6212 


3 


340 


VTAGIIIIFMI*EAFASNRKVLIV*EPS 

I * SLQV/ * WRGLLLLGSPPPYHTLEEPG 

YVTS KRKRKESNP P KLASSQPHGLHDFF * 

KKKKRGGPFKRILEGAPFGGGWGHREFF 
p 


6160 


20061 


A 


6213 


60 


493 


KSQQNF FC FETE SRS VTQAGI QWHHFS S 
LQPPPPGFKVISYL/SLKSNFSGYP*DF 
FCNGPPCS *GPIRFTYYICVIPSSWDYR 
RLPPHPANFCVF \ I KTVSPGWSQTPDLR 


olol 


20062 




6214 


414 


1 


T .rWnT) PTJT ,T .17 C PT 4 PT »T Pf*T iPP * VT. 
Xiv^iiNoiNJrwXiXJ vo^XiirlJuac>XiXtrwliJr£\> w AXi 

GPPRVL* \KGGPFL*SSFAPFF*EMBSH 
SVT\RLECSRAILVHCWLCLPGPSDPPP 
SAS * VAGR IARAQKVEAAVSKDUDAALQ 
PEQHSBTPSQQTNKNNQTKLTQKYPF 


olo2 


20063 


TV 


6215 


2 


4UJ 


D KREWGE FG \ *LKP WLPG * NNFSGQTIiR 
RSGD* RKTPHGGVNLVF * EKTGFSNLGR 
GGLKPPPSGNTPAWPPKGVGITGGTPPP 
PPVFCFLGFCENTNLRAKKKFI 


6163 


20064 


A 


6216 


2 


388 


KMEQTLWKTDWEFLTKLNILYPYSPAFA 

dlgiypnelxt/c/ CYTNTCI*KSIAPL 

FIVARTWKQPGCPLRDGWLNKIIGCIQAL 

E YYS P * KRNEL*NHNMAYKKLKL I FU«R 
QEDRFSLGVQEQPKHNGEM 


6164 


20065 


A 


6217 


363 


302 


NKEFKTTXANMGEPPFL* KRGFS \MLAR 
WLNSLFHDS PPPASQKAGIPGVSPPPR 
PLFFFFFLLETVSHYVAQAGQLISNYDL 
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nude- 
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ding to 
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peptide 
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Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E^GIutamic Acid. F=Phenvlalnnine- 
G-Glycine, H^Histidine, I=lsoleucine, 
K = Lvsine. L=Leucine. M=Methioninp 
N=Asparagine, P=Proline, 
Q=G1utamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W«Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 

nuclpntirif* incprttnn 

UUWICUUUv IIIACI Uvll 














KLPSCLSLLDTWDYRH*PPYPVFKLLNS 
TYKAC 


6165 


20066 


A 


6218 


576 


6 


RAVLRGLLVLLLTKHTFFLFPPRQSLAL 
LPRVGVQW\HDL\6SL\QPI*\PPG\FKQ 
FS\CLS IPSSWDYRYPPPRPANF\*FS V 

GITDLGQSETIHGGSGLDKPPDHKVDKG 
P TEE RACLLS YS AGRKYKEYH I LPEQGS 
EPPHHGDILS IS FQAASHIAQTPPAHT 


6166 


20067 

* 


A 


6219 

i 


44 


496 


GFRVDPRVRPRVRLNS *PRDP PTSGSAG 
ITGV/SSHHARP*NIFT*KLGISFLVFW 

/WPKFGLIEPSPSGVKEFFFFNPPEGWD 
\YGLAPPPGVIFWVLIKNGVSFFCRAWV 
QTSGLSWINRLNLP 


6167 


20068 


A 


6220 


1 


421 


QRGRLTAI S RETS FTLVAQAGVQWHDLG 
SLQHLPPGFQGFPCIiSLLST\TGNCSVT 

l^7±(~T\T/'\Xtit''l3T J~!OT . + 13 T DDPT.VDCCUT r?T5 C? 

UAb VU" wllAsol-t" JrJjFtrljjjJ\iro orlJ ofo 

SSWDYRL* KPHQGSD* VCVT\GAASPEP 
NTMTAWNKP PDI RMKE TLLLP PS S SEKM 
L 


6168 


20069 

♦ 


A 


6221 


3 


36 


AYCIVKPS P * PLTGALSALLMTSGLAM* 
FDFHS I TLL I LGLLANTLT I YQ * WRDVT 
RESTYQGHHTPPVQKGLRYGI MLFI TS E 
AFFLAGFF*AFYHSRLGPTPQliGGDWPP 
TGITPIOTI**VPLLN\TNE *IA*GVSIT 
*AHDP 


6169 

■ 


20070 

• 


A 


6222 

• 


895 

+ 

• 


13 


ANSWANN* FELATQI AFSGRI PLTDSHL 

MLYTASHS FSNPLTK\HIAYPFPVLSLI 
EA*FITSSHLPSDKQDLiKS\LMAYSS\I 
SPHSPS*LTPILIQTP\W\SFNGAGQFS 
I IAHGTYHAYI TYSA*ANFKI TERTSQC 

\SLTMGLTPPLINPTGENCSVASNQRFS 
/WTQISLLLPYRDSTILGSQALYSLYIF 
TTTQWGSLTHHINNI KPSFTRENTLMFI 
HliSPIIiIiIiSIjNPDTTTGF^S 


6170 

i 


20071 


A 


6223 


68 


405 


HS YALVYNP VGIRVKVHLS KC I L YTWH I 

IS*GISQLGAVAHTCNPSTLGG*GRW/I 
QQLKTSLANMAKPHIoLLKKKKNFPNFPK 
RG 


6171 


20072 


A 


6224 


591 


283 


GTSTQAGVQWCNCGSLQPPPPGFKRFSC 
LNliPCS WDYRR VP PHIiAWF\ * FIiVEMfSF 
HHVN/QAGLELLT S SDP PALASQS AGIT 
GMSHCARPRVAF * GEKNHRSL 


6172 


20073 


A 


6225 


302 


173 


GFFSFFLKC*DR/SLTMLLRLVLNSWAQ 
VILPLWPPKVLGLQA 


6173 


20074 


A 


6226 


417 


81 


TQTRGPPPPPFLGGPPSGPERTVPPPRG 
GVPQTGSRFFLFRQISrPGPALFPKBr/EGG 
F*KKFGPPLPPF*KTPFFFTNRGNPCPF 
FVGENPQAPPVFGGFNPFPRKKKKRNDF 1 
LS 


6174 


20075 


A 


6227 


41 


474 


TDATETFHFTSKH\HFGF*AAA*YWHFV 
DW*LFLYVSXY**GSC7PHPA 


6175 


20076 


A 


6228 


503 


28 


AAHAPPRVRKVIGH*AFFTQRCTNSRH* 
IIKFPKDKDKARILKGAREK/RTTYNGA 
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correspond 
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peptide 

sequence 


Amino acid sequence (A»Alanine . 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid. ^Phenylalanine. 
G«Glycine, H a Histidine> Msoleudne, 
KHLysine, L=Leucine, M=Meihtonlne, 
N=Asparagine, P=Proline, 
0=Glutamine. R— Arffinine. S = Serine. 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Un known. * ca Ston codon. /^possible 
nucleotide deletion, \= possible 
nucleotide insertion 




• 










PIWIiAVNFSVETLQ/A/RREWHDMFQMP 
KEKT FYLR I L YPMKI S FKHERE I KTPPD 
KQKPRDFISSRPVLQEMLKSWPGARVHT 
GNPTLLGG 


6176 


20077 

* 


A 


6229 


418 


175 


VSPLSPLKILFFPKGLKFWREGGPKCPP 
PKKRVLPKK/PPAMF* IPPPKRKKSKLP 
PPRKIWPPQGIFKSPPPQFYFFFFFF 


6177 


20078 


A 


6230 


45 


308 


ACVRLKH*TDN*QPNIYNQP/HKSLLP* 
KKKKKKKKKKKKKKKKKKKKKKKKGGGG 
FKKKPGGGPISWGGEKII FFFFGOQKKN 
TLGiF 


6178 


20079 


A 


6231 


440 


209 


IC2HFRGPSRRGLLGPEARASPGQHRBSR 
S FLKMQP\NKKFS * VWWCTPAI PASPKV 
EARESLEPGRRRLHLTEALQPG 




ZUUoU 


a 


Alio 

■ 






FSPP\PPKKFFFSPKA*NFWGGGGQKFP 
PPKKGVFSKKPKGVFKTPPKKKKKFFFP 
PGENWGPPRVF*KGPPPFFFFFFFFFFF 
FLS FNVN I LFKP VQAPCLT 


OloU 


■ 








t n 


flQaP TTY2VQHPTR PMT.PTQTPTJ32VmPNT 

AGQ KI S KD IADLNNNEQDHRD I YRN S TQ 
♦QQNTHSFQAAIRNKIDPILGHKKMLTH 
/ CKQSDQTWNKLEISNKYAKPQKIKHTI 


6181 


20082 


A 


6234 


2 


351 


MDQGI I / S DFE S YYLRNTFCQAI VARDG 
DSSDGSGQRKLKTFWKGFSLIiDAIKHTE 
D SW/EE I K IS TGVWKLI P I LVDDFEGFK 
TSVEE / VADVMDLSRELEL* VEPEDGTE 

mm mm* ▼ mm* mm* / V mT%mm+ W l^i/UVAUrfUUM Tii* UUW*U 

LLQSHSQT 


6182 


20083 


A 


6235 


3 


252 

• 


FHTYNS LLVCHSNLCYQNS LLFF I IH * / 
CI/HFHS*AKVS IWKHTYTHTHTHTHTH 
MHTHFFOATDRKI CVNVYMFSVAYIHKG 

LQ 


6183 


20084 


A- 


6236 


3 


294 


IITF*LPQLiSGYIEKSTPYECGFDPISP 
ARVP FS I KFFLVAI T FLLLDLE IA\LI»L 
PLP*ALOTTNLPIiIVMASLL* 1 1 ILALS 
LAYEGLQ KGLD* AE 


6184 

* 


20085 


A 


6237 


381 


121 


P VPAS F * FFFFYRDR \ VSLCYPG WSQTP 
GLKQSSCLGIiLSHWDYKYDPPHPAFISF 
IiLSSSLLWVL*RPWLIiCATVPAFLDIR 
EQ 


6185 


20086 


A 


6238 


395 


62 


PFFPIPKFFPWGPNFFF/PLGPPPFLGF 
FPGSFAPPSHFSPPD*NFPRPPKNFPPG 
GDLFPQKKGDWFFPKKRGVFFFLGGFSK 
FSGGPPFSLGPQKKKRFKIKGPKKFCF 


6186 


20087 


A 

* 


6239 


1066 


761 


S FFLRV7S F ILVTQAGVQWCDLS SLQLP P 
PGFK*FSCIiSLPSSWDHRRPPPRLANFC 
I F/M * RWG FSMLARPVSNS * P / CSDPPT 
PASQSAGI TGLSHRAWRLVFL 


6187 


20088 


A 


6240 


390 


27 


GELPTPIYLAPPASPKGGVSGGSPPSRV 
KFFPLEVPPPPPPGC* PGPFFYI WPQGG 
ARVP*QQIRFFSLRW\PPRPTRPKNPHL 
QVGPLRICLSPETGGGVIKVFRGSLLKK 
KKRAAASRSN 


6188 


20089 


A 


6241 


366 


1 


KIKNFDRPI/TSKGA*SW/RNLPTKKS 
PEPNGFPGVFCPTFKKKLTPVF/ sklfp 
KF*KEGILPNSFF*SSLALIPNPFYPSK 
KKKKTKKTTYRAISLMKIDEKIHYKY*QT 
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SEQ ID 
NO: of 
nucleotide 
sequence 

• 


SEQ ID 
NO: of 
peptide 
sequence 

• 


M 
eth 

Sd 

■ 


SEQ ID 
NO: in 
USSN 
09/5154 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first ' 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AJanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H n Histidine, I=Isoleucine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X»Unlcnown, *"Stop codon, /^possible 
nucleotide deletion, V=po&sib!e 
nucleotide insertion 














E FRS TLKVLYT I TK 


6189 


20090 


A 


6242 


23 


439 


IASGRPFFFFFFFFFCLPPPPGGNPSFS 
FLKKKKKGGGGGENPPNPPGKAPNPQGV 
GVPPFPPPPVFQKNPPPPFFF*KNPPPS 
PKGPPGGPPFFFPPGGGGGPPPT/PYFIj 
KKKGG WGGPP F F * KTNPPPKGEITPIiVG 


6190 


20091 


A 


6243 


20 


261 


WAQE I LKKFS CLS LLS S WNYRCAPPHVA 

n\fvffvetgsyhvaqsglblldsrdpm 
tgeshhaqphfc* ilytgesqril 


6191 

i 


20092 


A 


6244 


376 


76 


LLPRIiECSGVIKPFSHLSLPSSRDYRCV 
/PTMPNF*KFFVELKFNSL*R*GLTMLP 
RLVSNS WCQVI PLPWPPKVLOGCATTPG 
LTDFITTFKGYRHGDTY 


6192 


20093 


A 


6245 


222 


233 


KSFYFALIFEKNFCYVKKSQVGLVFFSK 
VKVLVRPFFPMVI KKNQP/ FNPL/RFFF 
FFFODRVLLCHPGWSAW* s ktls 


6193 


20094 

■ 


A 


6246 


2 


358 


ei iltfqsyy*entfneaitaidsdssd 
gsgqskwksfwkrftilgaiknicdl*d 
*v/kistlsgvwkkvipslmddfegf\k 
tsveeviadwe iar* lblevepedvi e 

LLOSHDKT 


6194 


20095 


A 


6247 


3 


322 


PDHSLLQTPTPGFRRSSHLGFSSSWDYR 
LE P PHS VCVCVCVV FCVYI YFFFFLEKT 

gfgfvpqaglnlgvkri fka* ppggvei 
pglthtpgpwggpikgkganpap 


6195 

• 


20096 


A 


6248 


1425 


1280 


sldfqglalpgwsaw* sclteastsr\ 
vkrtshpslpntwvyrhalprlansvyf 
l*rwgltmlprlgsns* tqailqpwppk 

VLAL* ADEEMKAKGHISTKWGGQTLNPQ 

w Ml Ml «lAr WW* ■* X^ *f£ «■> Wail 

PDCGATSVYSLP IRKFQ | 


6196 


20097 


A 


6249 

• 


189 


336 


HAPFPVSVISLSCQKRRR\WLAMVAHTS 
NPSTLGG*WGWIA*GEEFETSQ 


6197 


20098 


A 


6250 


2 


300 


FLPSFMGFWPVLGHGRASTVTPSEPLSL 
PFLLPAEPFLFFFFSLFRKRESIFGA/P 
GGLEGGQFNFLEPSPPGIKEIFWPNPPR 
GWEItKIWPPPPINFCDF*EKTGFTMVAR 
AGIiKL*TGGLEGGQFNFLEPSPPGIKEI 
FWPNPPRGWELKIWPPPPINFCDF 


6198 


20099 


A 


6251 


1 


198 


LTGRL/S KCE WSWRFDVQLKDLKRWQN 
HLLLS QQLAC I VLTSAG I MDHKEMR * KH 
TGGKSLGFFF 


6199 


20100 


A 


6252 


11 


214 


S LRLKS S SHLS LP S S WDLRCELPHLAN * 
KKKKKL/CKN/RSLTGLPKLVSNSWV*T 
ILPPRPPRALGLRA 


6200 


20101 


X 


6253 


402 


3 


KKGFVLS PQVEDKDAISGS FNPGPPGBK 
iPPPQPPEEVGPPGGPPQTRGNIiGI/LG 
N*GGGGGF*PPGAGKSPAPGPQKG\PFG 
KDKRE KP YGPPPKLF FF FLRQSIjCFAQA 
GVQWCDLGPLQ PPP PRFKQFCL 


6201 


20102 

- 


A 


6254 


362 


78 


VGVN/MTFFPQHFLGLSGI PRRYSDYPD 
AYTT*NTVSSIGSFISLTAVLIMIFII* 
EAFAS KREVIS VS YASTNIiB * LHGCPPP 
YHTFEEPTYVKVK 


6202 


20103 


k 


6255 


357 


2 


DLSCHSHGEATQKHSEKCKASCTHMFIT 
ALFLI AKHWNQPTC / PHTMBYNSVIK/N 
EQTIDTTTTWMDLKCIMRHBQRQS*KNS 
VLVCFHAADKDI PETGQFTKERGLMDLQ 
FHVAGEASH 



926 



WO 01/64835 



PCT/US01/04927 



SEQID > 
NO: of 
nucleotide 
sequence 1 


SEQID 
NO: of 
peptide 
sequence 


M 

etti 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to— 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H-Histidine, I=Isoleudne, 
K^LysIne, L=Leuclne, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y-Tyrosine, 
X=Un known, *=Stop codon, /"possible 
nucleotide deletion, V=possible 
nucleotide insertion 


6203 


20104 


A 


6256 


2 


305 


Q WCNHS SLKSNPGERETVSSLKLPKAWN 
PRHMLLCPTTS F I LRWGFI YVAQAGFDT 
RGHASL/LA* AS PSVGITSKSQRVRPVH 
DIACFHRESTMFSAENFN 


6204 


20105 


A 


6257 


335 


3 


GPFQSPPPGVTP I SCFTPPKKGGNRGP P 
PRRGKFWI FKKNGGSPG * KRR VLFPDPG 
VPPPRP LKKGGLKGGAWAKI FFFFFLRQ 
SHSFTQAGMQWCNLGSLQPPPPGFK* F 


6205 


20106 


A 


6258 


219 


2 


PTLSSQKDTKAQRLYSFPLTTTKDERLV 
TLVS/WPGWAYTFIPSTIiGGQGRWVT* 
AQKFETSLENTVKPPSC 


6206 


20107 


A 


6259 


306 


117 


SFELLGSSDPLISALSSWDYRCAjPfeRLk 

h ^mm} mmmw^mmrmmr * ■ * m^ A* m>i3 ^mw w*m*m\ m^^mmfm^ V • * — ^ WAVMT> 

NF* FFVCVEMGSSYIVQ/SWPQ*ILLSW 
PPKVLRLQT 


6207 


20108 


A 


6260 


572 


269 


ETDILSVAQARVGGLHLGSLPAPPPGFT 
PFSCLSCSS *DYRRIiPPRPA/NFFVFSV 

EKGF\TVLARIVSIS*PRDPPPSGLP*G 
ALGLPG VS FLR PALCLLLK 


6208 


20109 


A 


6261 


357 


116 


FAQAGLELMGSSSPIASASQSVGITGMS 
HCA\RLLLFFCGGRW*GG/LK* FSCLSL 
LS S WNYRHE P PHPANLC I FS FNTIS Y 

mmmm^m* mm* W * m»m m± mm^mV A-mmm} mm ^ m\ mtm* m mmm ^m* mm ■!» A 4« mm mm 


6209 


20110 . 


A 


6262 


3 


337 


HERHERHERHEQSAEITGVSHHAGGGIG 
IiLNFFLFLFFEAGSCTVT*T/GV*WHDH 
GPQ*PQIPRFK*SSHLSLHSSWDYRHVP 
P* IiGLQACATMPG * FF I FCRDP I SEINK 


6210 


20111 


A 


6263 


515 


150 


FLRWS FTLVAQAG I QWCGVSSLQPLPPG 
FKQFFCLSLPSSWDYRCPPPCPTNF\*L 
LVEMGFHHVGQAGLELLTSGDLPASASQ 
SAGITGISHRARPGTLFF*AANGGGKOV 
FLLLRVLNGL 


6211 

■ 


20112 

■ 


A 


6264 


1 


296 


GTS FLPS FLP / LLS FLKFLNFFQI VSHC 
RPGWSAVA* SQLTAPS I \FGLKPSSQLS 
RLSS WDYRCP INFF\SLPMLPRLVSNS * 
PQAILPTQPPKFLGLQA 


6212 


20113 


A 


6266 


298 


182 


IiHFFEVSKIIKLIRAEAKWCL*VLFTIA 
KIWKLH* * ICKYLS IDEWIKKMWYIHTV 
EYYSAIGNKBI/PVIVTIWIYMENIMLS 
EIN* *QKDKCMIALL*GIYIQ 


6213 


20114 


A 


6267 


402 


234 


PPPPLFKNPPPPPFKNPPPKKKKIIFP/ 
SPPVNLAPPKEF*KGPPPFFFFFFFFFF 
FF 


6214 


20115 

• 


A 


6268 


125 


17 

* * 


KGQAQWLTPVI PALREA / EEAGG * LEP R "* 
SSRPAWAI 


6215 * 


20116 

• 


A 


6269 


1109 


454 

• 


GDSAGAEGTMENFTALFGAQADPPPPPT 
ALGFGPGKPP P PPP P PAGGGPGTAPPPT 
AAXAPPGADKSGAGCGPFYXiMRELPGST 
ELTGSTNIj ITHYNLEQAYNKFCGKKVKB 
KLSNFLPDLPGMIDLPGSHDNSSIiRSLI 
EKPPILSSSFNP ITGTML/VRLPPPHWP 
VAGAVSSDAYSASQEEE*AOAQTEPYPG 
S CPPRNTI * FRSQEEEKEKRRGS 


6216 


20117 


A 


6270 


111 


86 


WGI TGVSHCAWSRNGTLTHCWWECEKV 
/QPIiRKTV*QFIiMLKI \PYQMI * PLLGI 
YLRKNKAYIHMKTLK*MFSLFIiFEIAK/ 
TWKQPKYPSASEWIN*LDSFSLYWLGWS 
RTPDLK 


"S217 • 


20118 


A 


6271 


2 


222 


RSCESNSQTSACYKSFYLYPLKS/GGFK 
ILERM*PIYFLIPYWTIFNQRYYFYIW \ 
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SEQ ID 
NO: of 
nucleotide 
seouence 

• 


SJEQI0 
NO: of 
Dentide 
sequence 

< 


M 
od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

• * mat wv w v w M 

location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
i nude- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
OCysteine, D=>Aspartic Acid, 
E^Glutamic Acid. {^Phenylalanine. 
G=Glvcine« H=Histidine. I—Isoleucine. 
K-Lyslne, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, - 
T^Threonine. V^Valine.' 
W=Tryptophan,Y=Tyrosine, 
X=Un known, *=*S>top codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 

■ P U WIWUUW s nun 














NLKKTRGIYTVLYKMYKSF 


6218 


20119 

• 


A 


6272 


787 


470 


LPFFFSFFETGSHCWRWERBSLGVAQVG 
VQ*CDLRSLQPPPPRLKQSSHLSIiPSSW 
DYRCAPP*LDKFFFCIFCRDGFHHVAQA 
GLKLLS SG\NIiPASVS PSAGTTL 














FCM2VI AAT DSS <5 QHGS PES \nXTija\ntW 

SILDAIKNTCDSWEEV/KVST*TEVWKK 

LI ptltddfqrfktsveevta\ ellwe . 

I TRB IELEVEIjDDVTEIJjOPHDOT 


6220 


20121 


A 


6274 

i 


175 


406 


KNS LGGPNLTGEGKKITFSL * NFFFSLE 
ALIFVGRLSSFVPPPKIRFFSKNSRNIiF 
/CPVGEFFKTRPPVNFFFSLP*RKSYFF 
SFPG* IWPP*GIFLKAPPFFFFFFFLRE 

lUvD V OnUtmtsXJrtaStO rf vy V X Xr JF lr W «r Jfc* IV V XI 

VLLGIVDVQ 


6221 


20122 


A 


6275 


2 


181 


PRVRGRVGHRKGKKS F VHCWWQWKLL * P 
LWKTVWRVIiKTLK\ IELPYYP/ATKLL* 


6222 


20123 


A 


6276 


397 


47 


PPPSKNFFFPPQKDFFLGGLPKKSPPPK 
KXFFFrOCTPKKNFPPPSFFKKPPPPAFI 

tf^PT.VTnf VfTPPPPPHTf WnPP"DITI?T?JfVPP 
IV. J. JrJuIVJftJVIVJVr K Jtr K ryJXf or tr r rvlr C INJtVJr Jr 

PLFFFFFFFFITTAP*KITKSIPYFYFF 
TKTFYL 






a 




JL 




Ci JL Don J. VAy<3V3\.OVJAl Xnluvoljvljjyuo JL 

NPPTTAS * VAGTTDACDHAWLMF*FL\ V 
EMGSHYVAQAELELLS SSDPP \ PQSPKI 
LGLQA 


6224 


20125 


A 


6278 


3 


204 


IRCYRKIFYERKHQWLQQISLLSYFKKL 
PQP\PPL*ATITLISQQSLISRQDPPPV 
KR* * LTEG * DDC 


6225 


20126 


A 


6279 


57. 

< 


306 


INKANRD KL/GT I DFLEFLT I KVRKMQN 
TKTESKKRN*ES ILHVE*EDTGHVRAAA 
VCRDMTYI<GEKLTDEEVDK\ I IREAKYW 


6226 


20127 


A 


6280 


419 


18 


SSSSPKACMFVYEWVCTQKCVWIYEYMC 
VFVYV*LVLMCCMNVCACM\C*SACV*M 
CVICBCVCM*GLS VI CEGVCM 


6227 


20128 


A 


6281 


459 


235 


FLRNGVSCCHPGWS AWQS* LTAASNS * 
AOVI L P S S WD YR P VTSHPAKTHNFYMO / 
GGNKKMCVTHS I VI FSLLQ 


6228 


20129 


A 


6282 


505 


470 


GVP*GQEFEPSLSHMGRPCFFIiKCKINP 
AWWQKLF* PRRQKFRLTQI /LALAPQPG 
QQKVNFFS /NKKKKKNPNNC* IAIiR 






*i 


OZ5J 






{3fifif^KifjQIWt^RKKRfirtRIfIfTPPK , *C5P " 

PRFFFSPPYKKKKFFFPPPFFFPPPNFF 
FYPPPPPFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFLNHFF\FF* LYVI TV 
LFMGHSDTLMHVHNV 


6230 


20131 


A 


6284 


431 


269 


YIKGGPPPEIETFLIFFYYIYLIIFIFF 
FFFFFFFFFFFFFFF * FVCLKYLIFNHI 
QRF * TY I S I PHFMLLM \ YTLHLFTFCI C 
SLFTI IWFYIFV* L* YCCTVKALCNSL 
LVSELDKYIKNNYNCKQ 


6231 


20132 


A 


6283 


298 


44 


YLFFFFF^VELVFFLFCMFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFEFKNKVSFNS - 
CFSLSLFKLYFLKYNFIQj^QGNLHFNNI 
I 


6232 


20133 


A 


6286 


386 


1 


ANLHIKVNFVVNFTXiNFSFLPVWFGFH 
FLW*RVYLSHRGFYVFFQNFLRI FFFF 



928 



WO 01/64835 PCT/US01/04927 

• * • 

t. ■ 



SEQID 
NO: of 

sequence 

• 


SEQID 
NO: of 

n< > ntid& 

sequence 


M 
cth 

■ 


SEQID 
NO: in 
USSN 
09/515,1 
26 

■ 


Predicted 
beginning 

mi rlprtf irti* 

IIIIUCUUUv 

location 
correspond 

|ho to fir«f 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

mi c\ 
IIUHC** 

otide 
location 

tUI I C3UUII 

ding to 
last amino 
acid 

residue of 
peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 

Fj=f«IlltaiTlir Arid IJ^Phpnvl Alanine 
i/^VJiu lain it nviU) ft' m uciljlafaUIUvi 

(XJlycine, H^Histidine, I=Isoleucjne, 
K=LysIne, L=Leucine, M=Methionine, 

JV^Asrifl rapine. P^ProIint* 

Q=Glutamine,R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W»Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possJble 

nuucuiiuc insertion 














. SF* VG \ FFFLFFLLFFFPFFFFFFFFFF - " 
FFFFFFFFFLfclAIVFY* IWYISYFTKY 
IEEPRTRGRTRGRTRG 


6233 

• 


20134 


A 


6287 

♦ 


190 


134 


LEFGRNRFQFSRMILRGSGLQWRNYADM 

GQNKL KTFWKGFS \LDAING IHNSQBE I 
KMATLTGVWKK* IPVLKDDFEGFRTS VE 

T7 /\T 7*PariM\7lfT JitrvV T.PT.TTVTWDV EFTWT* 
Ej/ V / LJ\LfV'l \JP^±J\J\k \LidIid Vfiir \BUV1" 

LLERILF 


6234 


20135 

r 


A 


6288 


394 


2 


RRQASLSCGGLHPVRSSRLLCliPKQAWA 
MAGAPPPASLPPCSLISDCCASNQ*DSV 
GVGPSKPGVGYHLLVHRFLSPLEKRSIR 

UnraDPCDfDT QTJT OTTDvncr^T o\ Tiaf" 1 
V la yJtKK.oWJaJjo r JjoIjaiuUmN w UryltTsm 

ASRVRQCLTLLQLAHGALHP 


6235 


20136 


A 


6289 


233 


58 


DRVSLCRPRWHWSRTRLTVASTSR/LQ 

RrT.T)DrtDDT7*rjiDA&c%nDaT)DaDin' *wa 
ALJUir Jry c fB w JjUirUAo ANK3 if KH rlvLilAj 

LRG 


6236 

• 


20137 


A 


6290 


385 


48 


TLQ YLCTPTSTV * KKKKPEN I KYW *QCG 
ANRTLFQ * WQVCKVAQTLWKPVWHYTVL 
NLNTR I VYDPGM PLLGTNNNAYMCC I C \ 

VG 


6237 


20138 


A 


6291 


273 


36 


RNGRWTPCGGR* IHPS*GTDPVYLPVPG 
TLS S S QASNSHPLLKAPGSRP \ PWPTDL 
GYKDPRGPPSNPACRRRPSLFHSD 


6238 


20139 


A 


6292 


394 


5 


AKZ YFS INPLVFSKQ* FLKNVFPNXItHTi 
QFFSNINSLMLIKV*ALLQDFPLVHNVY 
NKICDTSKDTTTLFMYLMFVFKMKTFFS 

♦GQEFKISIiANMVTASLY 


6239 

* 


20140 


A 


6293 


536 


190 


RRSFALAVQAGVQWRDLGSLQPPPPRFK 
RFSCLSLPSSWNYRHAPPRPANFVFLVE 
TGFLHVGQAGLELLTSGDPPVLASQK\A 
GMTGVSHRVQPD*GHGGSRLSSQPFWEA 


6240 


20141 


A 


6294 


2 


238 


HEADLGTTQVLTHPSTTAMYFVHYCQPP 

* TT.VftTTM , T»PPTri7mfMPTWTirTP /COP 

LQASTAINPQLSHINC\NPKDTPH 


6241 


20142 


A 


6295 


2 


447 


FFFLMGVSLFCPGL/ WS AMGAI LGSLAT 

DV^JiAJVy V liJIJVvJjOiiirOOUIvljy Vw\X XT XI 

LG+FLYF*AEDGGFSPCWPG*VSNS*PQ 
VIHLP WAPKALG I TSMSHCCPACFHCTF 
FRHFRQNPHLNtiFFFLEGLGLSPPPPSQ 
PLEGGKGPGN 






a 

n 


K>l.y\> 


J. 


1 w 


AEFPRPEPFPFP'RPF\ PEPEPRPFPRPR 
R * VERERGGVPLRTYR 


6243 


20144 


A 


6297 


3 


244 


HDYSSFLAPPALGYMRMPLHPAYFILFY 
FIS/FYLFFSQDBVLI i C*IiGWSQTPEIiN 
OASa^LQKWWDYRCE'DOCLAECDS 


6244 


20145 


A 


6298 


1 


361 


GTRRS AMLARLVLNS GPQ/ C YLPALASQ 
RAGI TGVS H * GC P F F S LLKKRWR SHCVA 
QAGVQWFFRG*TPGLSCFSCSLPSN*DY 
KCAPLCPP*DTVF+VRAVSSBVPLTITY 
RGEGAGVWA 


6245 


20146 


A 


6299 


3 




348 


HERHELEEL I KNLF F FF FFGKKI FFFFP 
GGKNGGPF* FIGAFS FRPQG I FLP YPP * 
KW\DWGPWPPCRGDFWFF*KKTPFSILA 
RWLNS PPWGILS P YP PKGVRFPGGTPR 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
oentide 

m* ^# u w 

sequence 


M 
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od 


SEQID 
NO: in 
USSN 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

seauence 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
C=Cysteine, D=Aspa rtic Acid, 
E=Glutflmic Acid. F—Pbenvlalanine. 
G^GIvcine, H^Histidine. I s lsoleucine. 
K«Ly$ine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine> — 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, . 
X^Un known. *«=StoD codon. /=oossible 
nucleotide deletion, V=possible 
nucleotide Insertion 














FGPV 


6246 

■ ■ 


20147 

• 




6300 

• 


1 

• 


436 


VDGEFSCLSLPSGWDCRRASPCPANF/S 
VFLVG IGFRYVGOAGLELLTSGDLLASA 
SQSPGITGKSHCTRPRKHH* IANSSSVL 
GDFRIDYFQI SRKAQWITLGYNPWAESH 
LLMGAGLRVLPAPRLMISQVDLLPATHP 
SSGRRG 


6247 


20148 


A 

■ 


6301 


370 

■ 


70 


PKG*STLPASASRSAGITAVIL*SHCTS 
PPM*LRPKHFTHSSTSTPP /CFFFFGKK 
ILI FVPQPGGKGPNLS *LKPLLSWLRKL 
PCLTL * RGGDYPG FPPP P \ S YFCFFI KK 

mm W<U Am ■fm%B*^B*B» A B* Bp a* 4h mm a» *" «» »VbVV#V 

GV*PCGPGGNS*PASASRSAGITAVIL 


6248 


20149 


A 


6302 


1 


285 


KKFFLFTYLFRDRVSVCYPSWSTWE/S 
L * S WL IAASN / CLG S K * FSHLS LLCHHA 
LLVDFNFL*RRRLIRQPRLVSNSWAQAI 
LS S OPPKALBLKT 


6249 


20150 


A 


6303 

• 


194 


351 


GTF ICI KKPVQ/ WLDTVAYTCTPSTLGG 
RDGRVT * AQEFETS I/3NI vrphvykk 


OZjU 




a 




IK A 




VT3HMP /^*TiariPT7TPn3rmiiT?MT.PWT«VT 1 
iiuu'ir / kj Linus ric wwjiir piurnuvu 

NL*AQAILLPQPPKV3jGLQE 


6251 


20152 


A 


6305 


1 


346 


GTSSLQLISLHSSPTASYPPKHLHTRLT 
FJiLSVSLMT^HVLSV^PATSS/PILQPHP 
IYPCTWIMFTHSSGSGKPSCQTHTHTYT 
HTHTRMCAHGSGHATIiPP * HPLHPRHSA 
YLLW 


6252 


20153 


A 


6306 


395 


177 


WLNTLW YIHTVDYFLAMKRSDLLIHS pw 
MNLQGIMLSEKRQ\SQR*HW* FYLYNN 


6253 

• 


20154 


A 


6307 

• 


2 


718 


GVLPYWPGWSRTPDLV/HPPTSASQIAG 
. ITGKSHRALP \ KF AFLTS SQ VMLTLLAH 
GTTAWFS KLS * F * EPPMTCAVFSWTG 

EKLTTGI S * QKEHE LPFSQS SD I FKLAY 
LIFFW/CWEMESHSVSQAGVQWHNLSS 
LQPI»PPRFK*FSCLSLPSGGDYRCPPPH 
MANFC I FSRDGVLP C * PGW\ S * TPDLRM 
DP PGLRP FOS AGDLTGMEA 


6254 

* 


20155 


A 


6308 


377 


67 


PPPPKIFFFSQTLFFFWGGCPQIAPPPK 
KFFF*KNPQGFFFPPPKKKKIFFFPPPK 
FLPPQKFF*KPP / LS FFFFFFFFFFFFF 
FFFFFFF PTI * I F YSMSFLP 


6255 


20156 


A 


6309 


1 


362 


QTYAKMHVTI I FFGVK\LTFLPQRFLGL 
SGMPRRYSDYPVAYTT*NILSCVGSFIS 
LTALILI IFMIRFAFASIRQELIVEEPS 
INLE*IiLGCPPPYHTF*EPGYLKPKRKR 
KESSPPKLV 


6256 


20157 




6310 


2 


411 


VLPTLWEAGARGSLEARSSRLQ+AMIT\ 
HCTPPWAT 


6257 


20158 


A 


6311 


59 


334 


THACGNMLSYAGSFISLTAVLLITFMI * 
EA\ FAS KRKVL I VKEPS INLE * L YGCPP 
PYHTFEEPVYIKSRQKRKESNPPKLVSS 
QPHGLHDL 


6258 


20159 


A 


6312 


3 


22 


KPSP * PLTGALLPLLMTSGLSM* FNFHS 
ITLLILCLLTNTLSIYQ*WRDVTRESTY 
QGHHTPPVQKGLRYGI ILFITSEVF\FF 
AGYF\WPFTHSSIiTPTPQI*GHWPPTGI 
TPLNPL*VPLLNTFVIiLALGVSIT*AHD 
P 
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od 
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peptide . 
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ed end 
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ding to 
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peptide 

sequence 


Amino acfd sequence (A=Alanine . 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F-Phenylalanine, 
OGIycine, HNHistidine, Msoleucine, 
K=Lysine, L=Leucine, M=Metbionine, . 
N=Asparagine,P=ProHne, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X= Unknown, *=Stop codon, ^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6259 


20160 


A 


6313 


11 . 


182 


INKMW*I*VRG*YSVIKK\DEILSFMAT 
WMEVEDILLSKIREAQKVTHPVFSFISG 

s 


6260 


20161 


A 


6314 




421 


IQASTMAGAPPQASLiLPCSLILDCCASN 
S/RGSVGVGPSEPCTGYNLLVOiLLRPS 
EKCSIRVGVT*FSRCRPSP/TRKGHSPT 
PCTSRMR 


6261 


20162 

i 


A 


6315 


I 


111 


DKGMNVFMPANTASVLWPMDQGVILNFK 
SFYLRN\TFSKYISAIDSDSSVGSGQSK 
PKTF*KGFTNLDAIHDSWBEIKIST*TG 
VGKKXiI \PTFRDDFKGFSTSVAEVSADV 
VE IGREL* LEVKPGDRTBIiL * SVLWPMD 
QGVILITFKSFYIiRNNIF 


6262 




A ■ 


6316 


440 


312 


TSSFSLSFF/CQDG/ SLTMLFRLVSNS * 
PQAILPSWPPKVLGLQL 


6263 


20164 


A 


6317 


5 


404 


PRYMIVKLLKI/ERNKILEAAREK*HFT 
SKGKAIWI rVDLSSETWEVRKNLHNFQV 
LKEK/NKTWE VRI L YSAKI S FQKKGKI K 
IFSDEGKLR * S FTSRPTLRE * *RKFSK* 
K*NDRRRNLKTFGRKKEYPKELKP 




*VJDJ 

» 


A 


UJIO 


394 


182 


MFIVA/VFSKAKEWRQPKCLSTDEWV/Q 
YM* YIHTMKYYSAIKSNELVIHVAA* IL 
FENTVLSERSHIWHGSHI 


6265 


20166 


A 


6319 


1 


378 


APSDFFLFQSLNKPLNCTHFSSVHNVKK 
TSLT* IjNSQKrPQFLRDEPNGWYHCIj/QK 
YLELDGAYVKK 


6266 


20167 


A 


6320 


286 


314 


NRCCFSLPTQVKCFNLYLGICCVCDWGY 
YRWLNF *NP I F* FI LRQFHSVAQAGVQW 
RDLGSLQPLTPRPANFYILVEMGF/TPL 
LARLVLNS * PSLLKYKN 


6767 




A 

* * 


6321 




397 


QKTLNMEKEMNI YIHKM* ITPNRSNIKR 
SSIoRHITVKFSKVKDKQRI /MKLAIEKQ 
PITYKETTIRLSEDFSAETL/NARR*WD 
DIVRYWLKKKKKKKKKKKKKRGGGLLKN 
SLGGPILTGGGKEIF 


6268 


20169 


A 


6322 


411 


2 


TEPPPPGGNF\L*TKPGKKNFLPPKNLG 
FFSPFSPLKFFFSPK\P*ILGGGGAPFC 
PPQKKGFFPKI PRGVFFSPP/YKKKNLI 
FPPPVILGPPKVFFKGPPPFFFFFFFFG 
QSGQVKLKSPKCK 


6269 


20170 


A 


6323 


395 


280 


MLARLVSNS * P\CDPPGLASQSAGITGM 
SHHVPQVNHF 


6270 


20171 

• 


A 


6324 

• 


3 


359 


QHYLVFRETQIKAAVRCHYI PTGMAKI I 
KTGSTKC*Q*CX3AAGTLIHCWEECR\LK 
QPLWKTAW*FLI*MLCD*TVPFLQIYPT 
EMKMCVCRMTC\RVFLSVLFIIASSWKQ 
PKYHKLVN 


6271 


20172 


A 


6325 


3 


174 


FFFETGPNFVNQAGGQGHDLG*IiQPPPP 
\GSRI FPA* TSRVAG ITGLNHHPHPLGS 
IF 


6272 


20173 


A 


6326 


265 


13 


taspiri i itsqdvlhs *avpslglktd 
aipvrln\qttfsdtrpgv*ygqcseic 
ganhsfmp i vleli plkife igpvftl 


6273 


20174 


A 


6327 


32 


259 


CYYEHL CTS LCVNWYFHFF * VYT * EWQ W 
PKYLSADEWINKM*YIWTVEYYLTIKIS 
EVL/SATTWLGLENIMLSKRS 


6274 


20175 


A 


6328 


1 


228 


PGLNHCSHLSWDHRHALQCPANFG\YFV 
ETRFTVLPGLVSNDPPALASQSAEITGV 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
0=Cysteine, D=Aspartic Acid, 
jl/-~v» iu ia line aciq, h mt uciijiaianinc, 

cine, xi — nibiiuiiicj ■i^uuieuciue, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
tt kb i rypropnaii} i^iyrosincj . 
X=Unknown, *=S(top codon, ^possible 
nucleotide deletion, V=possible 
nucleotide insertion 
















6275 


20176 


A 


6329 


346 


1 


LPKKKKKKISRGLFPFQLPKKKKDNEKS 
LTFSSETMPGRRQWSKIFKVLRGKKQPQ 
QPRI ISIiAKLFLKSEDFL/ETQKLREFV 

RALK 


6276 


20177 


A 


6330 


349 


77 


HTTPKKNKHN/RKKIKLDVLRYNKVDDN 
SKMSGLH*QCPSDECDAGVFMASHKQ/F . 
DRHYC^KYCIjTYCFNKPBDKAQHCQoIiE 

prslrpawat' 


6277 

> 


20178 

i 


A 


6331 

t 


319 


46 


RPRRPETK* ALNK* KDI/ PMCSWKQRHN 
TVKMLII^KLINRFNAITIKISTE \FHR 
NKLILAIIWKCKGPRIAKTT*KGIKLEE 
LLYLISRPP 


6278 


20179 


A 


6332 


496 


309 


DRATAF*RQSFTLSL/RAGMQWHDLSSL 
QPPPPGFK*FSCPSLQSSWDYRCPSPRT 
G*F\CIF\VEMGFHHGGQAGLELLTSGD 
LPASASQSAGITGVSHCTWPNILNVJK 


6279 


20180 


A 


6333 


3 


173 


HHHSRL/ IFFCS VLGTSSHYVA*AGLEL 
LGSSNPPDLASQNPGITYGSHYAWPDIP 
VP 


6280 


20181 


A 


6334 


339 


153 


FSFPPPFFKGFFFPPPPPPF/ SF*PLRG 
GGPPFFFGPPGVRPPPSPWSWCSPPPV 
VLASFFFS 


6281 


20182 


A 


6335 


32 


331 


QLiLNHS V I VFUjS FDHS it JjRNLiSIjF LA 
FKLIGVKNS*YIILSFRYLLHP*HLLNF 
FFF/CKRQGLPVLTRLVSKSWAQGICQP 

W fir j\VKV9J_iKAn£3 IroJjoiJ 


6282 


20183 


A 


6336 


226 


177 


DGVSLLLPRL\GGAISAHCNHDLPGSGD 
SPVSAS*VTQLQYYFCICLGLFTFTTRL 
SAYLGTGKLTTALPNTPGLLDS 


6283 


(Oil 

20184 


A 


/AAA 

6337 


302 


59 


VMY T MTTMn\TDTJ T T /DtraVCDOIMirweO 

KIYLLJLdNi lMLoVJKo J.1*/ KKKIwi'KV^lUMcjo 




VSYFKKLPQPPQHSAASTLISQQPSSSK 
QDPPAAK* L* FVEGRLIAFPSNNVF 


6284 


20185 


A 


6338 


197 


28 


AEKVRFSSAVTWHIiGSEISL* *SKKNL/W 
G 


6285 


20186 


A 


6339 


46 


317 


EVPSS ISSQISLFSGDFSSVPLEELSNL 
FFMFCPEHI I VTCERVTSNYS / YYKWKP 
EFSYP * I F/CFKMS LTMLPGLS LNSWAQ 
VIIPPQPPK . 


6286 


,20187 


A 


6340 


4 


276 


QTTIjRGTTVQI TKNIiio b I MW Ab KUKKK 
NL* SAWGWGMWENNYQ P IL YS AKI S FKN 
GVEIKTFTN/KQKIKN/FITSITLV*EI 
LKEVLQAEG 


6287 

• 


20188 


A 


6341 


485 

• 


2 


GVWEFLETESRSVAQAGVQWCDLGSLQA 
PP \PGFTPF5RLS LLSS WDCGS PPPHPA 
NFFFVFFVETGFRHVSQDGLDLMVSIS* 
PREPPASASOSAGITGVSHRARPKHSKI 
LIAGSRRSIQMFPVKFFQKLAWENPQN 
TKDFLNPYSEILLHTSQIHSSC 


6288 

* 


20189 


A 


6342 


2 


339 


EDLRVKLKKEG YSN I S Y I WNHQDPMLN 
SNGSVTVVALIjQAS * YLCILQAS * assq 
SWTS/VTLAAVSFQRISNQEHIPVYQQE 
ENQTDVWTItliNGS KDDFLI YDRCX3CLVY 
HL 


6289 


20190 


A 


6343 


181 


12 


RWGLPSGWDYRSMPLHPVNF* 11/ CMRQ 
DFTVLPRLILNSQPPVILLRWPPKVLGL 
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E>Glutamic Acid, ^Phenylalanine, 
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K=Lysine, L^Leucine, M=Methtonine, . 
N=Asparagine,P=ProIine, 
Q=Glu famine, R=Arginine, S=Serine, 
T=Tnreonine, V^Valine, 
W«Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possf ble 
nucleotide deletion, Impossible 
nucleotide insertion 














QA 


6290 


20191 


A 


6344 


3 


295 


YWHLSCF*FLAITNKAAMNIRIiLWTCAS 
I FPR * RSRKEMAGS *GS / SLFNFVRNCL 
TIFSHDCTTL/HS/HQQWGOV/PVLPFP 
HQHLL* S IFLVFDILIGI 


6291 


20192 


A 

• 


6345 

• 


334 


8 

■ 


KPSNPGGEGPP* KKGPKVFSPWGKKBNF 
S PKKKKKI * CPQKPKGPPWGAPAFKKKX 
NF*NPRGSPQL*KKRGIIGPPTLIPPSS 
QRPPPKKKKKK/ GPPKGALLAFGGIKFF 
FFFLEKNFLFFPKGKTIiLAPFFKGGLPL 
PG ' 


6292 


20193 , 

i 


A 


6346 


68 


256 


WVA\ P VTHTCNPNTLGGRRGR I T* AQB F 
KTSLGNIVGPHLYKNLKKKKKKKOEKKP 
GGGFSPL 


6293 


20194 


A 


6347 


385 


60 


KKKKKNRTPYDAAI / PSLGTSSKEFKS3 
SSRDICTPMF1A/AITKRWK*LKCPSMV 
NI/VKKMWFIHMMQY*SAKKKEILS*MH 
LKDVMLSERMLSQSQKKNFCMILLIWGI 


6294 


20195 


A 


6348 


234 


61 


CHRIEHIKMVNL/WLDWVHl'tK^STW 
GQDTW IA*AQEFKMSLSNTAKPCLYKKY 
KN 


6295 


20196 


A 


6349 

■ 


2 

- 


358 


FLSLGIHFNYCLVTTESKLSKLRLTYFF 
FFFLETEFPFCSPA*RPW\FNLG*LKFP 
PPGSRETPALTLPRTWNNRRAPPHPANF 
GFLRKNGV* FFGPGGFKTLDLGGPTRLW 
FQKGGNN 


6296 


20197 

- 


A 


6350 

• 


346 


27 


KMV* SLWKAVWQFLIKLNIH* P YDSAVP 
LIG/IYPT*MTT* I YTKP * TQMLAAVS F 
IMAKNTRNESNI FNWPIDKLWYIHIMEY 
**AFKRN*liLINTKT*MNLKC*LF 


6297 


20198 


A 


6351 


36 


158 


KNS VPSQE * L YYL \ RQENRLNSGGIGCR 
EP I WCHYT PAWAT 


6298 


20199 


A 


6352 


2 


146 


RSSHLSLPSSWDYRRAPP/H/LANCCIF 
CRDR/SFTMLPSLVLNS*IQTILL 


6299 


20200 


A 


6353 


74 


303 


EPS I CGQI YLS FS I YKIQK* / WPGAVAQ 
ACKTSTLEG*GGWIT*AQFKTSLGlSn*4MK 
PCVFSKKKKKKKPHPGGGGRKK 


6300 


20201 


• A 


6354 


3 


210 


NNKKIWTLKNLKFFRDRVLLCFPDWSTA"^ 

VHSRLTAAS\TLGLKQTSCLSIiVSS*DH 

RCMPPCLALKRKKT 


6301 


20202 


A 


6355 


2 


627 


grvggfiivpmrdlaeasrgqqgqvklge 
mkpsvscvcx3gv*ciskfklkn*l\wpg 
tvahtcspstlggqggwia*aqbfets'l 

DNMVK\PCLY 


6302 


20203 


A 


6356 


584 


166 


RQGHNVAQAGVQWRDH3SLQPPPPGFK* 
FFHLSLPSS WDYTRVPPHLANF\ *FLVB 
TGFHHVGQAGLEL P TLS DLPAWAS QSAG 
ITGVT/HQARLQLL* KELINPSTLGPTW 

s 


6303 


20204 


A 


6357 


448 


202 


FTTLVS E C I * D I HDS W KDV* I STLTGAW 
KTWLI PTIiVNGFEG\ FKT* VEEVTADW 
EIARESELEVEPEDVTKLLKSHDKS 


6304 


20205 


A 


6358 


3 


237 


DRSSL* P * P PGLK* SYYXjSLPSGWDHKR 
A*PHPANFCLFTF/V*QRGLDLLA*HDP 
PASASQGN* ITGMSHRAGPPLTF 


6305 


20206 


A 


6359 


49 


242 


SSYliSL+P * PPGLR* SSYLSLPSSWDHK 
HA* PHPANFCLFTF/ V*QRGLDLLASHD 
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nucleotide deletion, \ppqssible 
nucleotide insertion 














PPASASQSTS ITGMSHRVWPPLTP 


6306 


20207 


A 


6360 


508 


223 


FVTQ AGAR * LDLSSLQLPP PGFKQPS CIi 
SLPSGWYYRHVP/STPIVF/CI PSRDGF 
HHVGO AGIiE LLTT * STCLGliRS AG T TGT . 

xxn v mpiwmajjj i x x u x \<uwui\wnui x vtu 

SHRAQQERGFEP 


6307 

■ 

* 


20208 

- 


A 


6361 


2 


595 


AIWG*GTHFLIPWVQKPIIFDHRSQPH 
NVPVITGSKDLQNVNI IPCILFGPVTSQ 

T,PR T PTR TnPDVDRRVT.PQ TTTRTT.Tf QV 

VAR FDAGEL I TQRELVSRQ VS DDLMERA 
ATFGT.T IiDDVST.THtiTPRKP.PTEAVRAlf 

*Tt X XT VJXJX, I.IUU V O XJ X UU1 XT AXVClx X unVA/UV 

qvaqqdaera/rns LATAGDGLMBLCKL 

EAAEDITYQLSRSWNITNLPAGQSVLLQ 
LPQ 










i 


9^0 


T iP S PYPr»T .WYV * 9 <5 * PT.PT .TfP T iT T.P T P 
xjxTiOc x r hi ix x x *v o o f xjivxjivCvLJXxjjr xr 

ETECCPVA\QLECSGTISAHCHLCLQGS 
GNSPAPAYRVVGITDSFRITHCILTCIP 
SE 


6309 


20210 


A 


6363 


60 


221 


NTTKLGMVAHTCNPSILGARGGRIT*V* 
pppfTQT.r>MT\nr\ prrppvtrcT Tirr.w 


6310 


20211 


A 


6364 


1 


393 


RPRRPQTGVQWQNLGS LQ PPHPGFKRFS 
CLDLPSSGDYRHTPPCPANFSGFS/HHV 
GLKLFDS SDL PWP pKfvg I ARHESHRKC 
T*P^pp/3PPTAnpr.pp 


6311 


20212 

• 


A 


6365 


19 


85 


Clx^SKrMaSSRITi^l<MVQPL*RAriWQ 
CLiKLNIPHPYDSAIPLLHFYQCK*RHM 
NPHN\ TFTSMFITALFITAKN* KQSKCL 
S IGQWMKK* * CIHTVQ*NSGI * *HE* IN 
SKIMQSSRITTLCKMVQPL 


6312 


20213 


A 


6366 


2 


298 


PPRxxAKVQKFLLFFYLVF*DRVTLCHPG 

WT fi W* Q HT .TV*? T . A RRTtfT ."R T . D Q mxTTYVR TT 
n xnv V Oyill V X xxrxCxKiN L»r\xJ <r OvJriL/ X flxl 

VPQRIiADFVFL/CR/DRGFSMIiVGLVSN 
SRPQEV*ATSPSQSLLS 


O.? J "J 




A 


OJOf 


Of 


OJO 


OnNKITG I RYT.PVYPBfl Q PAT .VAOxVnVOtt 
^xxl*xvx\ufx>£vx xjit v x nv^er/uiv/iunuvyn 

HD1VGSLQPLSP* FKQFSCLSLPSS*DYR 
HAP P RP ANFL / Q FLVETB FHHVGQAGLE 
LLTSGD * PASTSQGAGITGVSHCTGPRJ. 
RFLKT FS HLEAPGG FPEKRQ PLRSERLQ 
FRAFPOlTVPCAHAYLLiGCVSHLiEMGCRN 

IDIPELSFFLCPLQCLIYRGKSP 


6314 


20215 


A 


6368 


3 


251 


RSRHCTPAWAPLS TQDHFYVFYFFLHFL 
IGSIiFIFACFVSSYFIEFSYGLXYFFVD 
S FG F YR YTMK* NLNSNY F KLS AE ILPQ 


UJJ J 


9091*1 

X.UX10 


A 


ojyoy 


ill 
j j j 




x cvx\xj v xa'xnnmv x \+i\mst c x x yu v cr vnuiw 

/TGDLWEEPGWSQTPDLR*SALLGLPKS 
WDYRHEPPPRLANFCI/FLVETGFQHIG 
QAGLALLT/ S / GDP PASASGALE * EiGGG 
SCL 




90917 


a 


OO /U 






P PTR PFMP PHPVfiOAf; / PR Q PfiT.IT* QBr 
k r x i\rxii lAr nr v uy/iur/ r no r wUa qau 

LSLS KCWDYRHEP P /* PG* FSFFTYIYI 
YFFFLFFS FS FLEEMGFHHVVKAGLELL 
C*NDPPASASYSDRI IGVSHQARPFFFF 
FF 


6317 


20218 


A 


6371 


2 


375 


D I QM I DRN I KN/ CSTSQS I RKLQ I KRXM 

* YHLTSVP^AITKKTTHKC * RCGKK\LH . 
C*WQCKLVQPL*10 , IQR1^QKTENW1NI 

* PSNSKN/WKSAC*RDICTAMCITT*PI 
RAEHGGS \ PCNPSTLG 


6318 


20219 


A 


6372 


75 


372 


NGNSQKKKKKKKKKGGPPLXKPSGGPNXi 
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NO: of 
nucleotide 
sequence 

*< 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 

• 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

• 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny (alanine, 
G=Glycine, H^Histidine, I-lsoleucine, 
K^Lysine, L=Leucine, ^Methionine, 
N=Asparagine,P=Proline > 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=Va)ine, 
W=Tryptophan, Y-Tyrosine, 
X=Un known, *=Stop codon, /possible 
nucleotide deletion, ^possible 
nucleotide insertion 


r 












PPGGAKKS FPIiKGGQKKPPGGFLBKNPP 
LGGAPL\GPPPPPKKRPRGKTKP*GPQ*R 
GKNPAKPWLGKKILKP 


6319 


20220 


A 


6373 

* 


605 


350 


ISTQICLGMFIAALFTIAKL/WKAT*PS 
FNR*KKKMCYIYASYSAIKK\DAILSFA 
ITWMNLENIMLSEISQTHKDKCCMISLI 
CAI 


6320 


20221 


A 


6374 


1*7 


378 


KKSQGGTNIKPPAITKYLSQMO(»*Kkiil* 
TGFLKKKS LLGGQAGVT ILLT\HRTFST 
KINDPDGGGDHILSQ 


6321 


20222 


A 


6375 


994 


747 


VSLALS PRLECNGV I S AHCNLRFPGSNN 
SPASAS * VAGAHHHTQLTFVFLVETGFH 
HVGQAGLELL/NLVIRPPWPPKVLGLQA 


6322 


20223 


A 


6376 


1956 


1592 


HEFEMESYSVTQAGVQ*HDLGI*LQPLPP 
RFKQLSCIiSLP/RYWDYCLAPPCLANF\ 
* YLVKMG FRHVGQ AGLKLLAS S DAPASA 
FQSAGITGMSHRARPKSAFKIQAI IGEG 
GVSELHRKINI 


6323 


20224 


A 


6377 


94 


402 


KTQELQNKI SHCLI RRVDRKM/ WLCTVA 
HTYYPSNLGGQDGRI A*GLEFBTS LSNI 
TRPRLYRKKKKKKNVFQAWPLWSNFLRR 
LRREIRAEKSF 


6324 


20225 

■ 


A 


6378 


463 


146 


GKLGPPKGSLKPPPPFFFFL*R/P/HPP 
PRGETPFFFKKPKKSPRGGGASPPFPPL 
GRVR PENFPP PRGQGS P* PKRGPTPALP 
PGGQKETLFPKKKKRKKKKKYMRLI 


6325 


20226 


A 


6379 


3 


207 


LE YQVAQTMCI I S SRI QLVAEKAS VPFV 
DKTVLS \TMGQL* EAKAEDGFFVPG* YG 
LAVS P LKFQ STR P 


6326 


20227 


A 


6380 


328 


70 


RPRRRLGLPRSWDYRHEPLHPDQLLVEI 
M5HYVAQAGLKLL\ ITGVSHHARPH/YR 
F*LGHC*SLVRRPWTISYSFLC1 J GFPMW 
EVSD 


6327 


20228 


A- 


6381 


13 


77 


VIFLLQPPK*SSCFSLPNRQDYRRVPPY 
PANF* FS /CRHSLTMLPRIiLF* TGLQAC 
ATIPS 


6328 


20229 


A 


6382 


342 


77 


KGQGFYPHLFFFFFFF*KDSPSIiPRVEC 
CGEISAPCNIi*LFGSNDFP/ASHAIVGG 
KLPKQRKGAFFS * GGGFFFFF* M* KHSV 
LNSSRN 


6329 


20230 


A 


6383 


1 


208 


RGRVDKI FYLFFETDSQSVT* AGV* WLN 
PGSLQPLPPGFK* FS*PQVI\TCIiDLPK 
CWYYRRBPLCPVY 


6330 


20231 


A 


6384 


277 


50 

« 


LVLKLGRCKTSSFI *NFKGPQIAKTILK 
XNKAGSLT \ LPNFKTNYKATVI KTV* YW 
HKH* YRHGFISGLS ILFH 


6331 

• 


20232 


A 


6385 


2 


354 

* 


AAASTNFRDR/ IL/IiCHPGWSAVAQS*L 
TAVSTLN*AQGIG\SSHLSLLNSIiDYRY 
APPLLAVLLCCPGWPQNSWPQVILLSQP 
PKALGYRCEPPCPAPLLLNFFFFFKMRL 
WFCCPGWSA 


6332 


20233 


A 


6386 


2 


188 


FFKYICY/ CINMYMCFYIFINIYSIiIYL 
♦ IHIYKYIFAYIFINTYL* IYICLYIYK 
YI FIYYR* * I YICLYI YKYIFI YYR 


6333 


20234 


A 


6387 


2 


241 


WATKCR* G* RRTGTLTGG* WDCKASGQL 
*KTFQQFLISITY/PYGPAISLLEIDPS 
ELKTYVHTKTRT*LFIAASFIITKN 


6334 


20235 


A 


6388 


142 


429 1 


YCVCIYI YI FLYLFI FFCTFFTYSSLHL 
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peptide 
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eth 

od 
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26 
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peptide 

sequence 
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Predict- 
ed end 
nude- ■ 
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location 
correspon * 
ding to .„ 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G^Glycine, NNHistidine, Msoteucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
•^Threonine, V=Valine, 
W^Tryptophan, Y^Tyrosine, 
X^Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


- 


■ 


- 


■ 






LIFSFFFFLiERGVCFSPRGGGGGKYLGL 
WRFFPPE*IKKNVYiyA\QQYYSAIKQN 
EILSFAATWMELEAIMLSEIRQAQNDKH 
YMISLICGIKR 


6335 


20236 


A 


6389 


468 


250 


VHRRTPPCPIIFFFF/CKQ*VLPMLPKL 
VSNSWPQATLLPWP SLTVBFQALTPGWK 
NFKKMFLMVMSWS I KKR 


6336 


20237 


A 


6390 

■ 


616 


1084 


LCLCQYAGISVQCPDCVFQFFHNVDSTY 
EDYLRKLKTSLS FFWRRSFALVAQAGVQ 
WCDLGSLQALPPGFK*FSCLGLLSSWDY 
RHPPWHPANFVUjVEMQFIHVGQAGLEL 
LTSGDPPASASQSAWITCMSHRTQQE\Y 
FLNLKFSTSPGELLCIiK 


6337 


20238 


A 

• 


6391. 


415 

• 


200 


SPPPRFFFFCPPKKNFFFPPPRSKNFFF 
*RGPPFFFFFKKTPFF/YFFFKPPFPPR 
/RFKNPPKI FFFFNRGPKNPGGGGGKKK 
PPGPGGFSPLWGPPPPFKKPPPPPKKNP 
PLFGKKKKKKKKKKRAAPR*NPRARGVF 
FFPPPPLGFWAPG 


6338 


20239 


A 


6392 


3 


117 


KLKPSSH\DY*VGETYGKDITSRGKDKP 
IAVCKTFSPLS 


6339 


20240 


A 


6393 


3 


204 


DRVSLCRPGWSAVA*S *I»TATSNS * \ VK 
QSYHLS I P*GS WDYS CVP\ PPLG* LVIL 
LRQPPKVLGLQA 


6340 


20241 


A 


6394 


501 


187 


AGVKWYEHGLWQP P P PGLKRS S HLSL PS 
S*DHRHEYPCPANF* KI FF\ VETRSH YV 
AQTSLEFLDSSNPPTSASQNAGI \ *GMS 
HCAQPMQTFSLVKIGTNFLIF 


6341. 


20242 


A 


6395 


3 


393 


hacgaimnskphavtq yhtpllm * syii i 

TT\VLLLLFVT I LNDGITILLTDRNRNT 
TIiFYPSGGGDP ILYQHLF* FSGHAGG* I 
L/LLPGFGTISHIVTYYSGKKEPFGYIG 
MV*AMIS IGLLGI IV*AHHI 


6342 


20243 


A 


6396 


101 


135 


FDSPTPGIVPGTWQTIRSLNVYLTRLTT 
SFSLMPS* FLFI FFCRDG/SLTILPRLV 
SNSWPQAI LSPQPHKMLGLQA 


6343 


20244 


A 


6397 


395 


187 


SSYLSLIiGIWSYRHVPPHPANFLFFCRD 
GWYFI FFC\ KMGAMLPRIiVLNS WAQVI 
LLL * PS KVLGLQA 


6344 


20245 


A 


6398 


35 


334 


QNETLS KTKKI KNKKKKKKKKRGGPLKK 
PLKGPKLTRACKKKPFSLKG/WRKKNNL 
GIFEKKPYFGGGKKWHKPPTKIKSFKEK 
KKF*REKGEKQPQNPGC 


6345 


20246 


A 


6399 


464 


146 


HXQ I RE FQRTPTRMS P KTSTLSHFKIKK 
QLVMNKRSFIKQWIS*QKPCQMARRD*H 
NIFTVQKGTNCQIR\ILCPAKLSFKNER 
E I KPFLGKYKLRBFI TA*FI FKG 


6346 


20247 


A 


6400 


723 


632 


FFN*DKI LLCHPGWS AVARSQFTAASTF 
RVQ\QSACLGLPSSWDHRHIPSCLATF* 
* KRGFAPLPRLLSNFWAQAI LPP WPPDV 
\LGLQA 


6347 


20248 


A 


6401 


522 


46 


FFPPFLSFFFFSKIPKQSLGLVFYF*NK 
VWVFCPGWRAIR* FPFPA/APPLGSKNS 
/SCLNLPSTWGYRISGMVPPKAH* IPPE 
I FFFF FF / CRQSIaALS PRL/ WQWQVQA 
ILLPQPPE * LRIiQVPGRW 


6348 


20249 


A 


6402 


103 


261 


NI PFTCLI KKKKKKKKXKKKKKKKKKKK 
KKK/IK* KKKKNVWGGWKKKLGGD 
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26 
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beginning 

nucleotide 

location 

correspond 
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Predict- 
ed end 

nurle- 

otide 
location 

ding to 
fast amino 
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residue of 

pepuuc 

sequence 


Amino acid sequence (A=Alanine • 
C-Cysteine, D=Aspartic Acid, 

E ias 021iitAmic Arirl F=3phenvlnlaninA 

G^GIycine, H-Histidine, JNIsoIeudne, 
K=Lysine, L=Leucine, MHMethionine, . 

1 ' rt^jja 1 dKlilv) I I 1 ullUC] 

Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W«Tryptophan, Y~Tyrosine, 
X=Un known, *=Stop codon, ^possible 
nucicouuc deletion, v— possiuie 
nucleotide insertion 


6349 


20250 


A 


6403 


146 


384 


GFLPSLSGHKELSLFFLFETESHPVAQA 
\ ITLGS LKPI^P PfiFK"* PQ C\ fiT .DP QWTiVP 

RVP P CS VNFCVS P Y* PG * SQTPDLR . | 


6350 


20251 


A 


6404 


3 


395 


ETESHSVAGAGVQWRNPGPLQPPPPKPK ' 
RFPYLSLPSSWDHRHMPPCPANSAPSVE 
TGL/LHAGQAGLQLLTPGDPPT*TSQSA 
RTTGASHRARPSLKYFLISFVTSSLLHG 


6351 


20252 


A 


6405 


30 


205 


CRLT R P AE FRTR KTHT \ RE KP YECKQCG 
KFFSCKSNLIVHEKTHKVETMGIQ*VMW 

T.TJT 
Li if Li 


6352 


20253 , 

\ 


A 


6406 


3 


444 


ARKITGY* I SKTI VFLYT/ DELAEKB I K 
KAI PFTIVPNK*DN*ELNLTKQVKDPHK 
EDYKILMKETEQDTK/WKDIPSSWIRI I 
Si I V K V AVJjS KAX xKrSAI L. JL KI R V Tt? K.T 
ECHWKKKIWIGKVAHACNPTTLENQGRR 
SLEARSLRLS 


6353 


20254 


j\ 
A 


6407 


A OA ' 

480 


307 


nunc oo a irvniirwnnvo /uncpnc /tttic 

PPPPPC*TPPPEP*YPPPKGEWGARPPP 


6354 


20255 

< 


A 


6408 


923 


833 


KAMKHFLIKTRAFFRHVTAHLIDQSIVK 
TELLYVTSF I AI FALLWWYGKEPML/WP 
GAAAHTCNPSTLGD*GGRTA*AQEFKTS 
LGS *NKTKP* CLSAV 


6355 


20256 


A 


6409 


382 


152 


ECSGTLLAHCNLCIjLCSSDFP/SLFHNV 
SLSSNSSS* VAEI I G LCHHWL I FVSAK 

r T5T PDT3T7DMT rGDTDVMMVC? 


6356 


20257 


A 


6410 


553 


173 


CLLI IKNTLPAKKKKKKKKKKKKKKKKK 
AR/WPP*K*RAP*NPVKTPGGSF 


6357 


20258 


A 


6411 


38 

- 


562 


FLRQGL / D SAAQAG VQWCNHS S QRAP P F 
(aJ^K^5H\PASIiAoWSAVA* SKJ-iAATST 
FRVQVILLSQPPE*LGLQSCSITQVGVQ 

YRRLP PRPDS S I TFS RKFFQ S PQARNLP 
PTPELKPQQPVPAPSPSBLHLSWSSGNK 
CHFLWFLCP 


OJJO 






04IZ 


4 J4 


144. 


SKS PASAS * VAGTTSVCYHAQLI FVFLA 
DTGFYHVGODGPDLMFCPPR PPKVLGI.P 

RLPLICF 


6359 


20260 


A 


6413 


3 


455 


AEAGGLLEPRSSRLQ/CSQP*SHPCTPA 
WAT 


6360 


20261 


A 


6414 


462 


162 


TQGPRXiFPHPFGAPGGPVPLGWEVRPPP 

ADPnTTPPPDPVPftX WCDDWPrOTJCTDDD 

LEG*ARKSP*PPKGLVPVTPKGPPPPHP 
GQENPTPFPKKKKKKVN 


6361 


20262 


A 


6415 


499 


0 


HFTSKHHFGFEAA\A*YWHFVDW*LFL 
YVSIY**GSH 








o*no 


1J27 
lo / 




MCHINGTMOFFPPPPPRMEVFFCPDAP'r 
GN*R/WPNLG*LKPPFPGLTLFSCLTPP 
RS WNKRPP PPARVI FLFFNKKG VFPIiLP 
GRV* I PHT PE PLATTNLLSVT IDLLI 
VSYKWDHAVFFFFFF 


6363 


20264 


A 


6417 


158 


412 

§ 


SSDPSRRSCLFSF/ CFFFLEREFFFAPR ^ 
VEWRGPNFCLLTPPPPGKKHFFAPSLRG 
GGDYGHLPHGPVI FGTLRKAG FS PCX3PG 
GS*PPPPGKKHFFAPSLRGGGDYGHLPH 
GPVI FGTIiRKAGFSPCGPGGS 



937 
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M 
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od 


SEQID 
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1TQCM 

09/515,1 
26 
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beginning 

nucJeotiae 

location 

correspond 

ing to first 

amino acid 

residue ot 

peptide 

sequence 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 

IahA _n i_ti_ ■ t Jl 

last amino 
ricid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
jLP=vri iiia otic aciu, r-* J rnenyiaianine > 
OGlycine, H=Histidine, Msoleucine, 
K=Lyslne, L=Leucine, M=Methionine, 
N^Asparagine, P= Proline, 
Q=Glutamine, R=Arginine, S=$erine, 
i* 3 i nreonme, voyaiine, 
W«Tryptopban, Y=Tyrosine, 
X=Un known, *=3top codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6364 

< 


20265 


A 


6418 


457 


11 


HI^USPiRCSECQK/ArTQKSAITNHQK 
PQQKGRKSMQ* LRI * ESFFD * SHQLRHN . 
RS SMNTM I VMGPFGTLVGEKP YRCNE YP 
KPSFLKISLVYRRDLTRRI IGQAHWPMP 
VI SAL WEAEAGRS LEVGSSRPAWPT 


6365 


20266 


A 


6419 


103 


376 


SALSLILGITFFF* I FLETGS HF X AQAG 
VQW/ PIHTCDHS ILQPQTPGLKPSSCLS 
LLSSWDYRHVP PVPRTSACFECTILRSG . 
SVAHTVLWKAKAGGSLEVRSLKLACATQ 
GAPCLTKKKCDPQD 


6366 

* 


20267 


A 


6420 

i 


463 


6 

• 


FPPNPHKKQNQQTP \ PPKKAPQQQKYTA 
DP /QDP PGND P KAQAYKKS PAATYPAAS 
SSPSSPSS/RPPSTAAHVLRVPL*GSPP 
GLVGNPRPPPPGGAQHPQPPDTWQKRPA 
HHGLQSLSASPPGI S * EPQLAPRNSPHW 
TRDL 


6367 

• 


20268 


A 


6421 


1 


479 

■ 


RGFAVLFS AVE FPS S S KSHLDLETS PKA 
YKHGAPCSAPGSLQASIjPKGRNQRNPFS 
NFFVLF*DKISLCCQGWSAVIQS*LTVA 
STSPGSDOPPTTPPPPHAPDTWDHRHVP 
PHPAKFYI FYRDR/ SFTMLPRLVSNSWA 
QTVCLSSLFMLLPSliiiWSLSP 


6368 


20269 


A 


6422 


11 


242 


FEIIFSAVEFGWSDHGSLQTQPPGXSSD 
PPTLDS * VAGTTGLHHHAQLMF*LFVEI 
GSPGLKWSSVSASESEPLYPA 


6369 


20270 


A 


6423 


488 


335 


SCQLRPWNESIGQRSGIRWLTPVIPALR 
KA/ EEAGG* JjisPRSSRPAWAI 


6370 


20271 


A 


6424 


454 


229 


RDPPASASQSAGITGVSHHARL/VSLFL 
IVM*AEGCYIjSGIWRTRIiKMNPPQ*RKE 
RQIQVKSTRSDI FFFYFLFIi 


6371 


20272 


A 


6425 


1575 

■ 


1309 


FESESHS VT \RTE CSGTIS\AHCSRHRL 
GSSDSPASASRVAGITGVHHHAQLIFLF 
LVELE FHHVGQDGLELLT\ * VIHPPWP P 
KVLGLQA 


6372 


20273 


A 


6426 


276 


437 


CFHPPEPWDQQWGPPPIK3NF* IFwKKRG 
PPFVPQIILNPWPQKNFFAWPPKTIiG 


/inn' 

6373 


20274 


A 


6427 


AAA 9 

445 


^ a r\ 

240 


RPRRADHIj/RSGV*DQPGQHGfcrrijRIjKK - 

IQKISWVWWHAGTLWPFWERJDGSHRLSA 

RTQMLGSWACSPGF 


6374 


20275 


A 


6428 


114 


450 


LLLLLL FE SES CS VAQA/ PSAVL / W X LL 
GSL* PliLPGFKQFLSIjSIiliS I WEYRGIiP 
PYPANFFKFLVEMGFHHVCQAGLELLTS 
GDLLAS AS QHSG I TD VS HRAQPVT I FLK 
GLS 


6375 


20276 


A 


6429 


380 


224 


T Tit TOT CTI/A 7\ TrtnAt?UPnT *Tl*OtV^T V/^ 

JjRHCLc 1 QAC» J.yRQEHGSIj w « l S PGLiKQ 
SSHLSLPSSWNfY\GVKLPCPNKFQH 


6376 


20277 


A 


6430 


509 


228 

* 


PHNKGRVSPPFPQNFr 5 PPSKNIiGGGWL 
PKIPPPKKRVFSKKPPRGKKK/PPP*KK 
KKNFS PPPFFGPPPKI I KKPPPPFPPPT? 

FFFFFFFFFFFF 


6377 


20278 


A 


6431 


3 


117 


HS WES I * KRGACFN I F X AAQFTIAD I WN 
* FMCLSGDE * IKKLGLI YT/MDCYWOVK 
KKEIWPIiATTWIELKVNILSEIGQA*KD 
KYHVF*MNR 


6378 


20279 


A 


6432 


417 


271 


LSKKTKFRNSPQPPLPCSFL*NDRK\WP 
RVWHTCNPSTLGGRGRWIT 


6379 


20280 


A 


6433 


385 


145 


TPGX KQS S CLHLP KCWDPSCE PPHLANL 
NF/CKR*YIPIVPNSKV*TFSQEVFSYT 
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\J VJIj i.llic, Jl xildllUlUC, 1 XjUKUCIQCf 

K=Lysine, L=Leucine, M=Methtonine, . 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Sertne, ~ 

M. 1 III Cull 1 II T V UlIUC) 

W=Tryptopban, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














. WPHS S S FTGKEVI I LFPKECPGMFK 


OJOU 




a 




OD 




KP*GDKIKTRQKKKNFFPKRGGKKNQPG 
KETCKKNNFWGGKKMEKPP \ K3CKKTCKBK 
KKI * KKKGEKKPQIAPEKKKGS 


6381 


20282 


A 


6435 


3 


454 


EIQRTPVRYYTK*TLPRHIVTRLSKSTL 

KLTVNF+ KKS *RREKWG\P I FNILKKKK 
P* PKI FYPAKLTFIREGKKKFFPKKQGL 

PQNTLKYIiAHNHP 


6382 


20283 . 


A 


6436 


424 


410 


NFFFPPQGGGGGGFFF*DKPPPGNPSSL 
FFFFLGGGKKRGPPPPPKG/QTPPPGGL 
F* KWLGGKKETPKKFS PPP PFFGGGGKI 
rr * KPPPPPGFFF* KS PPGGGGPPSVKK 
KKEKKHKKKKKKKKK*R 


6383 


AMD yi 

20284 


A 


.6437 


3 


214 


E IDCCAAVWKQPKCPSADE * RKKMWYLY 
TMEYCSATNKNEIQSF/VNNWKEVEIIM 
LS \ KKS QAE KEKHGX F 


6384 


20285 


A 


6438 


83 


420 


QSFP* ISCRKYS /WLGAAAHACDANTLG 
GRGGRAAWGQKFKTSLGNLEPGPKTADP 
RTSHGCELRRSS YTYALTHGFS STPETA 

ot37v dt t DnoomD'mDt'rvWBt'nT \rwvrvr t 
K P APJjJjF P PP P PTQKCiDi KEEDb xDDLjIj 

NK 


6385 


20286 


A 


6439 


163 


289 


KMYCYYYY/ C*DR VLLMLPRLVSNLWAQ 
EILPPWPPKVLRLQA 


6386 

• • 


20287 

• 


A 

• 


6440 


459 


431 


DKRIRTRITPLAIRKRQIKTTV/RKQIjQ 
SHYMVWKETTNNK/CGIFIHSWWECKM 
VQIJjWit T V SQVFtQQjDUPxNIiATLLI F I 
CQLQI KTLSSPKYLHMKVHS 1 1 IKIRKK 
Y I KKNL KC PSTDEW 1 1 KI * C * S I HTMEN 


6387 


20288 


A 


6441 


478 


36 


VFGCRVILVEKLFLCEPLRVTFLRIiGKQ 
LEKSL/WNFHDLELNISLHKNQPNRCVH 

nHDTAAWD Q*OGT7+ *19 /f^AWr VTQXTT. r P 

LIN* I PSCHTAMDI 1 1 IAQH/ ITHTVEY 
YSAIKKNEIMCFPATWVELEAI ILGEIT 
QKQSQIWRSPP 


6388 


20289 


A 


6442 


464 


206 


LVQRSGIHLPYSSTVPL*GL/HQKNEAD 
VYRNTRMFTAALFLIVQNWK*HNYLQTG 
EKINKLPYIHTME YFS VIKSNQHS IT 


OJfiV 


20290 


TV 


£ j| A^ 

6443 


A % A 

414 


216 


x ICFIjluObR^WVAT*AbbKJESWIN 
KLWHIHTMD*Y/SSNKKNKLQIOTTWTO 
LKS IMLTE I SLT 








CAAA 

6444 


7 


A 1 1 

411 


or IdiHor At bKVo zLWIVXsjX iAf ijl irih 

LSRSNSLHVFSW\WD/CKMV* PL/YETV 

WQFLTKLNILLTiHDSALrLLGIY*K/DM 

KSi^TKMCTQIFlJ^FIlAKTWKNL* 

CS S VGEWRNKLWYLOTME YSS SVLKKVn 

ELSSHEKTWRELLRERSQSKKAIYCIFQ 

QYDTLEKAKLWRQ 


6391 


20292 


A 


6445 


257 


419 


KSLPFFF/ CW * ETGSHSVTQVRVQWCNH 
IALQPQTPGLKQPSCLSLQGSWDYKFHF 
TQACPY*RP 


6392 


20293 

« 


A 


6446 


20 


417 


HQYCFAS FFFFNNFFKTESHSVAEAGAQ 
WHDLGSLKPLPLRFK* FSCLSLLSSWDY 
RCMPPHLDNF\* FLVETGFHHVGQAGLE 
LLTSDDPPAIiAS* SAGITGVSHRARPVE 
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nucleotide 
sequence 
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peptide 
sequence 
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M 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

i . 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G°G!ycine, H-Histidine, I^lsoleucine, 
K=Lysine, L=Leucine, M=Methk>nine, 
N=Asparagine, P=Proline, 
Q=Glutam!ne, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=»Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=posslble 
nucleotide insertion 














LFLLRRQGLRLLQNIGIGRQ 


6393 


20294 


A 


6447 


371 


3 


VIGCQFL/RKTKRYLPYDLPIVLLCIYP 
R/EKKNLCRH*TCTSMFMTALFIITQH+ 
KYP KCHS I G E WLN I LWH I HTME YYAEMK 
RNKLLIHEKTYMDLKDILFSGGQRVYLK 
SSILSIKCIRRQE 


6394 


20295 


A 


6448 


46 


332 


LGEHLRQGPI ILKVCQVILTHQQYFESQ 
WSIMALSNRKYNANCRyjJLKCPNSYIKK 
EML/ WPGAAAHACJNIiSTLGGQGRHIA* A 

■■• g v ■ ^m^ mm mm m^m<^^ mm*^^ ^m^m mm ^m m^mim. mm m m m^ 

QEFKTIAKPCLY 


6395 


20296 


A 


6449 


2 

• 


381 

■ 


IiCSLVFTHI CS I FLFFFFFFKDRALLP/ 
H/PCWRAGP*SYLTAASNSG\VKQSSNIi 
SRGKPLCRANFGSF\ CFWFWKKGGPTMW 
PRLV*N\ ILLPQPPKVRGL* AAAHAGGQ 
GHDHTSLRPOTPGLKOSSNLS RGKPLCR 
ANFGS FW FIj VLE KRG S HHVAQ AS LKQS S 
CLSLPKLGDYRHEPSHRANMFNFRSLPG 
IiRKADCLYPY 


6396 


20297 


A 


6450 


2 

* 


136 


KSVGCWRGCG* IGAL/ LHCWWECEMVQL 
LQNAVWQ FLKNCKI VPHY * A/ 1 PHLG IH 
PNALKTATQRN I CAP M F IAALCATAKRQ 
CK/PKCPS IDKT* YTHTMKYDSA* KKCW 
LLARVWINWSPVHCWWECEMVQLLQNAV 
WQ FLKNCKI VPHY 




7ft?qR 


A 




314 


72 

• 


MLPLTNGSCRTKVSKDIEGIiSNS inplp 
L I GI YRTPF PKTAGTP FFFFFIiV/ PNGT 
FTKIDHILGHKTRLNI LKMI * ILKYIP 


6398 


20299 


A 


6452 


54 


350 


LRSFCSALFCFETVLLCHLGWSAVA*PR 
LPAGSSNPPPS/CLPIPSSWDHRLVPPH 
QVFLEAFFFFFFF/ CRKGVFFFLPGGV* 
NSGVKGTPRPNPPKALN 


6399 


20300 


A 


6453 


368 


172 


WLTSVI \ QRFGRPRREGHLRPGA * DQPG 
LR*STHLRIiPKYWDYRRESQCPARPVIL 
KEANMFHLG 




9ft301 

r 


A 

• » 




i 
i 


365 


N I LKLNNKLIiNNL \ VKEEVAKE I XK* L V 
LNKNENTTYKDLWNEAKTVLHL*H*VKF 

IAJbSA* ri vsnqcs k* astlrnhkkksn 

LNPRLI *QQKSITLKREKSMKPKAGSLI 
RSIKLIIRLT 


6401 


2.0302 


A 


6455 


364 


1 


PETRSCSVWPRLGGVQRQDHGSLPPLSP 
RLKQFSHLS LPSS +NHRGMPPCPANLLF 
KFYLFIYL/FCLFVEMG3HHVSQAGLKF 
XjS SRDPS ALVSQS VATTGMSRRAWQDI I 
LQQKQAATAA 


6402 


20303 


A 


6456 


91 


300 


GHARHFGRPRWGDCLGPEVRDQPGQHGE 
TLTLQK\ KNKN* PAWWHTPAVPATQEAG 
ITGVSHRARPCPPN 


6403 


20304 

. 


A 


6457 


2 


195 


RFSCIiSLPSSWDYRHEPPRLAIFFFF/C~ 
FWNFKKRGGFPLWARWVSNS * PRGIGLL 
WPPKGWGIiRA 


6404 


20305 


A 


6458 


522 


202 


THNWRPSLQI FTAALFLLAKKWKQPKCP 
STDEMIHKMWYVHKMKY*SVIKRNEVLI 
HS A\ KWMNFENMLNEEACWERPHIV* fh 
LYEMSRVGKTGETKSRLVW 


6405 


20306 


A 


6459 


377 


161 


EMNIKCPSTDK* VKKRY/ LY IHTMEHCS 
ALKGNEVLI IHAMTWMNLANIMLSEKSQ 
SQNTTYSMITWKCLE 




20307 


A 


6460 


92 


484 


LLQACFLRTVLIDYKMWAEKGMVGMCVL 
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sequence 


M 

eth 
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f SEQID 
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26 
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nucleotide 
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ing to first 

amino acjd, 
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peptide 

sequence 


Predict- 
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nucle- 
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ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine ! 
OCystei ne, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
OKSlycine, H~Histidine, Msoleudne, 
K^Lysine, L=Leucine, M=Methionlne, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, $=861106, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *«Stop codon,A 3 pos3ibte 
nucleotide deletion, possible 
nucleotide insertion 


• 


■ 


■ ' 


• 


■ 




MRNMTVIII FRDRVLL* HPGWHAWRSQ 
LTSA/VKRSSCLKPPKC*DYRCEILCQV 
EIAFTHRSLFNSFLKRSMRVILLNYWKD 
PSLGFCSYDSTGMALLRNFS 


6407 


20308 


A 


6461 


483 


192 


MGLML ILRANFQSTKKKM I CLLKKSLTF 
LNCIVLSNWPCTFNMYIFVCVFAVYKLH 
VWAE \ PVAHACNPSTVGGQGERTA* AQE 
FBTSIjGNI irpcly 


6408 


20309 

» 


A 


6462 


451 


98 


KQVWQLL* KQVWQLLBSLN14KLLYDLAV 
LNLGAYPREEKTHVHTKTCTQMFTAAPF 
I \ 1 KK* KKNFS IDMKYEERDKMWTI HVT 
QK5fLIIK*SID\Y7VPTWMNLBNIMLSED 
SQTQKTTHCI IS FI 


6409 


20310 


A . 

• 


6463 


395 


67 


FLRFWCTHHWSS/DILYPVCSliIiSLNRS 
HS FPRVHRSP L YHS YAL CPHSLAPTYE * 
AHKMF /RLVNFLFLVETGFRHAAQAS PE 
LPSASNPPFSASQSAGTTDASHRAQTK 


6410 


20311 


A 


6464 


485 


220 


F F FETWSHSVTQARVQW CNLGSL \ KPLP 
P\GSSDPPTSAS * VAGIAGMSHAPTHEI 
PS IQARQQHLQGEKKIELLSLPTSASQV 
LRLLA 


6411 


20312 


A 


6465 


1 


371 


IS I P PKKYLR I NLTT YVQG LYE QNY KTD 
KRNQGRSS KWRDGPCS R IGKLNIRMSVL 
SNLIYSFNVILVIVPASYFVSIT/RLIL 
KFT»KCSRPRIVPLFHT,TiLPIiLVPFVCS 
FILCRPGASLVK 


6412 


20313 


A 


6466 


489 


175 


YKI IEISRCIK/GLL*LWRTTHLFSHPH 
WRRGMAYEKNADFRLGTVAHAFNPSTLG 
G*GR/WNQGFKTSLANWKPCLY 


6413 


20314 


A 


6467 


2 


367 


FGTETLGQAGIKGDPGEKSHWGEGLHQL 
RBALKILAERVLILETMIGXiYGS*LTEZj 
PAAAGSAGAPGQTGHPPGSHHLA\EPEL 
GS GAG P AGTGT P S LIiRG KRGGKATNYR I 
VAPRSRDERG 


6414 


20315 


A 


6468 


472 


260 


TTLGGAGNSG YRHI QPHVANYLFI CRDR 
/SLILLPRLILNS*L*AILIjSRPPKVLV 
LQT 


6415 


20316 


A 


6469 


88 


435 


RGVLLHMLSCLPLCKTCFCSSFTFHHH* 
TS FS FS FFFFLLGNKVS FYCQAEGQGPN 
LG*LKPLPPGLKQFSCFT/LPEKWE*QT 
EPPGPVKRHFGLSYFPLTRG 


6416 


20317 


A 


6470 


34 


106 


EFAIPFLTIYPIELNTHVHTKICI*MST 
AALSTR* KSPTCPSM\ INKMWYIHFMEY 
YSAISKGRNHANVHQWIHKM*YIHTVEY 
YSAIKSNEALTYATTW/MNLENIMLSER 
SQS*KICI 


6417 


20318 


A 


6471 


390 


92 


DFLIDAINHNLKVFFYS *FISMC/HPYN 
*SLSFSNIQKSIiIKTMINVSI*SFKY*YL 
IYFS*KKTF*N*RQGLTMLPRLLSNSWA 
QAILLPWPPKVLELQV 


6418 


20319 


A 


6472 


414 


167 


HIHIKTCA* IFSAA* FIIAKK*KPKCPT 
ADEWINKMWYVHTIEYYSAIKK\NKLLI 
PWMNLENIMLSTGNQTQKATYCMIPFI 


6419 


20320 


A 


6473 


3 


244 


TTILLGTTYDALFPELYTTYFVTKTLLL 
TSLFL* \ IRTAYPRLRYDQLI HLL* KNF 
LPLTLALLI *HVSI PITISSI PPQT 


6420 


20321 


A 


6474 


12 


224 


IASGRPFFCYDVCV/ VKRLCRHS I VI IM 
♦KKKKKKKKKKKKKKKKKKKKKKKKKGR 
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26 
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peptide 
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Predict- 
edend 
nucle- 
otide 
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correspon 
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last amino 
add 

residue of 

peptide 

sequence 


Amino acid -sequence (A=AJanine 
C=Cysteinc, D=Aspartic Acid, 

E=Glutamic Add, ^Phenylalanine, 

* 

G=Glycine, H=Histidine, I=lsoteucine, 
K^Lysine, L=Leucine, M=Methionine, 
N=Asparagine, r^=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W«Tryptophan, Y=Tyrosine, 
X=Unknown, *=$top codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GGALKKKPGGE PLFPH 


6421 


20322 


A 


6475 


417 


155 


FQSFTENKMLSSSITCSSEIFGERKSQI* . 
M*RPSLL/SY*BKLPI±LASTTVTSQRPS 
AL* VQPL / PTAKNL * LTAGS KGHYYFIiN" 
SKVFSN 


6422 


20323 


A 


6476 


1 


131 


RTRPPTRTRTRGRWLTPVIPALREA7BE 
AGG*L£PRSSRPAHAI 


6423 


20324 


A 


6477 


830 


513 


WNSVKESLPADTWFLPFLGPLIDIFLLL 
IFALCLVNLLVKFVSSRLQ * FHVKMMLA 
QGFQPIPSSDPEDKDILPLSLLEQVS\G 
ILPLQC*AGSMPITSAGSSYRR 


6424 


20325 


A 


6478 


399 


196 


S KQS SHLS LLSR * KYRCAP PWWAFF/ S F 
FLINI FCRDE / SLTMLPRLLLNS * PQVI 

LLPWPPKVPGLQA 


6425 


20326 


A 


6479 


1 


411 


NTGIMPANTVSIT1T1T.IDQEVLVTVKTCY 
LRNTFCGFSDGSGHSV\QTFWIGSTILD 
AIKSL/R/DSREDGKISTLIGIRKK* I P 
ILMEDLVGFTCTSQEEGTVDGLNLA*ELQ 
VEGPRDETELL* PQDQTCMDEKFLLMDE 


6426 


20327 


A 


6480 


381 

* 


162 


ag fthhnfffl p * dmvllchpg ys awq 
swltvasnsw\vk+sshlsfpkh*dylq 
avhfrmasnspgly£lyq 


6427 


20328 

* 


A 


6481 


379 


85 


S IL FFLFHP INF FF F FF* AW I F 1 1 PPPH 
\RAFKWRWGFFLADSVWFIKRLPQYY 
FFFFFPGFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFtiSVG 
GG 


6428 


20329 


A 


6482 


355 


117 


PKVKIFPPPPLPRKWDPRGPP* SPGTNF 
SIFFLVEKDFPHVAQAGLKLLDSSSTPS 
VAS\QSV* ITGMSPHAWLQVLLSL 


6429 


20330 


A 


6483 


402 

♦ 


97 


TAFHLLDIYHKS*KIMFTNQPLYKCPLF 
IIAKM*K*LKCPRRADKQIVA/WCIHTI 
E YYSARKRNVLLKYTRLWMNLKS IMLCK 
RSQSQDVTYYVI PFICHS 


6430 


20331 


A 


6484 


714 


365 


NGFALVAQAGV / QWCDLGSLQRLP P XFK 
QVSCVTLLNS *DYRSMPPHPANFCI FSR 
DGV*SYWSEWSLTPDLGLPKCCDYMREP 
P CP ANAKS FTVE ETHVTYQA YETMIif SIT) 
HDESG 


6431 


20332 

• 


A 


6485 


408 


146 


LKTETQTETCPPMFIAALFTLV/ RRWNQ 
PQCLSTDEWISQI*YICIMNINSDIKRN 
EALKHAVICMNLKNMVPFISPTEKGKKK 
EKKK 


6432 


20333 


A 


6486 


243 


11 


GMSQYQE YS KSNFFLSNRVLLCH PGWSA 
VAH/SRLTEALNRFK*SSCLGLPKLWDY ' 
RR VPLLLAQNATR * NS L * S C PA 


6433 


20334 


A 


6487 


403 


161 


DMVSPRWPVWSQNS \SLK*SAYLSLPRC 
WDYRREPS CQASRNTS IGRGNRKNQWNT 
CM YLL F I L I PLKKYMCNGRVME YFKT 


6434 


20335 


A 


6488 


292 


24 


DWRGRMAI P LKKGS KC * QGCRNTGLLRY 
* W/ME CEVAQPLQEMV* WFLQRLNI KLS 
YDPDFy PLLGI YPKE * KAGT* TDICMPM 
LVATLFSG 


6435 


20336 


A 


6489 


3 


203 


SSWDCRCI PPCLANFIiICCRDR/SLTML 
LRLVLNS * TQAVLLP WP P KGMPGLLFVF 
YGGIVLSYVNIS 


6436 


20337 


A 


6490 


413 


30 


RGP/ YQPGFLGFSPPNGLGFG* KVFLPL 
PPPPRKGPGKKGSPRARGFPGQAPPKKE 
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ding to 
last amino 

ubIU 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteinc, D=Aspartic Acid, 
Fy=f2lutamif Acid P^Phenvln Inning 

AU MIUlllllllV /IMUi A A II ^11 YI«lliUf |||C| 

G=Glycine, H=Histidioe, I=Isoteucine, 
K=Lysine, L=Leucine, M=Methionine, 

N=AQnaracrinp PssPrftlinfe- 

1 ~ xA aglUC) I "A I Villi 

Q=Glutamine, R=Argimne, S=Serine, 
T=Threonine, V«Valine, 
W*»Trvnhmhan Y=Tvrosinc 

X^Un known, *=Stop codon, possible 
nucleotide deletion, Impossible 
nucleotide insertion 














. NQRGGG F WS RGRLAKKKGFWFG *GPAKG 
kkfffflg agvwglkvl i p i p p 

PPLFFFFFFTjQTIHPIRG 


6437 


20338 


A 


6491 

* 


101 


335 


SSSFLGFPKYWDYRREPL/WPS/LWQFF 
RXjNIHLLYNPSIPILAVYPREMKTCDLI 
KSCMRMFTVAVF\ * ITQTWKQPNVF 








ha no 


532 


127 


LYT\PLPFPSSRYQQNGPCLSSPKLSPS 
PELGTQSFPVHKSFSPOiPSPKPLTVPP 

ft T .D G TCI O Q n Q T . * KTT Q T . & Q T .T . WTWCXIT'D 

PAPS CNPE S WGLDATAAI STCFYO 


(LA 7Q 




7\ 


o4yJ 




1 


SWPLCLPTPASAMVDAPPPA\SLPSRSS 
ILDCCISSEQGSVGMGPAEPGAG'NLLV 
CCLLRPLEKCSIWAGVSRFSWYSLSQLP 

T.a P W Q DWPT -C PPTJP ffMPP DVC 
AArUuvvJ X\0 if XT XjO £ t*M iafwlXrXUr Vw 


6440 


20341 


A 


6494 


319 


75 


KKIWGPLKKKFGLQTRRPKKWFFKRPPP 
FFFFFFFFFFF/HVAI FLLFCRDRG\LA 


6441 


20342 • 


A 


6495 


333 


190 


LFFFFFFFFFFFFFFFFFFP*GFFFFIF 
YSFRDGVIaLCHPGWSAAMPS 


6442 


20343 


A 


6496 


1 


299 


LPRLE YSGTITASNSWAPAI LPPQLIK/ 
wn7T?or , r*#<^D\rr TV! "OUT t/t mod dotttt 

PPPPPKSAEITGVMHHVWPKKIISNEKT 
THCSCSALLAKQQISS 


6443 


20344 


A 


6497 


51 


266 


PVNVILPSSWNLRHSPPCPARFFVLFCL 

LFFGKN/RGLAMLLRLVSNSWP*VILPP 
od dp * Turn .n vn mtwit.q g 


6444 


20345 


A 


6498 


366 


78 


LLAHCHS SNNKSLL I LDNFNI YHYGPSN" 
TVTS*F\*MSSLMILLSSP/C*GTSWSY 
LFYALKKI FCKD/RGLAVLPSLV*NSWP 

y V v J.UJOVW rr ISJ&JLA^JuvV 


6445 


20346 


A 


6499 


37 


396 


FPRPTRP KSS S PAKATQQNTIS FKKKKK 
SKQKKGNTVI I IRVKNQRKRKQKN* KLK 

0*T<r*FO*K"T7JTrTG?fPT.aKT.T /VTCWWT 
V « ay " zixiYrv.xo A^rju/ijrvijx / xiajvxi v a. a 

*IANIRNERSSITT*STDIKRLIWKYY* 


6446 


20347 


A 


6500 


706 


394 j 


VPLLLPRLAVQWHNLGSLQPLP PGFKRF 
S CTiS LliS S * DHEHTPP("!PA /N<3 FVPT.VB 

TPXHHAGQVCLKLLTSG/DTPSSASQSA 
GI KGMSHRTQPIEI LFYSVFF 


6447 


20348 


A 


6501 


403 


216 


ILNVI^FK/Nr^Y*HYTTVTJRTVRPLSP 
R*ATVTCTYTSGWPEAAEDPQKMTFPHC 
DT.FLPHPN 


6448 


20349 


A 

* 


6502 


377 


116 


LLPY*EN*FLMYLPPICCLPFDFGNSGF 
CHQKICNFFFFFKTKSLSVA\RIiEOGGAI 
PAHCKLCLLGSSDSMKNFLNEAE FPNLA 
FNGIi 


6449 




A 


6503 


j 


401 


CVGHLRAS WS SRWAWAWRAPHSEQPAGP 
AAPGNEGLS IRASGCGGCTGYPSSASPP 
ALCSIS\PRP*LPSRRAGLATCSPPCLS 
LPPPPWAPVRPEPPRRVPPPAPWRPVPS 
TTQGLRSASARHRTGRQLHLQ 


6450 

— - • 


20351 


A 


6504 


406 


55 

* 


WPKGFTPPRFPRKEGSP\PPFPKVFILE 
TRQPGPQTKPPFFPPIFKLTRPGGRPP* 
FPPPQRVRAGKPP*PQSFRVPGAIFPPP 
PPPLKGHPPRP<JFQKKKKKERKKRPTRK 
RGGT 
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Predicted 
beginning 

nucleotide 

location 
correspond 
ing to first 

o m inn f»r»iH 

dllllllU aLIU 

residue of 
peptide 

cpniiPftoA 
5Ct|UCUUC 


Predict- 
ed end 

otide 

location 

correspon 

rl in a fr» 

last amino 
acid 

roiuuc vi 

peptide 
sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 

F^=Cslutflmic Aftrl KWPhenvlnlaninp 
L/^uiuiniiiiL m.iu, Mr Jt livll yinutniUC, 

G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methtonine, 
N=Asparagine, P=Proline, 
y-'Uiuuniuic, i\— Arginine, d* - oerinBj 
T=Threonine, V=*Valine, 
W=Tryptopnan, Y-^Tyrosine, . 

Veil nlf nrtWft •""a=^fV#*rt f*A#lnrk /^f\ /\ rciKIa 

vv B uoKiiown, ^^rop couon, / possioie 
nucleotide deletion, \=possible 
nucleotide insertion 


6451 

* 


20352 


A 


6505 


3 


281 


F FFCDRVLLCHPG W SAMVRVRS RLTATS 
/SFLGSSDSHACLSSSWDYRHLPPLLAN 

QPPKVLOLQA 


6452 


20353 


A 


6506 


i 


247 


LGS LQ PP P PG \ S RHS PAS ASQGAG IADM 

CUDTDnC T17T7T TD "TTVTXTTTTD V * TTTXTMMT'PT O 

CTTAYKYEILRHSFNKICARSLYEKL 


6453 


20354 


A 


6507 


34 


47 


FDPNLTPRHIRI ILWAVNKERMLTASRK . 
KKQI I +RSSNLSGDRLLNEEAS WEP* CK 
KSMQETIQATEGVJNDIFKVLKEKKNL\H 

FTVTKPIIiQKC* RENQRTFDLTQI 


6454 


20355 

+ 


A 


6508 


403 


106 


HYVPPPSPPS/YFFFSPPGKRGFFPNPF 
ILVS PGFFPS PLF * KPPPE FFFFAPLKK 
I era P PRELiKr F F F KGPPP FF r F FF 


6455 


20356 


A 


6509 


389* 


230 


LSF FL FFRDKVS LCNPGWSAV/ VTFELK 
RSSCPGIiPK/SQSHCTKPQPGF* KYIYF 
SFFIK\KR*SLTTLPRLVSSSWDQVIHS 
ALWHPKVLGL*AVAHAYNPSTLGCQSAE 


6456 


20357 


A 


6510 


101 


419 


ISFFFFFFFFLKKFFFFPPGWRGGGEF* 
LPQTPPFWVKKIFFPPPPKVGGFKNSPP 
PPFFFFFFKEKGFS PVCPGFFKTNPKK/ 
oPPPPPPKGV<j1j<^WPPPPGWDP | 


6457 


20358 


A 


65U 


41 


376 


EWWCETRGGAGQGRAHGAAGGATGRVIiA 
I QARKRRPKREKHPKKVSCS VAQAGVQW 
RDLVLLQPLPPGLRVSLCHPS/WSOGGV 
VoAki w JNI? CJjfv/ 1Kj^VLiPCCAQWS*TP 
GL 


6458 


20359 


A 


6512 


2 


332 


WEKVKIATLTGFW K* LI/SGSS RTLMGD 
FERFKTSVEKVIADWEIARELEliEVEP 
ENVTE LLQSHYKTPGTLAFS RLS PDVS L 
YSQLPETRSHEWQAAS KIQPRKASTSQ 


6459 


20360 


A 


6513 


3 


294 


FFFSEMESLSVTQVGVWWHNLGSLQPPP 

T5CJPV^\T«e /**T OT no PIiTTWTiYl'IV TITTY TV /VT*"» 

PE rKQFS ClaS-LPSS WDYRHAP PHIA/ NC 
FCIFNRDGVLPC*LGWVSNSHIjGFTO^S 
HCTQPLltALS FLHG 


6460 


20361 


A 


6514 


1 


526 


FRLGRKPR* GGVM* PVWSRGE PGSVGAE 
A«r/ K4S w b AirKKijJjHHPAAIjJ-iA iv?ljS ASG 

RRSAR WKMERASGLS PGGGLGATSRQMS 
PGTQLANPPDHGDKDCLGRISPGSGKQI 
QAAGQLPGPPTSLAPAQGRLRSLTPWGL 
QTPEHSEPEGIGHLQAATEAVLPHSTQN 
LITKRNLM 




ZUiOZ 


A 


ODI5 


747 


451 


SGWDYRYRPPNPANF\*FLVEAGFHHFG 
QAGFELLTSGGPP ASAS QSAG I TGMSHH 


6462 




A 


6516 

• 


AAA i 
'i't'w 




PGGGGPFFPPPWGGKPGKIS* /PPGGQG 
SNKPKLPPCLPPGGPK/ GNPPQKKKKKK 
GFGPGWAHACNP 1 1 PALWEAEAARGLP 
PDAGRYG PGGRREAVRRE AGPQECQDPE 
GQV*GDPRGCGAAHAPARGHPRKA*RGA ' 
EGSSPGLPTK 


6463 


20364 


A 


6517 


257 


575 


ASSVTWAGVRWCHLGSLQLLPLGFK*FS 
CLSFPSGWD Y /GHAPPGPAKFC I YSGDG 
VSLC*PGWSQTPHLR - 


64^4 


20365 


A 


6518 


337 


389 


SQHF V * I S CLHDM PALASQSAQ I TGVSL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
nentide 
sequence 


iM 

eth' 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 
peptide ■ 
sequence 

■ 


Predict- 
ed end 
nucle- 

II W vlv 

otide 

location 

correspon 

n i no to ■ 

last amino 
acid 

residue of 

nentide 

sequence 


Amino acid sequence (A=Alanine 
CXtysteine, D=Aspartic Acid, ! 
&=Glutamic Acid. JF^Phenvlalanine. 
(^Glycine, H-Histidine, l^lsoleutine, 
K=Lysine, L=Leucine, M=Methionine, . 
N=Asparagine, P^Proline, 
-0 = Glutarnine. R=Arffinine. S=Serine 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 

nucleotide deletion \=nn«ihle 

nucleotide insertion 


• 


■ 










. RARLIFIFR\MDSCYVPQAGLE/PPELK 
RSSHCGLPKPWDYRHEPQAQPVCRPIRA 
TKCAYVSTOALS PAP VOPGMTP PAKRGL 
PCQPPGSPCPSLCHHQLAGTPPYE 


6465 


20366 


A 


, 6519 


61 


323 


FMWDRVARKGS I EGASLBLPLGGSVGGH 
GSCISMWGALGHSS\THV*HTHrHTHT 
HTHTOTNTHTPS SDSKKSS IIJ3KBRAEG 
CMTP 


6466 


20367 


A 


6520 


399 


101 


KIQIKATMKY/HNIPIRMGFFFKLSSQG 

SVW/RVFLKLNIHPIiHSPBI PLIOTYPR 
/DMKT*TCM*MFTATLFITTYS 


6467 


20368 


A 


6521 


1 


276 


RTRGFEGLRFFLGFFGTVLLCCPGWSAV 

WDYRCIPPCLANF* *R/CGSWAQVILPP 
QSPKALGLQT 


6468 


20369 


A 


6522 


280 


111 


GDDPSCSMGGGHCIiASRSB*GSVGVGPS 
KPGAG \ YNLLVCYLLRPLEKRS I RVGVS 
QFSRCHQSWLPFDRKGNSPTPCTYWVR 


646V 

* 


20370 




6523 


2 


327 


olir bo W U/ x Kill FFrliAjWr Kir wK/oAIj 
TMLFRLVSYSWPQVILLPQPPSVGTTST 
SHCPQPCIMFLKADLNPVRCQFRCALAP 


6470 


20371 


A 


6524 


298 


414 


PGPQSETLTKKKKGGGRLKGSNFTPAGM 
BGN IFF/ CFFLS KSHSVAQARVQW/ P * W 
LTAGSLQAPPPRFKLFSSI»L*VAIYASP 
TLLSS *DYRCTPPRLANFCIFPRDGVSP 

AGDGFRM 


6471 


20372 


A 


6525 


376 


180 


KCKKYPGGVAHACSPS *DYRHVPP/RPG 
YFTiHF * OFCHFGO AGTjDLPTSGDPP A<? A<5 

Q3 AG I TG VSHRTQ P QLNYF 


.6472 


20373 


A 


6526 


415 


86 


LYLP YDLAI P FLGM Y PRGKKKCAHTKI C 
TRIFTWLFIIARK* KRSKNPSTREW/I 
KMWNIYTVESHSVIKRKBLLIHASMILV 
H* /WMIFF.NIILSDRSOMOKITYCMVPF 
T 


6473 


20374 


A 


6527 


391 


166 


FFFFFRDWFSLSHPG WNAE \S *IiTVISN 
S WSOARIiI FKFLKLROGLNLLTRLVSNS 
RLQGILSSWPAKVLGLQA 


6474 


20375 


A 


6528 


544 


176 


TMRYLLTP VGMAI I KKAK\NT* C * *GGR 
E/IGTL/LQCR*ECKLVLPLWRTVWRFL 
KKTAIELPY7^PFLFSLEYGLFST/5STN 

CAQLRSVYRKAVQNHQGVSGECAGFQDS 
SHSAPVAI PTARQ 


6475 


20376 


A 


6529 


392 


139 


TYVKMHCK/ WLGMVAHAWNPS TLGGRGG 
QIT*GQASTKLSHSWGLSRBSRFWLBCY 
SRSCS FCSCRIjPADCTSIjTDSKO 


6476 


20377 


A 


6530 


362 


87 


NGPP FFFFFGE I DKL ILILLWKCK/RPK 
I AKTVFKKKNKVGGL I * HYFKAYYKGTV 
IKLVQFWHKVRKTDQ* SRRBSPETELHI 
YVLVNARDSA * 


6477 


20378 


A 


6531 


1 


346 


APTRPRLKKTLS KLKTTNLQLP YDPAFS 
FLSVYP REMKT * R \ FAKI CV * MFMAS LF 
MI VTTQ/ NVPRCPLMGEWLNKWR\ * FIT 
*YHTGMKNT\VMHKTTWMNSQRIMLNEK 
SQSLTY . 


6478 


20379 


A 6532 


3 


294 


FFFFRDRVFLYHPGWSAVV* S* t'TAALs 
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SEQID ' 
NO: of 
nucleotide 
sequence * 

'i 

• 


I SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 

• 


SEQID 
NO: In 
USSN 
09/515,1 
26 

■ 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide . 
sequence 

■ 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alantne . 
C=Cysteine, D=Aspnrtic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G^GIycine, H=Histidine, P=Isoleucine, 
K«Lysine, L=Leucine, M s Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginfne, S=Se rine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X^Un known, *=Stop codon, ^possible 
nucleotide deletion, V»possibIe 
nucleotide insertion 




■ 










PWLKRHSHLSLPGS *GHRMH/ PPHLGNF 
FV\FCLFL*RQCLAMLPRQVLRSWPQVi 
FLP W P P KAVLGLQV 


6479 


20380 


# 


6533 


2 


280 


PRY LVRNPTPTLGSCVTVDK* LSLWASV 
LYIYIYTHTHTDIYRKIYAP/IFVCVYI 
YIPI * XHIYYKEQAHAIMDAEKFQDLPS 
ASWRPGKLVE 


6480 


20381 


A 


6534 


96 


183 


DQIiNVFHRKLKRYDTYTFTVALUARHGG 
SC/LSLGSS*D*RYTPPLQTM*K*PFL* 
ISLPMLDWLVSNSWPQAILLPLPPKSSO 
LEA 


6481 . 


20382 

* 


A 


6535 


2 


241 


GFKRIDSNFESSVGKVPTN/TIFCBRKS 
LSMHQI / SLLS YFKKLPQLPQCSAATTL 
\ IS * QPYTLKQDPPPTRLQPABDSDGY 


6482 

< 


20383 


A 


6536 


485 


97 


PTTRP ILLLGFYTK*LKS \NKSDSCPPM 
FIATLF/ ILKCPSTDKWLKKMWYKHTTD 
YYSALKKGNHA/YM*QQMNLEGIMLSEV 
SQSHEGQLLHSTYVRYLK 


6483 


20384 

♦ 


A 


6537 


411 


57 


LFPSHI KFVLSWQSCFAKTSGPQTPSFF 
PP*LFPF/RDRVSLCHIiSWSAV/VSWLT 
VASTSQAQACLSIiPSSWDYRCPPPPPAN 
tfSRRGFTILPRLVSNSRAQEICPPWPSK 
VLGLQRRGI 


6484 


20385 

■ 


A 


6538 


438 


201 


I YPKEMKT+ VHTKTCI *MLIATLFTLAR 
KWKQTKCSSTDEWINKMWHIQ\TMBYYL 
AI KRNEAQLEHGE I IMLSERGQSQ 


6485 


20386 


A 


6539 


466 


131 


SFPKKGGYRAPPPPPPPFFFFFFFFVEA 
VLSLCCPGGSQTLGLKRSAHLVLPKC*D 
HRHKPPCIiASN/SFPNRHRMMTNNCLMC 
IKSWCLCIINPFYHLDSSLLWMLSLIYR 


6486 


20387 


A 


6540 


419 


50 


PLPFFKPGPRI F/CLGAPKKKI * VPT/L 
RE IKLFPLKGPPLFFFLNKGPLCFPGRK 
GRGVLFFLATPKSWGQKIFPPSKEGASF 
PFFFP FFSW*RWDLTMLPRLVSNSWPHV 
ILPSQPPRVLGLQV 


6487 


20388 


A 


6541 


1 


341 


SI I KTAREKCLVT Y * GTS I RLTGDF*KY 
I/SAKILQVRRQ*HDIFKVIK/EKNCQP 
RIQYPAKLSFINERKIKSFPNKQKLREF 
ITARPDLQEMI KG ILYLETKE * YLPS .* K 
HMKV 


6488 


20389 


A 


6542 


409 


290 


NCKQPTCPSTNE W/ 1 KL * YICI TEYYSA 
IKRNKLQLHTTT 


6489 


20390 


A 


6543 


32 


397 


AM I VFLQASLGNRS RKTLS LKKRKKKKG 
EKETLPGGST* KEKVRKKQNNLLV* KTH 
I KPG\ AGAHPGNPNI WGGGGRQ I TRSGN 
KNHPG*HGKTPFLQKKKKRGALLKGTNL 
TARGRQSKTF 


6490 


20391 


A 


6544 


402 


0 


LSWTQ * CKP IAP ATGEGGTGGS LE PRE S 
RLWCAM I APVNA /NCTPAWVT 


6491 


20392 


A 


6545 


1 


365 


RKVTSLPSTSRCLSILPHFSDGPRPVKP 
LQ* * LRCSICSASLPTHLHTARAAGLIiP 
\ LAS STAVNQN * S RELGACVKWEHSG I K 
YGLF* RD/RGLTMLPRLVLNS WPQAILP 
GQPPKLLGLQA 


6492 


20393 

• 


A 


6546 


1 


331 


ISTFKSYLRNI FSKAIAAVDSNSTEGSG 
QG * LKTFWKG YTI I DAIKNI YDLWEEGK 
TSTLPGVWK\LDDFEG\FQTSVEEVTAH 
WQIAKELELEMDPEDVTKWLQSHDTN 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 

r r 

sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=CJutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=liysine, L=Leucioe, M=Methion!ne, 
N=Asparagine, P=Prollne, 
Q=Glutamine, R=Argij)ine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y*=Tyrosine, . 
X=Un known, *==Stop cod on, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


6493 


20394 


A 


6547 


1 


417 


TRPKINKTKNLLPERINKIKKP*VKLTK 
NKNKNKTPHITKIRNESEAFTIEKLKS I 
LRE \ YMNNCVYNLL YANRKPRQMDKFLE 
R YKQLKLTQEETE TIiTD \ YI TS KEIKFV 
I *NFSTK* S *GPNGFNSE FGRVLWFTPV 


6494 


20395 


A 


6548 


410 


288 


AHFLFFCINK\SLTMLPRLV*NMWP*VI 
FPPQLPKTLGLQV 


6495 


20396 


A 


6549 


1 


216 


IDRGFTMLPQLVSNSL\LKRSSHLSLPK 
CWDCRCEPPCPACFHFLSECNTCIKNQL 
HVTLFFPNT*GLTSEG 


D*f7U 


onio/7 


A 
*> 


*55fi 

I 






VPGFKOFSCDSLPNS * KYORAP PCPANG 
FAFSVETGFPHVGQAGF* FPTSGDPPAS 
ASQSAGITGLSLHAWP\ENL* KLRLHAT 
GLRLHDHVPLKVPNYLKVCS FKFEIADV 


6497 


20398 


A 


6551 


42 


484 


S MRWNSMNVTIMPANTTS I PQ PTDQEVI 
S VFNS / YYY YYRYCKATAAI DSNLSGGS 
RQSKLKTFWKGLT 1 1 YAIKNICDSWEBV 
KIST*TI * KKVIPTFMGDFGGVKNSIK\ 
E ITADVAE I AR* L* LEVEPRVGWIiTPVI 
PALRKAEVG 


6498 


20399 


A 


6552 


304 


74 


HTGKAAKKKEC/ WRGTVAHAYK PNTLGG 
*NGWIT*GO* FETSLINMVKPRFYSC I C 
I SGYTHKVIHR 1 1 1 CKGKKLE 


6499 


20400 


A 


6553 


432 


62 


FWAGNSRMQQWCTDT/VSAHCSLDrjLGS 
SNPPASAS * VARTTGARHHARLMLITMN 
EKPLGARYCARI FFENKLGI YPNESVPT 
KTNANWSKPRSCFSHDPDPGHWIYLMS 
SGLIVNKIGKVK 


6500 


20401 


A 


6554 


2 


174 


YCNLCL PGP SDLPTSAS * VAGTTCMCHY 
TWLIFFCI \LVETGFYHVAQAAKLFIYS 
T 


6501 


20402 


A 


6555 


3 


268 


DAWADNI KCR*G YGAI *TLIHCWWECK\ 
W*ATWWFLIILNMLLPYNPVIMV\PRTC 
IQIFMSTLFINTKTWKQPRCHLVGEWIN 
CGTCR | 


VJJUZ 

1 




A 


6556 


•a 


40? 


DAWTDN IGCR * RYGAI * TL IHCWROCX K 
W* ATWWFLI ILNMLLPYNPI IMDHK\TC 
LQIFMSTLFIDTKTWQPPKCHLVGEW/I 
KLWSMPAKE YHSWLARTEL / 1 VP *NT*R 
RLSCT*LWEKSQYEKATYCWPTV 


6503 


20404 


A 


6557 

■ 


396 


2 


KKQNAFQSPSFLPRL*TPKITILISFHN 
I YI YTHTYI YIHVCI FTYIHTHI Y IFF/ 
RDRFLLCIiPGWRAWQSQLTAASTSQAQ 
GIPG*FFFK/SIFCRDW/SITMLPELVS 
NSWAQAIPPPWPPKVLWDYRCGS 


6504 


20405 


A 


6558 


393 


175 


F KSQGLVLLPRLE FSGMLI AHCSLKLPG 
FSDSItfSA\SQ*LGL*GCTATPGPYLLt, 
IYAGELGLNKRSISSK 


6505 


20406 


A 


6559 


401 


188 


SPPEPFILGGPRVFPPPPFLKPPPRI*F 
WGPKKKKNF/CPPPGPKNFFF*KGPPLF 
FFFFFFFFFFFFHRSS 


6506 


20407 


A 


6560 


37 


343 


YEPAWLLDIYPKELKT*AHRKTCTRIL 
IASPFKTVKMWQPPD/CP*ADE/WDK*T 
ACLQTVE * CSARKRS ELSTM/ IEMWRKL 
KCVALSERGQSEQAACYVIPTL 


6507 


20408 


A 


6561 


792 


141 


LFPPPP/YLSTSPNAGA*TTSPLPPQFL 
YHSPRRQQSGTKPPFFSKK/PBPPSP/S 
PNKFYPGPWWGVSPFKFPPPPQKS *GPK 
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SEQID 
NO: of 
nucleotide 
sequence 

'* 


SEQID 
NO: of 
peptide 
sequence 


M 

* 

eth 

00 


SEQID 
NO: in 

09/515,1 
26 


Predicted 

beginning 

nucieonae 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucie- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
jL^Vjiucaroic acjq, r^-rncnyiajaDine, 
OGIycine, HNHistidine, I-Isoleucioe, 
K=Lysine, L= Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutaminei-R=Arginine, S=Serine, 
1— lnreonine, v— valine, 
W«Tryptophan, Y°Tyrosine, 
X^Un known, *=Stop codon, /^possible 
nucleotide deletion, \=posslb!e 
nucleotide insertion 








■ 




u 


. KSPIjPPEAP/PAPPIjTQDCPPFVSNPGG 
KXKTPP/SAKKKXDPTPS/PSPPPRH* I 
GGPPGFSPPPLFKTPPPKLILGPPKKKK 
IFPPPGGKNFFFLKGPPPFFFFFFFFFF 
FSLLFPVGVWSRWLRSBAADLCGBCYSS 


m f\ *i 

6508 


20409 

* 


A 


6562 


* 

1 


356 


FFFNF I FI YLF / N FLRWS LALS PRLNG V 
QWSDLGSLQAPRPRUyiPFSCLSLPSSRD 
HRRPPPRLANFLYF* *RWGFTMLARMVS 
IS*PQ/CDPPASASQ\STGITGICHGTR 
LIPRNPMIA 


6509 


20410 


A 


6563 


416 


84 


LSLPSSW/DYRRPPPCLANFLYP* *rrg 
FTMVS IS * PHDRPTSASQSAGITGVSHH 
TQPI ISFLKSFFFTLSNRFNRVCIDSEL 
LLQCKDFYLQKFWLHIAFQEYTHCHGVH 
Y 


6510 


2041 1 


A 


6564 


2 


51 


FWREMQ I KTVLRYFSLKLVKI KK* DS T 
FFWKGC/GETGNLIHKANW/R/SI *QYL 
KKLYVHFIiFVPVI PPLEI YLKNKKILST 
K*LFAVFIVTEKC*KKKROG*T 


6511 


20412 


A 


6565 


2 


511 


VRW/SCAGL/LHTAV* IPGRIITSEKCA 
QQVDETHRN/LQRLQPALINRKGPILLH 
NNTCLYTAKPKLQKLNELGCEVLPHLSY 
S PDItS PTNYF FNHIiDNVLQGKHSHTQQN 
AENAF* EFVKF * SMDFYATGINKLIS /H 
WQKCIACNGSYFD 


6512 

■ ■ 


20413 


A 


6566 


3 

• 


248 


GDICT\SMLVPALFTVPK/TWNQPVSIS 
K* TDKMWYI YPVE YYLAIKKTKFC/SFS 
AT WM I LEDIMLS KPGTER * I LHVLYLCG 
R 


6513 


20414 


A 


6567 


3 


258 


WRDHS SLHS * P FGLKWFSHLS PLSNWD Y 
RDVPACPANFEKVL*K*GLIMLPRL\LL 
SSr PEW* XJjSSRXiPAiSSr UoVGi InMWxl 

cs 


6514 


20415 


A" 


6568 


392 


166 


GGGGPPRGGGLEPPGPPPFFLKNKK* /R 


PlrUOJbHaPurFPFUv^RblSlU r XIAjuaJNJ? 

H*TRVGPPPLSGGKKKKIRG 


6515 


20416 


A 


6569 


234 


411 


PGAVWHAYNPNTLGGQGHRIS*GQBFKT 

C /Ty^MTtTP PUT V 


ASIA. 




*\ 




i 
i 


1 10 


FLOHV*NKFOTMSDOIIGKIDDMNTLID 




LEKNVKDL\LITQTEVEELEGENKIPIL 
PG 


6517 


20418 


A 


6571 


392 


140 


IjPSS VTOYRHVP PCPANFVXjL»VETRFIiHv 
SQAGLEIiPTSASQSAGITGVSHQARP/E 
* *NS IRKGATTIQRDLGPVLQVRYLVRC 


6518 


20419 


A 


6572 


2 


228 


RTCTQMSVATLFLIAKNWKQPRCPSIGE 
W INKL * YI LTME YYL VI KK\NDFS SHKK 


6519 


20420 


A 


6573 


412 


100 


AIiHSHPLGGPGQGVPKSRI PNPPCPQGG 














tppppttwptct/tp pnpupT.PFPPT.inn'.irp 
EKFPNPG/ MPGVPLN* IPPPVSHPGVQP 
NFFSKKKKARGFCLVLNPSSS 


6520 


20421 


A 


6574 


406 


257 


NKREKKTRKMLDSIVIRKRHIKSQMRCY 
3APIQMAKV\SNRQSPKFWVRCRQTI3L 
*HC* *ECKIVQSMW*TLQKFLPAIVFLC 
IYHTEKLRFTQQP/CT*IFLAALLMIV/ 
RTV/KRLVCPSMGEWLSKLW*VNLSFSV 


4521 


20422 


A 


6575 


370 


68 


EQGSVGMRPSEPGPGYNIiLVCR*LRPLE 
KRT I WAGVS * FS RCSLSRLPLARKGKS P 



948 



WO 01/64835 PCT/US01/04927 



SEQID 
NO: of 
oucleotide 
sequence 

• 

• 


SEQID 
NO: of 
peptide 
sequence 


M 

ejh 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1. 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid- 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
-ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
^Glutamic Acid, ^Phenylalanine, 
u^Hylycine, H-Histidine, 1— Isoleucine, 
K=Lysine, L=Leucine, M=MethIonine, 
N-Asparagine, P=Pro!ine, 
Q=Glutamine, R~Arginine, S=Serine, 
1 -Threonine, V=Valine, 
\V £a Tryptophan,Y«^Tyrosine, . 
X^Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NF \ CI £> * VRFSNGIjS 


6522 


20423 


A 


6576 


1 


259 


IDLIDVLRPSLTVT*AWQWCNLGSL*P 
*PPG\SGDPSASASRVAGTIGQHHHRWL 
IFVI FVEI WFRHILVYFHTAI KNYLRLG 
NL 


6523 


20424 


A 


6577 


418 

* 


218 


KGKTRLGR W P PCNPSTLG \GQGGW IT* 
GQAFRTSLANIVFPLM 


6524 


20425 


A 


6578 


I 


401 


EKAGCSGSYL*P*HFGRPRQEDHLTLGI 
*DQPGQCSENLSLQKNP/TNISWA*WPG 
IVAHACNPSTiiGSQGRRIT 


6525 


20426 

i 


A 


6579 

i 


3 


356 


HWDYRHEPPFPV*MHFFIWIiPGCHSLLA 
SPNLTGCFFS I S I TESLLS LF I YLETRV 
*DLQCS YFWBRRG\ SLCCP CWSRAPGLK 
RSCCLRLPSSWDYSRLTPPHLANYCILG 
GEGESS 


6526 


20427 


A 


6580 


411 

- 


158 

• 


IGSVAQARVQWRDLSSSDPSAN*APPGL 
KQSSC\PASLPSRWDY\NKCTPPDIiANF 
YIFFVETGFPYVARLADCFYPPPL*KMT 
GSY* VFTMCQIPH*AFYKF*FI * FRVGE 
DKNSLPAWQHRETLSLQKKYKN 


6527 

- 

• 


20428 

• 


A 


6581 


399 


218 

• 


ggaelkvpk*gggpgspplppplqkprp 
kgplwpgg*gpprppqeapsplknkn* / 
plpg/agpppvppnwgggfyprg*gfg* 
lpi^p*t1^gkqrdpvfkkkkkn*nvn 

KKKAKL * KS* QFFFFFLKTGSRCFPRAK 
VQGPNGS YPKPQPRG 


6528 


20429 


A 


6582 


392 


214 


GSSDPPNSTSRVAGTTDEVLPCSPGWS * 
NS W / VS K* SACIiGLP KCWD YTRE PQHLA 
RSNF 


6529 


20430 


A 


6583 


390 


257 


PKRPPPPAFFC* PK\SU3MLPRMISNSW 
AQAI I *RWPPKVLEKQA 


6530 


20431 


A 


6584 




251 


HYFNSFNFFLMGSSNHPTKNPG* FFFFL 
TFYKNK/SLTIIiPRLVLNSWAQGILLPW 
PPKGLGL* AQG ILLP WPPKGLGL 


6531 


20432 


A 


6585 


2 


143 


RHMPPCLSSF/ SYFL*RRGFAVIiPRLFS 
KSWVQVI LMPWPPKVU3LQA 


6532 


20433 


A 


6586 


142 


334 


GRWLSQLLILNFTTRSVLKQMSW/MACV 
SETWPA/STLGGRGGRIT*GLKEFKTSL 

i 

QNMARPYLYK 


6533 


20434 


A 


6587 


485 


377 


RTRLSCRFFILSLIFRFGNFYCFTLKF/ 
H**VFI 4 LSFVFYC*GHP/FBFIFK7FFI 
IFFHFKVFICVFKYLIiFFFILIAETFYF 
FVNI FS FKKFVSVMFVLLIBMFYYGRFT 
IPLR*F 


6534 


20435 


A 


6586 


241 


251 


NSL FTKKKKKKGGALLKEP I LRAGGGEG 
I IFFKGPQKS I PGPGFKKAGREKPGPSP 
*NWFF* KG PPFFFFFFCKE /RSFTMLPR 
PASNSWAQAICLPQPPKWGL 


6535 


20436 


A 


6589 


429 


54 


VPT DT DOT nrprinnP DP POT nnm 1 W 

SI/SSFLPSLLFYILSSPPLSSPPSSLF 
LSFFLSRVSFCHPG * SAVAQSRLTATS I 
SWAQAILP/CR/NRDFTMIjSRIAIiNS*P 
QVIRP PRP PKVLGIiQA 


6536 


20437 


A 


6590 


411 


137 


KCIGTITTHCSLHLLGPSDPPTSASRW 
GLQAHHHIQLAC\F* LID* LID* LNVEW 
ESRHVAQAGLELLGSSDPPTLASQNAEI 
TEMSHCIF 


6537 


20438 


A 


6591 


412 


196 


QPQPPGAKKTSGGGFPIR/WGHQPPPPG 
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SEQID 1 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

etb 

od 

. — ' 


SEQID 
NO: in 
USSN 
09/5154 
26 

• 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of - 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon - 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-AJanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanlne, 
G=Glycine, H-Histidine, I»Isoleucine, ! 
K^Lysine, LHLeucine, M=Methioniue, 
N=Asparagine, P=Proline, 
Q=G lutam in e, R-Arglnine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=^Tyrosine, 
X m Un known, *«=Stop codon, /"possible 
nucleotide deletion, ^possible 
nucleotide insertion 














AKKYN FFFFF F W * RQALAMLPRWPNCQ 
T*AIILPWPPKVL*LQA 


6538 

• 


20439 


A 


6592 

* 


28 


400 


IMPNCFPKWLYITFFFFFF*RQSL/DSV 
AQAGVQ *HSLQS TATSAFLGFK* FSCPP 
AFLE * LD YRHVPTHPTNFCVFNKD/MGF 
TMLARLVSNS * PQ/CNPPASASQSAGIT 
GMSHRAQPHLFVLKMEKPPPNFLFILLM 
NQSIHIYLLNFYRF 


6539 


20440 


A 


6593 


414 


23 


QTHLFWFPPGFSPPPFFKNRPRI FFLGA 
LKKNFFFPPPRHKNLFFLKAPPFFFFFK 
RSFFPPGWCAIGLFSLQPPPPRFI^FSC 
PSLPKKWGYRPLPPRPGKF/CFFFFFVI 
LVENGFHHVGQDGLDLLIS 


6540 


20441 


A . 


6594 


91 


274 


I LHWS * PLGNYHL/ WLGWAHARDLSTIT 
GGRGGRI AGGQDFKTSLGSKQGS IS ARK 
KEKKKK 


6541 


20442 


A 


6595 


391 


251 


HLKRCSTS S VI RE VQ I KTTTTCHFSE * K 
R/SE INMCW* ECX3QTELSALLIRC* \ WD 
S?>IVQPLWERKEYLKQISTESSHAPAI P 
LLDI * P* ERCASLSLRKGWGYRPEPPBT* 
ASFLLGSYI 


6542 


20443 

■ 


A 


6596 


465 


268 


ILLCHLGCSAVAQD\LTLCSLKRfiSHLT 
SPSSWDSRRGPSHLTKFF/C* *RGLNTL 
PSLV*NFWAQAL 


6543 


20444 


A 


6597 


3 


229 


CHGSPQPCPPGLK*SSHLNLPTPK*L\D 
YRGMPPRSANFLFL * RWGRCVAQAGLEL 
PDSSSLPASASCPRQPVLYF 


6544 


20445 


A 


6598 


122 


418 


I IVNELCHIMCFFFFFFFKQNLLGTQAG 
GQGGKNNG*LKPKPPRLRGSPWLTLPGG 
WNYGGFP P /HPS * FFFFFLGKKGP YFCP 
PAGGKFFNLKGPPFFA 


6545 


20446 


A 


6599 


1238 


930 


FETDSCS FA*AGVQWRNLSSLQTPPP\G 
SNNSCASAS/RIAGITGTHHHARVTFAL 
LVETGFHHIGQGWSRTPGPQAIRPPQPP 
KVLGITGVTSRARAISKHFV 


6546 

• 


20447 


A 


6600 


398 


78 


PSSP FYLFIFRQS FPFFARVEEKGGHPG 
FPNP PPSGLKKFFYAPAPPGSKAPGQGP 
PA/RFYLFIFRQS FPF/CCPGGGKRGPS 
GFPQPPPFGVKKI FLCPSPPRK*GPRAG 
PPRPRKIFFFFFSSLFFFFFVRWGFTML 
PRLVSNSGAQAICPPQPPKVLGLQV 


6547 

• 


20448 


A 


6601 

• 


3 

- 


473 


FPKGKKKGPPPPAGG/QLKNPSKKPNKK 
PGAPPRGPRGPKKGGGGLGSPPQSPQGG 
EKWTLKGGPNLPP PRGGLGGARPP PKIG 
GGPLFPKGGGPtoSGTPPPPLFFFFLRRS 
PALS PRLECSGAI *AQTSS/SPPARVGC 
PSPPAPTRPTGLTVSPGSN 


6548 


20449 


A 


6602 


398 


156 


NGRKHTDVKPEELPSWVPTRDLTPTGTA 
GAF/Q/RGYYGYYNKYVTVKKGS IAGVS 
TVLAA*VLFSYCLSYREIKHEQLLRAH 


6549 


20450 


A 


6603 


380 


77 


PFPKKSPPL*NFPPFF*KKKVPPPPPGK 
LPLS FLFFFFFF/RDRVLLCRPGWSAMV 
HGHGLSPVNFCISTRNRAL/SMLPRPIS 
NTWTQAIRPS*PPKVLGLQM 


6550 


20451 


A 


6604 


352 


204 


KRGPKI *NFFFG*RQSLHMLPRWPNSW 
AQMILPPWPP\KVQGLQA 


6551 


20452 


A 


6605 


631 


357 


FFF* DRVLLYHPGWSAAAR \SLI*TTTSA 
SWV\KQFSCLSLPSS*DYRCAPPCPASF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 

■ 


M 

eth 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

• 

ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AJanine 
OCysteine, D=Aspartic Acid, 
E>=Glutamic Acid, ^Phenylalanine, 
G^GIycine, fiNHistidine, I=Isoteucine, 
K=Lysine, L=LeucJne, M=Metjhionroe, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X^Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 








M 






CI FSRDRGFTMLARMVSNS * PQVIRPPG 
PPKVLGLQA 


6552 

■ 

■ 


20453 


A 


6606 


3 


397 


VLHKLTQRCSAVPLRAPTAFLEIGTQTP 
KRVWKCR* P /JRETQNNLLD KEQD FKTYQ 
KATVTKAVW CQHEDT I EQ WDG I E 5TE KK 
FHTQRAGGRDHSHSDSVRKKRKESLFFF 
LKRGLILFPNLBGTRGGISTP 


6553 


20454 


A 


6607 


323 


35 


YLKTKIGKPGGLAALQFLGPRPSKNTQL 
GNRFWCTHTHTHTHTHVHPHTSQYLYPH 
RSISFCSYLDSI/SP*VLTESSNSNPTP 
LKCC*HFPSPO* 


6554 


20455 


A 


6608 


439 


252 


LVPPLFFPRSFLLFS PPKKRGGPKPV* F 
GPPRAFYPPPF/V*RGPPPFFFGAPKKK 
IFFSPPPG*KCFFF*SPPPG 


6555 

■ 


20456 


A 


6609 


3243 


2602 


FFFGRDGVLLCCPGWSQTPRLKQSSSLS 
L\PKC+DDWH/RATSSGLQVFLSICCSI 
IS I FFMPS ICDKSTKO FSN / PORAYILV 

G*R*VIYIX3VVIiAFLFVFLRQHWNTVA 
QAGVQWCNLSLL* PPPARFKRFSCLILP 
RSWDHRRLPPRPANFFVFLVETGF\TIL 
ARLVSNS * PRDPPASASLSAGIMGVNHR 
TWPILVFLHPSVFFKVLQLFYL 


6556 


20457 


A 


6610 


81 


395 


TWAALKLYSTCI FSSSETEPDVSQSQIT 
DTTAC IME PGLFW / YRS 1 I WTL* FFHFF 
WRDGGLTIjIiTRLVWNSWAQVISLPGPPK 
VLGLQASATAPNSFILIREGIS 


6557 


20458 


A 


6611 


1 


126 


GRLRQENHLNPGGRGCSEPRLHHCSPAW 
A I ERDS VS KK/NKNGLKRC FHLRKKTLK 
GEKRIW.*PGGRGCSEPRLHHCSPAWAI 
ERDSVSKKKKMD 


6558 


20459 


A 


6612 


389 


67 


YLFMYLF* / DRVLIiCHPGWSAVAQS * LT 
AASNSWPQVILLSNRFKSWTVLLIjSSKI 
PWQGSTPCPRCYSWHPKCLQLQGLLGL 
DCSSPKYPRGPSLAAFRSLPKWHV 


6559 


20460 


A 


6613 


I 


384 


RCFKVKDLKMRPFWNIQVDPKSNDKCPC 
KRLKRRGHASL P P WE KRRR PGKDGGRGW 
NSAVTSQGT /VEATRSWKSQGSVLP* SL 
*RERSPV\ELAHF*QRLAMLPRLVLNSW 
PQDISPPWPPKALGLQEP . 


6560 


20461 


A 


6614 


385 


174 


CLRLSALKKP KLNS WY* K* LYIME Y* SA 
LKT\NKG I LSFAI T WMNLDD I LLNE IS Q 
TQKDKYHMNSLICE I 


6561 


20462 


A 


6615 


1 


370 


MNPRAHIiS FLtiRRKERRILWLKALGHI M 
/ FTV*GLTG I ALANS SLYIVLHDTYYGV 
SHFHYVLSIGAGFAIIGGFIH*FPLFSG 
YTKKKKKN I RGGG TKNFLLCPALVQRE K 
VNPFFALTNLWG 


6562 


20463 


A 


6616 


3 


342 


YRHEPPHSAIFAQLYKL*K*LIYALKTG 
KVYVYKLYLNKAIKSQ\SYNRASLSLT* 
* PRVKRKRS * PGTGAHTCNPSTLGSQDG 
L IA* VQE FETNLGN IAKPCLHQKKKK6A 
PV 


6563 


20464 


A 


6617 


1179 


915 


FFFETBS CLVTQAGVQWRDLGSLQS PPS 
G\SSDFQLIFVLLVHGQAGLELLIS*SF 
ALASQTAR ITGVSHHAQLVWSDF I KS IN 
LFCCL 


65*4 


20465 


A 


6618 


3 


390 


HAS VKMELPYDAAI PLPSI YPKELKARF 
♦DLCIPM/FITALFTIAKRWRQLNCPSV 
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SEQID ' 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

cth 

od 

■ 


SEQID 
NO: in 
USSN 
09/515,1 
26 

■ 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide . 
sequence 


Predict, 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

nentide 

sequence 


Amino acid sequence (A=Alanine . 
OCysteine, D=Aspartic Acid, 
E»Glutamic Acid, ^Phenylalanine, 
(^Glycine, HMHistidine, I^Isotaicine, 
K=JLysine, L=Leucine, M=Methionine, 
N=Asparagine, P= Pro line, 
Q=Glutamine, R=Arginine, S=Serine, 
T«Threonine t V=Valine, 
W»Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=po$slbIe 
nucleotide insertion 




■ 






■ 




VIRSWGREGVKESCLMGMEFHSAR*KSS 
GDTFHNTENCY'TVHLKI VSGKF* * L* C 
VLNYVGFFYHNKNIKRLS 


6565 


20466 


A 


6619 


392 


49 


QFLFERLFSSYSGYVSGVKMLGSTBVQL 
FPRKCEMKSCS VNPRXjECSGGI SAHCNL 
CLPGS RDCPAS AS * VAGL I FY / 1 FLVEM 
GFHHDG/RELVLNS *PQVIRPHQPPKVL 
VLHV 


6566 . 


20467 


A 


6620 


193 


282 


TPGLK*SSCLSIiQNSWDCIWDYRRI.PLY 
PVNCFCC*FVCFVC/IFVEPGSCIAQYC 
LELLASSNSPMSASQSTGI IGMSHGACL 
ILT . 


DjO/ 




a 






Q 
O 


LNDTVST FFF *DR ISL \ C P S WSAVARS O 
LKAASTSWVQAILPPQPKTNLLGSSDSS 
TSA* DQS PGFK* S FCFRLPGSWDYGCVP 
PFSANF* IFCRDKLSL/WAOAILSPWPP 
KVLGLQ ALW EAE EGG 


6568 


20469 


A 


6622 


395 


42 


HVS S TS FQGDELHLRIVALPLP \ PLHNL 
PLPLSPRSL*C/HSRGWTVLPPSRLTAT 
SLPD/SSO^LLSAWDCKRAPPHLSGFC 
WSAVA* SWVSLQPPPPSRLPWPPKVRRL 
QlPLPSRHPV 






A 

** 






327 


PSPPSRWNPKDVPPNQENIiGI FFFFL*T 
RDIiATLPRLVPNS * TQVI / WPP KVLGLQ 
A 


6570 


20471 


a 


6624 


395 


47 


KNRALFFFPPPRKRGFFPKPLIWGTPRF 
FPPPPF*NPPPYFFFGAPKKKFFFS/SP 
PGGKIGSFKRPPPFFFFFKQNFFPVPPG 
GGPRGDFGSLQHPPPRVKGFSCLAPI I F 
FSSF 


6571 




A 




100 


267 


LSLS KKANISKFWLGT\VAHAV\DPSTL 
GA*GRWIT*VQGFETSLGKIVRPCLYFC 


6572 


20473 




6626 


409 


126 


PSSSFSSSSSSSSPSSSSSSP/PFFFFF 
FFFS FW * KWGVS LCGPGWS QTLGLKQS S 
CliSLPKCWDYRNKPLHLAFLASLILFTF 
QEWDITD 


6573 


20474 


A 


6627 


506 


323 


REAPGTLI PC WQKYDMI KAQLVWQFLRK 
PNI YLPYNPAI PL*HTCPRKIHTQYT/C 
FITKTCTRMFIAALFIIPQNWKQPT*PF 
CPFTIISHSCSPRDNWF 


6574 


20475 


A 


6628 


330 


117 


DVCPRKI YTLVLQQLCI \RMFTAILFS I 
AL* FCKPMCTSILEWIKKL* S IRTTEYH 
LVMRRNKPLLYSMPQM 


6575 


20476 


A 


6629 


396 

• 


139 


FFSPPEKKGVFPSP*NLGTPGFFPPPPF 
KKRPP * FI FX3APKKKK/HI SPPPAEKFF 
FFKRPPPLFFFFFFFFFLFFFFFSAFKL 
KV 


6576 


20477 


A 


6630 


79 


405 


VSGRMTALESYLLASSA\RNKCSSNCKI 
KKMT ARLD VLAHRCN P ST FGG* GGR I C * 
AHKFETSQGHIIRPCLYKKKKKGRPFKR 
TQFSPARWQGNGFFWGTLKNSRAGF 


6577 


20478 


A 


6631 


2 


417 


S F I F P CKFLNQF I S FYKNVCHDFDCNCV 
ESKDQLG/WPDAVAHTCNHNTLGSQGGW 
I A* AQELKTNLDWMAKPCLY 


6578 


20479 


A 


6632 


74 


417 

» 


KTVFRYNGWAL* AS ENR IS LADLLXPS F 
HVLLCLP KAG \ PNFPLPS S IKKFFDLPD 
LIRSHKTHISEGVLFPFP/RRKEPRTBR 
PRRI * TDRSCWVS SLSLIVLDHAFFVQS 
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nucleotide 
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correspond 

ing to first 

amino add 

residue of 

peptide 

sequence 
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Predict- 
ed end 
nude- ' 

mm mm) w m W 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A==AJanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid. F^PbenvIaianine. 
G=Glycine, H=HistidIne, l=Iso!eucine, 
K=Lysine, Lr=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Tbreonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, . 
X=Un known, *=S>top codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














OFYM 


6579 


20480 


A 


6633 


740 


776 


CSFFLKKQTRSRAAVQAGVQWCHYSSLQ 
PSPPDLKQPPASASQTV/KD*GAYFWCC 
FFLFLKRWGL/NSVVQAGVAiiiiPDLPVV 
RRTjVT PIiSIiWAVEPALWARADIj*riL / SP 

fVXV-LJ V L ■ ft ■ iJ"f^ lf mmWrnrn /lUrUVUI^U XJ±Jf mJ AT 

cplas qs prhqtgqs phq kcfpdi lglh 
lalrssfspglerptlpxppglaseht* 
atwlswplpyk/ ppkrvkmylvkadrpe 
pfl *gtrgrks vmqmthaktlqbtlysg 

PEAQAAASVTbEF 


6580 




A 


0054 

t 


L 




PRFK*FS(XSLPSGWD*RHAPPRPAIFV 
FLVEMGF\TMLATMV\ * IS * PGDLPTSA 


6581 


20482 


A 


6635 


407 


212 


GRPR*TPPPLGKKKKGAPPPKTKKKKNF 
W3GRDSYSQKKPFFF*RNKTRPGGGGPP 

RTPPPTiTfn*JXTfTnff3T.PPRPTrFPVT* /NTT 

PPPPPPGGNNQNPFPKKKKRKSTR* I*K 
IRI SRF ILYFFFFFFLERGFGCSPRGGG 
GGAFLSYGKLWPPG 


6582 


20483 


A 


6636 


20 


223 


PCKKLPQPLQPSAT\TLISQQPLTIiRKD 

T iTjPRTTP Tj* T AROQTTriS YOFTiVAICYFK T If 

GLS FGRNKLANI 


6583 


20484 


A 


6637 


866 


647 


FFLDRVL/IiCHLGWSAVMRSWLTAALTS 
GLKQSSCLSLPKLRDYRHEPQHQANNIS 
♦SSWSDSNMQPRLKTTL 


6584 


<*if\A O C 

204o5 




6o3o 


• 




Vn 21 P P P W T P P J? PT . PM T I? 7? P P P I? P P P P P P 

FFP INFXaQCLLYYRALHFNKVKVINNF/ 
PLNFCAYYHPL *NM* FS YI S FQNLYKFF 
LI/FGVFNLSGIYFNVL*IANF*+G*LI 


6585 


20486 


A 


6639 


t 


341 


NWDYRHALPCSANFFILVETGFCQLPRL 
VLNSCPQVSHKA\KFLFIFFFLFFLEEG 
GFVGPPRLEYREGSLQP*LPKLKGSTCL 
TLPKG*N*RGEPliPPARSSFFFISKFFI 
ET 


6586 


20487 


A 


6640 


391 


118 


NANFEKS FSMGKI L *NSLAWYKKILCD/ r 
RKNRSMQKTPLD* * YYFKKLPQPL*PSA 
NTTL I GRPALT S RQG LP P AKTL * FAEGF 
DPMILSLF 

mmr mm- m> m mm i* 


6587 


20488 


A 


6641 


352 


285 


KKKKKGGGGGAYSGIPTF/CFFFLNGVF 
LGPAGVQWQNQRSLQPPPPGVKLFFFLS 
LPSH*GPRHGPPRPGMFIYKNF/CFF/V 
RDGVLLHCPGWSQTPGLK 


OJOO 




A 






(LA 


TGNPFLDSLFYFNPGPLGGGG I LKGFFS 

4m- Mm mmm**** m^V mmmmm mm mm 4YW mm \mP 4m mmmmmTSmf \mT\m* mm 4J4VW mm mm %mf 

FPFFLETKFFFPPPAGGQWRYFSSL*PP 
PPKFKQFFFLSM/WPRPGKFFFFFPGIL 
VEAGFHRVAQAGLKLLSSGNLPTSASQS 
ARITAKSHRTVF 


6589 


20490 


A 


6643 


424 


11 


PPSIFLAPPPSRGGGPPPPPKKK/QSPP 
KKKKKGGGGGPKNPPPKKKGPPKKTTGF 
KKNPPQKKKKKNSPPKKKGGPPANK*KP 
PPRPQPFYFLTFYYKIFIYIFLLKFLLF 
YILFFFFLKSSKQGRPtAI 


6590 


20491 


A 


6644 


411 


2 


PPLRIWIiASPPSKIF*PQPLHI/YFSPF 
PP*KFFFSLKPLIFLGGFFPKFPPPKKS 
FFPKNPQGVF*NPP*KKKIYFSSPP*NF 
APPKIFLKTPPPFFFFFFFFFVEMGSRY 
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NO: of 

flUvlCwlluc 


SEQ ID 
NO: of 

cpri ii pn t*<* 


M 
eth 

uu 
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SEQ ID 
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uucicuiiuc 
Inpntinn 

correspond 
ing to first 

-amino acid 
residue of 

. pepuue 

sequence 


Predict- 
ed end 
nucie- 

ULIUC 

location 
correspon 
ding to 
last amino 

UUlU 

residue of 

peptide 

sequence 


Amino acid sequence (A s Alanine 

D=Cysteine, D=Aspartic Add, 

£/~vti uvaiii iv /will) i jriiciijmmumcj 
fi=f?lvcine Ifssl-I fctirlinp T=Tenk>nrlnp 

K=Lysine, L=Leucine, M=Methionine y 
N=Asparagine, P=Proline, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Threoninc, V=Valine, 

X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














IAQARTRERTRGRTRGRTRG 


6591 

■ 


20492 


A 


6645 


34 


381 


AKLGTRQAAKQBKEIKVTRI*VnCBIBLS 
yr lis i lyiuNjj/JN v iujsjji) w ijviritfajvi l 
GYKVNLKNINCIFHKHTNQQQKILKMPF 

TT2i*Tf /TTJT*T.* TUT.TKTlTnnT.VTiTCTVO 
XXn A./ 1 Xtv w Jj" XriXil pJJi\lULtZnaX iu 

TLKABL 


6592 


, 20493 


A 


6646 


17 


448 


HF YAKLGTRASLS CSGLHPV* ASQ PLYL 
PTQASAMADAPFPARLPPNSLI S YCCTS 
NEQGSVGVGPABPGAGYNLV*HLLRPLE 

QTPCASWVRQCPTLLRLALHGIiHPLSNQ 
SQGD . 


6593 


20494 

t 


A 


6647 


462 


336 


SLRLPGSNDSPASAS*VA\GITGPCHHA 
RLVFVFLVEMEGLGS 


6594 


20495 


A 


6648 


447 


62 


KDDIWIAITHMNRWSISLAKRKM*IKIT 
MKYPyTSI£CMA/lKQTNKN*KKSSSIKC 
W* ECRTMRTLIHCW* * HITG* PLWICTVW 
KFI I KLNLYL P YD PA\ TCTQMFTA/ALF 
III KY * KQS KYPS SGYCVPA 


6595 


20496 


A 


6649 


3 

■ 


1492 


LAYARAFIJ3DTNTKRYPDNKVKLLKBVF 
RQFVEACGQALAVNERLI KEDQLEYQEE 
MKANYREMAKELSEIMHEQI / CPPGGBD 
ERLTBFPSHLQRHQWDSNKHNGSRDDQL 
VFGRVI TSHGPCVIjTC rv XCKIiRHXjSKA 
NHWGDRAQ/ERTKGKGRERK*RTTLFLN 
R/RFYRSWKKVHIFF 


6596 


20497 


A 


6650 


495 


170 


G AAE F PAS Q SAG I AG VS HHARLEE FFS L 
DSGRVMRGTEVLKYLGGYTITI*SIFFF 
FSFF\WPGLGSLQPLLPGFK\LSLLSRW 
DYRCPAPHLISL 


6597 


20498 


A 


6651 


9 


222 


QTTRPERRNSGDPTHLSL*SSWDYGHVC 
HHAWXil r *K/ Cr SCjGIiAI <i iJ^RJjy y NSWA 

QAVLPPWPLKVLGLQV 


6598 

■ 


20499 


A- 


6652 

• 


241 


833 


KKTPLAMWNPWPGPPQIPHPLTKLVQMA 
G / QENGCLSGHDLE S I RPAGLHNSARGE 
VLGLSSSLGKELVFLQEELDLSEIH1PE 
AQETvErl^GHriUrltHV P VJrJJGKAJr I CKA 
SLSASSSLEPTPPEDTAISSLRPPSAPB 
MLTQHGAQEQVEDHPG/Q*QPSPHSQSR 

TRPG 


6599 


20500 


A 


6653 


1 


398 


FF FDIGSHSLTQAAVQ * HDHGHGSLHPR 
LPGFK* S S HLS FPES * D YSRHTP PRPAN 

SKSARITSVSHHVQPEVSIKW*NNISNL 
IKCFLLFYKYKCIFINTKFVFTV 


6600 


20501 


A 


6654 


489 


282 


PAS/WDY/ RHMPPLLANACAFIjC*RDFS 
MLPGLVSNWWARVICLPWPPRVLRIQAR 
ATTPG 


6601 


20502 


A 


6655 


953 


908 


SQSCSVS \RLECSGAISADCNLCLLGSS 
YS STSAS * VAGTTGARHHTQLI FVFLVE 
MGFHHVGQPGLELLT\ *VIHLSRSPKVL 
GL*APSSARGQNSVSQ 


6602 


20503 


A 


6656 


412 


44 

i 

1 ■ * 


R I S ERRKS LGAVHT KD KKYS S S RVRTSN 
PNCARSGASSKQSFLFGKKRAPHPGIKS 
F\KDALLWDGITGVHHHA*QIFVRSRE 
R* XRPVI PALWEAEAGGS RAQKIET ILA 
NTVKPRPSLLRA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
DeDtide 

Mmt ^*mm* ^mwmmm V* 

sequence 
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M 
od 


SEQID 
NO: in 
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26 
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beginning 

nucleotide 

location 

correspond 

big to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sentience 


Amino acid sequence (A~Alanine 
OCysteine, D-Aspartic Acid; 
E=G!utaroic Acid. ^Phenylalanine, 

^^mm ^m* m ^^^m m-^^ m ^B" my H mmm mm m^ mm _ J m mmj ■ mm 90 my V ^Brw 

(XJtycine, H-Histidine, I=Isoleucine t 
K=Lysine, L=Leucine, M=Mettaionlne, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosjne, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 

mM mM mm | Wm* m mW n»B mm m* m mm *7 ■ tlvll 


6603 


20504 


A 


6657 


3 


241 


F F I FLEARSHS VIQVGMQWCDHS SL * TQ 
TPGLKES\PASSFHRAGITGVSHHTYLF 
L*GH* SHSQGPHPHDLITSKRPCS 






A 

M Mt 


6658 


467 




FFFFFFKOGFGIiFPOIjEGKEGIPGPCNP 
CPPGANDSPPSTPLGNGGPGGPPPNPGN 
FGIFGI *GGEGGF* TPGSGKPSGPGS PK 
\IPFCWVTRERPHGPPPRPFFFFLRQSL 

% ^mm a> ■ ~ v v mm mtmm^mm' mm mm mm m* mmm mm mm mm mmMMm ~ > ^^m\ mmmw 

CFAQAGVQWCDLGPLQPPPPRFKQFCLR 
NS 


6605 


20506 


A 


6659 


2 


286 


KNLKINWT*WLMPWSAT/WGGSLKPGR 
LRLQ*AVXAPLHSSLGDRARPYLSVSQY 
LYFSLSLS LSLS LTHTHTHTHTHTHKI E 
LLGCVAEI IHFF 


6606 


20507 


A 


6660 


3 


249 


LSPRP* CGDT I SAHCNLHLLGSSSSPAS 
VSQVAG ITGACHHAWLI FVFS VEMGFHH 
AGQDCLHLL/NLVIHPPQPPKVLGLQA 


6607 

* 


20508 

* 

■ 


A 


6661 

• 


3 

• 


1507 


PEAPVPFLDSNQBNDLLWPEKFPERTTV 
TELPQTSHVSFSEPDIPSSKSTEliPVDW 
S IKTRLLFTSSQP FT WADHLKAQEEAQG 
LVQHCRATKVTLPKS IQDPKLSSELRCT 
FQQSL I YWLHPALS WLPLFPRIGADRKM 
AGKTS P WSNDATLQHVLMS0WS VSFTSL 
YNLLKTKLCPYFYVCTYQFTVLFRAAGL 
AGSDLITALISPTTRGLRBAMRNEGIBF 

ITL/SSSTDLWTHEIDIjHRNSL*NRDSN 
CSNF/LREQAI SDEDEEESFS WLEEMGV 
QDKI KKPDILS I KLRKEKHE VQMDHRPE 
<> WT.VKG TMTFTT.T.'MFLTN 1 ? KST.Vn.TQri 

PQAGLP PTLLS P VAFRGATMQMIjKARS V 
NVKTQALSGYRDQFSLE ITGP IMPHSLH 
SLTMLLKSSQSGSFSAVLYPHEPTAVFN 
I CLQMDKVLDMEVVHKELTNCGLHPNTL 
EQI^QIPI^RSSIiRNVVLRDYIYNWRS 


6608 


20509 


A 


6662 


326 


1 


IMIFLFFQTGSRFVA\RLECSGTITTHC 
S LML.PGSSMP\ PTSAS * WTHCRTRVI L 
NFSVDPGS RHWKGGLELLDAI LKPWPP 
KVLRTRGS TPE FRS GQSGVR I RGFN 


6609 

• 


20510 

■ 


A 


6663 


414 

• 


105 

• 


QGKVGFKVGIiNPLSRLLHFFKQKLFVLS 
*KGQVICKLSLKIPFPPGGFFSPPI*NnPP 
P *NKPORFKNPFMGFT/ IRKKI LFLAFK 
STKSGG*NPPSK/HKK1*FSF/ CFWETB 
SRSVTQAGVQWHYLS SLQPLP PRFK* FS 
FCKPHKRVFKPLGLVLWGG I SNRRRKKT 
SGGKRNFKA 

mm* ^mw *mmt m>^m>mm>m mm m>\mm My 


6610 


20511 


A 


6664 


1 


407 


GCVPWPAAREABAG* I A* TRET / RUR W 
AGIVPLHSSVGNKSETLSQKKKGAPSKK 
SLQGVALFLSQLGKLPPFLQRKKRPQRG 
GGRGQLAPGYNPPAGKTLVGPGPHLSFC 
S KKAWKELRP WKKPRGFWFLINST 


6611 


20512 


A 


6665 


1 


280 


FFFF*HRVSLCCPGWSTDLHLLPP*IjAR 
FKRSSHLSLPSSWDYRHTPT/HPGFFVC 
LFVFCRDRVSFVLPRLVSNFWAQAILLC 
GDRKVMGLQA 


6612 


20513 


A 


6666 


408 


89 


WLPNSLILPN*NYFVSYLSDRWSLCRP | 
G3TAVAQ * ELTA\ A* TLGLKRSAS FSHS 
HSCSLPSSWDYRCTSSHLANFFFKRWGF j 
TILPRLILNSWPQVILIiPQSQRH 
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sequence 
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M 
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USSN 
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26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict, 
ed end 
nucle- 
otide 
location 
correspon 
dine to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alamne 
OCysteine, D=Aspartic Acid, 
E=Glutaraic Acid, ^Phenylalanine, 
OGlycine, HHHistidine, Msoleudne, 
K=Lysuie, L=Leucine, MNMethlonine, 
N=Asparagine,P=Proline, 
,Q=G)utamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X^Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6613 


20514 

i 


A 


6667 


484 


208 


I T WS AAGL I HYS FLN PG E PMTTE K YAQ 
EIDDIH*NLQHLQPALVDRKGPIFLHDH 
A* PHAAQPK\LQKLNKLGCRVLPHLP YS 
PDLLPLT 


6614 


20515 


A 


6668 


495 


227 


PAFVKVPPE KDQRQNSTHAGRNHLPME I 
LLL * KKKLI BTESHSVT \ RLE CSGVI TA 
HCNLKLLGS SD P PTS ASD YRLKPWHLAP 
SSFSMS 


6615 


20516 


A 


6669 


2 . 


65 


OSVLLTLSLLARLVL\NSWPQVINLPWP 
PKVLG + QAGLGTPGIjK 


6616 


20517 

■ 


A 


6670 


403 


224 


ILL* P * PPGLK* / FLPSS WDWRRASLCIi 
ANFLFFVEMGYPRVARAGLKFLGSDPSA 
SQSAGVTGVSRPGRFLIRTVL 


6617 


20518 


A ■ 


6671 


406 


162 


RS YAQAGTQWLDFSSLQP * P PGLI KQS S 
HFSFSQSASH/RHPDNYCI PCRNRVYVG 
QAS LELLGSCDLPAS AS QS AG I TGS CL 


6618 


20519 


A 


6672 


453 


32 


WSATLHTSVWCACATRCVGMCMTLGV 
CGCECVW*GCVCTSVN* \HTSTGWSEK 
V,*LCVCGCV* LCVAECMAFVSRQCVCDP 
VCCLPEKMRLSPISVA*VCAPMKVSPCP 
PQLSELHLLSLNIYINKCVSLSQNAWLF 


6619 


20520 


A 


6673 


378 


180 


NEVIRSYKVSLFSPRIEKWINK/I/HWY 
I YPME YDSAI KRNEVLIHATT *RKLENI 
TLNERIFDIIPFI 


6620 


20521 


A 


6674 


378 


94 


PVKNRRTKELH I KD VCTPMLI VI LFM IV 
K/S * KQPKCP * IDE WMNKMWYTHMVEYC 
S VS KKEGSPF I C YNEDITLNE I SHKKTN 
IYNSTYIRYLE 


6621 


20522 


A 


6675 


I 


380 


VCSL* VLCRHMSCFQNCSL* FSFSFWFF 
WRQCLPLSPRLECGGPMSAHHNLRLPGS 
SDSCASASLVAGIRGACHHARVIFVFW 
EMG\FHCVGQAGLELLT\ * VICPPWPPK 
VLGLQVLATVPGQPHIF 


6622 


20523 


A 


6676 


424 


135 


SfHRWGVKCFKYNI FFFRD\RV* LCCTGW 
S*TPELK*SSCLGLPKCQDYRHETLRVS 
Y*F*NSI*QIGVLAMVSKLVLNS*AQAI 
LLPWPPRVLGLQI 


6623 


20524 


A 


6677 


408 


91 


HTHFVWPPVLI P* * PMR*VLLLTPNTNF 
FLRDRVSPCHPGWSVEAQSQITAASN\Y 
GLKQFSHPSVPKCWDYRRELPCMVSLI I 
F IYVHLHMYVHARKE KPE YPL 


6624 

* 


20525 


A 


6678 


1231 


217 

■ 


P PYFNWPLTFPQKRFFKGDKTLKKWL * I 
REI I LQNRGIG* KTDSIR*MGKTEAPLK 
EKFLLGTALQPSFLPYMLGHFKEE*GBE 
FPGTHSWPFPSFPVWGLGLSVNHRGIHG 
P RVS LQTWGPGQEAMECE CGNHGTEDTG 
\GAVLQPG * VT * G FHTPTSEPGLVAPP C 
TCSGLKAEA\GFLKLPGKQG\SEGALPP 
* SPLPSACFPELGLRRLGSS PGLMIHPG 
WGLFTS PLGLKI LG PRHPAGPALPQI AQ 
RPSLKFR * RRLG WGPELLPAQTCGAQSL 
GPSTAPLLGQGPWGLG *HASRWGTLGLA 
VAPGNEVPSLSPR* PRGAGPLHRHRVPV 
RSWG 


6625 


20526 


A 


6679 


1 


178 

* 


RGWDYRRPLP/RPG * CFVFLVEMGFHRV 
GQDGLYLLTL*SSQSAGITGVSHRARLQ 
PPIF 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=GIutamic Acid. ^Phenylalanine. 
G^GIycine, H=Histidine, P=lsoleucine, 
K==Lyslne« LHLeuclne. M=Methionine. 
N=Asparagine, P=rVo]ine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W=Tryptophan, Y^yrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


6626 


20527 


A 


6680 

WW 


3 


135 


KSEHLEREPI IEQLSTMFF/ TAKYGWHL 
RGH*HRCQKKPEPPEDR 


6627 


20528 


A 


6681 


425 


107 


KGPPTPS/PPPPSPVFPTEKILVPPPFP 
PPPPPKKPPPQNIFWGPKK/IKNFSPPR | 
GEKNFFF*TPPPKYTLGAKKKKNFSPPR 
GEKNFFF 


6628 


20529 


A 


6682 


424 


284 


CFLQPGP \ PGS KGAS PPS FQDNWGLRQA 
LPPP/RESFFLFLFLFFPIVQTGFHHVA 
RAGLE LLTS S DLPAS AS QS AE I TGVSH * 
A*PFP 


6629 

• 


20530 

* 

■ 


A 


6683 


1290 

- 


449 

■ 

■ 


PLSAPPPGDPSP*KSSPRAPNLPAP*PP 

KSPSPPV\PWATEASRGHQPQIT*ELG 
SLVLRMTEKGTWG*GIPDSIMPLPPJRRS 
EPRTCRE* GDTRWRWPJRRRRS TGAVSTF 
SPGFACGGGSGA/ PAHPGEQKIiGASARP 

LPSQPVH*RFKP\SPPY*PP\ALPPLSP 
FEPSAPPHTGSVPY*QGVPP/SEGCPST 
SKPLISVRPPVTINGCLPQSQPPKIHTV 


6630 


20531 


A 


6684 


425 


40 


E WAS LKN * HPP / GANQAVEAVEAPTHCW 
*GCKS IEPLWKTVWQCLDKLNI CLLYDS 
TPE * MSEHVYOAACKKSFTAVLFMI PPN 
♦KQPSCPSTCWLGQAWWLTLVI PAPWEA 
KVGGSLE PRSLRPAWATG 


6631 

4 


20532 


A ■ 


6685 


2 


107 


SWLTAASTSQ\VKRSSRLSLPSSGDHRC 
MPANFFFLKDDVS LLPRLI SNS WTQAIL 
LSLPTKGLEL * AHCSLNLPGS SDPPASA 
YQWGTTGACQIiIFFF 


6632 


20533 


A 


6686 


12 

i 


224 


TPPGSTHASGAPPCLANIKFNY\L*RLP 
RLFWNSWPQVILLPWPSKVLGXiQVWATV 
PSPFLIFDRDIDREI 


6633 


20534 


A 


#87 


404 


2 


TAARG 1 1 YGGP I KKLL PRKP ADVK/ SGS 
FKTAAPFFFFI/3TRTLTEFLFI I ION* K 
QPKRPST/DRINKIiWYISTIEYYSAMKR 
NKLLIHTTAWMNYKGIMVSKRGQSQKSI 
FSRTKTGFGRNHISGCFLYPZjPLRL 


6634 


20535 

• 


A 

• 


6688 


410 

* 


163 


PRGNTPPGFSPPEJCENQKVFNPPPGNIV 
* KKGGPP POGAOKKGPP Y*G PPLS FLP P 
KKKKKKGPPPPPPPKKNPPKGNS/SFSP ' 
KKKKKFFP PPPKKKGF *GGPPLKTLFFW 
GGGKKFFFFFWGKKEFPLGG FFFGGGGG 
GGPFFFFFFG 


6635 


20536 


A 


6689 


205 

• 


405 


KSPPSPGGPPLIGKLEIGGAPKPKNWPP 
PLGP KG ENP PLALGGGGGAKNHP PNPQF 
ILLFLGGKNP*KSPPSPGGPP/ANRKIG 
NRRGPKTQKLAPPLGAKGGKPPPCPRGR 
GGGQKPPPQSPVYSPFFGGKKPL 


6636 


20537 


A 


6690 


3 


199 


HGSLQH* PPGFKSSSSLSLPSSWDHGHV 
QPRPAI F/CFFLILEKRGVSPGFPGGPE 
TPGLKGLDLL 


6637 


20538 


A 


6691 


8 


350 


S VCLGLP/ RCWDYR YEPPHLA* LAFLOQ 
I IFHHMDWI YHVS ILWDGYLGCFYFIA 
I INNV\G* AGLELLTSGDPPTSASQSAG 
ITGVSHCTHPYCYTFLTREITTPKEVKI 
GLCM 


6638 


20539 


A 


6692 


373 


197 


LPSSWDYRFMP*HMLFFK\FLFLLFCRD 
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Amino acid sequence (A~Alanine 
OCysteine, D=Aspartic Acid, 

E=Gliltamic Acid F=Phenvlnlnnln** 

*-* VIUIBHHV, ntlU) M. M 1 * c "j'fl Hill 1 IIP, 

OGIycine, H=Htstidine, I=Isoteudne, 

K~L.VSlof» 1^=1 murine TVf =TVT i>thinnin» 

N=Asparagine, P= Pro line, 
Q=Glutamine, R=Arginine, S=Serine; 
^Threonine, V=»Valine, 
W«Tryptophan, Y=»Tyrosine, 

i\. v/IIKJIUVtII) ~OHlp CUUUUf /— *pux»iwic 

nucleotide deletion, V^possibte 
nucleotide insertion 














E / S LTMlkS RIjVfiKf S WAQATLP PO^ bidhL 
GLQV 






A 


ooyj 


joy 


inn 
IUU 


PMTiARMV 4 ? T PRnPPA<3JL<?n<l AflTTftV 

SHRARPIYFFK*R*SHAGYSAVPVHRHD 
PTTDKHNSFLPCFLQTGSCFWQVGMQW 
HDHISL* PRTPGFKQYSHPSLPRH*DYK 
PTPP / HP Q * PWO VGMOWHDHT SL 


6640 


20541 


A 


6694 


12 


182 


FLS S WN YRC VPPHLAN* KKNF FFYTDR / 
SLAVLPGLVSNS *TQAVLPLQPPKVLGL 

QA , 


6641 


20542 


A 


6695 


402 


248 


SSWDYRCAPPCPANFYKIL*TGS/LI*C* 

trcCr VoIMoWiJy V XJjlrirnslrJvVijKUV 


6642 


20543 


A 


6696 


301 


309 


VQEF * DR I LLSPRCHPGWRAVA* SRLTA 
ISASRLKARPSHLSLLSSWDYR*VPPGP 
ANFLHPFCR/DRGFAMLPRLVSNSWTQV 


6643 


20544 


A 


6697 


3 


387 


PCLANFF/ VFFVETGTHYAA* CGLRLLG 
SSALPV*TS*SAGIIGMSHCTC/LQITL 
LKTESHS \ VAQAGVQWHDLGSLQPLPPG 
FM* FFCLS IQSS WDYRRTSPRPANFCI F 

oHDb V JjPCWlrvJWo 1 IrDJ-iK 


6644 


20545 . 


A 


6698 


1 


287 


LWFLYQGNAG I TG I SHRAQ PNS FVFFKD 
RVIiLCCPGWSAVA\HSSLTVALTSQAQV 
I FLPQP PG*LGNFCI FSRD/RGFAMMPR 

Jji,oJNoriAy V JLXirJr 


6645 


20546 


A 


6699 

• 


412 


3 


FEMES CS VARLE CSGT ILAHCNLHT J ifi S 
SSSPVSASRVAGTTGACCHTWLFFFPFF 
V/FLVETGFYHVAQMGLNNWAKAILPQP 

VGYPAKFHS I VLLLL YLLFHLLCGVF 


6646 


20547 


A 


6700 


393 


88 


QKLYPYLTPYAKVN+QWIKNLSVTAKNI 
* FLGENTGVNLHS FRF I KGFLDM / TPKS 

KMKRQPTKYEKIHLTLNNWV 


6647 


20548 


A 


6701 


494 


239 


KAGSHS\VSRLECSGMISAHCTFCLLS* 

QMGFCHVTQAWVLI FDCGFES P INI KYN 
LL 


6648 


20549 


A 


6702 


2 


379 


R PTWVTWQNPAS TKKKXNFFFFKTKSPT 

RPEAPKGQSCSLMGGGLCPLLPP/WSGT 
PPQR 


6649 


20550 


A 


6703 


220 


47 


TGFSGSQIN*EK/ 1 WLGTVAHTCDPSTL 
GGAGGQVT * GQEFKTS RHEPLHLTVLCS 


6650 


20551 


A 

* 


6704 


80 


414 


ICYYFIVFAIYIiNQFFSNCYRSFTFFFC 
PFETESRSAAQAGGYNPTLTLLHPPPLG 
IKQFSCLTLPSS\WARS*KPPCPTWGNS 
VFVGJI YKNKLGGGGA I RRVOTCNOT YSG 


6651 


20552 


A 


6705 


69 


416 


NVE KRNN I HN I KKT P VYFKGFLL YCCKA 
E VFYVLDNPFI FYFLF YFFETHCVAQAG 
VQWHNLRSIjQPSHP\SSSDYRCAPPHIA 
NFCRDGVSPCWPSW/S *APGLKLIHPPW 
SPKVLE 


6652 


20553 


A 


6706 


373 


75 


HAQLTFKFYYFRNTFCXAVASTKSNSSD 
RSE * SKLKTFWKGLTILDAIRNISDSIA 
EEVNMSTFRGV* KKLTP 1 1 MDDFEG PDF 
TRESNHRGARHGGSRL j 
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beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
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4 

peptide 
sequence 


Predict- 
ed end 
nude- ' 
otide 
location 
correspon 
dins to 
last amino 
acid 

residue of 
peptide 

KM %M V* mm ViV 


Amino acid sequence (A=Alanine 
C=Cysteine, D-Aspartic Acid, 
E=GIutamic Acid. F^PhenYlalanine. 
GMjIycine, H=Histidine, I=IsoIeucine, 
K=Lysfne, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
0=Glu famine. R=Areinine, S=Serine, 
T=Threonine, V=Valine, 
W«Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible . 
nucleotide insertion 

iiuvivviiUv i a i jv i %a%#i< 


6653 

* 


20554 


A 


6707 


331 


379 


k*kinkn*kgltildaiknicdsweevn 
ist/ltgvwkkl/ itlidgsegfkisve 

EGNAGWEIARELSIEV/EPEYVTKIiLQ 
SHKKT 


6654 


20555 


A 


6708 


578 


152 


LQQHNPGSLQPLSP\GFKRFSCLSLPSS 
WDFRHAP PCSAN FFVFLVERGYHHVG * A 
GLKLLASSDSPTSASQSAGXTGVSKRAQ 
PLIGFKVNSVKKSVSVPTRRILRGNMSM 
YEOL 


6655 


20556 


A 


6709 

I 


392 


146 


FFETESLSPGLECSGTVSAHCNLGSSDS 
PALPS / RLIX3S SDS PAPAWLYSFKL * I 
Y/ FHKTCNYVLKINILLHVAI PYSFSWS 
C 


/L£<Z~ 


ZU557 


a 
j% 


D/1U 


o 
y 




Puj Xvir v>ll UIU9^0 X XvLi uriiyJjr 1c / oiv IT 

FYFLETF*ESHCVIQAGIHSGTIMAHHT 
LKLQGSSNPPASAPPSSWDHRCGPPPLA 

LVFELLGLKRFLLPQPS 


6657 


20558 


A 


6711 


34 


387 


QDFGFLSFFIiLFFFEKGLGGPTCRAGPE 
S NLME \ GGP PGLKQ FSGLTLPRTGNNGR 
ASRS RATCFWN * KGK&3P PGPPVELETFT 

AiJi\WftAIW Till AmJIvvwc « v F f V " Iff 1 

LRGPPHLSSPKGKN^GGGPKPDFFKKVG 
KKAPNP 


OOJO 




A 

<t\ 


O/ X£. 


i 
i 


loo 


T O AW W YTP WP ATWEA* AGGLiRX E PRRS 
RLQ * AVI WLHS CIjGDRVRP CLNNKKGS 
FPPQPLW 


6659 

• 


20560 


A 


6713 


377 


98 


SLGLKQSSHLNLPSSWDYRPVPPCPRNW 
FVPVP FFF\ RB * GLAMLPRLVSNS * AOV 
ILP PQ PS KVLGLQARTTAPGF FSHLLKT 
LVPNNVNLNT 


6660 


20561 


A 


6714 


353 


229 


SSCLGLPKSWDYRHEPPHLAYFYY**RQ 
ALV/MLPRX** Y* FLDVNM * L PLPFHVAP 
SHLLNSVGMV 


6661 


20562 


A 


6715 


371 


64 


ENHGDCTPAWATRAKLRFKKKT/CYQMF 
ITTLFTIAKRK*LKCPSSDK* INKMWYI 

mam. mat mm Arffc ^ A» <*i*i * 4H mm mm 

HMME Y / YAI KRNWT L VKLNQNP YAL * KV 
LLRE * KQNP * S VRNI FKTHI 


6662 


20563 


A 


6716 


462 


232 


HFNLITIKIIPFF/C*DGGLTMLPRLVP 
NS WAHAILLPQPPK* LGLQARACKL 


OOOJ 




A 


o / 1 / 




AO 


GPGGPPFKKARFGFI*HPF*TPPVLRPRP 

Wi. 47 JL *V4\* A4>\4« \mW mm **t4* J* A & & W 1 4A,\^ AVA 

PPVFFGGFKASPPKKKTPK/PPGGGGPP 
PYFHPFFLSPFFPIFFEKEPPPFFFWRG 
LGPPPPPFKNPFFFFFWVEGGKFLPQKK 
KKKFFQSISTRSVIHK 


6664 

- 


20565 


A 


6718 


3 

■ 


413 


GTFLTKTS LCX3TLL Y * VI S YMCV* FI S * 
\SYLVYIIYFKDICV/LLYMYIFYIYTC 
VHISVCMYMYVCMYVCMYKA* PANLKTF 

W mm A^te ~ fc • mm * » ^» » • ^» — ^mwm m> mm* mm * w * w ^ mmm%m> ^» ™ 

CTQKHLAWGWVQWLISVISALWEAEARE 
S PEVETNLAQIiGPTLSLPRKKKKKKKK 


6665 


20566 


A 


6719 


408 


71 


SSWDYRC/IiPPGPANFCIFSRDGVSPYW 
PGWSRTPD * RSLPGGDVMPALVIRHVPE 
LLKKQHSSVRSSRPAWPIW 


6666 


20567 


A 


6720 


375 


57 


SVIREMQIKTTMKYPFIPTGLIVIKKTE 
ra/RGWQRCR^IKACIY*WWGCKMTQIiL 
QjFCTIW*FLKKLNIEL/PYDPAIPPLQ^* 
LRELKTCLPRJjGWAHACNPSTLRG 


6667 


20568 


A 


6721 


391 


164 


RISETCT*MFPAALFIV/DQNNPDKWTN 
LMWS YTHTMDDYS AI KRNEVLLHSSTWM 
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peptide 

sequence 


Amino acid sequence (A B AIanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H^Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methlonlne > . 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X^Unknown, **=Stop codon, ^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














\ENIjENLMLSKRGQSPKTPPG 


6668 


20569 


A 


6722 


381 


278 


SSLQPQTPGLKQSSCLSLSNS\PPRFID 
♦SISLYRQGLTILLRLLSNSWSQAIVLP 
GPPKALGL*ASKKFLYSNI 


6669 


20570 


A 


6723 

» 


2 


363 


QRS LEHRPCKKC / CGLW* VS * KVQYLV 
YKESSSFKNLLRRIHCNFLFSGIVFHF* 
KLLRT PRS FCLCGLHI P INFT I F* I KI E 
KFKKC* PGTMAHPYNPSTLGGLGGWIT* 
GQDGLDLLTS 


6670 


20571 

* 


A 


6724 


352 


58 


HGGAIGARFFKVFNLI*FNLVCKDE/SF 
SLI*RAVINSWPRVILPPWPPK\VVGFQ 
AKTPLPRLKGFLESLPSLKKKKNLAYAR 
TITAQIGLLVCFIDQQ 


oo / 1 

* 

■ 




A • 








KSSPFFFFEOGLLCCPGWSAWSPIiOPK i 
CPRLKQFSPLS / LPSN * EYSCTPTTFS L 
HVCVSIHKYI*YIDQYIFKFF/CR/DRT 
LAMLPRLVSNSWSQAILPP*PPKVLGFQ 
V 


6672 


20573 


A 


6726 


3 


343 


LEV* ARATRQE * K I NGT * 1 1 KYKKVKQ Y 
SQNDMILYIQNPKDSAKKLPELVNKFSS 
FKVNRQNSVVFL*NSNKKS/EKEIKKKL 
/P FT IAS * RI KYLG INLTKEVKNIiGRAO 
WVTA 


6673 


20574 

• 


A 

• . 


6727 


3 


615 


HTSQGNL* IQYHTPWHPPSSERVERMNQ 
TLKSHLTKLVLKTRLSWTKCLP IALLKV 
RTVPQKEVGIjSPCEMLYRIiPYSHFTVDI 
PTFETKSQFVKSYVLGLSSTFSSLKAK/ 
EL FS TDATLGS S PGTSAFSLGDDVLI RS 
WEEGKLKPAWEG P YL VLLTT KTAVQT / D 
NKKMQTHHTP VKKAS PSSKS CAI VPRP I 
PTKLKIKKKKA 


6674 


20575 


A 


6728 


2 


228 


SGAI/SVHCNLHFPG/SSSDSSASAS*V 
AGTTGVHTMA*LIFVFLVETGFHPISQA 
GLELL/NHMICPPWPPKVLGLQL 


6675 


20576 


A 


6729 


98 


359 


KVEYTVLLF*GFKGKFPTLPLYLSFSSF 
LKNHLISNFFLNRVLLCYPDWS WQ* FT 
AHCNLNLSCLSLPSTW\DSRCVPTGAWL 
RFI 


6676 


20577 


A 


6730 


10 


428 


RSTRTDYAALREVLPDPRRCRPRTGAWS 
NGRRDRPGAGHSGPKAEAETRETLEKGL 
LFCCPGWSAVA* PRPTATFAS RAQSLTV 
SPRLECX3GWSAH*NF\SFHFQRWGLAV 
LRTLVLNSWSLALLLLWPRKVLGLYALA 


6677 


20578 


A 


6731 


425 


158 


SQTHDLR* STCLGPKCWDHRCAPPRPAL 
FLF/FNKCPMSIRKIENAQ*LPGI*KKN 
FCRDDHLPLLSRLILNSWAQGIL*CRPL 
KVLGL 


6678 


20579 


A 


6732 


431 


22 


RDRVLLCHPE* SAW* S*LTTVQNSWAQ 
GILSSI*DYSCVP\LPVNF*TFFKSVYY 
YFIIFIFEIYFCNSKALGL/LGVSHRAR 
PPRLVLNSWPPQVILPSQPPE*liGLEAL 
ATAPAIPRNSNLVWHQYSFKSIPDDL 


6679 


20580 


A 


6733 


44 


414 

* 


AI YDLPFLLIDLYPMELRAS IRAHTCT* 
RFVATLFTVTKRLNQPTCPSTDE*QNKM 
WSL/ YIMNYYPAI KRNEILIHENIMLCK 
RSQSQKAHTE*FHICEISRISKSAETED 
ELWATRWGKGRE 


6680 


20581 


A 


6734 


375 


49 


QRLTYHCRFS YNTASNKMRLS * TQMHQK 
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sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I~lsoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N= As par agin e, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Tbreonine, V-Valine, 
W=Tryptophan, Y=Tyros»ne, 
X=Un known, *=$top codon,/=possible 
nucleotide deletion, V=possfble 
nucleotide insertion 


• 






• 






SAC S MCPGRL * GVHAMRPKVLMRLSRS H 
/VARPIAATCAQ* VCDRIK/R/ SLLIEE 
QKIWKVLKAQTQSQTAKFKNETFPRNK 
N 


6681 


20582 


A 


6735 


381 


88 


SLFPRLETPGLRPSFFFFLFSLSH\PGW 
SWV* SRLTTALNS * ARFSLLSWCVPLH 
QAHFF\FFCRDR/S LTMLPRLVLKSWAQ 
AIPPPEPLKRSGLQV 


6682 


20583 


A 


6736 


384 


293 


AKLSLLSGWDCRCMPPCLDNCVCLC/RV 
FVCR*GQAMLPRLVSNSWTQVILLPWHP 
KVLGL*AQDQEPLWGRSGPGLPQAASG 


6683 


20584 

» 


A 


6737 


24 


300 


NILSAFLCLS*P*H+YGLVIL*NVPQFR 
FLPL FFPMQM YTHI FFKIR IRPGWSHA 
YNPS TLVGQGGR\ SHEFKTSLDNIARPR 
LYQVQNKIR 


6684 


20585 


A 


6738 


2 


353 


IHCW * KCKI T YTLEKT VWQPP IQLNI VL 
PYIPA/IPLLGIHTREMKTHIYTK/T/C 
MQMFG WLF 1 1 AKI VFQLVNR* I KL* S I 
HKVD YHSE/ IRNKQL YEAT *LNLKCTVL 
SE*S*TQKTT 


6685 


20586 


A 


6739 


2 


336 


IHATYTNP*HRH\VHTPTHVHT/HVYTH 
NGTHTHTARAWTPAPPTSLILPSPTESP 
PALQIPNLPFGLSSSPHPAGSSQGEPEE 
GGCQRRAKNRVGHRAEKWETELCAGDMN 
L 


6686 

■ 


20587 


A 


6740 


3 


385 


KKQATLLWIIHLATTFSVEI*QSKIEVH 
DI FKVLKEKKK* TTFYPKI TNPGKI AF I 
HEGKI KLY/IVK* QLRDF INSGLVLKAM 
LKRVL* SKRKGH * QTIRNLRVQNSLVTA 
NTQPNTEYCNTVXWGN 


6687 


20588 


A 


6741 


406 


155 

* 


TGSCS VTQA* / CWSAVI VHCSUDLPGSS 
NPPASASWIAGTTGMCYHAQLPSENLCL 
QFYCMGCVFSRFYLRFTFGLQS YTRKQK 


6688 


20589 


A 


6742 


I 


215 


LP YS PDLSPTD/ Y/HFSKHLDNFLKGKR 
FHNQQDAENAF * KF I ESQS I DFYAVR IN 
KL I SHWQKRVDCKGS YFD 


6689 

i 


20590 


A 


6743 


376 


109 


YHMEVWNLYSLKGSSYLRLPSSWDYWCP 
PP * QANF* K\ FFCRDKDLAMLPRWLNS 
*LQAILLLRPPKAEPLQLLGQFQFRRES 
GWN 


6690 


20591 


A 


6744 


382 


148 


NI PQFMFDI LFQ * LI TLS CPAS F*S KFP 
LSSLYFIQ*GR/WLGTVAHACNPSTLGG 
RGQR\ *A*DQELETSLAHKARPHL 


6691 


20592 


A 


6745 


403 


209 


H IV I NR 1 1 RKY CA* H YANKFNNLDKLDT 
FLE\NTTCKNLTQEETENLNSPVSVKEI 
EQGKLCSDRN 


6692 


20593 


A 


6746 


366 


40 


QGSVLS I LNVKI VFWFKTGCVHFLFPLN 
FLS L FQNYVCSHTS / YPLSL I PHESLWS 
FPPSFNHSS*Y*HFLKTN*MPDAMAHAC 
NPSTLGS*GRWIT*DEEFETSLANI 


6693 


20594 


A 


6747 


392 


57 


APKKKGGGVYPLYPKKSGRKKKPEG/PP 
PKPKKKLKTPGRKNDAKDI KPGGAKFFF 
SPKKKRGAPPGVPPRLF*KPPQATI*GG 
PKKKKKPPQEGGKKGAPPKAPPLFFFFL 
YF 


6694 


20595 


k 


6748 


376 


140 


LEDNI / GYSNQNAGAGRVS * INTPKIKV 
S\QSKIAKWDLIKPKRLCAAKVSMNRVN 
RQPPEWDRIFALASAD\RGFLNKILG 
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peptide 
sequence 


Amino acid sequence (A~Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycme, H=Histidine, I^Isoleudne, 
K=Lysine, L^Leucine,M=Methtonine, . 
N=Asparagine, P=Proline, 
Q^G 1 u tarn in e, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=po$sible 
nucleotide insertion 


6695 

• 


20596 

■ 

* 


A 


6749 


540 


172 

• • 


F FLRQSL / DS VAQAGVQWRHLGS LQAP P 
HRFMPFSCLTLPSS*NCRCLPPCPANVL 
YF* QRWGFTVLARMVS IS* PRDPPASAS 
QSAGITGMSHCAQWNTLSLHAFSCDKV 
PQNDLL / W PS KS F CGTLSHE KACKDNVF 
TTWAQWLMPVI PAIiWEAEAGGSRGQE IE 
TILANTVKPHLC ■ . . 


6696 

• 

-■ 


20597 


A 

- 


6750 


1 

■ 


353 

■ 


WFLERESTRSENSVNIVBMTT*DLEYYV 
SKAVAGFEKIDSNIKRSFTVGKMLSTSL 
H/CYREIFHERKSQSMCQTSLLSYPKKL 
PQPPQPSASATLIVNIKAKISAKRIKLT 
ECSDDC 


6697 


20598 


A 


6751 


404 


231 


SPS/PPPPYFGGPPGFFPPPVF*TPPPE 
YFFGAPKKKKNFP P PGGKNFFFFKG P P P 


6698 

■ 


20599 


A 


6752 


77 


371 


ITCPVLYLTSROTKTKKYYLDCCCFWP* 
WNNRDQTYLHLFFF/CFIKKNFFFQPKR 
GARGGFCINETPPLGGKGNPPPPPPGRG 
EKRAPPKTTjCFFFFF 


6699 


20600 


A 


6753 


354 


169 


ETESHSVTHAGVQWHNLGSIiQPPPPGFK 
RFSCLSLPKK/WFSVKLN/PNT*SFCYH 
ILKALII 


6700 

■ 


20601 , 

• 


A 


6754 


1 


442 


CICRTAFELTGRATRSEKEERKKERMYD 
* KKERKKEKEGI LLKKDGQEBEEGRKEE 
RKEGRRQTQDE FGGQNRLSTVKQGS * PK 
KRK*GIQ\QTGAPTQERDKETSQNKNEE 
KARHDS *QPQA* RTQSRLQQEGRKLWKR 
GIA/SRRK 


6701 


20602 


A 


6755 


366 


262 


FF * RWGLTMPRKLVLNSWAQVT LLPP P L 
E*LG*QA 


6702 


20603 


A 


6756 


3 


223 


GLKQSSCFCFLSNEDCKHVPP\HRLFFIi 
FVCL FCFVLFFW * RYGGLAVIiPRLVLKS 
WP*VIPQPWPPQMLRLQA 


6703 


20604 


A 


6757 


418 


244 


FSLLSSWDHRYMPPRMATF/ SFVLFCIiR 
EGCLTRFLRPV*NSWTQAIFPQSPKLLG 
LQV 


6704 


20605 


A 


6758 


46 


256 


KLKNKTEARHS S S ACL* / LPGS WDHRYL 
P PRLANF * TFWRDRVSLLP KGVSNLWAP 
TILLPWPPQVLGLQV 


6705 


20606 


A 


6759 


391 


55 


FIFFFFPHSSHGVSPCCPGWSRTP\GSS 
GSPAS ASQHAG I TGRNHHTRLQD I Y* LE 
/TGQTVTLQFSVLADTTLTKRLSWSILC 
CYNKIPETG 


6706 


20607 


A 

* 


6760 


377 


2 


FGEPPGFSPPPVLKTRPRNLFLGPHKKN 
INFPPP/WELNLVPLKGPPFFFFFLR*G 
LAMLPRLVSTSWAQVILLPQHPKLLELQ 
VHAMVLSSRTSHSPRSRSLSPRFSQRLL 
PVSGNPAQPSPMA 


6707 


206O8 


A 


6761 


2 


259 


LTI A* KNF I FLGL* QGICQ I PKDYQTSL 
R* IKDNLNK/ WDTP CS WIGGLNI VEMS V 
LSKLMYRFRAIQMKTTSGFPFFFFWKXG 
IFF 


6708 


20609 


A 


6762 


1 


169 


FFETEFRS FA*AGV* WHDLGSLQPQPSG 
\SSDSPASAALPFFFFFLKKKIFFFPPG 


6709 


20610 


A 


6763 


1 


496 

9 


MATCTPKHINTQI CVS FLLSLKLS EFEQ 
MS I FKKGENELALGTX3NIRP YDS CPGNR 
P/APFATSPAPRKPSGPSWRPPTSGGVP 
ARAECPAS PNPLPPGFCGQS PSRGRKEB 
♦LRGESTSTLAIF* *APNRYHSF*LPRQ 
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NO: of 

nitrlentide 

dvlJIIVIIkiV 

• 


SEQIJ) 
NO: of 

y^yf uuv 
sentience 
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M 
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SEQ ID 
NO: in 
USSN 
09/515.1 
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niirleAttrfe 

UUUvVUUG 

location 

IvWa UUU 

correspond 
ing to first 

amino acid 
residue of 
peptide 
sequence 

* * 


Predict- 
ed end 

niiflfs* 

IIUUv 

otide 

location 
correspon 
dine to 
last amino 
acid 

residue of 

nentirfe 
sequence 


Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutflmic Acid F=Phenvlalamne 
G m Glvcine, H=Histidine. I=IsoIeucine 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 
0=GJutamine R=Areinine. S=Serine 
T=Tbreonine, V=Valine, 
W=Tryptophan, y=»Tyrosine, 
X=Un known, *=Siop cod on, /^possible 
nucleotide deletion \=nowihle . 

nucleotide insertion 














P * PHGD CPARKKKKKPND PRLQPTVT 


6710 


206 n 


A 


6764 


1 


229 


VLUiCFFFFVL* DRVLVGHPG WSSRLYL 
KTKTOKKONN I F VCRVGG \ LTTLPRLVS 
NS WPQMVLS PQPLNALGLQV 


6711 


20612 


A 


6765 


580 


273 


WRQSLTHSVAQPGVQWHDIiGSLQPPPPG 
FKRFSCLSLPSSWDYRRAPPRPTNF\*Y 

i-i vy 4 ctjc nra v\j^/-tvjj.ir» i jit 1 oouur lovoy 

SAGITSVSHHAQLDMIFLR 


6712 


20613 


A 


,6766 

i 


220 


239 


AQEFKAAASTTALQTETLOilOCKKKKKG 

UVOilWVOoJ^f KXOUUnOV XOrolF/ v. Jt? 

FFFF*DRVSLSGVQWCSLQP*TPGLKRS 
SSLSLPSSWEYRHVPLHPA 


6713 


20614 


A 


6767 


89 


387 


FLSPSGSQKEHLQIQLRSTFSPFFFFKR 
EPRFGRVGGQCPNLG*WKPPPPG*RQFS 

r*T r T T T. r P / C\TMT7T3Mi f T , DD'D'D r I T KTT?tffTI7WXrrt 

NSPWWKRGAKIFGPWDP 


6714 


20615 


A 


6768 


397 


223 


IREVGWINWQHKDPO)/KWNCSVY*Ij* 
HTQK 


6715 


20616 


A 


6769 


3 

i 


367 


SWKVCSKRPIPSPSJ^F*KHPRVIiRAD 

GT* *vslatl*hmqi^s*agsvkeaeprs 

MAP PHIf ATPT.RTiPPPTT) / <! <?T.TMT.PP TiV 
nnr rnxvAJir ur J-ir r \*±u / ooiiinurivu v 

LNWPQAILPPCPPKVLGLPARANSSSL 

SVFKLTSFCC 


67 Jo . 


206 J 7 




6770 


A OA 

4U0 


260 


SASQSAWITGMSHCAPJ*CGTHYHLCERF 
LVSFVTALLSQLLELGLAHSRC 


bill 


2Uolo 




6771 


A 1 Q 

418 


1 in 
130 


P Q DP HT JJT.I?DT ,NflP DT »PPP PI?* T>P VT .T if 

CLGWSAVASN\FDLK*STRLGLPKCLDY 
RHEPLCPAMWDTLS SIj* E I FGLFCHRPS 
LPAPRTGPGSQ 


an i q H 
O/lo 


20619 


ii 


6/72 


ill 




ASPAMLPY/QPVEL*AT*TGIEVLTPGC 
RFOAVFGLKVG FHOGPTPVYIjG I 


6719 


20620 


A 


6773 


465 


283 


CPSLPSSWDYRHPPPFJPANF/*YF*YHV 
GQPGLELPTSSAPPASASQSAGITGMSH 
RAWPK 




20021 




Of /4 


*no 


QI 


P DH^ PCIiO POT PRT.1CO Q ^JrTj Q LP Q D VP 

HMLPCLAGWLAGWLACQTDRQTDRQTDR 
QTDRQTDR*TDR*IDSRDR/SLAILPRM 
FSDSWPQVNLLPQ\WPPKVRRLWA 


6721 


20622 


A 


6775 


40? 


253 


LI FAFF \CRDGGLAILLRLVSNS * PQAI 
TiPPWPPKVLiGLOaRAPGVRPDljS 


6722 


20623 


A 


6776 


3 


278 


FFFFSLHLWAPLAFFFFGAISAHYNLPI 
PG\SSDSPASASQVAGTIGACHHVQLF\ 
VFLVEMGFHHVGQAGLEIiLT \ * VIHLPQ 
PPKALELQI 


6723 


20624 




bill 


79 


442 


CPGVSDPSLHILF/CFFKEN/ SCFAPQA 
EGGGHNLG* LEFGPPGLKQFSCLTPQRI 
WNNGGAPPP PLYFFFLKKRGGS PLWPOW 
A*TS*PRS 


6724 


20625 


A 


6778 


77 


417 


HI^IjHLSFFLF/SFLFFFLKRAALIjWPR 
EQGRGWIWWCTLSLRGQGISKAIjPS*E 

agtkggrphptlflvf*kktsfsqvdqg 
gidlwargnpppgnlkwfelqketlcpg 
hif 


6725 


20626 


A 


6779 


62 


235 


IQELRIREGKCLAYEHTARNGKSQDLHA 
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UUVICvUUv 
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SEQ ID 
NO: of 

npntfrlp 

sequence 
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M, 
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od 
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09/515,1 

26 
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nucleotide 

location 

corresnnnrl 

ing to first 
amino add 
i residue of 
peptide 


Predict- 
ed end 

nnolfSa 
n u tic* 

otide 

Inmtinn 

correspon 
ding to 
last amino 
acid 

rMidup nf 

peptide 
sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Clutflmic Acid F^Phenvlalflninp 

(XJIycine, H=Histidine, I^lsoleutinc, 
K— lysine. T ,=1 .purine*. M=MethlnnInp 

N=Asparagine, P=Proline, 
Q=Glu famine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^ryptophan, Y«Tyrosine, 

YcaT Tn knnntfn *sR^nn rniinn ^znnMiKlA 

nucleotide deletion, V=possibte 
nucleotide insertion 














GLSCLCCCCCCCC\CNCWP*STRKFFTK 
PB 


6726 


20627 

• 


A 


6780 

• 


908 


566 


SKMEFRSCCPGWSDLGSPQPL\PLGFKQ 

T,\ Qr^.QT,p\P7iWVVPW7V.P'rWPIVMP\ #1?T. 
u \ o ^ .UO.LJ.tr \tuinl Xtvn/Ur Inrnnr \ r J_i 

VBTG F FHVS QAGLELPTSGDLPTSASQS 
AGI IGVRHHAHPECVFQKQLSEANLRRI 
WNQED 


6727 


20628 


X 


6781 


11 


483 


TTALLKAVRH/G*HLSLQRLLLSF\VCL 
C PAP RGG AYRGRQ AS * S CGGLHPVRAS R 
LLCLPNQAWAMAGAPPPTSLPPCSLISD 
CCASNQRDS VGVGPSE PHAGYNLPVCRP 
LS PLEKRS IRVGVTRFSRCRLSPLSLTR 
KGNSLTPCASRVSSASPCS 


6728 

• 


20629 

i 

• 


A 


6782 

• 


225 

• 

■ 


589 


SRRSEPGNFRSCLGVGVTCCMCLSLSLP 
SRSFLRPLFLSLSVCVFVCLGTRVPCAP 
EGGFLARRPFFWSASPRVSAWVLWPVGN 
RFPGGSGLGVCEGLGNVGI \WSEPQQPS 
FPPHPEQPLC *AGSRRALPNQGQR\LPG 
AHRLPAGGCPQS FKKVvvTPVALCPHRE 
M* PQLDAGTEKEAGKGMGQACLSLKGWP 
SWPSHPFDTPPRMPWVAWPPRILPGLW 
PLL* PPAGRNARRKKPETDRRGRSAGSQ 
PUACAJXa VK Vb LP KSGALVMS rvTOLP VC 
PWAALS PVARSRGDQNPATSGAAWGWVL 
PAVCVSLSLSLLALSFALSFSLCPSVCS 

SLPGCCXSRLAIVFPAVPAWGSVKftWATW 
AS ASEPQGFS FPPHPEQPLC 




zUoJU 




O7o3 


458 


170 


£TV.J. V liu/uUOAV7l 1\3 V ofLK. 1 KJ_iJ_i VIN J? 0 

KVGSVTISLG*EVGLRHWP/NI/GRLDK 
TGPGEKQHFILHAPQSAPSVYHCLGNTQ 
ELPPLSTAMISQPKR 




*yf\<i i 
ZUOil 


TV 




I 




WWflt TT.T3T.COD 7A^VnwUT*lT J'SCT .Pl'DDTJ 

PGFKQFSCLS LP \ RG* DYRACATRDGL I 
FCNFRLGD 


013 1 


ZUOjZ 


rx 


£Tfi< 

0 /oj 


1 
J 




/catgfvliylcg*qclcg/c/c*bdpf 
yceqlcvhvcs cvlgrgws vcvcisrll 


0/JZ 




7V 
A 


0 /BO 


/lab 


IZo 


PPGGGGAVLFPP PWRGRPENS PP PGGKR 
C/QPN* IFFLPPHPGHKTQGPPPKKKKK 
KISAEFLSW 


O/JJ 


ZUOJ*t 


a 

n 


0 /o/ 


1 
1 


Alii 


LIPTLTD/EY*GFKASVEEGTAHVLETA 
RELEVEAEAVTEL/LQSHDQT 


6734 


20635 


A 1 


6788 


3 


237 


FFFLT * VLTI V/ PG * SAEVQS * LTAAST 

^ \ V "1 vTi~m rri? r 1 Hrrlrrn rf lm j» 

FFFFGKKSLILHPGRGGPSLPTP 


6735 


20636 

&VUJU 


A 


6789 


2ft? 




PHPDNF \ RFLV* TGCIiHVGQNGLALLSP 
GYPPTLASPSAGITGVSHCTRPKRYF*M 
VLSNKSl^NRFGLCTVIiPTCLQAPRRTGP 
PPATVLGRGT 


6736 


20637 


A 


6790 

• 


62 


328 


FFRDRILFCHPRWSAVIQNSWLTEASKS 
W\lK+SSRIXaPKCWDGRHBPL<^NTT 
Y*CRQNTLPITLIGCFRVSLPYAKYTGK . 
FWGAEG 


6737 


20638 


A 


6791 


67 


353 


TNKHTYFLELNKREKPQLTVFFFF*REL 
RGL I LANGNFRLRG * SNSPA* PSKVAGI 
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NO: of 
nucleotide 
sequence 
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SEQID 
NO: of 
peptide 
sequence 


■M 

eth 

od 
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SEQID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
dingto 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
(^Glycine, H=Histidine, Wsoleucine, 
KpLysine, L=Leucine, M=Methionine > 
N=Asparagine, P=Proline» 
Q=Glutamlne, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptopban, Y^Tyrosine, . 
X=Un known, *=&top cod on, /^possible 
nucleotide deletion, V»possible 
nucleotide insertion 














TGPPHQAGLIIiVFLEKTGFNYVG/RRGP 
PPPPPKELALQA 


6738 

■ 


20639 


A 


6792 


744 


382 


MTFKKTAHIYFYLYLFIYLRQSL/DSVA 
QAGVQWHNLSSLQAPPPGFMPFSCLSLL 
SSWDYRRPQPRPA/NFFYF* *RRGFTVL 
ARMVS I SRPCDPPASASQSAGI TGMS PR 
AWPKLHIFKL 


6739 


20640 


A 


6793 


372 


129 


RVLPCHSGWTAWQS * L IAVS TSQA\KR 
SPHLS IRHVPPHPA/NFKQFFCRDGLIM 
LPRS VSNFWAQAI LPPQPPKVLGLQA 


6740 

■ 


20641 


A 


6794 

! 


395 


25 


QPGLFCFTPRKKPDSFSKVFAVLSF/FP 
PSL* FS /PliSL* VQEMVSICCPGWSAV\ 
TPGLK* SSCLSIiPKGRGYRCAPPHSANL 
KFiE^*R*VIaFKLPRIjGI^\SvrVQVVLCO 
WPPKGPSLRACTSKI 


6741 


20642 


A 


6795 


412 


176 


KKS * / CLATPRSENWVL*NGPPLFFFFL 
*RRDLAMLPRLVSNSWAQVMLPPWPPK\ 
VLGLQAGATMDHSCVQYSHSFAPE 






A 


0/70 


AAA 




NFVRNCOS FLOSGCTFLHS YOOFLL / PC 
PCQH* IMSVFQILTTVTGVYMHLV* ICS 
ALMTFDLFMCHVLI CHLHI FFGEVSVQH 
FSLLLFLFESGSIY\ft7QAGLHLLFSSNP 
PASAS 


6743 

* 

* 


20644 


A 


6797 


378 


732 


FFFWLNLGSSQPPP\PAFTR\FSHLGLL 
NSRDYRHLPPRLANF\ * FLVETGFRHVG 
QAGLKLLTSGDLPALASQTTGITGVTAS 
PSLDEGVFKALPPPVGTTLCLPPWTTGG 
PFLGPAL 


6744 


20645 


A 


6798 


444 


1 


PLCFP PGGGGFLS S HQGFP PP P PKRKGS 
VGAKKKNI PPGPFPFFFFKKNFFF*GPY 
PF/SEPKKARWGPPPPGPGFSPPRSLVS 
PIiGEKKKKKKIPPPETQPPSFGA*WGKK 
RAPPPGPRAGGRNSFLEGGRGQGRTGGS j 
VREFRAGR 


6745 


20646 


A 


6799 


462 


83 


IEPLRPALCLS*SNPASI*KPHFLLFF/ 
CLPFLSFFFS /S FPFSFPFLFPFPFPS F 
FLIPFLSYSLCCSGWSTVA* *GLTIAS\ 
TYGLKQSSHL/SLPAS WFPCLTN* KKI F 
FRDGGLTMLIRIiAYLFtOT 


6746 


20647 

4 


A 


6800 


433 

• 


185 

■ 


PPPGGRLLVEKTRGEGQLKPGQPGRQAG 
LGMRGNPGKAQPRPASGAPEMQLPARPP 
SGGHLYGCGAKNNEGGANPLLCGGTGSG 
S VAAAGE VS KS APD S GLMGNS MLVKKE / 
AGGGGEPQANQETEN*EG*PMSPESGAL 
LLTSPAAATLPLPVPPQRRGFAPPSLFL 
APQP 


6747 


20648 


A 


6801 


395 


60 


IEFLLFLPLSVDGRLWSHQLLPVMNEAT 
/MSIP*HAFAWTC/SSFLRGGCLGA*ST 
UjYGG YMFS YLRNKQT I FQS S CC IVNGE 
QPRMRNRLFS*FIQHIiVFSVFLIRAILE 
GI 


6748 


20649 


A 


6802 


423 


98 


RGVPY*PGGLQPPPV/REFFFFFIGRVS 
VLQAGAAAPTCRAQSILPGSWDYRHLHA 
NMSS*FLYFV*RQVFSMLPRLVSNSWDQ 
AIHPSQPHPALFFTCWSVKPLATL 


6749 


20650 


A 


6803 


396 


194 


SLMKIaAHV*VLHTDGGFMHPDSAISCHD 
I FDS LHLTGEGSAKX CRALYBLI MQLME 
EHP\GEKQTTIA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 
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M 

eth 

oo 
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SEQID 
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* 

beginning 
nucieoxiae 
location 
correspond 
ing lo nrsi 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

otide 
location 

COrTcSpon 

ding to 
last amino 
acid 

residue of 

at A n aVa <«. 

peptide 
sequence 


Amino acid sequence (A= Alanine 
OCysteine, D=Aspartic Acid, 
F^niutamic Arid FcsPhenvlalanine. 

(XJIycine, H-Histidine, I»Isoleucine* 
K=Lysine, l>=Leucine, M=Methk>nlne» 

r\J— : A cnara ainp P=sPrrtline. 

Q=Glutamine t R=Arginine, S=Serine, 
^Threonine, V=Valine> 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucieouQc uciciiun, \— pujoiurc 
nucleotide insertion 


"6750 


20651 


A 


6804 


373 


192 


. PALNPWAQANLPHQT PALSSNFFYF/ CV 
EMGLTVLPRLV*NSWPEAILILLPWHPK * 


6751 

■ 


20652 


A 


6805 


370 


242 

■ 


KKGTSKFLFFPS YS FFSKFWYQKKVKIA 
FFFF* DRVSLCHPE WSAVW* S * LSAALT 

GVSMLSRLENWPFCLF*PVFQPGQHGNS 
ISTKK 


6752 


20653 


A 


6806 


464 


182 


DGPHRFEP PISS* FLPPWPPQKLEI KG * 

VYSVYSWVSSRLTAAS\TAVLKQSSHLG 
LLSSWDYKHHH 


6753 


20654 




6807 


3 


228 


Y T CAG F FRRQS LCHLGWGAVARS QLTVA 
HLACVNFNTVKD VC FSC ILS 


6754 


20655 


A 


6808 


506 

■ 


173 


NliHLPGLKQFSCLSPPPRSRDYRRAPPC 
TANF\ * FLVEMGS RHVAQAGLELLNSGD 
SPASASQSAGjTI tiVbrlrloU A otir &U1 Ij 1 
STRISEVIFLTLLIaRILHDLPCLKLFSL 


6755 


20656 


A 


6809 


676 


397 


VSSCPRDQPGHHGETLSVLKIQKLARHG 

RLSHCAP/AWATEQVSVSRKCKKRQLHQ 
KERIPEWVRS 


6756 


20657 


6810 


367 


31 


QVFQELYSQDLFIYLETLSLCHPG*SAV 
ARS*WTAASDSW\VKQSSRLSLPSSWD 
YR* VPPYSAHFDVNSS FLTYCTS PGLTQ 


6757 


20658 

- 




6811 


469 


25! 


LLSSWK/YQVPPPHLVFIKIL/CRIGRE 
TASCCI * PRLVVNSWPQVILRPQPPKML 

/>»r f O 7\ TMD/2DT C1PE'*'" , Trf" r T5Y'"'77 

bijuhn I rlr^yir vr u<Lf 


6758 

* 


20659 


T-. 


6812 

I 

I 


16 


462 


EIISYCGFNLHFSNDYKKKBRKRERERK 
KRKKENIGE/HI *DIGVGSDILDLIPKA 
WTTKLNI DKWYHI KL/ QKLLSS * RVAKT 

SQTSDKGLISNTYKELKQLNSTETNKQK 
NNFDLLSADKG 


6759 


20660 


A 


! 6S13 


492 


243 


RMVLIS * PRDPPASASQSTLVDTSRYKV 
YQNI HQQVNMF * NYGWCFF1) I FTASVGD 


6760 


20661 


A 


6814 


32 


308 


E YTLPHP YLITGPPPVPKPET YSHS * YF 
T . Q PTMTrtP T n\ P APRTLNTAS STCIiSRRV 
TARPVPIYQDPSLEPLQLPPNSLTLPTP 
LRHSPCWV 


6761 


20662 


A 


6815 


97 


274 


GRFLB L KVKRG S LGLGM WHAYNLS TLG 
G*GGRIP\QGFKASLGNIAKSCLYKNKK 

GAA 


6762 


20663 


A 


6816 


74 


376 


DFFIVYSVFTHLVNIKDVKDVLYKSLKY 
FFAIYRMGENIYKLYILGGINIQNISMB 
YYTA/ IKKD* ILLFAATWM/RLEDI ILS 
EISHEQKS\KRCMFSLICGS 


6763 


20664 


A 


6817 


1 

* 


1127 


MRVTAPRTVLLLLSGAIALTETWAGSHS 
MRYFYTAMSRPGRG E PRFIAVGYVDDTQ 
FVRFDSDAASPRMAPRAPWIEQEGPEYW 
DP»ETRNMK\ASAQTYREl^IiRIAl^YYNQ 
SEAGSHTLQRMYGCDVGPrX3PO»LRGYHQ 
DAYDGKDY17VIj1^DLSSWTAW>TAAQIT 
QRKWEAAP^lAEQLRAYLEGLCVEWIiRRY 
I*ENGKETLQRA\DPPKTHVTHHP ISDHE 
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NO: of 
nucleotide 
sequence 
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NO: of 
peptide 
sequence 
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M 
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od 
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26 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

* 


Predict- 
ed end 
nude- . 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alaninc 
C s €ysteine, D=Aspartic Acid, 
E^Glutamic Acid, ^Phenylalanine, 
G^Glycine, H=Histidine, I=Isoleucine, 
K^Lysine, L=Leucine, M-Metbionine, 
N=Asparagine, PMProline, 
Q=G I uta mine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W«Tryptopban, V=Tyrosine, 
X-Unknown, *=Stop codon, /^possible 
nucleotide deletion, Impossible 
nucleotide insertion 














ATLRCWALGF YP AE ITLT \ WQRDGEDQT 
QDTELVETRPAGDRTFQKWAAVVVPSGE 
EQRYTCHVQHEGLP KPLTPERWEAI LPS 
PTSPIVGHCAWAGAVLASWWSSGAWAT 
VDV*EGRSSGGKGG\SYSQGWRPSRPVP 
QGFWMCLSTALKKP 


6764 


20665 


A 


6818 


34 


382 


STAGAIHVCMCMCVHVSTCVCAKMCVRV 
KF* I WDI ETGAAQIiSCXTVLVAQTEI I YL. 
QERNSSLSERECVCAK\CVCMCMCVHVS 
TCVCVCVCV* V* KRHNWPGMVAHVCNSS 
TLGG 


6765 


20666 


A 


6819 


1061 

- 


305 


FFLR WSL/ DS VAQAG VQ WRDLGS LQAP P 
RG FT PFSCLS LP SS WD YRRPLPRPANFF 
YF* * RRG FTMLARMVS IS * PRDLPASAS 
QSAG I TG VSHRARRRNT I LY * HLNFVI K 
SHKPPDI VFROL *GPRLHLENPVL *GAK 
SHSSCPWLNLSPPPPHAGSHCLSSSSPR 
MGARQDL L FNAS SHTGLFPFLQIiNQTAS 
RPTSQVPKSLCFPLPWVQPFALPWVRQD 
EGRASVNSPQDPPRVPSCAGFTHSCNLS 


6766 


20667 


A 


6820 


610 


248 


ERRSHPVA*AWKWCHLGSLQPLPP\GS 
SDP P\ T * AS * VAGTTGVRHHARL I FVFF 
IETVSHHVAQPTGF * TPGAQAIRLPQPP 
ENAR ITDVKPLCLAHFCYS KHF I MVPAP 
CISHQKVFH 


6767 


20668 


A 


6821 


3636 

• 


3332 


TCLGSHATPHCGHS FS SCKI LLF * FFVF 
EM*SHPVAQAGVQCRDLRSLPPPPL*FK 
RFSCLSLPSSWDSRRPPPRLANF\CIFS 
RDGILSCWPGWSRTPDLR 


6768 


20669 


A 


6822 


563 

■ 


146 


APGLSFNCYIPPAPAFMPGELTLFEVD/ 
NRSIFPTEY/TIHILISSBDILHS*TIP 
LLGLKTDAI PRCLNQTTLTTGRPGL YYG 
QCSEICES/N/HSFIPIVLBLILLKYFK 
T*STSTLKYHCKAIQHLPFKLKIDGVYT ' 
SLQ 


6769 


20670 


A 


6823 


399 


201 


KAEPPPAKSWSKGGYVLPRS / IRUDHVK 
WALE PDD IAVLNFMKERHPQS KS * TILQ 
KWIQPFSPIVQK 


6770 


20671 


A 


6824 


323 


323 


LPTVIVI ILSFYNFFFFSMRLIRTLSFI 
CMYVCICIYVCIYVCIYVCVYirYVCIY 
/IYINISF*NDRISFFFMLB*YVCVYIH 
EWIKKI 


6771 


20672 . 


A 


6825 


396 


122 


LPPGGHRPAVPGGREMGPSRGNPGNPGR 
P.* KKGPPSPGGGPAPGGPPS PEG * ARKM 
G*PRGPSLP*TQIPPPGSPP/WGPPGVT 
PETLVGFKKRGHPPRGGGPPPGVPLPRR 
AKPE KWVNPGGQAFHE P KS PP RG P PRGA 
KPETLSPKKKKKV 


6772 


20673 


A 


6826 


414 


64 


RGVEGGAAC I FKGLS VAGKPG / YLSAMR 
SCFRRMLLFKS*G/QVIATKHMLPTVLR 
MVGDPVANVRFNVAKSLQKIGPILDNST 
WQSEVKPILEKLTQAQDVDVKCFAQEAL 
TVLSLA 


6773 


20674 


A 


6827 


2 


519 


KTKTCLVEE I S * I TGRFLERINKI DKFL 
VNL*RKK\VQVLNIRNSKVDVTTDRT*K 
\ 1 1 KEHYGQLYANKFN/ 1 DEMDKFLKRD 
KLAKLTQEEIENLNIPVSI*KFLI*NIP 
KMKTP/GPDGISREL 
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SEQ ID 
NO: of 
oucieouac ^ 
sequence 

'* 


SEQ ID 
NO: of 
pepiiae 
sequence 

* 


M 

eth 

oa 


SEQID 
NO: in 

09/515,1 
26 


Predicted 
beginning 

rt aa^i fin 

nuueouue 

location 
correspond 
ing to first 
amino acid 
residue of 

nailf inn 

' peptide 
sequence 


Predict- 
ed end 

II u LIC* 

otide 
location 
correspon 
ding to 
last amino 
aciu 

residue of 

peptide 

sequence 


Amino acid sequence (A s Alanine 

OCysteine, D=>Aspartic Acid, 

mj \j i u ut iu iv aciu j x — x uciijriaiuiiinCy 

G=GIycine, H=Histidine, I=Isoleudne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glu famine, R=Arginine, S=Serine, 
T=Threonine, V-Valine, 
it— i rypiopnan, i— lyrosinej 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucieoiiQc insertion 


6774 

* * 


20675 


A 


6828 

* 


413 


114 


GGKRGPGP I PHP P ARGPQ * GPGHPAVQV 
PEP PGLS FPH FRGGFP S S GGCQRGEGRK 

PG/APGPEAPGTGTTMSAPTWPEPP/G/ 
GRQVQDVPAFPEGSARQAR * HEE * LHWK 
KILL 


6775 


. 20676 


A 


6829 


479 


261 


PCFSFLSNEDCRHAPP/RPV*F/LVERG 

CDtn/ADJlPT T?T7T T*C/^T CA O TV PrV^Br" T 7\ 

rKKVAKAuJ-illir Lil ^o^ijFAoAi>ytiA\jlA 

GVSHCAWPLQVGIEASLQY 


6776 


20677 

i 


A 


6830 


492 


313 


HPIGWPVNWGRASAVGEFRDWKIiFYPEC 
B IRTMGGREHGQS PGAQRTFFQLLLS FX 

tnjotr 
VlsSK 


6777 


20678 


A 


6831 


378 


1 


SNGICLQASSASPNKIVPSTSLKVCSN\ 
VRVDGSRNPSVAFPPLTPMLFAVAPSIS 
SGMGIETIPIQGYRVDEKTKKCYIPF\V 
*ANRESPSGVYNINVHALvSGPPLSDDS 
GVNKPQMKHQHCSA 


6778 


20679 


A 


6832 


374 . 


76 


QLVEKWTGNVKSMTDRVLTMRCELRSST" 
*ALRTPGTWNLINDQIGMF\KFSGLSPK 
QVE CIjVNEKH I YLMPSGRINVSGLTAKT 
LGYVAAS IQEAVT I IQ 


6779 

i 


20680 


A 


6833 


369 


56 


LKYLGTSVENRYADQAKWLSPVISALW* 
FE/CGWIARSEDRDHPG*QSEIPS/PAN 
FLFLVEMGFYHVDQAGLDLMTSGNPPAC 
NTGMS HC i K PGRTDVLAI L»CS DP 


6780 


20681 


A 


6834 


457 1 


142 


DCLALSSRLDLQWCDLGSLQAPPPGFKQ 
FLCLSLPSSWDYRCLPPRPANF\*FLVE 
TGFHHVGQAGLKLLTSSDLPASASQSAG 
XTGXonKAWiruSo r eXsSSoo 


6781 


20682 


A 


6835 


407 


35 


TAP AI RG VHLRGRG WTVYGAV * AVP PAS 
I PAYPGWYQ YG FYGADL YG \ DMQHSDM 
RGLMVSQHSPMPHT\LQQS\SVTVMAGC 
TQPSHTMPLPLPLVY\ELALWRVYABVA 
TSYIiPPAEVMLAPCK 


6782 


20683 


A 


6836 


1 


428 


GKTLNPPSLGKSFGLIGLPPENDWPPDV 
SVPRAAFPLRGPRSVQSWREMEQSVAQ 

T T T * UT H » LJCP T*>T_f V U TOTV T3T3 7\ T \ ATIDVT TTT/ 

LlLIj * WJjTivIlrHKK I SAFRAIj \QHb YJjHK 
DEGDPE 


6783 

• 


20684 


A 


6837 


3 


612 


FKGSGVFNR KKKLEEHP LW QD VALGTAL 

CPQLPSLPGWGVIEWPRRAACLPCNPQL 

CPHCCGPQEAQYPL*AGGQ* GQHQEKPG 

ANOrnFPbCaryv PbAJPGrS/WlAI^^ 

S PAVAQNP PQRKNTLPRG P L PRGAPTTT 

P*SLRPR*TPVGAGLAAPIRSHHPGCSG 

QEALALCPDRPHKDRGVRRGAGGPRGRR 

VQLGGIPP 


6784 


20685 


A 


6838 


3 


412 


IKWbr IMljlKLiVIjbb*PK/(_UrFA£xftbU 
S VGITGVNHRVRLFFFFFGTGPHS \MAQ 

RGGPHTPPNF/C I FFGKNG VS P YCPGWD 
FGAKAILPPGPPKELELEGSPPAPGKVF 


6785 


20686 


A 


6839 


419 


26 


EETSRLWS IGQCPLLIQTAQVLS FTSGH 
GRHREHRKPTDS PLPS ACPAP / RIiTPPP 
SMCPASHSLPSSQCMPGPTLTPPTACAR 
PDTHSPPPSVCPAPLP/CRPVCPPPRSL 
PTPA* TKPRTQLPSSMYCGRSS 


4786 


20687 


A 


6840 


365 


66 


LLERQ IGRS VYGPS SKSSI FDVPSLPHR 
GIKLVFCLCHTSSWDTGVHYAP PA*SWH 
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SEQID 
NO: of 
nucleotide 
sequence 

• 


SEQID 
NO: of 
peptide 
sequence 

• 


M 

eth 

od 

* 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

* 


Predict- 
ed end 
nude- " 
otlde 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
EXJlutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
KpLysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T~Threonine, V«Valine, 
W«=Tryptophan, Y=^Tyrosine, 
X=XJn known, *=Stop codon, /^possible 
nucleotide deletion, Impossible 
nucleotide insertion 








• 






TWHPRPRQVGQ/ AGFELLTS CHPSPLT 
SQTARIAGVSHRARPKFM 


6787 


20688 


A 


6841 


445 


188 


GFQ*FSCFGLLGGWDTRHVPPRLADFCI 
F /M * RWG FAMSARLS SNS * SAPSASQSA 
G I TCMSHCARPATLNS PI CGTSLPLGS F 
FF 


6788 

* 


20689 


A 


6842 


3 


412 


PQQSPQLTTVTAPRPRALLMGTTMANLQ 
TRVSTSHLQGPRLTSALTYQMAQVAVAT 
PPWSVLAWVLSARVTVLP/LN/VNGI 
SWIRQLQKAAGRQPPSPHQTRNNSLE*. 
VASNLS KFLILE VAELFCHVLT FFS YQ 


6789 


20690 


A 


6843 


113 


410 


NHKCNVNFYNGPRSIiEKRLITLPILHYF 
LNFILCKSSyi*TLLNLKXQTNLFCK/YE 
M/ESHFVTQAGVQWHDLGSLHPKQSSYL 
ShhSS *DYRGMPPHIiAN 


6790 


20691 


A 


6844 


2 


357 


FCPLAR I LQSGRFAVL PWNSGLVRGRRQ 
EGPHGAFTREHT/SLRAPQPRSENERR/ 
R I ACALRAAHQASGWRLTPRAP RS PSGC 
HCPPAEPTSR* EAWGARRKGHGCCPAPH 
QPAPQDIS 


6791 


20692 


A 


6845 


392 


279 


HFFFPCRD/ KSLAMLTRLILNSWA* VIL 
LPQPSKVLGLQV 


6792 


20693 

* 


A 


6846 


3 


275 


VNLVDKAAAGFBRMDSNF\BTSSTVGKM 
LSSS I ARYRE I FYERKS PSMQQTLFLSH 
FRKLP* PF* PLATTTLI Y*QPSTSRQDP 
LPAKRAR 


6793 


20694 


A 


6847 


1 

* 


454 


FFFFETQFHSLPRIiECSGAISAHCNFCL 
LGSSSSPASASQWGITGMHHHAWI IF/ 
VLFLVETGFHHVGHAGFELLTTCGPPTL 
ASIMCILVYLLVFQHLVSLKNLKGQNAS 
FYFSPFGQHLSGFVKCSIWCCGSLSGLB 
GHFLPPHSPPLH 


6794 


20695 


A 


6848 


13 


148 


GS AWRHT PAG\ELLKKLRHRYGLNP EGG 
GCTEP I WCYRTPP WTTD 


6795 


20696 


A 


6850 


204 


30 


TI FSRQVLR I QNVLS E KLRVSTVYANNG 
S \ VLQGTLWASVYHGKILIGTVFHKTLY 
CSL 


6796 


20697 


A 


6851 


2 


429 


EVWASGITGKAIMAMPIVIQPKNPSSYP 
CRRQFPLQLEAKBGFQLLTEKFQKHGLL 
IPYN/TPILPVKKSNEKYRLVQDLRIIN 
EAWPIHPMIPNSYVILPQPPDAQWFPV 
LDLKDSPFCISP/VDPSSQFLFAFE 


6797 


20698 


A 


6852 


3 


160 


SGWDHRRHTQLIFMYSFLFFCRDR/SLP 
MLPKLVLNSWAQVILLLQPPKVLGL 


6798 


20699 


A 


6853 


2 


243 


ACMILAAATI ILRW\PLRYI ILI WGIN 
KFTKKLRNPYS IDNNELLDFLSRVPSDV 
QKVQYAEIiKLCSSHSPLRKKRSAL 


6799 


20700 


A 


6854 


24 


458 


SRAARRAAGTTRRSSHTGCRPRPADPGA 
RCLPRGSPHPRMDPPESP/PSETSPGPP 
PMGPPPPSSKAPRSPPVGSGPASGVEPT 
SFPVESEAVMEDVLRPLEQALEDCRGHT 
RKQVCDDI SRRLALLQEQWAGGKLS IPC 
KEENGS 


6800 


20701 


A 


6855 


3 


183 


CCDPPVSASQRAGITSVTY/Q/WPSLGS 
LQPPSPEFKRFSCLSLLSSWDHRHPPLR 
PACKVFC 


6801 


20702 


A 


6856 


2 


242 


GNHPYARKLA\AQRRASTVSSVTQVEVD 
ENAYRCGSGMQMAKDSKSLKTHQTQPGI 
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SEQID 
NO: of 
nentide 

mJ W L# ^1 ^1 V« 
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M, 
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od 
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beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
oentide ■ 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Clutamic Acid. F—PhenvLalanine. 
G B3 Glvcine« H^Histidine, I™Isoleucine. 
K=Lysine, L=Leucine, M=MethtonIne, 
N=Asparagine, P^Proline, 
Q=Glutamme, R=Arginine, S=Serine, 
Threonine, V=Valine> 
W^TrvDtonhan. Y^Tvrosine. 
X=Un known, *=Stop codon, / a possible 
nucleotide deletion, \=possible 
nucleotide insertion 














. RRARDKAL \ LVANGKRLWC I E6S VC 


6802 


20703 


A 


6857 


54 


401 


EGERLQKFMHVLKTHTHTH / AHTHERAH 
THTHTRTHTPRSGMHRSNNRNLAKNLFK 
I FFFLSS S PPS LFFLI IFPFI*SLSIjSSL 
SLSLYKHTPLCILSLRTEHLHRMGALLG 
GHILE 




i 207U4 

* 

1 


2V 


Oo5S 

■ 




j/y 


PSQSSPPEQPQSMEMRSVLRKAGSPRKA 
RRARLNPLVLLLDAALTGELEWQQAVK 

FMNrnP^nPJTF1?fi TT T £Y42lvWVQT V 
&ruNjjiro v triNiLE^vj j. i/iunn/uvvUnn ioi v 

DFLITAGANVNSPDSHGWTPLHCAASCN 

DTVT CMAL VQHGAA I FATTLSDGATAFE 

KCDPYREGYADCATYLADVKQSMGLMNS 

GAVYALWDYSAEFGDELSFREGESVTVL 

RRDGPEETDWWWAALHGQEGYVPRNY7G 

LFPRVKPQRSKV 


6804 


20705 


A 


6859 


36 


269 


IALGS MDVLT ILMFNEHSKNE I KRTI PL 
KIATR/ 1 KYLGINLT \ RGKKDLDTENYT 
TLIEEI EDDTNKWKD I PCSWTGV 


6805 

■ 

■ 


20706 


A 

- 


mo 


47 

■ 

- 


888 


TLRARALQARPRTGS S CTAATWTS /SGA 

i>y HbJjKALib WKKJj x J-»o KAAljXvAooKl fa A 

LLSGFAlWAl^^VQLESDHBYPPGriVA 

T?Q A/T*TVT .\7TV\7WT .1? AT .VR7QTPT .T 4 DHTP 2L 

VSNIHNLNSVHQSPHQRLHRYVEIjAWGF 

DTPTPMVPTSRVPGTLAPVATSIiSPASN 
LPRSSASAAPSQAEPACPPRQACGGGGA 
HGPGWQAAMASTAI MVPVGLVFVAFALH 
FYRSLVAHKTDRYKQELEELNRLQGELQ 
*AV 


6806 


20707 


A 


6861 


3 


391 


NKI S F FCRD WGL PMLP RLVLN3 WAQVI L 
/LISTCQPPKVLGLQA 


6807 


20708 


A 


6862 


322 


175 


GCRCVPPPRASFKI FGRV/RGLDI LPKL 
VSNSWPOAI LP P WP LE S LVLOA 


6808 

• 


20709 


A 


6863 


700 


294 

* 


YVLRQGLTLLPRL/ECSSAI IAHCSLKL 
LG LRNP PTS AS R VAGT I GACHHTRVI F I 
VFWI ESCLLAOAGPKLIjGSSDPPAIiAS 
QSAGIAGISHHAWPPYYFYWVLSSTIPS 
VLSLHSHLILATILRGGDVTSQTDK 


6R0Q 


90710 


A 




371 


77 


RSMFANXNLVYDTSDSDDYHLLKVLEEG 
IQTLMGRLE VGS RRTGQILKQTYS KFDT 
NSLNHVALLKNYGI^YCFRKDMDKVETF 
LRMVQCRSVEGSCGF 


6810 


20711 


A 


6865 


273 


34 


DYLPT YLKIjFRLLF F / CLI I ETESHYVA 
QAILELLSSSNS PTSASRSAGITGI SHH 
TQPNFSNNTLLKPYLRLSYALLFA 


6811 


20712 


A 


6866 


] 


190 


GAKGMHHYTWLVRYF / CCVEMRSHYVTW 
VGLE / PPALKHS SCLSLPKCRDYRHEPP 
HLALFCFSF 


6812 


20713 


A 


6867 


506 


293 


EKPSNGHKKPYTWIS IAILFX IAKTWKQ 
S\PRCPSAGEWINC/WYTQTMEYYSMLK 
RNELSNCEKT 


6813 


20714 


A 


6868 


2 


116 


LFPFCRDR/SLTMLPRLISNSRAQGILL 
PWPPKLIjGIiQT 


6814 


20715 


A 


6869 


10 


141 


GS AWLFF I FCR/DRGLALLPQLVSNPWL 
QAILLLWPPRVLQIiQA 


6815 


20716 


A 


6870 


2 


603 


APTFINFPAKGKPKRGDTYHLQVRGFSA 
EQIARWIADRTDVNIRVIRPPNYAGPLM 
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SEQID 
NO: of 
nucleotide 
sequence 

"* 

i 


SEQID 
NO: of 
peptide 
sequence 


M 

etb 
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SEQID 
NO: in 
XJSSN. 
09/515,1 . 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Ala nine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanme, 
G=Glycine, H=Histidine, I=lso leucine, 
KpLysine, L=Leucine, M s Methionioe, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, Vs=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 




- 


• 


■ 


■ 




LGLLIiAVIGGLVT LRRSNME FLFNKTGW 
AFAALCF\VLAMTSGQMWDHIRG\PPYG 
PKDPHKGHVNYIHGSSQAQPVAETHIVL 
L FNGG VTLGMVLLCEAATS DMD IGKRKI 
MCVAGIGI^WLPFS WMLS I FRSKYHGYP 
YSFLMS 




20717 


A 


6871 


375 


i 

M 


GFPGFPVPKFG I PAPVAPQNPFPPPSF/ 
PQGPPPSPGGPNPNGKNPFFPKTPFFLK 
NSPQKKKIFLKKFKRGPP\GPNQKPPFK 
DFKKKRGLKPPKKGGPLKKKKKKKKRSN 
YSLKKIYSPPGFHFF 


6817 


20718 . 


A 


6872 


255 


23 


GRVD PTSTS QNAGI TGVSHR / GVQ WRDL 
GSL/QFLSPGFKRFSCLSLSSSRDYRLV 
/PFCI FSRDRILP FWPGWSQTPHLK 


6818 


20719 


A 


6873 


5 


157 


YLLFTSYTHAHTHTHTHTHTQFIjLLWTS 
GLS\SCCYIFSLSGPNTKCNMIF 


6819 


20720 


A 


6874 


351 


132 


S PRRCS AYGVA/ 1 YliFLYFLINLLSLYS 
I FYGFAPNS FLCEVQEPS LG \ SGWEPLS 
GNfcFSSIKMKIAHREVNM 


6820 


20721 

< 


A 


6875 


99 


337 


AGSFFQKKGKKKISRVNRQPIQLBKfFH" 
NCASDKGLVPRI YKELESAR\ KYQS I PS 
KVG 


6821 


20722 


A 


6876 


69 


306 


YVSHHLSFFF/ CFFGGGEKALFFCPRTG 
IKWPQFGLLEPSPSGLKQFFGLNPPETL 
EYRVFFPPPGKFFFFFFFFFFFFF 


6822 


20723 


A 


6877 


335 


169 


WHDLGL.liQPPPPG\SSDSCASASQVAGr 
TGMHHSFSNFIYLFCYFBKVFLTFTLD 


6823 


20724 


A 


6878 


2 


237 


ELGYKVLPHPP YS PDLS PTDYHFLKHLN 
NFL\FHNQQCAENAFQVFIESQSTDI»YA 
TGINKLI SHWQKCVDSNCSYLN 


6824 


20725 


A 


6879 


2 

mm 


199 


RGRVGRQFQKEIEELKKKLEEGEE/YAP ; 
SSSSSSSDPLISEPDISGSEEDDDEBGE 
VGEDGE KKKKK 


6825 


20726 


A 


6880 

Www 


i 


206 


FF FFE TBS C YVAQAGLELLGLGDPPAS A 
SQ WGTTGARHHAQLI F/VFVFLVEMGF 
HHVSQDGFDLLTS 


6826 


20727 

#*w ™ * 


A 


6881 


1 


123 


WY I HTME YYS ALKRRKFLS FATTWMNLE 
\EIMLSKISRRRG 


6827 


20728 


A 


6882 


15 


105 


SQLL / WRLRQKNRLNRGGGGCS FPLWCH 
CTL 


6828 


20729 


A 


6883 


223 


1 


AASTFLFPNLKNSLRG\SLRTFSSVTNV 
RKTALTWLNSQD IQFFFSRRSLTLVARL 
ECWRSWPAASRRLGCL 


6829 


20730 


A . 


6884 


193 


3 

• 


QGEQDFLPPCLPNFFNFNFFCRDIVSML 
PRLISNSWPQAVILPEPHKVLELQARAT 
TPSHVFIF 


6830 


20731 


A 


6885 


2 


355 


RLLTSSDLPASASQSAGVTGMSHRARPR . 
NF/CILPNLSPCNTHSPLSLPQPLAPTI 
LLSIiWICL/SLDXS/WDYRHNPPRLAKV 
FFFGDGVSQ \ YRPG WS A WP FAAS TSGV 
EAILPPQPPK 


6831 


20732 


A 


6886 


1 


151 


HSVAQAGVQWWDHGSLRPGTPGLKQ/FL 
P PLKQS S CLSLLS S WDHRCATL Y 


6832 


20733 


A 


6887 


314 


203 


FLFFCGDKVLLCP \RLVSNS WTQVILQP 
WPPKVLGLQE 


6833 


20734 


A 


6888 


334 


86 


QNRSTFKRGM/ MG WAWWLMFV I PATWBA 
E VAAKIAPLHYS LGDRARLS KGGMKCTS 
IFCSKSAKDOQTLPVKKKISLPTLPSI 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 


SEQIO 
NO: of 
peptide 
sequence 

— 


M 

etb 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

tng to first 

amino acid 

residue of 

peptide 

sequence 

• 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
G=Cysteine, P=Aspartic Acid, 
E=Glu tannic Acid, ^Phenylalanine, 
G=Glycine, H=Histtdine, I=Iso leucine, 
K=Lysine, L=Leucine, M-Metbionine, 
N=Asparagine t P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V«Valine, 
W^Tryptophan, y=Tyrosine, 
X=Un known, *=Stop codon, /-possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


6834 


20735 


A 


6889 


345 


152 


QHARIMSVFIVETGFHHIGQSGUELMT/ 
S/GNMHASASQSARITGVSPHARPSLPY 
YKCSKKQLCNE 


6835 


20736 


A 


6890 


2 


169 


ARECSGAITAHCSLDLMGSSDTFTSASQ 
VAGTVGVTIM PRY I FK\ FFVEMESHC IA 


6836 


20737 


A 


6891 


361 


184 


NQSSCFSLLSSWEHRLMPPHLANF/SFF 
DRDE/SLTVLSRLVSNSWTQAILPPRIF 
FENS 


6837 

* 


20738 


A. 


6892 

i 


3 


330 


HESAAITGVSHCTQTFFFF/ CETGPRFV 
PQVKGQGHDLS SGE FGP PRLRNWPGLTL 
QNAGNTGTL P PHRTNFGD FRRG S FS P CG 
PGGFGTRDLGGFPHLTGKRDTGLDPP 


6S38 


20739 

< 


A 


6893 


2 


348 


ARAFFFWNFFCX/NRGLSMLPRLALNSG 
AQGILPPWPPKKLGLRALPTWPKTGSSG 
LEGHSSLYPKHSLIFPHHCSKFCFIKQW 
RLFILRITENQGLTPLWAAKPAEPVMNE 
ARTQA 


6839 


20740 


A 


6894 


340 


240 


CRD/RSLRMLLTLVLNSWXQAILLSWPP 
KVLGLQA 


6840 


20741 


A 


6895 


155 


1 


HLCF WAG\ PVAHS CHPGTLGGRGGRI TR 
GQE FE TS LLG PGTQ DVLAKCS RA 


6841 


20742 


A 


6896 


454 


294 


PSSWDYRHA/PPMHNWPDFFCRDE/SL 
PMLPRLVWNFWAQLUjLPWTPKALQLQA 


6842 


20743 


A 


6897 


49 


333 

• 


I Y I CLS F YQS 1 1 YI* YDKQHDVLETVSHS 
V5LDLPGLRFSHLSLWDHRLILPWLASS 
NNFF\CRNRGLLVLPRLVSNSWVQAILP 
PWLPIVLGIiPA 


6843 


20744 


A 


6898 


434 


324 


FFFFCRD/RSFAVLPRLVLHSWTQAIRP 
PWPLKVLGDQA 


6844 


20745 


A 


6899 


127 

* 


365 


LPERRRYVS I SGYR YHLPQS LLFFYAQH 
ATFNNELW\LGVIAHACNPSTLGGRDGR 
MALGQEFEIGLDS I VRLHLCNKIK 


6845 


20746 


A 


6900 


208 


1 


FFFFFWILVETRFHHVA/RAGIjELLSAG 
NPPTSPSQSARITGIWFFVWFSSNLSQF 
CYAHSFFTCWYSRA ! 


6846 


20747 


A 


6901 


1 


162 


gtsgtrlftigkrwqqpkcppmaelink 

MW\YTMEHYSVLKRKEIFCTNLTQLL 


6847 

i 


20748 


A 


6902 


120 


313 


dkvl/wrlrqenplnpggggcseprshh 
ctpawatraklylkkkknfkgtlsnwys 
frrvantnlqmqsgsnymq I 


6848 


20749 


A 


6903 


340 


234 


iffcrer\silprlvsttwpqaillprp 
pkvlglqv 


6849 


20750 


A 


6904 


1 


327 


gtsgtsrilpwgpgcirtpblkqsahls 
lpkcwdcwnyrheppvysfamliiklff 
elhlsvilwflllanpqtqtesktfskt 
/prlvlnswpqavlpwslkalasqa 


6850 


20751 


A 


6905 


532 


359 


rdg/dlvwwprmvsmpgfkqsshlghpk 
cwdyrceplcqdfsysiitwfppsrel 

RR 


6851 


20752 


A 


6906 


270 


3 


ggttvyprktkt/wpgavahacnpnilg 
grggwitrsglpvstkntkisrahccaa 
wlatpeaeqenrlkpgdgdrdhhgqhg 

ETPSLSC 


6852 


20753 


A 


6907 


1 


312 


gtswkkmlptiimddfrg\ktlveei.ipd 
weiarevele\mehedvtellqphdkt 
wtdebllltdeqrwflevgsvpvedav 
nivemktkdleyyislliky 


6853 


20754 


A 


6908 


331 


61 


FRHVGQAG/ LQKLLTSSDIiPAPASQSAG 
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SEQID 
NO: of 
nucleotide 
sequence 

'i 

• 


SEQID 
NO: of 
peptide 
sequence 


M. 
etb 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict, 
edend 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A a Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=Isoteucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Prolme, 
Q=Glutamine, R=Arginine, S=Serine, 
Threonine, V=Valine, 
W=Tryptophan, Y»Tyrosine, 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


• 


- • 








■ 


ITGMSHHIRPKLILRKKSMRAKLDSRQT 
RLQPP WEQGLLBEQHBVYNVYMAHVS K 
SSSRKLS 


6854 


20755 


A 


6909 


419 


199 


SAVQDGV\SWCDLGSL\QPPLSRIiKQFP 
CLSFPSTWDYKCAPPHPATFCI FIRDGV 
SPCWSDWSQTPGLK 


6855 


,20756 


A 


6910 


341 


69 


EAEVGGLLKARSLRPAWAIQQDP IS ILK 
KS FTLGGQGG WI TRS GDRDLRASASQSA 
GTTGVSHRAQ\L I FAFLVERRFHHAQQQ 
GLDLLTL 


6856 

■ 


20757 


A 


6911 


174 


356 


KTYLMPLRSSLRPGMVAHACNPSTLGGQ 
GGWITRSG/ESRPSPVLVNRLTRCQANT 
VKPCLY 


6857 


2075^ 


A 


6912 


391 


3 


SPPPPPFPPSFPPFPPQNFFFPPGASSF 
FGPPPP/YFPPPPKKSPPPNPPPPFFFP 
PPLGKQFFSPPPPFFFPPPGFPFPPPPF 
FFFFFFFFFFFFFFFFFFFFCDKKYFIC 
LMICMLFFCKYILNFCRYS 


6858 


20759 


A 


6913 


43 


331 


IRDTYS FS LS APALAPMLGTKGVIJIJ£VA~~ 
SLGSRGASLCVFVCVCLCVRIH\ IGVQA 
SGCVCVCAC/CVCVCVRVCVCVCPHWPA 
SLLTKDLQdiPFTV 


6859 


20760 


A 


6914 


250 


11 


TPNLWGL FIjVFC FEM / EVHY I AQAGLE / 
PPGLKPSSSLSLLSSWDYRHVMPHAAQI 
CISVFSLAQKGVPPSNWILITHYSIKS 


6860 


20761 


A 


6915 


532 


53 


SQMALHEGFLLLFLKEKELHGKFSHPPP 
S APNS S MS KWFYFS ETESHS VAQAGVQ 
WCNLSSLQPPPPGFKQFSRLSLTSSWDY 
TYLPPHLANFLFLVEMGFLHVGQAGLKL 
PTSGDLPAItGSQSVG I TGVSHHTR\ PRF 
LFKIGPLFI.FSKSMSQDEARI 


6861 


20762 


A 


6916 


2 


373 


WHTP WLATHEAEAGGS LEPRRAKLQLA 
MIVA/NCTPAWVT 


6862 


20763 


A 


6917 


1 


270 


GTRQS PRhShLSSS CDHRRV5 PHPADF / 
SFFYFYFFCR/DROFTMLPRQVLIWAQV 
MliLSQPPKLLGIiQVVGVSHDSARP PHCUG 
YSWVYNKP 


6863 


20764 


A 


6918 


2 


367 


RCAPHCPDNSSFFK/RVKTGLTI FPRQT 
SNSWPQVILLPWPP/KVLGLQA 


6864 

• 


20765 


A 


6919 


3 


354 


HEVNIVSTTAKDLEYSINLVDKTGFEKI 
DSSFKRRSTVSKMPSDSITCYREIFCBR 
KSQSMLQTSIiLSYFKKLPQPPQPSA/TT 
TV I S QQ P STLRQGP P P AKRLRCAKGLND 
FQHFWL j 


6865 


20766 


A 

* 


6920 


366 


41 


STS GEDAVH I VEMTTKGLE P / YTAVPGF 
HPSSERRSWGNMLSNSVTCYRGIFGER 
KSQC/RQTS LLPYFN/ KPQPPQPSATIT 
1*1 \SQQPSTSRRDPLPPKRTjQLTESSND 
H 


6866 


20767 


A 


6921 


3 


368 


QLLMGLHLE PATLDLLQLKT\ T IQGGSH 
SHHITAIP\PS\FTRENTIiMFIHLSPIL 
LLSLNPDIITGFFLLKKKNKPKNHKNRG 
PS YKRTPEGPLIiLIiVYAQKTLP PKGGIiY 
KIAPLRLFGEDS 


6867 


20768 


A 


6922 


3 


91 


FSVLPRLV\QNSWPQVICLPWHPKVLGL 
QV 


6868 


20769 


A 


6923 


334 


°l 


ERCX3KCAHISBIFI*SSLDYIYIFVFY\F 
FFLFFF FFFFFFFFFFFFFFF FFFF FFS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
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od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

Ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- . 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E^GIutamtc Acid, ^Phenylalanine, 
G-Glycine, H^Histidine, I=Iso leucine, 
K^Lyslne, L^Leucine, M=Methionine, 
N^Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V^Valine, 
W»Tryptophan, Y=»Tyrosine, 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, \=posslble 
nucleotide insertion 














QFKFNTKS KEKMFNKTMKKVQVTF 


6869 


20770 


A 


6924 


375 


37 


TKAGVQWHDHGSLQPQPPRIiKPSSCPSL 
PSS WD YRHAPP /RP /ALFLFL FGLVWFG 
L VFLVE IGSCY I AQAGLKLLGSSDP PTL 
VSQSARI IGPTHHAWPTLTTS IQCT IRG 
PG 


6870 

■ 


20771 

ifc V tlx 


A 


6925 


647 


1054 


CFWKAHIQAGKTNNIQEGSERFHFCPAD 
EVYVHSGPAVEDQPQRRSFAL/SAQAGV. 
QWCNLSLLQPLPPRPKQFSCL\SLPSSW 
D\HRHVPACLAKFC I FSRDEVCHVGQAR 
LBLLTSGDPPASASQSAGITGMNHHV 


687! 


20772 


A 


6926 


3 


204 


DAWETTGFGMIYDSLDYAKKNEPKHR\L 
ARHGL YE KKKTS RKQRKERXNRM KKVRG 
TAKANVGAGKKK . 


6872 


20773 


A 


6927 


1 


112 


PTRPRTRGVASVLYFTTILIIjIPT\ISL 
IENKILKWA 


6873 

- 


20774. 

• 


A 


6928 

- 

■ 


1811 

■ 


478 . 

■ 

■ 


DRARSPRSERA\RRSSSRLRQRDPSLGR 
RRLRREIRPGLPESEPRPPRPPAALTAD 
Q PP PRRLS ESRGGGGMS E \ AGE \ATTTT 
TTTLPQAPTEAAAAAPQDPAPKS PVGSG 
APQAAAPAPAAHVAGNPGGDAAPAATGT 
AAAAS LATAAGS \ EDAEKKVLATKVIjGT 

I KKNNP R KYLRS VG DG ETVE FD WEGE K - 

RGYYGRRRGP PRNYAGEE E EEGSGS S EG 
FDPPATDRQFSGARNQLRRPQYRPQYRQ 
RRFPPYHVGOTFDRRSRVLPHPNRIQAG 

A» & •> AA T W)( A a> Vavavwi* v Aril At AAriK A * JWriA \^ilw 

EIGEMKDGVPEGAQLQGPVHRNPTYRPR 
YRSRGPPRPRPAPAVGEAEDKENQQATS 
GPNQPSVRRGYRRPYNYRRRPRPPNAPS 
QDGKEAKAGEAPTENPAPPTQQSSAE 


6874 


20775 


A 


6929 


324 


203 


LIFVFLVETGFHHVGQDGLDLL/NLVIR 
PPRPPKVLKLENP 


6875 


20776 


A 


6930 


3 


364 


HEASLTQTRTTMTHCSRTTGCSTASGRT 
WTRSRHS CAWCS AALWRAAVASRCPSS / 
IPVTPPQCLSWP/WKVPLQCPPALS 


6876 


20777 


A 


6931 


2 


165 


GRVGFCS VAQAGLQ WHDLS SLKPLPP\G 
SSDCPASASRIVGITSLAFHFLNNVF 


6877 

■ 


20778 


A 

■ 


6932 

* 


1 

■ 


370 


TE LFJ3 YS PSNGAI S SQCNLCFLGSGYS / 
RSSTSQVAGITGA/R/HHTOLI fvflve 
TE FHHV\ AKAGLELLTSGDPPASAS QSV 
GIAGVSHCAWPHVYILKEIMIK3NLVRCL 
MSICHHHTESICLFL 


6878 

- 


20779 


A 


6933 

• 


i 


441 


RRANT PNNPEAP PN / QKKKKKKKKKKKG 
KNPPGGPPPFFKTPPPFKNGGGSPPGKK 
KGGEKKRGGKIFFPPPGGKKPPPQKKKG 
GGGGKKI FFFLFGPGGGGNFFKKKKGPP 
PFPPRKKKKIFPTGGGPKKKKKKKAPPK 
KPRGFYFP 


6879 


20780 


A 


6934 


S18 


340 


PKPKNFPS / PSPPKI PPPKKKVFSKKPP 
GGFNNPPHKRKKYNFPPPEKLGPPKEFL 
KRPPL 


6880 


20781 


A 


6935 


396 


299 


WEVC/GVCVCVCVCVCVCIHSLCKSTL 
HRTGV 


6881 


20782 


A 


6936 


486 


272 


PTRAPTRPAPPHLADFLIFCSDR/SLIM 
LPRLVLNSQPQMILPLWHPKWGLQA 


6882 


20783 


A 


6937 


1 


264 


YSNLRDRARSCLSLPSSWDYTCSWDYKC 
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NO: of 
nucleotide 
sequence 
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S£QU> 
NO: of 
peptide 
sequence 


M 
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od 
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SEQID 
NO: in 
USSN 
09/51S,l 
26 

• 


Predicted 
beginning • 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid . 
residue of 
peptide 
sequence 


Amino acid sequence (A=Alanlne 
OCysteine, D=Aspartic Acid t 
E=Glutaraic Acid, F=Phenylalanme, 
G-=Glycine, HNHistidine, l^Iso leucine, 
K^Lysine, L= Leucine, M=Methionine, 
N=Asparagine> P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X^Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














. ASLRLANF/SNFCRERGLAMLLRIiVSNS 
WAQVILLPQPSKVLRLCIAVSVPFGEKQ 
GQCS 


6883 


20784 


A 


6938 


2 

mm 


407 


TTGTCRSVWLTFVFLWTaFHHVGQAGL 
ELL/RPPQPPKVLGLQA 


6884 


20785 

I 


A 


6939 


422 

~mmmm 


65 


LKGPPPSFFFFFFFFFFFFFFFFSP\LP 
YILYFDCASNACLFFKNNTWRYIKRIFF 
SPTPFIPFFHEVENVTSLSYMVPQYCFL 
FF 


6885 


20786 

* 


A . 


6940 


462 


209 


LSBFWDYRREPPLPAKFAIFMLCRDRGL 
DTLPSLVSS/SWHQAILAPQPPIMLGLQ 
VHSAYKGFCLPVLTFEEVIGNRPLGLLP 
P 


6886 


20787 


A 


6941 


491 


299 


CAEYS PESGPTHASAHAS / SNVQMVYSR 
IS CKEELLLGRTS PSKNYNMMTVSG 


6887 


20788 


A 


6942 


429 


256 


FSHIiSLPSSWDYKHLSSCPANFCIFVQT 
GFHHVGQA\GDPPASASQSVGVTGMSHH 

s 


6888 


20789 


A 


6943 


932 


661 


FKTGSYSV\TRLEGSGEISAHCNLRFLS 
SSDFFHLSSQVARITGSSQYARLIFVFF 
VEMGFAMI*PRVGLKFLGFKPLHPP\RTP 
RVLGLQE 


6889 


20790 


A 


6944 


381 

JO 1 


55 


PASLPPCSLI SDCCASNQRDSVGVGPSE 
PGVGYSL WRRFLS RSEKRNI RVGVTRF 
SRCV/LS PLSLTQKGNSLTPCASQVRQC 
LALLRIiAHGACTHWPAPTVWHSLVR 


6890 


20791 


A 


6945 


816 


513 


ff i f i yffeme s llprlecsrmi sahck 
vclprsssy / pasasqvakitga/trhs 
rlifcvcvlverrfhhlsqaglell/nl 

VIHPSRPPKVLRIjQGVTRTA 


6891 


20792 


A 


6946 


383 


234 


LI PS LMDNSEGLKTS VEE VTAD WKI VR 
ELELKVKPE/NVTDLLQS \ YDKT 


6892 


20793 


A 


6947 


423 


(0 


LNPPPPPF\YKPPPKKKKFFSPPPLKFG 
PPTKFFKRPPPFFFFFFFFFFFFWHGTF 
PLNSLVGPPKVEGWVSWAQLPRAVGPPL 
PNPKWGLNKPPAGGLNKPIHFPSLNGSG 
EATPPLIPWC 


6893 


20794 

■ 


A 

■ 


6948 

• 


1 


327 


MRLIAAALLLLLLALYTARVDGSKCKC\ 
SRKEPKIRYSDVKKL\EMKPK\YPHCBE 
KMVI ITTKSVSRYRGQEHCLHPKLQSTK 
RFIKWYNAWNEKRRVYEEAQGLRRRIG 


6894 


20795 


A 


6949 


952 


796 


FFLHFIiVEMGN FLLVGQAGLELPTSGDP 
PALASQSAGITGMSHRARPSVSS 


6895 


20796 


A . 


6950 


364 


6 


ITTHKNYFLSRTATPPKEGLPQRKKIFL 
F FTP I F I FFVGF I FWSGGGRGFLVFFFV 
FFFFFFFFFFFFFFFFFFFFFFFF\RVK 
TFIGFKVLFFKINVFFFLIRAVPLDYMN 
HALTIFQW 


6896 


20797 


A 


6951 


418 


266 


ISFFFXXXXXFFFXXXFFLLFFFPFFFF 
FFFFFFFFFFFFFFFFFFFFFFN 


6897 


20798 


A 


6952 


459 


7 


PSYLYKPGPSQPNRRRHSVP/HIRSNKD 
PPPPEVMKKWGPPTPQKKKYFPPRKKKL 
GGGGGPIYPPKKKVFFKKKPGGFKKPPK 
KKKKIFFSPPEKRGPPPSILKNPPHPFI 
FFFFFFFFFFFFFFFFFFFFFFLIGEKA 
MLLDMGA 


6898 


20799 


A 


6953 


325 


143 


LPSSWDYRRVPPHSAHFKFFCRN/RGLA 
MLPRLAKPRFPS IS I FIiNHPQ VS E ILGA 
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Amino acid sequence (A=A)anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutnmic Acid. F^=Phenv (alanine. 
G^^GIvcine. H— Histidine, I=lsoleucine. 
K^Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine. V^Valine, 

mm A mm m m0 mm ▼ ■ ■ ■ • * V J 

W=Tryptophan, Y=Tyrosine, 
X^Un known. *« a StOD codon./^oossible 
nucleotide deletion, V=possible 
nucleotide insertion 














TRLAR 


6899 


20800 


A 


6954 

• 


585 


306 


KTESHSAIRSQCS \GBISAHCNLRLPGS 
SNS P AS ASR VAG I KGTHHQVQLI FVFLV 
EMGFHRAGQDGIiRSPALVT CP PWPPKVL 
GLEAWPALHC 


oyuu 


ZUOv 1 








i j j 


FliVETGFHHAGOAGLELLTSGDPPTIiAK 
S\DSQDVRITGLSHRAWPFLDIY 


oyvi 


9AOA9 










FT/HVGQAGLELLGLS DVPTS SSLDAGI 
TGMSHCSRPRPEISSRDVHTIPGALQET 
WHTPDLASS IRSLRCRLDCPHNHCPLDK 


6902 


20803 


A 


6957 


1 


224 


QCGR I RSRKDARKGNVQ I IPREGRRRTS 
PRYLFPVTGQETDGWG\MGLPAWLLKTR 
S TS P PRS PRTiHRPRGATS W 


6903 


20804 


A 


6958 


102 


344 


GLIiKPKSLKLQCTX ITPVN \NHCTAAWA 
T 


* 


ZU5UD 


7V 
A 




: ■» 
j 




EEEEEEEEEEHAEFTGHVESITQCKLLL 
CHLS LRVEAG I THLLCFTVRAVTRSQRL 
LTKRRNGSVGHTSE , 




OAftftA 
ZUoUO 


A 




1 
j 


IT) 


PAIiVSQSDGIKDVGHDTEPQDFYTLGQD 
P 


6906 


20807 


A 


6961 


2 


224 


LALLPRLARSGTI IAHCSL\NRSLRLGL 

QnPPIlQAC'pCTTlTTV^QHrC'npiVVT.T.QT 

CIRNSVAETKYYLMFKCK 


6907 


20808 


A 


6962 


2 


330 


KGTLRRQVTY KGTL IR \ LVADTLAETFH 
ARKEVTODIFKVLKKHCQPRILYLAKPSF 
KNEGESFSQRKCIlTlfVARSKMAKGVLH 

T.P'VimT ^ TT T KHMK TP 9<3T.VRfl TWTTW 

UP V AX/XO X X, X £\.%/l T JX\.XIvOOXI V Ov2XJC3XvVW • 


6908 


20809 


A 


6963 


1242 


929 


ETGSCSWQAWQWOGHGYLKPQPSHAH 
/DDPPTASS I AG'ITEAHYHAQLI FKKFF 
NSLCCPGWSQVSWRXLGSSDPPASASQN 
AG ITGMS HGAWPQIiS IYRKGN 




one in 


n 

A 




1 


j to 


jxc xr xjvXjxt xjtio r x x Eixiv xj r r 0 r ur r x* r r Xj 
RPS FAFVAQAGVQWRDliGSPQPLPPRFK 
QFSCLSlxIiSRWDYRHAPPQSANF\EFLV 
ETGFLHVGCAGLELPTSGDPPTSASOSA 
GXTGVSHRTRLAFY 


6910 


20811 


A 


6965 


393 


43 


VS PFPLKNF YFS FTPKTFVGGGGPSGPP 
PQKRFFFHTPNSPFFPFPPF/YKSGPRP 
GFFFTPPPEKGKl^P\PPIiRLGPPPVFF 
TRPPPPFFFFFFFERQRTCSFLHFLWR 
PLFRFYL 


6911 


20812 


A 




225 


2 


TRVE CHG VFFGS CNPIJjPGLKEFSAPAP 
RGSGNSGPFPPPR /RNSFFFFFS WLLE 
MGFHHGQADIiEILTSSDPP 


6912 


20813 


A 


6967 


g 


341 


FVS VPP I LHYLFKI FFFFLRRSIiCLPPR 
LECRGTISAHGSLGLPGSSDSPS/ STKN 
TKLPLRGGTCIiGPQLI^LRHEirrWTLG 
GGS GS YP KSG KPAMALQGG VQNKTCVPH 


6913 


20814 


A 


6968 


29 


362 


Dx^CKHPHSSELKKKKKKKKKKKKKKKK 
KKKKGGPPKKKP/HGGPQFPPGRQKKIF 
PFKGGFKKPPRGFFEKNPFFGGGNLGPP 
PPPKNKPPGEKKNFLGGRGQKNSFLSAW 


4914 


20815 


A 




391 


17 


VFFFFFLKKFFPP PDKKLFF P P FPLKX F 
F/SPPNPFFFLGGFSQISPPPKKGFFPK 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, {^Phenylalanine, 
OGiycine, H=Htstidine, I»Isolencine, 
K=Lysine, L=Leucine, M=Methlonine, 
N=Asparagine, P=ProlIne, 
Q=Glutamine, R=Arginine, S=Serine, - 
T=Threonine, V=Valine, 
W=Tryptophan, Y«*Tyrosine, 
X=Un known, *=Stop codon, /"possible 
nucleotide deletion, \=possibte 
nucleotide insertion 














. IPPGV/YFSPPFKKKNFFFPPPLNFGPP 
RVFFKGPPPFFFFFFFFFFFFFFFFLGL 
RGNAGDCNGYGDVSYK 


6915 


20816 


A 


6970 


374 


268 


j IMLPRLVIiNSWPQVILLPWP\PKVFRLQ 
VRAPVPG 


6916 


20817 


A 


6971 


614 


315 


FFFGDALS PGLB CGGAVLAKLQA/ CT JiG 
S HR S PASAS R VAG TCKRP P PRPSG FFVF 
LVEAGLHRVGQDGLNLL/NLVIRPSRPP 
IVLGLQGMSHHARLFYFL 


6917 


20818 

• 

i 


A . 

■ 


6972 

■ 


861 


528 


FBTESRSVTQAWQWCNLSSLQP/LPPG 
FKRFFCLSLPSS WDYRRTPPCPANFCI F 
SRDGVLPWWPGWSRTSDLVIRLPRPPQC 
WDYKRBPLRLAKSRHPWNHLPBQ3VSHF 
K 


6918 


20819 


A ' 


6973 


274 


119 


P PN \ FVTLVETGFLQGGVKLSTPGDPPP 
PASRRAGITGVSHPACPGKBYIFKN 


6919 


20820 


.A 


6974 


370 


194 


1 KITNFAXLFGYILPYSHYFWCLEKNRNR 
/ SLTLL PRLVSNS WAQAFLLLW P PKVLR 
LQA 


6920 


20821 


A 


6975 


2 


239 


ARGPCSSI/SDCCASGEQGSVGVGPAEP 
G/MGYNLLACCLLRPLEKCS I KAGVSRF 
SWYSLSRLPLT\RKGNPPTPCISQVR 


6921 


20822 


A 


6976 


361 


270 


PQWLVPVIPA/LLETEVGGSLEPRSSRP 
A WAR 


6922 


20823 


A 


6977 


363 


48 


GPPKEMCPYPN\PRPYECNLFGKRVFAD 
VI KFKI LR W\ I I LDYTQVDPKSNDECP Y 
KKQKRRGHRQIKRKSQADQGGRDWSGAT 
ARNAKMITSVGEDVRKRKPLAYC 


6923 


20824 


A 


6978 


9 


163 


AG YN FL VC C \ LP RLL EKCS IRMGVSRFS 
RYHLS WLPFARKGNS PTSCTSWVR 


6924 


20825 


A 


6979 


408 


212 


IETGFCHVAQDG/ LELLASDDP PASAS Q 
SAEITDVSHRAQPHSAFLPWFSSKIFAY 
GSVWLVFPHS 


6925 


20826 


A 


6980 


395 


254 


LIFVFLVETRFHRVGQAGLELLTSSDLP 
SLASG\ ITGVSHHSRPSSAF 


6926 

* 


20827 


A 


6981 


408 


2 


KVSFFFPKGPRGKKKNGAGGFSPFFPPF 
LGGGTKNFFYPPIFSQGGGKK/QKPPPP 
E KKKNP PNPAFWG P P LVLKNWGGRPGGP 
LDPKVSNFMGAPPFLFGAKKKTFFSPPP 
PKKKKPQKKKGGGRSRSRTSRTRG 


6927 


20828 


A 


6982 


2 

- 


302^ 


RKHLP PH \ PVI FVFLEQATLRHVGQAGL 
ELLTSSDHRAS ASQS AGITGVS HCSLPA 
TSFSTPPFSVAKMLRGTQKMKVPLIGPL 
PQPIRLVAGHYFIYIT 


6928 

■ 


20829 


A 


6983 


2 

- 


375 


RGRVGGPVGGPVGRTFI FFQAGSDSARI 
LVLFPYCTLPHPLPGLFFFFFKTGI KSW 
VLLNPSPRFVIFLEKKIFLPQPPFF/HI . 
LSPPHPGGAGGGNPTARKIKGLSYRDPK 
EGHKRG WAHTPT P 


6929 


20830 


A 


6984 


1 


146 


PRPANFC/ DFLVETRFRHAGQAGLKLST 
SSDLPTLVSQSAGTTGMSHRA 


6930 


20831 


A 


6985 


2 


95 


LTML P RLE CN DT I P AHCS LNS / SGS SHS 
LTSTSKVAGTTGVHHHFWLMF/PLFI IQ 


6931 


20832 


A 


6986 


340 


169 


VCS SGLS S PLLEQH KTNLI FYASGD I CT 
/ANGKSGFNQPQPFIiKTFCCTHRIMSCT 
YL 


6932 


20833 


A 


6987 


346 


218 


P CLTNF/ S I FCRDB / S LTML/ PRLVLNS i 
WAQAI LLPRP PRLLRLQA 
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SEQD) 
NO: of 
nucleotide 
sequence 

- 


S£QQ> 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQW 
NO: in 

lie CM 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
iv=ijriutamic Acio, r— Jrnenyiaianine, 
G=Glycine, H=Histidine, Msoteucine, 
K=Lysine, L=Leucine, M=Methtonine, 
w s Asparagine, j^=JrTOiine, 
Q=Glutamine, R=Arginine, S=Serine; 
T=Threonine» V=Valine, 
w a i ryptopnan, jr^iyrosine, 
X=Un known, *=Stop codon, /=possibIe 
nucleotide deletion, v = possiDie 
nucleotide insertion 


6933 


20834 


A 


6988 


345 


141 


NRP KCP SMIEW I KKMWH IDTME YYAAI K 
KJJJVC WoJ? \CilX il_AjE*V XVanlvl KHKJjJ-io 
LIRGS 


6934 


20835 


A 


6989 


154 


236 


ELISFSRSVERCISSFIKVItFVETDLFT 
LSLSPFFSVLFFFC/RRWGFTMLPTLVL 

XTCUOn^/TT T OUT TiV~\TT DT nip 


6935 


20836 


A 


6990 


2 


156 


FLVE / TGFVHVGQVGLELLTSGDLPASA 
SQSVGI IGVSHRVRPCFLFVCICLV 


6936 


20837 


A . 


6991 


310 


96 


KNFFFFL VKKp P/ LNVAQGGLKLLGSND 
PPAS VSQ5GG I TGVS PRARPLNFYKEAT 
DPNTSVLKGLNIKKKKN 


6937 


20838 


A 


6992 


. 15 


279 


NLLCFPS ISPLIGI FFFFERGFHHVGQA 
GLKLLTS GD/LRAS PSQRARI TAMTRHP 
HPLVCGLLNSTHCDWRGRIPHWGFDLRF 
REGPS 


6938 


20839 


A 


6993 


127 


352 


KLADTPQSRFVLVCV/LIETKYSIiCCPG 
GLQLLAS SNP PTS VS QSAT I TDVNHHAE 
SQ VFLNL WPKS S KPTNTGL 


6939. 


20840 


A 


6994 


323 


119 


SSSEDHKHVPPHSAMFCFF/ CLRQSLTM 
LPRL VLNS WAQ VI LP PSPPKPFLDALKK 

Trr> trt^%A n rvr t ^ m 

KSSKWMSSLLIT „ 


6940 


20841 


A 


6995 


230 


1 


FFFETESCSVTQAGVQWCDIiGSLQPLPP 
GFKQFS CLS / LPNS WD YRHALPCPANFC 
IFSRDGVSPGWSGWSRIPDLG 


6941 


20842 


A 


6996 


346 


121 


RD I CT P MFVTALL I AKI RS PHKCPS VDT 

WIRKMWF\IYTMEYYKAIIKNKI 

HASLPVFNHMCKLKYLIS 


6942 


20843 


A 


6997 


2 


198 


SNNT WAFREPG I TGMCHHAQLiIVLLYM 
QFHHVDQDGLDVk/NLVICPPWPPKKI/j 
IQALNQPSPDI 


6943 


20844 


A 


6998 


183 


31 


MRHLKS S Q IQQGTVGHACNPSTLGDHGE 
RITQGQEFKT \gnt vrpcl ykd 


6944 


20845 


A 


6999 


3 


471 


LALSCSGTILAHCNIjHLLGPSLPPTSAP 
raagttgvhhhavjl i fvffgkdgvaqpg 
LKLSFK/VIRPPQLPKVLGLQA 


/A A C 

6945 


20846 


A 


7000 


483 


239 


GNNNXLir Ijc Lir c XETV»bnr V 1 \RLiECQW 
VQSSAHSQPSAPRAQVLIFVFLLQTGFH 


6946 


20847 


A 


7001 


132 


353 


YAKLGTRG FARGP WPGG FLTVAVWM S F 
AQAGINP FVCI FSNRELRRCFSTTL/ PL 
LQKIQVTKGTLLCYMREHL 


6947 


20848 




7002 


2 


399 


EDAQE VELQEGKVHSESDKAI TPHSQBE 
FQKQERESAE\SELTES 


6948 


20849 


A 


7003 


2 


135 


TIVSDT t Try TTtTTTTT Tf* /*\ 7\ f^t CT T /\TT UTITTj 

X PQgXiVcWGf HHvngflUT,Rr .r.^ mt.mtmp 
PRPSKVLGLQAYWHSTRP 


6949 


20850 


A 


7004 


2 


114 


IFQFLVEMGFHHVGQAGLEIiL/NLMIHP 
PRPSKVXjGiJQA 


6950 


20851 


A 


7005 


455 


1 


ALFPAWIjWLPS S S KGGLKLS HGTTLTLF 

PRKADRliAPGVQNRPGQRGETPSRKKGG 
EGNWGGGGELHIYTYTHTHTYVHTYIHA 
NWTWWCVPVVLAFQETEVGGPLEPEVAP 
AWVTVRPCLKKK 


6951 


20852 




7006 


371 


172 


QVILCXSLPGSWDHRHTPPHPA/NFLVE 
TRFQHVDLELLASNDPPASASQSVGITG 
VSHCARPCYDL 


6952 


20853 


A 


7007 


486 


252 


PQTVFFFFFFFFFFFQALFSTFYILHVF 
FFFFPPFFPKFQFFPFP/LSLIKQGKFF 
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'i 
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M 
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od 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D^Aspartic Acid, 
ji^vj iu uimic Acta, r^rneDyiaianine, 
vj^vjijcinc) n— niSLiaine, i^isoieucincf 
K-Lysine, L=Leucine, M=Methlonlne, 

Asparagine, i^tTOiine, 
Q=GIutamine, R=Arginine, S^Serine, 
^Threonine, V=VaIine, 
ty — a rypiopnan 1 t^i yrosiacy 
X«Un known, *=Stop codon, /"possible 
n ucleotide deletion, \-possi ble 
nucleotide insertion 














. crcc LA_J\o Jur J. V w xy I rr Vela 


6953 


20854 


A 


7008 


1 


113 


KNFCR/DRSLDTLPRLISNSWPQVTLLP 

UrrJVVlAJijyA 


6954 


20855 


A 


7009 


105 


383 


SLHRPAHPGLLKFGKKAIHS F IHLCRDK 

r«T^T.7i TT DDTArCMCUDA^rTT.T DUCT /WITT 

oJJ LiAl Xi r KJj V £>JN o n f U V 1 lilac* Wo 1 / W Vli 
RLQA 


6955 


.20856 


A 


7010 


609 


348 


ESrioVTQAl^/0WYDLboL0tLt>t>dfi\St> 
SPASASHVAGI TGMRHHTQL\ I FVLLVE 
lRSHHVbyJ. KJrV.br DI 1 Rr VVIr xoUKK 
TRJRL 


6956 


20857 

i 


A . 


7011 


382 


233 


DLE I FRGKTFLHHI FPFFFFFFC/RRRG 
LiARLP KLVXiNS W PQ VTLP KCWD 


6957 


20858 


A 


7012 


359 


199 


DLLPSLPPFLPSFIAFFLFETBSHSSPR 
LECSGAI PAHCSLGDRARHRLKINK 


6958 


20859 


A 


7014 


3 


274 

• 


CFFFFFETGSHSVTQAGVQWRHLDSIiQP 
PPPGCKRFSCLS /LPSSWDYRHV/PSSP 
LLVT^SSICLLSCFEMTPBLFDGVXiAIY 
CCVTNDLKA 


6959 


20860 


X 


7015 


372 


54 


LPSAWEAEMRGPLEPKGLR5QRAEIAP \ 
CTPAWVTEQAYVSNKTNTERPFQKIBHV 
EGLSCTTQKFSSND 


6960 


20861 


A 


7016 


548 


158 


KFDT / GATLFDGRPAVFDEA/ DFPS VAV 
YLTGAEYTGBELDSDTWQAELHIEVFIjP 
AQVPDSELDAWMESRI YPVMSDI PALSD 
LITS MVASGYDYRRDDDAGLWSSADLTY 
VTTYEM 


6961 


20862 


A 


7018 


396 


34 


QKNEI KPTMQLHIiTPARMAI IKKILKNG 
GWHGCGE /METLLHCWQECKLVQPLWKT 
MWRFLKEPKVELPFDPAIPVLGIQQSQH 
WESSNPSTGDPAI P PLG I QQSQPRRKv I 
KKRYLHTRL 


6962 


20863 


A 


7020 


2 


241 


FPGPTAARRRQKEDRYEEYDKVXiSDKIjK 
EABTRAEFAERSLTRLEKSIDDLQDEL\ 
YAHKLKYQAISEELDHALNDMTSI 


6963 


20864 


A 


mm am. m 

7021 


347 


3 

i 


NDLKSLTHS VAQARVQWQ \N I S S LQRPP 
PQFKRFSCFSLPSSWDYSLLNCKFYHLH 
FADEEIETLTSQRAQLECKSNMYKDIHN 
TVRSYKLKYKVPQKLERSRNRTKSSTRE 
VRS 


/A/ * 

6964 


20865 


A 


7022 


2 


160 


IjKolJDrU^GKUrnilr GQAGLibltli I USD/ li 
PTSASQSAGITGVTHRARPDWSFCQF 


6965 


20866 


A 


7023 


3 


248 


DRVSSVAQSSLBLLSSSNPHLGFPKCWD 
YRPNPPLPFSSRS LLFLPS FL I FCRDRV 
S^PK1J^SFW\WPPWPSKVLGLQA 


6966 

. 


20867 

■ 


A 


7024 


483 

* 


143 

■ 


PLNPLKTFFFPKAFKFWGGVGP ICSPPK 
RRVLSQKSQVGFYFPPQKEKGYTFPSPG 
KrePPKEILKRAPPFFFFFFFFLRRDK/ . 
S LTMLPGJjVLNP WPQAVLPIiQ ppkalgl 
OA 


6967 


20868 


A 


7025 


414 


81 


PHPLKFFFFPETFYFWWGVFSPFSPPGR 
KVPPQKSPGGFLLAPPKGKSFPFPCPFK 
IGPPRGIFKRAPPYFFFFFFFNRRDK/S 
LIT^LPGLVIjNPWPQAVLPIiQPPKALGIiQ 
A 


6968 


20869 


A 


7026 


14 


218 


KFAPRCRN S ARENTQNQPRKHSDTPFLK 
K / HKINl^NNKLAGRGTHLWSQLLGRLRW 
EDHLS PRQSRLQ 


6969 


20870 


A 


7027 


28 


272 


EFVGVGPSEPGAGYNLL\CAFLSPSEKC 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid. F=Pheny!alanine, 
G=Glycine, H-Histidine, I^Isoleucine, 
K»Lysine, Leucine, M=Methionine, 
N=Asparagine, P*=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, ^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SIRVGVTRFSRCHLS PLTLTKKGNSLTP 
CAS R VRQCLTLLRLAHGALHPLS S TP 


6970 


20871 

• 


A 


7028 


277 


480 


LG I FHMLSDS FLLLLLFCLFVFKI FFG I 
FCRD \ RS CYEVWAML PRL VS NS WAQ VI L 
PPWPLKVLGLQA 


6971 


20872 


A 


7029 


24 


153 


SVWWNSP/RSSGEFRKTGVHDADFEAHI 
IDMLAKASKIEVYRGN 


6972 

• 


20873 


A 


7030 

i 


2 


470 


IETPRPLWWHS AYRGRQASLSCGGLHPV 
R /AS WLLCLPNQAWAMAGAP P PASLLPC 
SLISDCCASNQRDSVGVGPSEPGVGYNL 
LVRRFLSQSEKRNIRVGVTRFSRCV/LL 
PLSLTRKGNSLTPWASQVWQCLALLRFA 
HGARTHWPAPTVWHSLVR 


6973 

- 


20874 


A 


7031 


487 

- 


107 

■ 


SP PAPGVAPAPPP PTRVFSKKKKKFFFF 
FLKKIFFFKPGGDPGGIFFFPGGPPPGP 
KKFS P PLFPKKGGYRVFP PP PGKI S I FF 
FFFFFFFFFF/CRDR/SLAMLPRLVLNF 
WAQVILLPWS PKVLGLQA 


6974 


20875 


A 


7032 


576 


36 


GYTSQMGRPGRGAPHFPDRAAGQRRSSIi 
PR WGG WAEALLTSQ MGWRPGRGAPHI PD 
DGQPGRGAPHLPDE£qLGRGAPHLPDEE 
RPGRGAPHFPDGAAGORCASHPTRGGRA 
EVLPTSQTGQPGRGAPHLPDDGRPGRGT 
PHIiPDG\GPGRGAPHIPDGAARQRHLSP 
PRRGNCTTRIVDQ 


6975 


20876 


A 


7033 


2 


129 


YGPTHASGAMLRSCAARLRTLGAIiCLPP 
VGRRLP\EASRDPS 


6976 


20877 


A 


7034 


1 




RLSLALVARDGVQWCDLGS PQPPPTGFK 
RFSCLSLPSSWDYRHVSPRPANFVFLVD 
TGFLHFGQAGLELPTSGDPPALASQSAG 
I\TGVSHRAWPK 


6977 


20878 


A 


7035 


3 


315 


HASAHASALFYFI FFETRSRS VAQAEVQ 
WRKLGSLQPPRFKQFSCLSHPYRHAPPH 
FANLVFLVETGFA\HVGQAGLELPTSSN 
LASQSGGITGVSQRARPGLNF 


6978 


20879 


A 


7036 


430 


304 


VI FYLI F FCKDGV/ LTMLPRLVLNSWLH 
AILPPQ P P KVLGLQA 


6979 

i 


20880 


A 


7037 


3 


256 


FFF FNRQKDLFGNFQTCQKWHS PS PRPT 
/APHQAPKGNTSPPSLPIHLTLASTCTY 
PIPQAMLENWSVPSPAQAAGEAALWHPR 
R 


6980 


20881 


A 


7038 


13 


481 


FARLHEFGTSRVI YLIiARLVSNSWPQVI 
CLPWPPKL/IiGLQA 


6981 


20882 


A 


7039 


455 


255 


SCLSIiPSSWDYRRAPRHPANFR\FLVET 
GFRHVALAGLKYPPALASQKAGITGVSH 
HARPBQFILTN 


6982 


208*3 


A 


7040 


490 


172 

• 


KCWQQ/ C/ GETGTLVHC/ WIGRNCTMMR 
PQWQQAWQVFKKIiHRELPSDPAMPIiLGV 
QPGGLKIDTQT/RYMYMALLTVKRWKQP 
ECPPVDK/LDKLWYIHTTEYRVTIKKK 


6983 


20884 


A 


7041 


438 


37 

• 


I FPNLPPKLHYFPQFS PPEKLPKPLLIK 
MAKRQSQGKVFETPKEK/SRAGPPPKFT 
TG FSAEKNGGPKPGGYI SQRVQKKKNNC 
HSRILHSAKLSFKNRSEIKTFLNKQKLK 
/DFTASRPALQKILQEAEFHRTD 


6984 


20885 


A 


7042 


292 


463 


IKAISFLPFFFGDSLTGAQWRDLGSLQP 
LPPGLKRFSCLSLPSSELP/CSGRCQS 


"1985 


20886 




7043 


59 


257 


LASLVSQSWMKKMWYIYTMKYYAATK\G 
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OCysteine, D=Aspartic Add, 
E=Glutamic Acid, ^Phenylalanine, 
(Xllycine, H=Histidine, I s Iso leucine, 
K»Lysine, L=Leucinc, M=Metbionlne, 
N=Asparagine, P=Proline, 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possibte 
nucleotide insertion 










* 




NEIMSSAGTWMKLEAIILSKVTQEQKTK 
HHMLSLRSGS ' 


6986 

■ 


20887 

■ 


A 


7044 

> 


279 . 


14 


RTADTEIGRGSSLCWRYGSLQIQVSKSH 
NFFFETGSHSATQYGVQWHNLGSPQPQT 
YGLQQSSHCGLPKCWDYRE/LPPSMECS 
GT I SAHLS LKLMGS SNPLTVAS QSAG I T 
GNSSSRLHDINYGLQ 


6987 


20888 


A 


7045 


.578 

* 


253 


AAASTFSCFFVFFETEPNSVA\RLECSG 
ATSAHHKLCLPGSSNSPVSAPQVAGITD 
AHNHIQLX FVLVETGFHHVGQAGLELQ/ 
NLVISIiPWPPKVLGIiQAWSHHAQPFV 


6988 


20889 

i 


A 


7046 


3 


229 


FFKTGSLSVSQAGVQW/PNLSSLQLPPP 
RFKQFSCLSFPSSWDYRHPPPHPANFCS 
FSRDRVPPCWSGWSQTPDLR 


6989 


20890 


A 


7047 


367 


151 


LSLLSSWHYGHVPTSPANFFYF/CVDTG 
PRCVSNSWTQAVCLPQLPRVIiGLQA 


6990 


20891 


A 


7048 


370 


141 


CQP PHPANFC I FVE I VFCHVGQTGFKLL 
TSNDLPAWASQSAGIWD\TGVSHHAWPS 
RFNSEVQFDYVSNKNVDFSLS . 


6991 


20892 


A 


7049 


104 


361 


VTAPGLEAAFKERVWPGAVARTCGPSTL 
GGRGRRI \RGQEIKTILANTLKPCLY 


6992 


20893 , 


A 


7050 


2 


325 


RFSCLSLPSSWDYRHEPPRLAIFFFFFF 
GI IRKGGFTH\ VGRGGFKP WTSGNMPAL 
PSQGVGFPGLFSPPRPVRGFFYPLPKWG 
GLFLQNGVKFKKGGSGDLFKMEKK 


6993 


20894 


A 


7051 . 


14 


208 


AHE I LS E I EN PLE PG WDHRCV S SCPAHF 
FVF/CYRDGGLPMLPKRVLNSWAQAILP 
WAPKVLGLQV 


6994 


20895 


A 


7052 


103 


368 


GNRVHTYMLTALFMTAKKWNKPKC/PVS 
D EQ I RQ I WS VHAMEYY S AI KRNEALI CV 
TTWVNLENIMLSESSQKTMGCMIiRRSIY 
MKGPB 


6995 


20896 


A 


7053 


421 


2 


LFLP PE VARGFP KGRGRAS PS PGGFF FG 
PRGGPIFPPP\SFFPTRPWSKGEVFPSP 
TPGKGFRGFFFP P PPKGKGGVF PLFFKK 
YLGGEGAPQNKGGPVFLLWPPHSGPPGP 
PKKFFFFFFFFETESCYVYPGCSAWQS 
Q 


6996 


20897 


A 

■ 


7054 

■ 

■ 


489 

* 


40 


IPEAPGGPGFKREDSPGANKSGPKALYP 
PKTFHPPWEGRFRSFQKGAPPPPVFFPP 
FFPLPFFFLRQSHS I PQAGVQWRDLGSL 
QPPPPPGFKQFSCVSLPCPP\RQMGFRH 
VGQAGLELLTSGHPPDSASQSTGITGVS 
HCAEFHRTD 


6997 


20898 


A 


7055 


3 


261 


TGSGSVTQAEVQAEVMAHCSLNLLGSSD 
LS ALAS RAAGAAGTCHHA WL I FVFFIEM 
GFHYVPRL/GLELWARVIHPPQPPRVLG 
LQV 


6998 


20899 


A 


7056 


1198 


1040 


FLWKXALLFYFKKLPQSLQPLVATTIiSS 
QQPSTSRQDPLPAKRL\RLAEGSDDR 


6999 


20900 


A 


7057 


271 


33 


LFFFFKMESRSVARLECSGAILAQCNLA 
RL FLVE \ QG FHHVGQDGIiGLL I HP P WP P 
EVLGLQAQATPPSPKITYEAATVM 


7000 


20901 


A 


7058 


2 


194 


CYRRQPPHLAMNFYFIjIiNFFCI*FFG\KA 
LSLLARLLLNSQLQASLPPQPPKVLGLQ 
ACGTTPCP 


7001 


20902 




7059 


3 


272 


NMWKLNNILLNNQWVNGEIKMKI FKNCK 
E \NGNTTYQNLRDAAEAFLKGKF IAVNT 
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SEQID 
NO: of 
nucleotide 
sequence 

• 


SEQID 
NO: of 
peptide 
sequence 

• 


M 
od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
B=Glutamic Acid, F«Phenyla)anine, 
G-GIycine, H=Histidine, Mso leucine, 
K=Lysine, L=Leu cine, M=Meth ion in e, 
N=Asparagine, P=Proline, 
Q=G lutam ine, R=Argi nine, S=Serine, 
T^Threonine, V=Valinej 
W=Tryptophan, Y«Tyrosine, 
X«Un known, *=Stop codon, /^possible 
nucleotide detetion, \=possibie 
nucleotide insertion 














YIKKDYRKTCLRLFYSYHFFNKEKEYTL 
KTMIKV 


7002 


20903 


A 


7060 


397 


129 


FWGPHKKNFPLPARGRKLGSFKTA/LPL 
FFFFFSFSLSLFFFSGSCSVS\RLKCSG 
TITAYCSLEFLGSSNPPASFSQAfcGTTF 
TFFLFIG 


7003 


20904 


A 


7061 


3 


351 


SCFRLLCLPKEAWAMAGAP PPASLPPCS 
LI SDCCASNQCDS / VGVGPS E PGAG YNL 
WRRFLSGSEKRNIRVGVTRFSRCRPS P 
LSLTRKGNSLTPCS SQVRQCLALLRIiAH 
GARTH 


7004 


20905 


A 


7Q62 


60 


324 


DDFVSWQTGVHWCGRGSLKPQPPRLSS 
TSRLSSPSSWDYGRVPF\NIFCRDRVSL 
CFPGWPRTPDLRGS SCPSPKWKLHTRV 
TAPGQ 


7005 


20906 


A 


7063 


481 


160 


RPGAPG/ PDDFARMGPRPKGGAFTLVHP 
MPSGAAFPGLLSEP\PHRGPRSFAARSP 
TLRYPGRAPSKGSGEDPARPPATGPGPN 
LIQDGLVLMSLTENVC 


7006 


20907 


A 

■ 


7064 




481 


CLCPAPRGGAYRGRQASLSCGGLHPVRA 
S RLL CLP KQAWAMAGAP P PAS LPPCSLI 
SDCCASNQRDSVGVdPSEPGVGYSLWR 
RFLSRSEKRNIRVGVTRFSRCV/LSPLS 
LTRKGNCLTPCASQVRQCLALLRLAHGA 


7007 


20908 


A 


7065 


392 


183 


GGLKP/ IPGNPGNSLFSQKKKKQPGAGG 
SPRGSPPPGGLGGGIFFGPKGLGSINPN 
FGPAPPPGGQKKKSLF 


7008 


20909 


A 


7066 


538 


248 


FFLKQSCS VAQAGVHWGYLS SLQTLHPR 
F\KFSCHSLSSSWDYRCAPPRLAN\FVL 
LAEMGFHFHHIAQAGLELPTSNDLPTSA 
SQNAGITGVSHHT 


7009 


20910 


A 


7067 


2 


127 


VTGQAWHACSPGTLGARAG/WIVWAQQ 
FRTSLGIMVRPCLYL 


7010 


20911 


A 


7068 


379 


217 


KKMFCKNE/SFTI FPRLVLNSWAQTILP 
PWPPKVLGLQVQTTVPQSYSPLSLYKE 


7011 


20912 


A 


7069 


386 


176 


KRDKVDKWIiKKMWCIHIMEYYLAIKKBT 
LPFATWVNLQ\TFILSEISQEQDKYHMI 
SLICGIKMLIYRIRE 


7012 

■ 


20913 


A 


7070 ' 


30 


408 

• 


NSSELGSVGVGPTEPGTGGNLLVCWLLR 
LWEKCSVWA3VSRFFQ/ CTDYDGFPWVG 
KGNPPTPCASQVRRHPALLRRTLHGLHP 
CASHEKNQVPQLEMQKSSVFCIDLTGSC 
RVELFPFGHLGSKNP 


7013 


20914 


A 


7071 


310 


414 


HVP WRS FAPAHCNLGS LQP P P PGFKRFS 
CLSLPSSWDYRCPPPRLAN\FCSGSLQS 
WLTATSASWVQAILLSQPPE 


7014 


20915 


A 


7072 


353 


243 


I FI FCRV/ RGLT I LSRLVWNFWDQVILL 
PEPPKVLGLQV 


7015 


20916 


A 


7073 


1 


276 


ELWDNFKEPNIYLKSLKERKECVCERGR 
RDRNKTLED IMAKN FS NWMK I INPQIQE 
AQQTPN\QETCRKIJU*RHITIKLLKTND 
KKKILKAV 


7016 


20917 


A 


7074 


362 


133 


GGGGGGGRYSPQKKGTLPLTPPSNWGPK 
GAPPPRGNFFFFFFFS \RDGOiAMLPRL 
VLDS WAQVMLLPQ PLKVLKL 


7017 


20918 


A 


7075 


3 


177 


KWSPHHSLPNN\WLYRHMPPCPANVBFF 
CS LAVLPRMVSNSWPGAILSPWPTRMLG 
LQT 
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SEQID 
NO: of 

n it f 1 m\ ii ffl a 

II ULIvVllUC 

sequence 

• 


SEQID 
NO: of 

sequence 

• 


M 

eth 

od 

• 


SEQID 
NO: in 
USSN 
to/515,1 
26 

■ 


Predicted 
beginning 
nucleotide 

location 
correspond 

Inp to fir*r 

1MK *w All SL 

amino acid 
residue of 
Dentide - 

sequence 


Predict- 
ed end 

nude- 

otide 
location 

rnrrt?snnn 

LUI 1 UUUU 

ding to 
last amino 
acid 

residue of 

n entitle 
sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
EF=Glutamic Acid. F—Phenvlalanine. 
G=Glycine, H=Histidine, Msoleutine, 
K=Lysfne, L=Leuclne, M=Methk>nlne, 
N=Asoa rapine. P= Pro line. 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^TrvotoDhan. Y^Tvrosine. 
X=Un known, *=Stop codon,/=po*sible 
nucleotide deletion. \=oossib)e 
nucleotide insertion 


7018 


20919 


A 


7076 


1 


169 


. PQLPSSRNYRHVPPCWANFCIPS/RRRC 
FTMLARLFLNSWPQVTI^HWPPKVLGLQ 
A 


7019 


20920 


A 


7077 

> 


402 


231 


PSCLSLRSSWDYRCMPPHLTNYYYYYYY 
YFVEMGSHSVTRPQDILPPWPS pkvlgl 
QV 


7020 


,20921 


A 


7078 


422 


316 


FCKDK/SLPMLPKLILNSWVQTTHLPQP 
PKVLGLQA 


7021 


20922 


A 


7079 


58 


293 


DPICTKSKKLKKTQ\TDKEDEITTKKAK 
KA^SKIKIEEDGEEGKWWEEEVSVKK 
KKDKKKPWIX3MVAHTCNPNTLGG 


1022 




a 

A 


/UoU 




12j 


uvr f lujfxruv^vf ouniuvuivniOKu / n 

VLTHFKEFLFHWFLTIHTHTHTHTHTHT 
HTPSFPE LGKAVPMHR I VI VPTNYQH 


7023 


20924 




70ol 




lol 


yptiTi\ PPwnwvTiPfHi^RwmcrtWQfYrPfiT. 

x c 1 1 1.) \rr rvui\ v uir^njcr\j«<3 auhijux fssij 
GQSAHLSLPKHWVCRREPPHPALFLLFK 


7024 


20925 


A 


7082 


453 


142 


FFXXFXFXXXXXFFPFFFFFFXPPFPFP 

FFFLKNSFPYFFLFFFFYPLFLPLFFFF 

FPPNFFFFFLFS*NPFFFFFPPPPLFFF , 

PFFFIjFFFFFFFFFFFFF8F 
r (T t -t jur t c ffffv f ceo* 


7025 


20926 


A 


7083 


268 


92 


KNNLPS S WG \ YKCS PPHKGNRFFFYI KD 
GGGLIMLLRLI SNS WAQVILLSWPSKYL 
ERQA 


7026 • 


20927 


A 


7084 


413 


0 


SCSVAQDGVQWDDLGSLQRLPPKNKPFP 
CT,<3 CfPAA\ FPSDWDYR 


7027 


20928 


A 


7085 


400 


125 


KTRSFSWRGGVQL/CI/GSLKPRTPGV 
K"«3 ^ <? P <3 A <? PVORNTOMPP P POO T FFFTW 

ETGSHYAAQASLKQSSCLGFLILFLSQP 
LGEKEGPTICT 


7028 


20929 


A 


7086 


377 


38 


NPGPGFYFWGPKKKIKPFPPPGVKLVSP 
KRAPPFFF /CFFFETGSPFVAOTSIjEIiIj 
DSHNPPTSASQSAGITDMSHHARPMIISI 
CM I QLWKSYHRFHF I SQGSRDCSSSRBE 
T 




mom 


a 


/Uo/ 


i 
i 


i ji 


S FCRD /KRLIMI*PRIiVIjNSOAOVMIiLPO 
PPKVLSHHTWPI IKFSFMHFNLF 


mm 

/UJU 


OAQ71 


.*» 


/Uoo 




in 
in 


PRXi\TLN\ SELRRSAHLSLPKCWDYGPE P 
PCPAQMHVLKSGTAAHKVHDFLASWSHL 
PTHSPFPVLCSKTQISVAFFIQNLRAGQ 
RSGQRHFGKQCGSIH | 


7031 


20932 


A 


7089 


1 


309 


KQINKSSLRISGKPPNCECVLHIiGKSVN 
KSLLEPHKLASDP /HLCDFLESSVEQVK 
SVKELNCHV\TSLYTLGALESDMAETYF\ 
.NKYTLGPGMLAHAYNSSILGG 


7032 


20933 


A 


7090 


414 


240 


FSLF/CKNIjKKGLTMIiPKLVSNFWPOAI 
LLPOPPKVIiGL 


7033 


20934 


A 


7091 


460 


328 


RHP P P YLG K/ F FCRDR/S LTML PRLVSN 
SWPQVILS PWPPKVLVL 


7034 


20935 


A 


7092 


388 


199 


KQFSCLSLPSRWDY/ SALPPRPAHFCIF 
NRDGFS PYWSGWSLGLPKCSD YRRRIFL 
RAKAHVRE 


7035 


20936 

• 


A 


7093 


18 


271 


MHSFATYLLVNLHLIiVLCRLCPLSMYSG 
EKSYILLTKRPMLWF/SSGPFPSWSFSS 
ITLRIFLTCLTLKLSCCSCCCCCYCCCC 
C 


7036 


20937 


A 


7094 


415 


291 . 


GDLGSPQAPPPG\SRHSPASASRIDGTT 
GARHIMNFKNLAIK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
nentide 
sequence 

• 


M 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

■ m mm mw mmmrmr mm mmm mm 

location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nude- ' 
otide 
location 
correspon 
ding to- 
last amino 
acid 

residue of 
oeotide 

K# mwmF WW mm mm 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 

m^^m ^^m- ^ ^^m^m mm* w ™ mm m mm ^mmr ^mr mm mm m ™ -^r » m, mm, mmm u my 11 in 

G=Glycine, H=Histidine, l=Iso leucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Prollne, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 

mm mm, mmj m WVH'II V V VWM^P^ 

W=Tryptophon, Y=Tyrosine, 
X^Unknown, *=*S>top codon, /^possible 
nucleotide deletion. \=D0Ssible 
nucleotide insertion 


7037 

■ 


20938 


A 


7095 

■ 


21 


428 


DKGLTVTQARVQW / HDPSSRQPPSPGLK 
RFFRLSLPSNWDHRCAPPRVANYYFFYR 
WGS YHVAQ AGLE LLASSNTP T S ASHS AG 
ITALFLRDTIiVRFFMGSFCFPQSFGAAF 
TAQIPQRGRTEELFIFLFLNDYLFS 


/ wJO 




A 


7096 


320 


134 


IjPL \ PKTTFYLFLNRDRVLLCCPGWSR \ 

mammm mmmw ■ m> mm mm mm mmi mm mm/ mmmrmm mmmmmj W mM mmmw mm. m w? mmm> mmtm ^mw ^mwmm mmmr m v) mmw mm m m, 

ILSSKQSTLIiGLPKCWDHRHEPPQQAVD 
HTLDFS 


7039 

* 


20940 


A 


7097 


402 


221 


DYRHVPPFPANF/ LVEMGF WHVS QAGLB 
LLGSSDPPNTjSQSAGITETSHQAQAYTF 
IFIIY 


7040 


20941 


A 


7098 


427 


259 


PPPRFKRFSGLSLLRRWDYRPPPSCPGN 
FFFFF\CIFVQMGFHLVGQAKMLKLHD 


7041 


20942 


A 


7099 


422 


275 


VQWYNLDSLQPASSGFKRFSHLRLLSTW 
DYRH\ PRPANFC I FSNTKDTFI 


7042 


20943 


A 


7100 


3 


404 


HAS GAT WRNP VS TR VWWHI P IVPATQEG 
EMGGS LE PRK/ LKAAVSHCTPYWVT 


/LKf J 






Tim 






FETE «5 RS PTEARVO WRDIiGS IjOP PPPG F 
KR/DSPALASRVTGTTGAHHHAGLIFVF 
LVEAGVSPHWPGWS 

* 


(VH** 






Ttm 


Wo 


AO/ 


WLILFFIFCRD /RGLTMLPTIiVSNSWAO 
AILLPQPPKVLGL 








/ IVD 






S FIiR WS FTLVAOAG vrwrglgs lopl PP 
WFKQFSCLSLLSSWD\YGRPPPRLANYF 
F WILVETGFHRYWPRMVSNSQPO/CDL 
PAVGRPNSAGI \ TGVSHCAWPKMFLNNC 
TCTHSPYLQGIVSLG 


7046 


20947 


A 


7104 


227 


55 


FSQKKINIQFYISLSPTHITMSFFLCRD 
/RGLAMLHRL/VLNSWPHVILPPWPPKV 
LRL 


7047 


20948 


A 


7105 


479 


314 


PRLLLIFFFIERGL\LCSPGWSQTFGLQ 
ESSCFGLPKSWGPRKEPQPCLALKWTLQ 








/ I uo 






NSFF / CKRRGLTMLSRLVSNSWPOVILT 
PQPPKLLEIi 


7049 


20950 


A 


7107 


441 


106 


GAPSPASLPPCSLISDCCASNQRDSVGV 
GPS EPGVG YSLVVRRFLSRSEKRNIRVG 
VTRFSRCV/LS PLSLTRKGNSLTPCASQ 

W *m> mj^mmr m m m ] m 'm ^mr W M mmmw^mr mr- mm.mr-mmr' mmm mm, +mmmm ww^i mmw mmm mm Mm ^mmm mwrnrnw 

VRQCLAiLRIAHGAHTHWPAPTVWHSLV 
R 


7050 


20951 


A 


7108 


413 


207 


CCIQAGPAFFGGPPPPRWGWGGVPPPGY 
/MPS P P VFRVGLL VFRGPP PNWFL PMGL 
QPDGPGLGDPKKFAK 

\^m\\ mm mmW mmm* Mm WMNJVtl m» M X 


7051 


20952 

• 


A 


7109 


92 


409 


PHLPDGAAGQRCPPPGRGGGCVRAGPQL 
PPGRGGWPGGGLPPPPSRTGWIiPGGGAP 

mm mmm ^mmr mr m ^mW ^mmw w ■ mm ^mm- ^mmm ^mmw mmt mm mm mm mmm mg m mm w m mm^r^m mmmw mmm mjmmm mmmm 

HFSDGAVARQRVS S LLRRGGRAETLLTS 
QTGSRPGRGTPH I SDDGRPGRD 


7052 


20953 


A 


7110 


408 


13 


PPWGPPGGGPPEARGSNPPGPPWGTPLS 
PBCKPKNYSGQGGGPLNPPPPEG/SGPGI 
PFSPEGKGSLNPGGQKGKIGAPSPPPGG 
PKKKPPSQKKKKGKEKKEKVIKGESSQI 
LLFKVS KI LSVNFLHLSSL I 


7053 


20954 


A 


7111 


290 


175 


LKKNCR/DRSHTMLPRLVLNSWPQTILP 
SQPPKVLGLQI 


7054 


20955 


A 


7112 


1136 


552 


I IjFKEMGCE AGQRCSCPV I P I TLGRPRR 
ADHLRCGVRDQPGPTWLKPRLYSNTKIS 
WAWWRVPA\ I PA\ I RRPEATRFTFNPRR 
WRLPLNLRSHPLHS HPGRQS KI PI KKER 
KKGQDRKGKGCGQNWGQKQKRRKEGKKE 
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SEQID ' 
NO: of 

mirl<kfktifiA 

IlllklCUUUii 

sequence 

• 


SEQID 
NO: of 

sequence 


M 

eth 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

mm M W* W V 

location 
correspond 

inp to first 

■Ug t-V 111 dl 

amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nude- 
otide 
location 
rnrresnon ■ 
ding to 
last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanlne , 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid. F=Phenvlalanine. 
G^Glycine, H=Histidioe, I-Isoleucine, 
KHLysine, Lr=Leucine, M=Methionioe, 
N~Asnaraf>ine. P=Prolirie. 
Q=G lutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyroslne, 
X=Unknown, *=Stop codon, /"-possible 

niiclentiHe (folptfnn. Wnn vtihtfi 

llUVlwUliUv UvlvllUII) ^JfWSwIMIv 

nucleotide insertion 




■ 






■ 




. MGMGRI GG KNKKEGRKERKEGRKEGNKW 
WAELGAFPE(^KGIjRRGPSEEAVGVSP 
L 


7055 


20956 

• 


A 


7113 

> 


570 


199 


LCLLKQAWLMAGAPPPASLPPCGLISDC 
CASNOODS VGVGP S E PGWYSL WRRFL 
SRSDKRNIRVGVTRFSRCV / LS PLSLTR 
KENS LTP CASQVRQ CLALLRLVHGAHTH 
WPAPTVWHSLMR 


7UDO 


2U507 






400 


OCA 


PTRGTGWTAJJPJ^ItATOPGI/rFFIILIHK 
QEGGNVGSQKGQRD\VTNSAPHWRLHDQ 
TANCLS 


7057 


20958 , 


A 


7115 


474 


224 


DSNFEKSSTLGKMLSNSIKCYREIFCER 

Tf ^OTiMnOT^MTA FK"KTiPG*3Pf)P^ATTTT. 
fNirm ti i^4\4 i- JlLLJ \c JuUJryuryrwni i m 

ISQLPPTSRQNPPPAK\RLQLAEGSDGC 


7058 

• 

► 


20959 


A 

■ 


7116 

- 


377 " 

* 


111 


FXXNFXS FLFFS LSLTRKNWVAHITFFF 
FFK'TiPTPPPPVFTPTJiTFPFFFFT.PYTF 

FFFFFFSFFFFFFFFFFFFFFFFFFFFF 


7059 


20960 

■ 


A 


7117 


417 


148 


VPPCLAN F \NFFVETGS P YVAQAGLELL 
GSSHPPTLASQSSQITRVSHCSLLyFLM 
HSNTS I S FFSFWALVS FLEMP S S CQDGX 
TVLAR 


7060 


20961 


A 




2 


1 Q£ 


PVT3P"Dr , P2X \ MPVFPVn'FAFf'HVAfi ARPTf 

LtiASSNPPASASQSARIAVLSYCPLPSL 
RLRSHLDFF 


70o 1 




ft. 


*71 1 Q 

lily 




41 / 


A WAT 


7062 

■ ■ 


20963 


A 

• 


7120 


801 


450 


GPKFLGPPGVRIN/RRPGGGGLPRVYKK | 
NNYPEDRPIVLPAFFSGGGR /GPGPHDK 
HPLPRKKQKNSPGRGGEPPFFPPPGFPP 
GGERKNPIiTPQRKRVHWGQGLAPPFPPS 
IiSAKPKLF 


7063 


20964 


A 


7121 


3 


199 


DAWGFAL WQAGVQCCDLS S LQPS P /RL 
ANFVFLAETGFLHVGQAGLKLLTSGNLP 
ALTSQRLRLQ 


7064 


20965 


A 


7122 


407 


164 


NVAGLTGAYLDS FFGRDRV/ LTMLLRLV 
TiNSWAOAI LPPWPP KVMGLOACTI BVSS 
LVAQ IAS KRLSGVQESDEAAW IRDC 


7065 


20966 


A 


7123 


33 


192 


RWDFT MLP RLVLGS KRS AHLGLP KCWD Y 
KHEP/PCCPGWSWAQRDLPTLASQSAGI 
TSMSHLHQAIPLKFLEQFIRFSILVF 


7066 


20967 


A 


7124 


3 


399 


SIPVDWPSRAPTPRGRESAAHQGN/YPV 
GQRN\RMAGLSPRSFCWREVSFSR/SHS 
C / ICWVOLGKSAVLPOQSRGPGAHEGSR 
RRGGSSPPCPPLQTQLVFSPQELSMSAP 
I HRLS VTLQGDC I S TGGAPFRCRLA 


/UO/ 










1J ft 


FFEMESCSVAnAGVOWHDRGSLOPLPPG 
FKQFSCLSLPSSWDYRHLPPHRANFFVF 
LVE TGIjHHVGQ \ S GLELLTLGD / LLP AS 
AS QS AG I TGVSHHAWAKNPCFLTS IDT 


7068 


20969 


A 


7126 


408 


234 


LPS SWDYRS PPPQANF YI FSRD/ MGFTM 
LARLLTSSDPPALASQSARIAGVSHHAQ 
PVYF 


7069 


20970 


A 


7127 


88 


277 


KLSXEYKKQRPEDS FFLFL/VE IGFRYV 
AQAGLBLLSSSDPPTLASQSAGITGVSH 
HIQPPFKK 


7070 


20971 


A 


7128 


3 


140 


QRW WCVP ATRKAEAGESLETES SRLQGA j 
MI TP \ CS PAWATEQDLAS 
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SEQID 
NO: of 
nucleotide 
sequence 

• 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 

- 


SEQID 
NO: In 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCystein e, D=A spa rtic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, LHLeucine, M B Methionine, 
N=Asparagine, PHProline, 
Q=G luta mine, R=Arginfne, S=Serlne, 
T=Threonine, V=Valine, 
W=*Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V-possible 
nucleotide insertion 


7071 


20972 


A 


7129 


1 


212 


PLLFVSPPLLS VS PPLLPFP/ EPAASPP 
PSTHSLLPSVQSESLGQFLHPPDPPPLA 
FPAPLRHQDLHISHL 


7072 


20973 


A 


7130 

• 


361 


3 


RNNISQDIAVIGNDSSNRSGQSKLKIFW 
KGFT ILHAI KD F CDS / WEEVKI SRLPGV 
WKKL I PTLMDDF / EGFKTSVEEVTADMV 
ETETELER LE LEVKPE YGTGLLQS RJDKT 
LTDEKLLLMD 


7073 


20974 


A 


7131 


421 


201 


NFCILVD/TGFHHVGQV^LELLTPSNPP 
ASASQSSGITGLSDHSQRKSELVLSFNF 
FALFHILLCVMFLAIDRND 


7074 


20975 


A 


7132 


221 


529 


LIKFTIHSFYFIYLFIFFSGDGCFALSP 
RLESSGVISAHCNIRLPGSSDSPDSASS 
VAGII/CMRHHTQLILY/CLVET/EFHH 
VGQGGVDLLTSGDPPASASQSV 


7075 


20976 


A 


7133 


1 


227 


KG IHFSSANDVKKKTVLTWLNSQDPQLF 
RDGLDGWYHRLQKCL\ELDEVYVEK 


7076 


20977 


A 


7134 


1242 


300 

• 


DFVLPDGRVIP \KGI VCLINI IG\THYN 
PN\LWPTPLEVYDPFRFNP\ENIKERSP 
IiAFIPFSAGPRNCIGdAFAMAEMKWIiA 
LTLLHFRILPTHTEPRRKPELILRAEGG 
LWLRVEPLAGTSRGNPRASSRPSLSDPW 
VLRRPLPPPI FLFLLLLTLPLLFSVMAR 
EP PS PLRVEAPG P PEMRTP PAI EAT PKG 
TLQP AGGRLRFL S GCVP I LHQ VAGHR YG 
KDKMGI LQHPDGTVLKQLQPP PRGPREL 
EFYNMVYAADCTDGVLLELRQYIiPKYYG 
IWPPPAAPNDLYLKLENVTHKFNKPCIM 
DVKIGQKKL 


7077 

* 


20978 


A 

* 


7135 


I 


259 


GTS C I LASMIiTRHRLRFS F/ CLFETEFC 
SVAQSGVQWYDHSSLQPQPPGLKLFSPV 
NF PS S WDYRHMP P CLAS AS I LKLLFT ID 
LGFYS [ 


7078 


20979 


A 


7136 


1132 


900 


CSGVI SAHCKPRLPGSRHS PAS ASRVAG 
TTGTRHHRRLNFCI FFLVEMGFHRVSQD 
GLHLL/NLVVCPPRPPKVLELHA 


7079 


20980 


A 


7137 


1 


501 


FFLRWSLTVTRLECSDVISAHCNLHLPG 
SSDS PASAFREAGTT/GHDAQLI FVFLV 
ETG/FHHWKDGVDLLTS 


7080 


20981 


A 


7138 


2 

• 


229 


WHSHGLLWSQP PL \ YLSLSSS WD YSHVS 
PCPAGLCLFLVETGS PYVAQAGLSLRGS 
. ASQSTG I IGMSQPAGFWKQL 


7081 


20982 


A 


7139 


255 


58 


IFGKGFPPFFCQNQGFGVPPPPRAFFFF 
FF/CPAKYYTMLPRLWSNSWAQAIHPP 
WPLKVLGLQA 


7082 


20983 


A 


7140 


422 


274 


TMLPRLVSNS \GLKQSSRLSLPKWWDYR 
YKPPCLVYTFF INRPPEVPKE 


7083 


20984 


A 


7141 


417 


60 


VWRFLEELETELPFDPAIPLLGIHLKEH i 
KLL YHKDTC\ WNQP KWPSMVDW I KEMWY 
INTMEYYTAIKKINAIFRDPLTSPWDPF 
S WRKFCLPLE I WPAS YFLPG INTRSLDE 
PLHIKGER 


7084 


20985 


A 


7142 


391 


70 


SPPPHPPRKGDYRGGPP/RPGKKREIFF 
FFFFLVDTGFCHVAQAGLKLLASSN/LP 
TSASQSAGITSVRHYIQPTLLKI IPQFL 
DILL FLKI FHS FLKS FNFNLGS FY 


7085 


20986 


A 


7143 


405 


304 


FCRDR / S LLMLPRLDLNSWAQG I LLPWP 
PKVLGL 
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NO: of 
nucleotide 
sequence 

- 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 
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SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
OCysteine, D=Aspartic Acid, 
E»>Glutamic Acid, ^Phenylalanine, 
G=Gtycine, H«Htstidine, I=Isoleucine, 
K=Lysine, L=Leucine, M-Methlonine, . 
N=Asparagine, P=Proline, 
Q=Gltitamine, R=Arginine, S=Seriner~ 
T=Threonine, V=Valine, 
W^Tryptophan, Y^Tyrosine, 
X^Unknown, *=Stop codon, ^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


7086 


20987 


A 


7144 


3 


159 


HAS E KI W YM YTM EHYA \ SQKKNKIMS FV 
ATWMRLEVN I LS KLMKNEMMSVNGR 


7087 

■ 


20988 


A 


7145 


343 


115 


CEFYFLSCIYVFIYFCR/DKGLSLLLRH 
VSNSWPQVILLPWPPKVSHQAQPBFYFR 
NKQTKI DLS FEETAADLAYYS 


7088 


20989 


A 


7146 

* 


453 


119 

• 


RCWKGCGLWI SLRTT/ WRBWQLVQPPWK 
TVWRGLRKLKVE PPYEPATPLPGIHLKK 
TKAVAQRDMCTPLFIVAPQHHLLKRLFF 
SRCVFLAPLLKIA 


7089 


20990 


A 


7147 


3 


105 


TIEWGEVTS/S S YS STMANNEGLFSIiVA 
RKLSRPL 


7090 


20991 . 

* 


A 

■ ■ 


7148 


295 


23 


FANIFSHSAETLQTRREWGPIPNIL/EK 
NFQPRISHPAKLSPISEGEIKSPPDKQM 
LRNFITTRPALQEFLKKEPMSPPC3TWMK 
LETIIIS 


7091 


20992 


A 


7149 


1 


161 ' 


RTRGFF YLDGVRSLAML P / RLVLNS WPQ 
AILLP^PKVLGSQTLAMVAAYTVFV 


7092 


20993 


A 


7150 


1 


219 


DYRCTPPHPAHF/SIFIFCRQR/ SLTMF 
LSLVLNS WTQAI LLPWP PKLLGLQACTA 
MPIPGVELLWVSGCFVKES 


7093 


20994 


A 


7151 


177 


34 


ELVSFLS FSLSLS F FPS FLPS FLPFLS F 
LPSFNPSF\LPSFIiPSFL 


7094 


20995 


A 


7152 


398 


40 . 


MNDRVLLFHPGWNAVAQSWFTGASN\FG 
LKQSSHLSLLCSWNYRHTPQCPASLLAY 
VCMYVCM YVCM YVCR D/RGL TML PWL VS 
NSRPQAI LPPQ PS KIMG PLLGAVAHAYN 
TSPKGSQRR 


7095 


20996 


A 


7153 


418 


179 


RNSCTQMRNEALPNSGYFPKLKYYYDF/ 
CQRQDLAILPRLVYNSMVQAILLPWPPK 
VLGLQHEPPCLAVIQFIiKGNLKIV 


7096 


20997 


A 


7154 


767 


537 


QALLCIiPGWS\AVAQSWLTAASNSSDPP 
ACLPSSWDYRHVPPRQAEWSFRVSVSSP 
VKRDPGSPWRRMRPCSE 


7097 


20998 


A 


7155 


421 


301 


HAQLFFCRHW/ SFTMLPWLVSNS WAQAI 
LLPWPPKVLGLQV 


7098 


20999 


A 


7156 


410 


269 


AGGPQTGSFFFF/CYERGVSLLPRLVSD 
TWSHVILQPQKPKVLGLQA ' 


7099 


21000 


A 


7157 


3 


237 


MQIKA/TMLYHFTPTNMTVLGFKKNSRG 
WQARGEIGTLVHCWWGCIMMQPLWETVW 
HFLKRLNKLOWVHACSPNYSGG 


7100 


21001 


A 


7158 


435 


211 


IjFLVEMG\FAVIiASIiVIiTLE\PRDSPTS 
ASQRARMTGMSHHTQPLFCLFVSKFHLA 
KKLSLI WQNLRRAG I CRLS 


7101 


21002 


A . 


7159 


3 


220 


SSSASQSGGITGVSHHARPSERQ\SCSV 
AQAWQWCNYSSLITPTPGLKRSSRLSL 
IiSSWDY\GALPHPSVAS 


7102 


21003 


A 


7160 


1 

* 


172 


FRKVLGRTGSNGLCTQVRVEFMDDTSRS 
IMRNVK\ SP VREGDVLTLLELELETLRL 
R 


7103 


21004 


A 


7161 


1 


180 


DAAPVLKATARP TRPDRWK/ IDIDSLDI 
HGETP SLQKNTKTFRAWWCI P WPATRB 
ADMF 


7104 


21005 


A 


7162 


3 


108 


KKKKTGRFKLV\MQYNRRFWVVPTFGK 
KKGPNANS 


7105 


21006 


A 


7163 


409 


110 


GQHGLTQS LPLSAP PRGAVPRWAPRPD / 
SGLSLS PLIiPTAFATTGASSANR FVS IG 
PRDGNFLNT PQQS QS WFL 


7106 


21007 


A 


7164 


35 . 


371 


YFHHLSLLLKf'LLASMMVFLTVVVRIjLE 
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Amino acid sequence (A= Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid. {^Phenylalanine. 
(XJlycine, H=Histidtne, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
O^Glutamine. R=Arcinine. S=Serine. 
T=Threonine, V-Valine, 
W^Trvoto d h a n . Y^Tvrosinc. 
X^Un known, *=Sfop cod on, /^possible 
nucleotide deletion, ^possible . 
nucleotide insertion 


• 












LLFFFLGETEPAFVAQPGRNOGPNLGSL 
RPHPPS VKHFPGLTPPRAGTSGLDPPS P 
AIFGFFKKKRGSPCGPGRAWTPGPRGGP 
P 


7107 


21008 


A 


7165 




382 


YTGPAWWSMIiVVIjATE eaevggs leprs 

skloAqpplrhctpawat 


7108 


21009 


A 


7166 


1 


305 


TAVTVDP YHSVY I E VNAR PGACMNGGGL 
LNGEQNSLKRKQLVQTLNKRHGEELAMV . 
DS LTLGPKLEEA/AS P PAFARVRLRARO 

U Immf mmM mm, m%mf\mV mm m \ k f 1 ■ 1 1 if ^ I ilU ^ £ *U V * A^JiVU \ ^£ 

ACRRKGSRLCWEHLVSKV 


7109 


21010 


A 


7167 

* 


383 


55 


RDLS MVRMKSMLAI ALCFTALMGMFNS I 
FYGRWAKLPF P PLS Y I QGL SHRNMMGD 
DTTDCS F T FLY I FCTMS TR\ ON r OKTVQ 

LAPSRAATKQAGGFLGPPPPCGKFS 


7110 


21011 


A 


7168 


465 


166 


S PQGAQ I KGVGFQKLRPWGP \ PGFPRGK 
PFFWFKKKLPPQGGGPLCSRFSGGLSQK 
N\SGPWGEI FPWGQNGPPPSPPGGKETF 
FS KKKKKKKAVD YLNS 


7111 


21012 


A 


7169 


2 


323 


IQGFKTLSEEVSADEVEIVRELGLQAET 

T7'H'\7 , TT7T>T.lTVMCTBr , r , DWTT7UT7I?'irDnOMO 
cuv xcxjixririMo J. *w~. v^rcci j. r ri is rUNJty 

QTSLVSHF\RKLLQSPQSSAIATINQSQ 

PSTLRQDPMPATRLHLTGGLDNH 


7112 


21013 


A 


7170 


3 


509 

* 


FGTIPIiVLCADLNSLPDSGWEYIiSTGG 
VETNHKDFKELRYNESLTNFSCHGKNGT 
TNGRITHG FKLQSAYESGLM P YTNYTFD 
FKGIIDYIFYSKPQLNTLGILGPLDHHW 
LVENNI SGCPHPLI PSDHFSLFAQLEL/ 
YTAFPAPSQRHPPSWQEWKHLQRTALI 
HL 




• 




71 /l 


1 lJLi 




dow>r si * ooxuv^i? v i£tf v ztvjvjij ir rii'i^ xnci 
DVFPEPQAR FYVACWLGLQ FLHEKKI I 
YRDLKLDNLLLDAQGFIiKIADFGLCKEG 
IGPGDRTSTFCX3TPEFLAPEVLTQEAYT 
RAVDWWGLGVLLYEMLVGECPFPGDTEE 
evfdcirlhgrpptpaflsvog\vkfio 
KLIjQKCPEKPLGAGDQNAEEIKVQPFFR 
PTNWQALLARTIQPPFVPTLCGPADLRY 
FEGEFHRAA\ PAL T P PAPHS LL TAROOA 
AFRDFDFVS ERFLE P | 


7114 


21015 


A 


7172 


355 


3 


GPLI^RLGMNILFLTLFLLLPLSMLEGG 
VCGNF/ITNCCLEIDDNRKVIEDITAKI 
QKLAHVPVQIWKGWSPDTLFGGWFSSLG 
RFKTLVGIVLVILGVCLTLPCLLPLLVK 
NIQSANV 


7115 


21016 


A 


7173 


184 


2 


EKKVEDDDDDDDFEME SRS V I PAGVQ WR 
NL\LHPLPSGFKPPSCLSLPRSVfEHRCA 
PLHQCI 


71 IA 
/ 1 io 




A 
n 


71*74 
Sit** 


i ion 


dm 


MGEASRPEESRMRLQPFPAHIGRSALVT 
SSSSLPSFPSTLSWSRDESARRASRIiEK 
QNSTPESDYDNTPNDMBPDGMGSSRKGR 
QRSMVWPGDGLVPDTAEPHVAPSPTLPS 
TEDVIRKTEQ I TKNI QELLRAAQENKHD 
SYIPCSBRIHVAVTEMAALFPKKPKSDM 
VRTSXjRLLTSSAYRXiQSE CKKTLPGDPG 
SPTDVQLVTQQWIPVCVRPFAKAAKQL 
VTITTKENNN 


7117 


21018 


A 


IMS 


3 


136 


AIHLASLKVF YRYRVLSL/ LKRLVSNS W 
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Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
(^Glycine, H^Histidine, l-Isoleudne, 
K~Lysine, L=Leucine, M=M etnlonlne, 
N=Asparagine, P=Proline» 
Q=Glutamine, R-Arginine, S=Serlne, 
T=0Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, Impossible 
nucleotide deletion, ^possible 
nucleotide insertion 














. PQVILLPWPPKVLGLQV 


7118 


21019 


A 


7176 

IMF \J 


3 


316 

•J 1 V 


GIVKFFFFFFLTKCRTVTQAGVBWCSLG 
SLQLPPPRFKQFTCLSPPVSWDYRHLPI 
CLANFCI F\ VETGFHHVGQMGLBLLTSG 
YPPTS AP PDAE I TGVIHRTQP . 


7119 


21020 


A 


7177 


529 


217 


LLPRLE YSGMVIAHCGLQLLGSSQAI LG 
AC YCAW /HNFKAFFVETKSHYVTQAOLK 
LIAS S NP P ALAS OS VGI TG VSHCAQATL 
PFKTVDKGPSFNLFSFTPPLL 


7120 


21021 


A 


7178 


3 


150 


YTLCKS KKDIKS I YH IMT/ HLKTHTHTH 
THTHTHTHKPSTTFSYTETFIM 


7121 


21022 . 


A 


7179 


540 


69 


ACRGAHVQ VGDGS ALT YVE DGTAOG PTM 
LSLDHLCLPALAFNFSTCTDSGERRICS 
HHGVTAWSLGWREKLTRGDRPLLTSPGP 
ALPIG\ csnegkcicopdwtgkdcs ihn 

PLPTSPPTGETERYKGPSGTKIIIDS1A 
GAVLVAAIVLDBTGWGYK 




71071 


a 

ri- 


71 HC\ 


4S7 


970 


C I DS I YOE I YHLS I S I LS VYIjS I YRS I Y 
LSIYLPIFW\SLYLGPRFLRTRDPQLLS 
LT 


7123 


21024 

* 


A 


7181 


501 


183 


HVRCLAF I CAL I G F PSGYFS I MPWT/V 
DLVG I EHLPNAYG I I ICVNGI SALLGPP 
FAGWI YD I TOKYD FS F YICGLL YMIG I L 

A J«W"f J» X *■» X A ylv A 1/1 wii A i I *ir*nT I I 1 f X 1 1 A>^>7 X AJ 

FLLIQPCIRIIBQSRRKYMDGAHV 


7124 


21025 


A 


7182 


446 


245 


IiRVRS LALNQ VS SASRYWDTGVATKLAT 
F/SLFWKDRNLDILPSMVFHSWPQVIFP 
SWPPKVLGLQM 






A 


71 HI 


Aon 


71 
ID 


CDSLEDFKISLTGAWKNLI PPLMDDYBG 
FKTRVEEVTAD WE I ARELELEVBHGLG 
AVAHAYNPNTLRLRS LKPRSLRPGQHS K 
TLSL i 


7126 


21027 
* 


A 


7184 


567 


174 


FF LRDRVP S LTQAGVQ WQDHS S LQ PR P P 
RLKQSSH\ PALLSS WD\ YRHRPLCLAKF 
LIFLQRRGLAI/FARGWSNSWSO/CNL 
PASVSPSAG ITGBSHYSQPFLLSDSQTK 
VYKTHRSTISSTITEMGCERKD 


7127 


21028 


A 


7185 


345 




RNGSSCLWQVIQSLTLLPRLECSGVITA 
QCSLGLVGSSNPPTSASC\VESHCFAQD 
GLBLMGSSNPPTLAGITGMSHCAQPLYD 
LFLYEYGLSVHY 


7128 


21029 


A 


7186 


1 


209 


LSWLBIGSHSVTQAGVQWVHSSSLQP+ 
MLGLKPSSRFSLPKCWDYRHBPWCLPGL 
LRLKFDPSVGGVA 


7129 


21030 


A 


7187 


3 


225 


EKTPVSDRGAKGCTESLVNRRPCFSALE 
VDETYVP*EFNAETFTFHADICTLSDKV 
RHFLKP I AL VI LNHH I ? 






a 


71 RR 


i 
i 


177 

j /./ 


E S * RNHNGDEDDS H VRAOGAC I EAKHDK 
YMADMDELFSQADDNRKKRDIPDYLRGK 
ISFELMRQPCITPSGITYDRKDIEBHLH 
RVGHFDPVTRS PLTHEQLI PNLAMKEVI 
DAFISENGCVEDY 


7131 


21032 

- 


A 


7189 


1 


359 

* 


SRATDHVQRQGHG * G I HNYSDWENTDWF 
RETKNQS VPLSCCI ETVSNGNGS IiAHPY 
DL YAEGCEALVGKKLLEFMMHVI WAELA 
FAAI QLLSMLS AC I VMCRKSTDPAYELL 
ITVGTYA 


7132 


21033 


A 


7190 


496 


118 


QNQKFPFE PGTS * LGL Y PKKYKL I YHKD 
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Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
OGIutaroic Acid, F=Pbeny (alanine, 
G=GIyeine, H»Histidine, I^Isoleucine, 
K=Lysine, L=Leucine,M=Methionine, 
N=Asparagine, P=Proline, 
Q=G)utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, * sa S!top codon, /^possible 
nucleotide deletion, ^possible . 
nucleotide insertion 




■ 










PKTCMFTAALLTIEKPWNHHKHLSMADW 
IKKVWYGTVGHACNQSTFLFWFFPEMES 
HAVAQGGVQWLNLCSLQAPAWATERDSS 
QKRPKIDPHKYIVN 


7133 

- 

• 


21034 


A 

* 


7192 

* 

! 


1337 

■ • 


342 

■ 


ISRATRTFRES I WQFQSSPREDDRNRQL 
SASGSRQSHPGPRSTPGPGQGHRSSHQG 
CAPVSSAARPGGPS FRLSEGALSRSQFS 
VLVYI P * VAAGQFVKACLGAS S PP SRL * 
GGLGTGHRTPGKEARSWPETSAPPAWHL 
LRSLCLPSRGEKLAQARSQKHLGWKWLS 
KLCPKTKYSWSGPDDSPSVLRDPEAGTT 
RWAPS GNPEEEWLQLKAKYLQNI SEVL 
SVLHLLHIKALQLHSTNPFFVNARHLLV 
GQISSALTLNNLSQILFLLLRFPFLSQV 
ERHSP LQKTGTTETNLVL FQDTVPMNFP 
SS I LLP SKY S CPALDKEASHFYQ 


7134 


21035 


A 


7193 


469 


130 

• 


SASWDYRRPPPRPSNFLY* *RRGFTILA 
RMVSVS * PRDPPASASQSAGCAYS I YLQ 
ERFARLHPACLAPRYMASDPLSSLHHSV 
PPGLHLEHEHWLWHCLLPGOICRPGKLW ! 
Y 


7135 


21036 


A 


7194 


430 


149 


TVWQFLKQLNIELSCDPVILLLGICPRE 
LKTYVHTKTRT *AFI VALCTI VPKQKQL 
KYPS VDEWVI KM WYVHTMDYFS VI KKNK 
LAKKRGSHL 


7136 


21037 


A 


7195 


257 


412 


TATSNKEPLKKVRCLIWRKKDCWARGFT 
ACNPSTLGC*GRW I MRSRDRDHPG 


7137 


21038 


A 


7196 


1 


124 


PL ISGWHPCNPSTLGGRGWRI A* VQEF 
ETSLGELAEPCLY 


7138 


21039 


A 


7197 


2 


120 


YGI ILF ITSEVFVFAGFF* AFYHSSLGK 
KISKKESVIIY 


7139 


21040 


A 


7198 


234 


416 


GVHAEVLISPGPQDRLRALWIPIIFGI 
LFAI LLVLVF I KKVAKKPTN KVGHP * E P 
GTEF 


7140 


21041 


A 


7199 


372 


3 


VHYPVPQTGS PCCVPHPRGLSTIHPGLH 

s rytglgmvahacncstlggqsgkia* A 

LEME I SLGNMVR P C L YERTNKQS KPRHM 
GITFGLTFSLTLTDKYSLI isgfylpqv 
LISLHPLGPY 


7141 


21042 


A 


7200 


396 


21 


glkeshkccai paclqrdplvrvlgkag 
f*rkslsistelqslfdspdfskitgkp 
ikltqvehragfewnedgagttpspglq 
pahltfpldyhlnqpfifvlrdsdtgal 

XiFIGKILDPRGP 


7142 


21043 


A 


7201 


2 


148 


REPPHHSLANF* KFFVDKGFTLLPRLVS 
SSWPQWLPPRPPRELGLQA 


7143 


21044 


A 


7202 


1 


291 


AEGFRMHALVSQAACGTSGYKLSVGYMV 
QGPLTHLMIiGLL I SLI FLSRGSGRTWAF 
SHS*YKTSDLLPCRNRREVIEFIjHYRNL 
HSH I CLS VTKTFL 


7144 


21045 


A 


7203 


262 


406 


I LARMTS WNS IKHLILFLK* SSHLS I P 
SSWNYRCVLPHQANFLFLFL*R* sftlm 
PRL VETVLL I QN"S P TKIDSS PDDFSGKF 
YQTLNIVFENIRRGNTSNLLHKNSRILI 
PNLTKTLHD 


7145 


21046 


A 


7204 


2 


247 

- 


DTVLBRSGQGTAALPRS TSLDRD WAITF 
EQILASLLTEPALVKYFDKPECMMARIT 
S AKN I RMMGKP LT * ASD YE I SAMTG 
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NO: of 
nucleotide 
sequence 

• 


SEQID 
NO: of 
peptide 
sequence 

• 


M 

etii 

od 

■ 


SEQID 
NO: in 
USSN . 
09/515,1 
26 

■ 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino add 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
fast amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H-Histidine, I=lso)eutine, 
KeLysine, LHLeucine, M^Methionine, 
N=Asparagine, P=Proline» 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V^possible 
nucleotide insertion 


7146 


21047 


A 


7205 


2 


168 


FFFLVETGFTVLARMVS V5 * PRGLPASA 
SQSAGIAGVSHRAQPALRFLMLBNRMC 


.7147 


21048 


A 


7206 


1 


237 


GKCAHTHTETHTRDTHIUiAHAHTHACTH 
THTYVQGCSLSQWSQDCVRMWACHIFGV 
PRE *ASR YTS SL VYKPGCTGVOG 


7148 


21049 


A 


7207 


2 


239 


KVDSEHKLERSSGGVLGRFKGKGSGGLN 
LGNFFASRKGYSRTGFDRLNTEGSDQDK 
EDDGSESE * EYS * PMPALEPCSS 


7149 


21050 

* 


A 


7208 


1 


405 


GSRDNMNVILICFPNAPKVSAESVKNDA 
QLpKYLECTAQE 1 1 *NHWEDVPDIiFHVM 
RTLPSDNMPSIiPPGGELASMRTVTEAVY 
YRLNP.YINDDTDSTSTDDMWFNCSSSHG 
VYLDLHRRVQVNFDETFSMHPBIi 


7150 

- 


21051 

- 


A 


7209 

■ 


2 


761 

■ 


QDFGTRFATVLLFYGYFKCTRTLKGTCL 
YNAIL YKVCS PRNDQPDVCYNPS FFLLP 
IKTGELLGFLVYASREKRS IAIGD *KDD 
KWPPERI IQYYGPAT*AQDSSWGYRIPt 
YMLN* 1 I RLQAVL E I 1 TNKTGRALTVLA 
LQEIQMRNAIYKSRLALDYLLAAEGGIC 
GKFKMTNCCLHI DDQGQ I VEDI I KITKL 
AHVPMQI PDI VWKNIVKIPVLFCSVLIT 
GTCSPQSGTPCWLSRSRPSHVDPFRWS 
P 


7151 


21052 


A 


7210 


147 


390 


KTGLS DQMPRAI QLS YS YLS PLLY * * S Y 
KKKKKKKKKKKKKKKKNFFKKGPGN* SF 
WGGEKKSGPGAGLKNPGGGKNTFY 


7152 


21053 


A 


7211 


433 


214 


WGRGNAIiTIHPP6*VPFQESETTPSPL*G 
ITLLDNPFSPIPKQPPPPH*SITVPYI»L 
SSPQPGFPHPFQAPGLA 


7153 


21054 


A 


7212 


532 


253 


KWINVFHKAAIMSDFSDRSRHSTLKAfl 
E^GFTMIDAIKNICDL*KEAKISILTGI 
* KKS I PTLMDDF * GFKTPML* VTANVGF 
QRFSKKFYW 


7154 


21055 


A 


7213 


456. 


24 


DAQAFSWYSQGYFNSSQGFSISLPGDTS 
P VKI KRMGGPLFFS * LLKSTCWACSKTS 
RVPGKVLKTGLPRPR* PVYKDPLGGFYP 
RF 


7155 


21056 


A 


7214 


2 


445 


ILRDLCQRVPTWSGFPSWAMELLVEKAI 
SSASSPQSPGDALRTVFECISSGIILKG 
SPGLLDP CEKDP FDTIATMTDQHRED IT 
SSAQFALRIiLAFRQIHKVLGMDPLPQMI 
QRSNIHNNRKRRRDSDGVDGFEAEGK*D 
•KDYDTF 


7156 


21057 


A 


7215 


203 


3 


VRPGWAHACNPSTLGG * GGRIA* SQE F 
E TNLGNTVRP CRNS GE VLNRKTVS IRRA 
LRVFSPLHTA 


7157 


21058 


A' 


7216 


1 


285 


GKCAHTHTHTHTHTHTHTKMHTRTHAHT 
HTHM CRG AV* VRGARTVCT CG SAP FLE S 
PGKEPKDTLRPLFVSQAARAFIjGEKAFC 
SVYRQREPPPI 


7158 


21059 


A 


Till 


1 


386 


GDSNCEWSVLYVI I ATFG1GVALGILS W 
TVI CMLKRQNGKPKMKSKY KNLDATDQE 
SL+LKPASRAGIKQKGLLLSSSLMHSES 
KLDSDDAI FTWPDREKGKLLHGQHGS VL 
NGQTPLKATSPREEIIj 


7159 


21060 


A 


7218 


70 


379 


NILYFNIFFFFK*FLKFLWVFFFFFFF* 
VFFFYNFFFFVFFLCVFFFFFVKLKIFF 
WGPFFLFFRFFFCIjGFFGTFQSWRWFFA 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
3&=Glutamic Acid, FMPhenylalanine, 
G^G iycine, HHHistidine, I^Isofeucine, 
K=Lysine, L=OLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *«Stop codon, /^possible 
o ucleotide deletion, \=poss ible 
nucleotide insertion 

M m mm mm m W w m mm W m m\ mm m mm mm m M 














LFPWGLMFLFPLLFFLVFF*LVFF 


7160 


21061 


A 


7219 

(ill/ 


i 

X 


208 


PRLAASVSPGDMLGMQVLGLGTVASACN 
CSTLGGRGMG IT * G * EFETSLVKKARTC 
FYKRLKRQKSANIi 


7161 


21062 

* * 


A 


7220 

■ 


455 


96 


HYLYGSQGLSRF YFVTQGRVQWHDHGS L 
QPPTPGVKQSSSLSLPKCWDYKCEPPCL 

m^m tm mm m» mm ^mm w m mtmm^mmw mm mmw mmm mm mmrmr mm. mm ■ ■ mm m* mm^mmm-mmm mt mmmmmw 

ARPCPFFFFL*TSSVDTGLLI»NQQPIL* 
IMTFPYPLLRKSLSFHHQIYEPTSICSH . 
LLSFCYT 


7162 


21063 


A 


7221 


201 


319 


IYEIiWPGPVAHACNFSILGG*GEQIAQV 
QEFKTSLGHLG 

mm mm m my mm mtm mm} ^m m mmmwmm 


7163 


21064 


A 


7222 


307 


3 


NRRQAGPFPAKLGRPSPKGGFPNFFKLF 
FFKSSF*KNPKGQGWGFPPLIPGFQGPQ 
VGGSLGAPGLKPPWGTPQNPFS KKKKKK 
RSNCTDTGTTIAHRRQM 


f 1W 


z IUOj 


A 
n 


1 LCD 


■ 


idi 


S ETI TANVS PNIi * DANDVP I OCE I S PL I 
SYAGEGLERYVADKEFHAPVI IDENGVH 
EWKNGI 

mm) V V mmmw » mm 


7165 


21066 


A 


7224 


157 


314 


WLPGFRDGGEDGVCHVAQAGPKLRT*GD 
PSIKASQRAGITGLTHRTGPK*KF 


7166 


21067 


A 


7225 


2 


392 


RDVRSLQPLPPSLK*FSCL»SLLSSWDYR 
CAPPTT * L T FT LLVEMGFHHIGOAGIjEIi 

LTSGDP P ASASQS AG VTGVSHRTRPPS F 
VLVNPIPPLLPPVPSPDGPNISSSTTSW 
RGYP EVLRWGLCS FWGCG 


7167 


21068 


A 


7226 


84 


204 . 


FIFFFFKR* SLTMLPRLVSNS WARVALP 
PQRPKVLGLHV 


7168 


21069 


A 


Till 


3 


307 


DRAAP RGAGRAGAVG PHRTG D WGLAGT P 
AGRGHARHIGRGPKQGRAAGQRGHLHPG 
GIiIiGRGGNSEP*RKQRTPGSMRRAAPLL 
STGQVWEGPASVGDRVP 


71.69 


21070 

■ 


A 

• 


7228 

■ 


52 

■ 

■ 


1176 

■ 


GSQQPLLHISEAQGDTLIKKAPAEAAWP 
PLIAFSSGGRLPCSHES*SAAGYKRLDD 
TAAGGPPS PGGS PRS PATAPGLHP VWGG 
QLRPRGGRGLPATGSGEAFQALTQIiSRG 

AGSSSCAIPDSAASCICPQGPPWRW*SR 
APARNRCSASSGRSRC*OLRRRRYI PGL 
LHPVLHEALPDSWGCETQLGPPLASEAA 
PPRSRRREEG*RGSGRC**KPSLPRSWS 
LCRNQAEAGFVS ILVWLSGLAGAPSRPA 
AAAPQIAPNQHRSPPPRIAFVPAASGSP 
WPWERDLGVIiETLNVGNFAQDPNPSACS 
GPQSTPDTYYPVQKPKTGLRLHFHSRPW 
T I VKS VGNMCL 


7170 


21071 


A 


7229 


3 


112 


ASNLGSRGCSEPRPCHCTPAWV*E*DSI 
SKTNKQKN 

mmm- mm mt mmm mm m mm m, -mm^ mm — 


7171 


21072 


A 


7230 


103 


299 


AMSQIRNAIPFTIAT*RL*YIiEIQLTKE 
AKDIYWENYKIPLKEIRDVINKWKNIPC 
S*IGAQLCK 


7172 


21073 


A 


7231 


2 


137 


SLTLSLSICFCL*RWGLSVLPRLISNSW 
PQAIFPPWPPKVLELQV 


7173 


21074 


A 


7232 


3 

* 


273 


LTHIVDGPNFLTTMYQCLMCVC*QELGY 
QPVTTERCHNVCNDCLQRS FKAQVFS CH 
ACRHDLGQITYIMIPlTEIMQTLIiDLAFPG 
YSLGR 


7174. 


21075 


A 


7233 


338 


136 


RPRWEDCLSPGVQNVCGQHSES PFLQEI 
* KI SQTWWCKP WPS YQEVEGEIiLE I RG 



992 



WO 01/64835 PCTYUS01/04927 

i ■ 



SEQID 
NO: of 
nucleotide 

• ■ mm m* m ^m* ^m 1 n mmr 

sequence 

'i 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
(09/515,1 
26 

• 


Predicted 

beginning 
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nucle- 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G^GIycine, H=Histidine, Msoleuclne, 
K=Lysine, Lr=Leuciae, M=Methionfne, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Ttareonine, V=Va!ine, 
W^ryptophan, Y=Tyrosine, 
X s Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














. STCFGLPKCWD 


7175 


21076 


A 


7234 


437 


261 


HLGFPECWDYRHEPQHPT*PLEHFM*RR 
GLTTFPRLVWNSWPQAIFPRWSPKVLEL 
QT 


7176 


21077 


A 


. 7235 


41 


264 


VTVMEIKGKINLVLDIIKDIYVEGFSTY 
YF* RQGLTMLPRLDLDS WAQAI LP PQPP 
E*MGLQECNHHAQLSRYF 


7177 


21078 


A 


7236 


i 


154 


GTRDFS VKTL * ARREWHD I FTGMKEKNF 
YPKKVYALKI PFKNEABTKKVBV 


7178 


21079 


A 


7237 


1 


179 


SCL»SLSSS*DCR* PPPCPANFVKSFFRN 
GVLTMLPRLVSNFLPQANLLAQPLSGFI 

ILL 


7179 


21080 


A 


7238 


340 . 


140 


KINWVWWPVPVIPVTPEA*AQKLP*PGK 
LKFPLT* IPPLPSRLGHQSQTSPPKKKK 
KFYILSSDSV 


71 RO 


x, Juo l 


A 


77-10 


A 


17S 


OFBSKPSPAWATEGDPVSKKKKKKKKKK 














KKKMKKKNSS KFKM I PFTKRQ Y # EGNWK 














N 


7181 


21082 


A 


7240 


336 


192 


QSETPSQSNQKFIF*GQGIALLPRLVLN 
SWAQAILPPWPPKMLGLQA 






A 


794.1 


l 


186 


DRPGQHGEMLSLLIFCFFCRDGGLTMLP 
RLVLKS * AQVILPSRPPKALGLQDTVHC 
SKFDAV 


7183 


21084 


A 


7242 


2 


111 


VKTLNKLGIDGTWLRIIRAl^*KPmfI 
ILNMCSFL 


71 JM 
/ lot 




A 


77A3 
/ — *t j 




717 


LPSRGDYKRKP P CLVNFYFL* R * GLTLL 
PRL I SNFWAQM I LL PQSPKC 


7185 


21086 


A 


7244 


35 


276 


VQQQRVKOGRKYYS IVLKNKKKLCNYLR 
VGE * MG * RNIG WVQ * VAHTCNPS TLGGR 














GGRI I * PWEFETSLCNIG*TCISG 
















7186 


21087 


A 


7245 


1 


356 

• 


IQHTFS LL1 FG I KYS YYVLKI VS WLAAV 
AHACNLSTLGGRGGWIT*GHELETNLFF 
AVKP WVYDKNS FP PTLNLRFAKS GHLWE 
PRGKNHGPPE TPPLSRVSP PQRFFPGGT 
RPLKKE 


7187 


21088 


A 


724* 


3 


293 


CKCX3KAFHNFYSFQNHEISHTGEMLYDI 
RNVGK*SIFF*HHCQHKTTHRAEKLCEC 
KICREAFSHFGNLKVHEAGRSGSRL * SO 
HLGWANCLTSGI 


7188 


21089 


A 


7247 

- 


6 


387 


AS I P CLTY PSWE FL * FI * QAGHFSFS KK 
KLFI CAFFSNRDEVSLMLPRLGLNSWAQ 
WLLPWPPKVLGLQVRATTPAKLGIFWG 
FMSVSLGQGVCLCRSLLCPLPLGPGQVH 
SGCSQS FVEKMNKEL 

*m^^ 


7189 


21090 


A 


7248 


373 


146 


HLGKKVLI FWPQYLALLNPPNGGVSRVY 
PRCRPLMVFFFFF FFC* GRGLAMLTRLV 
SNSWSQV I FQLQLP KVLGL 


7190 


21091 


A 


7249 


2 


184 


AILRG I * KKF I STLMDDFEG I KTSLEEV 
TAD WE IAREVE LEVE VEP KDVTELLQS 
HDTT 


7191 


21092 


A 


7250 


795 


544 


EMKSCSCCPGWVQWRDLGSLQPPPPRFK 
RSP AS AS LVAG I TAACHHARL I FVFL I D 
TGFHYVG* AGLKTPEA* ATAPGP I FLYF 


7192 


21093 


A 


7251 


34 


242 


KNTSLKKKFFWCQKLRNVSLVEEFAPWP 
GAVAHACNPS TLGDRGGRVA *GQE FQTS 
LSNIGRPRLYRKI 


7193 


21094 


A 


7252 


216 


57 


N IQVLKYVLTSYLKKKKCAFRLGP VAHT 
CNPGILGAQGRRIS*AQEFETSLS 
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SEQID 
NO: of 

npntirfe 

sequence 


M 

eth 
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nucleotide 
location 
correspond 
in a to first 

amino acid 
residue of 
nentide 
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sequence 
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Predict- 
ed end 

niirlpw 
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location 

Wl 1 VJUVH 

ding to 
last amino 

arid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamlc Acid. ^Phenylalanine. 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N s AsDArflf ine. P=ProHne. 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Trvntonhan» Y=Tvroslne. 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


7194 


21095 


A 


7253 


126 

• 


366 


INLTECQQYLDKAIFFFFLKKGLFAP*A 
EWOGGSFASLO PKPPRLKKSSHLTLLST 
WDYGGALPCRANFGIFFFWKKGG 


7195 


21096 


A 


7254 


2 


213 


RPTWATWQNPASTKKKKNS FFFETE FRT 
VAQAEVQGGNLG*LQSLPPGLKRPSCLK 
MPEAPKGOSCSLIG 


7196 


21097 


A 


7255 


150 


263 


EGVSLLPRLACNGAIIAHCNLRLPTSSD 
LCAFS *TTTRYYKYSGYCFSHPNLMLIC 

HS CPGWRAMVLS WLTATFASRRQATFAP 
SLELSR 


7 I y/ 


o f noo 




/ZOO 


380 




P PfifJT . D Dn Q T P <? PP AH r.TfP If <Z P Pf3 TTtTPfl 

GPKPGGSQPGFWGFFFGPGPPFKRGGGIi 
NWVLGAKN* KGVKKNQKKKKGKMPQETK 
jwjjs* iiji ui'jrvc vjo 0 


7198 


21099 


A 


7257 


153 


369 


KENSRLQGYGPLFSHS* FTNSCPGQVQW 
LTPVILALWEAEVGGSKDQPGQHGKTPR 
LLKIQKFARCGGGRL 


n i on 

7199 


21 100 


TV 


7258 


1 OA " 




GRIA*GQVFKTSLGNNVKTCLFLPSPHN 
QQ 




21 lUl 






£2 




nT.p a t pnT T.npT .MP^n t t.qr tt.t/^ c;n 
y ij rot x jt uu ivxj u tr jjfic<o x xjo h. 1 jj xaj 0 v» 

TVAPICDPSTLRGQGGWIA*VQEFKTSL 

GNMAKPHLYRKKKLLGQVTKLEYKW 


7201 


21102 


A 


7260 


326 


.2 


KS AMQ I C I VD PLKAFCLE I CCHANLHCR 

KNFSW?GWAHTCNSNTVRGQGERIA*A 
QB FETI LGNTGRP CLYFLKNNRKS 






71 


/zol 


i 
1 


Oo 


\7NT , C V T f .WT JCi PR W2XH 71 PWP <3 * fifiT? CZd 

RIA* P* ELETCLSNTA* PGWAHACNPS 
T 


7203 


21104 


A 


7262 


282 


176 


GLKKTF FFLVRERVLLCYPGW SAW* S Q 

HVAEN I C * CHMP WP ATWKAEAGEQLE P 
RSSVCSEIi 


7204 


21105 


A 


7263 


588 


351 


NNTVCLLES ISSVNLCLPHLQHSPVKLE 
I FLFFFS SDNFPSTHH I TVRPLSPS * AI 
IFPVTCTYTSRWPEATKDPOKK 


7205 


21106 


A 


7264 


327 


2 


KIKERINNFIFICRDRVSICCPGWNAVV 
TLNSWAQAIRFCLSLMSS*NYRCAPTHQ 
QVTTIFFCPJ)QIEIHMVPKYHLGNTLNY 
FFVLFCFVLFLRHP FSAHRNFRLP 


7206 


21107 


A . 


7265 


121 


270 


AQEAGFELLTSGDLPTSASQSAEITGMS 
HRALPSCY* FYVTIiHGLTVACA 


7207 


21108 


A 


7266 


3 


227 


SSQVSLPNNWDYRREPPCLRFVFLFFVE 
TRSCTVASLELLGSGDPTVTiAS *SARVT 

X Auv«l V /Ijj liD iiJJVjiJ w L/x A. V lino JrU\ V X 

I»4GHCDWPNLSEILIiHVR 


7208 


21109 


A 


7267 


300 


1 


KMLLNTIMCNREILCKRKSPRVQQTSFV 
SYPKKLPQPPQPSATTTLISQQP3TSKQ 
DPPAAK* L* FAEGRLLAFPSNNVLYQLR 
YIMCFLPHTG I VHLVN 


7209 


21110 


*T 


7268 


29 


•244 
j 1 'i 


ASLLALKKKSGLGAVAHAHNASTSGGRG 
GRIT+AQEFATSLGNMARTCLYKKKKKK 
KKKGKNFTGVGGPQRGAPI FGPWGGGKS 
PAPGKPGGPNPFPGKGAPTLF 


7210 


21111 


A 


7269 


3 


209 


LHPLSPGPPAPVAAGIPGPQASR*GPCR 
PDPSPDSASVDASLPGPALSLWSWGSNR 
GEYGTTPASQIH 
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SEQ ID 
NO: of 
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M 

etti 
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SEQ ID 
NO: in 
USSN. 
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26 
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nucleotide 

location 

correspond 

ing to first 

amino acid 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoleuciDe, 
K=Lysine, L^Leucine, M^Metfakmine, 
N=Asparagine, P=Prollne, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, * ss Stop codon, /-possible 
nucleotide deletion, ^possible 
nucleotide insertion 


7211 


21112 


A 


7270 


86 


206 


. cqpgwahaynpstwggrgrqit*gqbf 
etslgntarp£l 


7212 


21113 


X 


7271 


45 


274 


GUGTVAOACDNPS TLGG *GGWIA* GQBF 
KTINNDKSNKWQYLAI IEQINNRMLICK 
NKKELTDICNNTDEPQKHRB . 


7213 


21114 

♦ 


A 


7272 


1 


122 


VLVETGYLRVGQAGLKLLTPGDPPALAS 
QSAGITGI SHRA* SYMHF* LVKTGYLRV 
GQAGLKLLTPGDPPALASQSAGITGISH 
RA 


7214 


21115 


A 


7273 


3 


242 


LLKQLDYRPPHPAKFFFFLEFLVKKGFH 
LWKGGFKTQTSGGLPG * ASKOVGI PGV 
SHCPGLIGVFNLTLKVAVSLMDL 


7215 


21116 


A 


7274 


2 


169 


LALVAQAGGQWHDLGS LQPLS PS FPQPF 
YLSLP S S * DYRHE P PHSAWFS F FSNYR 1 


7216 


21117 


A 


7275 


353 


76 


FAHLGLPQCWD YRREPLCAT* KLFLKGK 
* PSKFQVI FSSLGTGHFSNACFS CVMWT 
IVLLIINFWVTGIDGRTSGSSSVKCX3NH 
SIYPMVLL 


7217 


21118 


A 


7276 


6 


87 


RRGLTLLPRLGSNSWPQAILLPWPPKVL 
KL*AQTPGLRQSSCLGLPKC 


7218 


21119 


A 


7277 


1 


203 

• • 


FQVKMLCS GWMWLTP VIQALGAVAHTGN 
PSTVGGRGGQII*S * * FYTSLDSGNPPA 
S ASQS AG I TLA 


7219 


21120 

* 


A 




272 


51 


IFFPVN* CGGHDVQHHFNPS *NNLSSWS 
SW5WPI ILPLWEA* AGK*LEPRSLRPAW 
ATKLDSNS KKKI IE I HA 


7220 . 


21121 


A 




1 


131 


S RDYRHGP LHPTDFVF FVEMRFHHVAQA 
GLKLLGLSDLPALT8*SRDYRHGPIjHPT 
DFVFF VEMR FHHVAQAGLKLLGLSDLPA 
LTS 


7221 


21122 


A 


7280 


411 


36 


FLECVPNFFPPKKKFFFKKNTGQVFFPV 
TFL*KGPALFFFIFPFYGKKFFFLSSAK 
DFPPRVFF* KGPLFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFCFKHFIYKKVHF 
♦SSVHFQPIIFF 


7222 


21123 


A 


7282 


395 


181 


RKHES IHHDSVISVTFNMQSLRLFFFIF 
FILSL*LFRFIFFFSFFLVFFFFFFCFF 
FFFFFFFFFFFFFFFFF 


7223 

• 


21124 


A 


7283 

■ 


392 


2 


I KFFGYNV* FNYI LKNGLF*QLLI * LIN 
LFILRQGLIFWQVCNVLVFFFFFFIFF 
FFLFFFFFFFFFFLLFFFFFFFFFFFFF 
FFFFFFFFFFFFPFFFFFLTQNMLFIVN 
FCKDTFKQQNIHLRTRGR 


7224 


21125 


A . 


7284 


113 


291 


YVCVCV CVC VCVC VME Y* S AI KRDKIVA 
FTVTWMELETIILSEVSQEWBTTHHMLS 
LIN 


7225 


21126 


A 


7285 


327 


44 


LIKVSPNFSPPKKKVLPKKPPCTIFPPS 
PF* KTPPEPIFIKPPHKKKNFNSPPPQK 
♦GPPQGFFKSPPPFFFFFFFFFFFFFFF 
LGFSSKLLIR 


7226 


21127 


A 


7286 


3 


215 


VILIIFMI*EAFAWKRKVLIVEEPSINL 
E * LYDGG YYDGLVLTSGLGPY 


7227 


21128 


A 


7287 


367 


119 


LKKAPAPPFLNPPFWGKNFILSPPEKMG 
PPRGFFKRGPPFFFFFFFFFFFFFFFLG 
FFFFFFFPFFLQYWNMYFH*ILIINQ 


7228 


21129 


A 


7288 


73 


259 


INKYLQN* KKKKKKKKKKKKKKKKKGGP 
FKKTPRGAQ I S PGVKKKNFPPKGG FKKK 
QAPGPF 
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SEQW 
NO: of 
nucleotide 
sequence 

■ 


S£QU) 
NO: of 
peptide 
sequence 

* 


M 
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od 


SEQID 
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USSN 
09/515,1 
26 

■ 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
. peptide . 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

dins to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alan(ne 
OCysteine, iNAspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K»Lysine, L»Leucine, M»Methronine, . 
N=Asparagiae, P=Proline, 
Q=Glutamine, R=Arginlne, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=$top codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


7229 

9 ****** 


21130 


A 


7289 


337 

mm mP ~ 


53 


LRSFAPFHLPINNLLTDIHMVIPLPPSS 
LCSHYLFNBV^VTTLFKTTTHLQLGTMA 
• HTCNPNTLGG * GGRS A* AQESBTSPGNM 
AKPRLYLLNF 


7230 


21131 


A 


. 7290 


1 


96 


KHHFGFEAAA* YWHFVDW*IiPLYVS I Y 
**QS 


7231 


,21132 


A 


7291 


288 


2 


SRVEFKGPFMAHCSVTIPGSSYVFTSAS 
* VAGPHGARHHAC* FLF* FLV*MLLBMG 
SRHWQAGL*RVSSSDPPARACQSGGI P 
GVKPHAQSVSV 


7232 


21133 


A 


7292 


121 


233 


YFPKKKKKKKKKKKKKKKKKKKKRKKK* 
KGQGRSRKSYLG 


7233 


21134 


A 


7293 


3 


346 


NPLNPLVNLrVS PKRNSSLDTRKKPCRE 
RHREKETLRKKRNQHTEARGRQRENHPE 
TE* ETGORDRDKQTKKOKDTDIQAYTYT 
DREKRDKKRHRQIR*AEAERIKDTERQR 
DI 


7234 


21135 


A 


7294 


46 


346 


ETGIiKKWGDLSNGTVRAVLNLAQADFCV 
S I PSYHIKHGNTKNNCX3GEGKKKILVVT 
AKKSEPGVVTHACNPNTIjGG*GGRVT*G 
QWFKTGLGNIVRPCST 












12 


RKDAENNGS RAB * I KRVKDLVQS KLKFF 
QELEKEGILPNPFYEASITLIPKPDKDT 
SKEDYRPISLMNIDAKILNKILAN 


7236 


21137 


A 


7296 


258 


1 


CIGQCVLVAVGREIiljCVSRSWCGLLHVD 
EVGGEALGRLLWYPWTQRFSESFGDLS 
TR DAVMGNP KVKAHGK * VLGAFS DGLAL 
LD 


7237 


21138 


A 


7297 


191 


263 


VCSKCE P F I YIMD YYAAI KRNKI MS FVA 
RWVQLEWSELTQEQKTENKMFS*WFG 
FLLLC 


7238 


21139 


A 


7298 


2 


164 


SRVAG I TGAHHHAQL I FVFLVETGFHHV 
GQAAFKIiLTSNHHARPQ*IiHFFYFFK 


7239 


21140 


A 


7299 


307 


8 


VMI ITFWLPHLSGYIEKSTPYEOGFDPI 
SADRVPFS I KFFLVAI TFLLCDLEKALL 
LPLP * ALQT FNtiPLI VMSSLLL 1 1 ILAL 
SMAYE*LIiXGLDCSE 


7240 


21141 


A 


7300 


205 


317 


KSSPACTPSTLGG*GGWFA*VQEFETSL 
DNMVKPQLY 


7241 


21142 


A 


7301 


140 


22 


FNFLFLYP *NYFFFFFFFFFFFFFFFFL 
FFFAMNINVFY 


7242 


21143 


A 


7302 


382 

mf VJ<£» 


151 


SIHRSEFRPRRSHSVSRPESRGI ILVYC 
SLYFIiGSGDSAFWS * VAGTTGVCPWAS 
LIPGFFFFFFNRDGVLIiSCFG 


7243 


21144 


A 


7303 


318 


17 


L V PTKR P Q VSNY Y P VHTYI WYCT KGKRK 
WAGIGGSCLSSQHFGRPSWQDPLSLGV+ 
DQ PGQHRETPSLQK* KNWWQBP I VPATQ 
EAEAGEWCEPGRQSLQ 


7244 


21145 


A 


7304 


310 


132 


QWCDLGATTPANFCI FFVRNGVS PCWSG 
WS * PPDLKLSARLHFQSVFFNPI I SIGL 
FFL 


7245 


21146 


A 


7305 


338 


168 


RE PPLPAANFVFFVEMRSHHVRQAGPEP 
PSSSDIiPASASQSAGITGVHCHAWP**T 
YK 


7246 


21147 


A 


7306 


276 


161 


I INAAI KKNEFMSFAGTWMKLEAI ILSK 
LTQELKVK* KLRKFNTDTILLLLLI FKI 
F 


7247 


21148 


A 


7307 


310 


61 


ITLVYCNFFYFINFNFLKLFDSFVI ILS 
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peptide 
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Predict- 
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nucle- 
otide 
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correspon 
ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=G)ycine, H=Histidine, I=Isoleucine, 
K— Lysine, L=Leucine, M=Methionine, 
N=>Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Scrine, 
^Threonine, V=Valine, 
W=Tryptopban, Y^Tyroslne, 
X=Un known, *=Stop codon, /-possible 
nucleotide deletion, ^possible 
nucleotide insertion 




• 










LKQIYSCTAIVYILIL*AFVQFKFLCFY 
FLYFKNIFIYKNIFYKIKYV*NMYFYF 


7248 


21149 


A 


7308 


257 


1 


CX5RVDLH*AVFFFFKRWSLTVLPSLDLD 
SWAQVILLP*PPE*LGQQSCTTPPSSLS 
TFLYAFLHIPVNKMSCFVGKDRGMFFSY 
H 


7249 


21150 


A 


7309 


2 


181 


RDGFCHTGHTGL * LLTSGDPPASASQSA 
GITGVSHHARPCLHSLTAFFKNWMVWPG 
QQE 


7250 

< 


21151 


A 


7310 

i 


25 


340 


FCVFNTMDVKNI * IFLIFVPDVHLG*QK 
NLLLDTI*KKKSIRIQE* *QYKSISKRN 
YNTVS DKS WLG TVAHACNP S TbGGQGGR 
TA* TQBFQTSLDNTVRPCVYY 


7251 


21152 


A 


7311 


2 


193 


FVDFSMHILDMLSLYLSIVICYITLFLS 
IIDNI YVNGPHKI IIPYFYCIKYIFYIY 
I WFCFWFFETVLLCCFS VFTFND YCS YL 
HY FLLLGVFGL FHF F * FL YL YMVL FLVF 


7252 


21153 


A 


7312 


] 


245 


RTPGIiKQSSHFSIiLSSWDYRHVPLHPAN 
AFFFFKGP*SEPLTLASSLHWYFPGGHF 
LQG P AKI DDPNFS P I LGKLKKKS PF 


7253 


21154 


A 


7313 


167 


2 


GKPCLLVVMCRDSQKCGEKGTGLKREAL 
RMGK* GQARWLTP Vt PALWELEAGGS P 


7254 


21155 


A 


7314 


334 


152 


CHVAQAG VQW YNHS S LQ P * PPGLR* FSH 
LSLPSGWNHSWTPPHPANFFIMGSCHLP 
LINK 


7255 


21156 


A 


7315 


2 


164 


PHVGQDGLELLTS GDPPAS AFQS AE ITG 
VSHHARPTFYLS * QVGLTVTS FRNPQ 


7256 


21157 


A 


7316 


155 


9 


QRIKH* FHWLGAVI LTCNPSTLGGQGGW 
ITGGQKFETSLAYMATPRIiY 


7257 


21158 


A 


7317 


1 


293 


GIPSTWDYWCTSPQPHF*FFFFFF+*KR 
DLAMLPGGVLNSWIPAILQPKGPKVLGL 
QAWTIIjYKGYYGKEMWHKEICKGTHDVL 
SKLLTTKRFKRLL 


7258 


21159 


A 


7318 


262 


73 


SNSI»VYLI KPDSVY I FLLFVETVSSHVA 
QVGLELVASSSTPI*ASQSVEITGVSHH 
ARPKKVT 


7259 


21160 


A 


7319 


169 


265 


GGRMALSPGRGGCSEP*GCHCTPAWBTE 
QNPA 


7260 


21161 


A 


7320 


169 


260 


KPQKI PVACDQQFTDKRNCSHGDYECNM 
VDFFFFFFRERVSLCHPGWSAW*SGDH 
TTALQPG 


7261 


21162 


A 


7321 


580 

* 


400 


" FFFFERESRSVARQPGVQWRDLSSIjKSP 
PPGFKRFPCXSLPSSWDYRRAQPCPANV 
FEFW* RWGFTTLARI VIiLS * PCDPPTSA 
SQSAGITGVSHCAQP * FFFFERESRSVA 
RQPGVQWRDLSSLKSPPPGFKRFPCLSL 
PSSWDYRRAQPCPANVFEFW 


7262 


21163 


A 


7322 

# **** 


130 


26 


SPGPLFGLIEPLPSRFKGFFWLNLP*TW 
EYRLALTAPIKFFKFLIKTGFPH*GIKL 
RG 


72& 


21164 


A 


7323 


17 


303 


KSSGTIKGRAQSPFI^QLVAIEQCQDCFH 
FRSQF* YLKKQLQPGMMMHAYNPfi YSKG 
*SRRIT*AQEFKSRLGNITKLCCLKINN 
SLIFKITENIE 


7264 


21165 


A 


7324 


194 


8 


DWVEVLKKSTYRPDTVAHACNPSTLGG* 
GGSIT*GQKFEIRLGNWGPCLKKNLTN 
QU3VGG 


7265 


21166 


A 


7325 


413 


72 


GVP I FLKTLTGKT I S SERES KVPLEKGK 



997 



WO 01/64835 



PCTYUS01/04927 

» • 



SEQIB * 
NO: of 
nucleotide 
sequence 

'i 
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SEQ1D 
NO: of 
peptide 
sequence 


M 

eth 
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SEQID 
NO: in 
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26 
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beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide , 
sequence 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanlne 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Htstidine, I=IsoIeucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Prolioe, 
Q=Glutaraine, R=Arginine, S=Serine, 
^Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


• 


• 






• 


• 


ANFRDKRGI P PHHKRVI FAEKKL * KGRS 
PFDDGFQKKLTQPRVSAFKGHPPLQPLY 
AFIKSFREGTHHQDCPFWHLTLLSARRQ 
Q 


7266 


21167 


A 


7326 

» 


394 


177 


NVNHTHTHTHTHTHTHTHGWCKSDTGYS 
SDS* CITPRCIHDPTWPSSSQTVNIiAMG 
. LTLAKKVLANMTQTEV 


7267 


21168 


A * 


7327 


232 


22 


SRPSGSKKGAPALQQSSCLSFLSSWDYR 
HKP P YLTNFCI FL * SQS FPMLP RLLSNS 
QPHSNLPKWELQV 


7268 


21169 

• 


A 


7328 


232 


3 


YHRSSGSLISLIQIRLR£KI*APSIPII 
KEWLSSVAHVYNPNTLGGQGGRLT*GQE 
FETSLQNIARPCLHENCLKN 


7269 


21170 


A 


7329 


364 


3 


PKARRATVRRTR VS KAPSKRQCLGVNTH 
VLGHRRPVKGNAKTRWSASESQAHTCNI 
TTLGGQGGLIT*DREFESSLATMQNPVS 
VSGWIiQKMFS KQKLYSKFFPNVAYKAFR 
IWLLCARA 


7270 


21171 


A 


7330 


177 


2 


DPHLHKKFKARCGGMTL*SQLLRRLR* K 
D QLS I/SVGGC FELL WCYCTPAWVTERD P 
SC 


7271 


21172 


A 


7331 


413 


40 


RHMYLMF YTY I YVLD I YT YTVFFMYI Y I 
P*CVCKYISCLHIMYRFISMYPCVYVYV 
CMCVCNVIHVIVENAGSCLALSALIEME 
GMARDSKMWTFNPDEKMRLSYKKQLEKL 
NFMILCPTWKEF 


7272 


21173 


A 


7332 


147 


49 


KNYIYFCRRGLPMLPRLGLNSRAQEILL 
PQP PM * LGLNSRAQE I LLPQP PM 


7273 


21174 


A 


7333 


148 


361 


S WLGTVAHAYNPS TLGGRGRQ IT* VQE F 
ETSQSNIARPLLYSVLAYAVQPPLPMVA 
MRKLWLIPTSRITAL 


7274 


21175 


A 


7334 


305 


2 


CPGGFVTRVHR* YVRLSQPDLRVFTSGA 
MPAS AAQSAGITGMSHRARPVCS 1 1 IFL 
NIFKVTCI FVELTI PNQNKQNT FLS STV 
ASCRILASRPTPIWVL 


7275 


21176 


A 


7335 


1415 


1164 


FFLRQSLLYCPGWSWAQSQLIASSITQ 
A*AILPPCSWDYSSQHTWLKFFFC*DGF 
SMLPRLVSGSWAQMVLXiPQLPKAQAKL 


7276 


21177 


A 


7336 


1 


206 


GTRADHLMSG FQDQPGGHTETLS LLKXQ 
RSQAWWHVPVMLPTR* IDARBSLERLMT 
IALLKVRNRYIF 


7277 


21178 


A 


7337 


65 


302 


KSSLFYFLKIBTGFHYVAQTGL*LLSSD 
DPPASVFQSAGITSMSHHTQPKBLLKST 
RNTLEE PNREWAKAREWAHRRNT 


7278 

> 


21179 


A 


7338 


208 


38 


NPVSAKNEEYPRGTFDLGTQAEVQWHDL 
GSLQPTPPGLK*FS*LSPRV 


7279 

• 


21180 

■ 


A 


7339 


2 


198 


ARADHPRS GVRNHPGQHGETPVS TKNTK 
N * AWWRVPLVPECWD YILFYMFTWCMDN 
KLPKWLTLC 


7280 


21181 


A 


7340 


2 


227 


ARGEIMS FAATWMQLEANILSKLTQE* K 
AKYCMFS L L S GAKHWVLMNI KVGTGDTR 
DYLGCSTQAVLKNLGTRAB 


7281 


21182 


A 


7341 


3 


300 


HEGRNYTLTGRDSCTLAASAEKACGACP 
LWGKCDAESSKCVCREASDCEEEGISIC 
VDVNCKEQTMPEC *AVAL*CTVHLI YVT 
INNALCLPQLTLFL 


7282 


21183 


A 


7342 


210 


351 


RG LGT VAHACNLNT FGG * GRW 1 1 * AHEF 
ETSLRNITKP2LRENPDML 
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peptide 

sequence 


Predict- 
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nucle- 
otide 
location 
correspon 
ding to 
last amino 
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residue of 
peptide . 
sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartie Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H==Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N a Asparagine, P»=Proline, 
Q=G 1 u ta m in e» R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


7283 


21184 


A 


7343 


306 


164 


IYMLYFIYITLIYIHTHTHTHIiYUDKYI 
YL * CVHCFL * * AYMI CHQR 


7284 


21185 


A 


7344 


442 


8 


SQMSKLKTWFRAVAHAWNPSTLGGQGRR 
I T* VRE PETSLGNTAR PHL YWN 


7285 


21186 


A 


7345 


3 


248 


HELSLSLLLSPFFFKNEKGSR*VAHAGL 
KLLTSSDPPT*ASQSARITGISHCARPL 
LSHFLAYLAIjPCHQNRIiCrGHLPST 


7286 


21187 


A 


7346 


2 


246 


HELYQS ILIiSPIiFFKNDIGSR* I AHAGL 
QLLT* SDP PT* AS QS ARITGI SHCARPL 
LSHCLAYLALTCHQNRLCTGHLPST 


. / AO/ 


21 IRK 


A 


7347 

1 


1 
1 


130 


GTRLLKRLTQETRIiNPGGRGCREPSLHY 
CTPTWARQSQTVAHN* KRLTQETRLNPG 
GRGCREPSLHYCTPTWAROSOTVAHN 

^B^ ^ vvb* vbbBf ^b ^# bbBp* *B ■> ^BB" bb bb bb < Iff bbb v W ™g B> ™ BB ■§ BV ™ 


7288 


21189 


A 


7348 


516 


288 


FFETESRFITQTGVQWCDLGPLQPLPPG 
FKGFSCLSLLSS * D YRCLPPHPANFCI F 

M MW^BT VBr MB/ BMMfhr*' BMW BB • ^ B*VB% #H> » ■ V BpB w ■» ^B> BBl A> 

SRDGVSPC*LGWSQTPDLK 


7289 

/ AO/ 


21190 


A 


7349 


417 


148 


STSVWOAGVOWHDHGLLLPGPPRLK*S 

BMW BP %jb» ~ W ■ It* J»^B» w ■ " ♦ B Bm» A * %Bf *jB>I*— B>faaT BB & BB BjB* 

SHLSLPHG*DYRHILSCLANF*NIFVEA 
GFRHTAHGLEPLGS SDLPASASQSAGIT 
GMSHLA 


7290 


21191 


A 


7350 


143 


2 


TQKFKTSLSSWDYRRVSAHVANFCIFSK 
DRVSP Y* PGWSRTPD^ARA 


7291 


21192 


A 


7351 


189 


37 


EAIAVRRI I *NLFFFFFF*RWGLTVLPK 

BB^V B>4^B^ ^ V ^ v^k B ^ V Atf ^B *B ^B ^B ^B B> » w W BpBlV ^B V BBB*BB> 4B ■ 

LVSNSWAQAXLSPQPSKVLRLQA 


7292 


21193 


A 


7352 


131 


2 


KS RLGTVAHA.CN PS TLGGRKGWIT*SRE 
FKTS RANMVKPR PRA 


7293 


21194 


A 


7353 


1 


106 


GTRGQCLTMLPRLVLNYWAQM ILPPWPP 
♦VLGLQV 


7294 


21195 


A 


7354 

■ 


939 

1 


653 


KGNFI FLRQS FTLVAQAGVQWRDLGSPQ 
PPPPGS K* FSCLSLRSSWDYRHGPPHPA 

^B BB ^B ^B W A ^ ^B ^V^^H^BV BBB V^Br B^ * W JBr ^B W ^BB^B ^B ^» ^4B^B Bl B 

NFAFLVEMGFLHVGQAGLELPT*GDPPT 
SASQSAGITGR 


7295 


21196 


A 


7355 


270 


26 


EARFLGLLRDI AQLCYPGWSAAV* S *LT 
AASTSWAQVKASTHLSLPNN*D*ARATM 
PG*LKKYVFF*RQGLAILPRLRLVF 


7296 


2H97 


A 


7356 


329 


206 


HHLANF * TFCRDRGLTMLARLNS WPOAI 
LLPQPPKVLGLQM 


7297 


21198 


A 


7357 


128 


253 


MRPDTVAHTCNPSTLGGQGGRIT+THEF 
ETSLGDMMKPYLYK 


7298 


21199 


A 


7358 


2 


108 


AREGSGGQGCSEP * WRHCTAAWVTEQDP 
VSKTIKK 


'7590 

* 


51200 

A t«Uv 


A 

*• 


7359 


1 
1 


3dO 


GTRERNWHLA*AIjHTHTHTHTHRHTHTH 

^0 ± b\IJbi»i f I4AMT« * ■ ■ «V 4a bV «V«> ^ 14 4k JV«bb»«b>A 4» *» «V A A 

THFSFPANS * RDTFVI F*QSLLLE I MTS 
FSPSLGLKCS FSQQPSGLLSS IAGERDS 
RIHGENTVSMCLHNRLKAELRSDVISLR 
V 


7300 


21201 


A 


7360 


385 


1 


SVGPSKPCAGYNLLVCHLLRPLEKRSIR 
VGVT* FSRCHLSPLSLTRKANSLTPCTS 
WMRRCLILLRLTLGVPHPMSCTHCPTLP 
S EMNLG PQLE MQKS P I F CVAHAG S PRLE 
LFLFGHLGSTPNLPSC 


7301 


21202 


A 


7361 


234 


29 


LSTEPKGS KALKFNVTLLLNIAF* CNSG 

CNSNIj*S*QQY*RVIQSHHLSQTLWLTP 
VI PAIiWBAEAGG 


7302 


21203 


A 


7362 


68 


339 


DRD I ANKPAWQLCQSRI LYLAQLQFI DQ " 
GELLPDFRIHYKVI*TCTAMH*RRNRLI 
DQWNRAD I PDLNP CQANRLTP V I LVLWE 
ADVGGS 
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Amino acid sequence (A=»Alanine 
OCysteine, D=Aspartic Acid, 
E>Glutamlc Acid, F=Phenylalanine, 
G=GIycine, H-Histidine, I=Isoleudne, 
K=Lysine, JL^Leucine, [^Methionine, 
N=Asparagine, P=Proline, 
Q^GIutamine, R«=Arginine,,S Ba Serlne, 
T=Threonine, V=Valine, 
W=Tryptopnan, Y^yroslne, 
X=Unknown, *=Stop codon, ^possible 
nucleotide deletion, \= possible 
nucleotide insertion 


7303 


21204 


A 


7363 


168 


2 


CFYSTHAHIFKELRQRPGAVAHAYNLST 
LGGRGGRLT*GQKPBTSLANMVKPHSC 


7304 

• 


21205 . 


A 


7364 

■ 

i 


399 


62 


QPGQYGKHPVLI KNSKIKPFWGDPPWP 
NARKG * A* KMVE P6KVR VQSAQI KALE F 
NLGPKKKVPFKKKNPWLAKGGGLRPBIP 
AFGKASKGGSPRSRILRPPGFLPYPAHF 


.7305 


21206 


A 


7365 


289 


76 


VCDVLRESRAQIiSRLKCSGI I *AHCNVS 
IAGSSNCHASAIiRVPaSTOVWVESEROD 
VDRFEAYGSRGYVCA 


7306 

• 


21207 

• 


A 

• 


7366 

- 


364 


182 

■ 


Q WHDHCNFKL LGS SNP PTLAP * VAGTA6 
TCHCAWLI * FLTWKIGFKTSLRI IVLLG 
YFWF 


7307 


21208 


A 


7367 


155 


2 


ELGTKREGWEKSATVAHACNPSSLGGRG 
RWIT*GPEFPTSMANMMKAHPRA 


7308 


21209 


A 


7368 


342 


184 


LSLPS S WDRRYTP PRLANFFKPFFKR* S 
L1MLPRLYRPFYITTHQTLNSAMTP 


7309 


21210 


A 


7369 


2 


211 


ARBPKCL*TDEWINKMQYIDAVEYHLAI 
KLNE I LTHATT WMNLENMLI GRS *TQKA 
TCCVIPFMDSIYV 


7310 


21211 


A 


7370 


190 


344 


YKNVTVKRILTWPSTVAHACNPSTLGGQ 
GGRFG*GQEFETSLGKIARPCLY 


7311 


21212 


A 


7371 


24 

* 


254 


KPQGIGAHASSQHYGRPRREDHLSLGV* 
DQPGQHGE X P SLQKKKKKGLPGRGGVCL 
KPWFLKILGEKNVWVLTPKG 


7312 


21213 


A 


7372 


43 


358 


D SNLNYS Ir FFHGEAD LGTTQ VLTHP S TT 
AMYFVHYCQSP * I LYGTINT* PPWHKN 
PIHIKTPSPCIiQASTAINIjQLSHINCNS 
KATPHPLGYQQTYPPLTVHST 


7313 

■ 


21214 


X 


7373 


2 


288 


IHI I LAFTSS LLG I LVYRSHLIASLLCL 
HGIILSLTIIAPLIPLNTHSLLAPIGPI 
AILVI AA* QAAVGLALLVS ISTTYQLNY 
VHNLALLQCCN 


7314 


21215 


A 


7374 


335 


1 


VQWTOFGSIiQNPPPGVSPFFWLSIiPNNIj 
GFKGPPPRPGPFLKF * * KPGFSALTRMV 
SLSRPRDSAPLAPPKAGVSRLNPRARAP 
FFFFFVFCFHFFLLSGEKKSLAHKGGI 


7315 


21216 


A 


7375 


399 


91 


RIFFFPPSLKAPPHRQKPTPFPPTKKPL 
VFSIPPIFPPKFLKKNPFSPWEVFLFLS 
FFQKDAFFGPQV* KGNFFFFKGPPPLTQ 
NSPQGVPPKKKKKKKKKKN 


7316 


21217 


A 


7376 


377 


131 

« 


YLYIKFYNFIJILFYIFIFNIF*FNMIF 
NLLFYIFYYILINFNILIILIINFFYFI 
LLIILNTILILYIIIYYFIIKNTIKT 


7317 


21218 


A 


7377 


2 


156 


FHHW*AGLELLGSSNPPTWAFQSAETT 
GVSHHAQLPHFILRKQELLSLGL 


7318 


21219 

■ 


A 


7378 


319 


59 

• 


DTKMNSQDILYSIQSNRT*W*LIKRQ*T 
KQKSAPRNGNEVP * THQHSTATTLISQQ 
PSTSKQDPPAAK* L* FAEGRLLAFPSNN 
VF 


7319 


21220 


A 


7379 


308 


12 


FSPT JSNFNYKSNPSPLIYLFQ* PITQL 
I PTNISTNTPLNL* FYLRLI YSTSISLL 
PISNNVKIK*QFEHTKPTPFLPTLIALT 
TLLLPISPPILIIL 


7320 


21221 


A 


7380 


11 


169 


RRDLAMVPRLFS S S PGLK* S LHLG FPE H 
WDYSHEPPSWLLFFLFLITEGYFF 


7321 


21222 


A 


7381 


300 


2 


KFFPPPGFFFKGFFS PL* SHQKKTQI FF 
GEFFWGFKKFFLFFFPPGGFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFLFFFF 
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Amino acid sequence (A 8 Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutarofc Acid, ^Phenylalanine, 
G^Glycine, H=Histidine, I=IsoIeucine, 
K-Lysuie, L==Leucine, M«Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R«Arginine, S^Serine, 
T=Threonine, V=VaIioe, 
W=Tryptophan, Y=TyrosIne, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














FF+INSLFIQNSFIiT 


7322 

- 


21223 


A 


7382 

- 


163 

■ 


160 


PGTSNDE I QRRNSRNR I SP I AER*GLS L 
LPRLE WMGSNEAHSNLELQGS SNP PASA 
SKSTMITGVSHCA* PRLEWMGSNEAHSN 
LELQGSSNPPASASKSTMITGVSHCA 


7323 


21224 


A 


7383 


1 


183 


VCVCVCVCVCVCIKWDSYSAIKKNELVA 
FWTWKRLETI ILSEVSQE*NTI YHMSS 
LIC6S 


7324 

< 


21225 


A 


7384 


2 


196 


K I QM WPG VVAHAC YPNTLGGQDGRTA* G 
QEFQTSLGNIARPCLYQQQQQQQQPRCS 
LTYDGDKS 


7325 


21226 


A 

V 


7385 


3 


192 


RNLSSLQPPPPGLK*WSCLSLPSSWDYR 
RPP PCPANCFMLCEFFRTNGVFPCS PGV 
YFEPRN 


7326 


21227 


A 


7386 


266 


198 


TMLS GFLRSAGLTM IS* PHDP PASASQS 
AGITGVSHRARPLVLFLRDSLYSGTQAG 
VPWRDLSSLQSRLAEQMRFCFP 


7327 


21228 


A 


7387 


164 


311 


SKQQKGLSGVAHPCNPS ILGGRGRWIN+ 
GQEFETSMASMVKPCLSLLKL 


7328 


21229 


A 


7388 


3 


218 


EIMSFAATWMQLEANILSTLSHE*IAKY 
CMLSLLSGAKHWVLMNIEVGTGDTRDYL 
GCTYPSSTEQLGYYS' 


7329 

* 


21230 


A 


7389 


521 


58 

* 


SDCGLTPIAQTGVHWHHLSSLQP*LPWL 
G* FSHLSLPSSWDYKYVPPCPASFLYYF 
LVETGFCHVAQDGIiELLDSNNPPSSASQ 
SAGI TGMSGCAGLRRF* KLRSWRYLVLL 
AMQVNEQGTSQAGEAVSEADAGQGCCCG 
LCXjKRILSQQYALP 


7330. ' 


21231 


A 


7390 


169 


322 


APGNFKTN*RKNGFWAGNTLGVSVKNNK 
NGWAWWLTSVIPALWEAEAGGSP 


7331 

• 


21232 

■ 


A 


7391 


1 

• 


353 


GTSPHYGSFHGHQKDVTGKGSQSRPRER 
VLGSHARKNS LPVHEVKAYLL* KYLS IL 
RATHILISPDGCWVPIFIAQSYIFLSSC 
GFFLLFFIiLLIiTLS FCSWHLI IS FIALF 
VIPLY 


7332 

i j 


21233 


A 


7392 


372 


3 


IQAGACCAECIPSKGIRWNPS IVEGLHS 
DMSWSLCLFLSGAISVLCNFRLSGSSDS 
PMS A*QAARTAVIRHHTRI»I FC I FSRDR 
ETGFHHVGQENLLMLIaLQLFSGLGFSIS 
ATLLTLMLLV 


7333 


21234 




7393 


362 


\ii 


ARMECSGAI * AGLRGS SDSLASAS *EAG 
YTGKCGHAQ* TFVFLVEVRFHHVGQNAI 
NIRTSL 


7334 

» 


21235 


A 


7394 


2 


304 


ARADCCARNE*GSVGMGPSBPGVGYNLIj 
VFRFLRPLEKHS IRVGETRFSRCCLSQL 
CLARTGNSLTPCTSVn^CLALLRLMLG 
ALHPLSCTQRPTSHSE 


7335 


21236 


A 


7395 


256 


3 


M YYAAR YWNKAK I TTE S LREWAGHGGSH 
PYNPSTLGG*GGQIS*AHKFETSLGNMV 
KPHLS KNKQTNKQTNKQTN I S WS WCTCL 


7336 


21237 


A 


7396 


346 


2 


RDHFYRESHSLITYICLENVKSNLKCGE 
GPLNFTFHLQRKNCNLGHTRS PGGVYHV 
GRTETATGP I KRRNGLGAVAHTSNPSNL 
GGQGGWIT*GQEFKTSLSLPRWLTPVIP 
LV 


7337 


21238 


A 


7397 


2 


242 


RRYSD Y PDAYTT *NI LSS VGS FI SLTAV 
ILIIFMI *EA*KKK3CKKKKKKKKKGGPF 
KKTLGGPKFNRRMEGKNFSLKGGR 
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Amino acid sequence (A=Alanine 
C^Cysteine, D=Aspartic Acid, 
E^Glutamic Acid. FHPhenvlalanine. 
G=Glycine, H=Histidine, I=IsoleucJne, 
K-Lysine, L= Leu cine, M=Meth ion in e, . 
N=Asparagine, P=Proline, 
0=Glutamine, R=Areinine, S=Serine. 
T=Threonine, V-Valine, 
W=Tryptophan, Y-Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion \=nossJhl£ 

nucleotide insertion 


7338 


21239 


A 


7398 


1- 


289 


TKLLGLYYTLLQASNDFKS PFS I FDGI Y 
GSTFFVTTGEHGLHVIIGSTILTI*PIR 
QLI FHFTSKHHFGLEADA* YWHLADARR 
HSESITHQCCSL 


7339 


21240 


A 


7399 


346 


40 


III FTNKI Y I KINKRIiKLF 1 1 KI LDLY I 
I* IFFFRK*FTFKQ*KF* IININKIYYN 
NIK* * IFIYLFFFFFFFFFFFFLFFFFF 
FFFFFFFFFFFFFFFFFFFLKK 


7340 


21241 


A 


7400 


1 . 


334 


NS CRRQRLIjDLLLL p itl* lpql igyi e 
KSTR YE CGFHP I S PARVPFS I KFFLVAI 
TFTiLFDLiOIATiTiT.P LP * ALOTTMIiPTjT V 
MSSLLLIIIIiALNIiAYE*LPKGLD*AB 


7341 


21242 


A 


7401 


2 


302 


HTINLLGERYVLATTY* * STMTLLLTGL 

TVLHTGLYS\7YIiMTTTOMG*t*TP?ITTNM 
a v jjnxvjjjxij v iiu ii, i lynu u if nA "j 

KP*FTR* DTLVF I HLS P I LLLSLNPD I I 
TGWLVLIiSYBLYLLT 


7342 


21243 


A 


7402 


57 


338 


JTTHNVIGYIEASTAYESGCDPISPARVP 
FSIKFFLVAITFLLFDId3IALLLPLP*A 

uy i luiiruivjiwuiiiiui iiiNuiounxo 

LQKGLD'AE 


7343 


21244 


A 


7403 


407 


68 


QEELVDPLTTVREQCEQLENCVKAR*RL 
*LCDERVSSRSHSEEDCSEDLFDFLHAR 
DHCVFL I L YINLKYMCELT 


TXAA 




A 




AQA 


I 


Cr.T7TP*VT.Vr.TWT.T7# * TfWWVVlTVWT .X CT 

our x r ijjx uir jjr rr cm rrx n iLai 
FSFKLFFFFLLHFFFHMFFFLFFSPPNK 
I FFPKI PPRFFLFPLF* *KFFFFLPFKF 
LSPLGFFF*SPPFFFFFFFFFFFFFFFF 

F 


7345 


21246 


A 


7405 


1 


400 


NPGRFLS TSNS S L YERTRE I RPTSQSAF 
PRK* YHLKKKKiCKKKKKKKKKKKKKKKK 
KKRGPIjKKECPRGGOK*TGGSKKKFFPK1? 

GGKKKPPGKF*KKTFFWGGKKMGKTPQK 
KINPLGKKKI FKGKRGKTPPL 




71547 

X lit / 


A 




*> 


i. j / 


RASLCHPGWSAW* SWLIAALTSWLKOF 
SHLNSLKCWDYRHEPPHLA*LRFEHRHS 
GSS I CTLNR* ILLS FP I LS I RYKNGLIL 
Y 


7347 


21248 


A 


7407 


259 


47 


YVFFFFLFLLFFIRLQSVSPFFFFFFFF 
FFFFFFFFFFFFFFFVK*SFIEMFSFW 
LYLLRQVS I S IAYTLFG 


7348 


21249 


A 


7408 


3 


248 


VTFL * CDVE IAVL I PLP*ALPTTNLAL I 
DMSSLLLI I ILALRLAYE*LRGGLD*AE 

mr% ih/WaMMJA rite riWrikJ* brikflJk^tfHw 9>rib aril AMVWoMr WrikV 

EHQATHSAYERAP PL 


7349 


21250 


A 

* 


7409 


2 


350 


RSGMPRRYSDYPDAYTT*NILGSVGAFI 
CLTAGILI I FMI * EAFAS KRKVLI VEQP 
SINLE*LYGCPPPYHTFEEPVYIKSRHI 
RKESNP PKLVSS Q PHGLHDFFKKKLGTI 
TEKN 


7350 


21251 


A 


7410 


2 ' 


232 


LDQTYAKIHFTIVFIGVDLTLLPQHFLG 
LSGMPRRYSDYPDAYTT*NILS S VGSFI 
ALTAAILIIFMI*EAFAKKF 


7351 


21252 


A 


7411 


2 


321 


TSLLAVLLI IMTL* LPQLNRYIEKSTPY 
E CGFDP I CPGRVP FS I KFFLVAITFLLF 
DLBTALLLPLiP* SLQTTNLPLIAMS SLL 
LI I ILDLNLAYE * LQKGLD*AE 


7352 


21253 


A 


7412 


2 


194 


S I ECFLVP I TLLLCDLE IALLL PLP * AL 
QTSNLPLI VMSSLLL III LALNLAYE * L 
QKGLD*AD 
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peptide 
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Predict- 
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nucle- ' 
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peptide 

sequence 


Amino acid sequence (A^AIanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leudne, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GJutamine, R=Arginine, S=Serine, 
T=»Threonine, V=Valinej 
W=Tryptophan, Y^Tyrosine, , 
X=Unknown, *=£top codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7353 


21254 


A 


7413 


1 


197 


LYTTYF VTKTLLLTS LFL*GRTAYPRLR 
YDQLIHLL* KNFLPLTLALLI * HVS IP I 
TISSIPPQT 




21255 


A 


7414 


i 


185 


STPLP * ALQTTNLPLI VMSSLLLI I ILA 
LS LAYE * LQ KG LD * AE KKKKGAAVLKD P 
SGGPS 


/ j j j 


21256 


A 


7415 


347 


228 


LE * L YGC P P P YHTFEE PVY I KSRLKRKE 
SNPPKLVSSQPHGLHDFFKHIKNYRKNK 
RKEGLAS VKENDRG P 


7356 


21257 


A 


7416 


1 


296 


HAYDHAS AQLNGY I EQSTP YECGFDP I S 
RARVPFCI KVFLVAITFLLFDLEIALLL 
PLP *ALQTANLPLIGMSSLLLI I ILALS 
LAYE*LHKGLD*AE 


7^S7 




A 


7417 




202 


TCSI *HVIiVRMT* L I YDLEI ALL* PVP * 
ALQSTNLPIilSMTSliLLI 1 1 LALSMAYE 
*LQTGLD*AE 


7358 

- 


21259 


A 


7418 


1 


321 

■ 


SNTLLALVLITITS *LPQLSGCVGECTP 
YECGICRILNARALCCMGIFLKAITFLL 
FDLEIALLLPLP*ALQTTNLPLIDMSSL 
LLII ILALS LAYE* LQKGLD *AE 


7359 


21260 


A 


7419 


1 


212 


RTRGI IFRPLSQSLY*LLAANLVILT* I 
GGQPVS YPFTI IGQVAYVLYFTTILILI 
PTICLIENKILKWA 


7360 


21261 


A 


7420 


2 


257 


GR VGGRVG YS IAFFLQAITVL VLDLESA 
LLLALP*SLQTTNLSLIVMSSLLLIIIL 
ALSLAYE * L PLGLD * AEYWGEMMTTTLD 
A 


/Jul 


21269 


A 


7421 


367 

JU / 


269 


HCDLKLLGSSDPPASAS * VAWITGTCHH 
TKLS 


7362 

f JO* 




A 


7422 


2 

mt 


116 


ALQTTNLPL I VMSS LLLI 1 1 LALRLAYE 
* LQKGLD *AE 


7363 


21264 


A 


7423 


2 


336 


KPECGREG I LPCCS S S AWPEGS FRP FQM 
NLGSFLSFFFLFFFFFEK3LFLCPPV*S 
PYGKSNLLEPSASRFKQFFCLGPLNNWD 
YRLFPPFPGNFGVFKKNRVFPFWPSLV 


7364 


21265 


A 


7424 


3 


154 


LFL*IRTAYPRFRYDQLIHLL*ENFLPL 
TVALLI*HVSIPITISSIPPQT 


7365 


2P66 


A 


7425 


3 


154 


LFL* IRTAYPRFRYDQLIHLL * ENFLPL 
TLALLI*HVSIPITISSIPPQT 


7366 


21267 


A 


7426 


1 


314 


INTLLTLLLIIITL*LPQLNGYIN*STP 
YECGFDP I S PARVP FS I KFFL IAITFLL 
FDLEIALLLPLP * ALQATNL PL I DMPSL 
LL 1 1 ILALS LAYE * LHQGLY 


7367 


21268 


A 


7427 


1 


331 


ILIINTLLALLLI I ITF*LPQLNGYIEK 
STP YECG FD PISPARVP FS IKFFLVAIT 
FLLFDLE I ALLLPLP * ALQTTNLPLI VM 
TSLLLI I IIiALRLDYE* LQPGLD*AD 


7368 


21269 


A 


7428 


2 


373 


SDRNHI IHALVITILLGLYFTLLQASEY 
FESAFTI SDGI YGSTFFVATGFHGLHW 
IGSTFLT I CFIRQL I FHFTSKHHFGFEA 
AA* YWHIVDVAGLFLHVSI YRGGS YNHQ 
IIPSVAQESTPA 


7369 


21270 


A 


7429 


378 


63 


PSFFPPFPP*NFFFPPRPLFSPGGFPPF 
FP PPQKG PPPKI P PGFFFS PPFWEKFFF 
PPPPLILPPPRFFFNPPPPFFFFFFFFF 
FFFFFFFFFFSFFFWDFLGSGC 


7370 


21271 


A 


7430 


1 


109 


QTTNIiPL I VMSSLLLI I ILAIjSLAYE*L 
QKGLD*AE 
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NO: of 
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sequence 
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sequence 


M 
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od 
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USSN 
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26 

- 


Predicted 
beginning 
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residue of 
. peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
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dine to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIantne 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=lso leu cine, 
K~Lysine, L=Leucine, M=Metnk>nine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serlne, 
T=Threonine, V«Valine, 
W»Tryptopha n , Y=Tyrosine, 
X=Unknown, *=StoD codon, /possible 
nucleotide deletion, ^possible 
nucleotide insertion 


7371 


21272 

- 


A 

■ 


7431 


358 


290 


. FLL VI PNFLFKFL I YI 1 1 S 1 VpttVSl FFfi( 
IIYILSIFFPFIIFILLLSSFNFILFFF 
FITSFPFFYFFFFFFFFFFFFFFFPFFF 
FFFFSFFFFFFF* FFFTN 


7372 


21273 


A 


, 7432 


1 


95 


PTRP PTRPHNS LQL* TPGLKQS YPRLPS 
SWD*RHMPSHPANFRKFF*RWHLYCPGW 
S*TPGLKQSYPRLPSSWD 


7373 


21274 


a 


7433 


1202 


870 


HNIFKIRPHCRLTHRGKPFYWGLLVTPV 
FSPSLRPSFLPSFLPSFLPS*LPSFLPS 
FLLFLS YS FCPLS FLCLRCI PHSAFFWG 
NPRQVQENCVIVTIYRYLSFTALHQL 


7374 


21275 . 


A 


7434 


163 


I 


S VYVKR I G LGR VA YTCN P S TVGGQDG * I 
T*GQEEETIIiAKRREPLFLACFLFLI 


7375 


21276 


A 


7435 


148 


362 


NDDNSVEFGIRtiHKGLGTVAHTYKPSTL 
GGQGGHIA* AQE FMTSLGHMT* S SLYHK 
LTEYGWRDTASLIiMC 


7376 


21277 


A 


7436 


61 


233 


GGDLRHW P S SDS FFLSLS FF F * NKNGGG 
IiALL PRLDLNS WSQAI LPLKPPNALGLQ 

A' 


7377 


21278 

* 


A 


7437 


44 


242 


FVCLFLFFFFEKESTFQHDLSSLQPPLP 
RFKRFSCLSLPSTWDYRIiAPQCLANF* I 
F**KSGFTFG 


7378 


21279 


A 


7438 


J J J7 


300 


YCVSLLI FFI FI I YI FILLI FKFLFITL 
N YI S FLFFYF *H * F YF*KN F I FVFPFGY 
FPRP 


7379 


21280 


A 


7439 


1 


221 


LKLQHHDPTTISLVITRNWKQPKCPNVQ 
QIVH*SAIKRNELLIHAKTWMNLRGIML 
SEICQPOKILYLEI PFI 


7380 

* 


21281 


A 


7440 


35 


345 


FSFCCVCVCVCVSLPKHAI IEMLNQTHP 
YVPKGVAYLPEREPFIVPMDPELTA*YE 
DYRTHESAQEPSEPHQLSRRSLPH*PGH 
ATRDHS PPTDS WGKRRTSH 


7381 


21282 


A 


7441 


3 


384 


VIHL* AGAVI IGSRSK3TNALAHFLRQG 
TPTPVILVLGIIETMNLLIQPVALAERL 
TDN I TAGHLLMHL I VS ATLAI LT INLTS 
TLIMFTVLILLTI LQIAVALMHAYVFAL 
LVCLYLHDDAYDHGG 


7382 


21283 


A 


7442 


15 
i j 


277 


GtiAILPRLVPNSWPKVILGVWPPKMLG* 
QA 


7383 

• 


21284 

■ 


A 


7443 


21 

• 


694 


NGGLNAHliASAS E FDHSGVQL I EREEE I 
CIFYEKINIQEKMKLNGEIEIHLLEEKI 
QFLKMKI AEKDRQI CVTQKLL PAKRSLD 
ADLAVLQ I QFSQCTDR I KDLEKQFVKPD 
GENRARFLPGKDLTEK3MIQKLDKLELQ 
LAKKEEKLLEKDFIYEQVSRLTDRLCSK 
TSGCKQDTLLFTKKMNGYSRR I KNATEK 
MMALCA*LSMKQAPTIELQKEVQGERRP 


7384 


21285 


A 


7444 


141 


428 


DRS LLEAGDGR iagd FFFF* TGTNFVAQ 
AKVQWGIHSSLHP*PPGLKQSSYLNriiG 
SYHHWLRPHALFFFLKNRGSLFCPGWFG 
TGFRGTTFWGAK 


7385 


21286 


A 


7445 


35 


343 


GYS I IGCPKFLDEKI FDSGHEKERKINS 
LLF* KKKKKKKKKKKKKKKKKRGGLLKK 
TSGGPKI KRGGGIKNFP I KGGVKKTSGE 
FFGKKPFFGGEKFWEILP 


7386 


21287 


A" 


7446 


403 


193 


EYIKQGAE*NNQPKLHAHRKRLTWKETP 
RC IHEG IAPAIMNSDCLVFDTS I AQLFA 
ENGNLGINVTISMC 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 


SEQ ID 
NO: of 
peptide 
sequence 

• 


M 

#h 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- * 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 
sea lie nee 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
(XJIycine, H=Histid/ne, Msoleucine, 
K=Lysine, L=Leu cine, M=Meth Ion ine, 
N=AsparagIne, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V»Valinei 
W=Tryptophan, Y«Tyrosine, . 
X=Un known, *«Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7387 


21288 


A 


7447 


411 


110 


LTDSLINIVYR* KMSY IS *TNSA* VYLY 
IFCIYIYIYIYIYRRIYSIYRC*YIYYG 
PAHIVLVYS SSYI LVYYE SHKYGY IMGL 
NILSYIFIYSYFSACS 


7388 


21289 


A 


7448 


1 


380 


PTRPQKLKKNCNGVSNGDLCFFGGGQFL 
IWGFFFFFCILIEGGLNSKNYELKSLIK 
TYGAKKAE I EHTEN* KNRETLLERAAQL 
AIKVTOPNAMIiRTVEHDYAMYMHNFWLN 
KLNNCHKVAISCKQ 


7389 


21290 


A 


7449 


200 


437 


VQSNNNKNFKRRRL* PGWAHTCNPSTL 
GGQARRIS * AQELETGLNNLMRPCNYNK 
FINVLDDVHGVTSPVSQYYKKVC 










WD 


•a 
j 


PPRFFGFFFPFYPIjKFFFFPJCGFNFFttfS 

FFfclFFPPKKKFFFKNSPGGFFLPPFLG 
KI FFFLPPFKFGPPRGFF*RAPLFFFFF 

AX «U X *> * X7 XT AVJL \J * CVVJ X £ £%fX&T X X X X 

FFFFFFFFFFFFFFFFFFFFFLQRINFT 
ARPGTPLSRAVPLRGALPPSA 


7391 


21292 


A 


7451 


2 


348 


PRVRPRVRYKLQTKRCDLTLPS * GHSHV 
STiRNTTFDAIjKS * DTnST.Ti^ P KT iECHGV 
NTPNCNLRLPSLSDYPASLSRKAWIAST 
RLRAWE I FVFLAQMRQD YCMNPGGGGCN 

ELK " : 


7392 


21293 


A 


7452 


1 


168 


LVHDGHAGLKLLTSGDRPA* AYQSAG I T 
GVRPLGPPTYYYFRTAS FMTFICEFATV 


7393 


21294 


A 


7453 


184 


260 


SSRGGSLASRSSR* STLLGLPKCWDYRC 
MPPSLANFFFFNF* I *NFCFW * RRS LGM 
LPRLWNS *MQAILPLRIRLQGIHFAGT 
QTFSL 


/ jyt 


7170^ 


A 


1A^A 




(LA 


FFFFFNFF*KKKT * DVFAKT FF5CTFFFP 
x x 1 1 x xi x x i\i\Avx xv v x «vi\x x x ivx i £ f r 

SGLKIFWGGGFQNAPPQKKFFF*KIKSF 
FFIiNFFF*KNIFFFVAGWFFLLNFFFR 
GAPLFFFFFFFFFFFFFFFFFFFFFFFF 

mJAftAi flift A ft ft A; A ft ft ft ft AAAA>AAAA A » A 4- JL 

FFFFFFFFFFFFF 


7395 


21296 


A 


7455 


368 


2 


SQSFGRPRWENRLGPEQHIKISRYRPAG 
VHACHPSTLGGRGRWIT*GQEFKGSTHL 
GLPKYWDYMPEPLLLVYIQCFHFRTFCN 
KAAIHCFKPEQLLYNKTSYFKESYKVKA 
ALVACKPTRP 


7396 

t ^ *r \J 

* 

i 


21297 


A 


7456 


2 


183 


TERDS VSKNKYI Y IHTHTHTHTYICVHG 
VIYVYI *SLLLI YNIAYILI YHVFY ILL 
IFKA 


7397 


21298 


A 


7457 


281 


127 


YNFC I FM * RCX3FNNVGQSGLKVLLRG YS 
PASASQIAGITGVSQCAQPVAYS 


7398 


21299 


A 


7458 


2 


194 


TSVRKYF* YP FT I TDRI YGSTWWATGL 
HGLHVI IGSTFLTICLIRQLI FHFTSYH 
HFGS EAAG * YWHFVEPARLFL YVCIY * * 
GSTWVYATGLHGLHVI IGSTFLTICLIR 
OLIFHFTSYHHFGSEAAQ 


7399 1 


21300 


A 


7459 


22 

■ 


319 


GGS VTGAYHRI LDHDRNQIFLA* GGS VT 
GA YHR I LDHDRNQ I FLALL I SMLLGLYF 
TLLQASKYFECPFTICDGIYGSTFFVAT 
GFHGLHVI IGSTFLTICWIRQLILHFTS 
RHHFGLEAAA*YWHFVDW*LFLYVSIY 
* *GSVTGAYHRILDHDRNQIFLAIiLISM 
LLGLYFTLLQASKYFECPFTICDGIYGS 
TFFVATGFHGLHVI IGSTFLTICWIRQL 
I LHFTS RHHFGLEAAA 


7400 


21301 


A 


7460 


1' 


222 


TILLGLYYTLLQA*EYFKATFTIYDGIY 
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SEQID 
NO: of 
nucieoiiae 
sequence 


SEQID 
NO: of 
pepriae 
sequence 


M, 
etb 

00 

• 


SEQID 
NO: in 

09/515,1 
26 

• 

■ 


Predicted 

beginoing 

□ucicuiiue 

location 

correspond 

ing to first 

amino acio 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
uing u> 
last amino 
acid 

resiQue oi 
peptide 


Amino acid sequence (A»Alanine 

C=Cysteine, D=Aspartic Acid, 

jL^vviucaraic acio, r— >jt nenyiaianine, 

G=Glycine, H=Histidine, ]=lsoleueine, 

K-Lysine, L=Leucine, M=Metb ionioe, 

N=Asparagine, P=ProIine, 

v — vviuiaroine, i\ B A.rginine, t>™ocrine, 

T=Threonine, V=Valine, 

W=Tryptophan, Y=Tyroslne, 

a— un Known, — otop cooon, / = p05SiDie 

nucleotide deletion, V=possible 

r% (|a|aa4i/4a SncAip^iAn 














GSTFFAATGFHGLHDIIGSAFLTICFIR 
QLIFHFTSEHHFGFEAAA*YWHFVDAT* 
LCMYVSI Y* *GSTFFAATGFHGLHDIIG 
S AFLT I CFIRQL I FHFTS EHH FGFEAAA 


7401 


21302 

• 


A 


.7461 


3 


246 


LNYS L F FHGEANLGTTQVLTHPS TTAMY 
r VxiXuyirir w Xljiv»l JLBil "IrFVVilKJNFXK 
KKKKKKKKKKKKKKKKKKKRGGGF 


7402 


21303 


A 


7462 


2 


155 


SRSRAALLLPLP*ALQTTNIiPLIVMSSL 
IxLIIIIjALjSIjAYE*IjQKGIjD*AE 


7403 


21304 

■ 


A . 


7463 


1 


136 


PTRPAPSHLLY CKNI KKLFPSGTANBAL 
I IADI FLRCKKKKFKVK* *VLKASBE*N 
GG KHLHGMH * NS AVCTVFWLKQ I MNS D * 
**KVNTQ*KKKKKitl<J<KKKKKKKKIU^ 
KKKKKKKKKKKKKKGGENNK* KNXKKLF 
PSGTANEALI IADI FLRCKKKKFKVK 


7404 


21305 


A 


7464 


166 


192 


NIN*MWYMHTVECYSDFKRKEILIHA5f^ 
WMNLED IMVSB I SQSQKDKY CM I LHI E Y 
KLYIHM 


7405 


21306 


A 


7465 

♦ 


416 


55 


LiFFLRLGYFFIiPNI RVFFFL5TLFFFFF 
VWGFF FFGRFFFFFPPPQEVCFFFI F* G 
I FFFFLFK* KNMFFFFFFLWGPPPFFFF 
SGAPLFFFFFFFFFFFFFFFFFFFFFLR 
KANTKKMF 


7406 


21307 


A 


7466 


A S\ f 

295 


84 


I WGNVTLLQLVS CL FCFL FLH * G FFCCC 
CCLFLFLFVCLFVC WFLNLKPS I KCLLP 

COT OT73r , IVT7TDG017 


7407 


21308 


A 


7467 


142 


258 


PGTVAHTCNPSTAGGQGGRI P*GHEFKT 
NLANMVKLHLY 


7408 

- 


21309 


A 

• 


7468 


3 


192 


PSEYFESPFTI SDGI YGSTFFVATGFHG 

r UtfTTOPiPPr TTPPTOOT T LIU UMIggtJTTTJ 

JjHVX XGSTrXiTXCr XRUXilFarxSKHHF 
GFEAAA* YWHFVDW* LFLYVS I Y* *GS 

♦PT7T7\ra.TV217H'flT.t-n7T Tn.QT l T7T. r T 1 Tr , T7TT3rkT. 

ir c VAiurnUiinv x luoiriiiiuciKyL 
I FHFTS KHHFG FEAAA 


7409 


21310 


A 


7469 


2 


no 


GRVGKHHFGFEAAA* YWHFVDW* LFL Y 
VSIY**GS 


7410 


21311 


A 


7470 


380 


8 


T PQKRKKLKKAGEGVFPP P PKKTTPPP P 

DTWT VT?T17I?T?T?I?T Vt?I7T7T?T71?T71?l?riVMCT7 

f ir X X ri r errc iir rrrrclf r ryinor 
FYSHIMIQRHFIiNF*ESFYSCISILQ*L 
FKFFSFLGGLIKL 


7411 


21312 


TV 

A 


7471 . 


1 


107 


D'TTDT5ijTj'C'r*i?i?n7\a+vwiJir\rri , \ri7#T.WT v^r 
flKFrtlir V?r X WnrvlJVV w iJirXixV 

Oil VJO 


7412 

• 


21313 

■ 


A 

■ 


7472 


2 


360 


VNTLIiALLLIMITF*LPQLNGYIEKSTP 
YFTYTf.TiP T S PAP VP PS T TfT .FT .VAT T17T .T. 

LDLE I ALLLPLP * ALQTTNLPLRAMSSL 
LWI I LALGLAYE * LORGLD * AE WLHSI* 
TQPTNKW 


7413 


21314 


A 


7473 


423 


167 


VKVHKCFLPGVAPHAFNPSTLGGRGGRI 
A*AQEFKTSLGNIVRPPSDTCNPIVliGL 
QA 


7414 


21315 


A 


7474 


1 


73 


I KPE * YFLFAYT I LRS VPNKLGGV 


7415 


21316 


A 


7475 


131 


3 


IS WVGAEWLTRVI PAL* EAESGVSRGQE 
I ET I LANTVKPDAW 


7416 

• 


21317 

* 


A 


7476 


2 


103 


PRVRSTGFS S I AM TRDVNYG * I IRYLH 
ANGAK 


7417 


2131$ 


A 


7477 


2 


325 


NTI.T.AT»TiIiIIITF*LPQliIGYIEKSTPY 
ECGFDPISPARVPFSIKFFLVAITFLLF 
DLE IALLLPLP * ALQT TNLPLI VMSSLL 
L 1 1 1 LALS LAYE * LQ KGLTERDT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

ejth 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=G)ycine, H=Histidine, I=Isoleucine, 
K=LysIne, L=Leucine, M=Metbionine, 
N=Asparngine, P-Proline, 
Q=Glutomine, R-Arginine, S^Serine, 
T=Threonine, V^Valine, 
W«Tryptophan, Y=Tyrosine, . 
X=Unknown, *=S»top codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


7418 


21319 


A 


7478 


423 

• 


170 


FFSFFFFFFFFFFFFFEIDKNVYLIM*C 
TNPQEKRNLLFCFLVMVSPVCITPHTYI 
CVCVCVIYIYIYIYIYTYIYPFIFSCCD 


7419 

i 


21320 . 


A 


7479 

• 


433 

• 


50 


RVFFFFFFTPEKKTFFPQE I YLFFSPFP 
L*NFFFSPKPLFFLGGFAPNFPPPKKVF 
FLKFPPGFFFSPPFKKKFFFFFPRFFFA 
PPRFFF*GPPS FFFFFFFFFFFFFFFFL 
TVKFTRG FRGFCGQ I 


7420 


21321 


A 


7480 


401 


337 


FFFFFGKFFFPQEFTVFFPPFPFKIFFF 
SLRLLFFWGGFAQFFPPQK*VFFSKFPR 
WFFFPPLFRKKFFFFFPG* FLAPQGFFL 
KGPPPFFFFFFFFFFFFFFFRPI*VGGR 
VGNNPNPNTNYLPN 


7421 


21322 


A 


7481 


1 


84 


KNFLPLTLALLI*HVSIPITISSIPPQT 


7422 


21323 


A 


7482 


562 


412 


LFSFHVSGTMAHTCNPSTLTG*GRIT*G 
LEFKTSLGNTVRSHRYRKKKIA 


7423 


21324 


A 


7483 


12 


233 


DI FHFTS KHY FGFE AAA* Y WHFVDWRL 
FLYASIY* *GSLL 


7424 


21325 


A 


7484 


385 


56 


GNFPTPKKFNFFYPFYLSI FFFFPKKLK 
FF* E I FP I FPPP KFLFLPPNFP VLFFFF 
FLFWBFFFSFLFWFFFFPFFFFFFFFFF 
FFFFFFFF* KTTEFYFVNSNFQHTK 


7425 


21326 


A 


7485 


424 


6 


F FFF FS EGS GL I RED P IGAGALYD YGR * 
LVWTG* TLFVGVYIVIEIARGNRLCD 


7426 


21327 


A 


7486 


16 


129 


; FXXXRXXLIXKBPSINLE*LYGCPPPYH 
TFEEPVYIKS 


7427 


21328 


A 


7487 


183 


387 


I ASGLD I FLLLLIiETGSHS VTQAGVQWH 
NHDSIiQPRLTGLK* SSRRSPRGSWDYRV 
LLCHPGWSAEA 


7428 


21329 


A 


7488 


351 


169 


EKKI FFFS PRGYGPPPFFFLKAPPHI FF 

FFFFFFFFFFFFFFRGGGGDHKNFFFFF 

WGFLKNFPLFFFFPRV* I* I FFFFFFFF | 
FFFFFFFFLGGGGGTIKIFFFFFGGF 


7429 


21330 


A 


7489 


47 


212 


THASAIiFL* IRTAYPRFRYDQLIHLL*K 
NFLPLTLALLI*HVSIPITISSIPPQT 


7430 

• 


21331 


A 


7490 


9 * 


273 


TSGSCCSAHILAFKP*AGMATFITKSL* 
*CHTHTHTHTHTHTHTHYMYIYTHMHTY 
IF 


7431 

* 


21332 


A 


7491 

■ 


70 


405 


LYGCPPPYHTFEEPVYI*EAFASKRKVI> 
IVEEPSINLE*LYGCPPPYHTFEEPVYI 
KSRQKRKESNPPKLVSSQPHGLHDFFKK 
KKKKKKKKKKKGGG P L KKPRGG PNLTGG 
GKKNFFF FRGGE KKP PGGFWKKTLFLGG 
GKLGTPPPRRF 


7432 


21333 


A 


7492 


22 


208 


RSSLPARPTRAVI LI I FMI * EAFASKRL 
VLIVEEPSINLE*LYGCPPPYHTFEEPV 
YIKSYV 


7433 


21334 


A 


7493 


2 

• 


204 


KLPETVKKKKKKKFYSFRSIVMLLLYLF 
S*SLSFFLLYSLLFFLFLLSFYIYSFLC 
r rixJivi Al lbH 


7434 


21335 

* 


A 


7494 


2 


104 


TTLLIiSRT*NKLT*IiTPLIPSTLIiSIiGG 
LPPLTG 


7435 


21336 


A 


7495 


291 


160 


PRHF* IFVEMGSHYVAQAGLKLVASSNL 
IiAWASQS AGVTGVSHGS LLITS CL 


7436 


21337 


A 


7496 


422 


180 


LGAPQGVF*KGPPFSPSSSSSYSSSSSS 
PS * QNI FWKI FLFVCFVLETBS VTQVGV 
QWRDLGSLQPLPPGFKRFFTITTH 


7437 


21338 


A 


7497 


244 


296 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
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SEQID 
NO: of 

nucieuuuc 

sequence 

- ■ 


SEQID 
NO: of 

sequence 


M 
eth 

uu 


SEQID 
NO: |n 
USSN 
09/515,1 . 
26 


Predicted 
beginning 

n ii clenti rl p 

location 

corrfesnnnd 

vl/l 1 vtfUUUU 

ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

IJUUW 

otide 

Inrntfnn 
correspon 

dim* to 

last amino 
add . 
residue of 
peptide 
sequence 


Amino acid sequence (A=AJanine 
OCysteine, D=Aspartic Acid, 

R=f2liitnm it*. Acid F^PhenvInlnninfc. 

G=Glycine, H=Histidine, I=Isoteudne, 
K— l,vsinf» 1 ~\ murine M=IVfethlonine. 

N=Asparagine, P= Proline, 
0=G lu ta mine R=Arcinine. S=Serine. 
T^Th reonine, V^Valine, 
W»Tryptophan, Y^Tyrosine, 
X=Unknown, *=Stop codon,/=posslble 
nucleotide deletion, \-possible 
nucleotide insertion 














FFFFFFFFFFFFFFFFFFFFFFFFFFFG 

gnf* fyk*ll6mrdgvclppssyl 


7438 


21339 


A 


7498 


1 


410 


PTRPDQLIHLL* KNFIiPLTLALLI *HVS 
TPTTTGCTPPYT 


7439 

■ 


21340 


A 


.7499 

• 


51 

• 


478 


PQKYTLIMKICIIPPGKGFRPVEMLAGG \ 

IFSSLLYKYNMIYCLMLCYYIGYIFIFY 
DMYYRQ YSVL YYKL YNYVML I * IYILYL 
FMCT * KYIYFTFM* YKYIFIYIiYLY* YK 
YIFYIYLCVYKNIYTLYLCNINIYLYTY 
IYININIY 


7440 


21341 . 


A 


7500 


3 


110 


FTS KHHFG FEAAA* YWHFVDEV* LFLYV 
SIY**GS 


744 J . 


21342 




7501 


loo 


111 


NPDINPYICGQMIIFLQLKSLHKI 


7442 


21343 


A 


7502 


400 


159 


NNFS PLRNLiQGFFGPLKFPMGFQAGPPG 
*GFSFFLRSNFFLDVFRGKY*TFGF*DF 

L* 1 1_ £r O IriN Cj V tr tr ijnutaDi AJrurc if 


7443 

• 


21344 

F 

* 


A 


7503 


401 

- 


257 


TSQKKKKKKKKTPKKKKKNREKPPKKKR 

P,P * P QLVKRGGRE KKKKNLWKKKGAF * K 
KXKKKKKNPKJCICKKKPGKTPKJCKKGDPG 


7444 


21345 


A 


7504 


44 


344 


LI^IANIALPPTINIJ^ELSVLVTrFS 
* SNI TLLLTGLNI LVTALYSLY I FTTTQ 
WGSLTHHINNIKPSFTRENTLMFIHLSP 
ir.TiTiSTiNPDTTTGFSS 


*7A A< 

7445 


2J346 


TV 

i 


7505 


200 




J. " /if ixn v uynuu£i rux ci/urna/uydA 
RITSVSQHAWPVHNIFYSLL 


7446 


21347 


A 


7506 


3 


213 


RBLPGLKR FFLLS LLSSWD * WRMPSH PA 
TFCI FCRDEVLPCCPAGLLILLVWNLKN 

t it T r-i n r-i T T 1 


7447 


21348 


A 


7507 


400 


20 


S LPQKKVS LLKLKSAVQLVMF * DRFSLC 
HPGWRA*GQSGVTTASSLRAR*SSCLTW 
EYRPSPLPLADFLFW*RRGLPI*PRVVS 
NS *GQ VGZaLHQPS KMLKFS SLS PCARPL 
P PP PN RSTNT J? VQS 


7448 


21349 


A 


7508 


349 


227 


SDFLTSANPPASASKSFGITGVSHHTRP 
* PRFLKVNLI YS EMEVYN* AS FRHVD * G 
GLELLTL* FTHLGIiPKLWDYRREPPHPA 
IVTKT FKGltP YTO 


7449 


21350 


A 

* 


7509 . 


371 


21 


FLVETGFHHVG*AGLEPLASSNLPASAS 
QTAGI TGMSHCAQ PNPGSLLNKTMLAVA 
TEQWVGYMWPSVFGHITARPEGKRLSDC 
LHSDTNVCWCVCSGLI FVCFETGCRFA 


7450 

• 


21351 


A 


7510 


330 


14 


DKVCSVTQARVQWHDHGSLIiFLSSGLRQ 
SSHLSLLSSWDHRHMPPO«ANFFYF**R 
IxRSHYVAQAGLKLLASSNPPTLTCQSAG 
I TGVS HNTW PKAL FS VTDLKH 


7451 


21352 

. * 


A 


7511 


417 


33 

* 


APPCGGGPPKRIFGPSRKRGKGPPPPKR 
FF*KNKPPKGGGKKSPPPPKNFFPKKKN 
LGPKKKKSGFWGRFAPPRAPGGGVFNFP 
FKAGGEKFFSRGEINPPRGAGKKKGFFL 
GKKNTKKKTPKKKDL 


7452 


21353 


A 


7512 


81 


296 

0 


CLREGASHSAAQAGLHWHDHSSLHP*NP 
GLRGSSRLKLLS IWDYRCVPPSLANFES 
CLLKIILGSPFPKNL 
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SEQID 
NO: of 

micicDuuc 

sequence 

* 


SEQID 
NO: of 

peptide 

sequence 


M 


SEQID 
NO: in 

09/515,1 

XO 


Predicted 
beginning 
nucicouue 
location 

t% f% mm mm nn *\A n #1 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

nucle- 
otide 
location 
correspon 
ding to 
last amino 
acia 

residue of 
peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
n/^vFiu lainic acju, f^jrnBnjriaiflninet 
G=Glycine, H=Histidine, I=lsoleucine, 
k— juysine, \j— i^eucme, ivi— ivi em i o n i n e, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine,-S=Serine, 
T=Threonine, V=Valine, 
w™ i rypiopnan, i b tyrosine, . 
X^Un known, *=$top codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 






A. 


7313 


A17 

412 




AQFETS LGNTGRPCLH 


1A CA 

7454 


21355 


71 

A 


. 7514 


Tit 

74 


126 


RGPPWPRPGPTVWKG*NKKN* PAPRGSN 
PPF FFPGNPGPFKLRGWPGS S * GS FQNC 


7455 


21356 


A 


7515 


332 


20 


GBRQVGSYFFMNTERLFGMTKKF* KWIV 
SMVAQHSSS I SGVWVFRGQELAFPLSPD 
WQVDYSSYTWRKLDPGSEETQTLVREYP 
SWbClAryHVGKArNyGK.lr l\. 


7456 


21357 


A 


7516 


1 


167 


RTRGQRFTVLARIVLIS*HCDI,PASASG 


7457 


21358 

* 


A 


7517 


2 


223 


GRMEEMGSHYVA*AGLEFLGASDLPA*A 
PQI^II^LLGGHOTGLTILPRLSSNSW 
PHVKWPRWPLKAUGIiRG 


7458 


21359 


A 


7518 


151 


476 


SLGKFAVS FS FS FLK * RRALAVLPRLCS 
NSWP*AVLLPMLPRVGIAGMSYHTQPDY 
H FQARAVAE I LGL VSTQHS SVTGLP VEN 
S FRAGHGGPRLQS QHFGRLRWADC 


7459 


21360 


A 


7519 


3 


134 


CSFRLGAVAHICNVITIiGGRGGRIA*AQ 
EFBTCLDNIARPCLY 


A *f ft. 

7460 

■ 


21361 


A 


7520 


14 


219 


APS IHGFGFLYTYIHTHTHTHVYMYI * V 
YIHTH I ESI KCQPLTQLQSNKKANR I VP 
RTQFMLTCY FMR 


74ol 


21362 


A 


7521 


134 


334 


RKHSC * I FFNFCF YLSLETRSHYVASTG 
LDLLASSNPPALASQRARITGMSHHAPP 
VLTRFKKSLF 


7462 . 


21363 


A 


7522 


406 


145 


FLKTGFSPFGPFVF*TPPPGFPPPPPPP 
IFGVPGGGPFAPPLFFFFPQTKHFPPFL 

till y\ f> Tl f> /%T1 TT> TH T > ■ jT'if tmnX i if mi n T*^ T*% 

NPPS PP PC2RGFFRGFFLFPPAFFLKTP P 
FPF 


7463 


21364 


A 


7523 


3 


382 


HPQGEGGRL TSLE * NTS LRLKKKKKKKK 
KRGGP FKKNP WGAQI * PGKKKKNFFLKR 
GAKKNHLGNFGKKPYFWGGKKLAKPPKK 
N*TFKGKKKFLRGKGGKKTPKPWLLKI F 
FSGFYLKKIFPPGP 


7464 


21365 


A 


7524 


319 


106 


GTRS CS VT* AG Vy WHDHS S LQ P * TP WIiK 
RSSWLRNRWDYKHEPPCLNNFYFFCRDG 
GLLCHPGWSQS PELK 


7465 


21366 


A 


7525 


433 


228 


MQP YNWE VNSS SQLS LS S ARQRVTHVLT 
GNFFIK* ERWDLPVLPRLVSKSWPQVIF 
PPWPSKVXiKLjQV 


7466 


21367 


A 


7526 

* 


26 


351 


CRS LVFSLP KPGLRF VS I LHIQVFSH FD 
rFFIil\5NTyK*KJ\ w TFFIFFA w liF^vTPA 
PKPGMVAHTCS PS YSGGLDGRIA*AQBF 
RTSLGNKTNPSLEGGRKKKKPSGS 


7467 


21368 


A 


7527 


438 


62 


QLSKIHRPPAGGRFFKSGGGFKPFGPVG 
LPPFGKKANQPTPKNPPKKQKPSGGGLS 

PAPKDYCPIV*AALIPSKKKKKSKPG*V 
KKGKPNLYSIYNK 


7468 


21369 


A 


7528 


3 


132 


FYYL*RRGFPVLPRIiVLNSWI J QVIU J PW 
PPKVLN*AAHHPHG 


7469 


21370 


A 


7529 


2 


628 


FFFLPGWNRVLLMGRGLGEANSREGRG 
PSQPRGMHGQLEVRGGRSQGDGWHGTLS 
PPWGR I YRGAPPTFAAPQAPKPFRQLLP 
M*LPGRPSSCLLPETPALSPLPSAEWGG 
FKLLTGTSETPGPSGSPSVGSANAQAAG 1 
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SEQID » 
NO: of 

nucleotide 

sequence 1 

'i 


SEQID 
NO: of 
nentide 

V mf V* \M \i 

sequence 


M 

etb 

od 

• 

■ 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ine to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
otide 
location 
corresnon - 

Vvl 1 vj Uv*l 

ding to 
last amino 
acid 

residue of 
peptide 
<?pn ii price 


Amino acid sequence (A^AIanine . 
C=Cysteine, D=Aspartic Add, 
E^Giutamic Acid. F^Phen vlalanine. 
OGIycine, HHHistidine, I»lsoteudne, 
K=Lysine, L=Leucine, M=Methk>nlne, 
N=As oa rapine. P=Proline. 
..Q=Glutamine, R=Arginine, S^Serine, 
T^Threonine, V=Valine. 
W=Tryptophan, Y=Tyrosine, 
X=Un known, * e3 $topcodon,A s possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


• ■ 


■ 






■ 




ATTIVARGHTPCTRGPLDTHAHSSLRWH 
AHTRP P PRPLAHEHGTHTHAHPHTPGAW 
VGBGPFQGWGRA 


7470 


21371 


A 


7530 


63 


205 


GRPTRPRTRGVKI LI* QFBHTKPTPGLPT 
LIALTTLLLPISPFILIIIi 


7471 


21372 


A 


7531 


390 


56 


KTFFFFNLIAFTEAKSVLLFLLLFFBTG 
SRSSAQPGVQWHHQSSLQPQPL*LKGSS 
CLSLLSN*DYRCAPPCQ*SPCLSLLKCW 
DYKREPPHPAKESHMQQSYLIYSLYIA 


7472 


21373 


A 


7532 


543 


335 


LAAYGGTCLQS * LFS RLRWEDHLS PGVQ 
GCNELQLQHCS PAWATE * DPVFKGEKKE 
RVLVKVFS 


7473 


21374 


A 


7533 


3 


145 


WCDHGSLEPRSPGLKQSSCLSLPSNWDH 


7474 

• 


21375 


A 


7534 


386 


106 


SWVTDEITME I * ECVLNDNSDTP YQNLW 
DTEKWLRAKFTALNACMKKSERAQIRH 
TCVHTHTHTHTHTHTGS KCLLRKYLLLT 
IQQQNIRIN 


7475 


21376 


A 


7535 


3V8 


I 


RGGKFFFFGKKEFPPVGQGGF*FLPPGV 

ISYPILAKQLKLCFLKXKKKSIQAIKLY 
HHQ KNP I TKS SVFWESQPRA 


7476 


21377 


A 


7536 


2 


344 


HSKCVPAEWTMAYYIRYLS ISSLLHIL 
ENIVHFFFHFLY*GPNNLFFFYFAGTRL 
TORS WK* PMT I * * VHVQ I KREGO I PNK+ 

IFCMIGRGG* ITCGQEFETSLVNMVKPC 
LY 


7477 


21378 


A 


7537 


116 


224 


LHTHTHTHTHTHTHTHTQFNKLP *LFI P 


7478 . 

> 


21379 


A 


7538 


1 

- 


383 


WHERTHSRI I ILFQGLETLLPLIAF* * L 
LASIiANLVLPPTINLLGELFVLVTTFS * 
SNITLLLTGLNILVTALYSLYIFTTTQ* 
GSLTHHINNIKPLFTRENTU1FIHLSPI 
LTjLSLNPDIITGFSS 


7479 


21380 


A 


7539 


31 


332 


DNYLSSYSSSGSWMRELILASQGIRWEP 
IVDRTPSHHRTHTHTHTHTHTHTHTHTV 
LFRLGP WTHAREPRGHICGN* IRNQSP * 
KNPRTRGKNIoRTPHGW 


7480 


21381 


A 


7540 


199 


15 


NNVQ I K* Q FEHTKPTPFLPTLIALTTLL 
LPISPFIIillL 


7481 


21382 


A 


7541 


337 


157 


AEIVPIiNSSLGNRVRLH*MESYSVAQAG 
VOWHDLGSVQPLPPGFKRFSCFSLIjSSW 
DSRLYFRCHD 


7482 


21383 


A 


7542 


28 


313 


RARIGSDQCLSAGTQWENKSQISGDFM 
WNDFRS * S VTRVGVRWRNHGSL * PWIPG 
LR*SSGFGLPNCWDYRDRNEVWGGKKBV 
TPPTVNLOKSO 


7483 


21384 


A 


7543 


493 


180 


GGFRGS WLVPRDGGQGVGFCSLPPPPSR 
VKQFFCLRF PRRWGGREGS PRPS YF FFP 
♦EKPRFFFLGRWLNFLGQGVGPPWPSQ 
RVGFPGLTPGPRASFFFNLI 


7484 


21385 


A 


7544 


403 


426 


FF*KFFITFHTIjFFFFFFFFFFFFFFFF 
FFFFFFFFFLAPPPGGFFFSFFYQKIFL 
KFPGQPEKFALPFGGGFFF*GKFPQFFF 
CFLVFFFFFFLFLENGVSLYCPGRSRTP 
GLKLSSRLLASLRAGITGVSH 


7485 


21386 


A 


7545 


3 


335 


RHYSDYPDGYST*NILSSEDSLISLTAA 
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sequence 


M 
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26 
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nucleotide 

location 

correspond 
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.amino acid 
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peptide 

sequence 
* 


I Predict- 
ed end 
nude- . 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
KHLysine, L=Leucine, M=Methionme, 
N=Asparagine, P=Proline, 
Q=Glutamtne, R-Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y==Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


• 






* 






LLMVFMI *EAFAYRRTGLIA*EPSMNLE 
♦LYGCPPPYHTFEEPGYIKSRLKRKBSN 
P PBCLVS SQ PHGLHDFFKKKXNRGGPF 


7486 


21387 


A 


7546 


424 


140 


RRVKFVGGGAPQGP P PKRGVLPKI PRE K 
VFPPPPG*NGPPGPAFKTPPKKKKNISS 
PPPGNWAPPGGLLKGPPPFFFFFFFFFF 
FFFFFIiNPLK 


7487 


21388 


A 


7547 


2 


348 


FCHVAQSGLKLLAS SDPPTLASQSARI T 
GMSHHGGLK*GE FLGPSSNKGALLLWEA 
TWLQSFMKSASLFVKPRPTFVYFVFAFV 
GYRDIYFTASSLIKVIQEGLKYTMKCAP 
LTL 


7488 


21389 


A 


7548 


2 


90 


RKQAYTHTOTHTHTHTHTH* KKTGXHAH 
KHTHTHTHTHTLKAEKQIBGGK 


7489 

• 


21390 


A 


7549 


256 

* 


405 

• 


CELYSGKEMELVFGLFIFTGVTRLETFY 
KKECSSFFHSQSRGHKLQPSILNEWWHI 
RNPLK3LTHFLIiL*KTGVAEYKNSLNVVH 
HPFFIiSYAVS FLLQGWSFAMLS KLGSNS 
* VQARCLPQS PKVLGLKES PQERTVNVS 
SIRGKKWSWYLDYIiFSQGLQGLKLFIRK 
SVHHS S I PRAEGINCNHQY 


7490 


21391 


A 


7550 


2 


362 


ILIMNTLIiALLLI I ITF*LPQLNGYVBK 
STP YE CGFDP I S PARVPLC I KF FIAAIT 
FUjYELEIAI4LLPLP*ALQTTNLPLIGM 
SSLLLI IIIALSIiAYE*LQKGLD*AELK 
EEQKTLQC 


7491 


21392 


A 


7551 


1 


317 


TTFDNSALLFFWDGGEIEFHSVSQAGVQ 
WHNLCSL*PQPPGFKQKLILPQIVKENV 
SKISSQLLFSRVNINISPSBQCITPPSL 
QLARIGIFYAAKIHLTKGLRG 


7492 


21393 


A 


7552 


479 

* 


183 


YLFHVHTTASHSNGACTGPTVGDIIVSS 
TL* SMTEQVTMPLAS AVSDGTVPSVRTA 
SRGSEQAAESMVSSEHIEILEHAGELVI 
ASPEGQLEVQTVIV 


7493 


21394 


A 


7553 


454 


105 


KPPWGNKIRPPLFPGRKVPPPRF*NPVR 
GS PPKK* FVFPVGRVLGTGVTPLS PFLK 
TTPLLLWDP PLS Q P PGGVKPENSLYPGK 
KRFR* PKFPPCPPAWGTKRKPPLQKKKK 
RQVL 


7494 


21395 


A 


7554 


439 


1 

* 


LLKRC VRKDS PP PQNK I FFF I LKKFVFF 
FAPY*VRKFYFLTAHFGKRPPQIYIFGP 
PPVFLIFCCFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFNKQVFI 
ERLLCARHCFRP P PEE KTKNKIEAANPN 
SEGEER 


7495 


21396 


A 


7555 


323 


82 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFR* *AYL 
VRSPLLFIFVFSYF*LI*HVYHRYSVYI 
ELDVLLTGSKIILI 


7496 


21397 


A 


7556 


194 


412 


KMYRTSLGHLVEAKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKDSRGGVV*KKFT3GG 
HNTRGVKI I FFFSLGG 


7497 


21398 


A 


7557 


414 


219 


KGFKKLFPPPVF*FFLCPPPIFLKGFP* 
GRKPPPPKNPQGGPPFFFFQTPPPLFFF 
FFFFFFFF 


7498 


21399 


A 


7558 


377 


25 


THVGGVFWGKFFFSRRVFFFFYHLIQIL 
SPPPQKRPPPKGTPEG*IFPLFKEINFF 
FF*DFFFAPPPFFFFFFLFFFFFFFFFF 
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Amino acid sequence (A»Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
G=Glycine, HHHistidine, Hlsoleudne, 
K=Lysine, L= Leu cine, M=Metnionine, 
N=Asparagine, P=ProIine, 
Q=Glotamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W=^rryptophan, Y=Tyrosine, 
X^Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














FFFFFFFFFFFFFFFFFFFFFLKVKTGV 
YCVPPIAVILYR 


7499 


21400 


A 


7559 


386 


2 


FFFFSFGEI FPPPGISGPFPPFPL*NFF 
FPLRPLFFLGGGAPFFPPPKXGFFPKIP 
RGVFFPPPLREKFFFSPPP* IWAPPGFF 
LKGPPLFFFFFFFFFFFFFFYRKARTKP 
ICLWQDVSPSKHFQCI 


7500 


21401 

4 


A 


7560 

• 


1 


396 


KI CHRI LRP I KKVK* EKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKGGGAS 
LKIPSGGPQLPRGGEKNPPPFLGGEKKP 
PGVFLKKHL FLGGGHFG P P P P KKNS P WG 
EXKNFWGGGGKKKQNIPGGKKI PSHKKX 


7501 


21402 


A 

■ 


7561 


1 


390 


N YVFKE FSDKLGAVAYTCNPSTLGG*GL 
WIT*GQEFETSLGNMVKPYLYQKYQKXK 
KKKKXXKKKGGG P FKKTLGGAQ FNGGGE 
KKFFFFLGGVKKPPRGFLGKNPFLGGGK 
MGAPPPPKIKALGEKKNF 


7502 


21403 


A 


7562 


406 


182 


FVFFFFFXXXFXXXXFXFXXXF*TXXFF 
FFXXXXFFFFFFFFVCFFFFFFFFFFFF 
FFFFFFFFFFFFFF FFFFF 


7503 


21404 


A 


7563 


2 


113 


FHFTS KHHFGFEAAA* YWHFVDW*LFL 
YVSIY**GS 


7504 


21405 


A 


7564 


313 


97 


SPTYPCYIRNSAQSFSLIAKS FCSHPHT 
YHHDPHLEKSLTYCPQSPS*PIIPFVTC 
TYINRWPEATEEPQNK 


7505 


21406 


A 


7565 


3 


144 


DAWVF VATG FHGLHVI IGSTFLTI CF IR 
QLI FHFTSKHHFGFEAAA* YWHFVDGV* 
LFLYVS I Y* *GSTFLTICFIRQLIFHFT 
S KHHFGFEAAA 


7506 


21407 


A 


7566 


1 


130 


FIRQLIFHFTSKHHFGFEAAA*YWHFVD 
W*LFLYVSIY**GS 


7507 

■ 


21408 


a 


7567 


434 

• 


1 


PDIiNKVI FFPHRKKKEPYKGHI VGKNLR 
NEMYS IQDTNSLPRRER*RKTRTCI I FP 
HKIKTSEHSFDIKEKH*NYVTLCKVLRK 
LSTNQRQS * I FLS TNKNDTLGRARWLTP 
AIPALWEAEAGGS *GQEIETILANKVKP 
RGRV 


7508 


21409 


A 


7568 


441 


86 


NFSFREKGGEFVPPPPLKI FFFPPPPNF 
FGGGGPPSPPPQKKVFFPKPPRGFFFPP 
PKKKKNFFPPPQKIGPPPSFF*TPPPPF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFL 
MFVCLY 


7509 


21410 


A 


7569 


1 


155 


PTRPDLRSLRFEAKSPNSRRTLHKPGVT 
I * EAFAS KRKVL I VEEPS INLE * LYDYG 
FFYYKIKEPVYFYSNQKKE 


7510 

• 


21411 


A 


7570 


434 


1 

• 


S PTEAGGKNFFKKNPGRKTPPKKKKNI F 
FSPLTPKKFFFPPRG * IIK3GGGGPNCPP 
PKKGVFPKNPQGVFNTPPKKKKKKFSPP 
GKNGGPPGVFLKGPPPFFFFFFFFFFFF 
FFLRQSHTLLPRLECSGA 


7511 


21412 


A 


7571 


493 


196 


S SLDCS * S LQGS I * FLTLMMLAVDFVDV 
LY * V* RVLYVLVQVAITKCH I VSGLNNR 
HS FLTAVKARKS KI KVP ADLVPGEVS IiP . 
SLFS 


7512 ' 


21413 


A | 


7572 


1 


239« 

4 


•KGLTILDAIKNICDS*KGVKILIFSEVW 
KKLVPTLTNDFERFNTS VE EVTAHWET 
ARELELEWPQDVTEL PQSHDQT 


7513 


21414 


A 


7573 


391 


32 


FFPQPSPPGVKGFFCLNLPNNWGFRAGP 
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Predict- 
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dins to 
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peptide 
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Amino acid sequence (A~Alanine 
OCysteine, D=Aspa rti c Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G^GIycine, H=Histidine, l=Isoieucine, 
K=Lysine, L=Leuclne, M=Methionine, 
N=Asparagine, P=Proline, 
0=Glutamine, R=Arctnine. S=Serine« 
T=Threonine, V=Valine, 
W«Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


• 


• 


■ 




• 




PPPGYFVFLVNKGVFPVGPPGFELPSPS 
*VAPPASQKGGVSGVSPCPRVWVPFPKK 
KKKLRGVEQRCTMHLLVYS KNVPVKPQK 
KSSIGSL 


7514 


21415 


A 


7574 

• 


2 

■ 


383 


PRVRSTAFSS I AH I TRDVNYG * I IRYLH 
ANGAKKKKXKKKKKKKKKRGGALKKKPG 
GGQKKGGGEKKNFFFKGGEKKKPGGIFE 
KKTF FGGGENGAH P PQKKKT PGGKKKI L 
RGKGGKKPLYPWGEK 


7515 


21416 


A" 


7575 

i 


402 


69 


KIFFKNSPLWKISHPPTYRGFFPPFPP* 
NFFFLPGPYFFLGGCSHLGPPPKKGFFQ 
KIPPGLFLSPPFKKKIYLFPPRIFLAPP 

m-^tmmmm mm mmw mr ^mwmtm 9 mm mm> m» 4>^4>^4t^^4» mm mlmfmm mm mm mi^imm. m* MTU mm 

GVFLKAPPPFFFFFFFFFFFFFFFFF 


7516 


21417 


A 


7576 


1 


119 


LI FHFTS KHHFGFEAAA* YVJHFVNW* L 
FLYVSIY**GS 

Mm mmmw m\ w %m* mm 


7517 


21418 


A 


7577 


219 


69 


• NDISANTAIKKFWKLPKCPTTEGWLNT* 
LYIHLVEYYATLKVCGDLYVR 


7518 

■ 


21419 

« 


A 


7578 


3 


287 

■ 


HASAQLNG YI E KST P YERGFDP I S PARV 
PPP T KFU^VAI TPTjIiLELB TALIi^PIiP * 

ALQTTNLPLIVMPSLIiLSIIIiALRLAYE 
*LQKGLE*AE 


7519 


21420 


A 


7579 


10 


125 


ALQTTNLPLWMSSUiLI IILT*ALQTT 
NLPLWMSSLLLI I ILTLILAYEGLQKG 
LN*AE 


7520 


21421 


A 


7580 


3 


299 


DAWVRLLDFDF*VELPTEARIRIIITSQ 
DVLHS *ADPTLGLKTDAI PGRLNOTTFT 

mm* w mmmm-m+m 9 mr^Mmwrnm mm, MW^liVA i/fAA *• W A A a, X 

ATRPGVYYGQCSEICGANHSFMPIALBL 
VPLKI FE IGPVFTL 


7521 


21422 


A 


7581 


3 


116 


AFAS KRTGLGVEEPSLTLE * LGGGPPP * 
HTLEEPVNT 

m>m> m± 1 f fti * * i V 4>^ + 


7522 


21423 


A 


7582 


398 


2 


RVFPAPKNSLQ I FPPLFFFWGS PWKFFF 
TPFPHFVFWFGNFKFNPFFAFLFRLPFP 

» mm Mm Mm m»m\mm W mm »»* ^» » " • »^ 4 4 M4 *» M* 

GEKIPLVFFFFFTD*NFVENFFFFLSLL 
FFFGFFFFFNM*S*LF*SPANFNV*VTH 
LFIFMVFFHPDFWSFGPILW 


7523 


21424 


A 


7583 

( 


288 


9 


RGSKNLE KNS F P Y KVNNHGNGS PLFPKL 
FFSPRGVGPFFFFFETRSRSVIQAGVQW 
HDLSSPPGLK*SSHLSIQSSWDY*RAPP \ 
HPANFVYF 


7524 


21425 


A 


7584 


1 

* 


124 


RDGGFTMLPRPILLTGLKQSSCISLPK* 
WDYRCEPLHLASC 


7525 


21426 


A 


7585 


339 


76 


THPGHNGIS VS PKKKKI PRGGGP PPLFP 
PLPRVGGGDSL * PRKGGFN* TKPAPFPS 
SWGKKKELPFSKKKKKKKRKKGS ISTPF 
SRIB 


7526 


21427 

«V * • mmm m 


A 


7586 


205 


87 


SISLSIiS LS HTHTHTHTHTHTIiSHPRCV 
PQPIGLPISKWAKVKELIT*R 


7527 


21428 


A 


7587 


353 


256 


PHPSVTTLVTSQQYQDPPPAKRL*LAEG 
SDDC 


7528 


21429 


A 


7588 


159 


195 


QICRHRWHINFLMSMGSIMITPPSFLF 
LNFL* S YVAQAGLKHLSQSDPPASASQS 
AEIRGMSHNAWPHF* YFVEL VHFI YINK 
FVGIELFILTS 


7529 


21430 


A 


7589 


li 


205 


RKKPE PEKTCFDN I TGNTYPCLS T YEHD 
SD*YYVNRT*LCAER 


7530 


21431 


A 


7590 


218 


20 


LS L YLASHL WLKP PHPLE WAPS KS 3 LRL 
GTVASPCNPSTLGGRGGWVT*GQESETS 
LANMAKPCFY 
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SEQID ' 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 

■ 


SEQID 
NO: in 
USSN . 
097515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
OCysteine, D=Aspartic Add, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidinc, I^Isoteuclne, 
K=Lysine, L=Leucine, M=Methiontne, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


7531 


21432 


A 


7591 


142 . 


12 


QIDY YYYLFI * RKGLTLMLRL VSNS WLH 
IILLPWPPKVLGLQM 


7532 

r mJ -J m- 


21433 

mm M 

* 


A 


7592 

i m* m* mm 

i 


2 

mm 


224 


QIIEALLITILLGHYFTLLQA*EYSKAP 
FTISDGIYGSTFFVTTGFHOLHGIIGST 
FLTICYNPQLIFHFTSEHKFGLQAAP*Y 
WLFVD VARL FVP AS I Y* *GSLFYPPTSL 
RVLQG S LHHFRRDLRLNI FCNHRLPRTP 
RHYWLNVPHYLLQPPTNISLYLRT 


7533 


21434 


A 


7593 


1 


323 


HAYGNTLVA* LWTMTF * LPLVDGYIKK 
FTPYEGGFEPKSAARVLFS IKVFLVAIT 
FIiLSDIiQSAT»LLPLP*AVQTTNIjPLIVM 
AALLLDI ILALSLAYA*LLKGLE 


7534 


21435 


A 


7594 


1 


104 


RACVCVCGVCVCVLCLLXMCVCVCVVCV 
CVFCAC+ KMCVCVCWCVCVFCAC 


7535 

i 

» 


21436 


A 


7595 


383 


43 


VFISETLCBHVAWSQTTIBSLGLGIVAH 
TLNAS TliRG * GRM IA* GO* SKTRRGNMD 
S ISSHSQT IAQS PHLLGRHQPKWCMCVQ 
AAEAVLKGQVETDASFQIQKIPINSLIM 


7536 


21437 

* 


A 


7596 


2 

• 


331 


WPPTGITPLNPLEGPLLNTGGLLA*GVS 
MT* AHHSLGEINNRNOVIOALVI TI LLGL 
YiFTLLRASEYFESPFTISDGVDRSTFFV 
TTGFHGLHVI IGSTLLSIWFIRQLICNF 
TSKHHFGFKAVA* YWLNFTKYIiVHPPTN 
M 


7537 


21438 


A 


7597 


384 


23 


LFKTKQYRFLLPPFPLKIFFFPLKA*IF 
LGGFSHYFPPPKKGFPPKSPPWF* IPPP 
YRKKHFCSHTPKNIiAPRWFF* KRPPHFF 
IFFF**RWGLAMLPRLFSNS*AQAVLLL 
QSPEVLGL 


7538 

/ mm* mm\m 


21439 

mm* 1 ~mJ ^ 


A 


7598 

§ mm mT%M 


401 


189 


RVLPCFAGWS*TPGLKCPTCLGL* KCWD 

^ ^ w mmmmm mmr^m ^ mmmmr ' * ^» mwmmm my ^mm' mmt mm* ^m mmimr^mw tmmmm m> mm^mr w w mmW 

YRREPPCLIRFYLFVYFLRQGLAMLPSL 
vlns WP * VTLLP * S P 


7539 

W mm ww ^ r 


21440 

mm m • F X* 


A 


7599 

€ m* mr m* 


81 


368 

mW W 


KCVI YPFLS ITLGKAKYDFFFFLEKELA 
FFPQGEMQGGNLG*LKPPPLR* RDFSCL 
ALPRGGNGRGAP PS PTNFGFLGGNGVYP 
SGPGGFBTPDLK 


7540 


21441 


A 


7600 


264 


25 


AVEHP QLRL FCS HY * NNI VE ERLGVLiAR 
TCNPCPLEDKAAWIT*RHBFETSLANMV 
IFCLDIGSKSFLSMRFSRCLLWKL 


7541 


21442 


A 


7601 


1 


196 


WE PSLVGETNVNS FNQKYINW PGAVAHT 
YNAGTLGGQGGW IT* GQELE ITLS KKVK 
PSTYFTNTR 


7542 . 


21443 

mm m v 


A 


7602 


315 

mf m mm 


82 

*mwmm 


TCTQVF IALFITAKKFKLICPLTDNWIS 
RVWYVHTMKHISAI KRKAVI HATMWVKL 
ENIILSEIKQSQKATGCMISFI 


7543 

. 


21444 

- 


A 


7603 


334 


H8 

• 


PKIKTEGGPPPKKNAGGMYK* KKF ILVL 
AAPPHPPFGVFSVFS FYSFWFFRFFFFF 
FFFFFFFFFFFFFFLR 


7544 


21445 


A 


7604 


3 


288 


DAYTT*NILSSVGSFISLTAVILIIFMI 
♦EASSSSSSPPGGLRPRAFPVRPPGAGL 
VF*VSVPPDI*PQCSIiAGLQVLRFMELAR 
LAPHIRWIIQ 


7545 


21446 


A 


7605 


354 


33 


EGRARYTRVPQRPRFHLLGIFFAPQLGK 
KKKFS FS KKKKKKNQLWLG AVSHANNLS 
TKGG*GRGSRFT*A*EFRTSQKLGTKGN 
MVKPISTNTFKNERGGAELSQIi 


7546 


21447 


A 


7606 


401 


161 


YLVSHLLQI I LPPALF WI FFF * DR VLL 
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SEQ ID 
NO: of 

niiiivuiiuc 

sequence 

■ 

• 


SEQ ID 
NO: of 

n pit ti ft a, 

sequence 

• 
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M 


SEQ ID 
NO: in 
USSN 
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26 


Predicted 
beginning 

nucleotide 

UUIrlvVIIUV 

location 
corresoond 

Vvl 1 WUVII1I 

ing to first 
amino acid 
residue of 

nentide 

sequence 


Predict- 
ed end 

n uci a— 

11 UVIv 

otide 

location 

correspon 
-ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
T^=Glu tannic Acid. F=Phenvlfilnnine 

(^Glycine, H=Histidine, Msoleucine, 
K 5 ^ Lysine. 1^=1 -purine. M— Methionine- 

N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Trvntnnhan Y=Tvrosine. 

X=Un known, * e =Stop codon, /"possible 
nucleotide deletion, V=possible 
nucleotide insertion 














PS S WDYRHLLWRFLMPRGI LCVYV 


7547 


21448 


A 


7607 

■ 


354 


50 


SGAQDPAGQHGQTPFSQKVPKKVYPAWW 

FFVYLGVWLSNGEP P PLGGGGS VFLBAL 
EKSVGFSSIjGFGWVFFF 


7548 


21449 


A 


7608 


370 


115 


TI KNI CNS WEVKI PRFTGVWKKLIPATL 
MDDLEGFKTLVEEPTEDW* ITGELEVE 
PENVI E FLOSHDOTS TDEGItLLMDEORR 

w 




Z143U 


Jri. 


/ouy 

t 


i on 
IxU 


J JO 


KDPILFLTPFPCFHASSAMNLQLSHISC 
NSKATPHP LG YQQTY P PLTVHMT 


755U 


Ol Act 




/Oil) 


■a 

D 


1 fLA 

104 


WF *2iQTP *NTMTHTHTH*THTHTHTHPYir 
net AOir riinimninxniiiininr ia 

EENLSHLWNI SC I FLGAHKGKKKMN 


7551 


Z145Z 




/0l I 


85 


3<&4 


T?PPPappi?F*Klf TCJFPPnAOfWVJTTiP^ 
v nr r/xr rr r zv.ftj.Dr "\^r^j\Jv/\j\j a. jjxto 

IiDPPPPGLKKFSGLTPPRSWN*GPCPPP 

GLIGVFLKKRGFPIiVGREGPKLWT 


7552 


21453 


A 


7612 


268 


317 


FLENCFNPGGGGCSQPKLGPCPPAWGAK 


7553 


21454 


A 


7613 


101 


339 


AVPLTMVKIHALWKRVWRFHNKSKL*LP 

mpftT GT.VQMVPTrRlvrtr^UPTiTmVPQPPT. 

V^L/ irn IQJJVOrlir XVEjI V >— JJ Ja>L/ V i»o IT XVIJ 

I TGPLT I AGMWNP PNRSSMDYSG 


7554 


21455 


A 


7614 


366 


208 


LELLTSGDLPASASQSAGITGISHRARP 

(ZTTtWV"* 2VVMfif3^3WnVT« , r»TtT>P VT»l\IfZT. 
u J. r nviiuUwii^ v J? i ix* i_jxv v jjivvju 


7555 


21456 


A 


7615 


30 


328 

■ 


NYCLDFHGETELGTTHFLTHPSTRAMVF 

EPYCQAL* IL*GVINT*PTVLHRNRSHI 

ItAACTjCTjOAPTWNIjO TflH TTMWKY*TTP 
Ann^uuxjynni v viiMU^unMinivi, Air 

HPL*YQQTYPALEYI 


/jjD 


Zl4D/ 


n 
ri 


/OiO 




1 

i Jir 


TPHT iKnKFHT*P»TiPP CWDYRHEPPPPPPN 
iVlYLI*ISF*CV 


7557 

• 


21458 


A 


7617 


290 


91 


KMSFLIFFRDMASLCCPGWT*TPGLKQS 
S C FHL PRS WDHR CTS LHLAVKT FLYY FL 
KMFYYQNVWH 


7558 




TV 

/» 


/Olo 


51 


Z40 


HTHYWI YTWKHTIjKGPKIiF* EKTKMGST 
PIISWKERLSLYRS 


7559 


21460 


A 


7619 


23 


190 


IPPQPCPFLIjSLFCT*rDMSC2aMCACVC 
VCVCVYVCVICDMVLLP FCFCKLECSK 


/JOU 


21401 




/ozu 


JO/ 


Z 


PPGGFFPLPGFLKPPPPPLF*NPPLKKK 
WFFFPPPENWAPPRVFFKBPPPFFFFPF 
FFFFFFFWSSRFLIFPQFFFLNNKIGKK 
KTPRHLGA(5 


7561 


21462 


A 


7621 


6 

• 


25 


SVIj^SIIIFIATTNLIiGIiLPHSLTPTT 
QIiS INLAMAI PL* AGAE I IGFRSKI KNA 
LAHFLPQGTPTPLI P II*VTIETISIiLIQ 

ISTI*LLYYNP 


7562 


21463 


A 


7622 


383 


67 


AFLNPGKKPFPPGFKRFFCPPPPKKWGK 
NRGPPPRGKI FFFFFFVKTKFFHVGPGG 
F * I LTSGDPPGQQGGliGGFFFFFQIGGT 
GDLNIiSHKS PMPFTINTHNKG 


7563 


21464 


A 


7623 


372 


35 


TGGFWGVFFFFFFKQGFPFKSTGGNLGP 
KNPCPKGPRVSPPPPPF*NRGPQGGPPP 
PGKKFPFFFFFFFIETGFPHVALAGLEL 
PGSGNLPTS VSQSAGI PGISTWPNGVFQ 


7564 


21465 


A 


7624 


287 


6 


LSGVEWVDHMINICLFFKKLPPCV* KWL 
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SEQID * 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 

- 


Predicted 

beginning 

nucleotide 

location 

correspond 

ins to first 

m mm mr mw m^ mm \^ m* 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
corresDon 

mm mjf m mj w BF ^mr m m 

ding to 
last amino 
acid 

residue of 

DCDtide 

sequence 


Amino acid sequence (A=AJanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glyctne, H»Histidine, I s lsoleucine, 
K=Lysine, L=Leucine, M=Methionlne, 
N=Asparagine, P=Proline, 
Q=GJutamine, R=Arginine r S=Serine, 
T=Threonine, V=Valine, 
W==Tryptophan, Y=Tyrosine, 
X=UnknowD, * sa $top codon, /-possible 
nucleotide deletion. \=Dossible 
nucleotide insertion 


- 


■ 










YQPVAVAHACNPSTLGGRDRWIPGGQEF 
KTTCLSLPKRWAHRRBPLYIAFCPFYNG 
IGCLLIIGF 


7565 


21466 


A 


7625 


271 


277 


NRGKGKTGS LG I LE KNP FFGGGP FWANP 
PP*GLKFWGGVGPKRPPPKKRVFFQNSQ 
GTS FPLPPVLKSGPGLVLKKAPQKEKNI 
TFQPPGKFGPPRGSLKRAPPFFFFFFFS 
LRKSSVIEK 


7566 


21467 


A 


7626 


2 


269 


LLGELS VLVC* FPPKHTTFLLTGLS ILF 
TCLYSLYILTTTQWGSLTHHI INLKPLF 
TPltNTLMFIHLYPILLLSLNPDIIKKKK 
KGGAF 


7S67 


X1WO 


A 

** 




^Q7 


164 


F P P PKKVF FOKN PKKKFS P PRVF * TP PP 
PPFFLPPPKKKKFFFSPPP*IWPPPGIF 
* KP P PLFFFFFFFFFFFFFFK 






A 


7fi7R 
/oxo 


•l 


174 


DGMNVSTL YGATGSHGLHD I MG3ALLT I 
WY I RQLILHFT S KHHFG FEAAA* YWHFV 
DVD * LFLCGS I Y * *GS PGGPKFSRGGRG 
IAVPHWGDD 


7569 


21470 


A 


7629 

< 


2 


246 


SGLGTTLS * KN I S VLLTGLKi LWG" L YS 
VYI FTATQWGS LTHH I NKI KP * LTRENT 
LMFIHLCPILLLSLNPDISYPGFTS 


7S70 


714.71 


A 

** 


/OjU 


1 1? 


7Qft 


LI SGKGOWTOMRLPVLVTLFTDSFS K* K 

hmf \mf IVWS^ t 1 A A AX * 1 ^ V mm* V A «V *m mm mm* mm* mm ttmf £V £V 

NP STLGEQCGR I T* GQELETS LGNIVRP 
HLY 


7571 


21472 


A 


7631 


248 


2 


KVPASGKRPE PHW YPYHAVPGGGR I PAT 
AI WTGKWYOGFPPALS PRLE CSGAI SAH 
CYFGFPGPSDS PASPS*tiAGTTGTRP 


7572 


21473 


A 


7632 


383 


106 


GWGQCFKPAI PALERPNPQECFRPGVLN 
OP * PONKSPFFFFFFKKKKKYIYICMVV 
HAYNPSYSGG*GGRIT*AQKFKTSLNNT 
VRPHLYKK 


7573 


21474 


A 


7633 


435 


133 


YACLGLPKCWDS RHE P PGLAKFCI FS KN 
RGFPISDRWINSWPRVFG* PKPPKMGG 
FQTLVQGASLGFLLI FLKTGFPFFSQHE 

• T» ™^ • WmmW mmm ^mr mm mmwrnj^ mm> mm W mm ^mr ma mm mm mm mmW mmmmmt 

EQGWDTSSFQHRFFRGK 


7574 


21475 


A 


7634 


1 


209 


NAYRIVILCQKLFPLLSGKMNFKKTRCW 
LGMVAHACSPSTLGG*GGRIA*AQEFGT 
SLGS I LRPCHCKK 


7575 


21476 


A 


7635 


117 


495 


STSFIDKVQLRHSFLFFSLFFETKSCSL 
TQTGVQWHDFGSL* PPPPRFK*FSHLGP 
LS 3 WD YRHAP PRPAS F FVFLVEMRFHHV 
GQAGLELLTSGDPPASA5QSAGITGMSH 
HALAHLYFSSGKS 


7574 


21477 


A 


7636 


1 


241 


LPGSSDSPTSASQVAGITGMCHHAQLFF 
FLAF* L I FNF* FFFGNSFFFVPQAGGHW 
GIGG* LNPPPPGLKRFSCLTLWGG . 


7577 


21478 


A 


7637 


3 


93 


EAEAGESIiEPGDRGCSKLRSCHYTPAWQ 
QSVTLVS KKETKNTKTSEVP FGVLNLLY 
KVSINILIFKEIFYQAPLILV*DNPDSS 
TLSSALQPGQQSKTVSNKTKQKTVS*PG 
NRV 


7578 


21479 


A 


7638 


46 


228 


GIPHIQPSKSVFNNVHGPGTVAHTRNPN 
TLEY*GGSIT*GHEFETSLGNIVRLPPP 
SLQSI 


7579 


21480 


A 


7639 


1212 


.1023 


KNHSNEWIKKM*YICTMEY*AIRRNEIM 
AFAAT*MKLETIILSEVTQEWKTKYCMF 
SLVSGS 
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NO: of 
nucleotide 
sequence 

— 
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SEQID 
NO: of 
peptide 
sequence 
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M 

eth 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

i 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence <A=Afanine 
OCysteine, D==Aspartic Acid, 
B=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H-Histidine, I=Isoleucine, 
K=Lysfne, U= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y-Tyrosine, 
X«Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possibte 
nucleotide insertion 


7580 


21481 


A 


7640 


479 


318 


AEFLNLGDGGCSEPR*HHCTPAWATRAK 
FHLKRKKK 


7581 


21482 


A 


7641 


470 


172 


RNIRPRENQYPVFMVAFFLIAKNW*QPK 
CPSSEEAMNKLRCIHTTAVLHSSEEEQP 
TDTHNNSDLRS IAPS WRSQTRAAAHRMI 
PWHSGKGKTPSTENA 


7582 


21483 


A 


7642 


2 • 


201 


AGAPPPAWLPPCRLISDC*ASNQRDSVG 
VGPSEPYATVRKYLELLLCQMHHDMCTY 
RFSIRIVLNL 


7583 


21484 


A 


7643 

• 


521 


385 


GGSPLETH* YTSQGGG I P YTDLTGHHPS 
QGR I QEAPKLTHL 


7584 


21485 

i 


A 


7644 


331 


3 


DLVPKKGGGGKKTTPRIKKFPPKGEDFP 
FFTPHYFWFVWFFWGGAFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFT*DYYIS 
F*EERFNKFYRCQS TLVQIGHTSNL 


7585 


21486 


A 


7645 


88 


367 


HCS FIiLIiGKKKKKKKKKKKKKKKKKKKK 
KKKKTKKNEYKNLLGGAPP * KP P SGALY 
KYRRENPIPFSFERVSKYPPRDLCKKNL 
FWGGLNIPS 


7586 


21487 


A 


7646 


378 


37 


FFFPXFFFFFSPPPXFFFLFXFFXXPFF 
XXSFFSLPPLPLXX£PPPPPLFFFFFFF 
FFFF FFFFFFFFFF^FFFFF FFF FFFFF 
FFFFF FFFFFFFFFFFF FFF FFFNI * T I 
VL 


7587 


21488 


A 


7647 


34 


293 


TG I PGVI S AI FM I + EAFVS KRKVLIGKK 
PSINLE*LYGCPPPYHTFEEPVYIKSTQ 
KRKESNPPKLGSSQPHGLHDFFKEKKKI 
IK 


7588 


21489 


A 


7648 


22 


106 


KNFLPLTLTLIi* KNFLPLTLTLLI * YGS 
IPITISSIPPQT 


7589 


21490 


A 


7649 


1 


231 


FDRT I S KD I S K* MVLVSKKKKKKQKKKK 
KKKKKIOCKKKKKGGGPFKKTPGGAPKNP 
GEEKI IFFFLGGGKKNTWGFF 


7590 


21491 


A 


7650 


79 


286 


WSRCLVSLRNPCI IKTCS*AKKKKKKK 
KKKKKKKKKKKWGGAFLKIPWGGPIFPG 
GGKFFFFFFGGGY 


7591 

* 


21492 


A 


7651 


57 ■ 


375 


SGKSEFHRVPQWPGTGADACNPSTLGGR 
GRWIT*GQEFETSPQGDPISTKKKKKKR 
GALL KDS LGGPNL PG FGNLKVFS FRGG I 
LKPTWEFWEGTP I LGGEKIGPN 


7592 


21493 


A 


7652 


3 


357 


LAFLLALSKITHAS I PIPVSSPSKSPRS 
KGTKKKKKKKKKKKKRGGPLKKTLGGPK 
INGGKKKKIFFFKGGEKKTPGGILEKKL 
FLGGGKMGPNPPKKIKPLGEKKNF*GEK 
GEKKP 


7593 


21494 


A 


7653 


113 

■ 


378 


MGAFNLGPFIiLMGGGNPSGPNGHWVLGA 
GSYLGPGKTPPKKSQKGGPPPDL*GRGG 
QCSPGTKPGGGGEKTGLAPP FTKS PLGF 
LQKK 


7594 


21495 


A 


7654 


341 


147 


NFFFSLKPFIFFGGFCPIFPPPKKSFFS 
KIPPWFFSPPF*EKIFFFPPPLNFAPP 
RVFFKGPP 


7595 


21496 

« 


A 


7655 


395 


3 


FFFFFFFFFFFFRGKSE*FVLFPVSPAP 
SLRG 


7596 


21497 


A 


7656 


2 


162 


ES KGYTHTHTHTHI YTHI YTHTYTPREH 
* PKKADVAMLILNKLFFKPRCVALL 


7597 


21498 


A 


7657 


333 




IiEFFFWFFFSYFLFFS * RVWGPERPPPP 
LKKAV*KFFIiFDI FLFFFFFIFGLGFFF 
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peptide . 
sequence 


Predict- 
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nucle- 
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ding to 
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peptide 

sequence 


Amino acid sequence (A«Alanine 
OCysteine, D=Aspartic Add, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leudne, MNMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W-Tryptophan, Y«Tyroslne, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=posslble 
nucleotide insertion 




■ 






• 




FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFLEHIRLLMFVTFSSHR 


7598 

• 


21499 


A 


7658 

• 

* 


1 


137 


HSSI/3DRVRPCLKKKKKKKEKKKKKKKK 
KKKKKKKKKKKKGGGLF*EKKKKKKKKK 
XKKKKKKXKKKGGGLF 


7599 


21500 


A 


7659 


236 


32 


1 1 FL I KFLYF* FI F I * F* FFLTFFYPKR 
LKILFFFFFFFFF FFFFFFFFP * KNP I K 
NI I YFFI FYLFYYLKFFI YI KTFLFUCI 


7600 


21501 


A 


7660 


301 


65 


KAPPKI FWFFFFVFPFFFVSFYI FFFFF 
FLFF FFFFF FFFFFFFFFFFFFFFFWKS 
LAAT* FLRGFETF*LAYILKTT 


7601 


21502 

mm m mf\fmm 

1 


A 


7661 


70 

9 X* 


273 

•** ¥ *m 


KHS P PHKPASD * NTE LTI KKKKKKKKKK 
KKKKKKKKKKKKKKKKKGGGALIKNPWG 
GPIHRGGEKFFFSFLRRD 


7602 


21503 


A 


7662 

9 \/\/4i 


1 

m 


353 


I LI INTLIiALLLI I ITFGLPQLSGCLQK 
STPYBGGSDP I S PARGRFSLKGP LQAMT 
FLL*ELEIALL* PLP * ALQTTNLPLIDM 
GCLLLNI ILCLS LAYE *LP*GLDCSRYH 
SQR1Q 


7603 


21504 


A 


7663 


1 


131 


F I RQL I FHFTSKHHFG FEAAA* YWHFVD 
YA+LPLYVSIY**GS 


7604 

9 VW ¥ 
* 


21505 


A 


7664 


2 

mm 


192 


PLSQSLY*LLAADLLILT* IGGQPVSYP 
FTI IGQVAS VLYFTTI LILI PTI SLIEN 
KILKWAY 


7605 

■ ■ 


21506 


A 


7665 


356 

* 


2 


TFFFLSS PGGSCGSPTPARPKKNPPPWG 
GGLPLFSRRGAI/FPKNFFWGGYP FFPLF 
* KKPPFFPPPGPQSPVTSPKDWPPLRM 
PPPRPHVRPLGLPKKS FSSPRWEKQVKK 
TKKRAA 


7606 


21507 


A 


7666 


312 


2 


GFFFFFFFFFFFL*DRVLLLLPRLECNG 
VISAH 


7607 


21508 


A 


7667 


1 


233 


FWLSSNAEFDSANS CECLEVQRMIQDQD 
SFPTYHYFDMYVCIYV* *RSLAVSPGLV 
SNSWPQAILQPQPPQSLGriQE 


7608 


21509 


A 


7668 


174 


295 


I FFLPFCGEHSLAVLPRIiVLNSWTQAIL 
PPLPP*ILGLOA 


7609 


21510 


A 


7669 


183 


330 


NKFKL YWS PVAMAHACN PRTHRGGN IT* 
CEEFETTLANMVNMVKPCLY 


7610 


21511 


A 

- 


7670 


3 


319 


TSNTLLAIiLIiIIITF*IiPQLNGYIEKST 
PYESGFDP I S PARVPFY I KFFLVAITFL 
LFDLE I ALLLPLP * AVQTTNLPL IVMSS 
LLLI I ILPLALAYE * LQKGLD 


7611 


21512 


A 


7671 


1 


313 


ARGERERERERERERERERERESGGGGP 
TQTDCKGGRNT*RGREIYRESE*DDRPP 
FLPTYRVNLQRPVGLRRLKOAGDKTFCL 
I LTLARDYVWPD YRMKRADHM 


7612 


21513 


A 


7672 


3 


91 


TRRERERDR* REREREREREIFREKNSQ 

s. 


7613 


21514 


A 


7673 


1 


115 


DELANLFI I * KAGFPVLPRLVLNSWLQV 
ILIiPWPPRLN 


7614 


21515 


A 


7674 


3 


232 


TRRERERERE RERERERE RERERERERE 
RYRE*DRERERERBRDCVWGGGVISLSR 
AVAIjSGAP AVGRQTRER I S R 


7615 


21516 


A 


7675 


1 


234 


ARG ERE RE RERERERE RERERERERE RE 
RE X ERE RERLTQRERERERGRG VWHAP L 
AIERG*KPQGGFWWKEERVKKEPLGKLL 


7616 


21517 


A 


7676 


1 


262 


ARGERERERERERERERE RE RERERERE 
RE RERE RERE RERB RARAPQNL * ERW AH 



1018 



WO 01/64835 PCTAJS01/04927 



SEQID 
NO: of 

IIUUCUIIUv 

sequence 

• 

- 


SEQID 
NO: of 

rtpntirl a 

|JC|JtlUC 

sequence 

• 


M 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
niir Ipntfrfp 

location 
en rrps nnn ri 

vui ■ upvuu 

ing to first 
amino acid 
residue of 

peptide 
sequence 


Predict- 
ed end 

IIUUv^ 

otide 

InroHnn 

correspon 
ding to 

last amino 

C1IIJIIIV 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 

JT— (~Z | 1 1 f-n m i r Arirl P^PhpnvIalanin^ 

Li^VJlUltlllllL ZtLIUj Jl * 11 VllJrltliaillllC) 

G=GIycine, H-Histidine, Hsoleucine, 

K=I/v«fnp 1 .piirlnA IVT=IVlpth inning 

N=Asparagine, P=Proline, 
Q=Glutamine, R=Argiriine, S=Serine, 
T=Thrennine V=Valine. 

A ■ III Will II w, T T OIIIIVj 

W=Tryptophan, Y D Tyrosine, 
X^Unknown, **=£top codon, ^possible 
nucleotide deletion, \=posslble 
nucleotide insertion 














j> xr xr^urv? v otto *»xvC»jj a. c\u\3c rn\JM\JBiwn 

GRLGHPPPSILRAKGG 


7617 


21518 


A 


7677 


214 


2 


QQASVAQAGVKWPDLCSLQSMPPRFK* F 
SdtS LLS S WD YGPPP PCAGLH I WRRNVT 
LALQGLPTTS LALV 


7618 


21519 


A 


7675 


3 


341 


GGKKRGPIWFLKKFFSPPQKKPPPPPPF 
LRE IGAMAPQPGGGLLI *NKTPFRPRGQ. 
GGPKPPTPGKPNPPPPPKGNFPL*GPNP 
QPPGSPTPPPPQKVIFPFEGHQA 


7619 


21520 


A 


7679 

! 


1 


357 


GTRINTLIiAIjIjIiI I ITr *LPQI»TGY IKK 
S TPY ECX3FDP IS PARVPFS I KFFLVAI T 
FLLLDLE IALLLPLP * ALQTTNLPL IDM 
SSLLLI IILTLSLAYE*LQKGLD*TDFY 
SVGGEAN 


7620 


21521 


A 


7680 


365 


1 


PDASQ * HCTNGQSNRIjIjNLL I KALLTTA 
KIWIQPKCPPVDARIKKMQ* IPRMEHPS 
AIKKEILPSVATWMSLEDTVPSRISQAQ 

KDK Y H 1 V o UM WftiiKHuNAlS FKlaKFV V FK. 
GVQGWGAARA 


7621 


21522 


A 


7681 , 


387 

■ 


3 


FFFKKEFCPRKKAKK*NWGPGNLGPRG* 
KNFP PPP PQKRGKKdGPPQGGE ILGFL * 

KKGFPPMGRGGSKLWPIjGFSPLNPPKRG 

r* wt?/^t?T3 Tin d daopvott 1 coo irr* T ireoT?i? 
O JSJ\vjC< F F F F FUKCi JvFvjoF Frltbiirrrff 

FFLRQSLTLSPRLLV 


7622 


21523 


A 


7682 


107 


290 


ELNKRWGLGAVAHACNPST* *ELNKRWG 
LGAVAHACWPSTLGG * GRWI T * ALEFKI 


7623 


21524 


A 


7683 

• 


1 

* 


348 


GTRERKPTWLWYHRERGMERDCSCVPGS 
Z)Kj X bcbK V W V y V vjInB V I FA1 1 1 JjuuEFK 
YLVISEFSPAFRIWFLGLSVYAHTCNPS 

FFLF 






TV 


fO&t 




77 


ftttPS T PPRIiAIiTjflGGGPOnPPLPPnNPS 

FFNFFVKKGVFFPPGVFPPLPPGVLPPP 
PPFWAPG* PRPPNFFLGAPGFFPFFFIjG 
GGFFPVAPGGV 


7625 


71526 


A 


/UOJ 




2d 


HEET 1 1 OVKREPVE* KKI FANPTFNKGL 
TS E I YKKQLNRKKPNNP I * KQS S K 


7626 


21527 


A 


7686 


1 


125 


GTRPGMPGTYSNYPDTYTA* KX ISS IGS 
F I SKTAFQ PGP WB 






A 
*i 


7687 






LS IjP S S WNHMCAIjPHPASFFIi * ROGtAIi 
L P RL VS KS WAQVMLL P W PS KVLGLQ V 


7628 


21529 


A 


7688 


374 


248 


FPHHNVHEVIVRPLSPRYPIISHVTCTY 
RSRWP*ASEGSOKK 


7629 


21530 


A 


7689 


284 


163 


GTVAHTCNLSTLRGRGGWIT+GQEIDTS 
PAWPPKVLGLQA 


7630 


21531 


A 


7690 


83 


255 


KVD YVS I KS E FF * DRVS LYH PS * GTVAR 
S * LTTTPNPGIKGSTCPTVPSS WDYRHV 
LPLPTRTIFSVPTQSLATILI * ITLNLY 
INLGKIHFL 


7631 


21532 


A 


7691 


3 


193 


HERLD PADFS FNFFS VDVGLAMFPRLVL 
NS * HQVILLLWPLTVLDYRHBLLRPAP^G 
FYYIOTT 


7632 


21533 


A 


7692 


1 


364 


NFKSFFP*LFYLYHLLFFGFGLFLFFLI 
KLGSSLF INFS KKS VL * FTNWI YFSVLY 
FTDFCFI FI I SFFGFSLI YFI IAVFPLK 
FIGFII*SFI*ITVFYGQAQWLTPVIPIL 
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SEQID 1 
NO: of 

ii uvicvuuc 

seonence 1 

1 

'i 


SEQID 
NO: of 

seauence 

■ 


M 
eth 

nd 


SEQID 
NO: in 

09/51 5.1 
26 

■ 


Predicted 
beginning 

II ULIWUUc 

locution 

correspond 
ing to first 

^ allllllU DUU 

residue of 

peptide 

sequence 


Predict- 
ed end 

u uc ic- 
nti He 

location 
correspon 

riino tn 

last amino 
acid 

residue of 
peptide 


Amino acid sequence (A >= Ala nine 
OCysteine, D=Aspartic Add, 

G^GIvcine H— Histidine I«ko!eiiHnA- 
K=Lysine, L=Leucine, M=Methionlne, 
N=Asparagine> P=Proline, 

O^Gliitflmine. R=Aroinine. S=Serirw» 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 

I1UVICVUUC HloCJ HUH 














WVAEVRGSL 


7633 


21534 




7693 


370 


151 


GKKYSVASPGKLWPPQGFFKTAPPFFFF 
FFLK*ROGFTMLPRLVLNPWPQGILPPQ 
PPKVLGLQVLRHHIWLR 


7oi4 


21 jjj 


A 


, 7o94 


3 


J/J 


QHFLGLSGMPRRYSDYPnAYTT*KILSS 
AGSFISITAVILIIFMI*EAFASKRKVL 
IVEEPSQSAGITGVSHCARAEYLFIDRR 

TV2T .Q T .PUDf^UT 


7635 

- 


21536 


A 


7695 

- 


406 


394 


KFF*KKFLFPPTPTFIiLBACLFPPLPPI 
KGFLPKYIAPGIKTPPPKKEKFFSSLK7 
IjF btfitxc r if 1 IrFFFXr r r If r r r r csbeb 
FFFFFEFFFFFFFFFFFFFFFFFFRFEG 

PPWPT.* W* TTVT? VPS?KT?GDUP T D 

W r I f I f F xvCjO rAV. X XT 


7636 


21537 


A 


7696 


2 


334 


I L INNTFLALLLL 1 1 TFWVPQLKGY I KK 
STPYECGFDPISPARVPFSIKFFLAAIT 
FLLFDLEIALLLPLP*ALQTTNIiPLIVM 
AS LLL 1 1 ILALS LAYE * LQKGLD*AB 


7637 


o i act 

21538 


A 


7697 


2 


95 


T KTT TTVTTT T"TUTI T\ TTtT T T vtt MOOTTflT t T 

SRT*NKLT*LTI YI I LTTTAFT J iLNLNS 
STTTLLLSRT 


7638 


21539 


A 


7698 


423 


326 


RQCLTMLPRLVSNS *SQ VI LLPWPLKVL 


7639 


21540 

• 


A 


7699 


400 


32 


FHKEYESYFFSPNQPQFFFFFPHR*NFY 
VGVLKKQPPKKKFFFLLFTPERFFFPFL 

+ VIfMTCP17D4 Vl?t?17DT TTTT7T7T #BDT)T PI? 

FFPPPPFFFFPFFFFFFFFFFFFFPFFF 


7640 


21541 


A 


7700 

♦ 


125 


407 


FINFSSTFWKPTTCNMQKHTPIARTKD 
LCIT I F FFFLB KNFLFVPQVGGQGG I LG 
*LKILLRG*KQFSCFTLPGRWTNGGIPP 
PPKNFLKKFF 








/ /Ul 


5U 




RIA*AQEFETS*VTQ*DPVSVIKKRKKN 

Civ 


7642 


21543 


A 


7702 


2 


146 


EIALLLPLP*AiQTTNLPLIVMSSliLIiI 
IIIiALSLAYE*LQKGIiD*AB 


7643 


21544 


A 


7703 


1 


93 


ILI INTLLALLLLIMTMGLPQPKGYIKK 
GTP * "RCG PHP T PPAP VPPQ T TCT PT .T ATT 

FLLFDLETALLL PLP * ALQTTNLPLIVM 
ASLLLI I HjSLSLAYE*IjOKGLD*AIiI*L 
LIMTMGLPQPKG YI IOCSTP 


7644 


21545 


A 


7704 


2 


112 


GRVGKHHFGFEAAA* Y WHFVDVG* LFL Y 
VSIY* *GS 


7645 

■ 


21546 


» 


7705 


1 


325 

- 


TAGQFLPKLSILLSYNPAITFLGIYPKI 
LKTYVYMlCrCTT^FIAAIjFIVVOTWKOO 

S KLWY I QT I KYYSVLK * NELSS YENTS K 
KLRCILLRERSOYKKPPYFLIPTM 


7646 


21547 


A 


7706 


2 


169 


SRSRAGTI*AI*TINLPCTLMIFTILIVL 
TILEIAGALIQAYVFTLWSLYLHDST 


7*47 


21548 


A 


7707 


263 


346 

• 


MLINVPLGLLFVGVI LS KES PS VDQGGV 
QWINLFSMQPPPTGFK* FSCLKA* ASDN 
IiSPHEQYRLALSFLKIjTL 


7648 


21549 


A 


7708 


343 


163 


PKEF* *RQGFFHVGQAGFKLLRSGMPPP 
S ASQNG K I TG VNP LAWQTNNS I P PMAP 


7649 


21550 


A 


7709 


459 

■ , 


144 


FWPGAS SHAFDPTTLGGRGGRIA* AHBF 
KTSLGNIVRPPSDTCNPKVLGLLA 


|7650 


21551 


A 


7710 | 2 


83 


NFLPXjTLALLI*YV3IPITISSIPPQT 
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SEQID ' 
NO: of 
nucleotide 
sequence 1 

■» 


SEQ ID 
NO: of 
peptide 
sequence 


M * 
etti 
od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 

• 

• 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanfne 
OCysteine, D=Aspartic Acid, 
E=GJutaniic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G!utamine, R=Arginine, S=Serlne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X«Unknown, *«Stop codoo, /"possible 
nucleotide deletion, \ppossible 
nucleotide insertion 


7651 


21552 


A 


7711 


19 


181 


KYVHTKACT^MTISSLFIITKNWKQPIF 
SS IGE* VNKT* YIHTMBYLLFRNKRK 


7652 

■ 


21553 , 


A 


7712 

» 


392 

- 


49 


SDIFNSDIFSHY*CSVF*SYIAYNRYLS 
FL*LFLEFIYSYP*IDHAFHCFIiHPVSS 
FWLDPFI»FIFSFFIiFSYFFSYFYF*FYF 
*FIFNFIFNLFFILFYF* FSFFSKCLKF 
PC 


7653 


21554 


A 


7713 


334 


208 


FFF* EWVLTTLPRLVXiNPQAPAILLPWP 
PKVLGFTGVSHIKI 


7654 


21555 

t 

1 


A 


7714 

• 


2 


408 


WNGNE WNGL KWNRLER YGMQ WNGLE WNR 
TB * TRMVWHAME WTRI BLNGLE WNGMEC 
NGMHLNGLE WNGMERNGTDPE WNGTDPE 
WNGTDPEWNGTEWNGMEWNEVDSNGMEW 
NGIDSNEMGWTRMEWNGMESTRV 


7655 


21556 


A 


7715 


362 


223 


VWWFMPI IPTLWEAEVGG*IiEPRSSKPA 
WATWODAI FKKKFKHQHP 


7656 


71557 


A 


7716 


2 


335 


III VI INTLLALIiLI I ITF* LPQLNGYIE 
KSTPYECGFDPISPARVPFSIKIFLVAI 
TFLLFDLEIALLLPLP * ALQTTNLPLIA 
MSSLLLI IILALSIAYE*LQKGLD*AE 


7657 


21558 

• 


A 


7717 


2 


142 


SITL*LPQLNGYIEKTTPYECGFEPICP 
ARVPLJSIKFFLEAITFLLFDLEIATiTjLP 
LP*ALQTDDLPLIVMTSLLIjIIILTIiSL 
AYE * LQKGSD * AYMPRPRPFVHKILLRS 
YYLLII 


7658 ■ 


21559 


A 


7718 


6 


92 


FFEAAA* YWHFVNW*IiFI.YVSrY* *GS 


7659 


21560 


A 


7719 


417 


164 


PLEPLI9GRGLPQIAPPPKKGSPPKSPR 
WFFFPPP*KKKNYFPPPPKILPPPGFF+ 
NPPPPFFFFFFFFFFFIFFYFFKTALPL 




71 5ft t 


A 

** 


7770 


75 


"TV J 


TENS * DPNWD * I PHYA* P* TSTVKKKKK 
KKKKKKKKKKKKKKGGGL I KKFRGGPKY 
TGGGKI I FFFFMGGKKKPLGDFLKKNFF 
LGGGNIXSKHPPKKLSLQKKKKNFKGRGG 
KKTPLCRRGKKFS 


7661 


21562 


A 


7721 


388 

- 


2 

* 


APPFFFYFFPFI FFFYFFLGVGFSFK*N 
SKVFWISNFLKKILRVFICSLEKGINPF 
LEKCFYMFFFFVAF I IRLNFSLQAKFFL 
LNFALIiPFPEI FFFFPFFFFFPQIEDFW 
HPYIEQVYQLSLFSLS 


7662 


21563 


A 


7722 


389 


71 


FFLPHQKQVFFPPPPFKIFFFFSRVFFF 
WGGWAQKPPPPKKFFFLKTPPGFFLPPP 
♦KKKFFFFSPRFFLPPPGFFFKPPPPNF 
FFFFFFFFFFFFFFFFFFFFIiN 


7663 


21564 


A 


7723 


77 


220 


TPRRGWAYWLMPAYPAILEA*AGGLLEP 
KSLRSTWPTWRNS ISPKRK 


7664 


21565 


A 


7724 


16 


258 


ISSFVFKRRSFAMFPRLVLNSWPRWLP 
PRPPE *LGLQACTTTPSFNFFDQ IFLRP 
LKEDVTARDQRE I L YVFALFYYGGA 


7665 


21566 


A 


7725 


353 


in 


I FGAGPRFVFKAGGQGANSRSLKPFPPG 
NKWSFPFSPSKKWEGRG*PPRPPFFPPF 
FFIREGVSLCCQGWSQIPGLKQSS 


7666 


21567 


A 


7726 


2 


185 


MSMGHTRLS SAWTG KPPLS VEDDFEKL I 
WEISGGKXjEAEIDLDPGKDEDDLLLEIiS 
*MIDT 


7667 


21568 


A 


mi 


346 


143 


SQAFLYLESFLLLLLLY* FLFFEMESHS 
VTQDRVQWRHLSSLQPLPPRFQQPSRLT 
LPSSWEAEVGG 


7668 


21569 


A 


7728 


411 


221 


nlrppgs s yspasas * iagi pgarhhtq 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 

■ 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E^CIutamic Acid, ^Phenylalanine, 
0=Glycine, H»Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionlne. 
N=Asparagine, P=Proline, 
Q=G In tii mine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, Y=possible 
nucleotide insertion 














LIFAFLAEILKNLAKLYLI PRKPNI YYL 
PNCYINI 


7669 


21570 


A 


7729 


2 


303 


AAASTNNLLFFXiORHLWEWLQHMVOLI 
FTFKKLPNQAGRRGSHL* FQHFGRQRWE 
DHLRSRVQDHPBQHSKTPS SLQKNKMS I 
ESNLLNIHKLFSTGRM 


7670 


21571 


A 


7730 


67 


65 


SICEYCFVSSVFVLKRLFSVMQAFSLEG 
CVSCQS*AFLQHYLFKKLYFLY*WGLTM 
LSRLVLNTWPQVIHLPWPPKVLGL*AFC" 
GNEA 


.7671 


21572 


A 


7731 


354 


238 


Q S I LDDVAMVLD * QAE VFI VKM WRLL I Y 
VHEAKKIGLVK 


7672 


21573 


A 


7732 


33 


186 


RGALAVLSRL I LT PGLKES S CLDLPKG W 
DYRWEPPRPGCFF*I*MVLVLSF 


7673 


21574 


A 


7733 


1 


181 


VNAGADCSS IGGVP FLQHKKCHGKDYBP 
RGITTLERS YVBETTEHLVSKSK* PLRA 
QINL 


7674 


21575 


A 


7734 


336 


80 


HKLKEPPGVFPVFPFKNLEFGPGPPKFF 
FKKAFSP ILSFFFGKI FKI PRFRGENFA 
P*NFGKNS * KPRFS PPPPKKKGFFFFFF 
F 


7675 


21576 


A 


7735 


2 


163 


TPVSTGTPVPTLT* VPSPI I FPVSEKWA 
GCLHIiCLNFTCTEIiRLLTSLLTIRS 


7676 


21577 


A 


7736 


87 


258 


KAPSVCLFSALLMLLRMSARTSVCTVRP 
LSPS * AVI S PVTCTYTSRWPEATEDSQK 


7677 


21578 


A 


7737 


1 


103 


LDS KG I YMTLQPS KLE PKLEANV2 IREH 
MLENSSRF*RDLYDSAAFKARTKARSKC 
RDKRAHVGEFF 


7678 

- 


21579 


A 

• 


7738 


193 


300 


GGLPPPPFFFFFFUCKRGFPWVTRGGPN 
PPPIiEMGGPTPPKGWNYGGGPPP*FHPF 
GGVGPP I S KGGGLGPPRVTQGKPLFFKK 
KKKKNGGGGKPP YS PFLGGVKQKKGVNP 
KGGGSKKPKFPPPPPPGGKKKKPFLKKK 
KKKK 


7679 


21580 


A 


7739 


150 


17 


GRVAQVWWLMHAI PAIi* KAEAGRSLEPR 
SWTPDWGTWHLPISTK 


7680 


21581 

■ 


A 


7740 


1 


358 


SPPRPPPPPPPPPKEKFKKLENPPPPPK 
NPQKQNPPPKTKKKSPTTTPKKKPKIGT 
PPQTP KKINCS * KKKNNPPENKKGLNYP 
NKPNHPHAPPKEPKQREKPLVPTKKKKG 
PSPKKKK 


7681 


21582 


A 


7741 


235 


147 


F* FFLGDEISLCCPGWSLTPGLEQASCP 
GLPRRWNYRCEP PHSVKTIF 


7682 


21583 


A 


7742 


131 


12 


AASTYGQ* FKICGAILRLMPI VIEFIPL 
KILQIGPVFTL 


7683 


21584 


A 


7743 


1 


184 


NQYPWCFCNH^MGKESKTLNRSGMVAH 
TCNPSTLGGQGGRIT*A*GSKSSLGDKV 
RPHLY 


7684 


21585 


A 


7744 


347 


202 


SLFVETG YH YVD * AYLELLASGS P PASA 
SQSFGISSVSLCTQPMFQFE 


7685 


21586 


A 


7745 


138 


263 


KKRKTYIHTKTCM*MFIAALFIRVRKWK 
QS KCHSADEWINKF* YI BCI CIiS IHQLND 
IWIASTF 


7684 


21587 


A 


7746 


358 


169 


AAAVAERTQNTEKTEDtVl^LWIKK^IY- 

WPGWAHVCNPITLGGRGGWIT*GQEIE 

TSLPTRR 


7687 


21588 


A 


7747 


78 


346 


AIQMLRNQVKIKLPKSPVEIIPFKIVPQ 
FQVQKIKSQNNFLLLFQESQIFFATSTG 
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SEQID 1 
NO: of 
□ ucieo iiae 
sequence ' 

. • 
■» 


SEQID 
NO: of 

mm gm m* ^» *m ^ 

pepnae 
sequence 


M 
etb 

OQ 

• 


SEQID 
NO: in 

09/5154 

• 


Predicted 
beginning 
nucieonQe 
location 
correspon o 
ing to first 

amino 4Ciu 

residue of 
peptide . 
sequence 


Predict- 
ed end 
nucie- 
otide 
locanon 
correspon 
■ •ding 10 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Ala nine 
C=Cysteine, D=Aspartic Acid, 
Hi^oiucamic acio, r^jrncnyiaianine, 
G^GIycine, H=Histidine, I^Jsoleucine, 
wv—uysuitf Lr-jueucine, jyi—iricnjionine^ , 
N=Asparagine, P=Proline, 

("¥— !t#o m in a De A r>nirt inth CsCanna 

Vc^ijiuianiine, jv— Arginine, aeriRe, 

T=Threonine, V^Valine, 
W=Tryptophan, Y-^Tyrosine, 
X s Unknown, *=Stop codon, /^possible 
nucieotioe deletion, v-possiDie 
nucleotide insertion 














CUP DT .C DC * 2Y T T C D\7T<TV'r G P WDRUTT? 

KSQKK 


7688 


21589 

* 


A 


7748 


222 . 


98 


KKKARHGLGAVAHACNPS TLGGQGGR I T 

w VyC/iW J. VK 1 V-JJ I 


7689 


21590 


A 


7749 


314 


52 


IiNl^DLQFFLGPRKKKRNIiin^SSLFVSA 
EEVRLMYCFNKLAILHYF+ FSNKFIVFV 

*VEVTJT T rjUTSTMr 1 0 VCTTiTJ T r* MXT7V MTV KIT/T\VT 

ASK 


7690 


21591 


A 


7750 


371 


216 


YHTHTHTHTHTHTQI P SLPLNLEGPVGY 
LLSCGSIiHLKNL W* KCMRKERKK 


7691 . 


21592 


A 


7751 


3 


238 


FLIQTG FGHVS QAG WELLR * GNLPALAF 
QSAGI TGMSYLCVASSLYLLPPFLKILC 
So IiiiTJU S P Y UT WKVAIjIj US IT V 


7692 


21593 


A 


7752 


2 


247 


NSDIFDI YI IQYIFLIYLI YI IQYI YLI 
* VFLI I FFKFTLK* EFIFI SLS ISFRLA 
F I VYRDVTLLFL F FFS * NFVGDS I S 


7693 


21594 


A 


7753 


359 


2 


TLKIRPIiLYLPLFFFFFF* KKKIFFFPP 
GGGKGV * SGFYLMGRGTFSLFFKRFFLF 
YP P KEGGLRGS P PP PS FFFFFFKKRGFP 
FLARGV* KKGPQGTPLPWIiPKSLGLRG 


1694 


21595 


A 

* 


7754 


3 


174 


CS VIQAGGQWCGHSSLQP * I PGINQS3 Y 
LSFPSSWDYRHATMPS*FLFFIEIKMLQ 


7695 


21596 


A 


7755 


292 


2J9 


IWPIiSHVHS^PLTHSPSQVI^GrjFTTti 
PCLSGiWVQFPIAEMPFISSLLLENAC 
ItSLCEGS EAQ KPLLWEGLS YLP PLNQLP 
LCL*LCVCISGGVCRCLMCILGVCVCMH 
VCVSVCPCTHTLAQETKGC 


1696 


21597 


A 


7756 


2 


195 


VALMADGAIDTESNDYGAJWPIjGIERGL 
DRIWEMPELW1jRPNEF1XWTD*PCIQHA 
PSVSCGIaA 


7697 


21598 


A 


7757 


4 


280 


DHIVDMI TPSFTRRTI AVF* DLNLYI VI 

KGNEILIHATIWINLENMQDEINQTQKD 
IYCTISLI 


7698 


21599 


A 


7758 


352 


107 


r Li rRAoRGHNGhTI VT5WPGAV AHVCN P 
SSLGGRRRWIT*GQEFKTSIiANMVITFF 

loluVoiilKr udyr rlounrXJlZAi VuO 


/077 


9 1 Ann 




T7CQ 




** 


Y 


7700 


21601' 


A 


7760 


92 


4 


RPFJUIRGGWTA* GQE FKTSLGNMVKPCL 
Y 


7701 


21602 


A 


7761 


328 


124 


PLFSFLFFPFFFFFFFFFSRRDGVLPCC 
HTAHHNF 


7702 


21603 


A 


7762 


1 


167 


VDSTDKRPGAVAHNYNPSTVGSQGGQIT 
♦GKFOTSLANTVKLFKKKKKRKRGALF 


7703 


21604 




7763 


309 


226 


RPRRKJj 1 tioPHli* CSGAlbAHt-KJjRliPG 


7704 


21605 


A 


7764 


349 


7 


QVFCFLFFVCFSSIFMTFNE*QGI*KVTS 

EYGSPEQHNLGIVRNASSRLGWAHTCN 
PITLGG*GGWIT*GEGFETSLANMGKPL 
LY 


7705 


21606 


A 


7765 


43 


339 


F FF FLFFFFFF 1 1 FLLLSF FF * DHFFVS 
LLFI SNFVFLFP I Fl FTVFQAHLF ICLL 
FFP I CM FLNLFFPNFNYYFYPFPIiFFYL 
LFFFFG* FFNHLFYL 


7706 


21607 


A 


7766 


352 


102 


HTSCLIIKSSLSKMNISCVSSSISFFF* 
FILIEIGSR*WOAGIiQLLGSTDPPASA 
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SEQID 
NO: of 

n it rl f>f\t iH f 
IJ UCJvU UUc 

OCUUvllvG 

• 


SEQID 
NO: of 


M 

eth 

od 

OO 

- *» 


SEQID 
NO: in 
1ISSN 
09/515 1 
26 


Predicted 
beginning 

II If LlCVUUC 

correspond 
ing to first 
amino acid 

■ CblUUc III 

peptide 
sequence 


Predict- 
ed end 

IJUGJC" - 
uuuc 

location 
correspon 
ding to 

loot aminn 

acid 

residue of 

|JcpiJUc 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D^Aspartic Acid, 

Vz^fl 1 ii tci m 1** A/'irl — P kpnvlalonmA 
j-r^v»iu Lamii. /«CIU) r*— riiciij'iaiHf/luC) 

CJ^filvfin^ H=H!«Hrtin^ I=Tcnfoiif»iri» 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparaglne, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 

T= T Ki ronn in a VsVal in a 
M A Ui vU 11 111(7) t — ▼ allOVf 

W=Tryptophan, Y=Tyrosine, 
X«Unknown, *=Stop codon,/=possib)e 

nucleotide insertion 














SQSARITGLSHHAQLSSTSLWENVEW 


7707 


21608 

■ 


A 


7767 

• 


346 


90 

• 


TPPSSPFSFPPFPFFSFIPFRFFFFSNR 

urv v jj v t»» x JL/jWo iraljZv Ooyf ULrRvW 

DFRCGPPCLTSTAFYIS ITVNFSPSNCY 
e 


7708 


21609 


A 


7768 


278 


87 


AASTGVGPSEPSAGYYLLVCRLLRSLEK 
PCASRVR 


lino 

+ 


OKI A 


a 




c 
J 


0*70 


KKKKKKKKNKKKKKKKNKHYE* FNTFPG 
V7ARKINLFLCIQKVDAGGGTLWGGGPPI 
FFFFL 


7710 


21611 


A 


7770 


198 


33 


QRPQSRQCRAWLGMVAHAYNPGTLGG*D 


7711 


21612 


A 


7771 


97 


410 


A*WLMPVIPIIWEVKWRLLEPGSLRPS 
WATWRKP 


7712 


21613 


A 


7772 


5 


415 


I LCVYLHFVGTH P FVHL * YMH FLPLTLA 
LLI * YVS I PIT I S S I PPYT 


7713 


21*14 


A 


7773 


1 


342 


WRVTSGHSG * AAAYAT * Y I LS AEGS FF 
PLTEELL I * FMKREAFT * KR * VLI IHEP 
SINLD*LYGCPTPYl!TFPDPVYLNSKRR 
RRDSHP P KLASRLPHVLHD r ¥ KKKKGGR 
RF 


7714 


21615 


A 


7774 


398 


122 


SPPPPFPSSPSFFLPPPSSSFFLFFLFF 
cr rrrrr r if ire re r b rr rrrt.Kl>*VbIiU 
CPA* SQTPGLK* YS YHGLPKFWNYWHBP 
PCLAYFL 


7715 


21616 


A 


7775 


1 


336 


FNFLLI I I IEMESRS VSAHCKLSLPGSS 

NSS AS AS RVAGS RGAHHHTWLF FFFFFF 

■ 

r * KKKGAP PGGQG G r L lliluUCN P P PRAP 
QRGG I NGGTPRPQNGGGKFFPWGEKKKF 


7716 


21617 


A 


7776 


1456 


1232 


FAIESHCVTQAGVQWCNLASLQPSPTBF 
RDRVSPCWPGWSKTPDIiK 


7717 


21618 


A 


mi 


400 


2 


NTEFPYGPPISLLGM*P*EMETYIPRKP 
CTQMFLEVLFTIAKK* KKPKRS * LING* 
ILvYPLMEYYSATKRFDVMIHTKNVDAI 

f21\TT TT . C PP D OM * VTOTTTVC T ITD 7A Q PT 7A T? 

VRIKVSYKRAATW IKS ILIA 


//lo 




i\ 

n 


T7HQ 

/ 1 to 




Ho 


VI LSLWP PRVLG * QRGGRVEQRGARGNW 
MS KCPEGGDCLCVTVAGA 






n . 


1 Hy 


i 
1 




pp vp q r <5Vi\ wanvnMHnHPnPPPT/5C! on 

DPSVSASCISRITGASHCTQG*L*LLTK 
VL*VSAFCLSVNTLRLTSLLKSDTFLGS 

SLFPRFLVKHKK 


7720 


21621 


A 


7780 


372 


219 


ATSLLS YFKKLPQPPQSSATTTI ISQ*S 
STSRGDLPPAKRLELTEGSDDP 


7721 


21622 


A 


7781 


358 


145 


I KNIHDS WEDVKI STLTRV*KELI P VLT 
DNSEXjLKTSM*EVTAEVVEIARELELEV 
E P KDVLVQFHDT INR 


7722 


21623 


A - 


7782 


317 


146 


GRVX>CKCPFYRFQMLLVRI^MVAHVCNP 
STLGGRGGWIP*GQEFKASVSRMPRPHL 
Y 


7723 


21624 


A 


7783 


13 


186 


DRVSITQTSWAHPPTSAS * VAWTTGMHH 
HTWLNFVFFVEMGFHHVGQVGIRLPLLS 
WK 
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SEQID \ 
NO: of 
nucleotide 
sequence i 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 

• 

• 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino add 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, (^Phenylalanine, 
OGIycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M m Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutarnine, R-Arginine, S=Serine, 
TeThreonine, V-Valine, 
W=*Tryptophan, Y^Tyrosine, 
X»Unknown, * e =Stop codon, ^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


7724 


21625 


A 


7784 


339 


221 


KWDLAMLARLVSNSWIjQGSSYIiSL*NCW 
DYGS EPLCPAL 


7725 


21626 

i 


A 


7785 


355 


3 


FGNS G R VDL F FF F F FLDRVS LCCPQCSA 
VTNLEVHHSALQPQTPQLKPSSHLSLLS 
SWDYRRVPPHPS *GLF* *GK*YSITQLR 
FLNLHLESGLRSLLPKPVASTVRIRTQS 
GLGLK 


7726 


' 21627 


A 


7786 


1 


201 


QQFSHVFRLLFQFSFSCQEPPSSHIQPG 
VCTROTKVSQAW * CVPVI PANREAETGE 
SLEPGRRSTRP | 


7727 


21628 


A 


7787 

• 


270 


1 


RPRRHLVPCVLMLGPKCPGLPHTLTTLP 
YTLTTPSPTRSLGPPECK*SEm*SFPSS 
WDYRRMPQRLANFCRGRILLCYPDWSSS 
PGLKQS 


7728 


21629 


A 


7788 


229 


2 


DG WVR I NWAQ PGTVAHAYN I S NLGGRGR 
RIT*GQEFKTSLTNMEKPCLYSDHBVRS 
LRPARPTWRNPISTKNTKM 


7729 

» 


21630 


A 


7789 


118 


263 


CSKSYYWPDAWHTCNPSTLGGQDGWIA 
♦AQEFKTSLGNIARPPISVF 


7730 


21631 


A 


7790 


246 


359 


TF I FS ETRSHS VTQAGV * WHDHGSLQP * 
PPWAQVILSC 


7731 


23632 


A 


7791 


70 


102 


AA*SRLTATSLSRVQAI PLPQPPEQLGG 
QGGRXS 


7732 


21633 


A 


7792 


7 


312 


FLDFQIiRLHSNSYCEDKGVFDSEIYEI C 
IVLICKK*RKFLNQKKXKKKKEKPGLI * 
KKKKNSQKKWFDFPWLPPNPETKNSLFL 
PKEFLWLKTKPPFPIiTLT 


7733 


21634 


A 

• 


7793 


1 


341 


I KPE * YLL FAYT I LRS VPNKLGG VLALL 
LSILILAIIPILHISKRRSIIFRPLSRS 
LY*LLAADLLILT* IGGQPARYPFTIIG 
QVASVLYFTTILILIPTISLSENKVLLW 
A 


7734 


21635 


A 


7794 


1 


143 


YGSTFF VATG FHGLHVI IGSTFLTICP I 
RQL I FHFTSKHHFG FEAAA* YWHFVDW 
* LFLYVS I Y* *GSTFFVATGFHGLHVI I 
GSTFLTI CFIRQLI FHFTSKHHFGFEAA 
A 


7735 


21636 


A 


7795 


2 


257 


KWAI IEEFTKNNSLI I PTI IATITLLNL 
YFYLRL I YSTS ITLLP ISNNVKI K*QFE 
HTKPTPFLPTLIALTTLLLPISPFILII 
L 


7736 


21637 


A 


7796 


22 


326 


RDASDCS FQNLP VPLWVEK* MVFLLTKK 
KKKKK^KKKKKKKKKKKGGGP FKKNSWG 
AKI I PGKKKI IFF FLEGQKKKLWGFFKK 
KPLFWGGKKRPNPPKXN 


7737 


21638 


A 


7797 


506 


281 


RGAN * NRSGCGKRHEERERERERERERE 
RERERERERERES PRPPKRQRERETEXQ 
TLSLTVSIiAPPPTCVF 


7738 


21639 


A 


7798 


435 


336 


MRSNIHFFSHTHTHTHTHTHTF* ILKQH 
TFSK 


7739 


21640 


A 


7799 


411 


106 


RNPPPLFFSPPLQRGGFPPIYWGPPRFP 
PPPPL*KTPPKLKIGAPPKKKKPPPPPG 
EKMVSF*PPPPFFFFFFFLRRPFPLFSP 
GGGPWGHFRGPPPPPPGV 


7740 


21641 


A 


7800 


2 


296 

* 

• 


F VPSTANWCFFLWS VF * TGGSNYFFVHR 
SYSQAGVDLIFIRLANTISNLSFI * *RG 
GLAMLPRLVLETRAQTILHSWPPKVLGL 
QVWATVPS FQFLKN 



1025 



WO 01/64835 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 

■ 


M 

eth 

od 

• 


SEQ ID 
NO: in 
VSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- . 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
OCysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K= Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosme, 
X^Un known, ^top codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


7741 


21642 


A 

- 


7801 


344 


105 


PGFFFFPPPGKRGFFPKPFFIiGTPGFFP | 

PPLF*NPAP*FFFWAPKKKIFSPPPPGK 

KIFLLKGPPLFFFFFFFFFFFFF 


7742 


21643 


A 


7802 


68 


216 


VHTIAKIWKQPQCPSMDD* IKNMRYLYT 
VE* YSVLKBRE ILLYWGHYAM 


7743 


21644 


A 


7803 


407 


3 


GFIH* FPIiFSGYTLDQTYAKIHFTI I FI 
GVNLAFFPQHFLGLSGMPRRYSDYPDAY 
PT*NILSSVGSF ISLTAVILI IFMI *BA 
FASKRKVLI VEEPS INLEWLYGCPPPYH ' 
TFEEAVYIKSRQKRKESNPPMY 


7744 


21645 


A 


7804 


2 


156 


THRRI I ILSQGIi*TLLPLIAF* *I*LASL 
ANKKKKKKKRGGPFKKFPWGAKI 


7745 


21646 

i 


A. 


7805 


2 


309 


THSRI I ILSQGLQTLLPLIAF* * LLASL 
ANKKKKKKKGGAPLKKIPGGAKIKRGME 
IKNFSQKGGEKKTQRGIFGKKPYLGGGQ 
NWEKLPKKFKGLKGKKNF 


7746 


21647 


A 


7806 


110 


362 


TI*SDLERLLMKAVSHFLMIDLLEKC*CV 
LKNQS SKKKKKKKKKKKKKKKKKKKKKK 
KKKKKGGPLLKNSWGGPNFPGGEKIFFF 
FFRGGF 


7747 


21648 


A 


7807 


295 


108 


KQRG FVFFFFEME S RS WQARWQWCNL 
SSL* PPPLGFKQFS CLRIVYRKDSLSFN 
KPRLLP 


7748 


21649 


A 


7808 


2 


142 


GSTFFVATGFHGLHVIIGSTFLTICFIR 
QLI FHFTS KHHFGFEAAA* YWHFVDW* 
LFLYVSI Y* *GSTFFVATGFHGLHVI IG 
STFLT ICFIRQLI FHFTSKHHFGFEAAA 


7749 


21650 


A 


7809 


390 


2 


PASLLHCG*ISDCCASNE*GSVGLGPSE 
PGAGYNLIiVCHLLRPLEKCS irvgvtrf 
SRCCLSPLPLDRKGNSLTPCTSQVRQCL 
ALLQLTLGALHPVSCTHCPTISGEMNPV 
SQLEMQKSPIFCVTHAG 


7750 


21651 


A 


7810 


1 


157 


FLHFGQAALELLT SGDPPAS AS * SAGI T 
GVSHRAQLVCTF I T I YVFLKNS S Y 


7751 


21652 


A 


7811 


368 


44 


QNFP PPKKRLPPQPPLFFFIiVPS FKGEG 
FFFFLRGFFFFS SAQRALL* FI YFIFFF 
FFFSFFFFFLYFSISPDSKGDTHHDLLL 
GVSWWTRSLPLWIARHNMHKWGW 


7752 


21653 


A 


7812 


400 


80 


PQNSFSPPGIGGFFPPFPL*NFFFSPKA 
FFFLGGFSPFFPPPKKSFFFKI PPGFFF 
SPPLKKKIFFFPPPFFIAPPRFFFKAPP 
PFFFFFFFFFFFFFFFKYFRHI 


7753 


21654 


A 


7813 


119 


230 


SEEFETSLRCIVIPSL*KKKKKKKKKKK 
KKKKKKKKKKKKK 


7754 


21655 


A 


7814 


3 

• 


412 


HEQEHL*LLLP**PLAII*VISTLAETN 
RTPFDLAEGETELDSGFNIEYAAGPFAL 
LFIGEYTDI I I RNTLTTT I FLGTTYDAL 
S PELYTTYFVTKTLLLTSLFL* MRTGYP 
RFRYDQLIHLL* KNFLPLSLALLI 


7755 


21656 


A 


7815 


1 


183 


LRUERLS Er STRRERERERE KKkERERD 
TRIDI YI VSQKRKKYIV* IRRNT FVYHA 
EFSKR 


7756 


21657 


A 


7816 


484 


97 


QPRTPDLK* FACLGLPKCWDYRHKPPCL 
ASDGNHS * S SPGLLVLS SASVSP SGHMS 
PSC3QTSPWVSEESLLLGRVPGFLYFPPV 
DAPGPGAWLAHWQAVLKLLGSSDPPVL 
PSQSAGFAGLSNCAWPW 


7757 


21658 


A 


7817 


2 


219 


ADRLRNS * EA* EREREREREREREREPY 
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SEQ ID » 
NO: of 
nucleotide 
sequence 

. ■ 


SEQU) 
NO: of 
peptide 
sequence 


M 

etlr 

od 

- 


SEQID 
NO: in 
USSN 
09/515,1 
26 

* 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Ala nine 
OCystelne, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=GIycme, HNHistidme, I=lsoIeudne, 
K=Lysine, L= Leucine, M=Metb ionine, 
N=Asparagine, P=Proline, 
Q=fjGlutamiue, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknowh, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 




* 






■ 


• 


PPSHIYTDRNRH*MRHBERTLFLWGSGE 
VRDAAPWVSHIPNTAL 


7758 


21659 


A 


7818 


190 


292 


NSHLIFPWTKVFFMS*CTY1JLVIjPOSKK 
KFYSHA 


7759 

- 


21660 

• 


A 


7819 

* 


498 


139 


RRAATPAPAAAEPPPPLQRP* PPSEPPD 
TQRDARGRRRGEDPGNSPFHPNPDRQPS 
RALCSTPRMHRLWIGPAFFLMTSLSVSG 
AVI PRNGGPGGVS SGPCLLQLLOGQAGS 
STIRKIPS 1 


7760 


21661 


A 


7820 


489 


330 


ELGFLHVPQAGLELLS * SNPPASASLPT i 
SWDYRHDHRTGHHPGIYDSKMCIP 


7761 


21662 


A 


7821 


490 


260 


FFLXXNXFXFFFFXXFXXPXFXXXFXFF 
FFXXXFFFXFFFFFFXFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFLY 


7762 


21663 


A 


7822 


493 


482 


VGQAGLKRLTSGDL PAS AS QSAVITGVS j 
HRARP I MYFR YVQ * AKGSHV* RWY 


7763 


21664 


A 


7823 


456 


121 


ASFFPIQYKGLGGATPPQEGGCG*GIPI j 
KWGYKRRPGGPHGGSKRPPTQ*KKSYFI 
NVLVLFYIiRDKGLAI PTLVSNSWAQVIL 
PPWPPKVLGLQTRVTVPGQNILKENMFF 


7764 


21665 


A 


7824 


294 


457 


LCIGFVILISYFNIMENWYCRPGTVAHT 
CNPSTLGG* GRQ I TRSRDRDHPGQHG 


7765 


21666 


A 


7825 


26 


235 


SVWWNSQTGNKVKFTISW* IDKQMRCIH 
LMEYYSAIKRNEVIjINATTTWINLNTIK 
LSKRSQTLKSNTIj I 


7766 


21667 


A 


7826 


30 . 

- 


303 


SYVSWEFPGSSGPTLPSPWHQrPRTYL 
NS VASA INLTQCPH * PEKKKKKKKKKKK 
KKKKIOCKKKKKKKKGGAFKKKGP*KTPK 
KTPGGF 


7767 


21668 


A 


7827 


464 

• 

• 


21 


REPPHPAPADILILHFWPLEL*ELISW 
LNTRCWICDDTPRKLTCQPMPRliLLQT 
EPL* VLFSNLGS FGSLSKS *MGLVNFFC 
KGTDS RS FSLFVAVFCLFL * RWGLAMLH 
RLVLNCWPQVILPPPPPKLLGLQVEATT 
HEFHGTE 


7768 


21669 


A 


7828 

> 


372 


205 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFI KKFFI 


7769 


21670 


A 


7829 


463 

• 


313 


DIYPERRNSKIMSFAATWMKLEAI ILK* 
LKNRKPNIVCTHL 


7770 


21671 


A 


7830 


449 


266 


VVQMRFIiHVGQAGL * LLPSGDPRASQVQ 
SAWI TGVSHHAQRS I FFKKRNNDCDRPD 
NTECT 


7771 


21672 


A 


7831 


480 


148 


IELRLSHGAAEFHRPREWYGIFKVLKER 
DFYPRI I YPAKI ILRHEGEMKTFPDKQT 
LRHFINTRPVIjQEMLKKVHQSEIKGC* * 
ATNNHLKVKKKKNSLGW 


7772 


21673 


A 


7832 


23 


131 


QRGNSRGYHLKMIQQEEIRKLEEEKNQL ! 
EGE I IHFYKMKAASEALQTQLSTDTKKD 
KHGKKQ*FL 


nil | 

7773 


21674 


A 


7833 


16 


339 


krs s cpdlpncwdyrhgpp ylas fvllk 
*tlsilllpyyhkmhtcvpmvlc«tpcg 
.gvcpgmellclkvygylslkhtfh 


7774 


21675 


A 


7834 


355 


70 

* 


KKTPRGFSGVFLGPFYKKGPIjFFFFFFF 
FFFFFFFFFFFFFFFLFLQFFILCQQFIi 
S*STEZTVHSKYII*SFPFHCLIFCPFIL 
LCSTFCNGAHY 


7775 


21676 


A 


7835 


494 


282 


&ENGMIQGGGACSEPR* * HCTPAWAKER 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 

■ 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1. 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
OCysteine, D=Aspartic Acid, 
E>=Glutamic Acid, F=Phen>laIamne, 
G=Glycine, H=Histidine, l=Iso leucine, 
K=Lyslne, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W»Tryptophan, Y»Tyrosine, 
X=Un known, *=J>top codon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














DSVSNAKTIQNKKNI/3N*HSISTAKAKN 
IIIS+YKSELRKKM 


7776 


21677 


A 


7836 


305 


185 


LVIETCLTVRPLS PS *AI ISPVTCTYTF 
RWPEVTEESQKK 


7777 


21678 


A 


7837 

• 


179 


191 


LGCRKYLF YLNGGLKCLY I KCVCVCVC I 
YIGVYIGMCVYICIYIYLMAS*NQVTSF 
LSFYVFLLNYR* IPVKVITVLPNLRMTG 
LP 


7778 


21679 


A 


7838 


466 


343 


U3RLRQENHLPPEGRGCSBPRSRPCTPA 
WETEEDAVSKINK* INK 


7779 


21680 

» 


A 


7839 

i 


451 


89 


LRDTR* SGVI CLPKQAWAMVGAPPPASL 
PPCSLI SDCCASNQRDSVGVGPSQPGGG 
YNLLVRRFLSLS EKRS I RVGVNRFSRCH 
LSPLSLTRKGNSLTPGASRVRQCLALLR 
LAHGARTH 


7780 


21681 


A 


7840 


442 


164 

■ 


AHTNQFSQCIKKSVTPDLMEEMYKKAHA 
AI*ENPVYEKRPKKEVKKNRWKCPKMSL 
AQKKHQVA*KKASSLRAQEQAAES 


7781 


21682 


A 


7841 


494 


359 


I CMLPRLVLNS * PQVILLPWPPKVLGLQ 
A 


7782 


21683 


A 


7842 


577 


224 


IFFFFNKI FLFRPGLNLMGDI WVPSTPP 
LQG*TNFPPQPSQEVGLQKAPPPPGLIL 
VFLSRQGFPQLGRVGFHFPPPGDPPPRP 
FKKI WSHPGGGKS P PKKKKKKE IAEVLG 
AFRLR 


7783 


21684 


A 


7843 


404 


56 


HTLS TNVCSS YLNLDFFSLRGRGLIMLP 
RLALNSWAQEILLPQPPE*IiQXjLRRLRQ 
ENLL3PGVQGQPGQHNETPTS 


7784 


21685 


A 


7844 


1 


323 


INTLLALLLI IITF*LPQGIGYINNSTP 
YEGGFDPISTARVPFCIKFFLEAITFLL 
FDLEIALLLPLP*ALQTTNLPLIAMASL 
LLI 1 1 LALSLAYE * LQKGLD * AE 


7785 


21686 


A 


7845 


419 


159 


FLFFFFXFXXYVLSCGIY* YWCNLFSS * 
Q YFFFYVMAFRI FLtTFLFLHFYYNMF* 
CTYFYLSCIj*FEYICSHIIHYFIYFLHL 
ENF 


7786 


21687 

• 


A 


7846 


475 

• 


124 

■ 


FFFFFQPPQKEGGPPPPPKNFFFFFTRI 
FFFGVFFFFFPPQKNFFFF+KTKGVFFF 
PFF* *KIFFFFPGVFFFFPVFFFLSPPP 
SSSFFFFFFFFFFFFFFFFFFFFFFFFF 
FFLKRGQAS 


7787 


21688 


A 


7847 


6 


189 


LIDTVS PSVALAGVQWCNHSS PQP* PPS 
LKRSSRLALPRFWDYRCAPLCKAHSFNS 
NHEK 


7788 


21689 


A 


7848 


20 


249 


KINFIWELTCSNTVHTFYVYGFDKCIL 
PTTQIjFFFL*ETGFHSVARLECSGVIMA 
HCSLDILGSSWRPANFLNFL 


7789 


21690 


A 


7849 


101 


398 


LFQKJGGCKKXKKKKKKGGPLKKKPGGGK 
NKGGEKKKIFFLKGGEKKNPRGNFKKKT 
FFGGGKKGEKPPKKKKSPEEKKKF*RGK 
GGKKSLICWVEKNIiG 


7790 


21691 


A 


7850 


87 


226 


GSLSPIMLVLPFLIHLL*KNFLPLTLAL 
LI*YVSIPITISSIPPQT 


7791 


21692 


A 


7851 


2 


440 


GAIIRRLLH*FPLFSGYTLHQTYAKRHF 
TIIFIGVNLTFLPQHFLGLSGMPRRYSD 
YPDAYTT * NILS S VGS FI SLTAAVLTTL 
MS * EAFAS KRKVLLVEEPS IDLERLYGC 
PPPYHTFEEPVYI I SRPRRKESNPAKLV 
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SEQID ' 
NO: of 
nucleotide 
sequence 

* 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

i 

od 


SEQID 
NO: in 
USSN . 

AO/SIC t 

09/515,1 
ZD 

• 


Predicted 
beginning 
nucleotide 
location 

^•k^tt mmmmmu.** bbl.A m *M 

correspond 
fing to first 
amino acict 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Asparttc Add, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I-Isoleuane, 
jy-Lysine, l^Leucine, M=Metnionine, . 
N=Asparagine, P^ProIine, 

9 ~\ . 1 i-L in Am A mm^mm mm. m mm. A. ^ A** - _ . f _ _ 

V£=L»iutamine, K=Arginine, y=Serine, 

T=Threonine,V=Valine, 

W-*Tryptopnan, Y=Tyrosine> 

X=Un known, *=Stop codon, /^possible . 

nucleotide deletion, V=possfbte 

n ucleotide insertion 














SSQPHG 


7792 


21693 


A 


7852 


1 


80 


SSIXLI I ILALSLAYE*LQKGLD*AE 


7793 


21694 


A 


7853 
> 


5 


135 


ATFLYFS * K* SLSMLPRLVLNSWTQAIL 
PPWTLKVGLCFFDPL 


7794 


21695 


A 


, 7854 


38 


419 


FVN3WPLHSSLGEGVRPYLKKKKKXPPWE 
GGSPPKNFKNQ IFA* NQKPPFFFFIPPN 
KKWGSPPGGGASPL* SHPSGGPGGPNNK 
VKISKPPCPPGGTPVFTKTPTFT+PWGP 
IPEKEGRKTVCPRTH 


7795 


21696 


A . 


7855 

• 


398 

- 


61 

■ 


FFSPRPRGGGFPPPPPKNFFFSPPTIFF 
GGGVWNFFPPPKMVFFLKISPVVFFPPL 
IRKKI FNLS PGS FWS PQGFFF * GP PPKF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFF 


7796 


21697 


A 


7856 


1 


121 


LDASGRLPPSPKPIKIKNYNRP*EPK*T 
KICSLHSLPPQS 


7797 


21698 

* 


A 


7857 

• 


1 


332 


TPPIARPAENSARGAEGRRR* I IALQVC 
NKKWPPTKLHCIKKKKiCKKKiCKKKKKKK 
KKXKKKiOCKKKLWGGGGGKNFFKKGGKK 
KLGGG F KKRGRKKKK 


7798 


21699 


A 


7858 


3 


191 


SPRHLTRAkGIRHEGQTGLKLLTSGNLP 
AS AS * SAG ITGMSHR IGPRMLDLLTLVS 
WKQHGL 


7799 


21700 


A 


7859 


503 


62 


FFFFLNCCLSGLRHLIKLFEPQFLHL*N 
GGMVLSIILSIKSPLRSPSLSYSWL**P 
REKIKWWGGQVRWFMPINPRTLGGGGGQ 
IA* SQEFKS SLDNMVRP YLSB PP 


7800 


21701 


A 


7860 


355 


86 


YI KG *QMVGNRGAS QAGMTG YGMPRLIL 


7801 


21702 


A 


7861 


336 


225 


PQLAGPVKTVRKVYKKEKARVI PEEKKN 
FKAFAS FRKGRANGRVFG IRAKKAKEAA 
KQDV* KKPPIPLAGGSPTREPPSQAHLF 
PQNPPRGNPP 


7802 


21703 


A 


7862 


465 


170 


GGPPPKTHPFGGBRGGKYHRGKKKPPGH 
QRVNP PS PKKKKKNWGRGKNPFTPL IGG 
GQAKKPPQPGRGBCP* KKVPPNNSVLGK 
KKNFFPKKKKKKKK 


7803 


21704 


A 


7863 


495 


275 


RSFFFFFFFFFFFFFFFFLVARK*YYFL 
ATKKKKKKKKKKKKKKKK 


7804 


21705 


A 


7864 


493 

* 


70 


PLTASISLPVNWRYIAPHRVVKIK*KSV 
Y*KYLA*CLAQSKCPKSVCsjHLSSP*tiH 
FVPPPNPAE AMWTARQC* CIRGLGGKGT 
LCVAHTYNPSCSGG*GRRIV*AQBFKAS 

■■t mmtm. mwmmi mmm nm«H mm9 1m+ « b»bv aBBaaaBiB, mmmt^tmrnm «k .^b. ^b b> ^bb b^b» mm* ^b& k ^ 

LGNSEPLILKKERKKESQAWWLI PI IPA 
F 


7805 


21706 


A 


7865 


67 


385 


VARTTGMQHHAWLIFFIFIIFVBTGSHY 
VAQTSLEFLASSDPFTSVPQSTWITGVS 
YSWS*PGIAHTCNPSALGDRGERIT*GQ 
EFKRSLGNIVRPCLYKNNKNKKN 


7806 


21707 


A 


mmt f*l 

7866 


m\ mW m 

454 


0 


LS FFFFP IXSDRVTLCCPG* S AVARSQLT 
VALNSWAQAILLPQLHKHVPPCLANF*N 
FF * RQRYTMLHRLVLNS WAQVI LQPWPT 
CPTAAQA 


7807 


21708 


A 


7867 


1619 


698 


PATSSSSS SS SSSSAAAAVAAAATAASL 
PPCRPPLRPPQPLAPHEVATLAPTOG* K 
PSFTSSGKRWLTSEPQTSGSCASCL*SM . 
RASSPSSG* SKKKPPLPAEAAASYAACA 
VCWRVRAPP YVAATTAYTMAVMGWMAFP 
WE V I WTRWRMMSQAI RPLQTWTNS VTAP 
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I SEQ m 
NO: of 
nucleotide 
sequence 

- 


SEQID 
NO: of 
peptide 
sequence 


M 
etb 

od 

• 

• 


SEQID 
NO: in 
XJSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

i 


Predict- 
ed end 
nude- . 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=G lycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionlne, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X«Un known, *«=Stop codon, /^possible 
nucleotide deletions-possible 
nucleotide insertion 














S 


7808 


21709 




7868 


52 


187 


LDFN* RRCLTKLPRLVLNS WAQ VILPP W 
PPKLLGLQV 


7809 


21710 


A 


7869 

• 


33 


373 


AGVQWRDLQLLQPPPPGFKRFSCLSLPS 
GWDYRHAP PHLANFWLVEMG FHHVGQA 
GLKLPT*GDPPTLAYQSAGITGMSHRAR 
PGISVFHLKLHSFDPPKI I FRPGAETMP 
S 


7810 


21711 


A 


7870 


100 


119 


VFVBIVFPEGLLFSPCSFHLCGFRIHLK 
AEF* FFVETGSMLPRLVSNSWAQVILPP 
WPPWIi*LQA 


7811 


21712 


A 


7871 


471 


446 


LLMLPRLVLNSWAQVILPPQPPKVLGL* 
AYI 


7812 


21713 


A 


7872 


449 


33 


FSFFFFFFIiGKKDLFFSPRGGGGGANSA 
FFPPPNFGKKGSAPPPPPLKSV*NPAAP 
TGVFFFFCGGGA 


7813 


21714 


A 


7873 


334 


41 


PCPSLFFFFFLRQSLTLWPRLECSGVIS 
AHCNLCLPSSWDYK*HCPYLATGAPGLL 
SQSELS YVTL* LQESRPLCI PVDFMTFL 
PSLCFQSPNSTTL 


7814 

• 


21715 

• 


A 


7874 


342 

■ 


716 

- 


STGI FLFVCF *DGVSTLAQAGVQWPDLG 
SLQPSPPRIiK*FSCLSLASSWDCRHVPP 
LLANYLYF* *REGFTVLAMMVS FS * PCD 
PPTSASQSAGITGVSHCAHPGMFFIiNWP 
LKSCPSESGAAV 


7815 


21716 


A 


7875 


454 


215 


PPPARCH* PP* * TLVHAVEFKSHHYSLI 
SSTQGHKHCGRPQGPLPRKTRDLCSLVY 
LLTFPSLI*SYDPAKSliSARNTQB 


7816 


21717 


A 


7876 

- 


492 


176 


P CGGRGAGGNWG WP P WHLRAS AS S R WAW 
ADQPHTRS SQPAGPTG PRQ* GAQHLGQQ 
LLCSISHQALAAPPQGRAQDLQPAMPEP 
SPRPPLPWAAQ 


7817 


21718 


A 


7877 


281 


439 


TWSIDL*HMAYFLFFS FLFFAFLSFSLL 
SFSLLFFAFLFL*VPSFSLFQRQSli 


7818 


21719 


A 


7878 


507 


338 


LLRRLK* EGHLS PGGRGCSKPRLHHCIP 
HWATE*EPFSKTINK*N*S*MKMKMKIN 


7819 


21720 


A 


7879 


55 


298 


PPCLANFLKFFL*RQSLPMLSRLFSNSW 
PQAVPLPRPPKRNQLYLYFGCCSQLOJP 
QHLCS KQHT FKI LFD I EKHFRLREH 


7820 


21721 


A 


7880 


145 


292 


YFGGKQIYFIFCRDRISIiC*PVLLTSIiV 
SNSWPEAILPSQPPKVLGLQA 


7821 


21722 


A 


7881 


3 


203 


LSLLPRLECSGTFTAH*SFKIiIiGSRDSP 
TSAS * lAKTTGMRHHAQIiRFFFFVPGGV 
CVCVCGCVYV 


7822 


21723 


A 


7882 

• 


324 


211 


AWWLMPVIPTL*EAKMGGSLEARSLRFV 
WAHNETPFLF 


7823 


21724 


A 


7883 


2 


124 


QHFGRLRQVDHLRLGVHYQRGQHGETPS 
LLK* INVYI YMT 


7824 


21725 


A 


7884 


367 


141 


KCWDYRREPPRIAQGIiEFLFNFCFEIVT 
EPPCPALAYF*NFL* RWSIATLPRLVLN 
SWPQAILPRWPPKALGLQE 


7825 


21726 


A 


7885 " 


471 


215 


RGMCVSHIFLCVGARYNISSARESIiLKE 
FTECTBRKHTHTRAHTHARARTHTHTHT 
HTHTHTRV*NSRSQPYCRVHACS PAYLG 
6 


7826 


21727 


A 


7886 


462 


204 


RHSS*LGLPKCWDYRHEPPHLACISSF* 
RKSDISLSDSLrGTKCHKWWPQWKSRAG 
SVGRDPCSGHFIIiAGFGQLGA 
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SEQID s 
NO: of 
nucleotide 
sequence 

t 


SEQID 
NO: of 
peptide 
sequeoce 


M 

etn 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

fng to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon . 
ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alaoine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
OGIycine, H»Histidine, Msoleudoe, 
K— Lysine, L= Leucine, M=Methiontne, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


7827 


21728 


A 


7887 


409 


152 


PVFLALERPNQQDFLRPGVLDQP*QNNK 
ISFFFFFFKKKKKYIYICMVVHAYNPSY 
SGG*GGRIT*AQKFKTSLNNTVRPHLYK 
K 


7828 


21729 


A 


7888 

« 


1 


152 


GVHVCCDCKC I CVCLCVVGVHKCLCVCV 
SK*VHMCHREGTIFGNISWEGD 


7829 


21730 

• 


A 


7889 


101 


399 


EI INI VFFFYFFLPI*LFFFFWGGRFKKQ 
LNPEFLTNSWKKWHKTIGGKNPMGG* KK 
AITNPQKKGRKRS FSF *NKAQPTQGKNG 
QNSQGS KAVGALTAH 


7830 


21731 

■ 


A 


7890 


478 


142 

m wmm 


KFSKKKKRGVLNFYKKKAFFSKPPKPVF 
KKKKLLKNP FFKAKKQFFFKTPTRPLFR 
EKFFPGPPKKKKKRPPGLFKNN*KGPPW 
PREKKRVKKKKKKVKSHYYNNHMPISVH 


7831 


21732 

•» A ¥ mm 


A 


7891 


497 


320 


PPWKFEFEFWFSP * LHKYYPPYVCVCVC 
VCVCVCVFDFRFFKPAPSPQSHFGINLS 
FLL 


7832 


21733 


A 


7892 


217 


23 


I PPFKKFF* EMGSHAVT*ARWQWHNHA 
SLQPETCGLKGFSHLTLPNSWDCRSVPS 
SSARRQSH 


7833 


21734 


A 


7893 


652 


470 


SSSSSSSSSSSSSSSSSNSSSSSQFERK 
TRS APQVEGKG PNFCSLEAP PPGLSLFS 
CLNF*RSWDPGPPP*PHATPKTRAGGDL 
LCATHHRARHCTPGLRFPLFLAL 


7834 


21735 


A 


7894 


25 


178 


GILFFWRDGGLVILHRLVSNFPVSSDSP 
ISASQSWDYRCDPPCPACPLIFW*LLKH 
SF * FFWRDGGLVI LHRLVSNFPVSSDSP 
ISASQSWDYRCDPPCPACPLIFW 


7835 


21736 


A 


7895 


104 


451 


ILSFMTTWMEAEDIMLSBISQTQTDRYC 
MILLVES KNVNLRPGMMAHAYNPSTLGG 
* GER I S * GQB FET S LVNI ARHYFYKRNF 
KI KS KFKKI VRKSQKKNVRRWLLEARNK 
GRYG 


7836 


21737 


A 


7896 


510 


335 


SCI VSLHRSWDLWHVPPFLANF* IFRKG 
GVLPCCPGWQIYTFQLQIPFLSITPVAF 

sis 


7837 


21738 


A 


7897 


37 


260 


AGNSQNQGGGACSE PRWRHCTPA* ATER 
ESVS KWKQTKFLCS VP YFESTHFHCTNE 
ATGAWG VE KVALGHAAS W 


7838 


21739 


A 


7898 


35 


466 


THTCGGIRKKLTNKRENQK* SQQKPSAP 
KEPEVKNKQKKTKNKPKKKKKTPQIQPT 
KKP FRWPNIiRVFTFFVTTQNKTPTRGRK 
TFQKVKLFPPNPCLGNKPQTLLPRGGLQ 
KKKKKIPFFYSHPPPLAQKKKATPWQNG 
LGFT 


7839 


21740 


A 


7899 


494 


149 


PARLLSLHSYEVKSVLPKEBGLIiNFFVQ 
SVTAHPTSCIGLEEI *LLD . 


7840 


21741 


A 


7900 


494 


114 


GAEGGPLSKKKKKNLLPDYHTGPMSLSL 
YLWPFLSNGHTLTLMCAHTHTHTHTHTN 
H*LSRVGLPVLKPGQYQANGNRWSP*YN 
IKE* P R KTR VMP VTP QLRNDG PMLTFW C 
S S RNS FAMHVLGKL 


7841 


21742 

■ 


A 


7901 


19 


454 


VWCNSSVS I HTHTHTHTHTHTHTHTRS W 
VGWHMP I VLSTQEAEAGRLLGSGSLRIQ 
RAMIVPLHSSLGNAARPCV*HTGSCCVS 
QAGVQWHNHSSLYPQTPGPKQSS SLSLL 
SR* DYRHVP PHPAS CVCVRVCVCVCVCV 
CVYGDRGITPH 
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SEQED 
NO: of 
nucleotide 
sequence 


SEQ1D 
NO: of 
peptide 
sequence 

• 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

• 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid; 
E=Glutamic Acid, ^Phenylalanine, 
&=Glycine, H=Histidine, Msoleucioe, 
K=Lysine, lc=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=^Threonine, V=Valine, 
W^Tryptophan, Yc^Tyrosine, 
X^TJnknown, *=Stop codph, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


7842 


21743 


A 

■ 


7902 


3 


393 


D AWD P LNPL VNL FVS P KRNS S LDTRKKP 
CRESKKFNTHSRPKSSHQLRKRSSSTPT 
T*KIPNI*LNSSTPSRAKKKRKKKPKKK 
KK* KNS QKKKKPMRKT KKKKGQQLLKKI 
LGQAKPFGGRKEKIFFF 


7843 


21744 


A 


7903 


424 


139 


LFFFFFFFF*G* GFVI FCYVGQVGL *LL 
TSGDLHISASQSAE IRGVSHHAPPLSLI 
WFISQCVGILNHTVHLQYIQCSFVHHF 
LINQEKIDIIK 


7844 


21745 


A 


7904 


1 


181 


SQ*LY*LLAANLFILT* IGGQPGSYPFT 
IIGQGASGLYLTTILILIPTICLIESKI 
LKWA 


7845 


21746 


A 


7905 


3 


281 


HAYAHASAT FFVS TCFHGLHD IMGSAFL 
IWCFI RQLI FHFTSKHHFGFQAAA* YWH 
FVD W* LFL YVS I Y* * GAVLKEPWGGQS 


7846 


21747 


A 


7906 


1 


84 


PTRPLTMUPRL VS T PQLKQS SQLSL PKC 
* D YRCEPACTAYS F * LTMLPRLVSTPGL 
KQSSQLSLPKC 


7847 


21748 


" A 


7907 


3 


392 


DPDNYTLANPLITLPHIKPE*YFLFAYT 
I LRSVPNKLGGRLALLLS ILI LAI I PIL 
HISKQQSIIFRPLSQSLY*LLAADLLFL 
T*IGGGPRSYPFILIGQVAFVLYFTTIL 
ILIPTISLIGDRILKWA 


7848 

• 


21749 


A 


7908 


2 


231 


GCVEKGTLSHC * WECKLVQPLWKTVWGF 
IiKELKIDLPFDPAIPLLGIYPEENKFTIi 
GPLHSVGRSSRKLPGYSPRV 


7849 


21750 


A 


7909 


529 


297 


KIFIGAPVFCPPPHFFFLIPPF*GVEKI 
FFSFFVLLPFGFFFLFLGLFFFFFKLFF 
FFFFFFFKIPFFFSKKVGVGK 


7850 


21751 


A 


7910 


410 


61 


FNNQAMQWFPPFPLKIFFFP*RV*FLE 
GGWSNLSPPPK*GPSPKFPTRVL*GPP* 
GKN*NFGFPG*NWAPHRFF*RAAPFFFF 
ODRVLLPRLVSNS *AQVIHPPWPPKVL 
GLQA 


7851 


21752 


A 


7911 


512 


314 


ANFC * LFIETGFPH IGQAGLELLTSNNP 
PALASQIAGTTGVNLRPGQDFSSYHSEN 
PLVYGSLYIA 


7852 


21753 


A 


7912 


1 


264 


S STTTLLLSRT *NKLT*LTPL I P STLLS 
LRGLPPLTGFLPKWAKKKKKKKKKKKKK 
KRGGGGWKKKPKKGGGGETSSKKDTFFK 
KGGP 


7853 


21754 . 


A 


7913 


416 


129 


FFFFFFFAPPPKIKTPPFFFKREVYIY* 
AJjSVFQSVCKALYIHFRI ISSQYPC*LG 
FFFFFFFFFFFFFLKGSLDLLSAYCVPG 
SVQGALHTLSH 


7854 


21755 


A 


7914 


397 


34 


IFSNLFFPPQKKMFFPPPPLKYFFFSFT 
ALFFFLVFFPFFPPPKKI FFFNNP PKFF 
FYPS *KKKFFFFI PVYFLAPPEFFF*AP 
PPFFFFFFFFFFFFFFFFFFFFFFFFFF 
FSVSKQPLFCLKI 


7855 


21756 


A 


7915 


396 


86 


IFLSPRK*GFFSPPPP*KFFFSPKPLIF 
FGGFFPKFPPPKKNFFFQNPPGVFFYPP 
FKKKKFFFPPPLNFGPPRVFF*RPPPFF 
FFFFFFFFFFFFFFYLWVC 


7856 


21757 


A 


7916 


351 


112 


SFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFPI*KYVFIYISLSLIPFCKFLK 
GRECDIHFCDIQHSYSKYLGHFKCSTND 
RQMNK 
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SEQID ' 
NO: of 
nucleotide 
sequence 


SEQU> 
NO: of 
peptide 
sequence 


M 

eth 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D^Aspartie Acid, 
E^GIutamic Acid, {^Phenylalanine, 
OGIytine, H=Histidine, Msoleucine, 
K=Lyslne, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V<=Valine, 
W^OVyptophan, Y-Tyrosine, 
X^Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


7857 


21758 


A 


7917 


445 - 


2. 


FFFFFFFFFFFFFFLSYF*LIHYAEGIG 
ASPCYIMLGYNFSSFPOGTISIAPGFNF 
YRLYFI SHASADAWVDPRFLDRYRRG IN 


7858 


21759 


A 


7918 


2 


407 


P PP PQ VYFNI WRT I FFGEGLI*HI FPPQK 
RFCFYKTYTNYIKTLFIKKKNIFSLAHI 
KMVPPRS I Y*TPPPLFFFFFFFFIiFFFF 
FFFFFFFFFFFFFFFFFL 


7859 


21760 


A 


7919 


107 


11 


KMWYIYTLECYSALKKKKEILLFDTTLI 
NLKDIMLSKISQHRKHTHT* FKT 


7860 


21761 


A 


7920 . 


1 


70 


FFFF*AAESCSRIKCSGTIIAQCNPELP 
GLSDPPTSAT*VAGTKGVCHHTQIjRFIF 
IV*LSGRILLQDQVQWHNHSSV 


7861 


21762 


A 


7921 


123 


260 


G IQARK£ PGAVAHAYKLSTLGG * GGWIT 
*GQELETSLVNMVKPCL 


7862 


21763 


A 


7922 


184 


399 


LiHLLMR VLKKKKKMGG PLYRS PRGAKVN 
PALQRLDSLLIGSRILSNLGIFGKAPHS 
WGG * AFFAATGLHGLHVI IGSTFLTIGF 
IRQLIFHFTSKHHFGFKADD* YWHFVDV 
A* LFLYDS I Y* *GSRILSKLGI FGKAPH 
SWGGTKWDNPPPDMKSS 


7863 


21764 


A 


7923 


2 


407 


GRVGMI*EAFASKRKVIiIVEEPSINLE+ 
L»YGCP PHPLHTSNKKP PHTKKKQKKKKK 
KKKKKKKKKKKKKKKGGGIjLKKLWGGP k 
NTGGKKKNFFFFWGGKKKNLGDFLKKNL 
FWGGGKIGPTPPQKKNPIiGEKKF 


7864 


21765 


A 


7924 


1 


398 


PTRPPTRPTSSRSRAPPFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFVSAALFFGKKKNFFFFF* FFRG 
* GFYFFP PLYFF ICGGGGVFFFFFFAGW 
NIFFIPPKKNGIFKTLRGKGG 


7865 


21766 


A 


7925 


2 


335 

■ 


NLTFFPQHFLGLSGMPRRySDYPDAYTT 
* NIIjS S VGS FI SLTAG ILI I FM I * EAFA 
SKRKVLIVEEPSINLE*LYGCPPPYHTF 
EEPSIKKKKKKKKKKKKKKKKKKKKKQ 


7866 


21767 


A 


7926 


1 


328 


RTRGRTRGESNPSLRIQNSPCHLSHPIL 
KKKKKKKXKKKKKKKKKKKGGPLKKNPG 
GAKI * PGPKKKNLFFKGGLKKTPLGNFE 
KKPFFGGGKKRKKP PQKKKTLKE KKKF 


7867 


21768 


A 

• 


7927 


27 

• 


226 


LSR*KM*NKSHIYSFEKKKKKKKKKKKK 
KKKKKKKKKKKKKKQKKENRPKNGDSEN 
GHPKSKIFWL 


7868 

- 


21769 


A 

* 


7928 


398 

- 


2 


GGGGGPGGRGRP* *NPHKKEGRDPPHPK 
KKFFPRRRKNRGGGGREKSPPQKKKAPQ 
JOCPGRGFKRAPQKKKKTPPPPRKKGPPP 
KNFKKGAP P FFFFFFFFFFFFFFFFFFF 
FFLGILARTTTTTNDMKNHR 


7869 


21770 


A 


7929 


3 

* 


156 


HASAHASAPVFV* SVLMTAVLLLLSLPV 
liAAGVTILIiADRILNTTFFDPA 


7870 


21771 


A 


7930 


405 


249 


ASAHAS APLFV* S VLI TAvLLVLSLPvL 
AAGIT I LLTDRNLNTTF FDPA 


7871 


21772 


A 


7931 


921 


699 


DHPGQHSE IIiSLQKMKTMSQCGGAHL*S 
QVLRRLRREDCLSPGSQSCSEL*SHHCT 
LACETE * DLVFQKKKKF 


7872 


21773 


A 


7932 


423 


259 


HDLGSLQPLPPGFR*SSYLCLPSSWDYR 
CEPLRPAQRRGX LMAATSRILTRS KGW 


7873 


21774 


A 


7933 


395 


35 


PRVKRFSCFWLPSKWGPRPRAPWPGYFC 
FFC*NGGSPFLPGGSWTLAPNYSPPPAP 
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M 
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od 
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26 
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ino tn first 
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Predict- 
ed end 
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otide 
location 

ding to 
last amino 
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residue of 
peptide 

sentience 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=C*lutnmic Acid. F^Phenvlalnninp- 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lyslne, L=Leucine, M=Methlonine, 
N=Aennrflpin^ P=Proli nf* 

Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine. V=Valine. 
W-Tryptophan, Y«Tyrosine, 
X=Un known, *=£top codon,/=possible 
nucleotide deletion, ^possible 

niirleotirie insertion 


• 












QNGLVQALRP PGPPTFPPLI FPTKAPWS 
WPFYGGKKKKKLSPKGHSERHLHHKYIR 
LLSQHG6M 


7874 


21775 


A 


7934 


1 


118 


NF*IFCRDWGLALSPRQLSNSWPQVILP 
POP P KVLGLQA 


7875 


21776 


A 


7935 


322 


383 


KKKTGL I KIS TLCKVKNTFKKTKWQAPE 
RKKKVP I FLS YKRPVFKI YKKLF PFCRK 
KIKKKIPPPGFTSFSCLSLPSSWDYRHP. 
PPCPVNFFCIYNKNGVSPG*PG 


7R7rt 
/O/O 


71777 
z,\ 1 1 1 






j 


in 


AGLSHRSQPLTWSL 


7ft77 


XI / /o 


n 






1*K> 


UPTM^TJffTfrMVMVQCCfJAVWCCPT QT.PVttf 

WIX^QEPPCPVIMFLKRGINHALYSPSR 

KT.T.WPT. 




/I //If 

i 


k " 






Of I 


RFSR CCLS PLS LTGKGNSLTPCASRVRQ 
CLAIaLWLVQGVLHPLSCTHCIiALPSEMN 
PVPQMEMQKS P I FC I ADAGS CRPBLFLF 


7879 


21780 


A 


7939 


2 


95 


RLNLGGGGCCE P * S CHCTPAWVTKRNS I 
SKK 

.... it 


7ft 8fl 


0 1*781 




/yw 


i 
i 


101 


Jr 1 Kt'oy JL Jroxi J\ w B orlLtoIiir JU? WJJ XKV_i£jr 
PCLAVILSIPQNSSMPSNIFAASYSS * F 
GLYSSRW 


tool 






70 A 1 


177 


JO J 


c vn c\ro t trr .r .p an a v arse VD'rvxiLjr'D xyfi 

AVAHTCNHSTLGSRGGRIA*GHEFKTSL 
GNIVILPLYKKNKKKKKRGARLKEPNLT 
p 

mm 


7882 


21783 


A 


7942 


235 


3 


KNI LPG YFCFFLREKP PPPGLESLFS KK 
KKKKKLARHG VAPLLGRMR* EDRLRPGV 
QGCNELWWCHCTPAWATEQNP 


7883 


21784 


A 


7943 


3 


225 


EHGSLYPPTPGLKQSSYLSLLSR+DYR* 
SWTQAVLLPPKVLGLIiV 


7884 


21785 


A 


7944 


2 


232 


TLLPDNLS QTI YPNIQCKKKKKKKKKKKK 
FFGGRVLKKGGGKKPGEPTN 


7885 

i 

* 


21786 

• 


A 


7945 


394 


4 * 


WCRWLETWGAGGSGAVPPPPFFPPHSSS 
WDIjaTjnQSGAS S P FS PGAVawafMPSTI. 

GGO^WIT*VQEFETSLANMMKPHLFSK i 

KGPVTAVAHPAKAHRANWCIPTHMHTQH 

VLHVL 


7886 


21787 


A 


7946 


405 


100 


FFFSPRFSSSLFLFLSSFFFFFFFFFFF 
FFFFFSTGLFVYWS PTH*SLPVPRI FFF 
S YLWQTEVE TMLGTE 


7887 


21788 


A 


7947 


391 


212 


KEMQIMPAMR*HLTPVRMVITKKSKAKC 
WQGCEEKGTLAHSWWKCKGTAETGKLAG 
F 


7888 

# www 


21789 


A 


7948 


106 


301 


LQLPITQLWHSIYFMKKIRYRLGVVAH 
TYNPSTLGGQGGRIA*GQVFKTSLSNTA 
RQKKKKKKG 


7889 


21790 


A 


7949 


342 


208 


GLGAGTHTYN P STLDN * GGW I TSGQE FD 
TSLANMVKHHVVQTGQF . 


7890 


21791 


A 


7950 


391 


21 


RQGFPKLPRL I SNS * P KGSAHLSLPKCTJ 
EYRSBPPSIATWYVTLSRHTCSTLLLGQ 
GRSKFHMKWESRLEGLLAKVSSLRSSYC 
S SLRILGLE I LLS PLLFFLETGSGS VSQ 
AGVQPLCPGLK 



1034 



WO 01/64835 PCT/US01/04927 

• ■ 



SEQID 
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OO 
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26 
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location 
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inn fift4 

jng w iirsi 
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peptide • 
sequence 


Predict- 
ed end 

nil j*1 

otide 
location 
correspon 
ding to 
last amino 

HC1Q 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
kj v? luuiiiiii /vciU) jt— jt Deny ibim nine, 
G=Glycine, H^Histidine, I«Isoleudne, 
K=Lysine, l^Leucine, M=Metbionine, . 
ii— Aspar&giiie, r^rroiiiie, 
Q=GI uta mine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
w— l rypiupnan, i — tyrosine, 
X=Unknown, *«Stop codon, /-possible 
nucleotide deletion, V=possibie 
nucleotide insertion 


loyl 


21792 


A 


7931 


1 AC 


ice 


LGWAHACNPiS PL * GQGRRIT * AMEF* T 
SLDNIARPCLYQK 


7892 


21793 


A 


7952 


289 


327 


KGI PPLPPRLKDKGPFFFFFFLKRESLF 
LP * FWKNGCRGPPLFFFFFLRDSVSLLP 
RJj V5 NS WP * V 1 JjJjP QFPKVXiKIjyA 


7893 


,21794 


A 


7953 


861 


586 


DR VR S VAQARLQ WHDLGS LQP P PPGF*Q~ 
FSCLSLLSSWDYRRMHTMPFVF*WRWGF 
TCVAQAGLDLLSSDNPPALASQSARIIG 
ESCCVliLiK. 


7894 

• 


21795 

* 

1 


A 

■ 


7954 

• 


402 


52 

■ 


PRKVSIS* PHDLPASASQNAGITGVSHR 
TRPSFNFSYIETAITETRKSLQPSPGMN 
♦SLPSGMVGFKPHS FIHS ANFCEALAVQ 
QTLGEVPGHAPLSPPRQBPRVCPKKVCL 
KTAJjK 


7895 


21796 

■ 


A 

- 


7955 


259 


3 


YNTKVLS FPILL*TFLKHLWIKJAVARAF 
NPSTiAiGGGGRIA*GQBrKTSIX2»JIBRC 
RLYKNLKLFLESGLFLMAWRGAKRVRTR 


7896 


21797 


A 


7956 


171 


386 


LP I RLFS LVFTT IE KSRPGMVAHTCNLS 
TLEG* sgwit* AQEFETSLGNMAKLSLP 

VTOTPPR Cr.T r P'DCOTkTV 
Kj. yis bAo Li X X KoyW Iv 


7897 


21798 


A 


7957 


397 


283 


PQPLPPGLKQFSCLSLPKC*DYRRDSAR 
PAKKNPTFNK 


7898 


21799 


A 


7958 


169 


324 


PTUD^n?MPor\tin' pin 7"7\U7\ pnnn dt yvrtT7 


7899 


21800 


A 


7959 


290 


397 


DCSSCCSPLKGR*RG*AQWLTPVIPALW 
EAEAGGS 


7900 


21801 


A 


7960 


496 


298 


LS KTGSL YVAQS VVYWLFTGT I MVHDSL 
TTFQPDVYRL 


7901 


21802 


A 


7961 


404 


174 


PGPQKGSFF*KK1QCJCKKBIDHILSHKAN 
QLLKETLFLEMKKYTIRKLS 


7902 


21803 


A 


7962 


25 


99 


PGLFLSS*HQVILLPWPPKVLGLQL 


7903 

■ 


21804 

. 


A 


7963 

■ 


436 


106 


GGGKS P P PGNWAP PGNPGE PLFF* KKKK 
KN PWjGWarRr r 1? P P PGGRKKKG VYPGGG 
GFTLTWFPPPPPPWGKKKNFFFKKKKKK 

inrtrW/lD T a TPVT TT .PTTC TTJT?VtraT7T ■ 


7904 


21805 


A 


7964 . 


212 


395 


ERERKERDRDGEREREREEREEGRKKHG 
GR*GGREGERHRPRGAEDPDRAGRATDK 

KXVvuJ 


7905 


21806 


A 


7965 


344 


345 


FSDCYICJISSCASWTWYTRIjAHHQPPARP 

PRPAPGATGRPSRVPKSTAFTQMLTQPR 
DPHLPLPIPPLSQPPP9P*PAPLVMR 


7906 


21807 


A 


7966 


397 


298 


RGFTMLPRLVLNSRTQVI *LPWPPTVLG . 
SLSIF 


700*7 




A 

n 






Zoo 


PTILGGC^RKIT*GQDLKTTLVNMGKPH 
FY 


7908 


21809 


A 


7968 


257 


107 


RTHTHTHTHTHTHTHTHTHTHTCENSKT 
TKWIA]^CS*LESQRNNGKCTYAKF 


7909 


21810 


A 


7969 


413 


12 

i 


PPPPPLLTPWPCFFPQIL*PVGHSAPPL 
FK*LKI PRPFFFPPPPKGIGAPPPKIFF 
PPGFSLPPFVKLPPGEKILFPPPKKKKY 
PPPPPQS*SF*PPPPPPFFFFFFSTSAK 
KS KKNFLFI FS PFFNPKYYNT 
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NO: of 
nucleotide 
sequence 

• 


SEQ ID 
NO: of 
peptide 
sequence 


■M 

eth 

od 
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SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
C=Cystetne, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny (alanine, 
G=GIycine, H=Histidine, Wsoleucine, 
K^Lysine, tRLeucine, M 8 Methionine, 
N=Asparagine, P=Proline, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=tfnknown, *=Stop codon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


7910 


21811 


A 


7970 


399 


212 


TGVQWSNHSSLQSQTPGLKRLFLLILLS 
IWDYGCMPP YLS FKN * FCTKGGSC YVAE 
ADVNIL 


7911 


21812 




7971 


584 


424 


RPRRENRLNPGGGGCSEPRSRYCTPAWV 
TE +NS VSNRNKTNQNP I FFS CKQFD 


7912 


21813 


A 


7972 


105 


1 


PSMPQLSHL * NG I LGRVRWLTP VI PALW 
EAETGRS 


7913 


21814 


A 


7973 


363 

• 


28 


SSSLGGHPQLW FHTLKLRPVTVSYA*NK 
LCDKIEKKLFFT IALHNKFLRKKLTRDV 
KIIiFNNK*IKkEGPNEF*NILCLWIERI ! 
NINKVPYPSKVMYRSNTLPIBVAVFKSQ 


7914 


21815 


A 


7?74 


404 


250 


FFFFW*R*GFTMLPRLVLNSWTQAIHPP 
ALASQSAGITGMSHHAQPLS I I F [ 


7915 


21816 


A 


7975 


2 


60 


FSCLGLPFCWDYRHAPPHLANFCRDGVL 
NY* PQVIHLPQLPKVLGL*ACTTTPG . 


7916 


21817 


A 


7976 


325 


81 


KT IHS LLFGQ FFLLQPLLPS P I PHAAP P 
TPHLI FLLLLFCF* FFK*RQGKQGLTTL 
PKL I SNS YSQVTLLPQP P KVLG FQA 


7917 


21818 


A 


7977 


406 


282 


QWRDLGSLQPPTPWFK* FSSLSLPNSWE 
TKAGRSREPGGGD 


7918 


21819 


A 


7978 

9^9 V 


382 


221 


DCI I SAS YLQKNFFFFLGDKVLLCHPGW 
SAVA*S * LTVTSKR^HFYSSEISIiY 


7919 


21820 


A 


7979 


423 


29 


FLW * RRGFTSLPRIiVSNFWAQG ICLPWL 
PKVLGLQA 


7920 


21821 


A 


7980 


3 


238 


SLAFFVETGSHFVA* AGLELLS SSNS PA 
IASQSAEITGVSHHTQPE *GYSHM* PQY 
PYQLRFNSDTI ILSNVLFVPNF 


7921 


21822 


A 


7981 


324 

* 


128 


WLSVS P YTFELSLLVGYEFVIHDVI * TL 
IFVVVWVSLFCFLWFFLVCMVFWFFC 
CCFLSVFCI 


7922 


21823 


A 


7982 


1 

1 


86 


NLTMLPRLVSNS WAQAICLS W PP KVLGL 
*AQAICIiSWPPKVLGL 


7923 


21824 


A 


7983 

• 


1 


137 

mm* 9 

• 


RPANFCIFFVETEFCHVAQAGLELLGSG 
DP PTSASQGAGTTGMSH* D * PANFC I FF 
VETEFCHVAQAGLELLGSGDPPTSASQG 
AGTTGMSH 


7924 


21825 


A 


7984 


2 


277 


PRVRS VQR VI YQYV* NLQ IHVP FDSTNL 
G I YLTD I LPHWNDNMYKVFAAVS FV IA 
KY*NPCLTRRQCNKLWPIHAMEYYVTIK 
KNEDGRA 


7925 


21826 


A 


7985 


211 


42 


GPQ KW PRG PGENLGGFFFFF FF CRDDGL 
TMLPRLALNS*VHAILPPQPRKVPGLWV 


7926 


21827 


A 


7986 

9 s w 


409 


183 


LLRLK* SSHLSLPKWWDYRCEPPCPAI F 
FKRKQILKESFPGENTCCHLFNSLEVLK 
NKRRTCYNFPNTLAVLKNM 


7927 


21828 


A 


7987 


2 


194 


RFFETQSRIiQPPPPQIiK* SSHLSIJQSS 
WDYRHVTPCLANFFIiFWYFLQGQGFAFL 
LKLKFLFK 


7928 


21829 




7988 


483 


209 


SSPSLGNFCIPSSGGVSPCWPGWFQTPD 
LVIWPPGPPSLVFFFFFFRDEGLSMLLR 
LTLNF*AQVILLPQIiPE*LGLQADRQVP 
LYPAEVF 


7929 


21830 


A 


7989 


164 


29 


KFWFWLGTVAHACNPSILGSQGRRIA*A 
QELEPSLGNEGRPCLYK 


7930 


21831 


A 


7990 


3 


474 


PTRAPLRAPFETIVQEEEFSKHHFGFEA 
AA* YWHFVDW*LFLYVSIY* *GSPPPP 
PKILIKKNGAETKKFFF 


7931 


21832 


A 


7991 


469 


323 


VETGLHHVAQDGLELPTSGDS PASASQN 
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Amino acid sequence (A=Alanine . 
OCysteine, D=A5partic Acid, 
E=Glutemic Acid, ^Phenylalanine, 
G=GIycine, H^Histidine, I»Isoleutine, 
K-Lysine, L=Leucine, M^Metatonloe, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IGITGVSHHA* PDFTFIiBQP 


7932 


21833 

• 


A 


7992 


469 


13 


KGDTVSSSVSSPSCGGLFPVGAS+LLCL 
FTQALAMVGARRPFSMLPCSLISDCCAS 
NERGS VGMG PS KPGVT3YSLLVCRWLRP L 
EKRT I KVGVTRFSRCRLSQLPLARKGNS 
LTPCVFWVRHCFGSHSVGCTHCPAPAVR 
QAPVK 


7933 


21834 

■ 


A 


7993 


.15 


439 


IASGRPFSIKFFLVAITFLLFDLEIALL 
LPLP*ALQTTNLPLIVMSSLLI»IIILAL 
SLAYE* LQKGLD * TEPNKKKKNKKKKQE 
KKKKKKKTKKT KGGGLLKKILGGAP ILW 
GGKNMFFFFWGGEKKTPGGFLGENLFLG 
GGK 


7934 


21835 


A 


7994 


392 


186 


YF Y I INNFFFFKKI FI * * 1 1 F FFLKKFL 
*YSPKKVFFLFF*IFFFFFPSSSLFLFF 
SSSSPFFFFFFFFFFFFFFFFFLRG 


7935 


21836 

■ 


A 


7995 


11 


386 


TWEVEVAVS*DHAIALQPGQQE*NAISK 
KKMTGRMFIEVSFIIASSMKQIiKCS*TG 
E *MKSLWYTHI I E YYLT I KENKVLEHKV 
SRLTLIDTGRERSKTKEYTLYNPIYTNL 
RIAKLSGCGGSHL 


7936 


21837 . 


A 


7996 


3 


186 


DSFCF*RQSCSITQAGVQWGDRGSL*PQ 
SPGLKRSSCFSLPKHWDHRDEPLHPAGV 
AVLN 


7937 


21838 


A 


7997 


316 


649 


RDHSFFFEMESRSVAQAGVQWHDLGSLQ 
ALSSGFMPFSCLSLPSSWDYRHPPPCPA 
NFL YFL VEMG FQHVGQNGLDLLTS * S TR 
LGLPKWLGITGVSHWHPAQDGGFHHVG 


7938 

1 S m/ V 


21839 


A 


7998 


468 

~ WW 


336 


RRGFTMLARMVS IS* PSDM PASASQSAG 
TTGVNHHIRIHRSLL 


7939 


21 840 


A 


7999 


464 


283 


TLCGGIRPTNI MKGNLLYS KS IGXjNVSIj 
LPKNPRRNIQNNV* PNTWALQPSHVDI 


7940 


21841 


X 

-* 


8000 


46 

~w 


487 


RQGRLS LQ KFLL P FVQLCPAHRGGVYRG 
RQASLS CGGLHPVRVSQPLCLPT * VSAM 
AAAPHPASLLPCSLISDCCASSEQGSVG 
VGPS KPGAGYNIiLVCHLLRLLEKRSIRV 
GVSQFSRCHLSWLPLARKGNSRTPCTFR 
VRRCLAL 


7941 

• 


21842 


A 


8001 


295 


1 


TQPWGTKRKLPLKKKKKNSGDREWRWLC 
NS VT I LNATTMYTS BWLRPGTVTHTCNP 
KTSGG*GGEIA*TQEFKASLGNIA*SWL 
YKKILKI S PLWWQT 


7942 


21843 


A 


8002 


144 

■ 


317 


ELFYI*KEMV*RPGPVAYACFPSTLGDRG 
GRI T * AQEFRTDLGNI ARPCVYLRKKKK 
RG 


7943 


21844 


A 


8003 


69 


291 


HCVWTMI F I VI KYFYLNPQKNSMRYYYS 
HFMDKETDVQSD * MTC * RLGAVAHACNP 
STLGGRIA*GQBFKTSL 


7944 


21845 


A 


8004 


3 


176 


IVAHSSLTPGLKQFSCFSLPSS*VYRNM 
LPRLANY*FPGTNRVSLSCQGWS*TLAS 
LLGS SS S PAS ACQVAXS TGTCYHAWLI I 
DFLVQTGSRCLAKAGHBLCS CL 


7945 


21846 


A 


8005 


419 


178 


NPPPKKLFLSSSSSSPLPHLNWGTPGFF 
PPPPF*NPPPEFNFGAPKKKKKLSPPRA 
EKLVPFKGPPPFFFFFFF 


7946 


21847 


A 


8006 


429 


225 


QPLPPRFK* FFCLSLPRR*GHRHRPPHP 
FNFFFF * YFLRWS FVLVAQAGVQCMDLS 
I YLNE FWLLDF 
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SEQ ID 
NO: of 
nucleotide 

■ M m^m m^g$ ■ ^mr w ^mm mur 

sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

mm ^mm mp m mw ym" mtm mmm mmm 

location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sentience 

t ■ 


Predict- 
ed end 
nude- - 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E>=Glutamic Acid, ^Phenylalanine, 
G«Glycine, H=Histidine, Msoleucine, 
K^Lysfne, L= Leucine, M=Methlonlne, 
N=Asparagine, P=Proline, 
Q=Glu famine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Un known. *— Stoo codon. /=Dossible 
nucleotide deletion, ^possible 
nucleotide insertion 


7947 


21848 


A 


8007 


756 


224 


PPRHQPQHVPPPLLHFSTPSPAPPKSHA 
AAPNSNPPQAHAPPAPPAPRAHSPSPPP 
RGATPPPPPPPFS FLAAVGRAPI FPREP 

m mmYmtmrnm * mm mm mm mm m* mmm mm- mm mm maw mm 4at mm my m ^m* *m mmi Ammm mmt mi mm m Jmmmm mm 

LLPSSSSPPPPP*IGEPPGFSPPPLLKT 
RPPK*N*GPPKKKNFFPPPRPKNWFL*K 
GPPPFFFFPFFFLRQSFTLVAQAGVHKT 
RLHLKKKSN. 


7948 


21849 


A 


8008 


370 


559 


SSHLSPQSSWDYRSGMVAHTCNPSMLGG 
QDGRLT * AQKFQTS LGNI VRCHFYFKKY 
FKNKIFFYSPFINTNPKKKKKKGGAVLK 
NOS LRPRAGKVLVFIiWQHLIOFRGPR FK 
NGGTGKPGGLPNLIAPR 


7949 


21850 


A 


8009 


1 

• 


662 

• 

■ 


RTRRTRS WWYMP VX.PLSHYGEAGESLGP 
*KWMVD*APIAPLHSTLGDKMTLPDIKK 
KNG ETVANTS P L S T TAKPAKLFL KAKKD 

LKI I WRRHGI WPLKNIGPTEDVQASAHG 
GVEENMTS D I E I PEAKHDHRPTEDVOVS 

W UUilk 1 A ^m* mm** mm U <A f ImmU 11/ AliVf A m-iMmm*' V V 

AHGGVEENITSDJE I S EAKHDHHLVEDL 
SESLSVCLEDFMTIGSQWKPICLS 


*7Q5ft 
lyDK) 




a 


Rftlft 
oUlu 


Oil 




NFFFFFPPFWGFFFFlJGPGFFKVFF*IF 
PFFFPRNPPFF* FFESVPLFFKK* IFFF 
PFSIRFCFFPKKFFFFLILFFFF*FPFF 
FFFVYVFDFFFS F I LYI FFFVYF3LP FT 
VLI I SLI YHFAPA 

V mtmrmm mW* m^^^mr mm mm m m,m- m> 4* * *» 


7951 


21852 


A 


8011 


1 


363 


PTRPLF* *RGCFTMLPRLIIiNSWPQVIC 
PPRPPKVLGLQT 


7952 


21853 


A 


8012 


278 


39 


FXXX FFLS PPXFL I FFFFFFFFFFNFFF 
FFFFPPFFFFFFFFFFFFFFFFFFFFFF 
FH*DQ* F3NFDFTPFTFTLISNI I 


/7JJ 




a 


OUl J 




4.17 


MNE CI LGKGGKDGGF FEO CENLGLC * FL 
KVP * DS D I WLGAVAHAYNP STLEGQGER 
ST*AQEFKTRLGNMTRPCLYQKHTHKKK 
TEOOCPHPGLGWGAWGPTYLGGVGTKIT 


7954 


21855 


A 


8014 


211 


19 


FGNIHNFFLIGSRSVTQAGGQWCNHSSL 
* PQTPVLKQSSHLSLPSSWBIEAAVSQV 
RITALQPGRQS KTLPGNSGPVLRADARR 
SI 


7955 

* 


21856 


A 

* * 


8015 


i 

M 


130 


ARI VS IS* PCDIi PASASQSAG I TG XSHK 

^ mm -m mmm mj ^P* • * * ^ ' — ■ ' mm a 1 p ■ ' ' gi^L p ' ^ *~ » r >■ ■ mmm mi t> mm— mm*9 m+ m*m~ m> 

TQLAHNTLKASPTLF 


7956 

- 


21857 


A 

■ 


8016 


4 

■ 


417 

■ 


RLMPPHLANFLFF * RHGFTYVAQVGLKL 
LGSNDPSASTSQSTGITDVSHCTQPHLL 

WmW ^mw ^mm*^m ■ mmr *mm m^m— *m mmm^mr mm ^mmw •mmm' m\\\w wmm ^mpr m^m mmm mww ^ mmw mm m ^mmw mmm mmmr mmm mm mmmmmw mmmmr 

KSSHSTFPLKTLDSMATNCSWDKVHYLP 
TACRALLPCPLPSACFPSLSQVPSVFTC 
CVPATLTITLLPLLRAFEHVTIiCPLF 


7957 


21858 


A 


8017 


388 


158 


CVTWASNCINWI CFPLS* INSHLCNEI I 
VI,I*GElYNVCISIYLCIYMCVCiyiSV 
CVCVCIYVH*TPYIiIHKCLI 


7958 


21859 


A 


8018 


154 


382 


GIKRPGAVAHNCNPSTIjGG*GKWIG*AQ 
EFKTSPGNMVKPQIYLKKKKKKGGAVLK 
DPSLRPRAGKVIVFFWGPLI 


7959 


21860 


A 


8019 


396 


237 


FFFFFFW*G*GFTMLPRLVSNSWAQVIH 
LPWPPKALGLQVRTMVPGLRLANF 


7960 


21861 


A 


8020 


412 


168 


GNLLGSPLPGNVCKNKPFILF*KKK*SL 
AVLPKLVSNFWP*MILPPWPPKI*GVQA 
RAPPPSQI LKKKKSSHFYNSHVNSL 


7961 


21862 


A 


8021 


1 


124 


TRFLHVS QAGL * LLTSGDLPASASQSAD 
YIFIISPAKIIFL 
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SEQID ' 
NO: of 
nucleotide 
sequence 1 

'i 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 

• 

■ 


SEQID 
NO: in 
USSN. 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AJanine 
OCysteine, D=Aspartic Acid, 
Ec=Glutamic Acid, ^Phenylalanine, 
0=GIycine f H=Histtdine, Msoleucine, 
K=Lysine, L= Leu cine, M = Methionlne, . 
N=Asparagf ne, P=Proline, 
Q=Glutamme, R=Arginine, S=Serine, 
T=Threonine, V«Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Un known, **=Stop codon, /=po$sible 
nucleotide deletion, Y=posslble 
nucleotide insertion 


7962 


21863 


A 


8022 


3 


200 


HLSP YFYFFKBRVALCHPGWSAVVQ* P I 
VTSIPWSSC+KLTSWRRWRPSWLTR*N 
PVSTKNTKN 


7963 


21864 1 


A 


8023 

• 


3 


132 


FLFFFQF* FMRLGLAMLPMIjGMNS WA* V 
ILLPQPPEWLGLQA 


7964 


21865 


A 


8024 


3 


187 


VGQAGLEPPTSGDLPASASQSAE ITRMS 
HRAQTK*DFIGSFSEEFS 


7965 


21866 


A 


8025 


405 


231 


SDKWIKKMRYIHTTEYY*ALKRKEILTH 
ATTWLNLENILLSEISQSPKHRYYVIPL 
T 


7966 . 


21867 

i 

1 


A 


8026 

■ 


175 


382 

- 


GKRIFPPYPPQEGGKTGPPQTPGLFFFF 
LKKK*PGGLGGARFSPLLGGVGRKNSFT 
LKGGGAINLKGPPSLPPGGKRWAPPQKK 
KKKGQAWWLMPVI PALWEAEAGRfi 


7967 


21868 


A 


8027 


83 


374 


GDQVWWLMKACNPSTLGGQOGRITWDRE 
FETSLANIFFPISTKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKAS S GGGVS * K 
KNFSPRAGKEIFF 


7968 


21869 


A 


8028 


382 


148 


SPHCSLDLVGLKQSSHLSLPSTtDHRCV 
PPRI*AI FS FAFLFFWEATTTTTKATMLP 
RLVLNS * AQAI LLPQPLKVLRL 


7969 


21870 . 


A 


8029 


88 


208 


SGSV*SNNPLSWAWWLQSIIPATQEAEV 
GRLLBPRSSRSA 


7970 


21871 

• 


A 

• 


8030 

■ 

■ 


I 

• 

• 


283 

• 


NKSRIKKAI KMTEKMKBKI EMMKLMNQL 
PNGEHHAMIQMMMKI EMKGRMKIKE * DK 
ESDKDDRKDEREDRDDETDEPTPERRTS 
RDDSDDDEDRDEGKNEDKRKDDSKDDDE 
ADEDTDQDDYDPMDAHKAADEDDDEDED 
EVAEQMTKKKKKKKKKKKKKK* * KDDRK 
DEREDRDDETDE PTPERRTSRDDSDDDE 
DRDEGKNEDKRKDDSKDDDEADEDTDQD 
DYDPMDAHKAADEDDDEDEDEVAE 


7971 


21872 


A 


8031 


2 


139 


LQTLLPLI AF * + LLASLANLALPPTINL 
LGELSAQFAVIGMY2XY 1 


7972 


21873 


A 


8032 


80 


324 


YLHF YFFVR * KLMGLDTVAHT YNPSTLG 
DPBRRIA*A*EFKTSLGNMVKPCIYKKK 
KICKKKKKKKKKKKKKKKKKKKKKKK 


7973 


21874 


A 


8033 


299 


158 


GNQPGASAHTCNPSTLVGQGGRII*AQE 
FKAS LGNMVKH YL YQ KHKN 


7974 


21875 


A 


8034 


416 


60 


AQWLTS 1 1 P PL * EAEVGGFFE PRSLS PA 
WATE 


7975 


21876 


A 


8035 


398 


271 


FYFL*DEVSLLPRLVSNSWTPAILPPWP 
PKVPGLRHEPPRPA 


7^74 


21877 


A . 


8036 


385 


151 


FLYFL*RLGLTLLPQLVSNISSSDPPT* 
ASESAGITGVSNHAHLRICILTRLPGNS 
SH*NVRRAGLGDTQSSALIPPG 


7977 


21878 


A 


8037 

*r*rmr w 

m 


3 


406 

• *m^m • 


PAS AS * VAGITGTHHHAQLI FVFLVETG 
FLFFFFWAVFLFFKTKNGPPGGQLFFFA 
APPALGGGDFGGQGATFSPKGLGVLGBT 
RGGAPAPKKLGAKKEPSHLLGGGAQNLP 
KPRGQKGFGFYFYFLARDFFIiG 


7978 


21879 


A 


8038 

• 


428 


20 

• 


LANQ*WKPPSNPWETKISPKKKKGPPGQ 
GPPPIiTPPLWEAPKARKLGSPWLPR*NP 
LFPQKPKKFFGFGGGPPYSPLPQKLSPK 
NGVTPEVGPSHKPKFPPSPPPLGQK*TP 
FPQKKKPKRLKRGMFLHGSTFQSL 


7979 


21880 


A 


8039 


3 


194 


QNKI SQVWWYTPVIIATWEAEGGESVBP 
GRQRLW * AKTAPLHYS QDEKYLKAS VDS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 

r r **** 

sequence 


M 
eth 

oV 

■ 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

i 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, {^Phenylalanine, 
G=Glycine, H=Histidine, 1— Isoleucine, 
K^Lysine, L=Leucioe, M=Methionine, 
N=Asparaglne, P^Proline, 
Q=Glutamine, R=Argtnlne, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Unknown, *=Stop codon, /^possible ' 
nucleotide deletion, V=possible 
nucleotide insertion 














YCLISCH 


7980 

/ /UV 


21881 


A 


8040 


415 


96 


LEPL*KYFPILDAIKNICDSWEEVKIST 
LTGV* KKLI PMLMDNFEAFKTSMEKVTA 
D WE IGELELEVEPAD VTELLQS CAKLE 
GMRSCFLWMSKBSGFLRWIYSW - 


7981 

i jO a. 




A 


8041 


413 


254 


PGFKGFS CLGFPKR * D YRHXPPCPASKY 

mt *mmi mm ^m I m. Am *«*\ mm* mm m\J. j^a^x. m. mW 3m A mmm* mm, 

FKINYLRNIYRSLILKN 


7982 


21883 


A 


8042 


409 


102 


GFFPFGPFFF*PPPPGVFPPPPPPNFWF 
♦GFGPPPPPLF* FFFRGGPKKI WFFPFF 
LWPPIiFFFFF^FWPKKTPPKKARSLKIF 
FFLGPKKI PNPFFFFFFF 


7983 

r 


21884 

i 


A 


8Q43 


414 

* 


153 


GRGWATNFYF FL X I LK* FF F I FKKFF FF 
HLFFFFFFFFFFFFFFFFFFFFFFYKIF 
LFTLVLTCPQTS CREAE P FDHKVCS VLK 
ML 


HQQ.A 
1 7 OH 


ZIooD 


TV 








MVT . P Pfl *■ W FTFTW PrWIP FP K" P T V Y Q P P P f 

FFFFFFFFPSFPPFIOTPPPQKGKFPPK 
KYFFNP P3j PPPIiFFI iKPPP Ij FF P * GP 

KNL I FHP PS KKI CP FKRGP P * FFFFFFF 
F 


7985 


21886 

. 


A 


8045 


370 


14 

• 


tppptktpfffl yp ppwrekregff* ka 
nmsppvwkkkipplp£yiyi**kggsryl 

mm** mm Am V mm IMMVdh 4b mm mmm* Am Am mm mmm mA, mmmf A^^mwl ^m* m m MA.^m mW mm§ 

AKLS LNLKGS I YLS FC FP PLP P SLPLS L 
FER WGLT I LPKL VLNS WAQMI LLP WLPK 
VLGLKV 


7986 


21887 


A 


8046 


395 


268 


PFFF F * ETGSCS I VQVGVQWLFTGT ILV 
LISVGVLTCSISDL 

mmm mmm- mm W ^m ▼ mmm mm ^mmmmr mmm mmw mmr mmm 


7987 


21888 


A 


8047 


379 


98 


GPAHNSPPLGGESETPLGYPGKPRFF* K 
NKKINPARGPGP WP P PPQG * AGKRPLP 
PRRGFQLTQNGP PPPFPGGKKKPPFQKK 

Mm mt m Mm ^mT mmm mm m* ■ ^mm9 mm mm Mm Mm mm mm UwAUMU m* mm, ^fmimmm 

KKKREPAIC 


7988 


21889 

dm, 1 XJSm* 


A 


8048 


369 


177 

M I i 


RNTRGGPTFFFFFFFFK*NLARLSRLEC 
HGTI S AHCNFRLLGSSDSLPLTSKDIGQ 
INKSLFA 


7989 


21890 


A 


8049 


1 


128 


ALGLVAHAYNPS TLGSQRGKIA* GREFG 
TSLGHRARPCXYKK 


7990 


21891 


A 


8050 


166 


163 


PVHQGEQTTQDKCLR* STHLGLPKC*DY 
RREPPCQASKLVPTGILADSKHSKCQTM 
DSCFSNQLYKQR 


7991 


21892 

mm m ****** mm 


A 


8051 


402 


245 

mm M mm 


YFKSVCQAQWLNTYNPSTLGG*DGWITC 
GQEFRLPWPSG ITGASHHAQPHFF 


1991 

t j m* mm 


21893 

Mm m 


A 


8052 


404 


268 


QQLWRLRQRDCLNLGCGGCSEPRSCYCT 

•tom^ mmmr t t mm m-Mmm* mr m mj^ m •mtmrnm' ^^m* mm^mmm m mmmm *^m* ^mw mmw ^*Mm mmw mmW mm mmmmm ^mW mm ^mmi mm 

PA* ATE PDPVSNKQINK 


7993 

« ^ mr mj 


21894 


A 


8053 


1 

A 


154 


GFLRVGQAGLELLTLRDLPTS ASHSAG I 

mm mmm^m m w ^mW ^m^mp ^™ ^mw m^^— ^mm> ^mmw^mmm mm mmm mt ^m mmw mm mm mm* m\ mmmm* m mvmmw mf mmmmw mmm 

AGVIYRAWPGEAYLFSG*WLRFT 


7994 


21895 


A 


8054 


406 


150 


PKKKKNPCPPPEVKFDS LKRAPLFFFFL 
EESCFVTQAGLECSGAILAHCNLCLVGS 
SDSPLSL*RWVGNSHFIFTSLLSVSPRS 
N 


7995 


21896 


A 


8055 


1 


125 


TAVI LIIFMI* EAFAS KRKVLI VEEPS I 
NLE*LYGCPPPTH 


7996 


21897 


A 


8056 


400 


260 


HMSPHLAFFFSFLFL*RLDMLPRLVSNS 
WPQAFLQLLLPQVLGLQV 


7997 


21898 


A 


8057 


396 


263 


FLQPGTVAHACNPSTFRGGLIT*AQEFK 
TSLGNKVRPCLYKKQY 


7998 


21899 


A 


8058 


402 


211 


FSCWWCTKPKAAG* FPAP WGPPFLNNPP 
GAGTNFPGPPRSGGFGSPPPPWPGPRRP 
CPNGIFFC 
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SEQU) 1 
NO: of 
nucleotide 
sequence 

■ 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 

• 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding, to. 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine . 
C=Cysteine, D=Aspartic Acid, 
E=Clutamic Acid, F=Phenylalanine, 
G=Glycine, BNHUstidine, I-Isoleutine, 
K«=Lysine, LHLeucine, M-Metfaionine, 
N-Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
Ts=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X«Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=pos3ible 
nucleotide insertion 


7999 


21900 


A 


8059 


1 


279 


VLL YHPGWSAWQS WLMQ* TPSNADI WR 
IARGFKRSSHFS LLS S WDHRHVTP CLAN 
FFLIL* IWGLAMLSRLLPLRLRKPVLLD 
★APSPCGLI 


8000 


21901 


A 


8060 


408 


315 


SLTMLPRIVFNS * AQAILLPWPPKVLGL 
QV 


8001 


21902 


A 


8061 


198 


2 


RIQHVE SNCHLE VEGG WAQWLTPSTLGG 
RGRMIS *AQEFETRLGNTVRPCFYKNLK 
SQPGMWRI 


8002 

* 


21903 

* 


A 


8062 . 


3 


294 


GLS PTAHP ITAFTSQPLPSGLSVCRLFL 
WAMCEQCPHLP IRPCCPR I IGPECALYT 
TSCLLHQLKKREGF*KKKKKKKKKKKKK 
KKKIF*WGGGKK 


8003 


21904 


A 


8063 


197 


184 


IHDFLFFF*GRVSLCCPGWSAAAQSRLT 
AL* PGELKETFYL* IRRSWDHKHAPPLP 
APF*LFWVTLIITPSTHEVSSAVCSLLY 
VKSEQKCQ 


8004 


21905 


A 


8064 


3 


373 


TSGSLAKRPADCLSAQPTTPSSLCGCSP 
VRVCT TVPRCTHARAHTHTHTHTHTHTH 

SNHG * APGAL Y I QELQRGKVSHVPEKE K 
SDPETA* TKGG * HVTNPHGRQRSGEQE E 
HRICVGRCCSG 


OV/VJ 




A 


QUO J 


J J 4. 


IVJO 


ENKPIHNLSINKYIGKILRDKALLYCPG 

wravmqs*lpvasnswapaillcpgfls 
twdykrvppqlvkifskkkknkptnilr 

PKCHTRNQD I YKRGKVG FL CS YCP PALL 
LLLSLLLLLLLLFSFSTRDSASGAGCSS 
LP 


8006 

■ 


21907 

■ 


A 

■ 


8066 


411 


130 


PHAQLI FLRR * S LTVFPR WSNS WPQAI 
LPPWPPKVLGPQARAMVPSPKLVLESIL 
RIPTSVIFFPLNLCSNIKLAEIKEGCSA 
AFLFNQRFAN 


8007 


21908 


A 


8067 


3 


32 


DAWADAWGSLTDEWVEKI WYIHAMQYYS 
SFEKREIL*YVRTCMNLEDTMVSEIRQS 
QRDDYCMI PHI * GRTRGVH 


8008 


21909 


A 


8068 


410 - 


116 


STFNIQTWPGTVAHTYNPNTLEGQAGRI 
T*GQELETSLGNI TRPNFLLFCQACPGQ 
GLSTVRMCGSLHYFVLPLRPLLKCLDNR 
KSPFYHPVILPCYE 


8009 


21910 


A 


8069 


158 


289 


DLCSSLRNIVRPQLYKK* KISQVRKKKK 
KKKKKKKKKKKKKKKKKKKK 


8010 


21911 

• 


A 

■ 


8070 


388 

■ 


1 


APTPP CFLFKP FVLGGP P FGKAFFQKFP 
GGGFFPSISGPKPPPPRNPSLISKGKRV 
PSPQSPIMVFPGAF*RPPPFPLMVG*KI 
KIFLPPPPPFKTPPALSPFFFFEAEFRS 
CWPG WNAVMRS RLSATS 


8011 

. 


21912 


A 


8071 


15 


336 


KLDKRYDRTGS PGTTHASG YL I E PLVAE 
ASYELILSLAFFFFERRGLALLPRLBGS 
G VI VAHCTLKLLGSGGPPASAS * VAGTA 
GMNHHEGHEPGLFEKQNSGFIF 


8012 


21913 


A 


8072 


396 


274 


PHLHTWESSL*RQGLAMLSRLVSNSWPQ 
VILSPWPPKWVL 


8013 


21914 


A 


8073 


189 


54 


NKPLGPGMWQACNPTPLGGQGQWIP*A 
PEFKI SLGNLAKPCFY . 


8014 


21915 


A 


8074 


3 


257 


HAFVLLFETRSCLSLRLESSGAMIAYCR 
LELLGSGD PLTS ASQ VAGT KGM * DDAWL 
RQVSNSWPQVILRLSHPHMFLFQFSLET 


"SoTs ™ 


21916 


A 


8075 


1 


119 


TLPATWEAGAGGLLEPKSLRV* CVFTAT 
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M 
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nucleotide 
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peptide 

sequence 
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Predict- 
ed end 
nucle- 
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ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A=A)anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H»Histidine, I=Isoleucine, 
K=LysIne, L=Leucine, M-Methionine, 
N^Asparagine, P^Proline, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W«Tryptophan, Y-Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, possible 
nucleotide insertion 














VNS SRTPAWAA 


8016 


21917 


A 


8076 


232 


266 


LPYCSGQWCLGLPIIQGVQWHDHGSLQP 
KLPGLNQSSCIiSLPKCWDYGREP*YLA* 
YQARYHGSLARQNVFNIRPGIMVHARSP 
STLGG 


8017 


21918 


A 


8077 


390 


143 


LGGFLVFCP PAKKGFFPNP INLGTPRFF 
PPPPF*KPAPEINFGGPKKKKIPSPPPG 
VKFDP LKRP P PFFFFF FFFFF IRLLV 


8018 


21919 


A 


8078 


424 


144 


LKNYI FFTFL *NRDRVSLCCPRLVLNFW 
AQE I LLPWP P KVLGLQA 


8019 


21920 


A 


8079 


182 


45 


VHKAGMVAHAYNPSTLRGRGGWIT*GQE 
FETSLASLDNMVKPRLC 


8020 


21921 

i 


A 


8080 


405 


163 


YFLEEMG FHHADQAGLELLT IGDPPGGM 
SHCTWLNI4NWLIQKHPHRNIQNNV* PH 
ISGPVKLTHEGNHHNWHMSTQKHSP 


8021 


21922 


A 


8081 


39 


240 


QS KTVS KKKRKI I FCRYRVS * S PGFKQS 
SHLGLPKGWNYRREPPHPASTFSQSFPV 
KDHFCSGVSS 


8022 


21923 


A 


8082 


2 

mm 


263 


DCCVS SB *GSVGVGLS BPGAGYNLLVFR 
LliRPLEKRSIRVGGSRFSRYYLSWLPLA 
RKGNS PTPCTSQVRQCPTLLLGLHPLCD 
KPQ ^ 


8023 


21924 

* 


A 


8083 


1 

* 


248 


SVGVGPS EPGA* YNLLVCCLLRPLEQHS 
IRGGVSQFSKYRQSWLPLARKGNSPTP* 
ASCGMRGPALLWLTLLGLHPLSNKPQ 


8024 


21925 


A 


8084 


3 


255 


HLSLPSSWDYKRMPPHLANFFFFFLGKK 
GVLP*GQTGLEL*N*KNPPA*PPKGAGV 
KGVTHRPKP *NGGKGRFBKDTPGGPFL 


8025 


21926 


A 


8085 


409 


145 


LRGPYKKNFSFQAPGRKIGSFKKAPPFF 
FFFFF* GQSLCCPVGLSTGVTIAHCSLE 
I-LTSSDPASSASRV7VGTTGACHCS*VLQ 
TFTF 


8026 


21927 


A 


8086 


199 


26 


GFPPPPNKTEIF*RPPDFLNFFFLVEMN 
LTSLSRI*VSNPWAQAVLLPWPPKALGPQ 
A 


8027 


21928 


A 


8087 


222 


3 


SVGITGVSHHVLWYFTLKGFLGRGAHSC 
NNSALGGRGGWIT*AQEFETSLGSVARH 
HLYKKYKNYPVAWRL 


8028 


21929 


A 


8088 


415 


197 


TILCFYHQNNSVFSSNPFLSL* IFIF*D 
RVSFCEPGWSAWQSWLTVFPILCATVK 
YFTCQQYI IMI FALNS 


8029 


21930 


k 


8089 


297 


71 


FIFPENKNRPE WAHACNTLRGRGGRI T 
♦GQEFEKSLSNIARPHLYKTKTQNILSK 
YIQYLGFQLTLQIDLANIY 


8030 


21931 


A 


8090 


185 


415 


PFGLKQSPHLTLLSS*DYKHTPPCRVNF 
FFFLGGFGFFPKAGFNLGN*RNFLASPS 
KRAGMAG INNLAGP I FRFFKK 


8031 


21932 


A 


8091 


2 


125 


RKQAS KKERKE *KK* RKKERKKERKKER 
KKERKEKERKEEP 


8032 


21933 


A 


8092 


8 


412 


KGPQG7TLTCIFDLSYSSSLSHWSQSLC 
VAFPLPECIFPHKPLCPACHFFFFQSWG 
LALLPKREYRGKIGVNGNLELLG*NNPP 
SGTSKIAEPAELFHHTHARPFNWIIKTD 
FPYIAQTGF*LRPSRNPSTLEV 


8033 


21934 


A 


8093 


249 


143 


NIFAATWMRLBT I ILSEVTQG* KTKHQM 
FSLIOGS 


8034 


21935 


A 


8094 


3 


280 


PERWDYRHEPPGPASFT*LIiLFVFNL*R 
KGL YVG QRGRQLLAS SSL PAPAS LS AG I 
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Amino acid sequence (A~Alanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid. (^Phenylalanine. 
(^Glycine, H-Histidine, l=IsoJeudne, 
K—Lvslne. Lr^Leuclne. M— Methionine. 
N-Asparagine, P=Proline, 
Q=Glutamine, R=Argtnine, S^Serine, 
T=Threonine, V=Valine, 
W«Tryptophan, Y=Tyrosine, 
X«Unknown, *=Stop codon, /-possible 

niip|pn+irip riplpfinn \=nnsdhle 

nucleotide insertion 














IRMSHCTPAWVTVRPCY* IKK* INGKDT 
IRQNLKKM 


8035 


21936 


A 


8095 

* 


404 


16 

- ■ 


QMGFCLFFLPSSKEVFFPPXFFFYRESF 
IFFLFIRVSPYI FFFFFFFFFFPFFFFF 

FFFPPFFPFPFFPPFFFPPFFPFPPFPP 

FF FFFFFFF FFFFFFFFFFFFFWGSMEL 
LSKAHQCISSHGSG 


8036 


21937 


A 


8096 


358 


200 


KS YF I FSLI I FSHYFFLLFLKHLLATYY 

RSCL*TFH*RLYFSFPIFNVQKLF*LSD 
XCFCCI YLFIYLFIYraTVFCSCCLGWS 
MMA* FNGMFKDRI 


8037 


21938 


A 


8097 


2 


132 


KEEKKERKE * KKRKKERKKEKREKKRKR 


8038 


21939 


A 


8098 


48 


331 


PGTWHACLGGBD ILKWQTKGRGRRAGSH 
RTSWIEPLETVFLELEAGGRx^SPGKNY 
SRLGAVAHICIASTLGGRGRQIT*GQEF 


8039 


21940 

■ 


A 


8099 


7 


373 


NGRLRRGFAMLARL VLNF * PRDPPT * AS 
QSAG I TGMSHHARPTI ES F IHYCWE CKM 

P W dt. r< VT . T T .VPUT . 1? T PTf DnTTTyKTUM^trci 

S LLQP PTPGLQRS FRLTP PSGWDYRRGP 
PCPANI*LII 






a 


o i r\r\ 


1 lo 




vt ,dnt. p vnpnB p c pa t t .ktt jap q a t.* p qu 

x i-trCViljr iyKy/iT oirVlXX/l\XJP , Jl^>iXi w iSoxl 

TVAQAGVQWHNLGSLQPLPPRFKRFSCL 
SLPSSNIFAPLSYFFRNSSFTOISPSEI 
GPQITDPLFIKKKI KVSVMVE 


8041 


21942 


A 


8101 


361 


138 


FNLLGRKKKNFGGLKEGES RRPPNPGPQ 

PPPPPT.PPPPP <5n* PUnT.TMT.PD T .\7T.NQ 

WAQVIIiLPWPPKSLGLEV 


8042 


21943 


A 


8102 


1 


199 


PTRP FVFLMEME FYHIGQTALE LLTSGN 

T .P7X Q 31. Q n Q 2Vf3 T TfiVQ nP * TA+ VT .WPTTTTR 
uriuAoudnui ivjvowK'iiH" xxjc rrrv& 

LFCYFALLLI 


Qf\A1 
OU*fJ 




n 

A 








<3 P P P P PP P V T f?P KT3PP WPrtn PPPP P 

SPFFFRGF*GSPFPPK*KVLLVGFIiKPK 
PPFPDGWGNPPPPPVGGFSPPQSIiQFFF 
F KKVFGWVFF FPLF F FFFFFS PFFLFFS 
ETESHHIAQA 


8044 


21945 


A 


8104 


192 


2 


EETELSFFIIKYYLPRNQKVCKRL*RRL 
GPVAHACTPS PLGGQREW ITRSGVQDQP 
GODGETP 


8045 


21946 


A 


8105 


373 


183 


GGPF* FQYILILFDNFFFFSLSFPFSQD 
FFFLSS FFFFFFFFFFI INYFYFF 




91047 


A. 


oIUO 


3 /J 


9ft 1 


S PKTAP P FFFFPPW * RLSIiTML PtJMVSN" 
SWAQWLLPQPPKVLRL*ATQRKKHFRQ 
EQI 


o\JHt 


01 OAR 

• 


a 


olli/ 


* i 


114 
1 14 


pt .t fvh t n pwwanAfiT .pt iT j» ^nnpp at . 
ASQCAGITGMNHHARPYYLL*TLNIPLP 
YDPEIFLLGIYPR*MKTYVHTKACT*FF 
ITPLFVI AKTWKQPK* PSVLGLQA 


8048 


21949 


A 


8108 


416 


243 


PGVFPPPPF*NPPPEIIFWAPKKKKYFP 
PPPPLNFVFFKGPPPFFFFFF FFFFFFF 
L 


8049 


21950 


A 


8109 


2 


211 


S VTQAGVQWHVS LLQ PPJjPGLEQS S CLG 
L**OT)CX3HEPWHJ^VxTSKHTESCSL 
LCI PTATFLGLVT 


8050 


21951 


A 


8110 


172 


28 


HWCFITSFFGFYFL*RQGLTKSPRLVP 



1043 



PCTYUS01/04927 

• 



SEQID 
NO: of 
nucleotide 
sequence 

i 


SEQID 
NO: of 
peptide 
sequence 

• 


M 

cth 

od 

■ 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 
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peptide 
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ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid. F=Phenylalanine, 
G=GIycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine,M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Seriner 
T=Threonine, V=Valine, 
W-Tryptophan, Y^Tyrosine, . 
X=Un known, *=£top codon, /^possible 
nucleotide deletion, V=poss!ble 
nucleotide insertion 














NSWV*VILLPWPPKVLGLQA 


8051 


21952 


A 


8111 


2 


225 


QKSAKNFYHLS I YLS I YLS I YPS IYHLS 
IYLSIIYIiSSTYE*VIILEQKWNSIKKG 
CL*CTIQTNCL*QLKE*VIRLIPKL*YK 
I KLPTCW SVHLNWFTLCNI S WYLKYLAM 
* * RKLVCFA* FWYNSNKLLI T I KRIGH 


8052 


21953 


A 


8112 


392 


221 


PFPKRESPPDKVNPPPPDIFGGGKTKKK 
TLANN* FPPKEKEFFVFFFKKEDFF*NI 
VGNFKTPSKI*AVCF*NPVFKKKKGAPK 
KKSSFKNF 


8053 


21954 

1 


A 


8113 

1 


369 


112 

m M mm 


I FFSTI YYFXHPFFFPPPLFFTPPPLFF 
FFLKKKKFFFFFFFXXXFFFFFXXXXFF 
FFFFFFFFFFFFFFFFFFFFFLAF*HVY 
SF 


8054 


21955 


A 


8114 


418 


248 


KYYLPTPGY*NIiILLN3PPFFFFFFF*D 
RVSPCCPRLP*TPRIiTQSSCLSLPRSWD 
HRCMPLRLVTF * FTFKLSHQS LRTLLTY 
TLAGTTLG 


8055 


21956 


A 


8115 


229 


78 


SFKGAAPFFFFF* * RQNLTMS PKL VSNS 
RPQAVLP P WPPKVLGLHRGGW I T 


8056 


21957 


A 


8116 


356 


66 


PLFPFFYIKNFKKKFFFFFQYFFFFPPF 
PKHYSHKDI FFFFI flCFFPSSSFSFPFF 
FALSSFFFFFFFFFFFFFFFFHA*FPVF 
FLYKIPMLYPQL 


8057 


21958 


A 


8117 


212 


81 


ERLSHCS LNLPDS SNP PASAS * VAGTSG 
THQLVMWHAGSPRF 


8058 


21959 


A 


8118 


367 


2 


GVFFFFFSPAKKGVFFHPFFFLGPRFFP 
PPPFFFPPLWFYFWGP * KKY* FPHPRGY 
EFFFF*GGPPHFFFFFFFFFFFFFFFFF 
FFFYAEGIGVSPCYIMIiGYNFSSFPCGT 
ISPRVRPRV 


8059 


21960 


A ~ 


8119 


38 


215 


FFKSAFHHTRLI FNFFFFLEKTGSHYVA 
QAGVE LLG * SSPTTSAFE IGG I TGLFLS 
FKN 


8060 


21961 


A 


8120 


266 


2 


PQ I FLFI FE TRSHS I TQ VGVQ WCDHSSL 
QPQNPGLKQSSLSS * DYRHTLTAPG* FC 
FL*RCGLAMFPMLVLSSWPQAVLKIHPC 
CGMC 


8061 

• 


21962 


A 


8121 


3 


332 


DAWADAWGHVPPCPANFVFLV * TGFLHI 
GQAGLBLPTLSDPPASASQSVFIYFLBT 
ESHS PAWETERDSVSQKKKKRGGRFKGS 
NFTSAGLQRFI FFMGPPKLI SRAGV 


8062 


21963 


X 


8122 


378 


66 


LFIPLPIREEVFPCIPG*NWAPPGVLYN 
GRPFFFFLLSTVFLHAGQAGLBLLGSSD 
LPVSASQSVGITGMSHRVQPKLRIFVNI 
SFPFEIFCEIMLLVPLAQP 


8063 


21964 


A 


8123 


2 

• 


316 


ISMTtTiATiliIiRIITFRLPQIiNGYVEEFTP 
YECGFNPISPGRAPLFIEHFLVDITFLL 
FDVQIALLLPLP * ALQTGNLPLI VMPSL 
LLII ILALRLDYERLQNGSD 


8064 


21965 


A 


8124 


331 


14 


ITSILLIIITF*LPQLNGYI*KSTPYEC 
GFDPI S PARVPFS I KFFLVAITFLLFDL 
E IALLLP L P * ALQTTNXjPLIGMS SLLLI 
1 1 IALSLAYE* LQKGLD* AE 


8065 


21966 


A 


8125 


2 


228 


IIITF*LPQIiNGYIEKSTPYECGFDPIS 
PARVPFS X KFFLVAI TFLLFDLE IALLL 
PLP*ALQTTNLPLIVMSSL 


8066 


21967 


A 


8126 


451 


200 


KA* WNRFAIIiNVTKYTCESQEBVKI PTF 
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E=Giutamic Acid, ^Phenylalanine* 
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K=Lysine, L=Leucine, M-Methionlne, . 
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W==Tryptophan, Y=Tyrosine, 
X=Un known, *«Stop codoa, /"possible 
nucleotide deletion, ^possible 
nucleotide insertion 














AGVWKKLI PAHTDDFVHPKTSVQEVATD 
VAKIASLLELEVESEDVTE*LQSQDKT 


8067 

w v v r 


21968 


A 


8127 

\ 


2 

mm 


396 

w r w 


LI VI INTLLALLLI I ITF*LPQLSGYIE 
KSTP YE CGFDP I 3 PAR VPFS I KFFLVAI 
TFLLSDLE I ALLLPLP * ALQTTNLPLIO 
MSSLLLI I ILALSLAYE*LQKGLD*TBS 
GWEKLVKKE KADKLKGGRP 


8068 

www 


21969 


A 


8128 


1 


350 


GTRGLQTLLPLIAF* *LLASLANLALPP 
TTNLLGELSVLVTTFS * SNI TLLLTGLN 
ILVTALYSLYIFTTTQ*GSLTHHINNIK 
PSFTRENTLMFIHLSPI LLLSLNPDI IT 
GFSS . 


8069 


21970 


A 


8129 


375 

* 


1 


LTCFS EHRPPPRANFFFSRKTRAP FFLP 
GGFQI PGPNFFAPPIFPNGGPIPGLDPQ 
PGPLNFF * KSFFPDPKFFFLFFPPPI *N 
SGVFLPRNPVFPKTPNFLFFFSPFFPGD 

w w ^m^m* » ^ m ™ «• " » Mi mi m ■ M> mmm> w m> mv Mi M» m Mp ^m mm* 

GVLLCHPGRLPRA 


8070 


21971 


A 


8130 . 


32 


211 


G I RHGFAMLPRLVLNSWPQAILLPWPP K 
LLG*QAGVELLASSNPPALASQTAGMTG 
VSTRPPVLLPTLGTHLCLDACLGS SL 

W ^™ ^m^^m W MMMM *r Ml ™* Mi • »-M^ v^MM MM^^ M^ Mr MM 


8071 


21972 

■ 


A 


8131 


396 


5 


KYRDGPPPPGKFFFFFF*DGFSLCCPGW 
SQTSCPK* SSCLPEG*DYRHEPPHMTLS 

m» Mr ^r M- mm Mr M» m^mm mi MM Mr M> M w mmm m» • vmv v mm Mr 

SS Y * CS QMQKCINKNCLGSCWAHACNL 
SPIjGGRRITWGKKFETSIiDNIVKNRLKK 
KKKKIYQGGGDHPCIFTN 


8072 


21973 


A 


8132 

W- m> wr mm 


347 


34 


KAGPVI PRGPQEF F FFFFPQDGI LLCFP 
GWRAVAKS * VTATSNS WAQAITGMLPP C 
PANF* KI S *RQS I SMFPRLVPNS WPQVI 
LPPQPPK*LGLHVCLVPSLA 


8073 


21974 


A 


8133 


102 


270 


DYRRTPPHLANFFYFL* RQILTVLPMLV 
SNSWVQAILLLQDFFSVQLKMGFLSVP 


8074 


21975 

■ 


A 

- 


8134 


338 


47 


FLSTQKES S PLHTHTHTHTHTHSHLLTH 
HHGRGQERGKRGR*GG*WRTGTHSPHGP 
HCVRLRKSRPSGBLGAAQGGCSLIQPKI 
EDPGQSGGPREVP 


8075 

■ 


21976 

■ 


A 


8135 

• 


68 


467 


SLLNRIPFCLCLNSFDCVFLLGLLFEAG 
FCS VAQAGVWWYEHGSLQP * PRGLK* SF 
HLSLP KFWDYRNE PPHPPFS CFKALNFK 
DYCDSGTVPCPFMKELSMLFIHALTESF 
VLFIVMSKLITTKHLTI ISNV 


8076 


21977 


A 


8136 


500 


355 


FHHVGQAGIQLLTSGDP PA*ASQSAKIT 
GVSHRARPQTSFLNKLPSIiR 


8077 


21978 


A 


8137 


58 


222 


I YG YEGRGS VAGS LS SL * SATTDS DLD Y 
DYLQNWGPRFKKLADLYGSKDTFDDDS 


8078 


21979 




8138 


345 


196 


KGDQAQ WLRPGMVAH I CNLSTLEGQGGR 
IT*AQKFETSLSNTLRPPISFK 


8079 


21980 


A 


8139 


500 


355 


FHHVGQAG IQLLTSGDP PA*ASQSAKIT 
GVSHRARPQTSFLNKLPSIiR 


8080 


21981 


.A 


8140 


3 . 


231 


HASADAWGRNEWPCAVAHAHNPTTSGG* 
GGWIA*AQEFEFETSLCKIVKPHLSKKK 
KKKKNFSPGWGSPPCTPLF 


8081 


21982 


A 


8141 


416 


26 


AR I FPNNS FW PGQGGFPF P PRPLEGPGG 
VIP*CQGVQARPAPPGEPRLFPKPQKIV 
GGGGE PPLFP PPERAGPQ KCRPFGEGGF 
N* PKPRPCPPGWGAQGGFVSKKKKKRNE 
DSKIPLIGKYPVGIRNMA 


8082 


21983 


A 


8142 


413 


35 


RGAGGGG SPPGPRYPS QQHLLRART PT P 
FARSRGSFPGVRGSGWRGGLGG* SRERP 
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sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutaraic Acid, F^Pbenyialanine, 
G=Glycme, H=Histidine, I~lso leucine, 
K=Lysine, L»Leucine, MMVfetbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














LPSLPPRALPRSGASSGPLDGGAQGPGR 
RERWAFLGGIRADARGCRAVTSPEPYRL 
PGAGAARARR ' 


8083 


21984 


A 


8143 


166 


368 


ENILWAAMVVLDCHPSTLGG*GGRIT*G 
QEFKTNLANIERPHLFFFFFFKRDSLFA 
PRAEGGGPILF 


8084 


21985 

• 


A 

- 


8144 

• 


1700 


621 

■ 


DLRDHSVLQQGPACLTRCGBPILSHSEG 
SKIGSERESGEHSRHPBAPPPSVEVGSR 
RS * GQDS *G FGPGNGQDTRGPLSHAAPN 
WTPDPAAAVAAGGTSLPPRGGGGI *RGR 
SHTARS GARVGS ANAHARGG3G FSRHSG 
SHGCLRNQS PRLGKAGALCGSAWVSGL * 
GSGLTS PRGRRERLRMPG VAKGRPR *GN 
PRRGRG KQHG I P ASGL PGRCWLGI KMS T 
IjQPRTRAPEELFVPOESSGSTPBKMLVS 

fhgss lrneatprysqqeeagngrwqqs 
XjS lerwp p wtshplgt p plmpvavarc c 

ILPGLWPLLSPPSGSASPVSQGPGCL3L 
WPNAFKKDDFPVRQGDTSWRSVS 


8085 


21986 

♦ 


A 


8145 


390 


38 

■ - 


NTTTFNIYFFYFFFFSLFIYFFFFFFFF 
FFFFFFFFFIFILFIYSLLVFGVWVLGL 
GLAIiALLAKLFL VNS LCRRYRG * SLLYY 
AWL* FFI FPLRYYI YCARFQFLSPILYL 
GKWFG 


8086 


21987 


A 


8146 


20 


187 


KLTTS TGRNPGRPTRPNF * VF * RDGDL I 
MLPRLVSNSVfPLVNLLPWPPKILGLQM 


8087 


21988 


A 


8147 


417 


64 


VRPR VRPRVRPR I RKKKKNKKKKKKKKK 
KKKKKWGGGF * D FWGGEKGPP PLQKNDF 
LLFWGFFLKS F FFFCGGLFFWGGPFFFP 
PPPNFLGVGGKKKTFRLLCGPFSAFFGG 
EKN 


8088 


21989 


A 


8148 


351 


187 


RIX5TF YLSNI LRFYLS ITHGP I P YIGRL 
TSQNLPIiLLLDLT*LEKLLPKKSQTYN 


8089 


21990 


A ' 


8149 


465 


60 


PPHFWGNPKKKNSQFFWAKKKKKIFLGP 
* KMVAKKG WF WGRKKG FQRGGKKG * KG 
GQLF YPVLSQVLGGGQNHGP PGKAGFGE 
NFFFFGQGLGWAI SRWLRTPLFLAGGGE 
DPKKNFKPPPKIKTLGAGVKKK 


8090 


21991 


A 


8150 


I 


175 


WDYRHVP P YLANVFFVI FRGDKVSLCCP 
V*GPTTEIiKRSSCIiGF , P*GWDYRREPPH 
LA 


8091 


21992 


A 


8151 


59 


35 


RWNSRPRRPGLKFLTSGDLPASASGSAV 
ITDVSHHAWPETFFR*RA 


8092 


21993 


A 


8152 


269 


159 


ARVEVSKKYIYIHTHTHTHTHTHIHTHT 
HTYTTKKLPS * L FQVCKME KFGRL WQ 


8093 

■ 


21994 

- 


A 


8153 


1 


256 


PGPPPRFFFFFFFFFFFFFFFFF*TLKI 
RFI WRKQ KTHNPKDG I LHQD I SQVXSLQ 

ITNHFHRRYFCTNFTCIQERDINQSLFL 
F 


8094 


21995 


A 


8154 


252 


41 


LIIPVLICIVFIVLQFKXFSVISLNKGL 
FRNHFFIF**RWGLTMLPRLVSNSKAQV 
ILPPWPPKVLGLQV 


8095 


21996 


A 


8155 


21 


443 

# 


HELSLI * FISTLAETNRTPFDLVDGESE - 
LVSGFSIEYAAGPFALFFIGEYTOIIII 
NTLTTT I FLGTTYDAL S PEL YTTYFVTK 
TLLLTCLVL* IRTAYPRFRYDQLIHLL* 
KNFLPLTLALLI^YVSXPITiSSIPPQT 


8096 


21997 


A 


8156 


2 


205 


LLHLMKHFNLEIQKAQ*TRIN1 J KRSISR 
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SEQ ID 
NO: of 

DUCIC0 11 Uc 

sequence 

• 


SEQ ID 
NO: of 

sequence 


M 
eth 

rr 

OQ 

• 


SEQ ID 
NO: in 

09/515,1. 


Predicted 
beginning 

n ti oIaa^mIa 
11 U ItCOlluc 

location 

cui rcapuiiu 

ing to first 
amino acid 

peptide 
sequence 

* 


Predict- 
ed end 
nucio 
otide 

1UCHIIUI1 

correspon 
ding to 

1 Ho I aillluU 

add 

residue of 

pepiiae 

sequence 


Amino acid sequence (A=AI a nine 
OCysteine, D=Aspartic Acid, 

fir- uiumnnt /\v»iu, r r uciij mianine, 
G=Glycine, H^Histidine, l=Isoleutine, 

KsT .train a 1 ±sY.Aiif*inA IVT ot h Innina 
rv— xjysiiie, jLr~jjcuuuc, ivi — iviciuiufijnct 

N»Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 

T=ThrAfminp V=VolinA 
M ~- 1 nrvuniiic, Y~~YHnntJj 

W«Tryptophan, Y=»Tyrosine, . 
X^Un known, *=Stop codon, /^possible 
iiucicouoc Ocicuon, \— pubaioie 
nucleotide insertion 














HVI INLSKAKHKG* ILKAVREKMLIMYK 
KPLVKLQVISN 








ol57 


*> 


2U.) 


HVI INLSKAKHKG* ILKAVREKMLIMYK 
KPLVKLQVISN 


8098 


21999 


A 


8158 


2 


222 


VYTWGTGKARVFPGEF* LPVW I PTRHLK 

CVfcTD "D TOF\7\TJX'C A C A PTPMPrtO C T TUBA 

DEQNGDIRRTDKVTIHQ 


8099 


22000 


A 


8159 


1 


297 


GFHHVGQSGLELLTSGDPLASTS *SAGI 

t 

mpMO till A TVTDT VPO VT*T\C OCMT VHtlDT rrrrf 

ACWWEEGWRKLLMENEQGREAHQKLBSV 


8100 


22001 


A 


8160 


138 


437 


NEYDHFS I IKERRDFI VCVFFFFFSKKS 

t p rxrr , T<*>nr«T>cvT t?ddt?t rwr t? c t , c»t 
i_i 1 V Vj i Wvi<j rb I iiilj F e Jr iAsW J\JSi r o<JJj 1 r 4j 

GSWDNGGATTRLG* FFVFFRKKGVSPCG 
PGWS *TPDLRKTPPSA 


8101 


22002 


A 


8161 


330 


148 


QQSKTPFFFFFFKKKKKYIYICMVVHAY 
NPSYSGG*GGRIT*AQKFKTSLNNTVRP 
ftLYiUt 


8102 

* 


22003 


A 


8162 


391 


255 


RPAPADF* FFFFYRLGLAVLPRLVWNSW 
PQATLLPQPLRVLGLQI 


8103 


22004 


A 


8163 


I3l 


3 


HKVSLCYPG* S ALMQS * LTVASNS WAQA 
ILQP* PPE*LGLQA 


8104 


22005 


A 


8164 


97 


440 

i 


NKGSLCPPGGRPGALSHPTARSRGSPGP 
TPQKTGNNG.PGJP PTPVNLB FFEKRGFTP 
VGQGGLKLRT * RS P P PS APQKAG ITGGS 
RRPGPIQKKFLKTLLGETKWMAKLIKTY 
LK 


8105 


22006 


A 


8165 


541 


295 


FSQQKKKI FISTNLSHQES KI YI CKEME 

ptrertrTDT t o/t*/"*^ Tiurvn-i/T" t T7T r»C5 c?c» tvt* 
oHSVIRLLSQSGAinNHCSIiKLPSSSDP 

PASVS * SAGI IGVSQHLANYTFKKIF 


8106 


22007 


A 


8166 


440 


259 


PPQKFFFFFFFVKTGFRHW*AGFRLLD 
SSDPPPLAFQSVEITGMSPSARPMPGVF 
HSLR 


8107 


220Q8 


A 


8167 


23 


405 


FRICHLLRVHLLVYFI PSFTDNHLLSTS 
AMFLYSNDWIAQNVSGPFLGI I FRTFF 

PGLKGFSGLTPPGGGNSGGGPPNKPNFW 
IFKKGGGSTLWPRII 


8108 


22009 


A 


8168 


23 


388 


P YFS CALPRHHP PTTNLGPHLGMGAPWG 
y fo VAntl 1 ijur r r r r bJvKJJr v-r Vr'KViilt 
LWLEESSLNPPLLGLKBFSCLTLWRTGN 
NGGPPPPPVIFCF* KKGGVYPWGPGGGE 
TPALKETPGL 


QIAO 


Tim n 


A 


oioy 


012 




JN ±tit olio r oillnKxlTUl 1 tilxtl ciAittrJL XI 
RTP*DNVPQ 


8110 


22011 

i 


A 


8170 


2 


288 


VNLTFFPQHFLGLSGMPRRYSDYPDAYT 
T*NILSSVGSFISLTAVILIIFMI*EAF 
AS KRKVL I VEEPS INLE * LY I KKKKR3CK 

KKKKKKKKKKN 


8111 

• 


22012 


A 


8171 


381 


101 


QKSLQQPSNIATILEEVQVIFLGSPYHL 
QRWEMRLTKWLRGLGAVAHACNPSTLGG 
MGGWIA* AQELETSGDP P PSSNRQLFTG 
EHNNDNKLI 


8112 


22013 


A 


8172 


136 


360 


TKKHFGLLK* *TKKHFGLLKKKKKKKKK 
KKKKKKKKKKKGGGAFKKKKFF PRGGGK 
I FFFFGAPKKKFGGGVLKTGGGKKPGVT 
KK 
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SEQID 1 
NO: of 
nucleotide 
sequence 1 

"i 


SEQID 
NO: of 
peptide 
sequence 


IVI 

eth 

oa 

* 


SEQID 
NO: in 

09/515,1 
26 

■ 


Predicted 
beginning 
nucleotide 
location 
correspond 
Ing to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nucie- 
otide 
location 

correspon 

jj Ag- 
oing to 

last amino 

AM* n 

aciu 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D^Aspartic Acid, 

17— Pluto ihia A >i<l 17nDnAnirln1na!a» 

Er-vviuiamic aciq, r=r nenyiaianiQe, 
G=Glycine, H^Histidine, Msoleutine, 
K=Lysine, Lf=Leucine, M=Methk>nlne t 
N=Asparagine, P^ProIine, 
Vf=oiutamine, K—Argimne, o^aertne, 
^Threonine, V-Valine, 
w— i ryptopnan, 1=* tyrosine, 
X«=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


8113 


22014 


A 


8173 


358 


58 


EKVSVFFLKMKCSGIILLPCRLTLPGST 
DS PASVCRMPAI AG PRRHA* LVLVETGF 
RCVGRAGLQ PLTAS D PPTSAS PGAAIAD 
uvtsiriyk-awvFKiKvu 


8114 


22015 


A 


8174 


254 


370 


GLTVLPGLVSNS WAQVTLLS WP PE * LRP 
QAYATGAGFLL 


8115 


22016 

i 


A 


8175 


48 


269 


FYVILCLRDRVLLCCIiGWS*TLGLKLSS 
YL S LLS S WDLiQHVP PHSAS uLTF* VKAC 
FFLLKYLKRLKIYAKIM . . 


8116 

■ 


22017 

• 


A 


8176 


1 


338 

• 


PTRPINTLLALLLI IITF*LPQIjNQYIE 
KSTP YECGFDP I S PARVPFS I KF FLVAI 
TFIiLFDLE IALLLPLP*ALQTTNLPLIA 


8117 


22018 


A • 


8177 


426 


130 


QSLDPPGNMVTLSLQN*KNPPP*GGAVF 

PRQP PS P KKKKADS EFS I FNLQPALLCK 
QYCIVLHALPNKILF 


8118 


22019 


A 


8178 


396 


244 


KXHLLY I CMC I YI YVISRHY I LR * HTHA 
HTYTHTHTHTHTHT P KNT KYTLW 


8119 


22020 


it 
A 


8179 


2 


119 


TRPGNW PGVVAHFYDPSTIjGA* GELIT* 
GQEFETSSSLY 


8120 


22021 . 


A 


8180 


2 


80 


FLPLTLALLI * HVS I PIT ISS I PPQT 


8121 


22022 


A 


8181 


438 


333 


SPLGRLRHYNCLNWGGKGCN* PKSCPCP 
PAWGTK 


8122 


22023 


A 


8182 


393 


19 

• 


IFFFFFPPPSPAKFFFIFKKKRFPPLWG 
GVF* S PPPGFQKTPPPQKVGFSRVFPPP 
PPKKFFFGGGKYFFFKEPPPPPPPIL* K 
GGAKISPPPPPGKKKNPFFLKGGKKKKT 
PFIFFFPPKKIKN 


8123 


22024 


A 


8183 


24 


269 


GSQQVLGFCFRDRVLLCHPG WSAVM*S * 

T fTTT TTi flfYlftMJl An TT/IAT T f*MA Hill >TlllirtT> 

LTVASTS WA*R I I^CSIiGSVrEYRHTPP 
CL*NRNPQLTVKWRVKDHNNKKSHR i 


8124 


22025 


A 


8184 


154 


361 


WHNHDPLQPQTHGLKRSSHLGLPSIWHY 
RHVP PRLANFLNLS FRQGLAMLPT PS VC 
HLLVFLS * H I YWTFG YLS * S VCS HLLNF 
L**WHNHDPLQPQTHGLKRSSHLGLPSI 
WHYRHVPPRLANFLNLSFRQGLAMLPTP 
SVCHLLVFLSETLGFK 


8125 


22026 


A 


8185 


AAA 

228 


337 


RPGMVAHACX PG I IjGGRGGW IT * GEKFE 
TSLANMVK 


8126 


22027 


A 


8186 


314 


127 


DIQPKKEGEGEEEQELWLGMTAHAYTPS 
TLGD*GGRIA*VQEFETCLGSIiARPPSL 
QKNFKN 


8127 

■ 


22028 


A 

> 


8187 


10 

* 


420 


TQEAEVAVNWDHATALQPGRQSETPSPE 
KKKKKRG P PRWGENF FPTPGG F * KDGGN 
KKTGFPP FKxDGP P P PKNWTPWVFQKRG 
APPEP PPLGGALGRE PLGGFFGKGGVET 
PFFP I Jr lUS^NPPEXlCPHRBrxi FFAF 

■ 








8188 


398 


i 
j 


IjKKKRPOTPFSPICIOF * KFFPPFPFKKS 
PFPKNLLKKKRAPGGPIFFFFFYPGLFK 
NPSPL*MGSPPPPLFNPKN*KKPPLLKK 
GPGF*WPFFQKKKPPFFFFFFWRQGFTV 
LPRHTPS CLVNFCI FCRDGI 


8129 


22030 


A 


8189 


30 


153 


AW YVRRE DHVS LGG * GCGKP WLCHGT PA 
WVTEENPKGKIK 


8130 


22031 


A 


8190 


206 


45 


HPSQNSLSKN*ERVSVCCPGWSAVAGSQ 
LTAASTSQAQVILLPQPPE*LGLQA 


8131 


22032 


A 


8191 3 


228 


TCVCLCVCVYICDVCVGCVCVYIDEYIQ | 
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SEQ ID 
NO: of 
nucleotide 
sequence 

♦ ■ 

» 


SEQ ID 
NO: of 
peptide 
sequence 

• 

■ 


M 

etb 

od 

• 


SEQ ID 
NO; in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict* 
ed end 
nude* - 
otide 
location 
correspon 
ding tfw. 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid; 
E=Clutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucfne, M=Methlonine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
TMThreonine, V^Valine, 
W=Tryptophan, Y^T^rosine, 
X=Unknown, *=Stop codon, A=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














IYRYIHTYIDI YTHI * IYIHTYIDTHMW 
VCVYIYIYIYRCMGVRET 


8132 

. ■ 


22033 


A 


8192 


41 


416 


I CR VKE IASRPLLE THSLSLVELS * FCL 
TAQ * RNDFLALKHLTAVLYNL I LHS SGF 
WSSLKSHFIFLAIi* YYL* *FI *SN*RRD 
LAVLPRLVSNSWPQVILPPQPPKVIiEIR 
YEPPSLALILFPN 


8133 


22034 


A 


8193 


2 


183 


krls i wqkkvegk6fqgmdvgpvs vaha 
cvssslegqggkia*gqefetilsnivr 

PHLY 


8134 


22035 


A 


8194 

• 


2 

■ 


465 


AGVQWHDLGSLQSPP PGFK* FSSLSLPS 
SWDYRSAPPCPADFCIFRRDWAGWGVGC 
G FHRVARAGLELLTS GDP PAS AS PGAG I 
AGMSHRTQRGQDNFASWRRGGPQMSPS P 
PSCXFLC2NTFSLKPSWVAPYPHASALGG 
GE YGGP PS PQPHLE 


8135 


22036 


A 


8195 


397 


149 


L I CLTDLP I RFLSLLALVFTVFNTKVKV 
KDLIKLGWAHTCNLNTLGG*GGRIA*A 
HEFETSLGNI VRAVI PVOCFS VILSR 


8136 


22037 


A 


8196 


355 


95 


KKIMPTLCLKIPS* YRCEEI IQLSLQSS 
CDHRHMPPHPTNPF*RQHLSMLRSLVLN 
SRPQAVLPHGPPKVXrGLQAPATAPGILS 
LF 


8137 


22038 


A 


8197 


382 


112 

■ 


KTDFGKKGSPSSPPPFFFFKKSFAPSPR 
GEKKAPLKSQRGLCPPKPKDPPPPGSG* 
TQDKGPPPPGQP I FFFFLKKGVCLFSRG 
GSKTFI 


8138 


22039 


A 


8198 


32 


361 


ASR IGFI LQEFL P VR VWTTNPS AGADG Y 
NFLEKQKGTEKTS CS VAQAGAQ WCHRNL 
L* P * TSELKQSS CLSLLSSWDYRMYFVL 
LGKQVTLLIGKRALSVYPRDWSLC 


8139 


22040 


A 


8199 


1 


314 


INTLLAT.T.TiIIITF*LPQLSGYIEKSTP 
YEOGFDPISPARVPFSIKFFLAAITFLL 
FDLKNALLLPLP * ALQTTNLPL I GMPSL 
LLIIII*ALSLAYE*LQKGLY 


8140 


22041 


A 


8200 


3 


109 


FTSKHHFGFEAAA* YWHFVDW* LFLYV 
SIY**GS 


8141 


22042 


A 


8201 


3 


216 


DAWAAVGHHHTRLI F* I FVEIGSLCVSQ 
AGLEILDSSDPPSSASQSAGIRGMSHGA 
GQEHLFHRPIGKIK 


8142 


22043 

- 


A 

■ 


8202 


1 

! 


267 

- 


RIHTGEKPYKCEECGTGFKGPSTLTTHK 
FFVYCRE VAVLLKNCYSHLYPH * IIVNG 
SEADKREMCUjCIFFHPLAEEQENVENK 
RKYNP 


8143 


22044 


A 


8203 


422 


92 


PLVFALPPKIGFAPTTPSSSPPPPPLFF 
FGPPVFPPPPFFNPPPFFFFSPPPKKKF 
PSPPPRGKIFPF*TPPPPFFFFFFFFFF 
FFFFFFFLFLYYLFILC 


8144 


22045 


A 


8204 


445 


260 


VCSPPLFIWVAPGFSPPPLFKNPPPNFF 
FGAPKKKNFFPPPPA*NFFFLKGPPLFF 
FFFFF 


8145 


22046 


A 


8205 


103 


248 


GPG WAHACN PSSLGG * GQWIA* AHE FK 
TSLSNAVRP I SAKKKKGGPF 


8146 


22047 


A 


8206 


407 


197 


QL I FKFL * EQAS H YVAQAG LELLG SGNP 

PASTSQSVKVTGMSHGAWLLAPTLRSTS 
LNYNSLSVLLLPRP 


8147 


22048 


A 


8207 


I 


142 


S CS VAQARV* FRDLGS LQP P PPRFTPFS 
CTPAWE TVRLHLRKKKKKN 
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SEQID i 
NO: of 
nucleotide 
sequence i 

* 


SEQID 
NO: of 
peptide 
sequence 

> 


M 

eth 

od 

• 

• 


SEQID 
NO: in 
USSN 

09/5154 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AJanine 
C=Cysteine, D=Aspartic Acid, 
E=GJutamic Acid, ^Phenylalanine, 
G=Glycine, H^Histidine, I=Isokucine, 
K=Lysine, LHLeucine, M-Metbionine, { 
N-Asparagine, P^Proline, 
Q=<3lutamine, R=Argioine, S=Serine, 
T=*Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X= Unknown, * a Stop codon, ^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


8148 


22049 


A 


8208 


2 


370 


PBVFDYPN * FKNL*LHKTCTQ*L* AALF 
VITKTWNQGOPLTGEWINKLEYtHTMDY 
LAI KINELANHKMM*RTLKCI LLSERS * 
SRKATYYMI S I IRHPRKGKTIBIVQNSS 
CQGLGERQKGL 


8149 


22050 


A 


8209 


363 


108 


KKPRVE ACLS PGVGRERAKRER* KRERE 
RERERERERENEL*KLSLADLCIKNG*R 
EFFKEKRYVKRYAKILRVSGRKQKYGNV 


8150 

• 


22051 

t 


A 


8210 

• 


386 


254 


LNPGGGVCSEPRLCPCTPAWVIR*DSVS 
EKGKKKRKGPEHS LTATNF* ALSCNLDY 
liRCSMLKMQRQRKRKERTEVRFWQLHHN 
LTLGFV*LIVYQRE*RRTGWFVAKYMLY 
TPS * PGSFP RLRKCVEHVFGYEPACS PL 
FPL IHNQLHKAKC 


8151 


22052 


A 


8211 


3 


131 


AHLFIAWFTEYFKPTVITYCSKKKIPFK 
MLLL I D SAP S YPRALMEMYKGMNWFMH 
DNLTFILWPMDQ*VTFTS*SYYLRN*FC 
TIAAIDSDSSDGSGQSNLKIF*RGFTTL 
DAI KNI CDS *EL 


8152 


22053 


A 


8212 


86 


286 

• 


WS I WE CTVI S VQLFHKPKTS FKNTS YWS 
GVMAYTNNLSTLGSQDPQIT*GQEFETS 
LANMVKSCLY 


8153 


22054 


A 


8213 


393 


344 


SFPFSLPSKWGFKNLVPS PGYFLFFLNK 
GFRYFGQGGFKI LNSNYPPPLASQKGGI 
SGISPWPRALVFF*WRNLGPN*RTIVSP 
NYGLK 


8154 


22055 


A 

- 


8214 


3 


313 


QGLVLV?HSHGLLQP*PPGLRRSSCFSLP 
CSWEHRCTPPHLANFLILCVCVCVCVET 
GSHFVAQAGLELLGSSDPPLS ISQKSWD 
YKV*AI I PRAKQNYLMLFY 


8155 


22056 


A 


8215 


165 


348 


RPCPL FPEDLLTFSREGPGTMAHACNLS 
TLGGQGGGIA*GQEFETTLGNIVRPHLY 
FYLKK 


8156 


22057 


A 


8216 


379 


218 


VFLVETGFLHVGQAGLKLPTSGDPPASA 
SVSAGITGVSQKCRF*YNVFIFQKSSIiL 
NALPICSLTMLLGNLFSMFMTPKFFTCS 


8157 


22058 


A 


8217 


32 


276 


RYLPI IAALFTIAKI *KQPKSSLLDKWI 
KKLWC I YTME YYSALKVILTVVTTWMNL 
VGFIPGDINHIQKAKYCMISLICGI 


8158 


22059 


A 


8218 


171 


2 


KGVFGLGRLFFFLDRVWLCLPEWSTWQ 
SRLSATSAS*IQAILPPQPPEELGPLCP 


8159 


22060 


A 

■ 


8219 


530 

■ 


73 


TKPTVGSNSRSYINQTQYFPSTLTFGGL 
MRLTIYHFFFLLVYYNPRVMVTPAYM*L 
LIL FYS S S IRDLL I AKQ P CL PNEGFS PH 
AC* TNATLLFP YS I * YLGLPSFHLIFFV 
CFSFYFL*R*GLTMPPMLVLSSWPQVIL 
PPQPPKVLGLQA 


8160 

• 


22061 


A 


8220 


350 


135 


FFFFSQTGVAQNRVQ*RDHNSLQP*PPG 
LRQ S S CLTLPKCWD YRCKP LCLAS W I YF 
KDKVYTSTPVHTQEY 


8161 


22062 


A 


8221 


185 


331 


PGMVAHAYNPNTLGG* GGKLT*AQEFBT 
SLGNINSVPKKRRKRRKEMM 


8162 


22063 


A 


8222 | 


73 


232 


YCS FKF I TLTCLS WLGTVAHTCNPSTVE 
GGGGRIT*GWEFTTSLGNIVRSCLY 


8163 


22064 


A 


8223 


418 


211 


AMPSLLT YLLTF I FCRKKKI FFYWGEFT 
MLS*LVLNSWTQVILPPWPAKVLVLPRL 
ALFYS LLVLFPS F 


8164 


22065 


A 


8224 


227 


33 


ESHSTTQAGVQWHDHGSLQ PQP PGMSL Y | 
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SEQID 
NO: of 
nucleotide 
sequence 

1 


SEQ ID 
NO: of 
peptide 
sequence 

• 


M 

etfa 

od 

■ 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acio% 
E=Glutnraic Acid, {^Phenylalanine, 
0*Glycine, H=Histidine, I=Isoleucine, 
KHLysine, l/=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=l)n known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


- 


• 




■ 






CH*CLRFWSWRPAEKPGWCSSLRFMELE 
SWSGASDAF 


8165 


22066 


A 


8225 


244 


287 


TDS PTSAFRSAG I TS ISHHTLPKS TF * I 
FCRAGGLAMLPRLVLNSWPQVILPSQPP 
KALGSQA 


8166 


22067 


A 


8226 


53 


235 


NNKWQGWAHTCNPTTLGG*GSWIT*GQ 
EFETSPANMAKI PPIiVMFAFCFBETDNK 
SHTK 


8167 


22068 


A 


8227 


275 


25 


GVLMFNFKFCI *QNSPFMLPGWIiNSWI 
PAILPPQRPKVLGLQAWTILYKWYYGKB 
MWHKE I CEGTHDVLSKLLTTRRFKRHL 


8168 

■ 


22069 


A 


8228 


386 


211 


QLKLPPLKGNKLRPP* *KFFFFFFFQRQ 
GLT I LPRLVSS S WHQM I FPPQPPEVLGL 
QA 


8169 


22070 


A 


8229 


I 


117 


VFI L FYF* DGLAMLPRLATNS WAQAI LL 
PSAPKVQGLQA 


8170 


22071 


A 


8230 


356 


26 


WHVHGSQQP * PPGIiKQS FCFSLPSS WDE 
RCAP PHP INYFQ I CRDRVYKRWANMVT S 
PNPIiVLISQSAGITSKiPCPGSBISTIF 
IFSHQLLLVRGVYPDMLVLQLGKTCI 


8171 


22072 


A 

■ 


8231 


3 


188 


FPSLRPA* IFFYIYFFFLKNKVFLCHPG 
WTQLLDPNWTQAAGL'KQSSHLSLLSSLN 
FRGSA 


8172 


22073 


A 


8232 


2 


253 


KLDYINCGTNLAQKCLFCIjFGSLFFQR* 
GLTLLLRLFMNS WPQAI LLP * PLTVLGL 
QATMLGSQVPVFDLGITLCRCRFGKLE 


8173 


22074 


A 


8233 


373 


65 


PE KAGS CLELAFGLQMQHP P WAS TLPAA 
LTDFGLATVPNHVSQFLKSHTHTHTHTH 
THTHTCTRTLLVLFL*EALTTTPLLRSI 
CKHRESACSLSIiTWPLCC 


8174 


22075 


A 


8234 


390 


46 


RVFKPG *NPPLVKAPAPPLFFGGV*GRS 
PKKKNPQGPGGGGP P PGFHPFFLTPFFP 
IFLRREPPPFFFRGGLGPPPPPFKTPFF 
FFFWVKGGKFLPQKKKKKFFQS ISTRSV 
IHK 


8175 


22076 


A 


8235 


387 


216 


QKLGVGLVPPGRVIFFFFFFFCC*GGGIi 
TMLPRLVSNS W PLAI LLPQPPKVLGLQV 


8176 


22077 


A 


8236 


1 

• 


201 


RFFFT I P PQCFPQ FI LS F * RRGLAVLPR 
LVLNPWPQAILPPQPPELXjGLQAHPTTL 
GPVFYSYCVRA 


8177 


22078 


A 


8237 


399 


229 


SVGGGGLTMLPSWSQTPGLK*SSHLSLI 
KCWDYRPEPPHLAHFSLLNLQLYCHTKL 
S 


8178 


22079 


A 


8238 


192 


335 


SHTLPHSSNFS YFW * RQGFAML PRLVLN 
SWARAVCLPQSPKVLGLHA 


8179 


22080 


A 


8239 


442 


254 


GRDRVLLFCPGWSQTLGLKRSSCLGLPK 
HWD YRHE P PHPAEN I KFCVWPTL YFYWP 
ALVQ I NL P * QLQGTECVACMC 


8180 


22081 


A 


8240 


381 


180 


VLKLFFCSHRGPTMLPRLVSNSGLK*SS 
HLGLPNCWD*RCEPWHPASLCLLTSTFW 
PPTLICNIHK 


8181 


22082 


A 


8241 


388 


260 


NAIPFYITKCCNPSTLGG*GG* IT*GQE 
FENCLANMVKPCLY 


8182 


22083 


A 


8242 


1 


145 


GFTILVRLVLNS*PLDPPALASQGAGIT 
GVSHCAQIilYVFQRSLQLLW 


8183 


22084 


A 


8243 


1 


173 


GI/TVSQAGVQWCDHSSL+ SQTJjGLKQSS 
SLSLPSSWDYKCKPQHQADRFFLMEFFQ 
M 
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SEQID ' 
NO: of 
nucleotide 
sequence 1 

■ * 


SEQID 
NO: of 
d eo tide 
sequence 


M 

etb 

od 

- 


SEQID 
NO: in . 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
dingto 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D^Aspartic Acid, 
E=Giutnmic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, t=Isoleucjne, 
KNLysfne, LFLeucine, M=Methlonine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, £=Seriner"- 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8184 


22085 


A 


8244 


202 


327 


TLFI TFI FF * R*GI*AVLPRL VLNSWPQV 
ILLLWPPRVLCLHV 


8185 


22086 


A 


8245 


42 


289 


KE KRS XILFI FFCRDGVLLWLPRLDLNS 
WPKTIFPPWPPK*LAIiQAHTTIPRGGGS 
IILIC IRDVKRERKQ YAKSKNPEGET 


8186 


22087 


A 


8246 

• 


592 


485 

• 


NIVSQAGVQWHNLGSLQPLPLGLKWFSC 
LSLLSS * DYRHVPPCPANFCISVETRFC 
HVGQAGLELLASNNPPAPASQSTGITGV 
S PCTRPRVYVFLLW I LANGPS *RAETLV 
NLATNQEDADS SIAPGLGSGHCF 


8187 


22088 

i 


A 


8247 


307 


12 


MCSPVFS PKQKNPIiFLNFFKKEFLFFFP 
GGRQGGHFGFLAP PLPGLNKFLVLPFPG 
NGDTGKGPPARGNFFFFFFFGI FSRDFV 
SPC*PGWSLIPDPR 


8188 


22089 


A 


8248 


392 


237 


HYGQA3 LAPLTSGDLPASAPQSAEVTGV 
SHRAWPTVLI*SFS*KFNWV 


m fiQ 








l JO 




FFPPGWPKPSLLEPPPPWLKGFPPPTIjIj 
GGGD * KRPPPPPGNFCFFKKNGVSPLWE 
GWF * TS NLGNPLP S P PKRVGI T . 


8i on 








JO I 


I DO 


IET ILDAIKNIHDSWEEVKISTLTRVWK 
N* I PTPMDNFEGFKTSLEEVTADVMGIA 
RELEVEPEHVIBLLHDIS 














LFAI VKR* NHPKCLL I YAR INKMWHI S I 
GWNI I SLI SKGMKFS YMFLTWMNLEVIM 
LSEISHSQKVT*YMVPLA*GHLEKLNCL 
PIKQQTPIPSSSSP 


8192 


22093 


A 


8252 


379 


201 


PGANYACN PSTLGGRS GW I A* AQELETS 
LSNMTKPQPCSTPQCFLVNQLLIiQNLYP 
YFP 


8193 


22094 


A 


8253 


78 


311 


LTLYFHTRKKKGLK* I FYFI * S ILSQKD 
LKVGTGPGAGAHTYNPRTWGGWKGLGAG 
KRI T * AQ E F ETS LDNTVKTCL 


8194 


22095 


A 


8254 


303 


81 


TPCAENFKLRKRFHHI ISIjQRKMNYLIS 
FLYF* +SHSLTILPRLVSNSWPQAILLP 
LPPKVPGLOVSATLPGL 


8195 


22096 


A 


8255 


2 


376 


MGYNLVCHLLRPLEKRSS IRVGVT* FSG 
CPPSPLPLARKGYSLTPCASQVRQCLAL 
LWLTLGGLHPLSCPHCPTSPSVMNPVPQ 
LEMQKS PVFCVSHAGSCRLELFLFGHLG 
TQSSLDSFFKKVI 


8196 


22097 


A 


8256 


70 

1 


356 


LI I AYNLTYDLSWKMFYVYLRKI YTLLH 
WVECS I YTCWV* FVYSMVSWNRIDSPGK 
KKTNIYDQLIFNKGAKSTKLRKNSLFNK 
WGQDKISTYKK 


8197 


22098 


A 

•> • 


8257 


403 


85 


PFCPWFPPVFIPPPLRKSGTPPGPGPEF 
PGPGFSNGGNPRPLKKS PKFS PGWGRGP 
LFPPSRGV*AGNFLGPGRWRGPWAKFVP 
LPSCLGHKARFFFPKKKRRKIK 


8198 


22099 


A 


8258 


249 


2 


YCEGDLGSYRPGVSKCMEFSPRRDIRIM 
P WPGAVCYAYNLS TLGGRDGWI A* A* EF 
GTSLAYIVKPHLYTHTHTHTHTHTMY 


8199 


22100 


A 


8259 


2 


189 


IQCVCVTKSHFCHPGWSAVAQSWLTAAS 
TS*A*VDFSHLSLLNSWD*RRVTPRLVN 
LILFPS 


8200 


22101 


A 


8260 


453 


128 


FFFFFFLI ET VSHDVP * AGLELLSS SHP 
IASAFQSAEITGVSHHAQPSSVSWLIHS 
S 


8201 


22102 


A 


8261 


420 


79 


KNPTPFFFFFFLIETVSHDVP*AGIjEIjL 
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SEQW 
NO: of 
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SEQIB 
NO: in 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A la nine 
C=Cysteine, B=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, J=Isoleucine, 
K=Lysine, L=Lcucine, M=IMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Tbreonine, V^Valinej 
W«Tryptophan, Y=»Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


• 






■ 






SSSHPLASAFQSAEITGVSHHAQPSSVS 
WLIHSS 


8202 


22103 


A 


8262 


1 


262 


NSRTERLGSWQSHFCHKEMEAHFVVQSG 
VQWCDLGSLRPIjPPGFK*FSCPILLSGW 
DYSQCRPCRTCSLIFMHSRSPSQPSEKK 
WTC 


8203 


22104 


A 


8263 


321 


100 


SATKNYLLSRQSPIGK*NNGLGAVAHAY 
NSSTSRGLGGQIA+A*EFBTS*GNMMRP, 
HCYRKYKNLARHSTYSPR 


8204 


22105 


A 


8264 


145 


275 


KPRDYSFFF*WKRDLAMLPRIiASNSWTH 
AILPPWPPKVLRLQG 


8205 


22106 

i 


A 


8265 


73 

* 


351 


SLCYRS FARCDRPKKGQHFS KS YLQDAQ 
WARTLLSriAQNNSABHHLKTTDW*GAV 
AHACNPSTPQGRGGRIA*VQBIETSQGN 
TGRPHLYKK 


8206 


22107 


A 


8266 


382 


232 


LVETGFCHVGQAGLKIiLTSGDPPASASQ 
SAGITGVSHCA5fIi*T* YS *DFQ 


8207 


22108 


A 


8267 


397 


130 


IGQAGLKLLTSGDIiPTSASQTAGITGMS 
HRAWPTFSLIAR * NAS FQVGHFS PKYQV 
GTLAKS LKCHQ VRS DTLE F I LCCR 1 1 YE 
TLGKF 


8208 


22109 


A 


8268 


309 


97 


DTWYRTRRKDHCT I TETR IQSVRPLS PR 
♦AITSRVTCTYTSR* PEANEEPQKITFR 
HCDLFLPTLTDQLTL 


8209 


22110 


A 


8269 


2 


224 


DSSGI PGS PTRPPTRPKKQKLPCDPTVS 
LLGICI YDQRQ* SVFQRDI CHSHMFITA 
LFTIVNIWKQT*HPIiMDE 


8210 


22111 


A 


8270 

lib i\f 


374 

■ 


2 


TSWESARKPPPIHSHTLKGFFFFIjFMFF i 
FF*NKI CKDGSLTMLPRWLNSWAEGIL 
lprs PKWGIQELATQRRAS ILLCNPTY 
AOGI FQHWVNVKLLMKQCPFCPAAPTK 
TQNFFPRV 


8211 


22112 


A 


8271 


1 


379 


PTSASQVAGTTGTHHHIVLFFFFGIFEK 
KGVFPFSPGGF*TPKPKPPPGFGPKRGG 
KKGGGP PP PARGMGSGVFYP PPQKPFFV 
LIGGQILFKGRTLDFKKRAPPFPPFSLF 
LPKHTPSFWIKWDP 


8212 


22113 


A 


8272 


80 


401 


AWGMRFCljLtLYS CQYLISD IAHFS KAFF 
FWKREPPPAPQVKGQGPNLS * WNPWLQG 
♦RGFSGLTPKGGGNYGPPPPPPLIFFFL 
RKNGFFPGWPGWSKTPDliRKPPC 


8213 


22114 


A 


8273 


405 


95 


PPKR FWGS S KGFPQREKKLGS PTFPKKK 
KDFIYSYLSKSLKKE* INRPSAAAHTCN 
PSTLGGQGGRIT*GQRFKISLSNIGRPH 
LYEKIKNVKKLKIKKPNKL 


8214 


22115 


A 


8274 


167 


52 


DREQRPGAMAHTCNPSTLGG*GGWITRF 
LANMVKLRLY 


8215 


22116 


A 


8275 


276 


23 


KYVSSRPGTVAHSCKPS PLDGQGEWIA* 
VRKFKTIIXSN^KPHLLFYKKLTFKNVK 
CI FKKYCATESR YVTKEKQKDRKKGTKB 


8216 


22117 


A 


8276 


1 


114 


FTMLARMVWIS*PRDPPASASQSAGTTG 
VSHHARPGLS j 


8217 


22118 


A 


8277 


2 


87 


TMLLRLFIiNS* AQAVI LPWPPAVLGLQG 


8218 


22119 


A 


8278 


357 


347 


S EGLAL YQLEAFTVCDLRERG FCRGRSL 
WPLITWAWRGGVFLLVQFQEVCOUiALG 
SIiSPDPVLLPESGGPTSASQSAGI IGVR 
DRAQPNCVFEIGSYSVTQAGVQ*HNHGS 
L*P*P 
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1 SEQID ' 
NO: of 

nnrlf fkfiflc 

UUIIEUUUV 

sequence 

i 

'i 


SEQ ID 
NO: of 

|il>|fUUv 

sequence 


M 
eth 

• 


SEQID 
NO: in . 
USSN 
09/515,1 
26 

■ 


Predicted 
beginning 

11 UVlwvllUv 

location 
correspond 

Ino tn first 

amino add 
residue of 
peptide 


Predict- 
ed end 

nil rl**- 

otide 
location 

rniTMnnn * 

ding to 
last amino 
acid 

rcdiuuc ui 

peptide 
sequence 


Amino acid sequence (A^AIanlne 
C=Cysteine, D=Aspartic Acid, 

F^=f»liitamic Acid. P=Phfcnvlfllflnin«_ 

G=Glycine, H-Histidine, I=Isoleucioe, 
K=Lysine, Lr=Leucine, M=Methionine, . 

N=Acnnrfloinp P=PrnIinPL 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W»Tryptophan, Y=Tyrosine, 

VslTnLnnum *=Sfnn rnHnn /snfrcclhlA 

A UI1IU1UTTII| ~OlU|J tUUUIIf / ^URUIUre 

nucleotide deletion, V=possible 
nucleotide insertion 


8219 


22120 


A 


8279 


371 


125 


FFPKKNEPPG*PPPKK*RKPGKFILAPS 
PP PPGGGE'PYtajKRGVLKOKPGPPGVP JC 

ILKTPPRLGKNKPISPPPGQKKQNFF 


8220 


22121 


A 


8280 


377 


63 


PPVFPPPWGGQGGGFP *ARGLRTPWPTR 
ENPVFF*NPKN* PGGGGPF* IPPPKPG* 

TiT .VPPT i P DPfSfiVPT SVK N T P PPPP WQWTTP 
rUJAT r litre* KVjvj v <rU JL Ail XIrrJrirtXJC OflAJt 

RVFFPKNLKKKKNPCLIESL 


8221 


22122 


A 


8281 


2 


284 


LPK*VKNFVHAKICT*MFVAALFVITSS 
WKQ PRY YSS I EKWINKLWH I S AME YYS S 

J-iI\K CjJJoorlxjrv.1 IMMJr i\uvjoJjoej\oyoc» 
KDTWCLIPVI 


8222 


22123 , 


A 


8282 


318 


40 


APRIiS LWPMWFI KRLDS S PGSR I TOT 

lNlv_r cLDJrlljL. VL V LiJKr JuJuL.yAKc.yWK VJu\» 

SPQSPSPKFKRFSCLSLPSSWDSRCICrP 
*PANFCGLC 


8223 


22124 


A 


8283 


30 


343 


CYQ KASHATE KS FMR* RVNPCS KLCCCY 
KILPQPPQPSAVTTLISQQ1WCKKKKKKK 

GGR I FF F FFWAKKNNLGGGL 


8224 


22125 

* 


A 


8284 


163 


392 


FWLYNFS VI YS I S DP CWY KEQLCKS VF 
SFLSS* SLLDEPNPNSPANSQAAQLYQE 

JNJ\KJi I nixK VaAl Vaya WWJJo 


8225 


22126 


A 


8285 


406 


63 


KKKLILAKPGVVNLDSLKRAPPFFFFFF 
RHGVPLCCPGWSKTPEPK*SSCLSLWSS 

CRDSQES YDP I IMMMVI I ITEGGMICXJV 
IM 


oZib 


2J.M1 


TV 
t\ 


oZoo 


i 
J 


szz 


TKFPT T.JIT.T.T.PT T*rT?+T.DOT.TCmV\7T7lf QTO 

j,IM XJUuiUjJUuJr A A It? ^ijJryijLWvji VrjJVol Jr 
YECGLDP I S PARVPFS I KFFLGAI TFLL 
FDLEIALLLPLP*ALQTTNLPLIGMSAL 
LLIIILALRLAYE*LLKGLD*AE 


oJJ.J 




2V 


ozo/ 




1 D1 

lol 


lJft!CT.lJP^T , OriTYV"\T?CDT\OT.PCCW<'2VPUV 

PPCLANF* I FCRGGVSLCCLG S VEGLE C 
QARLPVLVRAELGLD 






<ri 


OOQQ 

oZoo 


J 


Z4o 


EKTPFCFVPQGGFNLWA*TSPLNPPPWA 
SKGAKITGGTPGMGLINLANGRPP 






■rx. 




loo 




r .r <; vpppfpr nprwpn jaGGnwnwT ,ri * 
WNPWFRGLKNFPGLTPPRSWNNGGGPPP 
PLIFCFFLKKKRVS PRGRGRANFLDLGT 
PPPGPPKERE 


8230 


22131 


A 


8290 


141 


375 


RTFFFFFKTEGYFVAQAKGQGGDHGSLQ 

IT V^O It VJi-UVVJ J J U Li rLn.rm J-\J ortrrL.IT J J. J f 

F* I F * KNGASLCGPNNSKTGG 


8231 


22132 


A 


8291 


407 


209 


IGGPPGFSPPPVFKNRPPNLIFGPPKKK 
TCNPPPPGGKNWPF* KGPPPFFFPPPPP1? 

FFFFFYYTL 


OLD 2. 






ozyz 


60 


All 


it tr AJtrrtXvlj X HiAwvirvT l\X\rLT f xrlrr r Ki3KVK 

FFFFGPQKKTPAPGQNPGGGGKPWGSPL 
FSPGEKNXPPTHPXGGPVLGAPPPPPGI 
VLI * KKTGLPLFVGRAGTNPNLGGPPAP 
PPKG 


8233 


22134 


A 


8293 


147 


342 


LVKFEYTVLPKNRYRLGWAHAYNSSSL 
GGQGGRI T * A* EFKTS IjGNILRSHLYYK 
KRKEKKLN 


8234 


22135 


A 


8294 


6 


152 


CCGHAP PHPANF YS L * RRSIAMLPRLVS 
NS WTQAI LLP * PLKVLALQV 


8235 


22136 


A 


8295 1 


18 


205 1 


IPTRPPTRPIPGRLNHFTFTATRPGVYY 
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sequence 


Predict- 
ed end 
nucle- 
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correspon 
ding to 
last amino 
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peptide 

seauence 


Amino acid sequence (A-AIanine 
C=Cystein e, D-Aspnrtic A cid, 
EXilutamic Acid, ^Phenylalanine, 
0=Glycine, H»Histidine, Hlsoleucine, 
K=Lvsine* L=Leucine. M = Methionlne. 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, . 
X^Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 








• 


■ 




GQCSEI*GANHSFMPIVLELIPLKIFEI 
GPVFTL 


8236 


22137 


A 


8296 


1 


199 


YLYI YLS ILLS I YLSI YLS I YIASYLAT 
*LAI*KEIYYEEYVLVMTEAEIFHILPS 
PSWRPRKVMV 


R217 
OX J / 


*4 J JO 


A 


0997 


i 

i 


lis 

J to 


PTRPPTRIiLiSRPIKIOIALHKFSLCPEY 
LQ*ADYLF* SLWIRPDAWPHACNLS IL 
GG*GGRIA*AQEPEISLGNIVRPCLYCI 
IKNKIKKKKPLQI IAFFLCFKD 


8238 


22139 


A 


8298 


416 


304 


GFLHVGQAGLKLLI SGHLPA* ASQNVG I 
TGMSHHARL 




99 1 Aft 


a 








EMILKAAREKK* I PYNEPLICLTVDFLV 
ETLQARREWYDIFKVLKKKNKTKNPPYT 
KIEYLVKLSFKYEKVTKTLLDP*Q*LRD 
FINTRHANRNTSNRRKRPI IKHK 


8240 


22141 


A 


8300 


382 


266 


RWDFVMLPKLVSNS * AQAI LHVGLPKCW 
DYRREPPHLA 


8241 


22142 


A 


8301 


2 


273 


PLTRGPSSASDALSSGGPYHPSECCFTY 

TTVTfTDTJnRTMnVYB"rMQnrC7DflTU*V 
X X X AAlrltUKAFlX/X HilWoVUoIUrvsl V 

VHTHHTGGRGSQQGLLEGSREWWWNGDP 
QRTSQV 


8242 


22143 


A 


8302 


77 


430 

> 


GMGLSMHRTHFQYL^IHPCVHVHTHTH 
THTHTHTHTHTLCM*HTEIHKSVRYGTE 

THTHTHSVYVTHKVCEETCLSFFYPESG 
TISHAQE I LKRCLENKTHLGVWC FTS LL 
SVANIVFFSYKROVAAARGGLFFSLSOA 
E 


8243 


22144 


A 


8303 


137 


396 


S S VKTMNYLKP VVHMVADRC * LFQCLPS 
KKKKKKKKKKKKKKGGPLKKKKI LTRGG 

+V1 ^iv*V*VXVl V* \4u\l\l\XVxV\JW & JJ i\AuuVX m X (\W \J 

GGIFFFFGAPKKISPGRV* KKKKKKKKK 
KKKKGGPLKKKKNFNPGGGGNFFFFWGP 
KKNIAGARLKKPGGGKNPRSP 


8244 


22145 


A 


8304 


329 


144 


NGPPFFFFFF*DGVSLCCPGWTAVAR£R 
LTFAVFFTLNCS WG IHSELOHWFODYL 

mmw *m urn 4% Mr w +h mm mm M*+S* « Ww • ▼ rfb4Aw4#w^MfT ^ ^^A^ A> M 

KLKTF 


8245 


22146 


A 


8305 


404 


157 


RTYNPSTLRGRSGQIT*GQDFDTSU3DR 
VGPHLRKRKRNYYSLVECFYRIKEKYPE 
SHKKAMKMLFQLKPDFLHI FOHIATN 


8246 

* 


22147 

■ 


A 

• 


8306 


394 

• 


294 


SLCFLTPPRDFNLGALIKNFYSPTPGWV 
SCVLSKGPPFFFFFFFKDGVLLSCPH*S 
ONPFVFALEWSGTPELK*SSHPGLPKAW 
DYTYE P PHQAQVBLLR Y* RS FP VLLLDG 
ICIFHS FFSPRGSLI P 


8247 


22148 

• 


A 


8307 


280 


2 


WDSHVTFFTSVITVMLFS IQYVKQLEN 
I KSWPG I VARACNP SALGGRDG* AQEFD 
SSPGNTAKPCLPQKNKTKQKTKNKVGGT 
LETRSWRL 


8248 


22149 


A 


8308 


136 


330 

W* WT W 


MS YQRKS VNKYKSNNACEEVTTWARHSG 
SCIiRLYNP STLVGSGGKI P *TQE FETSL 
SNIARPYLY 


8249 


22150 


A [ 


8309 


3 


118 


I F WLVQMGFHHVGQAS LEL * PQ VIHPP 
QPPKVLGLPA 


8250 


22151 


A 


8310 


2 


366 


TRVDPRVRVRSTHRNLCLPGS SHASTS A 
S * VAGTTSACHHARLIFSFSFFFLGFSE 
KTGFNFFAQGGFEPLGEREPPPFGPPGF 
WGGGGGGTFKRPRRWKFQKD 1 1 P P L VSN 
RGDQPEPFF 
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SEQ ID * 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 

• 


SEQ ID 
NO: in 
USSN . 
09/515,1 
26 

• 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 

r r 

sequence 


Amino acid sequence (A=Alanine 
OCy stein e, D=Aspartic Acid, 
E=GIutaraic Acid, F=Phenyialanine, 

— * 

G=Glycine, H=Histidine, I^lsoleucine, 
K=l»ysine, L=Leucine, M=Methronine, 
N=Asparagine, P=Proline, 
Q=Glufcirauie> R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X^Un known, *«Stop codon, /-possible 
nucleotide deletion, V=possible 
nucleotide insertion 


8251 


22152 


A 


8311 


253 , 


446 


SKTFQARGGVTEMGFRHVGQTGLEILTS 
GDPPASACQAAGITGVSHHARLRI * YFI 
RIFIIMTC 


8252 


22153 

• 


A 


8312 


430 


88 


GVGRIQISWSPWLPKPKNSPGSPPQ*VG 
APGGPPPPGPYFFFFFQKKGFPLFSGRV 
*GSGSKNFFVFPPPKNWGTSPPPPPEGR 
GFFFFFFFLSRDGASPCCPPRPPKVLGL 

QV ] 


8253 


22154 


A 


8313 


400 


323 


HRLMPPHPIDFFLFFVETGSCYVAQGAC 
ELLGSGNL PGSAS *NAR ITDVNHHAQ P * 
KSSSGPGTQQC 


8254 


22155 

i 


A 


8314 


4 


421 


GGRIARTQEAKVAVSQQCTTALQPGRHS 
KTPSQKTKKKKSPGGPPGPPPGRFFFFF 
KT*KKGPFLKKPPQRGGDPRGKKKRGGG 
IiWKKPGGKLGGPTKKKTRGGGANPPPLW 
GKGGGPKGAPRAFPFTKKKPPPSGRNQ 


8255 


22156 


A 


8315 


213 


32 


RCPGTVGHGCHPSPLGGQRGRIP*GQBF 
ETSLANMVKPYFFLNS KKKTKNHPTKKN 
SQAW 


8256 


22157 ^ 


A 


8316 


422 


153 


AATLYTHKTPHCIYFFLDRNLLCCPGWS 
QTPGLK*AFCLSLPSSWDYRHKPPRPVT 
SPIFRIKSYDVIPPPYLE*NPMMLYLPS 
YCHHI 


8257 


22158 


A 


8317 


1. 


284 


ERQDWESRLEAMECAFHLEKSVNQSIiLE 
LHQIiAMEKGDPQLCDFLESHFLNQQVKA 
I KKLGD YIiSNLCKT * APEAGLAE YLFDK 
LTLGGSEEDT 


8258 


22159 


A 


8318 


187 


37 


TRDFFWPGWAHA*NCSTLGGQGRRIA* 
AQKFESNLGNRVSHCLYKKFKN 


8259 


22160 


A 


8319 


2 


250 


KYEQSLQEVWDCVKRPNLRIIGVPEEEE 
KSKSLENI FGGII EENFPGLARDIoDIQI 
REAQRTPEKSIAKRSSSRHIVIRLSK*E 
VWDCVKRPNLRI IGVPEEEEKSKSLENI 
FGG I I EENFPGLARDLD IQ I REAQRTPE 
KSIAKRSSSRHIVIRLSK 


8260 


22161 


A 


8320 


371 


196 


NNFS ILFFFFETRSH* VAQAGLVLLGSS 
NPPCSASRVAGVTGVHQRYQQDDFKNFR 
HN 


8261 


22162 


A 


8321 


3 


235 


FISVSFKYSKHIPNNMHLKFLVLFIOTF 
FRDG F WLRSG * S WAILRCGHS S LQPLT 
GLKQS SHLS PQGS WDYRRAPH 


8262 


22163 

- 


A 


8322 


1 


395 


LEVPLLNTSLLLASGVS IT*AHHRLIES 
NRNQI IQALLITILLGLYFTLLQASEYF 
ESPFTISDGIYGSTFFVATGFHGIiHGII 
GSTFLTICFIRQLIFHFTSKHHFGFEAA 
A*YWHFVDED*LFLYVSIY 


8263 


22164 


A' 


8323 


322 


83 


MHQS FLVAKVKKGKHGRWFYS CLM I KSA 
PVLSVTHNTXiLCWPGAVVHACNPSTLGG 
QGRWMA* AQEFETS LDNVGKPHP 


8264 


22165 


A 


8324 


12 


249 


GFFLPPPPGRGEKRGPPPPPLFFWFFFK 
KGEF P LGGGNFLTLE PPRLAP P KGGK*G 
GDPPPPGFFFFLKKKPKPLLKG 


8265 


22166 


A 


8325 


139 

■ 


335 


ENILWAATWLDCHPSTLGG*GGRIT*G 
QEFKTNLANIERPHLFFFFFLKRNFFFA 
PRVEGRGPV 


8266 


22167 


A 


8326 


1 


246 , 

• • 


FRRGIiAVAPRVLLCYPGWRAMVQS * LTA 
ASNS WAQAIHLSL P KCWNYKHT P P YPTC 
♦NSACVTDESSFPLI.VSAMEVCGQPHS 
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SEQ ID 
NO: of 
nucleotide 
sequence 

■ 

> 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 

09/5154 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 

* * 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystei a e, INAspart ic Acid, 
E-Glutamic Acid, F=PhcnyIalanine. 
(XJIycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methlonine, 
N-Asparagine, P=Proline, 
Q=Glutamine,R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W-Tryptopban, Y-Tyrosine, 
X= Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=posslble 
nucleotide insertion 


8267 


22168 


A 


8327 


422 


107 


IYYVRQQRKKRGLFFPFFRFPLPPGGGQ 
WGHFWFLQPPPRGV*RFFFLNLPRNRPW 
DFKGLPPPPAYFAFFLKKGVFPPGQGDF 
* ILAPKGPPTSTSQKGWGFKD 


8268 


22169 


A 


8328 


1 


161 


FRG I RWVCHVL FFYIiS FFLRQGLAVL PR 
LILNSWAQATLLPQLPE * LGLQTSS 


8269 


22170 


A 


8329 


422 


103 


PPYSPPFGGPGGGVPKVRESGPPGPPQG 
NP FFFKKPKNNPGGGGGPL I P PSSGG * A 
KGMGLSPRPRLPLS*NWPPPPPPGGQNK 
LPFPKKKKKERKKEKFNFEHRG 


8270 


22171 

• 


A 


8330 

! 


46 


422 


STPFPIiASQSAGITGVSHRARRSSSFQC 
TQIPLFPETLPNLGFGSRHGRGFSSQVL 
PAQSSSFDLDLISVLGY*MLSHISI*GPN 1 
RALGF I PPHLS KWENGSTLRS IVLSQIE 
GLSSTLSFPHPVS 


8271 


22172 


A 


8331 


429 

• 


112 


FSHLGSSPRQGFTMLRRLWNS*PCDPP 
TSASQSAGITGVSHHAQPHAVFYIRLPD 
SQVLCHQMP YWPTAHYPHRW I FSLVLQK 
NLSVPPPLWPIQSQPKAS ILLN 


8272 


22173 


A 


8332 


291 


19 

* 


KTPFPPKI*KFPPNFLKFTPGCGGPPPP 
FLKKFVS PGNPP PI&CIFFFFFPPP PPFF 
FFFFFLRRGSLIS iAQDGIQWHYHGSLQ 
PQLPGLR 


8273 


22174 


A 


8333 


307 


123 


KYFRGGGGGGR* FPPLGGVKQKKRGNPG 
GGGSKKPKLGPSPPTRGKKKNFFSPKKK 
KKKKK 


8274 


22175 


A 


8334 


3 


169 


HASADMCHQAQLLLLLLVETGLCHDGRA 
GFGS * S QE IHP PRPPKVLGLQHFGL I Y 


8275 


22176 


A 


8335 


1 


208 


LCVCVCVCVCVCVCVLFC * WRKGLALS P 
RGECSGTNMAHCSLNFSGSSNPSVSPSH 
VABTTGVRPILG 


8276 


22177 

AAA / / 


A" 


8336 


1 


352 


ILFYFI FYYFS FCFEMGS RS VSQAGVQC 
PDLSSLQPLPPRFKRFFCLTFPSSRDYR 
CVPPHLAKFCI I RRR*AAI GQAGIELLT 
LSDPPASAS*SAROTGVSHCSGPSFINF 
LNWQI 


8277 

> > 


22178 


A 


8337 


420 

- * 


272 


KAPCQ FSGEKTVFPKHCWN S WVP FRKKQ 
QQKKNLCPCLVLYTS INSQ*TMGLNI KA 
RTVNIS * KYKED P * TGKK* INKS CPWTG 
KKKKFLARTQKTIAIWDTLINWTSPKLK 
MSTKQRHHLKMNR* PKWLLFFVS 


8278 


22179 


A 


8338 


427 


202 


LI I YFCHQSVQKKGVLPLIiQEREGWAEA 
SLKISNISLSifTHTHTHACAHTHTHTHT 
HTSS*NGIKTWTTIKSFA 


8279 


22180 


A 


8339 


2 


164 


TPGHGISLCRPGWSAVAPSRLTATSASW 
VQTI LZiPWPPE * LGLQESRLLRDHLC 


8280 

* 


22181 


A 


8340 


422 


1 


PPGPPGGKTKRAPPPSFFKGPNPLLKGK 
GPGAKPPPKNGVFFFPPKKVKGPQKNFG 
GTQI S QPNKRGA* KFTPFFT** KPPQKKK 
GPPPGBKIFKGPKF* KFPSPGFFFFFLK 
QGLTLSPRLERSGANTAHCSLKLLSSSD 


8281 


22182 


A 


8341 


1 


456 


RTRGAVSQGHAIALQPGQQSETPSQKKK 
KKNPAPGGKGKIGGTAKKKGRNFXiDANI 
LGPNSRPGGAFFGG * TAPPGKKRGS PGP 
IFGGGKGGPKKKILSF*KAGGGAKFFPS 
P VGTPFFRGF* KGKILLKKKKFS KPGGG 
NPENLS PTNFFF 


8282 


22183 


A 


8342 


407 2 RQFILGAPGVFPPPGF*KPPPKriFWGP 
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SEQID ' 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 

■ 


M 

eth 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
dine to-*— ■ 
last amino 
acid 

residue of 

peptide 

sequence 


1 Amino acid sequence (A^AIanine 
OCysteine, D=Aspartic Add, 
E=Glutanjic Acid. ^Phenylalanine. 
OGIycine, H=Histidine, I»Isoleudne, 
K=Lysine> L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
-Q=GIutamine, R=Areinine, S=Serine+ 
T=Threonine, V=Valine, 
W=Tryptophan, Y=^Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 




* 




• 


- 




KKKNFFPPPRGEKFFSFKGAPPLFFFFF 
FFFFFFSPF^FGALFFFFFTILFKTKFG 
VFSCKGI PPKFFKRSRPPNGAGRLPLLG 
VGPLPGVPQGGEAERKISQGSFQ 


8283 


22184 


A 

■ 


8343 


441 


1 


PPPYILGGPRVFSPPPFFKTPPQIFFLG 
PQKKKNSPPPPGKKIFFF*RAPPPFFFF 
FFFFFFFPPFFFGALFFFFFTILFKTKF 
VVISWKGIQPKLFKRSRPPNGGGRLPLL 
GGGPPPGVPRGGEAERKFSRGSFQRTQG 
SAPEFRA 




79 IRS 

i 


A 


OJ*H 


WO 




FGVAGTONPGRNPRVPS FRGGPPPGVPO 
GRNPPPPWFFPLNPPKHFLTP*VFPFLE 
PP IEWVGPGPP * PSYGFSSPVARK 


R7»5 


79 IRK 


A 




6S7 




LF FW FFLROS FTLVAQAGVO * CDLGSLO 
PPPPGFK*FSRLNI»PSSLDDRCPAPCPA 
NFVLLVETGFHHVGQAGLELLTSGDPPA 
SAS OSAGITGVS I SAOPAO 


8286 


22187 


A 


8346 


3 


212 


DCSRGP PPHLANFGLLEKTGSNHVDQAG 
LKPLS LRDP PAS AS * SPG I TGGSHRAQF 
* F I FKRGMAPKGF 


8287 


22188 

< 


A , 


8347 


9 


255 


IFGPTHPSDFFLYBIDLPFKN*K3FPPP 
PRGGGKNPPFPPPPFFFS* PPWKKKITF 
QTPLTGVFGAPKNFNFWGGFKKIGF 




99 IRQ 


A 


R148 


70 


119 


NAAGFE VGI S KA* AKETLO I ITREGHSN 
KPRYAVTFESHEMGTVLHLVEQEHTVGY 
ITASCSALKC* IESGGFLNDYI 


R7RQ 




A 

* * 




1 
1 




FFFFLKRDRVSPLSSRVKYKWHDYSSLO 
LQTPGLQ*SSCLRFIiSSQDYRHIPPCPT 
NF * KCFVEMGS CCVTQGWSQTPGAQAVL 
PP WAS QRHWGLQGVRPLCS LLLSNCPHS 
RLVWSWLFSGC 


8290 


22191 


A 


8350 


434 


324 


LP I LS P VWWHMP WPGTREAEA* EQ * S L . 
EPGRRRLQ 


8291, 


22192 


A . 


8351 


3 


253 


YMCSKFFFLECSGAISAHCHLELPGSSD 
PPPPT*NF*VIGTTGTCHHEWLIiFELFG 
KMKPHY I AQVGFKLLERNFLKTKKVFP 


8292 

• 


22193 

• 

- 


A 


8352 


3 

• 


1431 


SPRGGXPSLKQVYSSLTtrNSR^SQHL^* 
EEALQRAREEBEKRKEITSHFQSTLTDI 
QGQI EQQS ERNMKLCQENT ELAEKLKS X 
IDQYELREEHLDKI FKHRELQQKLVDAK 
LEQAQEMMKEAEERHKREKEYLLNQAAE 
WKLQAKVLKEQETVXxQAQLTLYSGRFBE 
FQSTLTKSNEVFATFKQEMDKTTKKMKK 
riEKDTATWKARFENCNKAljLDM TEEKAT» 

RAKBYECFVMKIGRLENLCRALQEERNE 
LHKKIRDAEISEKDDQSQHNSDEEPESN 
VSVDQE IDAEEVNSVQTAVKNLATAFMI 
IHHPESTPHOSKETOPEIGSSOESADAA 
LKEPEQPPLIPSRDSESPLPPLTPQAEA 
EGGSDAEPPSKASNSPAGLGAETQCEGL 
PVGAQADQAS WKP EAEASGQAPQAPTEA 
SLQKMEADVPAPACAAEEHVAAMVPACE 
PSRQPPRAAAEELPVGASAGPQPRNVA 


8293 


22194 . 


A 


8353 


376 


65 


IHCGDHVQEVPKLKVCAIjHMSSWACSQI 
PKAGDKILTFDQIiTIiDTLKGCGTILIiSG 
PHKGQEVYWHFSKALGTQHSHTKPCVHS 
RDQKFKHIRG*WASQGYKN 


8294 


22195 


A ! 8354 459 


85 


' LEEVESLERVLEKKMIESEELKSKQTRF 
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NO: of 

m ■■*»l»/k4lfltt 

uucieouuc 
sequence 

■ 

• 


SEQ ID 
NO: of 
pepuae 
sequence 

■ 


M 

e r th 

oa 


SEQH) 
NO: in 

09/515,1 
26 


Predicted 
beginning 

■mcicuuue 

location 
correspond 
ing to first 
aniino aciu 
residue of 
peptide 
sequence 

< 


Predict- 
ed end 

DUCJC* 

otide 
location 
correspon 
uing id 
last amino 
acid 

residue oi 
peptide 

Sequence 


Amino acid sequence (A=Alanine 
C^Cysteine, D^Aspartic Acid, 
c/^vviuia rule /vciOf r m ncnyiauinjne, 
G=Glyrine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
v^— vviuiamine, k— Arginine, a— oerine, 
^Threonine, V=Valine; 
W«Tryptophan, Y-Tyrosine, 
A*-unKnown, — oiop couon, r^possioie 
nucleotide deletion, V=possible 

n 1 1 |aa^i/}a • ■ ■ ri ji ■» _ri n 

nucieoiiue insertion 


• 


• 


■ 


• 


- 




LEEIKNQDKLNKSLKEEAMLQKQSCEEL 
KSDLNTKWELLKQ KT I ELTRACQKQ YEL 
E*ELAFYKIDAKFEPLNYYPSEYAEIDK 


8295 


22196 


A 


8355 


231 


158 


PFFI FYL YQSNDIARGLERGLEPEKVIG 
ATDSCGDLMFIjMKW*ESYAPMTFSGSRP 
LSKPRAISLLWYK 


8296 


22197 


A 


8356 


2 


164 


IHKOVJbiNQMAGEQGWDJ^VovxWvGAVA 
HACNPNTLGGQGRWIT*GQEFETNLP 


8297 


22198 


A 


8357 


23 


291 


CTHTRAQAHAHTHHIjFYY FLY I FNFFBE 
TGSH* IVQACT >KLPASSDP PAS FSQSAG 
I TGMSHHAWPTSC ILKVKMALCS INCS A 
ACFLY 


8298 


22199 


A 


8358 


1 


307 


FFFFEIESCSFAQDGVQWCDLGSLQPPP 
PGFKQFSCLSLLS SWDYRHVPPRIjANFC 
I FSRDGFHHVGQAGLQLLTSNDP PT*AS 
QSAGITGVSPRAQPCPAF 


8299 


22200 


A 


8359 


75 


266 


FVKTRNLFIjTVLEAGKSNIKARAG*VSG 
EEHGLCLQDGALLPLEGRNAVSSHPRGQ 
NMMRAKC 


8300 


22201 ( 


A 


8360 


1 


295 


NTWVFCFVIiLFRDRVS IjCCS VRS TVAS S 
♦LIVNMNIiSFPSS * AYRHVSSKLDFFFF 
FKIQRIIiILPRLISNFWPQEILLWLPQG 
LGLQPRPQVLVGNS 


OTA « 

8301 


2^202 


A 


8361 


376 


^r\*> 

293 


MGFHKVS QAGLKLLTS GDLt PAS AS QSAR 
I TGAGHHAQPDF*HSGDSWQaP PE PRDQ 
G 


8302 


22203 


A 


8362. 


1 1 c 
375 


142 


Qi^-CCCT p TTiTV fill— if T/~1 PVO IV WM' U T C*Ti rof £S 

PQEEFLRFATDVGEYRAVTELERPVSES 
WNTQ KDLL * QKRGRVS TYCRHTYGVGES 
FSVQRRGEHVGGRGLGPCELEI 


8303 


22204 


A 


8363 


36 


363 


QLSFNPLIGKKVKLRTVFIGKQFLFLLG 

V KKACjAH b (JAyA 1 VKWriN Lib bHAKKP 

SCLSLPSCDHRHLLPHPANFCIFWTO*I 
PPCRSGWVQTPELKQFTHLGLPKR 


8304 


22205 


A 


8364 


455 


235 


GAQARAP PRGI 1 Ft FFW * RKGF CMLPRP 
VSKFWG*WPLPWPPKRVGLDSWAPPPR 
DFIFFSFFVKKRVCILW 


8305 

i 


22206 


A 


8365 


432 


175 


S TDQVS VAMGPS * PGAG YNULiVCRWLRP 

SLTPCMRVRRCTVMLRIiMLRGIiHPLSD 
KP 


8306 


22207 


A 


8366 


383 


83 


RRIiIVFICYFSKK*GIiPLLPRLVSNFWP 
YNPSTLGGQGTRIT ! 


8307 


22208 


A 


8367 


27 


304 


FFGF ILTLKKSSN* CTLRFCFLS IGKVR 
XnTXiGNI LSRRGYUcRCSIjPHCIjWgS IC 
LPVQSN*ALHKKVQWHIPFDQAIPSRGI 
LPTDTGHK 


OJUO 




a 


OjOO 




AU*f 


OMT P T AV<3 FFPFPL.1TR V«5 PPPPO* P anSCS 

QSLFIGALTFQV*GILIPYPPGNWELRG 
PPPNLTNFF*FLNLRFLLGPINLHYVHD 
KTSPLLYLFFFFF 


8309 


22210 


A 


8369 


400 


201 


KERPLPFRLIGSPFQGTWPRFFSKGEVK 
NSNFSKY* PPPFINPAPRVCFEGIEVIT 
KFFQVRPFFP 


8310 


22211 


a 


8370 


21 


408 


L I P LST YVC VRVCE CVCVCVCVNI +GYA 
CNTLLIACNILQDVPFTSPTKDKIFSGD 
FLTLVGKK 
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SEQU) * 
NO: of 
nucleotide 
sequence t 


SEQU) 
NO: of 
peptide 
sequence 


M 

eth 

od 

• 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 

• 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon . 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^ Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, {^Phenylalanine, 
G=Glycine, H=Histidine, I-Isoleudne, 
KHLysine, L=Leucine, M=Metbionine, 
N^Asparagine, PHProline, 
Q*=G!utamine, R=Arginine, S=Scrine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, possible 
nucleotide deletion, ^possible 
nucleotide insertion 


8311 


22212 


A 


8371 


1 


94 


NTLGHEN * LNSG FGGCS E PRS CHCTPTW 
ATE 


8312 

• 


22213 . 

* 


A 


8372 


546 


972 


PIHSLYGR* PS FCLGLALFKTLQVMHFL 
KDSVRNLKSSNLKVGLNPRSPQIQATIS 
. PPLQPKTKTYEEDP rCSKPKKEEKNMEVN 
MKEWILRYAEQQNEEBKNBNSKSLEBEE 
KTOPNERYliHLAAKLLDAKEQAATFKLE 
KK 


8313 


22214 


A 


8373 


502 


178 


QHFWRLRQ KDS FALGG* KQPGQKGENP F ; 
FQKKAKNILG* WDQ PWPTTQK6GGGES 
L*FGRTKSPPCTSGWGKKQNPVLKKKKN 
PQKTNCIPLCIYHIPLINRKTNTV 


8314 


22215 


A 


8374 


362 


112 


YRMSYNPQVRAKRVSQRTNTPPNFFF*D 
RVLLCRPG* STWQSQLTVAS I S WAKAI 
LLPP QPRQAKYFLRIiVKCS SEKGER I E 


8315 


22216 


A 


8375 


411 


21 


VLGIYPTLVSPSPGLKMTRVKLPSGSKK 
VI S S SNR P VFGWAAGG * FDKP I LKAGR 
AS S KYKAKRN CW P RVRGVAMNPVEHP FG 
GG ILQR IGKPS AI R I YAPAGRKVGL IAV 
RRSGRLRGANTVQEKEN 


8316 


22217 


A 


8376 


450 


108 


AEGGRGPRK* *LRSCRPVDGSAGQSPGP 
LRTESLETPGCSPQGWQELEPPPRRPQ 
WWPGPPGHRDRRQHRRPGARSRQQGPES 
SH*QRSGESRPPAVPRPPCPLPWKRGVL 
E 


8317 


22218 


A 


8377 




306 


EQQPPPVAVHQSGPPGAPPEAS PSHPGS 
SBKPQPVPPPRAPLVPQPKP*LSPPHHP 
AAGLIAVHALAGGPGTVQRGLPGRAARA 
VSVRVPAARC 


8318 


22219 


A 


8378 


376 


149 


PAHCSLLIPGFRGSSCLSLPCNWYPGRA 
PPRVGNFFFFF * EMES CYVAMLPSLVLT 
SWALAILPVLPPKLPGLQA 


8319 


22220 


A 


8379 


98 


374 


S VPPP VGT YEGDAAPTGGGGLPALGR * G 
AAGGHVQQGHPGPHRGHAAPKPSAQGGP 
GHSRECLEPSGGAAGGEREPPRAGGGQS 
WMRAPSMT | 


8320 

• 


22221 

■ 


A 

• 

V 


8380 


2 


807 


RPQCCVRAPRAGAVSGVPKLWSVP*SGR 
SAPHPTRS *VRSWSRWLRTQSWQTSRTR 
RRSESKVRVSYQAWTASPPPAASASARP 
A*RTSSRSYSSSSTCCSWLWPSSWSTGP 
SQTFVRNSS TLSCLCLTRKWIAMMPQVL 
PCTPVRPSCSAVSTITRSFLL*QALASR 
VT*IAPPRGSTTRTPSPIRL*NLP*LSR 
GPGK*KSGSWSSSSSA*TRWRTSAPIiI 
T3SSLLSRS3CKAQTG*ASCRPVRVPIP 
AGSSLGASAPGSRCHW 


8321 

■ 


22222 


A 


8381 


104 

t 


327 


CIAFPEMliGIKYEVGKCPSGLKSLKIIiK 
NTHTHTHTHTHTHTHTHTHTHALYLIRE 
GRKK*CVCMRKTDSTCVS 


8322 


22223 


A 


8382 


400 


270 


KPKLBVRASGQVQWLSPIIPALEEAKAG 
G*IiEPRSLRHEMVCN 


8323 


22224 


A 


8383 


178 


588 


ADIjLSGAPGPPGGSQDNANS SEHSRACA 
PAAPGP* WLQPWLCP * WTQSPIiEARGS * 
NPGVSHSSRIPGIPGAAGTSRPSGALPP 
Q PAGHGG S S G ELHVARG CACGRPL WSHR 
PGCX5QLLRV0AADSHRCASRGAAPM 


8324 


22225 


A 


8384 


423 


78 


QGQELAELTGVTVES PEEPQRLLVS*GG 
TVPCLACS LHRTL* TVGHGSAS S FIiAQG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQD) 
NO: of 
peptide 
sequence 

• 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
KpLysine, L^Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=GIutamine, R=Arglnine, S=Serine, « 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Unknown, *»Stop codon, A=possible 
nucleotide deletion, \=pos5ible 
nucleotide insertion 










» 




DMEP LGDRGLQG TEQLH * GGDRAS LQD I 
CFLLFVLSLLCPRSLDLHRHRVGVTGFT 
ED 


8325 


22226 


A 


8385 


434 


176 


FADDSRFDFLLLV*CAMLMLIREQLLEG 
HFTVNMRLMQDYPITDVCQILQKAKELQ 
DSK* PGGKRPMFGREASRPCAVAPGTHR 
NL 


8326 

* 


22227 


A 


8386 


2 


305 


RLWEDPPGRNTPPGAR* PGPCGEGGHI I 
RBEQACGWPRGHSGPWRHPQAGPLQLGQ 
WRGKGAGQGARRS PQGSLESRPPTLSAA 
PPAIHICSVGGVRIiHCY 


8327 


22228 


A 


8387 


331 

* 


91 


EMRSHSVTQAGVQWRDYGSLQTRPPGLS 
FNVNWDWVGL * GDPS S PPMLLPS S * ARP 
GGGVIGPGGEGHVS KEAAGRLQGPP 


8328 


22229 


A 


8388 


2 


250 


IHVRLWSQLFRRLRWEDLLSTGGGSCSE 
P * S CHCTVTWAFFLES CLRCAHWYVTL 
LTALLG SLP CANPRQTAVS LPQAQLV 


8329 


22210 


A 


8389 

OJD7 


388 

J (JO 


176 


YWDSQ * CMPS FAVP VESACI CAIGHNFS 
KSWSVXAVC7IX3SSHKYVFTHDGRCHR 
EAFDVYRDI CDDDA 


8330 


22231 


A 


8390 


445 


330 


PPRFTPFSCLSLRSSWDYR* PPPRPDNF 
LSLYSVLHET " 


8331 


22232 


A 


8391 


559 


142 


LRTLPVTLGKS LALSVPS FP YVGGNW I T 
NRE PSAKGRGSGPNRGPGHAASCPARTH 
SLRHS PAGGVMLTQPRP I LIIiPETQHP S 
ES AVARTDI S KARRWLHSRS PWPH* DET 
SAGAERPGIRCYIGHPAT 


8332 


22233 


A 


8392 


104 


400 


YYEIFKKEMYSYYHILHFFFFFSFFFFE 
TKPNSVNGPEGKGPILG*LKPWLLG*KH 
FCLTLPSGGNYRHVLPPPVNGFFFFFFF 
KKKKVFPFVAPRGV 


8333 


22234 


A 


8393 


408 


169 


DQVWI KDWNIGS LRPR* KGPQTI ILTTP 
TAVKI EGIPAWI QHSQVKPAAPETWEVR 
PS LDNPCKVTLKKMTS PAPVTLRS 


8334 


22235. 


A 


8394 


1 


269 


YCCPLFSSKALTQENSPYSSFRLVNPPG 
LSLHPEGEGGKW INERGRELGPS AGPLL 
LFLHFAEAGRRQPPDWADSEADLQQVRH 
KLGQP 


8335 


22236 


A 


8395 


2 


204 


KDCKVWKEVERVLREFHQAGKPIGCDAS 
TSLPAPAPWWSWNAALCCPGLSCVWCWQ 
GESRTCVGEHQ 


8336 

W v v W 


22237 


A 


8396 


3 


285 


KVTVKYDRKELRKRLNLBE W I LEQLTRL 
YD CQEEE I PELE I DVDELLDMESDDARA 
ARVKELLVDCYKPTEAFI SGLLDKI RGM 
QKLSTPQKK 


8337 


22238 


A 


8397 


1 


192 


ETGFHHAGQDGLELLTSGDCPS LAS QSA 
RITGVSHCTRAQLLTEQEYFKNQNHLPS 
APFYKRAF 


8338 


22239 


A 


8398 


8 


241 


LNPFLYAFLDPRFRQACTSMLCCGQSRS 
SGTSHSIRGEKSASYSSGHSQGPGPNMG 
KGGBQMLEKSI PYSQETLWD 


8339 


22240 


A 


8399 


2 


240 


S CLNP FLYAFLE PRFRQACTSMLCCGQS 
RSAGTLHSSRGEKSASYSSGHSQGPGPN 
MGKGGEQMHEKS I PYSQETLWD 


8340 


22241 


A 


8400 


10 


272 


CNWQKAYLTWRCAQDCEDYFAERLYRS 
MKGAGTDEETLIRI IVTRAEVDLLGIKP 
KFHQKYQKSLSDMVRSDTCGDFRKLLVA 
LLH 
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SEQID \ 
NO: of 
nucleotide 
sequence 

■ 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 

■- 

■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon . 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartfc Acid, 
E^GIutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K-Lysine, JLr=Leucine, M-Methfontae, 
N=Asparagine, PHProIine, 
Q=Glutam_ine, R=Arginine, S^^Serine, 
•^Threonine, V=Valine, 
W=Tryptoph an, Y=Tyroslne> 
X«=Un known, *=Stop codon, /=po$sible 
nucleotide deletion, V=possible 
nucleotide insertion 


8341 


22242 


A 


8401 


47 


248 


TRGIFFFFLNTRSHSVAQAKGLGHDHSS 
LKPQPPRLKQSFNLSLLSTWNYGGLLPH 
SALWKAEAAKS 


8342 


22243 


A 


8403 

» 


497 


303 


TVEAS VS YTjL YVAMVMQL P WGKS P ASGA 
ESHRQGCGGSWPGGGMEEGRGAERGGGC 
LLPQALLY 


8343 


22244 


A 


8404 


3 


235 


LTLLMCRMMAAAHMGGPPMMPMMGPNT 
PGMMPVGPAPGMRPPLGGHMPMIPGYPM 
MRSPARLMMVPSQPRMTRPDR 


8344 


22245 


A 


8405 


3 


246 


LVFSVDVGLLAHTRQRGLKRSSGMSSLL 
GKXGAKKQKMSTLEKSKLDW6SFKEEBG 
I GEBLAI HNRGKEG WI KGAQPLGQ 


8345 

* 


22246 

• 


A 

■ 


8406 

'■ 


1739 


804 

■ 


WEPDVSGSAAWLASAAFGMATFSGPAGP 
ILSLNPQEDVEFQKEVAQVRKRITQRKK 
QEQLTPGWYVRHLPNLLDETQIFSyFS 
QFGTVTRFRLSRSKRTGNSKGYAFVEFE 
SEDVAKIVAETMNNYLFGERIiLECHFMP 
PEKVHKELFKDWNIPFKQPSYPSVKRYN 
RNRTLTQKLRMEERFKKKERIiLRKKLAK 
KGIDYDFPSLILQKTESISKTNRQTSTK 
GQVLRRKKKKVSGTIiDTPBKTVDSQGPT 
PVCTPTFLERRKSQVAELNDDDKDDEIV 
FKQPISCVKEEIQETQTPTHSRKKRRRS 
SNQ . 


8346 


22247 


A 


8407 


2 


241 


CLR FPS S PAMGLLRSGTKLI FRRRPKQK 
EAGLSQSHDDLSNATATPSVRMKAQSFS 
RRLIKRFYFKSKPKANGNPSPQli 


8347 


22248 


A 


8408 


285 


186 


KQGLTSLPRLVSNSWAQAFLLLWPPKVL 
RLQA 


8348 


22249 


A . 


8409 


1 


246 


RHENREE LQV I ADLC I KYDTLC I SDEEF 
MNGVEY I WKGPCFCLTTFTLYLKGDIFP 
RPPSGCLNTjWIVLNPMYPVIHLrTSC 


8349 


22250 


A 


8410 


1 


102 


AGS YTLH 1 1 KGDDGTRGVTGRFTFTLHR 
IQTSEE 


8350 


22251 


A 


8411 


877 

■ 


647 


FFFETESHSVTQAGVKWGDLGSLQSLAP 
GPKKFSCLNLPSSRDYRSPPPPLANFCT 
FSKNGVSPCWPGRSSTPDRR 


8351 


22252 


A 


8412 


1 


130 


RMLNDKTLRTDI GGNF PKNGWAAXATHS . 
FE FAQFDNFLEEATR 


8352 


22253 


A 


8413 


97 

■ 


401 

• 


CNPENGLTASALGRRMCLATCKAPRTLP 
DSGDTASCRFPAVPRP PHSRRSSGSGHL 
PGRPRCPALPGLEWSNPPGPPTSGYLFP 
TFSTPAAHSHQKTLLG I 


8353 


22254 


A 


8414 


348 


516 


VSAYGF I TEGHERFSDHYYDTS WKRLI F 
YINHDFKLEREVWKRLHDEGI IRLYQRL 


8354 


22255 


A 

* 


8415 


714 


410 

• 


LGVFRSAliHGSLWLLLRSFPQKSPNPLA 
LLTjFIjQCNTAYQCLLIADQHCRTRKYFL 
CLASGI PCVSHVWVHDSCHANQLQNYRN 
YLLPAGYSLEEQRILDW 


8355 


22256 


A 


8416 


2 


102 


GHDPQDRLLAQD S EHNHS DRNW QRNRPW 
PKDSY 


8356 


22257 


A 


8417 


1 


309 


F YFENALS KSNKP IHT 1 1 LN PHVHLVGD 
DAACIAYIRLTQYMDGSGMPKTMQSEET 
RVWHRRDGKWQNVHFHRSGSPTVPIKPP 
CIPNGKENFSGGTSLWQNI 


8357 


22258 


A 


8418 


3 


242 


ARALTNAASHVDDMPNALSALNDLHAHK 
LRLDPVNLKLLSHCLLVTLVDHLPDEFT 
PVEHACLDNVLAS VS TVLTFK YR 
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SEQID 
NO: of 

mii aIiia^iiia 

nucieoiiue 
sequence 

• 


SEQID 
NO: of 

pep Hue 

sequence 


M 
#h 

OO 


SEQID 
NO: in 

09/515,1 
26 


Predicted 
beginning 
n ucicouue 
location 
correspond 
ing to first 
amino acid- 
resioue oi 
peptide 
sequence 


Predict- 
ed end 
nucie- 
otide 
location 
correspon 
ding to 
last amino 
acid 

restaue oi 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
jo^vjri u l<j in it, /\ciu, i 1 ^* ncnj'iaianinc, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leu cine, M=Meth ion ine, 
N=Asparagtne, P=Proline, 
Q=Glutnmine, R=Arginine, S=Serine, 
i— i nreonine, r^vaiine, 
W«Tryptophan, Y«=Tyrosine, 
A = \jnKnown, — oiop coaon, /— possiDie 
nucleotide deletion, V=possible 
nucleotide insertion 


8358 

■ 


22259 

- 


A 

■ 


8419 

■ 


711 

• 


296 


NVANS DGIiI AS LWKE YGKADARWVYFDP 

TTVQirPTT.TUST.TVSCT 2H*.T?T.TVIi 

i J. vo VfiliJi V/U*JLA»oJjnUf IlJ, XnJL V AJSa . 

YYRHFLQ I TLCVCELYX3CWMTFLPEWLT 
RSPNLNTSNWLYCWLYLFFFNGVWVLIP 

(ZX .T J .WO<! WT iT7T ilfTrMWmrRT^ C^TVtrCTi 
BUI UjHyJ rr iJIL JJiVEVl'iJl^xvCi 1 OO V JVIYJ7 v 


.8359 


22260 


A 


8420 


3 


97 


FWSSLCNGLIAAQLLFYWNAKPPHKQK ! 
KAQ 


8360 


22261 


- 


8421 


1 

■ 


279 


r»fY" , Y7\TM 7\ UOJ\ T T?KTf /"kC»T T "PTkTT T T Trim 

pjiVT PQT.&PMT .T.nPKrTf*TY3T.T/lT E*WTf PIT 
Lnl J. xr Oil/if DIliJjJJKJN JXlVJlJlAj J. T rVIW_f\l 

I VERKS P YVAVCCILMAFS ILLIQKLVK 

MPDPTfTTWT 
nry a win x 


8361 


22262 

i 


A 


8422 


1 


228 


RHEVFIELNHIKKCNTVRGVFVL3EFVP 
EIKBWSHKYKTPMAHEICYSVLCLFSY 

ViWVtldafifiiUilKi flrKir VSO 


8362 


22263 


A 


8423 


3 


182 


S EDTGEEQ WTAE FX NRGE YE IDIAGYR 
HGK 


8363 


22264 


A 


8424 


1 


70 


VQFVFDAVTDVI I KNNLKDCGLF 


8364 


22265 


A 


8425 


2 


151 


ETTASSCTPASLESRRCCAPCRMPRTGF 
FGSSPLWRPSGSRSLKPGFQQ 


8365 


22266 


A 


8426 


1 


187 


RGRVGPGGERLVPGVPGAEAQQPAGDGV 
RAGP LQARP PAP VG VSQGRCQAAGAAAG 
PPRPDG 


8366 


22267 


A 


8427 


520 


423 


GKYQLQSQENFEAFMKAIGEC WTERQS C 
GSHQ 


8367 

■ 


22268 


A 


8428 


1 


627 


GTSGTRGVTGYFTFTLYLETPKPSISSS 
NLNPREAMETVILTCDPETPDTSYQWWM 
NGQSL PMTHRFQLS ETNRTLFLFGVTKY 
TAGP YECE I RNSGS ASRSDPVTLNLIjHG 
PDLPRIHPS YTNYRSGDNLYLS CFANSN | 
PPAQYSWTINGKFQQSGQNLFIPQITTK 
HSGLYVCb VI^bAlXjIsiibblbljTVK^ 
STRIGLLPLLNPT 


836$ 


22269 


A 


8429 


3 


390 


ILGCNILRVBYSLVICVSVPGSKKVILD 
LPLVIGSRSGLSSRTSSMASRTSSEMSW 
VDLNIPDTPEAPPCYMDVIPEDHRLESP 
TT PLLDDMDGS QD S P I FMYAPE FKFMPP 
PTYTEVDPC I LNNNVQ 


8369 


22270 


A 


8430 


3 


AAA 

208 


NGTHVI I JjJjLJjK J. tAj i V VN VANDM1 VAb 
NLGTGVPNQTPVSSGDLLIRSNGLIjIPG 

1 U£»X IKJjx i loci 


8370 


22271 

■ 


A 


843! 


3 


563 


LPTSRVDPRVRIiDRMKKDQEBEKDQGPP 
CPRLSRELPEWEPEDLQDSLDRWYSTP 
FSYPELPDSCQPYGSCFYSLEEEHVGFS 
LDVDEIEKYQEGEEI^KPPCPRLNEVIjM 
EAEEPEVLQDSLDRCYSTTSTYFQLHAS 
FQQYRSAFYS FEEQDVSLALDVDNRFFT 
LTVXRHHLAFQMGVI FPH 


a J / 1 








o 




WRCSERVAPSA 


8372 


22273 


A 


8433 

• 


192 


1285 


AGVLSIIEETDSEGLQTKWENQTYDER 
L6 1 NDS EE VAS I YTP T PRHQGLPRS AHL 
PNKAMADNS SDECE EENNKEKKKTSQLT 
PQRGFS ENEBDDDDDDDS SETDS DSDDD 
DEEHGAPLEGAYDPADYEHLPVSAEIKE 
LFQVNQLRCVPTDVLDHKLKPFIPDFIP 
AVGDIDAFLKVPRPDGKPJDNLGLLVLDE 
PS TKQSDP TVLSLWLTENS KQHN I TQHM 
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SEQID 1 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 

nentlde 

sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

mm mm VI^V WUw 

location 
correspond 
ing to first 
amino acid 

residue of 

peptide . 
sequence 


Predict- 
ed end 

nurJe- 

IIUww 

otide 
location 
correspon 
ding to 

1 a c+ aminn 
moi miaiiiv 

acid 

residue of 

|JC|liluc 

sequence 


Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
&=Glutamic Acid. F=Phenvlalnnine 
G=Glycine, H=Histidine, I^Isoleudne, 
K=Lysine, L=Leuclne, M s Metfak>nfne, 
N=Asparagine t P^Proline, 
Q=GJutamine, R=Arginine, S s Serine, 

* 1 III CUIIII1&) T lOllllOj 

W«Tryptophan, Y^Tyroslne, 
X^Unknown, *=Stop codon, /^possible 

IIUI.IVVll.IUc UClCllUll) \~-*UU3SIUIC 

nucleotide insertion 


. 


- 






• 

■ 




KVKS LEDAEKNP KAI DTW X ES X S BLHRS 
KPPATVHYTRpM PD T DTLMOEWS PEPR1? 

LLGKVSLPTAEIDCSLAEYIDMICAILD 
IPVYKSRIQSLHLLFSLYSEFKNSQHFK 
ALAEGKKAFTPSSNSTSOAGDMETT/TFS 


8373 


22274 


A 


8434 


3 


251 


TLQADHFNTRLSCGDAAQTLWARTGYLG 
FVRRTELTAATGERHDALYVVGALDETL 
ELRGLRYHPI DIETS VSRIHRS IAEW 










i 




AIEEESEDESNEEEDEDDEEYHEWKRTO 
ft r /TRNM PT i w t WYjaaav TV 


8375 


22216 


A 


8436 


95 


340 


CGCGIAGLAMSAIFNFQSLLTVILLLIC 
TCAYIPvSIiAPSLLDRNKTGLLGl FWKCA 
RIGERKSPYVAVCCIVMAPSILPIQ 


8376 


22277 


A 


8437 


1 


317 


Q PKPLVRTS RE PGKDVTTSG YS SVS TAC 
PTSSVDGGLGALPQPTSVLSIiDSDSHTQ 
PCHHQARKSCLQWRPPSPPESTVSQQQV 
KRINItCIHSEEEDMNLGLVRL 


8377 


22278 


A 


8438 


80 


213 


PHLS FNAG I TT I KVN I RNANS XjGGGFHC 
WTCDARRRGTLQS YIiD 


8378 

• 


22279 

* 


A 

• 


8439 


3 

■ 


167 


HEDNIKQLKEMKFTYLINYIQDEINTIF 

NDYI pyvfkllkenlclnlhkfnefiqn 

ELQEASQELQQIHQYIMALREEYFDPSI 
VGWTVKYiEIjEEKI VSLIKiujIjVAIjKDF 

hseyivsasnftsqlssqveqflhrniq 

ksqaiatkkiisdyhqqfryxlqdfscq 
lsd yye kf i aes krl xdls i qnyhtfl i 
yitellkki^sttvmnpymklapgelti 

IL 


8379 


22280 


A 


8440 


103 


354 


NGCECDFLFLFLFYFFETESRSVAQAGV 
GWIMMSGVQDQPGQHGETPFLMQAGLKT 


8380 


22281 


A 


8441 


3 


160 


KLYPLKIVFGMNGRWVKGKTVQQTLIL 
ANVXE ACELMTLDQRI LMF X LAES 


OJOl 






■ 

- 


45y 


3 


AWLPPCSLISDCCASNQQDSVGVGPSEP 

f3ZVR YKT .T .VP P PT »<? TCKT? Q T P VP VTR T7Q 
RCHLS P1»S LTRKGNSLTPCTSRVROCLA 
LLPO^GAI^LSCAHCLALPSEMTPVP 
OMENAE ITRLL 


8382 


22283 


A 


8443 


2 


318 


RQGNMTAALQATLLNPPINTKSQAVNDR 
ARGIVLNVLISFKANDIEKAVQSIiDKNG 
VDLLMNYMYKGFES PSDNSNAMLLQWHE 
KALAAEGVTS IVR VSTALIPA 


8383 




A 


8444 


2 


109 


VT PFPMS CDLQGDCACRNPQAOEHSRKD 
LRGYSHG 




8384 


22285 


A 


8445 


'1 


69 


LVLNSWPQVIRRPWPPKVVGLQV 


8385 


22286 


A 


8446 


1 


163 


P S EKHN I W VG VTQFS RCRLS PLSLTRKG | 
NSLI PCAS RVRQCLTLLRLVHGACTH 


8386 


22287 


A 


8447 


1 


147 


GQAGLELLMSGDRPTSASQSAG ITGVSH 
RARPSSISFILELRGSVRKKF 


8387 


22288 


A 


8448 


914 


729 


GLTMDTQKDVQPPKQQPMIYICGECHTE 
NE I KSPJ3 P IRCREOGYRIMYKKRTKPXV 
VFDAR 


8388 


22289 


A 


8449 


134 


392 


TCPPPPEPPSPLTCAVLVPQTRRWRGLG i 
SLFRRACCVALPLOLliLUiFLLIiLFLLP 
IREEDRS CTIiANNFARSFTLMLR YNGPP 
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SEQ ID 
NO: of 
nucleotide 
sequence 

• 

- 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- . 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=G)ycine, H^Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pro!ine, 
Q=Glu famine, R=Arginine, S=Serine, 
^Threonine, V=V aline, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /"possible 
nucleotide deletion, ^possible 
nucleotide insertion 














PT 


8389 


22290 


A 


8450 


37 


248 


AKITPLLQPGRQEQDSISNKKILRCCLT 
MLPRLASDSSGFKRSSHFSLPTSWDYRH 
SPPPLASVPPLLMC 


8390 


22291 


A 


8451 


367 


165 


NFFVFLVETGFCHAGQARLQLIiASCDLP 
VLALQSAGITGVSHCARPISTLKDTIKT 
SHFVPLKISGK 


8391 


22292 


A 


8452 


2 


282 


GKRMAGGPEMMQLRLDGNRLYITTSMFS 
AWDKHLYPDLIREGSAMLHVDVDTALGG 
LKLNP YFL VDFWKE PLG PSLAHELR YPG 
GDCSSDIWI 


8392 


22293 


A 


8453 

i 


4 


163 


HHD FHAQSLLAN I ERIAI LVFAACEAAA 
VLSLLVS ISNTYGIiDYLHNLKLLQC 


8393 


22294 


A 


8454 


374 


197 


TPEPKQSSHLGLSKCWDYRHEPPHQARN 
STFDSNHSSLAAEDKLLGYFGLSQWQPG 
YQP 


8394 


22295 


A 


8455 


343 


161 


MGMGPSVPGTGYNLLVCRLLRPLEKCS I 
RVGVSQFSKYHLSRLPLPRBGNSPTPCA 
PWVR 


8395 


22296 


A 


8456 


1 


193 


MGFtjHVGQAGLKLPTSGDPPTSASQSAG 
ITGMSHRVQP YYVLS ICQTLHSVCICIiL 
IILLTTVL 


8396 


22297 


A 


8457 


342 


263 


WLRPWPALWEAEAGGSLEIiRSSRPA 


8397 


22298 


A 


8458 


101 


303 


KAAKYLFQRLHFCFLNLKKDNMLLFNHR 
MEN F KEL FGPGTVAHTCNP S TLGGRGGW 
ITRSADRDHPG 


8398 


22299 


A 


8459 


3 


140 


YLKHLCMYVCKYLRQGLLMLPRLVLNSW 
AQRILLPWLPKVLGLQV 


8399 


22300 


A 


8460 


327 


248 


FPRLVLNSWPKAILMPWLCKVLGLQA 


8400 


22301 


A 


8461 


327 


79 


SIPFILAFFIYLSFFFKETGSQSVTHAG 
VPGII IAQOGLQLLGSGDPPDSASLVVG 
TTGTYPTPILSVFPNMPSFFILLCMLF 


8401 


22302 


A 


8462 


365 


273 


MGFQHVGQVGIiRWPQVICLPWPIjKVLGL 
QA 


8402 


22303 


A 


8463 


375 


217 


QAGLKLLTSDDLP ASASQS AG I TGVSHH 
AQPRHHFFKTRFIYIFFSSPSSVLS 


8403 


22304 


A 


8464 


238 


354 


RGKCFVQAFLYKECSGGQARWLTPVIPE 
LWEAEVGRWIT 


8404 


22305 


A 


8465 


367 


248 


FQMRIHKWLIDLHSPSEIKQITSISIEP 
GVEVEVTIADI 


8405 


22306 


A 


8466 


341 


153 


NMILGWAQWVTPVI PAIGEAHWGGLLYP 
RS S KPAWATQ QNLL FTKKKKNI YI YI Y I 
YIYBRY 


8406 


22307 


A 


8467 


368 


101 


VEMGFRHVGQAGLELLTSSDPPASVSQS 
AGS S TS LPTADLLTP GWVGVYGL VDKNP 
VTL IAMQRLPAVLEKKS VARKNS ENKI V 
LKRGQ 


8407 


22308 


A"' 


8468 


362 


248 


HVGQAGLALLASGDPPTSASQSARIAGV 
SHRARPEAPL 


8408 


22309 


A 


8469 


384 


196 


SARhGh PKFWD YRRE P P CPADDML I LMV 
GILETFSLIFTATMESRNYCVYFKMRNK 
HIISKTS 


8409 


22310 


A 


8470 


386 


300 


LAMLPRLVPNSWSQWLSPLPPNMLGLQ 
V 


8410 


22311 


A 


8471 


380 


178 


PKKNKFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFF FFFFFFFFMY I YI F 
AIVYQTLACNNKFIjQMERVTRKLRQC 
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SEQID » 
NO: of 
nucleotide 
sequence • 

^ ■ 

■ 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 

• ■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
G=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
C^GIycine, H 8 Histidine, Msoleuclne, 
K=Lysine, LHLeu cine, M=Meth ion Ine, 
N=Asparagine, P=Proline, 
Q=G1utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophon, Y=Tyrosine, 
X=Un known, *«Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide Insertion 


8411 


22312 


A 


8472 


3 


154 


FL VDMGIHKVS QAGLKLLTSGDLPAS AS 
QSPIPFFLFFFFEKQILAQAGO 


8412 


22313 


A 


8473 


3 


162 


GLYHVGQAGLKPIiTSGDPIASASQSVRI 
TGVSHRTRPKI IFKLIFFLFFLVA 


8413 


22314 


A 


8474 


266 


107 


LCLFL I EMRFP YWQAGVKLLSSSHI»PS 
SASQSDRITGVSHCFWPEVFKRGIF 


8414 


22315 

• 




8475 


361 


169 


RLPGPSNPPISASLAArii*TdAWlHA6Li 
FVFLVEMGLCQAGKHNFEKYSRVLIYSP 
IVSFTHFI 


8415 


22316 


A 


8476 


1 


154 


KKLIPTLMDHGAGFKSSVKEITTAWEM 
ARE VELEVE PBDGNELLQSRGKT 


8416 . 


22317 


A 


8477 


3 


193 


SHYQSLISSNHGHKHCGRPQGPLPRKTI 
DLCSLQYQLTFPPLLTHDPVKSPSVRNT 
QBLSLIP 


8417 

* 


22318 


A 


8478 


408 


103 


TGSRLTLSSRVECSGVITAHCNfLELPGS 
SDPLASASQVAETTGMHHHVWLIFLGIF 
LVKMASHYVAQDDLKLIiGSTDPPVSASQ 
SVGITGISHHAWATICYK 


8418 


22319 


A 


8479 


59 


166 


NVNTCLCLHTHTHTHTHTHTHTHNYFKE 
TPIHQYI 


8419 


22320 


A 


8480 


394 


253 


FLLLLLESGFCHVAQAGLKLL3SCDLPA 
LASQSAEISGVSHCAQPLA 


8420 


22321 


A 


8481 


379 


184 


CFSLPKCWDFRGVPISPGDFWNFIIKLG 
VCPFCQGGFELLAPNDVAASGFQNAGIS 
GINFWAWAL . 


8421 


22322 


A 


8482 


367 


170 


CGLSSGLGGRSGRESERKKERERERDRE 
KKKEGEGERKRE RERES VS S Y KDTNS MG 
SGPHPYSLI 


8422 


22323 


A 


8483 


324 


136 


QRSSCLSLPSS WDYRRMPPHLANFCIFC 
RDGVLPCCPSWSQMTTSFLSLNFLICLV 
GIILGRG 


8423 


22324 


A 


8484 


1 


57 


GE RR YGTC I YQGRLWAFCC 


8424 


22325 


A 


8485 


35 


339 


GWLLRFFWWQWKNGNNLNVSLIDGWLSY 
DTIiGYNGILNCHMYILTBGDSQKKKKKK 
KKKKKKKKKKKIDPAGNSKIYNRKIFKT 
PRGFFKWALGPLLILFF 


8425 


22326 


A 


8486 


330 


66 


RPGVSRFNTPWPLIFFLRRVFLCRPGWNA 
WQSLLTKISASQVQAIFLGGGGCSEPR 
SCHCTPAWATTAKLCFKKKNGKNFTDIM 
FSK 


8426 


22327 


A 


8487 

• 


3 

■ 


209 


CLSLLRSWDAQAPPPHPANFFLMFNFCR 
VGVLLCCPGLASSDPPPLVSQSARITGM 
SHCTWTIFLSFK 


8427 


22328 


A 


8488 


3 


216 


GQS ITVSCTATNSDVGSSNLVS WYQSSN 
LVSWYQQHPGQAPKLIIYEDNKRLSSEK 
KKKKKKKKKKKKKK 


8428 

• 


22329 


A 


8489 


326 

. ■ 


62 

■ 


PAYFS IGS FFLFLFFFKTECHS VTRLEC 
S GA I S AHC KLRFPGSSNS PAS GGGGRSE 
P K£ HHFTLAWTTRAKLHLNKNKI KTVKE 
FLSS 


8429 


22330 


A 


8490 


2 


156 


SSAG IGRTGCF I ATR IGCQQLKARGEVD 
ILGIVCQLRLDRWVCGCKQGQQR 


8430 


22331 


A 


8491 


399 


254 


KNNKRNKT INNKI TNKLI I KLKN I SFL Y 
FFFFFFYFFFFFFFFFFFFF 


8431 


22332 


A 


8492 


2 


130 


FRRVGQAGLELLTSGDLPASASQGAQIT 
GVSHRAQPTFLKSC 


8432 


22333 


A 


8493 


3 


94 


HLPAEFTPAVHASLDKFLASVSTVLTSK 
YR 
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SEQDD 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 

• 


M 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to •* 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, F^Phenylalanine, 
G=Glycine, H=Hbtidine, I=Isoleucine, 
K=Lysine, L=Leucine, MNMethionlne, 
N=Asparagine, P=Proline, 
>Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine; 
W=Tryptophan, Y«Tyrosine, 
X=Un known, *=Stop cod on, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


8433 

* 


22334 


A 


8494 


327 


70 


KSGFSNVGQAGLKI LTSSDLPVSASQSA 
GITGMSHCTWPLYP YFGLGS RS FHS I EE . 
HCWYQLAPFPSCRNPFVIVPFFVHRFYL 
TF 


8434 


22335 


A 


8495 


97 


219 


PGTVAHACNPGVLGGQGGWMTSGQEFQT 
S LAS AAKPRSTRP 


8435 


22336 


A 


8496 


3 


191 


GIRRGEILEVIEFTSNEEMLCRDPKGKY 

GYVPRTALLPLETEVYDDVDFCDPLENQ 
PLPLGR 


8436 


22337 




8497 


144 


304 


S YFYEE ILFSTVI KKNAIVS FTATWVEL 
EAI ILSELTQEQKTKYHMFSLVSGS 


8437 


22338 


A 


8498 


2 


90 


LTILVLAMWKVGFFKRNRPPLEEDDEEG 
E 


8438 


22339 


A 


8499 


1 


156 


PSSQDYMHAPPHLANFLNKFFVKWCLAM 
LPRLMSNWPQTILLSPPKALGLHV 


8439 


22340 


A 


8500 


67 


319 


KLMS FGPTLLS ARCVS TGQ I GMNELNLN 
IDAMSVAATINNEIMSFARTWMKLKAIT 
LI ILTLEHKTKYCMCSLI SGSLRMTTQG 


8440 


22341 


A 


8501 


3 

* 


299 


ETGSHCVDQGGVQWHISAHCSLDLLGSS 
DPPASASRWGITSAHHPLLWSISSLPP 
LLLHPHNLIiNLLLLLLPSLS IVLNLHIL 
PPLLPPLRLLCSPC " 


8441 


22342 


A 


8502 


56 


142 


NNSAKKKKKKKKKKKKXKKKKKKKKKGG 
AAF 


8442 


22343 


A 


8503 


486 


348 


TGFHRVAQAGLELLS SGNP PALASQNGG 
ITGVSHHAQPAEYNFKK 


8443 


22344 


A 


8504 


158 


403 


RI FF FFETVGFVAQDG VQGRGAI LQPR1* 
FELKRFSCLSLPSSWDYKRAQPNPNDFF 
KRQMAFDPKIQRI SLGSKKKKNHPIB 


8444 


22345 


A 


8505 


3 


78 


DWHIAYVLLYGPRRVEIMEBESEQ 


8445 


22346 


A 


8506 


164 


291 


TRFCCCWLFWWFCVLCVCVAVFPPWCI 
LFFIWLFFFFFFF 


8446 


22347 


A 


8507 


372 


69 


CAPLPPLKFFFPPRGFIFRGGGSPIFPP 
PKKGVLPKNPPKGFFPPPPFLKPAPGGA 
FNYPPFGGKI FFS PPRGKWAP PGGFLKG 
APPFFFFFFFFFFLFF 


8447 


22348 


A 


8508 


282 


1 


SFQHVAQAGLKLLGSRDPYTLAAQSAGI 
TGVSHHARP VL YFS S VLFSMEFLSTMSA 
VILPCICSFCVSLAVPSSCILQQPLPPT 
CTYIKTETCI 


8448 


22349 


A 


8509 


400 


45 

• 


TPPVSSLFFLPLKEIIFFCPPFRQSLPP 
PFFFFPPPLFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFKYLSLPFTIi 
SKEKNK 


8449 


22350 


A 


8510 


241 


419 


TPLGGWEKKKKKKKKKKKKKKKKKKKKK 
KKKKNKKXKKKGGGFIjKKPLGGANFIGA 
GNN 


8450 


22351 


A 


8511 


317 


28 


QSLSPSPQTVYFI YLI I YAVFFTTLTI I 
LIFSPSFRMELKFLLRCYFFAPFYAFLF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFSQPGAQTTAGRAGV 


8451 


22352 


A 


8512 


124 


292 


MCCYIMSLLKVICTIIKEFSTKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKIKK 
KGGVPL 


8452 


22353 


A 


8513 


1247 


910 


QAQLGDIGTSCYTKSGMILCRNDYIRLF 
GNSGAAGACGQSI PASELVMRAQGNVYH 
LKCFTCSTCRNRLVPGDRFHYINGSLFC 
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SEQID 1 
NO: of 
nucleotide 
sequence 1 

■ 

■» 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 

• 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I-Uoleudne, 
K=Lysine, JL=Leucine, M=Methionlne, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^erine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y-Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














EHDRPTALINGHLNSLQSNPLLPDQKVC 


8453 


22354 


A 


8514 


99 


262 


^KRFLI^KKKKKKIQGQCKKKKKKKK^ 
PFKKNPGGAQ I FPGGEKKI FFLKDGL 


8454 


22355 


A 


8515 


85 


2 


G WGWAQWLTPVI P VLWEAKAGQSRGQE 


8455 


22356 


A 


. 8516 


244 


423 


VFAVFNFLGWVMDMLINIiAWI ILQCAC 
ISNHHWHLEY IQFLFANYTS I KLGKKP 
SKSV • 


8456 


22357 


A 


8517 


3 


114 

* 4 ~ 


KINGEDKQKI LDKCNEI INWLDKNQVCV 
FFFFFFPI 


8457 


22358 


A 


8518 


413 


91 


GGVRGKISFRGGGFLKKKKKKIiLKKKKK 
WGGGPKEKKKRGGFFFFFLLNFFFWGOV 


• 












KKKKKKKKKKKKKKKKKK^ 
GRKEGREGRRKEGEGRQEINMTA 


8458 


22359 


A 


8519 


1 


163 


SRLPGRSRRRSRGRRRRRRRRRRRRRRR 
S RVL I FP WDQNLDLtNTVY 1 1 ED VDAA 


8459 


22360 


A 


8520 


I 191 


21 


RFTTHVDAWATVTVFLI ES VFLHVGQAC 
L ELL PS DDL, PAS AS ENAR VTGVS HRAWL 


8460 


22361 


A 


8521 


2 


144 


4WATSQAWWCMPWPATWBAEVG4^SLKP 
GRLQLPLTMIALVHLSYAI 


8461 


22362 


A 


8522 


90 


8 


KPTPFLPTLIARTTLLLPISPLILIIL 


8462 


22363 


A 


8523 


305 


146 


VFR I HSCGI RGS VDVKI TDGLLVTRR I E 
NVPPGPNNKNKNPYAIFOSSS I ESQ 


8463 


22364 


A 


8524 


.17*. 


2 


LAWPFLFLPKCWDYRHEPLQPASLTTFV 
REWDKIGWAWWLTP VI PAL WEAEVGRS Q 
GO 


8464 


22365 


A 


8525 


2 


200 


CLI PSS WD YRHMPPRP ANFCI FFFFLGD 
HVSOVGLKIJjTSKDPLAWTSORAGITGF 
SHHTKPQGFF 


8465 


22366 


A 


8526 


2 


217 


SFSLLFPPSFSLLLPPLASLLLPPLPSF 
SLLFPPSFSLLSPPSFSLLLPPSFSILL 
HPTFSHLYPPSPSFI 


8466 


22367 


A 

41 m 

f 


8527 


2 


186 

1 ou 


IRGLRPLEKRGIRVGVSRFSRYHLSRIiS 
FARKGNS PTP CAS R VRRCP SLLHGLHPI* 
SDKPQ 


8467 


22368 


A 


8528 


3 


153 


CLGFLSGWDCRCI PPCIiANLKKLFVEMG 
SLFWPQWILPPWPLKVLGLQV 


8468 


22369 

• 


A 

• 


8529 


2 


388 


RTAVHTF ILVLQVCVCRWHLSliAALTPA 
HLVCSSHRILS YLS IRGSWDHRCMTTTP 
GLVI S I FFVEMKYRYAVHAGH ILLGSS Y 
PASAYQSAEITGVSHWAWPTDSLWQKPH 
MVIWCISCFLNVLIGTM 


8469 


22370 


A 


8530 


293 


64 


YRISIKVSKWFKGWVSSFWIFFFFVFFF 
SFFFVFFCFFMFFLKQYLKFI INIIIIL 
TQALY I CTTS KI CYS FLYFR 


8470 


22371 


A 

■ 


8531 


2 

4*» 


250 


SCLGLPS S WDYRGTLPCLANFCI FNRDG 
VSLCCQGWSQTPGSQVIDEAVYSIYAYY 
ILALLIVYVAQWEVWLKHFCCILICYT 


8471 


22372 


A 


8532 


2 


201 


FHHVGQTGLELLSSGDLPASASQCAGIT j 

GVSHCTWPSILYR>3NFKNLSAVFIAWVI 

EGNYTNKTCI 


8472 


22373 


A 


8533 


319 


200 


TFTFFLQRWGLTMLPRLVSKSWAQAICL 
LQPPKVI.ELWA 


8473 


22374 


A 


8534 


160 


297 


KTETLALVAHACNPSTLGGIGRQVSSAH 
EFKTILGNMTKPILYTKL 


8474 


22375 


A 


8535 


333 


228 


PNRGHY I AI VKSHDFWLLFDDD I VEVSS 
FLISYF 


8475 


22376 


A 


8536 . 


3 


325 


PPSSLLPCSWILDCCASNERDSVGVGPS 
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SEQID 
NO: of 

UUvlbU HUG 

sequence 


SEQ 10 
NO: of 

sequence 

- 

- 

• 


M 

nri 
uu 

■ 


SEQ ID 
NO: in 
1JSSN 

09/515,1- 
26 


Predicted 
beginning 

nucleotide 

location 
correspond 

ina tO first 

amino acid 
residue of 
net) tide 
sequence 


Predict- 
ed end 

nude- 

■ 1 U«IV 

otide 
location 
corresDon 
ding to 
last amino 
acid 

residue of 

nentide 

sequence 


Amino acid sequence (A= Alanine 
C=Cysteine, D=Aspartic Acid, 

E=Gluttimic Acid F==Phenvlfl limine 

G=Glycine, H=Histidine, I=Isoleucloe, 
K=Lysine, D=Leucfne» M=Methionlne, 

Q=Glutamine, R=Arginine, S=Serine> 
T=Threonine, V^Valine, 
W°=TrvntODhan. Y^Tvrosine. 

X=Unknown, *=5>top codon, /=possible 
nucleotide deletion Wnnwlhle 

nucleotide insertion 














EPGAG YNLLVCC FLSLLE BCRS IRVGVTR 
FSRCRLSPLSLTRKGNSLTPCASQVRRC 
LALLRIiAHSALHPLSCSYCLALP 


8476 


22377 


A 


8537 


216 


5 


KLG FLVNLVE FQ VKFF FLFFFLE I IL I E 
MRSHYVAOAGLELIjGSGDLPASASOSAE 
ITGMSHHALPNYSF 


8477 


22378 


A 


8538 


136 


22 


RPRRRLVLNPWPOAJLLPWPPKVLGSQV 
WATCGRRPAS 


0*r /O 






RS10 


uo 


99 


RPRRRL VIJJP WPOAI LLP WPPKVLGSO V 
WATCGRREAS 


8479 


22380 


A 


8540 


8 


319 


YIYMSMYMLIYVCIYVSIYICVCIYINS 
ISFNTYMI IYMLIYVYVYMCVYICVCVY 
IYTNSISFNTHITPIKHSDVHTVSTTRC 
NIjGGRGFCHTWPLPHLAGW 


8480 


22381 


A 


8541 


331 


156 


SVNGLLSLPACLPVCLPACLPSFLPSFL 
PSFLSFFLLLLLLLQKPVNSLIFSQSLS 
IS 


8481 


22382 


A 


8542 


72 


1 


KIGQARWLAPVI PALWEAEAGGSP 


8482 


22383 


A 


8543 


2 


83 


QAGVQWRDLGSLQALPPGFMPFSYLGG 


QAQ1 


*>01 Oil 




8344 




2 


lfT\n»mVT ATMS DT?\Tf? THB TVQrtPTTVCMV 

QETCTRIFIAILFI I^PNWKQRQDTMNP 
GNKENGKVATLDSDGFLVRFQS LMRS NC 
PT iYP FN1CJ?P T LT.TC 


8484 

* 


22385 


A 


8545 


314 


122 


LVETGFLHVGRAGLELPTWSDPPTSASQ 
GAGIAAVSHRGQPVDFKNNI STQIQGRP 

ncamcTF 








0540 






GAGI AWSHRGQPVDFKNS I STQIQGRP 
IICNYKTF 


8486 


22387 


A 


8547 


3 


247 


GIMATERLANYTGGIYAEYQDTTYINHV 
FHFLKVI PRKKTS W YSSRLNGS VYV 


o*to/ 






0345 






HRRR VFFTTWrJPP T FT?Pf?PT7f!irDnTT?VQ , l7 
yiu\cAr r imnvjit xr x r r wjfrur ryxrvor 

PFFFPKIGVFFVFFFRFSLVLKGDFFSN 

FLKIKfCFFFFFFFFFFFFFFFFFFFI KN 

RPCL 


8488 


22389 


A 


8549 


289 


2 


KKKIGETLCKLKLCKKFLDVTPKAQFIK 
QKKEKLINWQFIKIKNFRSPKALLRRGK 
GKPGTGRKRNANHI 3DKGPLCRIOKKNS 
RGRAQWLTLW 




99 "TOO 

XZJ7VJ 


A 








ROENPLNLGGGGCS EPRWCHCTAAWVTE 
QDSINK 


8490 


22391 


A 


8551 


293 


183 


RDEGRGISYMLPRLVLNSWPQAILLPQP 
PKVLELQA 




* 








R7 
Of 


MTiPT»r)FFT«GPSTinFr*PPPPR«;G<?RGTKV 

PESTTPRASAFLPPGAANLHHILQLVGA 
PRVPPGFHHVGQDGLDLVDLVIRLPRPP 
KVLGLOGG 


8492 


22393 


A 


8553 


57 


225 


KWQKWTPWAGGVAHACNLSTLGGRGGKI 
ACAEEFKTILENIVRPCLSKNKWISFS 


8493 


22394 


A 


8554 


326 


2 


KSKRGYSEPLVFSICSARAPRSHSVSQA 
TVYGHAHASTQPFPPRLNRSSYLSLLSS 
GDYRTOPPPLAYICISVIGNDVEHLFLC 
LLSTCISSVNRSHVYSTLVPIFGP 


8494 


22395 


A 


8555 


372 


148 


YICFLRSSVRQILFTPDCSDCSMWVPCN 
IRLPGSTHWCASASQGAGTTGARHHARL 
I FFWVFFC I FS I FSRENH 


8495 


22396 


A 


8556 


422 


269 


AADQBRLHT YQIjNY YHFCKR WGLTTFPR 
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SEQID 1 
NO: of 
nucleotide 
sequence 


SEQIB 
NO: of 
peptide 
sequence 


M 
. eth 
od 

- 


SEQ ID 
NO: in 
USSN . 
09/515,1 
26 

• 

■ 


Predicted 
beginning 
nucleotide 
location 
correspond 
Ing to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
£>=Glutamic Acid, ^Phenylalanine, 
G=G)ycine, H=Histidine, MsotoicJne, 
K-Lysine, L=Leucine, M=Methtonlnc, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, ■ 
^Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X«Vnknown, *«Stop codon, /^possible 
nucleotide deletion, V=pos$ible 
nucleotide insertion 














LVSSSWVQAILPTWPHKVLRIiQA 


8496 


22397 


A 

■ 


8557 


669 


407 


AGVKWRNLSTLQPPPRRFKRFSHLSLPS 
SWDYRCPLPCPANYCIFLVEKEFRHVQK 
AGLELLTSGDPPAWV3LIA0ITGVSHRA 
HPD 


8497 


22398 


A 


8558 


195 


352 

■ 


LGI^GrTASGYSTVTLGGMLKPRSWAW 
WPWPATREAEVGGSLEPGRSKLQ 


8498 


22399 




8559 


345 


219 


MLARM I S I SLPCDP PASAS QS AGITGMS 
HHGWLNLFKS IHL 


8499 


22400 


A 


8560 


137 


2 


VKYCL PTRSLI KTVRPLSPSQATASPVT 
CTYTPRWPEVTEESQKN 


8500 


22401 


A 


8561 


324 


193 


RSSGNLPASAAQSARITGVSHRTRPKCI 
ISKEYVAENNHLKSTS 


8501 


22402 


A 


8562 


326 


121 


HmTHTHTKimKKHQQPQHTHTYHTQTD 
IYVLQAASQKSLILISSLAV 


8502 


22403 


A 


8563 


3 


85 


HEHTHTHTHGHTHTHRHTHTHTIiCARM 


8503 


22404 


A 


8564 


2 


153 


ARDRHI PPCLTNFS VFWRDGGLAMLPRL 
VS KLWAQAVHP P WLLKVLRLQA 


8504 


22405 


A 


8565 


2 


177 


ARVGFHYVGQAGLGRLTSGDP PAS ASQS 
AGITGMSHRARPMS3LTLHILQDTWRAY 
AG 


8505 


22406 


A 


8566 


353 


212 


AGFRHVGRAGLELVTQDGPRTSACQSSG 
ITGMSHCAHPEKSTAFILF 


8506 


22407 


A 


8567 


76 


191 


ELIFCRDGGLTVFPKLISNTWPYAVLPL 
WPPKVLGLQV 


8507 


22408 


A 


8568 


1 


142 


GTRGFliHVGQAGLELPTSGDPPALASQG 
AQMTGV S HCASQKKE FLEW 


8508 


22409 


A 


8569 


250 


95 


SASRAPLPIiPPSHLPLRAAGLSPLCPPR 
LVS S YRPHVILLPLPPKVLGLQV 


8509 


22410 


A 


8570 


3 


226 


HEHTHTHTHTHTHTHTHGHTHTHTHTLV 
LVYSLCEHIWRFIVISRARVCCIYVWF 
FECGYADRRVADKWLWIH 


8510 


22411 


A 


8571 


1 


216 


GTRGLGVGVSHLVS KNTHTHTHTHTHTH 
THTHTATASQLSVFKIWAFQRSLDKGLT 
YISNSYCSSKFYLYNH 


8511 


22412 


A 


8572 


356 


268 


LTMLARLVLNFRPQVIPQPWPPKMLGLQ 
A 


8512 


22413 


A 


8573 


12 


175 


RWGFHPVGQAGLELLTSSDIiPTLASQKA 
EITGMSHCARPSFFSFFLSFFFWKHL 


8513 


22414 


A 


8574 . 


I 


163 


G TRTG FHHVGQAD LELLTS GD P P VSAS Q 
SVGITGVSHRAWPVGLCIIALISNEY 


8514 


22415 


A 


8575 


344 


176 


EFLVETGFRHIGQAGFEIiLTLGDQSTSA 
S QNTE I TGVSHP AP GLAI I FKS GNQAFF 


8515 


22416 


A 


8576 


2 


179 


ARG I LCHSGTFKLDSRPSMVAHTCNPST 
U3SQGGRI AOGQE IDTSLLKVARPRLYG 
QIP 


8516 


22417 


A 


8577 


1 


100 


GTRGFTMLPRWSDSWAQVILLPWPPKV . 
LELQM 


8517 


22418 


A 


8578 


3 


114 


HERHEILSIYLTIYLSIYLSIYLSIYLI 
YLPTITRI 


8518 


22419 




8579 


1 


264 


GTRPCRIELYRWESLAKAQETSGEEIS 
KFYLPNCNKNGFYHSRQCETSMDGEAQL 
CWRVYPWNGKR I PGS PE ITGDPNCHI YF 
NVQN 


8519 


22420 


A 


8580' 


1 


245 

* 


GTSGTSGIIiGFYLEREDRLLQIRTDTIT 
LSHGYSTFS LNRCDSRHHMNRMFQMLYQ 
SPFLGYIQLSHEVIAFGISYIYNTE. 
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SEQ ID 
NO: of 
nucleotide 
sequence 

- 


SEQ ID 
NO: of 
peptide 
sequence 

■ 


M 

eth 

(•• 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

Ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C-Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F»Pbeny!alanine, 
G=Glycine, H«Histidine, Mso leu cine, 
K=Lysine, L= Leucine, M=Metbionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X^Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


S520 


22421 


A 


8581 


1531 


1457 


QARNETIARRLWDVSCDLLGLPID 


8521 


22422 


A 


8582 


191 


2 


TITSYFSPTPAPGSHYATFCFYKFDYSR 
YF I KVELWEAETGGSRGQE I ETILANTV 
KPRLSRA 


8522 


22423 


A 


8584 


2 


212 


ARAGIYPKEYKSFYYKDTCMRMFIAGLF 
TIPKTWTQSKCPSWIDWIKKMYRQGPIL 
SPMLECSGEVSGFK 


8523 


22424 


A 


8585 


2 


192 


ARGNS P ASS SRVAG ITGAHHHARL I FVF 
FLFVRFFFSFLKTGFKLVAHSNFQPLTL 
EAPLHFK 


8524 


22425 


A 


8586 

i 


277 


66 

■ 


PLDFRLGHKRRLP FPKKKKKESKFVCVC 
VCVCVCVCFKRQCLVTLPRLVSNSWAQA ! 
ILLSWPPKILALQV 


8525 


22426 


A 


8587 


395 


286 


PHLGLPKCWDYRREPPCIAEWLKKYVW 
VTYFSFRR 


8526 


22427 


A 


8588 


307 


177 


RYPSTINYS ILNRDKIMPKLDRMVYKAR 
PRVMCLPWPPKVLGL 


8527 


22428 


A 


8589 


1*7 


349 

• 


RNLRELGEFFFFFFWFFFFFFFFVGKFS 
VFLLVGNLFLFFGGGIFGLSRVMCFKEF 
RCFS . 


8528 


22429 


A 


8590 


375 


240 


LLS RFS WFCFVFRQGLTMLPRLVLKCWA 
RGILLPWSPKVLGLQA 


8529 


22430 


A 


8591 


347 


215 


ETGFPHVGQAGLKLLASSDPPASASQSA 
QITGMSRRAQPSAKRF 


8530 


22431 

• 


A 


8592 


340 


114 


HRTAHCS LYLSGSSDPATSSSQVTGTTG 
KDQHTWLLFKYFCRARVSKINKYKS INK 
PYFLNTQTTPSEYGKASPG 


8531 


22432 


A 


8593 


86 


253 


LRLCCGLSCSTAKKKKKKKKKKPKKKKI 
KKKNLHPKRGGFEE ILFFFPKGGKNLF 


8532 


22433 


A 


8594 


3 


135 


GFCHVGQVGLELLTSGDPPALAS QNAE I 
TGVSHRAWPPVLFFF 


8533 


22434 . 


A 


8595 


2 


274 


PRVRT I SLLG ILVYRSHL I S SLLCLEG I 
ILSLFIIATLITLNTHSLLANIVPIAIL 
VFAACEAAVGLALLVSISNTYGLDYVHN 
LNLLQC 


8534 


22435 


A 


8596 


3 


316 


SIPYSWGEKEGIPCMAPPQIQGTSRLND 
FTALSLHLNTHTHTHTHTHTHEIQHLPP 
QG I TAL I LNS LLRHYCPS LAFP S LS P KQ 
NLTVRHCI LS RDTWLATAS K 


8535 


22436 


A 


8597 


385 


291 


KAPPLFFFFFFFFFFFFFFFFFFFFKPV 
KKILGV 


8536 


22437 


A 


8598 


275 


369 


G VQ VLKLLTS GDLPALASQSAE I TGVSH 
RAR 


8537 


22438 


A 


8599 


98 




VHYEEFMCVCVCVCVCVCVCWPRVRGVD 
MNPVEHPFGGGTHQHIGKPCTIRRNASA 
GRKVRLMAAL WTVR IGCTKTVTIiRKT 


8538 


22439 


A 


8600 


328 


195 


THVPPSLAKFVFLVKMGFLHIGQAGLDL 
STSGDPPALASHVFKR 


8539 


22440 


A 


8601 


2 


214 


HVD WMGTFAI LS E LHWDMLHVDPENLR 
LLDSVLNCELANHFGTDFTPPVQAAYQK 
WADVANALAHMYH 


8540 


22441 


A 


8602 


89 


181 


GLRHENRLNSGGGGCS EPKLCHCTPAWA 
TE 


8541 


22442 


A 


8603 


340 


57 


KEGHQEMEQAEDGGHGRKAPGTSPWTIiG 
ELP KSARGHTGLFVS YRGCRTQFRFKKV 
LPXSGDRNRGLAMLPRLVLNSWPQEIIiLP 
QPSKVQGLQK 
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SEQID 1 
NO: of 
nucleotide 
sequence 

'» 


SEQID 
NO: of 
peptide 
sequence 


M 

etti 

od 


SEQID 
NO: in 
USSN 
(09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
. peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanfne 
C=Cystetne, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, BNHistidine, I=lsoIeucine, 
K=Lysine, LrLeucine, M = Metblonine, . 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V^Valine, 
W-Tryptopban, Y=Tyrosine, 
X=Un known, *«Stop codon,/=possible 
nucleotide deletion, \=possfble 
nucleotide insertion 


8542 


22443 


A 


8604 

• 


330 


132 


TGFLHVGQSGLKLLTSGDLPTSAFQNAG 1 

ITGGSHCTRPFSLFHSYLVLFYGCSSSQ 

VSLSLLARM 


8543 


22444 


A 


8605 


546 


279 


RPG PTVS PRLEC I CMIMAHCSLDFPGSG 
VSPTSASRVARTTSTHHHAWLIFYSSVE 
'PGS HHVAQ AG IiELLQSSNPP I SASQS AG 
ITAL 


8544 


22445 


A 


8606 


3 

. 


249 


NR FNLGS GGCSE PRSHH CT PS WATEQET . 
CITYLEYCFPPVSHTNIiVYPKPLNVTIR 
SDYC I LGLHAS S IGGSD3L I I DCWD 


8545 


22446 

■ 


A 


8607 


159 


324 


VI Q YRTN ILHLHLCEVLVFWFL PYFFLK 
KTYTGIGQMRWLTPVIPALWEAEAGGS 


8546 


22447 


A 


8608 


334 


179 


TGFPHVGQSGLKLLTSGDLPTSGSQSAG 
ITGVSHCTWLEVIYFXNISIYPS . 


8547 


22448 


A 


8609 


278 


202 


QQPSTLRQDPPPAKRLHLSEGSDGH j 


8548 


22449 


A 


8610 


236 


66 


AAS TFFLQKI KRHLIaSGTVAHACNPS TL 
GGRG I R I SRGRE FKTSLGNMVRPRVYKK 


8549 


22450 


A 


8611 


29 


177 


GKQVMAiaCFI FFHFFFRRSLAMLPRLV 
SNSWVQEILPPWPLKVLEIiQA 


8550 


22451 


A 


8612 


1 


215 


HTSRTLFVHLRLKNFNKYL IEWGLSMLP 
RLVLNSWAQAVLPPWPSKVLGITGMSHC 
TWLYLHIiSTDLLKLC 


8551 


22452 


A 


8613 


3 


91 


LRHGNCLDPGEGGCSEPRS CHCTPAWVT 
E 


8552 


22453 


A 


8614 


2 


167 


FTISLLGILVDRSLTHHINNIKPSFTRE 
NTLMFIHLSPILSTRKQLAIQHLSRLE 


8553 


22454 


A 


8615 


2 


281 


FC I LVETGFHCVAHAGLQLXiSSGNP PAS 
ASQS VG I TGMSH CTS PNLS LLTQS KS FC 
LLIQNHNPYAFINSQLTFS ILFLLS YFI 
FYDSLFLLF 


8554 


22455 


A 


8616 


282 


182 


GQAWWLTPVIPALWEAEVGGLLEPRSSR | 
PAWST 


8555 


22456 


A 


8617 


379 


207 


SNQTTKKYINFFFVEMRSHYVAQAGLKF 
LGSSNPPTSASQSASITGISHYVWPISP 1 
S 


8556 


22457 


A 


8618 


705 


483 


DKSFALVARAGVQWHHLGSLQPLPPGFK 
QFSYLSLPSSWDYRHAPPRPANFCILSR 
DGVSPCWPGWSRTPDLR 


8557 


22458 


A 


8619 


359 


220 


RQSLTMLPRLVSNSWAQAILPPWLSRLL 
GLQALAP I PGKS YNEKTP 


8558 


22459 


A 


8620 

■ 


3 . 


186 


HEVS WVKRKQDBW I EFDDDTVS I VAPED 
I LRLSAGGDCH I AYVLLYG PRRVQILBE 
ESEQ 


8559 


22460 


A 


8621 


1 


116 


GTS FCRDGGLTVLAQLGSS FWPQRI LP P 
WPPKWELQA 


8560 


22461 


A 


8622 


369 


188 


MYSLEMGFIGNGHAGLELLASGDLSALA 
SESAG ITCMSLRYRTRFSFKRATOGLDL 
SLQS 


8561 


22462 


A 


8623 


202 


22 


YILGFYFMPVICYQERKKDKKNWEVTTI 
MSFVGICMELGAIILSECLMQEQKTKYEI 
FSQV 


8562 


22463 


A 


8624 


387 


315 


SNLGGGGCSELRLCHCTPAWATE 


8563 


22464 


A 


8625 


2 


135 


TRDLCSLVYLLTF P PLLSHDPAKS PSVR 
NTQELS I KKKKKGRPF " 


8564 


22465 


A 


8626 


3 


232 


HASAFEE P VY IKSRQKRKESNP PKLVS S 
QPHGLKKKKKKKKKKKKKRGRGLKKKKK 
KHNKKFKXGQKKYNFGRGAF 
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SEQW 
NO: of 

ilUilwVUUv 

sequence 


SEQID 
NO: of 

nentf ffo 

sequence 


M 
eth 


SEQID 
NO: in 
TJSSN 
09/515,1 
26 


Predicted 
beginning 

n ii clenHffo 

location 
correspond 
ing to first 

flminn arirt 
aiiiiuu aviu 

residue of 
peptide 

cpniiAtirp 

1 


Predict- 
ed end 

nil/1 \tKm 

otide 

location 

correspon 

Hi no 

last amino 
acid 

rcMUUV ui 

peptide 
sequence 


Amino acid sequence (A=A(anine 
C=Cysteine, D=Aspartic Acid, 

Ps/^|i|lnn||i> A Vfferl 14^=PIt AtlY/lalnnSnn 

MLr~Kfiuiuniic ACiu, ** rncnviamlliue, 

G=Glycine, fiNHtstidine, I~Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProJine, 

■ 1 ■ / |i ■ 4o rrt ■ n A TJ- — A rfiln ina ' C^— Carina 

\£ — vy iu turn iiic, iv— Arguunc, a— oerioe, 
T=Threonine, V^Valine, 
W=Tryptophan, Y^OTyrostne, 

Ycsllnl/nnun A— C4An niulAn /cnrtcclKlo 

A B uiiKnonD ? — oiop couon, / = possiDie 
nucleotide deletion, ^possible 
nucleotide insertion 


OJOJ 




a 




07 




OHGFT *YY Y Y Y Y Y YY Y YY TfGGPT .f MY PT/2 

GPNFTGDGKKKFFPKKGGI KKP PGE FLK 

KiSn^FLGGKKNGKNPPKKLNPIiGEKKNFK 
GKTCG1TNPP 


8566 


22467 


A 


8628 


2 


188 


RYGCPPPYHTFEEPVYIKSRQKRKEWNP 
PKL VS S Q PHGLHDFFKKKKKKGAAVLKD 
PSGGPS 


OJO/ 






&A7Q 

0O47 




Oo 


gf rgp tow GRPwrGGGns rp v rgrrur a© 

RTGRGEGGGGEAKGRPRSARRRGRGSRG 
SKSQTPSRHLPAHIITN 


8568 


22469 


A 


8630 


364 


136 


VSLLLGIiEYSGT ITAHYSLNFLLFIVET 

lyoti 1 V A\i I LxL lU-d-rtjia bJM F ir 1 0 JJ 0 yx FB X 

TGVSHGAQPKVQFWSKNLDT 


8569 


22470 


A 


8631 


355 


70 


LFP PRFF P FFSPLS PLKFFFS P KGFNFF 
RBFFF I FSPPKKRVIiSKNSPGGFYKPPIi 

p*P*Ti F F F FFIiVI 


8570 


22471 


A 


8632 


2 


266 


AFTISLLGILVYRSHLISSLLCLEdliL 
SLVI I ATVI TLNTHS LLANI VP IAI LVF 
AAC6AAVGLALLGS I SNTYGIJD YGHNLN 


8571 


22472 


A 


8633 


1 


186 


LTHHIlWIKPSFTRENTLMFIHLSPIIiI* 
LKKKKKKKKKKKKKKKKKKKKGGALKKK 
PGGGQN 


8572 


22473 


A 


8634 


2 


80 


VLE I AVAL I QAYVFTLL VS L YLRDNT 


8573 

■ 


22474 


A 


8635 


713 


333 


EGPPPPRSKKKGTQGGGKDPPS PPEPiCT 
PGNPQGRKTGPPPPGRPYTGPRPPGSGP 
TRGGGSRSSSSNTNAPGEKIFFSKNPGR 
KIFPPPAILVFFSPFPLKNFFFSLRLLI 
r LAiviCA-r F r PF Pit 


8574 

- 


22475 

■ 


A 


8636 

* 


389 


1 


Fl FSFLFLI IFFFLNINFFFYIFSSIiS F 

nn/DT WDXTT T T VBTDDTTT BMtfOT pny nri 

c rr J^riNijijiji< t HIi > i J llijb rr £>JjSP±FF 

FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
rrirrrrrrrrrcrrrrrrrrrrjr cur 
FFFFFFFFFFFFFFFFFFFFFGTKKELY 


8575 


22476 


A 


8637 


2 


192 


VRTLGLRTDA I PGRL YQTT FTATR PGVY 
YGQWYEI CGANHS FMP r VX^LLALTIFE 
IGPEFTL 


8576 


22477 


A 


8638 


, 


87 


PTRPLEIAVALIQAYVFxliLVSVYLHDN 

X 


8577 


22478 


A 


8639 


2 


132 


INIiASTIiX I FTILI I1LTII1E IAVAJblQA 

* wV XIII IV Jill XinX/iV X 


8578 


22479 


A 


8640 


2 


113 


IiFFIAE YTNI 1 1 INTLTTTI FLGTTYDA 
LSPELYTTY 


SS7Q 
QJty 






OCA] 

otrH 


0 / J 


£2 

• 

• 


t?TT?TCDT.SVT ,PPT T5'C"DTrCT7TT5T5T.r'T7n r C , TP 

rir is fiofiliir r xj.fr Jrr r Jr X r FijL.r rr r 
I PLFFLFLS FLFS L I FFLFSLI L I FPPS 

FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFCWLPHR VMIAKVGI S WSKKI 


8580 


22481 


A 


8642 


180 


34 


WF I S DSLWLGMVAHTCN PSTLVGQGGW I 
TCGRSRP S VPKRTPSQTCAPA 


8581 


22482 


A 


8643 


338 


133 


NLPLGLFYSLiQKKKKKPFLRPGTLGAPK 
GNFQGAPPLFGKFAFFFFFFFFPFFFFF 
FFFYGFFFFTjFK 


8582 


22483 


A 


8644 


156 


287 


LTYPI KPPI^CSQKKKKXKKKKKKKKKK 
KKKKKGGAFKKNPWGAQI 


[8583 


22484 


A 


8645 


I 


325 


LHHCHTHNFVNKVVCFIRKKKKKGKKKK 
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SEQID > 
NO: of 
nucleotide 
sequence 1 

'i 


SEQID 
NO: of 
oeDttde 
sequence 


M 

etK 

od 

- 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 

■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

ins to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
corresDon ■ 

apra^ P P ^^^^ Bf ■ ■ 

ding to 
last amino 
add 

residue of 

nentide 

sequence 


Amino acid sequence (A^AIan in e 
OCystetae, D=Aspartic Acid, 
E^Glutamic Acid, {^Phenylalanine. 

4Ppp p pa Ph iA Baa a pm 4 k p^pr pp. m p« ■ * pp p p J Vlp w 4vp?pvpj p»4"PV 

G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
vQ=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan t Y=Tyrosine, 
X^Un known, *=Stop codon, /=possible 
nucleotide deletion. \=D0Ssible 
nucleotide insertion 














KKREKKKKKKKKKKKGGGPLKKPPGGPK 
FSGGGKKKIFPQRGGYKKPPGGFLGKTL 
FLGGE KMGENP PKKI KPLGEKKI F 


■ 


• 




• 




% 


P VRAS RLL CL P KOAWAMAGAP P PA^L P P 
CSLISDCCASNQRDSVGAGPSEPGAGYN 
LVMHCFLSPSEKHS IWVGVTRFSRCCPS 

W P PA A & ta» Pfe Pb^ AA *•» * * V ^P V ■» ■P^»^» P*P* A * 'p* ^PJ* PW lv 

PLSLTRKGNS LTPCASQVRQCLALLRLV 
OGAGTHRTRG 


8585 


22486 

i 


A 


8647 


341 


88 


KKKKIFSPPIKRGPPSVYFICPPPPFFF 
FFFFFFFFFFFFFFFFFFFFFFFFPFFP 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFRMGCDRWHGELYNKI 
PIWHKPCGDL 


8586 


22487 


A 


8648 


1 


108 


PTRPRTRGVASVLYFTT I LI LI PTISLI 
ENKILKWA 


8587 


22488 


A 


8649 


3 


96 


EICGANHSFMPIVLELIPLKIFEIGPVF 
TL 


8588 


22489 


A 


8650 


457 


297 


SPSFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFLFYEALF 
FIYLI IKDNYMFQC 






IV 

■n 


OOJl 


M- 




FFWGGLItiTLLGIVLySPLEALKlSPSF 

*» pp »»»j*%pT*41 * AjJ pV Jm«V p*> w «pT*p p««pwppjpp% A pjp* pp %p* p> 

GFLBK 


R590 




A 

** 


OUJi 


2 


1 1 1 
111 


NNI KPSFTRENTLMFIHLSP ILLLSLNP 
DIITGFSS 


8591 


22492 


A 


8653 


1 


305 


RQQQQQQQLRNLRDFLLVYNRMTELCFQ 
RCVP S LHHRALDAEEEACLES CAG KL I H 
SNHRLMAAYVQLMPALVQRR I ADYEAAS 
AVPGVAAEQPGVSPSGS 


8592 


22493 


A 


8654 


129 


268 


LSEVLYLFPPKKKRLIILCLFFRNPGSS 
NLQKI TKEP I IDYFDVQD 

* « mjm^£ * * Pa pa A VW pp Pb> -P«k ^ppr M> P- V 


8593 


22494 


A 


8655 


281 


117 


CI I TLS F INTLQAY 1 1 WLWFSEVCL YV 
SLSI YLS IYLSI YVFEYRYIKISFHL 


8594 


22495 


A 


8656 


3 


177 


DLHAHKLGVDPGNLNLLSHCLLETLAGH 
LPGEFTPAGHAFLDKFLDFVSTVLTYKY 
R 


8595 


22496 


A 


8657 


3 


270 


RRRGRAHCSLDLLGSRNPPASASRVAVT 
TGTRHLAWL I WGGFLFFVLRDVLVRFH 
ATDKDIPKTGQFTKERGLMDLQFHMAGK 
ASQS 


. 8596 


22497 


A 


8658 


107 


11 


QFEHTKPTPFLPTL IALTTLIiLP IS PFI 
LIIL 


8597 


22498 


A . 


8659 


21 


136 


VPIEHDVPVPNSRAELLKMFG IDRDAIA 
QAVRGLITKA 


8598 

- 


22499 


A 

■ 


8660 


1 


395 


LVTALYSLYIFTTTQWGSLTHHISNtiEK 
KKKGKGKKKKRGGAL FKGI FGGSHFPGV 
WELLFFFLKGGIKKTILGFLGKTLFFGG 
GLLGAPLPRKIKGLREKKNFKGELGVKN 
RVFFFFGNFSSLGVYLKKY 




22500 


A 


8661 


I 


255 


RTRGRTRGRTRGLTRGKKKKKKIMKKKK 
KKKKKKKKGGAFLKNPWGGPIFWGLPKF 
YFLPNKGSFFNLIGDFLKRPFFWGGAYF 
G 


8600 


22501 


A 


8662 


136 


358 


FHIVKGVSIIKFKCVILKKKKKKKKKKK 
KKKKKKKKKKGKKL KKKKGG KKKKGGGK 
NKIFFWVGFFYKRVWGYF 


"8*61 


22502 


A 


86^3 '"" 


198 


368 


LINLVQFTC I LGTLLGiLFI YLFliRkAl 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 


SEQID 
NO: of 
peptide 
sequence 


M 

#h 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
-amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
OCysteine, D=Aspartic Acid, 
EKJIutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, i=Isoieucine, 
K=Lysine, l>=Leucine, M=Methionine, 
N=Asparagine, P*=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonme, V=Valme; 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *«=*Stop codon, A=possibJe 
nucleotide deletion, \=possible 
nucleotide insertion 














TMLARLVLNS W PQAVLLPQP PKVLGLQA 


S602 


22503 


A 


8664 

• 


407 

■ 


82 


KFLPHLGFPPFKRFFIKKEPFQWFFFLP 
GFFF FRRS FFLLPRVQCNGAISGPCNLC 
LNVKVRLSFKKKKKKCWDYTFEPLCPAL 
HFSFINKTEDCLFVCETVYWPHGF 


8603 


22504 


A 


8665 


166 


293 


KI I AKHNIARTNPYTFCIMKKKKKKKKK 
KKKKKKKKKKKKKKGGAL 


8604 


22505 


A 


8666 


1 


164 


RAKAKRRNTTVS CRMRHLKI VYRRFRYR 
LYVGI I IGPGGLNETLVFTCKKITVP 


8605 


22506 


A 


8667 


295 


85 


YIYMSHI CTLSLSLS I YKRVKCFCFFKT 
WSHSVAQAGVQWRDLGSLQPPPPKWILC 
ISVFFTVLFVMKKN 


8606 


22507 


A 


8668 


2 


131 


INLPSTLIIFTILILLTILEIAVALIQA 
YVFTLLVS LYLHDNT 


8607 


22508 


A 


8669 


423 


107 


HCALPKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKK 


8608 


22509 


A 


8670 


432 


301 


SQILGRFSQKQHLSLGGGGCSHPRS CHC 
TPAWARGRLRLKKFIK 


8609 


22510 


A 


8671 


1 


106 


LKLLTSSDLPAS ASQVAB ITGMSHCARP 
LSRIYMC 


8610 


22511 


A 


8672 


383 


252 


CPANFVFSVEKGFIiHyGQAGliELPTSGD 
PPGCWFCFLQQQLHW 


8611 


22512 


A 


8673 


560 


135 


SSWGMDS IPTSSNMEETQQKSNLELLHI 
SLLLIESRLEPVRFLRSTFTNNLVYDTS 
DSDEYHLLKDLEEGIQMLMGRVEDGSHL 
TGQTLKQTYSKFDTNSHNHDALLKNYGL 
LPCFRKDMDKVETFLRMVQFRS VEGS CG 
F 


8612 

■ 


22513 


A 


8674 

• 


2 


326 


SLLMCVHRCECVCMRACLCAGVCMCIAS 
CLGLPMNWECYTWRVLVFHQFQDEELH 
DTVDLETI PLBRQPRDVQHPVSTRILYL 
HVYFVAVTLTL I R ILQLWTEAFSP 


8613 


22514 


A 


8675 


171 


272 


TQCCLLKAEKKKPKKKPKKKKKKKKKKK 
KGGPF 


8614 


22515 


A 


8676 


60 


409 


LVWLBSHEDVGPLI KDSKQEKKKKKKKK 
KKKKKKKKKKKRGGAFKKKPGGGQNLRG 
GEKKNFFFLRGEKKNPRGNFGKKTFFWG 
GENWAKPPPKKKSPEGKKKILKGKGGKK 
YLNFLGE 


8615 


22516 


A 


8677 


106 


286 


LFWNKEGNNWKKKKKKKKKKKKKKKKKK 
KKKKKKKWGGALLKISGGGPIFPGGGKN 
FFFFFGGGF 


8616 


22517 


A 


8678 


1 


97 


IIGQVASVLYFTTILILIPTISLIENKI 
LKWA 


8617 


22518 


A 


8679 


1 


141 


NEHQ PGQHGEMPS LLKIQKI S WAWWSMP 
VTPATQEAEMEKESQISIM 


8618 


22519 


A 


8680 


499 


140 


GVFFFSRRKRCGVSPPPPPKPVFFPLPG 
FFFWWGGCPSAPPPPGGGFPPNPPRGPF 
Y PPPI>KGNr if c PPPAWaGPPKGr FWAPP 
PPFFFFFFFFFPFFFFFFFFFFFFFFPF 
FPFFFF3 


8619 


22520 


A 


8681 


60 


224 


KQTKYIKE ITKHI CTWQGRYHDHEGGFP 
RVKL I HCTPDMLT PVI SPDVGNSTTV 


8620 


22521 


A 


8682 


252 


383 


SQXKKKKKKKKKKKKKKKKKKKXXGGGL 
LKNFWGGPNLTGGGKI 


8621 


22522 


A 


8683 


405 


145 


KFFFSGGCF FFWG PPGKNVP P P KKMFPF 
LIPPPKKKKPPPPKKIFFPPPGVFFPPP 
PPFFFAPPPFFFFFFFFFFFFFFFFFFF 
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peptide 
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Amino acid sequence (A«Alantne 
C=Cysteine, D=Aspartic Acid, 
E=Glutamlc Acid, F=Phenylalanlne, 
OGIyclne, H=Histidine, I=Isoleucine, 
K=Lysine, D=Leucine, M=Methionfne, 
N=Asparagine, P=Proline, 
Q=Glutamtne, R=Arginine, S^Serine, 
t=Threonine,V«VaIine, 
W^Tryptopnan, Y=Tyrosine> 
X=Un known, *=Stop cod on, /^possible 
nucleotide deletion, \ppossible 
nucleotide insertion 














FFF 


8622 


22523 

• 


A 

• 


8684 


1 

• 


406 


LYYFLTKSTTTYLAVPQPFPPTPSTPSR 
AKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKRGGG PKKKTRKGGGGQKKKKGR 
GPEKNRGGGETTPGEKKKKKKHGGGGKK 
KKKKSGGGKKNGGKPGRGGGGGR 


8623 


22524 


A 


8685 


391 


265 


GRPFFFFLDGVSLLLPRLECSGVILAHC 
NLCLPGPSDSPSSA 


8624 


22525 


A 


8686 


2 


177 


DTALYS LY I FTTTQWGSLTHHINNI KPS 
FTRENTLMCIHLSPILLLSLNPDIITGF 
S3 ' 


8625 


22526 


A 


8687 


479 


366 


GLPKCWDYRREPPHPAANFCIFSRDGVS 
PCWPGWSCL 


8626 


22527 


A 


8688 


35 


161 


SNPPSSAPQVAGTTGVRHHDQLTFFFFF 
FLGGTGQILCRGGD 


8627 

• 


22528 


A 


8689 


1 


122 

• 


THHINNIKPSFTRENTLMFIHLSPILLL 
SLNPDIITGFSS 


8628 


22529 


A 


8690 


2 


152 


NIILAFTISLLGILVYRSHLISSLLCLE 
GIILSLFIIATLIYPTPSFSCY 


8629 


22530 

■ 


A 


8691 


541 

- 


85 

- 


IFLRQFLSPRLEYSGVIKAHCSLNLLGS 
SDPPASAFKVAGTTGIRHHAWLIFVFFF 
AETGFHYVAQGGLELLNLRNLPATAFQS 
AGIIGISHHDLLGIQITTWLLQKLSYLQ 
KKICKDQETGLLHAFHLHPYPYPMPSSP 
QLLQGPDILTNG 


8630 


22531 


A 


8692 


122 


38 


GRVDHS FMPIVLELI PLKI FEIGPVFTL 


8631 


22532 


A 


8693 


276 


77 


GVS PCEPVFFF I LGRCNP PAWGVPKGRD 
FFVLSFFFWFYILLFFFFFFFFYFFLF 
SFYCFNGSNK 


8632 


22533 


A 


8694 


3 


70 


FMP I VLELI PLKI FEIGPVFTL 


8633 


22534 


A 


8695 


230 


3 


G£GIPSHPGHPSLPYHLEREWLAKTGIR 
DTSNSVHTiIjKLCLRSQRHGRARWLTPVI 
PALWEAEAGGSRRQEIEPI 


8634 


22535 


A 


8696 


401 


240 


PPPPPPPCXSSEPRPPHPSLSPPGSREMF 
VLALSQESQMTFFFFQLRKKLHLYL . 


8635 


22536 


A 


8697 


103 


313 


EKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKGGGGPKKKKFEAIFGKTP 
GGVSQGGGPPPLNFFFLNPGPPFF 


8636 


22537 


A 


8698 


354 


110 


KKGTLFEVSES F YTTEGVSAKINRIRS I 

* -W mm ^»*mmm t*K m"m W M«t IHHib mm V ■ mm H mm*. ■ ■ ■ ■ 

VS VNKVARCIGE I FTDAVQS AFQKEVGG 
VSDSMVHKYEYAGCVDVKAVCSHGL 


8637 


22538 


A 


8699 


2 


164 


FLVETGFLHDGQAGLKLL I SGDPPAS AS 
QSAEITGMSPHACNSSTLGGRGGRIT 


8638 


22539 


A 


8700 


191 


371 


PPDSRWAIQGGHLELTPTTRQDLAQAPR 
RKLALLEGSFQGQAQWLTP VI PALWEAE 
ASGS 


8639 


22540 


A 


8701 


2 


170 


ARAGLYHVGQAGLKPLTSGDPLASA3QS 
VRITGVSHRTRPKI IFKLI FFLFFFFFF 


8640 


22541 


A 


8702 


31 


229 


MEKYNVHPHSGILRSHEKEOAALFTIAK 
RWRQPS YPS IDEW INIMWSSHTVEYYTA 
MKRNKQLYSQ 


8641 


22542 


A 


8703 


3 

« 


126 


LQELRDPTLTFRLLGSPRPVWETRPVD 
DPTAPSNLYIQE 


8642 


22543 


A 


8704 

■ 


438 


31 


SEPCAGYNLLVCRFLSPLE KHNIRVGVT 
RFSRCHLS PLSLTRKGNSLTPCTSRVRQ 
CLALLWLAHSALHPLSCMHCLALPSEMN 
PVPQMEMQKSPIFCIAHAGSCTPELFLF 
GHLGSTPLTHLLSLRINVFLRTSE 
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SEQ ID 
NO: of 
nucleotide 
sequence 

* 


SEQID 
NO: of 
peptide 
sequence 

• 


M 

cth 

od 

• 


SEQ ED 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

Ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
^Glutamic Acid, F=Phenylalanine, 
G=G)ycine, H=Histidine, I=Jsoleucine, 
K=Lyslne, L=Leuclne, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W»Tryptopban, Y-Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possibte 
nucleotide insertion 


8643 

* 


22544 


A 


8705 


429 


319 


PGAI IAHRSLNLPGSSDPPTSASQVAGN 
TGVSYCARP 


8644 


22545 


A 


8706 


426 


245 


RAATPSPIKLTKKKKKKKKKKKKKKKKK 
KKKKKKKKKKRGAPLKKTLGGPQ INRGK 
KKIFFFF 


8645 


22546 


A 


8707 


1 


108 


KHPRGGS FGYCFALAWVAFPLALVSGI I ! 
YIHLRKRE 


8646 


22547 


A 


8708 


3 


146 


QI EGHTI SALGDGAAWFVQGLIRHFRPE 
LEERMQRFAQQHQARQAAS 


8647 


22548 


A 


8709 


2 


83 


LVLEIAVALIQAYVFTLLVSLYIiHDNT 


8648 


22549 


A 


8710 


1 


81 


RPLS KTVRFNVLKVTKAAGTKKQFQKF 


8649 


22550 


A 


8711 


156 


4 


RPLS FHPGGKKRLFLKKKKKKLCAVAHA 
CNPS TLGGRGGR I MRSGDRDHPG 


8650 


22551 


A 


8712 


440 


233 


TPSYFVLLVERGFSMLPRLVSIjTSGDPP 
ASAFRSAG ITGMSHHAQPRTGRSKAYLP 
FLEEPDLKHIFPF 


8651 


22552 


A 


8713 


92 


302 


amhptmgpqdqghtgglyahtplhphsh 
thantlihshiyahahshtipatcpkgp 
tqpylcllsphiel 


8652 


22553 


A 


8714 


2 


166 


KKINS FFSQGLTQAKVHGGRPS S LGPQT 
PVLKRSS CLSFPSS WDYRHEPLYPAN 


8653 


22554 


A 


8715 


1 


129 


GFRHVGQADLELLTSGDLPASASQIVRI 
TGVS PCAQHEFFKSN 


8654 


22555 


A 


8716 


381 


105 


RRSLLHS VLNGAQAGVQWRDLGS LQP PP 
PSSr.PWPPKVPRLQPLPGRHPVWEVRSV 
SARPP IVWDVRSPSAWLPSLESEERLCL 
AAIPSGK 


8655 


22556 


A 


8717 


385 


125 


FFFFLPPAFLLSGXFWFFFFYI FSFFFF 
FFFLGS P PPP I FFFFF FFLNKI FFFFFF 
FFFFFNNLFFFFFFFFFFF YFFFFFFFF 
FFF 


8656 


22557 


A 


8718 


409 


264 


KTDQRWSSTSSSKIMSQSQVSKGVDFES 
SEDDDDDP FMNPSSLRRNRR 


8657 


22558 


A 


8719 


3 


269 


KKHVKRVLSHLKASCPEELLHFSSWPQL 
LEI^LMGGGSHTHMDQHTHTHMHPHIYT 
HAHTHTVVAKLNELP WVEMNLLS LGAFQ 
QRLP 


S65S 


22559 


A 


8720 


430 

* 


141 


LFFFLSFFEDGVLIiCHPGWSWAQSQLT 
ATSTSKKKKKKKATP PGI PKFL IGKSGK 
PPRVLLIGHWAPFKFFFFNPRIAFF 


8659 


22560 


A 


8721 


386 


290 


RWGLTMLPRLVLNYWLQAILLLWPPKVS 
GLQA 


8660 


22561 


A 


8722 


3 


157 


HE VFLVETVFLH VGQAGLELPTSGDIiPT 
SALWBAEWGGLLEDRSSGPAWET 


8661 


22562 


-A 


8723 


493 


186 


FSPQGGEKRAILGLRNLCPPGVKDFSAR 
PPKEVGNEGRVPKTKLFFVFLKKKGFPL 
IGRGGFKSRPWESPPRPPQKVGVQRLNP 
PPGPFFFFDLIWPNRNGS 


8662 


22563 


A 


8724 


3 


186 


GGGTPPFPGVWGKGNGGGTPPAKTLPPF 
PS PLFFFS PCPS PGEGGSREPLL S TPLA 
CICI 


8663 


22564 


A 


8725 


370 


67 


FAPPKNFYKGAPPIFFIFLFFILFIYFF 
FFFF I FFKFLVFLLETG FHHVSQ EDLDL 
VIHLPQPPKWGLQAWATTPGQFFVFLI 
ETGLH P VNQDGHNLLTL 


8664 


22565 

• 


A 


8726 


966 


652 


LGSLKPPPTGFKRFSCLSLPSSWWYRHA 
PPCPANFVFLEETGFLHVGQVSLELLTS 
GDPPTLASQSAGI TGMSHRDWLGQHLI V 
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SEQID * 
NO: of 
nucleotide 
sequence i 

'i 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
co rrespon . 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanlne 
OCystetne, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H-Histidine, 1=1 so leucine, 
K=Lysine t L=Leucine, M=Methionine, 
N«Asparaginc, P^Proline, 
Q=G]utamine f ,R=ArginiDe, S=Serine, 
T=Threonlne, V=Valine f 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NHC YLFYQGH I KTTRSLLP P P 


8665 


22566 


A 

* 


8727. 


85 


425 


HMHSFAHELHTFLHTHTNSHTNSQNQSP 
TYTSK3HTHSYMRFTDSHDSQMQQTHRH 
: SLLHMTQLYTCTHTHTHTRTVTPLHLHS 
VKGWLNTKVALGGRTSHGRBSHZAGRLL 
A 


. 8666 


22567 

> 


A 


8728 


393 


124 


LGSGEPQLFQSPPSGGPPGPKSHWNSWG 
NANPPVPPYT IESAPPPPHFPFGAQSKY 
SGGQAARLCPCPPFFLKKKKKKAPGSGT 
GTFSFQ 


8667 


22568 

• 


A 


8729 


25 


217 


AVQP IRVQWRRS LQS PQEQ I I LAPSLAK 
VDMEMTQLTQBNADFATRDRYHHSSLVN 
REQLMPHY 


8668 

* 


22569 

• 


A 


8730 


1 

- 


611 


PGIFYSALLSLDTSILNQLCFIMHRYRK 
NLTAAKKNE LVQ KT KS EFN FS S KTYQE F 
NHYLTSMVGCLWTSKPFGKGIYIDPBIL 
E KTGVAEYKNSLNVVHHPS FLS YAVSFL 
UYTSSSSSSSSSSSSSBSSSSSSSSSSS 
SSSSSPSPEERTVNVSSIRGKKWSWYLD 
YLFSQGLQGLKLF IRS S VHHS S I PRAEG 
INCNNQY 


8669 

• 


22570 

• 


A 


8731 


120 


378 


VEKTQAS I LGCKCSTPRAALGFRERTLP j 
LRRCPCCGP PEAQMELEKRALGS QAQWL 
MPVFPALWEAEVGESIaEARfiFETSLGNI 
A 


8670 


22571 


A 


8732 


2 


68 


VYVCVCVCVCVCMCVCRAKVGM 


8671 


22572 


A 


8733 


382 


258 


NYHSSLHLETPGLKQSSCLSLPKSWDYR 
HESPCPALIFNSL 


8672 


22573 


A 


8734 


479 


326 


PPLYGFFFFFFFWGGGFFFFFFFYFFFF 
FFFFFFFFFFFFFFFFFFFQTTF 


8673 


22574 


A 


8735 


422 


2 


GRRPPGIiYFFFFPPGKKNFQGRGGFLFF 
FPPKGFFFFFFPIGFFFFSSELGKDWPP 
PKKKVFSQRFPFFFFPPPLFLFLFFFFF 
FFFFFFFFFFFFFFFS FFFFFFFFFFFF 
FFFFFFFNKQVFIERLLCARHCFRPPAS 


8674 


22575 


A 


8736 


7 


75 


IAVALIQAYVFTLLVSLYLHDNT 


8675 


22576 


A 


8737 


2 


176 


N FGLiiAE TG FL YVGHAG LEVRS S GDLPA 
SASQRAGI TG VS TAPGFNR YFYKQTI YK 
YG 


8676 


22577 


A 


8738 


408 

■ 


111 


GSSPPPRAGGENFLKKNAGGKNFPGGEG 
GGGFYPLYPKKFFFSPKGFIFWGGGGGK 
WPPPKKGGFSKKPQKVFFTPPQKKKKFF 
WPPGGNWGPPKNF 


8677 


22578 

■ 


A 


8739 

• 


5 


249 


YEGLKKLHMQASPFQRGHPVNHKKKKKK 
KKKKKKKKKKKKKKIGCKKIOCKKKKKKKK 
EGALGGKFLAQTS FKGEKQKVFI KY 


8678 


22579 


A 


8740 


1 


226 


FFLRQSLSVTQAGVQWCDLSSLQLLAFQ 
GSSNSPTSASQVAGITGVHHHIQLIFIS 
LVETGFHHVAQAGLELNLQ 


8679 


22580 


A 


8741 


4 


96 


DGLIMLPR1.VSNSWPQGILPPWPSKMLG 
LQV 


8680 


22581 


A 


8742 


19 


112 


FGRI LLVRE KKKKKKKKKKKKKKKKKKK 
GGGPL 


8681 


22582 


A 


8743 


362 


234 


KMDQKCLSEVLQRWFPCCCCCCCCCCCC 
CCFSLPFIFEPSRS 


8682 


22583 


A 


8744 " 


3 


419 


LTLRRFQLNLTALTKKKKKKKKNLKKKK 
KKKKKKKKKKKKGGPLFKNTSLGGPIFPG 
GGKFIFFFFWGGFLKPPGDFLKKTFFLG 
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SEQID 
NO: of 
nucleotide 
sequence 

• 

• 


SEQID 
NO: of 
peptide 
sequence 

• 

- 


M 

ejh 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1. 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

amino acid 

residue of 

peptide 

sequence 

• 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
actd 

residue of 
pen tide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteinc, D=Aspartic Acid, 
E^Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, J=Isoleucine, 
K=Lysine, L=Leuclne, M=Meth!onine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
•^Threonine, V=VaIine, 
W»Tryptophan, Y-Tyrosine, . 
X=Unknown, *=5frop codon,/=possible 
nucleotide deletion, V^possible 
nucleotide insertion 














GE I FGP P PPKNLTPLGKKKI FKGVGGKN 
PPLFLRRKKFFSGGFFKKIFSPGLGF 


8683 


22584 


A 


8745 


444 

- 


1 


KKPSKEILYPENSRFFSPLSPLKFPFSP 
KGFNFWGGGGPLCPPPKERFFSKNPPGG 
FFSPPLKEKIFFFPPPVNLGPPRDLFKG 
APPFFFFFFFFFLWEMFYWLSLSSGSQR 
LPQQVPTVEPSELGGMGAYFVSKRSTYL 
QRGWRPER 


8684 


22585 


A 


8746 


104 


222 


NKTFCUiKKKKKKKKKKKKKKKKKKKNK 
KKKKKNWGGAL . 


OUOJ 


*xJOv) 


A 


87d7 

1 






MEKYNWQPHSGILYSHBKEQAAIjFP IA 
KRWRQPS CLS IDEWRNIMWSSLTVE YYT 
AMKRNRQLYSQ 


8686 


22587 


A 


8748 


1 


90 


RTRGAVYAALERMGLDGCVEDLRSRLQR 
GP 


OwO/ 




A 


K74Q 
o r **y 


AAA 

• 


OJ 


S PP P PGL FFFFFFPKKKTS P PPTKKGFF 
SPPPPQKFFFFLKPFFFLGGLGPNFPPP 
KKNFFS KNPPRFFFFPPFKKKNFFFPPP 
FFFAPPKFFFLTPPPLFFFFFFFFFFFS 
FFFFFFFL 


8688 


22589 


A 


8750 


2 

mm 


309 


EFFPPYWEF1/KINACMFSPEKKKKKKKK 
KKKKKKKKKRGPPLKKTPGGPQIPRGGK 
KKIPPLKGGQKKPPRGFLEKNPLFGGGP 
FGPPPPPKINPPEKKKNF 


8689 


22590 


A 


8751 


3 


76 


TASVSEGGGLQGITMKDSDEEEEG 


8690 


22591 


A 


8752 


433 


145 


SFFFFFFFFFSFFFFGKKSSFFTPL 


8691 


22592 


A 


8753 


2 


124 


GHLLMHL I GSATLAI ST INLPSTLI I FT 
ILIKKKKKGRPF 


8692 


22593 


A 


8754 


80 


201 


F I NANS KKK3CKKKKKKXKKitKKKKKKKK 
KKKKGGGLLKKTPGGAQF 


8693 


22594 


A 


8755 


415 


i 

M 


IYFPTPEKFGPPKETLKKGAPFFFFFKQ 
KFPFFGPGGKQKGGFGSLQTLPPGKKKI 
SRPNP PRKRGFKDAPP PPGKFVFLKKKG 
VFPGGQGGFKTPTPRDLAP PAPLNPGAL 
FFFFLRQSLSLSLRLECSGTIIiAHAS 


8694 


22595 . 


A 


8756 


3 


136 


FTATRPGVYYGQCSEICGANHSFMPIVL 
ELIPLKIFEIGPVFTL 


8695 


22596 


& 


8757 


80 


326 


KKFSLGSQGGRAGVFPRPPLGKLQTTAL 
NSGAKGTPPFLFPEPRKRGGPPPAPGWE 
GI FCPHFFPMGQ KGEGGLWHTGEGPH 


8696 


22597 


A 


8758. 


1 


164 


SLISSTQGHKQCRRPQGPLPRKTRDPCS 
HVYLLTFPPLLS YDPAKS PS LRNTQE 


8697 


22598 . 


A 


8759 


431 


79. 


FFFFF I FLKKVSTQSPPPQFYFFFFliKT 
FFLFQ PLLKEGGSWGGRGDKPPPIaAPFK 
KWGVFFFFFSFFCCFVFFFFPLFFFFFL 
FFFFFFFFFFFFFFFKFFFFFFFFPFFF 
FFFFF FPRCVLTRL 


8698 


22599 


A 


8760 


60 


194 


ARPTC PAT AVTQKKKKKKKKKKKKKKKK 
KKKKKKGGGVLKKKQGGRKY 


8699 


22600 


A 


8761 


383 


38 


CASMTFPKKKKKKKKKKKKKKKKKKKKK 
KKKK 


8700 


22601 


A 


8762 


399 


164 


HITVKSLLVPMDDPPKKKKKKKKKKKKK 
KKKKGGGFKKNLWGGQKLTGBKKKIFFF 
LKGGKKKPLGIF 


8701 


22602 


A 


8763 


353 


48 


NFFFKVFFFLKDFSHKGAPPKKGGPPKK 
TP PRKKFFFLKIKPLFFI AFFFFWS LFG 
FFLI FSL FFFFFFFFFFFFFFFFFFFFF 
FFIVFTITLMHFKIYLH 
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SEQID ' 
NO: of 
nucleotide 
sequence 

■| 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/5154 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


1 Amino acid sequence (A^AIanbie 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, (^Phenylalanine, 
(XJIycine, KHBtstidine, Msoleucine, 
K=Lysine, L= Leucine, M=Methionlne, . 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=^Threonine, V=Valine, 
\V»Tryptophan,Y==Tyrosine, 
X=Un known. *=Stop cod on, /"possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8702 


22603 


A 


8764 


2 


93 


. FAMLPGLVSNSWPEAICLPWPPKLLGIiQ 
MM 


8703 


22604 


•A 


8765 

* 


398 


63 


PPP P VFPFPSGG F FFWGGGGQKS PPRER 
CFFFKMHPGFFYIPPFVGKKVFFVLEGV 
VCPLGFFLSGGAFFFFFFFFFFFFFFFF 
FFFFFFGFFFFFFFFFFFFGFPSVSHL 


8704 


,22605 


A 


8766 


3 


83 


ILEIAGALIQAYVFTLLVSLYLHDNT 


8705 


22606 


A 

• 


8767 


450 


306 


QKKKKKRRNKPEGLGTVAHACNPSTLGG 
RGRWITQPGQQEGNFISKY . . 


8706 


22607 

• 


A 


8768 


1 


111 


WSI*ALVAQAGAPGFKQFSCLGLPK£WDY 
RHEPPCPGL 


8707 


22608 


A 


8769 


385 


261 


FIiAEMGFCRVGQAGFKLLNSGDPPASAS 
QS AG I TGVIAP VL 


i 8708 

1 


22609 


A 


8770 


3 


293 


SLGS KPLGLLS LS P VKWFR I FGKERNKV 
WGKDGGTDRNQS SS AFKHLQRGDSDPKQ 
NKI KACS SKFYLRRCVKRS FLLI KKKKK 
KKKKKKKKKKKKKKK 


8709 


22610 


A 


8771 

O / / 1 


7 




O/CVCVO^VYIYIVYVYICl^TtfMfi^/Y 
MYICIYIHVYIHVYTCVYVYIRIHMYIC 
ICVCIYTCIYTYIYTHTYKHSVYYLHNF 
YINPKLLQS 


8710 


22611 


A 


8772 


295 


161 


THTKTHTHTHTHTHTHTHTHTTCSLPPS 
LAQLLGNCC I KAAI S ILYIL 


8711 


22612 


A 


8773 


1 


318 


FFFFVRWSFTLVAQARVQWRDLGSLQPL 
PPGFKQFSCLSIiLSSWDFRHTPPCLANF 
VFLVETGFLHVGQAGLELPTSGDLPALA 
SRGAG ITG VSHHPQPPLCFLFL 


8712 


22613 


A 


8774 


3 


70 


FMP I VLELIPLKI FEIGPVFTL 


8713 


22614 


A 


8775 


369 


191 


TV KINKIGWAW WCAPLVPAAWKAEVEGL 
LEPGSLRPASVIQQDPHLKKGTYSRVLT 
THL 


8714 


22615 


A 


8776 


2 


124 


GFRHVGRAGLE FliTSGDPLAS AFQSAG I 
TGMNHHVQPIVE 


8715 


22616 


A 


8777 


1 


162 


LKYYTADENGKTSRLLPQRPSDECGAGV 
FMASHFDRHYCGKCCLTYCFIKPEDQ 


8716 


22617 


A 


8778 


346 


138 


YNS PPYKEKT I PLQARVNFGP PRDSLKR 
PPFFFFFFKRQSLTMLPRLVLNSWAHGI 
LLPWPPKVQVLQA 


8717 


22618 


A 


8779 


2 


117 


LNLDTTAVQVRNYPRIRESYKVSFLSAL 
EEYTKKLNTQ 


8718 


22619 


A 


8780 


2 


97 


WDLTMLPRLVSNSWAQVILPSWPPKVLG 
LLA 


8739 


22620 


A 


8781 

• 


403 


276 


IISTVFQRGSCPIPSRKBVCSERPRRLQ 
KDSLTGFPHTVYKH 


8720 


22621 


A 


8782 


2 


119 


QAGLELLT S S DLPASATQS AG I TGMS QR 
ARPHFQCKNVP 


8721 


22622 


A 


8783 


245 


101 


ELQFKMRSGWRRSQPISWGLTLVPRLIS . 
NSWAQVILLTWPPKVLGIQA 


8722 


22623 


A 


8784 


2 


124 


AKLE KKKKKTGRAKRRMQ YNRRFVNVVP 
TFGKKKGPNANS 


8723 


22624 


A 


8785 


3 


355 


VGPSEPGAGYNLLVCRLLRPLEKHGIRV 
GVTGFSRCHLSWLPLARKGNSPTPCASW 
VRHHLALLRLTVCGLHPLSCTHCLTSPS 
EMNPVPQLEMQKSPVFCVAHAGSCRLBL 
FLFGH 


8724 


22625 


A 


8786 


238 


9 


PVKLGAG P PQAPFWLRKWAPLI KPFFKG 
PIGSLQIPLFFFLRQSLTLPPRLECSAT 
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SEQ ID 
NO: of 

n iidpfltfffc 

U ULKVUUv 
KU U 6111* 
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SEQ ID 
NO: off 

■ 


M 
eth 

uu 

- 

• 

* 


SEQ ID 
NO: in 
USSN 
09/515 1 
26 


Predicted 
beginning 

n it /*lpfhfirfo 

■ 1 UklCvLlUC 

lnmthn 
corresDond 
ing to first 
amino acid 
residue of 
oeotide 
sequence 

* 


Predict- 
ed end 

location 
correspon 
din? to 
last amino 
acid 

residue of 
peptide 
sea ue nee 


Amino acid sequence (A^AIanine 
OCysteine, D=Aspartic Acid, 

F,=^»lutamlc Acid F=PhenvIflfflnlna 
R=Glvcine H=HkHriin*» fcsTcnlpiirin* 
K—Lvsine. L^Leucine. M=Methionine 
N=Asparagine, P=ProIine, 
0 = Glutamine. R^Arcinine. S S5 Serine. 
T«Threonine, V=Valine, 
W^TryDtoohaD. Y=Tvrosine. 
X=Unknown, *»Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














WLTESSTSWVHAILLPQPPE 


8725 


22626 


A 


8787 


336 


10 


CLSLPSSWDHRRLPPHPASFCIS 


8726 


22627 


A 


8788 . 


1 


147 


I FYLLETGFHYVGQAGLELLTSGDSPAS 
ASOSAGITGVNHHALPEIHVF 


8727 


22628 


A 


8789 


3 


130 


DLEEGIQTLMGRLEDGSRRTGQILKLDH 
SSEFSKTPJSLYPVF 


8728 


22629 


A 


8790 


206 


3 


VPPSGPIKKGDKKKNLYLIFLIKNVPNK 

irT.PTrriTrGPT.aTPPippgppvf pt t tttv^ituxt 
HJjKAur V Z>ru£\l rccc r r r V i? IujEi Iv^rxUl - 

VGQAGLKIiLT3 


8729 

i 


22630 


A 


8791 


124 


390 


RAGAQSNLME PP PS GVKAI F FPNFPKKW 
GNGTPPPAPLIFGGLKKRGVSPCGPGGS 

DKFLN 


8730 


22631 


A 


8793 


2 


301 

* 


HE ERERERERERERERERER ERERERER 
ERBRERDALFAFFPPRGARPPPEIERGV 
FVGAJbSVREi YPTLiS I FFFSPPAHRGRE 
LRSRCVSHSLHLISRA 


8731 


22632 


A 


8794 


1 


285 


ARGERERERERERERERERERERERERE 

PLFFLGRGGALRAHAISPPPFLGAPPYI 
YI FFHMRARP PHRYI^ CAQRETRPAL 


8732 


22633 


A 


8795 


I 


321 


AR<j£*RERERfcKERER&KfiREKEKERVSR 
RI PRGDLRDQ FPHPLARSRDS FCGSADH 
FGRGVFNKVS WTD P PTHRVTSSLGGGV 


8733 


22634 


A 


8796 


2 


140 


HEERERERERERERERERERERERERER 
E 


O fi*f. 




- — 


C7CV7 

<s/y/ 




» 


oHRll rCCijtH K-d KDKrv KluvCrXvlS JKJI» K.CiK V o xC 

RNLSRGGGGVPPPLQNVRIHSGGPARGT 
LHHI KKKT SLTDVGLAQ 


OIJJ 








i 
1 


1 /O 


&Pf3T?P PP PPT7PT?P P*PT7PT?0 WOlPPWIf T G1? 

LGGGRGTQFAIERGVIQFCGRRYTTGRV 
RN 


8736 


22637 


A 


8799 


335 • 


27 


P PTKF FF FFFLVEMGFCCIAQGGLELLS 


8737 


22638 


A 


8800 


342 


191 


STSLSLPKCWDYRREPPCPANLSYFFKD 
P FS KCNPRIiR YWGLGIiQHRNFG 


Of JO 




a 


ooni 
ooUl 


1U4 




a\7PT.Twn7Tf THPT .W IfPl/KTO ROTMlf Q IfT .WT .P 
HV IrJJl WVJ\J.mrXJWJ\-KV WitgJIPIftJJVJLl&Jjxr 

CDPAI SLLSM YPKEMKSLCQRDVCTPRL 
S TGPLTTAGMWNPP KRSSMD 


8739 


22640 


A 


8802 


563 


258 


FFLNLETRSWYVAQARVQWLFTGTVTVH 

LTLPFLKLGLLGLGGGGVGYPSYFGIiEG 
FHNDCSRVRVLCMWSYO 


8740 


22641 


A 


8803 


2 


199 


ARGLLVRRFLSPSEKRSIRVGVTRFSRd 
RPSPLSLTRKGNSLTPCASQVRQCLALL 
RIiAHGACTH 


8741 


22642 


A 


8804 


266 


108 


KLWSKCVALPPSFSSSCSSHIGCACFP 
FTFHYDCKFPQASQAMLPEQAVEP 


8742 


22643 


A 


8805 


1 


140 


KWWFKRPGVYYGQCSEICGANHSFMPI 
VLELI PLKIFEIGPVFTL ' 


8743 


22644 


A 


8806 


2 


90 


SSAAAEENNDEKKEEAEKTEDDMEFGLF 
D 


8744 


22645 

• 


A 


8807 


229 


34 


KKKIFFFFPGFFWPPQKFFFKAPPPFFF 
FFFFFFFFFFFFFFLDLFI YLI ILSYTK 
LISIRAAAPI 


8745 


22646 


A 


8808 


43 


173 


NADSGHAQWLTPGMPALWEAEAGGSQDQ 
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SEQID 
NO: of 
nucleotide 
sequence 

• 


SEQIB 
NO: of 
peptide 
sequence 
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M 

eth' 

od 

- 


SEQID 
NO: in 

vsstt . 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

* 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leuclne, M=Methionlne, 
N=Asparagine, P=Proline>, 
Q=GIutamine, R=Arginine, S^Serine, 
^Threonine, V=Valine, 
W«Tryptophan, Y-Tyrosine, 
X=Unknown, *=Stop codon, ^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














EIETILAGAGKPRLY 


8746 


22647 


A 


8809 


259 


134 


I YM I FKNKFFNRDRGLPMLHRLVLNWAQ 
VHiLSWPPKVLGL 


8747 


22648 


A 


8810 

• 


391 


193 


VLPKI PG I QFPP PCRFKKRPRPRFKKPP 
PKRKKISFSNPPKIWPPQGYFKRGPPRL 
FYFFFFWIL 


8748 


,22649 


A 


8811 


304 


3 


TTPPTNIFFCFLFFIFIFLWYFIFIFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
QCFIGHDDQKMMVHTHTQTSSHLLQGGS 
NPQGKGQSPGWVLWCA 


8749 


22650 


A 


8812 


395 


268 


FLVE TG FHHVGQAGLELP ASSDL PALTS 
QSAG I TG VSHHGWS 


8750 

* 


22651 


A 


8813 


782 


1174 

• 


LSRLFYFCVLFCLYMKTTQLPYFRGLVC 
LFVLRQG LTL S S RLE CSGM IAACCSLNV 
PGS TDS PTS ASRWRTTGVRHHTQLI FV 
CFVEMWFHYYAQAGLEPLGSSS P PALS S 
QGAGITDVSHHTPLELCF 


8751 


22652 


A 


8814 


95 


236 


ATMPGLKNIYFLKNKDKGLTMLPSLVLK 
SWARVILLPWPPKVLGLQT 


8752 


22653 

1 


A 


8815 


381 


58 


SCLLFPPFWGARGGGPPRAGGSNPPGPP 
GLTPFPPKKPKNYWARGGGPFIPPPRBG 
WAGG FFLPRRGRVPIiAPGSNKLEIiGPL P . 
S P PG YQ KKTP FS KKKKKE KKKRXR 


8753 


22654 


A 


8817 


441 


158 


FFF FLRTDGYLTMLPRLVSNS WPQAI LL 
PQPPKMLGL 


8754 


22655 


A 


8818 


415 


342 


RLVLDSWAQEILLPWPPKVLGLQV 


8755 


22656 


A 


8819 


400 


171 


NFF F P PGVKFIX3GGGPQFP PPQKRGF FQ 
KTPGGFFKP PP KKKKKFFP PP PKIGP PQ 
GIFKKAPPFFFFFFFFFFFF 


8756 


22657 

■ 


A 


8820 


2 

• 


268 


INIIIJ^TISLLGILVYRSHLMSSLLCL j 
EGI ILSLFI IATLITLNTHSLIiANIVPI 
AI LVFAACE KKKKGGAVLKDP WGGQSLR 
VLAR 


8757 


22658 


A 


8821 


2 


258 


LCLPNQAWAMAGSPPPASLLPCSLISDC 
CASNQRDSVGVGPS EPGVGYS LWRRFL 
SPSEKRSIRVGVTRFSSWVRMLRTVIPA 
T 


8758 


22659 


A 


8822 


250 


456 


GGGDKFGL I ETF PPGLKPFFIjLNLLSGW 
ELGPLAPPPFKFCFFKGRGFPFIiPRFVF 
VANLLLTCCKKRD 


8759 


22660 


A 


8823 


492 


376 


QENCLNPGGRICSELRSCLCTPAWATER 
ACLKKQNKTNQ 


8760 


2266] 


A 


8824 


1 


116 


LTPLPPSAPPSVDDNLKTPPEWVCSLPF 
HPQRMIISRN 


8761 


22662 


A 


8825 


2 


191 


FLVETGFHHFGRAGLKLLNSGGALTSAF 
QRAG I TGVT TGVSHCFE VNRVWTGTQYF 
LCYILKS 


8762 


22663 


A 


8826 


.393 


75 

■ 


PRFFPPPPPKIFFFPPPPFFFLGGFPPI 
PPPPKNFFFPKPPPGFFFPPPLKKKIFF 
PPPPFFSPPPNFFFPPPPPFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFSI 


8763 


22664 


A 


8827 


81 


380 


KI ITKHNIARTNPYTFCIMNYLKKKKKK 
KKKKXKRKKKKKKKKKKKKKKGGGPFLK 
NPGGGQNNPGGKKNFFFFFGGGKKNPPG 
FFKKKPFFGGGKFGAPPPPKN 


8764 


22665 


A 


8828 


354 


13 


TKPKTPPLTKGPRRGFIiAPRVPGGGGP I 
TPPPQNFIiGE KGTIiARKKVFP PGRPAPK 
RPPRKKNGFEMRPLKKNGPGPWKFLGFF 
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SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

n nc left tide 

location 

cnrresnojitf] 

ing to first 
amino add 
residue of 
peptide 
seauence 

1 


Predict- 
ed end 

nude- ' 

otide 
location 
correspon 
ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid. F»PhenYlfllanine 1 
G=Grycine, H-Histidine, I=Iso leu cine, 
lC—\ jVsfnfL Ij=I.,eu cine.' M=lYIeth ion Ink. 

Am JtJjiJlllW, U^UVIJIW) IT* J T iVill 11/11 111 

N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, Y^Tyrosine, . 
X— Unknown. *=Stoo codon. /—possible 
nucleotide deletion, ^possible 
nucleotide insertion 














LANP PGFGPKKKKPENQTHKKKCQRRKP 
RP 


8765 


22666 


A 


8829 


1 


215 


KHSTTHSHCPRTIKLKKKKKKKKKKKKK 
KKXKKKKKKKKKKKGGPPFKKTLGGPPF 
SPGGKKKFFFFLGGL 

mm/ mm ^mW^mJ 4}m*4*^^ % mm m* wr mm mtmf 


8766 


22667 


A 


8830 


1 


127 


ILFFWQRRGLALIiHRLVSNSWAQVILPP 
RPPRVLGLQVI YFK 


O/O/ 


ZZOOO 




OOJ 1 






FFFXFFXPFFFFFFPFIiFVFFFFFFFFF 

X> X L. mix. XT L <Vr Xr ft ft ft ft I r f Vftftftftftftft ft ft . 

FFFFFFFFFFFFFFFFFFFFFFFWCS CS 
YVLI F I OPFI KS FH 


K7A8 


i 


A 


R5517 

i 




OJ 


PPO KKFFFPOAPKI SGGGGPO IAP POKK 
GSFQKTPGGFFI PPQKKKKNYFPPPGKM 
GPPOGFFKRPPPLFFFFFFFFFFFFFFH 

^mVmm mm yg^J mm Am Mt^mM,^ mm *» A -1. ^ A. fc^fct*-*-**.** 

NRHL FKVKHLQNS FHLVQ LRL 


StlAQ 

o/oy 






OojJ 


J.7 


zoo 


NCLYRTKKKKKKKKKKPPYKKKTLRGPN 
FPPAGPPAPLPLSGGEKKPSRGLLRRPP 
TLGGAAKGPPPPPKLTPLRKKKIF 

mm. mmm m mtm mat mm- Mm mm mm 4m m> mmmmm mm mm Mtv<*4>Akikft 


8770 


22671 


A 


8834 


1 


146 


INIILAFTISLLGILVYRSHLISSLLCIi 
BGI ILSLFIIATLITPSTLC 

*JV mm* mm* mmmtmmT 4mmf4m mmm 'mm m\ mt 1m mmm imm> mm 4m mmW A 


8771 


22672 


A 


8835 


2 


79 


EDPKTSPKPKI IQTRRPGLPP3VSN 


o I /Z 




A 


oojO 


i 




PTR PT 1 1 TP I IiLTIi FL ITOLKI LNTNYH 
LP PS PKP I KKKKKKKKKKKKGGGP 


8773 


22674 


A 


8837 


152 


292 


AKTFNFYKVEFINVFFNGLCLIjIIKNTL 
PKKKKKKKKKKKKKKKKKKK 




*>7 |C7 < 


A 


OojO 


i 
i 


• 177 


VASNS IHPKAKENTAPHTHTHTHTHTHT i 

VmTm)imZf4m a > w X IVUUJil X^LXTXX x n X XX X XX X XX X XX X 

HTHTHTHLFMAI FK 


$775. 


22676 


A 


8839 


2 


106 


HVGQAGLELLTSGDLPTSASQSVGITGV 
SHRAWP 


o7 /o. 




n 


QQAfi 

oo4U 


- .„ , . 

J04 




APT »rY5RHT CT)CT) PT YP Q PHTV THTHTHT 

HTHTHTHTHS 


8777 


22678 


A 


8841 


266 


1 


NS LS VE FLILFP P YNYS P KQC I CSHQNC 

AHACDPNTFGGGGRWITRSVDRDHPGQH 
NETP 


8778 


22679 


A 


8842 


343 


52 


KTGFRHVGQPGELLASSDLPASASQSAG 
I TGVSHR VR PGLH I LDNS S FLDTS FAD I ! 

mmm X VJ V 0XX£% ▼ X% XtVJ JLIuXl » /Xl O wXT JJi/XO XTX^XV X> | 

FSVLYLKAGIASLLHIYIHFLLPLRDLL 
LVSSLTVSFPTAV 


8779 

* ♦ 


22680 


A 


8843 


256 


35 


LFIFSLSLQYIFCVCGCVCVFMSLCFLV J 

CAWGHVOLHVCSCVRMCVCENDVCVCGW 

VQMCFCIKNFQKEVYQI 


O/OU 


ZxOa 1 


A 


Oirrt 


j j j 




PPIKKKAPPPKGRAFFFFFLKKKKGGP P 
PPQKKTGGGGPKKRGGVKKPPPGFFPGF 
FGPLFFWGPPFLPPFFFFFFFFFFFFFF 
ETFFFFFFFFFFFFNFISTKHIiF ! 

ftAftftA&ftftHftft* mm 4m m,m 4*. mm* immf mm *«*»A4fc 


8781 


22682 


A 


8845 


15 


288 


RCGLTVLPMLVSDCWAQVILLAQPPRVL 
GLQAGAS AST P I TSVNCSQGAIYPEVGT 
TGSTI IAAASSGGEFDSVLQNDICMCFL 

mm ^m* mmm ^ ^ Mmf mmm mmmm' mmf ^m* mmtt 4m mmmrmMmV W Mf^AIA/ * ^» mm mmm 

TQQFHF 


8782 


22683 


A 


8846 


1 


397 


ECAHHTQLIFLFLIESSLHHVGQAGLKL 
PISSDPPTSASQSAGTTAVSHHA 


8783 


22684 


A 


8847 


1 


179 


ARERERERERERERERERERERERERER 
EREREREREREREREREREREKKKLGGG 
GGGPQFPIGGGLTQGGGGG 


8784 


22685 


A 


8848 


2 

» 


233 


HEERERERDRERQRERERERERDRERER 
ERETFFSSSGGGPKETTLFFETGAPKEG 
YQNMHARLI PSAVPFKKRGRG 


8785 


22686 


A 


8849 


2 


180 


HEERERERERERE RERERERERE RERER | 
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SEQ ID 
NO: of 

n nr l<»nH Hp 

■J UllvUUUv 

sequence 

■' 


SEQ ID 
NO: of 

npntiri^ 

sequence 


M 
eth 

■ 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 

• 


Predicted 
beginning 

n n f*l p a tifl ^ 
II u wicuimc 

location 

co ires no nd 

ing to first 

amino acid 
residue of 
peptide 

wijiiwiiiiC 


Predict- 
ed end 

nn r*l 

II UL4w 

otide 

location 

correspon 

dint? to 
last amino 
acid 

peptide 
spam price 


Amino acid sequence (A~Alanine 
OCysteine, D=Aspartic Acid, 

IT=/"Z lii to m ■ I s Ariti PWPhpnvlfilanitiA 
E^^xjr iuuiiii ii. m.iU) r * ucii^ra uidiiic, 

G^Glycine, H^Hfatidine, l=Iso leucine, 

K'—l.vsinp I .purine ]V1= p# h 1 An i n p 

N=Asparagine, P=Pro!ine, 
0=Glutamine. R=Arainine SsSerine. 

T=Threonine, V=Valine, 
W=Tryptopban, Y^Tyrosine, 

V=2lln knnwn *sStnn rnrlnn /=zaa««IH)p 

nucleotide deletion, \=possible 

niirlpotirip insertion 














. ERERERERERERERERDFVFYRWGBGHI 
DRQ YCS PLCP6WCHCPM 


O fO\J 






8850 

OOJ\J 


■ & 


79 


PEERERERERERERERERERERERERER 
ERERE 


8787 


.22688 

* 

■ 


A 


8851 


1 


178 


ARGERERERERERERERERERERERERE 
RERERBRERERERERERERARLFARGEK 
KKHTGGGPPLFYYLYI 


8788 


22689 


A 


8852 


3 


76 


TRRE RERE RERERERERERERERERE S P 
A 


o/oy 


t*i jean 
226VU 


A 


00*7 


< 

5 


2Z7 


EERERERERE RERERERERERERERERE 
RERDFFFFFLEKKYLFSPIGELKGRGGG 
SLTPLLTGKWNPCPPPPH 


8790 


22691 


A 

• 


8854 


1 


189 


ARGERERERERERERERERERERi 
RERERERERERERERERERERERI 
RHSRLFPRESVCVARAGGASSPPI 


2RERE 
5RERE 
IQ 


8791 


22692 


A 


8855 


2 


Aft 

271 


HEERERERERERERERERERERBRERER 
EREREEPPRGEKYCIRVPPQKILSPACD 
SLLYVTHNLCVSHSTPRGERYVALFNSR 
ETRCF 


8792 


22693 


A 


8856 


2 


309 


HEQQQQQQQLRNLRDFLLVYNRMTELCF 
QRCVPSLHHRALDAEEEACLHSCAGKLI 
HSNHRLMAA YVQLMPALVQRR IANYEAA 
SAVPRVAAEQPGVSPSGS 


8793 


22694 


A 


8857 


44 


203 . 


GYSTCVGMHAHTH I HTDTHTHAHTY IHQ 
CLIQIGLCTFATCTSQVNKSSMASL 






a 






182 


IIFVCLVERGLHYVGQAGPELKASDDLP 
ALASQSAGTSGVSHCVSSKMS 




22o9o 


A 


oo5y 


1 


175 


GLTLPPRLGCHATITAHYSLNLPGSSNP 
PTSASQVAGTTHSRPCPEITQTALQRGP 
HD 


8796 

• 


22697 


A 


8860 

- 


3 


350 


HEAIGLRSNIKNGLDHFLPLGTPTPL I P 
ILAMI ETIIiLLVQP IALAVRLTANITAG 
HIiLMRLNGSATLAISTISLPSTLIIFTI 
LILLT I LE IAGALIQAYVFTLLVGLYLR 
DNA 


X7Q7 


77£QR 


a 


ooDI 


toy 




TFKEADIKGGGPPHLASFCIFGRI 
CWPGWSRTPDLKRSTRA 


JGVSP 


8798 




A 

• * 


RR62 






S PTPPPSSKPSS I PRKSSVDLNQVSMLS 
PAALSPASSSQRHES 


8799 


22700 


A 


8863 


346 


219 


RLVP PClaDNFLNFCRYSCLTMLPGLVSN 
SWAQVIFLPQPPKD 


8800 


22701 


A 


8864 


1 


147 


GTRFF FERG IDHVGQAGIQLLTLGDP PT | 
PASQRAEITGVSHHTWPNFIF 


OOvl 




A 




967 




KKKFWPFFFFFLRDGVSLLLRLVSNCKA 
QEILLPWPPKLLGLQV 


8802 


22703 


A 

* 


8866 


343 


215 


LAQGLKTP I PKRVPPHTQKKKFCS ERPT 
RIQTDRSCWVSPLS 


8803 


22704 


A 


8867 


1 


242 


GTS KKI DKLD Y I BLKCFCTAKETI S RVK 
RP VEWE KI FANS S P E KGL I S R I NKEAKK 
LNSSAIAAHNCNPSTLGGQVRRIT 


8804 


22705 


A 


8868 


1 


219 


GTRRPTWATWQNPASTKKKKNSFFFETE 
SRTVPQAGGQGGNLGSLQSLPPGLKRFS 
CLSMPGAPKGQSGSLMG 


8805 


22706 


A 


8869 


1 


181 


ATGG I LLLLD WS LAYE S KHLLEGAKSE 
SAEELKKRAQELBGKLNFLTKIHEMLQP 
GQDQ 


8806 


22707 


A 


8870 


1 


255 


AAAPPNAPGGPPGPQPAPSAAAPPPPAH 
ALGGMDAEL I DEEALTS LELELGLHRVR 
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SEQ n> 
NO: of 
nucleotide 
sequence 

■ 

• 


SEQ ID 
NO: of 
peptide 
sequence 

■ 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

i 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H«Histidine, Wso leucine, 
KpLysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argin!ne, S=Serine, 
T^Threonine, V=VaIine, 
W«Tryptophan, Y^Tyrosine, 
X<=*Unknown, *=Stop codon, /-possible 
nucleotide deletion, V=possible 
nucleotide Insertion 














ELPELFLGQSEFDCFSDLGSAPPAGSVS 
C 


8807 


22708 


A 


8871 


383 


233 


ATTFPNPGFLFVRLFVLVQWPDHBPPGF 
KQSSHFSLPSSWDYRHPPPPPG 


8808 


22709 


A 


8872 


211 


294 


AXKKKKKKKKKKKKKKKKKKRKNI I WGG 


8809 


22710 


A 


8873 


398 


100 


QKERPPGFPPFKPGPPQKRGGKGAPPKD 
GEHYFFWVLEKSTGFPPKTKRVYYPSPV 
YPHPPAPQKEGFPGGSPPPRPIASPFGG . 
KKKEKKSYIEKENLF 


8810 


22711 


A 


8874 


3 


214 


DAWVLGILVYRSHLISSLLCLEGIILSIj 
FI IATL I TLNTHSLLEKKKKKKKKKKKK 
KGGAFKKNPGGAQI 


8811 


22712 

i 


A 


8875 


2 


254 


PRVRPEGRNRTPQSWFQANPMACMTFSK 
KK2CKKKKKKKKKKIGCKKKKRGGGFI KKM 
GGGKNFREKEKKNFLLIRGKIKKTGGDF 


8812 


22713 


A 


8876 


2 


135 


TINLPSTLIIFTILILLTILEIAVALIQ 
AYVFTLLVSLYLHDNT 


8813 


22714 


A 


8877 


2 


296 


IiARGAEVLG YGS HSRGRVPGALVGQGAG 
RLFTEHPGSSPATLAIYPTPEGTSVAVS 
ISAPPKARSRPYPPSRSCHNLSIAGSSV 
LVPPGPVHRWVGRP ' 


8814 


22715 


A 


8878 


164 


29 


RKIGPARWLTPVI PALWETEAGGSRGQE 
IEI IIANTVKPGACEVIi 


8815 


22716 


A 


8879 


381 


219 


GEKMTE E EVHMLVAGHEDSNGC INYEGK 

R 


8816 


22717 


A 


8880 


2 


156 


ANGNS FATRLSNI FVIGNGNKPW ISLPR 
GKGIRLTVAEERDKRIATKQ3SG 


8817 


22718 


A 


8881 

• 


3 


402 


SGFNIEYAAGPFALFFIAEYTNI HINT 
LTTTI FLGTTYDALSPELYTTYFKKKKK 
KGGRPFFFFFFLKKKKIPPPKKKNFGKK 
GETLKGRGGAP I FQPKKILS P PKKKKKK 
RGGAFKKSLGGAKFNGGGRN 


8818 


22719 


A 

• 


8882 


1 


406 


RYSTPSEGEVGERYSTPPGETLERYSTP 
PGETLER YS TP PGETLERYS TP PGETLB 
RYSTPPGETLERYSTPPGEALERYSIPT 
GGPNPTGTFKTYPS KKKKGGGRFKEPLG 
GPRFTGVGRVKVFSLWGGVLNAR 


8819 


22720 


A 


8883 


484 


307 


KKILNRRVR WLTP VI PTLWETEAGSSRG 
QEMETILGNTVKPPASASQSAEIKGMRI 
IIP 


8820 


22721 


A 


8884 


1 


131 


GS LTHHI NNIKPS FTRENTLMVI HLSP I 
VLLSLNPDI ITGFSS 


8821 


22722 

* 


A 


8885 


406 


106 


KGGAGPKIAPPKKKAFSPIPPPFFWPPP 
VFLKGPPPPSPFNFFPPPVFFFRGPLFF 
FFFFFFFCETWLPFRPSNLVFTLPHVF 1 
AWLIPIDQYLFISHSC 


8822 


22723 


A 


8886 


425 


325 


SSTHTHTHTHTHTHTERSHLLAFEPSAG 
ELWMAM 


8823 


22724 


A 


8887 


394 


278 


VCVC VCMCVY IC I YPCWLPCWHMCLELY 
KVFKGIMKGV 


8824 


22725 


A 


8888 


2 


86 


Y SRTRAAAL I QA YVFTL LVSL YLHDSAY 


882S 


22726 


A 


8889 


400 


62 


FLYLKSFP P PPERGVFS PLPPQKFFFSP 
KALNFLGGGG PKFP PPKKRFFFKYPPRV 
FLYPPHKKKNYIFPPPDDFGPPRVFFKA 
P P PHFFFFFFFFFFF FFFFFFTRLEWVQ 


8826 


22727 


A 


8890 


3 


100 


SIaEPGTSGLKGFSCIiTLPSSEDYRHEPP 
CPAN 
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1 SEQID 1 
NO: of 

n ii r I^Atift *» 

U UUvuUUC 

sequence 


SEQID 
NO: of 

pcpuuc 

sequence 

* 


M 
eth 

ad 

• 


SEQID 
NO: in 
USSN 
(B9/S15,l 
26 

- 


Predicted 
beginning 

location 

corrpinnnd 

ing to first 
amino acid 
residue of 
. peptide 


Predict- 
ed end 

11 U VI v" 

ottde 

lUVoUUIl 

correspon 
ding to 
last amino 
acid 

rociffiio (it 

peptide 


Amino acid sequence (A=Alanine 
OCysteine, D^Aspartic Acid, 

F^Glutarmc Acid F=PhenvlfllnnfiM» 

G=Glycine, H=Histidine, I=Isoleudne, 

K=I*v«in< k 1 ;=I<£iicln£. M=Meth inning 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V«Valine, 
W-Tryptophan, Y«=Tyrosine, 

ydfnlfnnum ^sC^An rni)fln ^sfwiaeffifo 

nucleotide deletion, \=posslble 

nnrlpotirfp inCPi-Hnn i 


8827 


22728 


A 


8891 


32 


142 


. VRPTRPGQVASVLYFTTILILIPTISLI 
ENKILKWA 


8828 

■ 


22729 


A 


8892 


3 


382 


TPHNP LSRP S YE P KPASTPSRAPKKKKJC 

LGGPKLPGGKKKNFFFFRGGKKKPPGDF 
LKKTLFLGGGNLAP PP PKKKNP WGKICKI 
FWGEGGKKNFFFFG 


RR7Q 

OOX7 




A 


OD7J 


000 


100 

* 77 


KKKRI S P PSRKRAPOGBFLRPTPP I KNL 
YYYLSFFFLILYIFFFFFFFFFFFIFFF 


ODjU 


i-L I Ji 1 


A 


0O7H 


•j 
j 


1JO 


TINLPSTLI IFTTLILOTILEIAGALIO 
AYVFTLLVSLYLHDNT 


8831 


22732 . 


A 


8895 


1 


144 


IATINLPSTLI IFTILILLTILEIAGAL 
IQAYGFTLLVSLYLHDYPYN 


8832 


22733 


A 


8896 


153 


375 


PQKKKKKKKKKKKKKKKKRIIKKKHGGGF 
RGFFGKNPFFWGGKKKKN 


8833 


22734 


A 


8897 


332 


400 


WLTPVI PXLWDAEAGGS FEVRS S 


8834 


22735 


A 


8898 


215 . 


359 


TELVPNAVLFC P L PCSSGHHLLCAIHAK 

KV1 IMFJMJXyj-iAKKXKUbiKA 


8835 


22736 

< 


A 


8899 

> 


414 


73 


PRLLCFHKKAWAMAGALPPALLPPYSLI 
SDCCASNQRDS VGVGPS E PGVGYNLWR 
RFLSLSEKRSIRVGVTRFSRCCPSPLSL 
TRKGNSLTPCTSRI1RQCLPLLWI1AHGAR 

•TO 

In 


ooJo 


111 J / 




oVW 




00 
00 


PQKRVLPQKPPGGFFSPPLKKKKFSFPP 

PVKT/lPPTffS^FITfiPPPPFFFFPRFFPfiT, 

DFMYTGSSNVW 


8837 . 

- 


22738 


A 

• 


8901 


433 


56 


PEGPPPPGGFYPGKRGFFPTPLWFGGP 
KRD P PPRG KKNPfi WRSGGKOTPPJRPRDP« 

LLKAFFPGGPPQRLGKAQGPFWFFFGPK 
KGKTOTGPPFFFKKLGDPWGPKKKPPPG 
GAPRLGPPKKKFL 


OOJO 




A 

*•* 


OQfl? 


3 


ID 


IAVAL I OAYVFTLLVSL YLHDNT 


8839 


22740 


A 


8903 


3 


149 


PNLSYI IGKDTWVEHWPBEDECQDEENQ 
KOCODLGAFTE S MWFGCPN 


8840 


22741 


A 


8904 


351 


137 


FPPEFFAAPKKSGILFFSKTWMEVEVLF 
LSQLMLGQKTQYCMFSLINGRIil I KAPG 
SIiKGDHHPPLGIiLEG 


8841 


22742 


A 


8905 


142 


5 


KI FLQKKKKKKGIjGTVAHACNPS TLGGQ 
GGWIMRSRDQDHPGQOW 


8842 


22743 


A 


8906 


367 


244 


HVPQARAKFFC I FSRDGVS PYWPGWSRT 
PDLMI CLLFIVBL 


8843 


22744 


A 


8907 


146 


248 


SQILGRLRQENCLNLGNGGCSEPRWCHF 
IaAWAKE 


8844 


22745 


A . 


8908 


2 


197 


AAASTHHARLFLCVCLFAYLLVEMEFHH 
VAOAGTELLDS SNPPPLASQNAG I IGMS 
HHTQPQYIC 


8845 


22746 


A 


8909 


1 


127 


KKTRLERAQ WLTPVI PAFWNAEAGTLLE 
ARSLRRLPWPPNVL 


8846 


22747 


A 


8910 


368 


280 


LTMLLRLVLNS WAQAI LLTWP P KVLGLQ 
A 


8847 


22748 


A 


8911 


355 


211 


QQQNLHLFWPGTVAHTRNSSTLGGQ®4R 
I TGAYE FETTLGNTARPHL Y 


8848 


22749 


A 


8912 


325 


190 

r 


KKKKKKGGGGPQKKKKNPPPPPTSKKKK 
KKKKKKKKKKKKKKKKKKKKK 


8849 


22750 


A 


8913 


1 


411 J VNVPAPRGGAYRGRQASFS SGGLHP VPA 
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SEQ ID 
NO: of 
nucleotide 
sequence 

■ 

* 


SEQ ID 
NO: of 
DeDtide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


[ Predicted 
beginning ' 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
seau en ce 

1 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
dins to 
last amino 
acid 

residue of 

W ^0**r *t H OT • 

peptide 
sequence 


Amino acid sequence (A-AJaoine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid. ^Phenylalanine. 
(^Glycine, H=Hlstidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
0=Glutamine* R— Arsinine. S=Serine. 
^Threonine, V»Valine, 
W^ryptophan, Y=Tyrosine, 
X=U n kn own . * ss StoD codon./=oossible 
nucleotide deletion, V=possib)e 
nucleotide insertion 


• 






• 

• 






YRPLCLPNQTTNSAMAGAPPLASLPPCS 
L I S DCCASNKRDS MGVGPS ETGTGYNLL 
VC2iPLSPLEKCSIRVGVTRP3RC3ai3PL 
SLT1UCGNSLTPCTSRVRRCIALLRL 


8850 


22751 


A 


8914 


258 


9 


GAIPCLHCBTNPSHISSTQELQTPEPQL 
PGVPLBLPPPGS CFKCRKSGHWAKECPQ 
PGIPPKPCPICAGPHWKLDCPTGPRL 


8851 

* 


22752 


A 


8915 


326 


42 


EKIHRKYFCYS FFVFLVEMGFYFVGQAS. 
LKLIiASSDLPASPSQSARVKAMSHCVRP 
VLVI FKNKILQNTENSTKI YVYPPPNNQ 
ILTFCHFCFK 


8852 


22753 


A 


8917 


j 


106 


GLELPTSGDLPASASQNAGI TGISHHAR 
PIMTYS 


8853 


22754 


A 


8918 

0710 


320 


171 


AASTTDGSYKCLCliPGYVPSDKPNYCTP 
LNTALNLEKCPFGLPHLSGSS 


R8S4 


— x / <j.j 


A 

x * 


R919 


1 

1 


201 

XU 1 


VKPSDRYHLMP I FTAAYPHQNS TYNVP V 
STRMVMAEE FKQGKRLS LCSDL YRKDLR 
TIVDPWSCAT 


OOJJ 


2975A 

XX / JO 


A 

• * 


8920 


*l 

<7 


182 


LLRPLEKRG I RVGVSRFSR YHLSRLP FA 
RKQNS PTPCASRVRRCPS liLHQIiHPLS D 


8RS6 

OO JO 


29757 
XX / J / 


A 


8921 


142 

1 *rX 


1 
1 


SCH PGWSAVVRPOIjLRRLRYENCLS LGG 
SGCSELGSCHCTPAWTTEG 


8857 


22758 


A 


8922 


1 


117 


LFFNYAWGWSTTLIiSRLSLNSWVHVILT | 
PWPP KELGLQA 


OOJO 

• 




A 

Xk 


809 3 


3-S7 
j j / 


191 

1X1 


LGAFSDGLAHLDNLKGTFATLSELHCDK 
LEVDLENFRLLGNVL VCVTiAHH FG KE FT 
PPVQAAYQKVVAGVANALAHKYH 


RRSQ 


XX /ou 


A 


8094 

o7a^ 


343 


133 


LFFNFNFLETRLRE WL I FVFSVETGFP I 
KRAGLKLQASSDPPARGSQSAEISGMSH 
YAWPCKSNRAFKCS 


8860 


22761 


A 


8925 


321 


44 


PDSGGS PASASQVAGTTGACHDAWLMPI 
FNMDININIPFSAKRDKLSMDKINEI IF 
SLLINRISSFQIMYFLCSSRIESRKRWL 
KQTGRKKE 


8861 


22762 


A 


8926 


335 


106 


HFVFLIETG FHR VGHAGLELLTS TDPPA 
LAYKSSGITGVSYRSFTKLHLCACVCLC 
IYTYLGLVLSFTPGSWEKKT 


8862 

* 


22763 


A 


8927 


2 


160 


XjSRVPPRSANLS IS vfaekrchftmlpr 
PISNSWAQGILLPSQAPKVLGLQA 


8863 


22764 


A 


8928 


3 


140 


RVITEEEKNFKAFASLRMARANARLFGI 
RAKRAKE AAEQDVE KKK 


8864 


22765 


A 


8929 


126 


59 


NSGGGGCNELRS CHCTPAWATE 


8865 


22766 


A 


8930 


3 


104 


KNMAC YCRIPAC IAGERR YGTC I YQGRL 
WAFCC 


8866 


22767 


A 


8931 


311 


112 


MENYLSKMQQELEKNITRELKEAAAELE 
SGS IASPLGSTDESNLNQDLVWKASREY 
VQVLKKNYMI 


8867 


22768 


A 


8932 


345 


134 


GGGSSPWFPPFWRPRRADSLRVGVIiNPP 
GPGGGAPFFLKNPLIGWGGGARLWFQLI 
LLRVRPENSFDPGR 


8868 


22769 


A 


8933 


262 


39 


RPRRPHPGNFFFFFNSGSHYVAQAGLEL 
LDSSDPTLPSQSAGTTGMSYQAWPLDHT 
FENRDCVLLHCSSPVSNT 


8869 


22770 


A 


8934 


84 


22 


TPAWVWWLTPVI PALWEAKAG 


8870 


22771 


A 


8935 


277 


111 


ERI iNHAiVGSHGVSG I FMK^DLSSLiMvf - 
VIEBHMPFWQPFVRLCGIVGGIFSSTG 
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SEQID 
NO: of 
nucleotide 
sequence 

. ■ 
* 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etb 

od 

. 


SEQID 
NO: in 

09/515,1 

* 

■ 


Predicted 

beginning 

nucleotide 

location 

correspono 

ing to first 

amino acid 

resiaue oi 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
locauon 
correspon 
-ding to 
laST ammo 
acid 

resiaue oi 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=>Aspartic Acid, 
Ci— oiutamic Acid, r— r nenyiaianlne, 
G=Glycine, H=HIstidine, Msoleucloe, 
IV— Liysine, iv—ijcucine, ivi— ivieuiionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
i = i nreonine, v^vaiine, 

W=Tryptophan, Y=Tyrosine, 
a— un Known, — oiop couon t / = possioie 
nucleotide deletion, ^possible 
nucieoiiGc insertion 


8871 


22772 




8936 


2 


104 


. DS AI PRRliECLKEDVQRQQEREKELQHR 
YAEFVF . 


8872 


22773 


TV 


8937 

■ 


1 


193 


KKGSISGFTMVLAGYMLFIYCLSYKELK 
HERLCKYH 


9911 






QQ1Q 




Tin " " 


VGESLEVRNI I YILQNMSNIGNIHLI I K 


8874 


22775 


A 


8939 


329 


177 


RWNIAMLPRLVLNS WAQAI CPSWPLKML 
GLEAJE P S CPARTNFKVTS PRJjH 


8875 


22776 

i 


A - 


8940 


358 


120 


HPSNFFVFLVEMGFHHVGQPGIiELFPAR 
YVPTLAFRSGGITGISHCAWPKEWALPR 
KASPGLSGRRGSPHCATCSSKAW 


8876 


22777 


A 


8941 


333 


104 


PHVYSNGHICLSILTEDWSPALSVQSVC 
LSI ISMLSSCKEKRRPPDNSFYVRTCNK 
NPKKTKWWYHGKQFEFVITF 


8877 


22778 


A 


8942 . 


1 


241 


APLVAFTQVNLEDKGGLSKLVEAIRINF 
NDRYDE I CHHWGGNVLGPKSGTRIAKLK 
KAKARELATNW X KCTLLS FLY I KR 


8878 


22779 


A 


8943 


1 


240 


AP LVAF TQVNL EDKGGLS KLVEAIR I NY 
NDRYDE I CHHWGGNVLGPKS VTRIAKLK 
KAKAKEIATNWIKCTLLSFLYIKR 


8879 


22780 


A 


8944 


306 


150 


AASTLC PRLECS GAI MAHCS LNLLGSGD 
PPASASQVGGARGSLEPGRWRXiQ 


8880 

i > 


22781 


A 

• 


8945. 


284 

■ 


2 


GKSRRS PLDLCTSGFTGSTHFTLI ICWV 
PRYVATCPPIGLNFVFIKTWWSHYIVQA 
DLKLMVSSNPPVS AS QSAGNTGMSHHTW 
LENRLLTTKK 


8881 


22782 


A 


8946 


1 


226 


WSFALFAQAGLEFLTSGDPPASASLSAG 
XTGMSHUAQPAVSLNSIFSASTFCATSS 
YCAKMELKELKFML I VDAA 


8882 


22783 


A 


8947 


363 


96 


NTAPGYMPLLS I PFLVLSKAVFLFLLGM 
FTAALLI IAQRWEQPKCASTDEWINKMW 
HMHTMEHYSTCCGVHAAVNISIQISEWN 
CWEW 


8883 


22784 


A 


8948 


1 


208 


GRVGQPGLRLLTSGDLPASASQSAGVAG 
VSHRAQPTPALNCVLTCWVLQHYIKHNH 
GKRSEYFTIFNDM 


8884 


22785 


A 


8949 


69 


298 


SRGHQRVADAALLSFPDVI CLGLMLWGS 
SSIVCILHRHKRRVQHIHRTSVSPTSSP 
EdGATKTI uTiEQERTRSPKS 


8885 


22786 


A 


8951 


2 


201 


LYDLLNMRSISLPLPSLVCIHSHTHEHT 
YTHIHTHAHTYTHTHSHWVLSLKQKKMV 

VTJT5T1T5T OITTaTT 


8886 


22787 


A ■ 

* 


8952 


1 


285 


GASPPASLPPCSLISDCCASNQRDSVGV 
GPSBPGAGYNLVVRRVLSPSEKRSIRVG 

V I Kr bKU<r br JjoJj 1 KJUiNoLt L Pt-AoK V 

RQCLTLLWLVH 


8887 


22788 


. . .. 

A 


8953 


A 1 

341 


256 


LiLjiAstjvj vJb fc, fKWrUiK 1 W\W V j, hKUr J. o 


8888 


22789 


A 


8954 


227 


43 


I LFF IGKHTHTHTHTHTHTHTRAKCLFP 
PARFPTSPNSTTNCTFPTEPAANPRGLV 
RAGLG 


8889 


22790 


A 


8955 


310 


79 


RPRRPGLMERVNVF X FS ILRES F YKXR I 
TNQAQCCMPIVPVTQETEAGRSLEPKSL 
RLAWAIQQNS I SKKNNNNNFF 


8890 


22791 


A 


8956 


374 


198 


FFFFLPNRHGLAMLPRMVLHWVQAVLPQ 
PPKVLGLQG 


8891 . 


22792 


A 


8957 


138 


10 


RPRRPRLCQAQWLMPI I PAL WES VAGGL 
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SEQ ID 
NO: of 

n iiflf»ntl rip 
sen nen ce 

• 


seq n> 

NO: of 

|#C|/UUC 

- 


M 
eth 

uu 

■ 

• 


SEQ ID 

NO: in 
I ISSN 

09/515 1 


Predicted 
beginning 

n 11 r IpAtifto 

U U HCUUUC 

location 

■VVAIIVII 

correspond 

int? to first 

'■■ft *V III tfV 

amino acid 
residue of 
peptide 
seauence 

» 


Predict- 
ed end 

nitf*lfk_ 

IIUUC 

otide 
location 

ding to 
. last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
0=Cysteine, D=*Aspartic Acid, 

E/~\JIU lallllL /\.CIU| r — i IIClljrMf liUIIRc, 

C5=Glvelne H=Histidine I=I«nIenrin* 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T«Threonine, V«Valine, 
W=Tryptophan, Y=Tyrosine, . 
X=Unknqwn, *=S>top codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LEARSLRPAWFFFNT 


8892 


22793 


A 


8958 


354 


216 


AGPRVFPGGEGGFRPPPPGGS PPQQ 
GGESGGKPPPPAHWVFF 


PPK 
GGlT" 


007J 




A 


RQSQ 

07J7 




79^ 


LQAARS KKKKKKKKKKKKKKKKKRG 


8894 


22795 


A 


8960 


259 


11 


FEALKDPSNNLQSLFSFIVTVLTVAHVY 
EKLSTLTSLAARRGLAMLPRLASNSRTQ 
T ILLPWPS KVLRLQARATE PGLGLLL 


8895 


22796 


A 


8961 


334 


213 


AA3TKNFERMEWGLTTLPMLVLNS WPQV • 
ILLPWPPESMRL 


8896 

• 


22797 


A 

* ' 


8962 

1 


260 


3 

; 


KGGFFPIPFIGIKPGFFFSPALIFPKVF 
IFSQNFFVFLKKIFFFFFVETVSSHWK 
AGLELLASSNPPAFTSQSAGMTGMSHHA 


RRQT 
OOy / 


OTTGR 
ZZ/70 






4ZD 


Z«HS 


ADEWIHKMRHIHMIEYYLAIKRNEMLIH 
ATAGMHLENILSEKDHMYDS I YMKCLAW 
ANL 


8898 


22799 


A 


8964 


1 


118 


GKLMELHGEGSSSGKATGDETGAKVKRA 
DGYEPPVQESV 


8899 


22800 


A 


8965 


2 


130 


GRVGPRMALNSWAQVIHLPWPPKVLGLQ 
VAMKRKIMILSTSY , 


oyUU 


ZzoUl 


a 


oyoo 


1 in 

1 iy 




LGWYLIEKIQICCSPpLGADPFQQMISP 
LRDGSLTVLPKLVLNSWPQVFLPPQTRK 
VLGLQAEVTAPGHI I FMI IMINTFK 


R901 




A 


RQA7 


73R 
ZJo 




TRCLRLLPRLVLNS WTQALLLPS PPKVL 
GLQV 


8902 


22803 


A 


8968 


297 


169 


NPP P PHI YKFFF I FN I FFF FF FF FFFFF 
FFFFFFFFMSKILIF 




77Rfid 




oyoy 


z 


z iz 


RVRPRDRPLVRVRVGRNKDGAETTPSPG 
LLPAHLTFPLDYHLNQPFIFGLRDTDTG 
ALLF I GKNMDPRGP 


8904 


22805 


A 

• 


8970 


242 


357 


TLDIIKSFKKKKKKKKKKKKKKiCKK 


KKK 


8905 


22806 


A 


8971 


191 


265 








90Q/Y7 

ZZOV f 


A 


RQ77 


1QR 


07 

y/ 


MFLYNRILWFFSPHPPLRFFFSPRATIF 
liGGGAPHFPPPKKGFFSKKPPGVFLSPP 
QKEKIFSFPPPLFLAPPKDFFKSPPPFF 
FFFFFFFFVSTGSSVS 


8907 


22808 


A 


8973 


370 


265 


YNSSLPSSWDYRHVPPCPDNFCI FSRGW 
SQTPDLR 


8908 


22809 


A 


8974 


3 


119 


GLKLLTSSDPPASASKTAGITGVSHHTQ 
TIFVFSSHKS 


8909 


22810 


A 


8975 


1 


110 


VAQAGFKLVSSANLPTIiASQSARVTGVS 
HRARSMKQ 




77R1 1 
ZZOl 1 


A 


oy /0 


j 


1 3Z 


WNQPKYPSTGEWMEKMWYIYTKLVQPMA 
HGPHVAQDGFEGSPTQIHKLS 


RQl 1 




n 


ROT? 
07 II 


J / J 


lyj 


FQDLPLGSTSNISFFKRWSPAMVPRLIS 
NSWPQElLLPCft'PKVLGITGMSHCTWPG 
GHISI 


8912 


22813 


A 


8978 


2 


395 


RAALPTQASAMAGAPPPASLPPCSLSSD ; 
C CASNERE S VGVG P S E PGAG YNLLVCYL 
LRLLEKRS I RVGVTRFSRCHLSPLSLTR 
KGNSLTPCASRVRQCLALLWLTTiSALYP 
LS CTHCQTLPS EMNLVPQL 


8913 


22814 


A 


8979 


120 


19 


ARVQWHDHGS TATLNS WPQVI LLP WP PK 
VIjGLQA 


8914 


22815 


A 


8980 


204 


48 


LNRDKGLTMLARFVRNSWPQAIPLPWPP 
KVLGLQALATAPCQMS FLRAPLS 
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SEQID 
NO: of 

nucleotide 

IIUvlVvUUV 

sequence 

■ 


SEQID 
NO: of 

nentidp 

sequence 


M 
eth 

■ 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

location 
corresoond 
ing to first 
amino acid 
residue of 
. peptide 
sequence 


Predict- 
ed end 

UUUv 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

npntirie 
sequence 


Amino acid sequence (A»AIanine 
OCysteine, D»Aspartic Acid, 

F^Glutamic Acid. F^Phenvlalnninp 
G=Glycine, H=Histidine, I=lso leucine, 
K—Lvsine. L^Leucine. M— Methionine. 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine. V^Valine. 
W=Tryptophan, Y=*Tyrosine, 
X=Un known, *=Stop codon t /=possible 
nucleotide deletion. \=nft*tsihle 

HUWIVUilUV U^ICLIUII| ^UUd3llflv 

nucleotide insertion 


8915 


22816 


A 


8981 


335 


221 


FFFLFLFFCMITSHCSVNVLOASDS PTS 
VSHIAGTIGT ' 


8916 

071U 

- 


22817 


A 


8982 

0706 






LSHMPLLFVSLYPDHRAFMQVI PFFPET 
ESPSVTRIiECSGAILAHCNVCLPGSSDS 
PASASYAGYSFYL 


8917 

071 / 


29R1R 

t 


A 


0707 


r 


116 


GQGRGVETES CS VTQAGVQWCDHHSLLS 
LELLGS SDPS TS AS P VTGTTGGHYHTS 8 
HFLFSVSSSYFPTPVCLAYR 


8918 


22819 


A 


8984 


350 


263 


LPMLPRLVSNAWPQAI FLLWPPKVLELQ 
A 


8919 


22820 

i 


A 


8985 


3 


159 


feEYNLLVCRLLSPLSKCS^ GWWOySfe ' 
CHVS PLSLTRKGNSLTPCAS WVR 


8920 


22821 


A 


8986 


2 


253 


KHWGELKNDLS I CGTLHS VDRYLNI KL 
YMQLPADEVHKQLLQDAARKEPLQQKQ 


8921 


22822 


A 


8987 


2 


115 


GARIVGHLTHALKQGEYGLASICNGGGG 


8922 


22823 


A 


8988 


3 


159 


GFYHIGQAGLKLLMSGDLPASASQGAGI 


8923 


22824 


A 


8989 


1 


123 


SHYVAQAGPKLLGSSNPPASASRSAGIT 
GySHHAWPPSLFL 


8924 


22825 


A 


8990 


462 


303 


IoARTPSRPTRPPTRPPTRPPTRSPTRPi>~ 


8925 


22826 


A 


8991 


382 


178 


PPFFFYFFFFSFFFFFFFFFQKGFKGFF 
PQKFFFFFFGGKPVKNFQKFFLRAKGKG 
GEKKPPQNKKVF 


8926 


22827 


A 


8992 


532 


432 


RDAGLTMLPRLVSNSWAQVLLIiPGPPKV 
LTLQV 


8927 


22828 


A 


8993 


418 


172 


KKGKKKKKNIFLIFFNYFFFLIFFFIFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFGGAWSLFRGGDSGEVKGTDSMS 


8928 

■ 


22829 

• 


A 


8994 


2 

■ 


722 


AVRLN I S YP PQNLTMTVFQGDGTASTTIi 

KTiuoiUikJ V IJiLLitJoi^xllJ V v^nViJdiMtrr'iuCxJ 

SWTWGSLTLSPSQSSNLGVLELPRVHVZ 
DEGEFTCRAQNPLGSQHISLSLSLQNEY 
TGKMRP I SGVMLGAFGGAGATALVFLS F 
CI I FWVRS CRKKS ARPAVGVGDTGMED 
ANAVRGS AS QGPL I E S P ADDS P PHHAP P 
AIATPSPEEGRIOYA<5ti53FHKARPnYPO ' 

EQEAIGYEYSEINIPK 


892Q 

07*7 




A 




498 " 


323 


RNAKTGLLPGSFVENI 


07JU 


ZZoj 1 


A 




21 


SI 1 

■ 


III T DGVEAYAXiNASGVVNI TVFDPTffWA 

LFRS FKAVKSKLDTRRGSNS ELETAVKD \ 

LGKAVSYKGMYGDVAIWYSGQYVENGV 

KKNFLPDNTMVLGNTQARGLRTYGCIQD 

ADAQREGINASARYPKNWVTTGDPAREF 

TMIQSAPLMl*LADPDEFVSVQIiA 


8931 


22832 


A 


8997 


178 


306 


GAINAINWPPFNNVGMGKTLGFPNLMAL 
KQ I PFS P VGEKTKRR 


8932 


22833 


A 


8998 


351 


211 


S I QSLRMQP VPCY I SGRHHHYHSHHH JH 
HHHYSSYFLKYPQEYLRPL 


8933 


22834 


A 


8999 


1 


381 


FRLFKFALMELCS I EG I SDQPLFFVFGF 
FFFGKEASFGPQGGREGGKFKLMEPLAK 
GVKGISCPKPPHPCLNPPKPKRGGESSP 
PEGPKKTGAPGPAKKGLFFLTKGKNPWP 
GPGKTRKKAVFPKNF 
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SEQID 
NO: of 
nucleotide 
sequence 


S£Q ID 
NO: of 
peptide 
sequence 


M 

etb 

**' 

od 

■ 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

tag to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I s lsoleucine, 
K=Lysine, L=Leucine, M=Methionlne > 
N=As para sine. P=Proline» 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Voline, 
W^ryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /"possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8934 


22835 


A 


9000 


173 


399 


I KSE WAAKKKKKKKKKKKKKKKKKKKKK 
KKI FFFGGGRFFFFWGGDYFWGGGGVKK 
KGGGKKNPGPQKKNGGGKK 


8935 


22836 


A 


9001 


2 


81 


E FT P AVHAS LDKFLAS VSTVLTS KY R 


07 JO 

• 

* 




JT3L 

■ 


• 

! 


•a 
j 

• 


OjJ 


ELRAWEQTERSRKLAEQELIETSERVQ 

EAVQECRNAEEKAKKAITDAAMMAEELK 

KE0DTSAHLERMKKNME0TIKDLOHRLD 

EAEQIALKGGKKQLQKLEARVPRTiKNBL 

EAEQKRHAKSVKGMRKSERRX KELTYQT , 

EEDRKNLLRLQDIiVDKLQLKVKAYKRQA 

EEAEEQANTNLS KFRKVQHELDEAEERA 

DIAESQVNKLRAXSRDIGTKGLNEE 


8937 


22838 


A 


9003 


429 


280 


RGFRGHKFLBPPGFPGFPPIMGPPSPPF * 
GS P PQKNFPLQPKPPLGFKPPP 


8938 


22839 


A 


9004 


2 


108 


LSA YQGTPLPAN I LiDWQALNYEIRG YVI 
IKPLWV 


8939 


22840 


A 


9005 


3 


158 


SSTQGHKHCGRPQGPLPRKTRDLCSPVY 
LLTFS PLLS YD P VTS PS PRNTOR 


8940 


22841 


A 


9006 


3 


370 


LLLTGLNRLTTDLYSLYI FTTTQWGSLT 
HHINNIRPS PTRENT^LMFIHIiCPTritiT.S 

IjSPDI ITGIFS YTRHYVNSSTYLETNER 
GGS YKNRLGGPESKGLRI KDDSLCWGPL 
HDTSAYYENK 


8941 


22842 


A 


9007 

- 


363 


% 


KLFNVGGGGLDKPPPPKKFFFFKKNGLF 
FFYPLKKKKNFFFPRELLWGPPRFFI KT 
PPPYFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFPFFFFFFFVMIS 


8942 


22843 


A 


9008 


1 


87 


KMLKKPKFELGKLMELHGEGSSSGKATG 
D 


8943 


22844 


A 


9009 


420 


3 


RFYFPKPRKRGGGGVFFFFPPKKKFFFL 
NPPPFFFFPPPKKKKNIFFPPRKGGRGS 
KVQTMGPPLFFFFFFFFFFFFFLFFFFF 
FFFFFFFFFFFFFFFFFFFLRETSSNNV 
LAYQ I ENSLLKLiDLQRQFCLPLPGRV 


8944 


22845 


A 


9010 


2 


74 


LHRLGGPEAGIiGEYLFERLTLKHD 


8945 


22846 


A 


9011 


33 


132 


I DP RCRNS ARAPLS LAGPH PGMGD AS NH 
MGQM 


8946 


22847 


A 


9012 

■ 

■ 


83 


607 


RRSGTPCGYNVTSQNGTIY3PGFPDEYP 
ILKDCIWLITVPPGHGVYINFTLLOTEA 
VITOYIAVWDGPDQNSPQLGVFSGNTALE 
TAYSSTNQVLLKFHSDFSNGGFFVLNFH 
AFQLKKCQPPPAVPQAEMLTEDDDFEIA 
ITGEGKPTRMPOSRICKSCHI ISVLISAK 
LNRAFL 


8947 


22848 


A 


9013 


483 


364 


PPPPPPFIiNPAPGIFFPPPLGGIGARPP 
PPSFFFFFFFFF 


8948 


22849 


A 


9014 


74 


180 


LLSTYVGRXjSARPKLKAFLASPEYVNLP 
INGNGKQ 


8949 


22850 


A 


9015 


488 


317 


HVGQAGLVLLTSGDPPTSASQSTGITGV 
SHRAQPLRSFSFVLSLLQKRIVLC 


8950 


22851 


A 


9016 


149 


13 


DERGGLQAVAHTCNPSTIiGGRGGWITRL 
GDGDHPSCRILAGDASQ 


8951 


22852 


A 


9017 


20 


107 


FbStTADSDQVNPiOGLASKWDYjBXJfeWK 
K 


8952 


22853 


A 


9018 


448 


239 


VDHQQVXWNRER3SNSQNGIVKEIKGAD 
TFIPGHTPAVKPLKFANQMYIDTGAVFC 
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SEQID ' 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 

i 


M 

eth' 

od 


SEQID 
NO: in 
USSN . 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
fng to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
dine to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D-Aspartic Acid, 
E^GIutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M-Methtonine, 
N=Asparagine, P=Proline, 
Q^Glutamine, R=Arginine, S^Serine, 
^Threonine, V^Valine, 
W-Tryptophan, Y=»Tyrosfne, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=posslble 
nucleotide insertion 














GNLTLIQVQGEGA 


8953 

■ 


22854 


A 


9019 


3 


209 


CP P LS PG CTNS ARAE PSVHALPDALNNL 
RTHEGSGDGPSSSVDWNRPBDVYPQGIY 
VISAPSIYARBA 


8954 

• 


22855 

\ 


A 


9020 

* , 

■ 


504 

• 


20 


NYAQMS YSTARASANE SWAYFMGRRKFV 
ASRQAS QM FL CWLEEATVRRWTLPS KA 
RFSFQEARSAWGNCDWIGSGRMAIDGLK 
EVQEAVMLIEAGLSTYEKECAKRGDDYQ 
EI FAQQ VR ETMBRRAAGL KPPAWAAAAF 
ESGLRQSTEEEKSDSRPAAGS 


8955 


22856 


A 


9021 


3 


168 


LPRLEAAARIRHEERERERBRERERERE 
RERERERERERERERERAPPPTLCET 


8956 


22857 


A 


9022 


2 


89 


LCSLVYIiLTFPPLLSCDPAKSPSARNTQ 
E 


8957 


22858 


A 




508 


371 


FFCRDKVIiATLPRLLiANAWPQVIFLPQP 
PKVLRLQPCTTPSLKDL 


8958 


2285Q 


A 

mm 


9024 


2 


170 


RTHSAGR P YQCNQCEKAFRHSSSLTVHK 
RTHVGRETIRNGSLPLSMSHPYCGPLAN 


8959 


22860 


A 


9025 


3 


185 


P PAATPQGPSP INS PP!TKKKAKNKNRTK 
KTKKKKKPPPSKKKGPQKTPKKTMGGPL 
KTTT 


8960 


22861 


A 


9026 


104 


319 


LVFLSKYTPPLLVSLWIHGLSLLSFLPS 
LPS FLPS LLTYLL PS FLPS FLTS VTNPL 
PSLFFLVFKLSQFWT 


8961 


22862 


A 


9027 


476 . 


364 


SLKRLTSGDLPASASQSAVITGVSHRAR 
PIIYFKYVQ 


8962 


22863 


A 


9028 


36 


154 


FLVGQAGLKLLTSGDLPASGS QSARXTA 
VSHRTWRTHYF 


fiO£3 

■ 




TV 






■ 


PYWCRQCPEYRRQAAQPPHCPAPEGEP 
GAPQALGDAPSTS VS LTTAVQDYVCPLQ 
GSHALCTCCFQPMPDRRAEREQDPRVAP 
QQCAVCLQPFCHLYWGCTRTGCYGCLAP 
FCELNLGDKCLDGVLNNNSYESDIIjKNY 

mw **mm+mm>v M V< W A** ■ + v m-**^ m* mm^mJSmr A A 

IiATRGLTWKNMLTES IiMALQRGVFLLSD 
YRVTGDTVLCYCCGLRSFRELTYQYRQN 
I PASE LPVAVTS RPDCYWGRNCRTQVKA 
HHAMKFNHICEQTRFKN 


8964 


22865 


A 

• 


9030 

■ 


15 


389 


IVDHYPCGGIPTSGSVTQAGAQWHNLGP 
LQPRP LGLEGSS CLS L ICS WGHRHAPPH 
IARGHFSSEVEQLRSRLQHGAVFVRQRK 
GTPEDPRQGLRTLISSCIiIiHLVAIALWF 
SLPFHGPRIHMQ 


8965 


22866 




9031 

m* ^0mw m 


3 


243 


RHSLAFURFSCLSLPSNWDCRHPPSCPA 
KFCTFVEMEFHHVGQAGLELIiTSGDLPT 
LASQSAGITGVSHHAWTRCCCCF 


8966 

- . 


22867 


A 


9032 


1 


245 


PTRPAG I AI CI CM CMKNHRATR VG I LRT 
THINTVSSYPGPPPYGHDHEMEYCADLP 
PP YSPTPQGPAQRS PPPPYPGHARK 


8967 


22868 


A 


9033 


46 


124 


QELMTHGAXSPDGTRVHNAFIiFVTTV 


8968 


22869 


A 


9034 


235 


435 


KKKKKKKKKKKKKKKRRPFKLKGPFKPP 
LKPQGGLFEAPPFGGPPPPGFFFAGGGA 
PPRGFFKKKGK 


8969 


22870 


A 


9035 


333 


70 


CPIYQIPILKDQVNLSSISSPFSHLIKD 
LWKTLFIS YITELI FVCFVEMMFHHVAQ 
AGLELLGSSNPPTSTFQSARITGISHHA 
CLNI 


8970 


22871 


A 


9036 


297^ 


411 


NKTMS FAATWMQLEVI LSKLMQEQKTKY 
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SEQID 
NO: of 
nucleotide 
sea uen ce 

• 


SEQ ID 
NO: of 
Deotide 
sequence 

• 


M 
od 

■ 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide. 

sequence 


Amino acid sequence (A a Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid. ^Phenylalanine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysine> l^Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine; 
W«Tryptophan, Y»Tyrosine, 
X=Un known. *=Stop codon, /^possible 

m mm ~mmr mm m mx^ m mm* m w mmm mmm mm-^p 9** *m* - mmr ■ Wl ^mW^mmmw m mm m^ mmm 

nucleotide deletion, ^possible 
nucleotide insertion 














CMSSLISGS 


8971 

* 
• 


22872 


A 


9037 


461 


167 


NKNYPLLGGPPFGSPSPENLNGGFPLTP . 
GGGGPINPNSPPPPPPGGRRPPFLKKKK 
KKKKEKEXNTYILQDTCDRCSAVLPIMQ 
VBCIVMICLIMHRLF 


8972 


22873 

MO / ml 


A 


9038 


466 


292 


NGCLLEPAS WNSLANI FIFCREEGLTML 
PRPQVILPPLPPKVLRLLPPPLSS 




79R74 


A 


9039 


34 


124 

1 *m~T 


GIRNIKFSSLSTFIAHKKYYCSSHAAEH 

tmm mm ^mw^ *m% ^^mm mfw %mw ^m'-mw mm mmm ^mmmt \mmm*mm mmmm mm mm mmrm^ mw m mmm mm mmmwm>Mt 

VK 


8974 


22875 


A 


9040 


54 


239 


RRRRKLRRWGLAMLPRLVSNS WHQAI LW 
PPKVLRLQACATTPGLTRRWLHGQIWSP 

m- ~mw m" mmm-m^m »*— * ■ m m» mm mm.w -m> ■— » W w *-^m> A fTW •» 

GAGSV 


8975 


22876 


A 


9041 


469 


223 


PWARVCVCVCVSVCVCACVRVCARASES 
OTCAAVPS LGPAGAGRPALGCCAVCEDE 
VGSCPHARPLPRAAVDKAAGRRGLCF 


8976 


22877 


A 


9042 


491 


380 


S PGWAHS YN PSTLDGRGSHITRVRKFM 
"TCTiATEGAV 


8977 


22878 


A 


9043 


144 


360 


IHSHWGYWLDVSLIAAHLAINPDCFMYV 
YIFSOCTDKLTEKEFIEGTLANKEILRIi 
IQFEPQKVKEKMKNA 


R07ft 

oy /o 








AO 


ZD J 


RWNSDGKGSIjRPAWATWPDPVS iknton 
THTTHPTPHSPHTPHGQQPFLSPTPFPW 
PVSSPSPFPPFSHTCIiSGLLPPGA 

m mm** m^mW mm mmmw Sm mm mr mr mm* »-* mm* m^mM Wmmw **mmmmm»m~mm»m mm mm *mm* m mm 


8979 


22880 


A 


9045 


461 


287 


PPSLANIYLFIYLFIGRDGVSLFPRLVS 
NCWAQAX LL PWLPKVLGLQGBPPNLALK 
SL 


8980 


22881 


A 


9046 


485 


6 


NNKNSWAWWRVPVLPATREAGAGESLES 
GRRRLR 


8981 


22882 


A 


9047 


475 


307 


LFLRDGSLTVLPRLVSNSWPQAI LLPWL 
LKGLGLOARATLHMACSSCINSTQNILQ 


8982 


22883 


A 


9048 


483 


338 


LREGERQREQERERERERERERDRERAH 
TTAFTOLLFQYVAI ILLPCY 

4V w %r iM#t» m^m m mm mmfmm m^m ^h^" 


8983 

070J 

r 


22884 


A 


9049 


680 


303 


NDLLFGS LYVKQFAASGTGRVTL PE PSE 
EQWCLRCFQDLFGDDHNKNGFKMFDSS 
NYHGQDLLFKDATVRAVPVGEKTTYRGW 
LGLDYVAALEGMSSQQCSGAGRTGPPCR 
LPWCYSDPFTPEP 


8984 


22885 


A 


9050 


26 


165 


AFGYHRVGQAGLELLTSGDPFASASQSA 
GITGVSHHARSQYFFVYV 


8985 


22886 


A 


9051 


461 

■ mj A* 


83 


HKIIWJPSHKKKKKKKKKKKKKKKKKKKK 
K 


8986 • 


22887 


A 


9052 


475 


209 


MAAPPKKKKKKKKKKKKKKKKKKKKI 


8987 


22888 


A 


9053 


531 


404 


SVTQAAMQWQKHNSLQGSTLEFKQSSLL 
SLPTIWDYKHADLS 


8988 


22889 


A 


9054 


66 


416 


KKKTNKI KMGGGPLPFKISPPFLKKTLS 
WKKGAPKTPPLEMAPFFFPQGGKKNSPF 
KGGKIFFFFFPKKTFIKGPPGGEHTPPL 
GGGPQIFSLPPPKGVPQKNPPPLLRGGL 
FFTI 


8989 


22890 


A 


9055 


1 


104 


ILAVNGETLEGVTHEQAVAILKHQRGTV 
TLTVLS 


8990 


22891 


A 


9056 


3 


106 


LKRWGIAMLTGLVLNSWAQVILPSWPPE 
VLGLQV 


8991 


22892 


A 


9057 


1 


299 


PTRPINI ILAFTISLLGILVYRSHLISS 
LLCLEGIILSLFIIATRIYPIAPPLPPL 
EPEK3CKKKKKKKKKKKKKKKKHKKNTKK 
RKIKAKRKNLGGGGL 
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SEQ ID ' 
NO: of 
nucleotide 
sequence 


SEQ 10 
NO: of 

nentide 

sequence 

■ 


M 

cth 

od 

- 


SEQ 10 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

mm m* myyy^ 

location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
OCysteine, 0=Aspartic Add, 
E"=Glu tannic Acid. F=Phenvlfllflnln^ 
Gr=Glvcine. H=Histidine» I~lso!eudne. 
K=Lysine, L=Leucine, M=Methiontae, . 
N-Asparagine, P=ProJine, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V^Valine, 

^ m mm ™ mm m$ mw m^ If i | ■ W V V w M W ^» V 

W«Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 


8992 

V/ J m++ 


22893 


A 


9058 


482 


146 


LTSSKKKKKKKKKKKKKKKKKKKKKK 


8993 


22894 


A 


9059 


2 


290 


YFLFAYTILRSVPNKLGQVIiAT.T,T.SILK 
KKKRGGPPKRTLGGPKFNROGQGNIPFL 
MGGGLKPNLGILGRNLSLGGGKNWPNLP 
PKI KG FRENKNF 


8994 


22895 


A 


9060 


413 


48 


PQKKGFFQKNPKGFIxKPPPKltfttCKKFPP 
PGKIWPPQKIFKKPPPPFFFFFFFFFFF 
PJCPFPFFPFPFFPPPFS TRNPPSSVTirW 

fa 4Vi m> fa fa fa far fa fa7 fa7 far fa fa7 far fa/ w AaVLv* fa^ a> a ivn 

LRSLDVCHLLPPITPQVPGPTPVLLFST 
LPDCNHSLL 


8995 


22896 


A 


9061 


2 


274 


FTISLLGILVYRSHLISSLLCLiEGIILS 
LFI I ATL I TLNTHS LLAN I VP I AI LVFA 

PQINPG 


8996 


22897 


A 


9062 


42 


255 


QEFRTRRERERERERERBRERESSGWR 
AKFRSNFSAGAIAHTIRVMLPPSGI YSN 

TMITfVJRTXT.T.'PT? 

r». rv.-i l mt\i '^ppi " ip p 


8997 


22898 


A 


9063 


20 


413 


YALRNSARAGDPHGQLGRRDGAVGGRGW 
LMPREALHPWWANVQAWAGS PGLFLGAP 
WPQGGVWLPWLCVSMSASSSSIKOLPPQ 

WGGLKKFGGRFLKRGGGBN 


8998 


22899 


A 


9064 


6 


306 


ECEHSTTYSHFAGGGGPVMVGGDADARS 

{VlUiJw VUv UuVJ A on X V XJ V XJXSlrXX x nu X tr JX 

S PS E LQAAGW VGWQ EVSAAFDPNS FYNL 
CLTSLS SQQQQRTLD 


RQQQ 




n 






1 /JO 


AHVOT.WTncvnnTR cjcvp t cirr.avp'nn 

nmyiiv x l wjvv vaj j Rjaim iom rri T "Vf*. 

WIVTCPGCQNHHI IADNLGWFSDLNGK 
RNXEE I LTARGEQVHRVAGEGALELVLE 
AAGAPTSTAAPEAGEDEGPPS PGKTE PS 


9000 


22901 


A 


9066 


23 


244 


SCHAGIRHEERERERERERERERERERE 
RERERE I YFLALFSQKGRVHGFFKTPRG 
GQKRELLSNRVNTLCTE 


9001 


22902 


A 


9068 


396 


143 


KDIQNFGQAWWI/TPVI PALWEAEAGGSR 
GQE I E TI LVNMVKPHLS VLLAKNS SSHC 

Irnv»VYXXnXVLJ\^n X X/U UDC l4f\X X X V7X X XJaLXrUT 


9002 

■ 


22903 

- 


A 

• 

- 


9069 

r 


828 

- 


99 


VDREPVDREPWCHPDLEERLQAWPAEL 
PDEFFELTVDDVRRRLAQLKSERKRLEE 
APLVTKAFREAQI KEKLER YPKVA£«RVL 
FPDRYVIXJGFFRPSETVGDLRDFVRSHL 
GNPELSFi^FITPPinVTjDDHTOTTjFOG 

EPLPGRSGCNLGAEEPAGVYLEPGTiTiKH 
AISPSAADVxjVARYMSRAAGS PSPIiPAP 

dpapksepaaeegalvppepipgtaqpv 
krslgkvpkwlklpaskr 


9003 


22904 


A 


9070 


20 


427 


DGTRRERERERERERERERERXjYIjTEDT 
IiHSLSLEGARSGASVCVQSVYPTGASAP 
HS I INGHKRVLVLCVETEHTFPLAEKDR 
AFLGD TS VFVGAAP VGGANTGAS AGGAP 
PSRGEPKEESGESHEDMGFGLFV 


9004 


22905 


A 


9071 


297 


126 


CSIYDKRIPTFGLGEGVSERQRLTLLLP 
CL PPALTE TPTS S DWQ S TD ATPTLTNS S 


9005 


22906 


A 


9072 


117 


286 


WLRGSEKMFQVQEINEQYLQCILIKGI 
FLVKKKKKKKKKKKKKKKKKKPRPEYIK 


9006 


22907 


A 


9073 


3 


302 


PPMETLRQWTTIVHSQEEEEEHFEVIKD 
EVKWARKHGQPGTP VAIATQLP PRTSA 
AFPAQQQP1*QPLAP TTVLSVSADWNALL 
QRPNKYI WGLQ YLEM 
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SEQID 
NO: of 
nucleotide 
sequence 

• 


SEQID 
NO: of 
peptide 
sequence 

■ 


M 

etb 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

i 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=*Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=GIycine, H«Histidine, Wsoleucine, 
K=Lysine, L^Leucine, M=M ethionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X D Unknown, *«Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


9007 

* 

• 


22908 


A 

■ 


9074 


437 


79 


NQKREEEEERRKKRRRRRRKRRRRKKKK 
KILRQKKKKRRKA6SREKKKILRHLPBK 
NKRSKKHYELINALSRLQDTRLIYQKMH 
FYILAKNNWKIKLKIQYHLQALENKVPR 
HKWETYA 


9008 


22909 


A 


9075 


51 


185 


CRADFMFFHDKI IVYLCSLVYLLTFPPL 
LSYDPAKSPSARNTQE 


9009 


22910 


A 


9076 


452 


310 


QFS RCKLS SLS LDRKGNSLTP CT S WVRR 
CLAL LRL TLGALHL LS DNPQ 


9010 


22911 


A 


9077 


438 


326 


LLHCWWDCKLVQPLWKSVWRFLRDLELE 
IPFDPTLHP 


9011 


22912 


A 


90,78 


37 


169 


GIPGFRHAGQRRLQLLITGDLPTSASHS 
AGITGTSDRARPPSTF 


9012 


22913 


A 


9079 


1 


408 

■ 


RKGRLS KE E I E RMVQEAE KY KAEDEVQR 
ERVS AKNALE S YAFNMKSAVEDEGLK6K 
I SEADKKKVLDKCQEVISWLDANTIiAEK 
DBFEHKRKELEQVCNPIISGLYQGAGGP 
GPGGFGAQGPKGGSGSGPTIEEVD 


9013 


22914 


A 


9080 


31 


168 


VRRNPRLEHTVTI TAYCSLELLGSSNLP 
ASASQVARTTGLQLLNLQ 


9014 


22915 


A 


9081 


460 


319 


YFFVEMGSHYI AWAVLELLGS SNPPASA 
SQSAEFTDAHKPTDdkPLR 


9015 


22916 


A 


9082 


465 


326 


PCVAEFCTHTHTHTHTHTHTFKCVGSKL 
HTNND 


9016 


22917 


A 


9083 


33 


269 


PGNSRPISSVRGIHPPCLANFCFLNVFV 
EMGSHYVVLAGL I LGS SDLS ALAVQSGR 
VTGMSHCAWPEIFFLLENWIIL 


9017 


22918 


A 


9084 


47 


300 


DLITSLFLLKQSSLATMNIQTPFKNIYF 
DLIIKTLFLRFFI FCFI YFYFLFFSYGG 
F FS FFFF FLKGS S FLP PRLNGGAKI WVN 


9018 


22919 


A 


9085 


437 

i 


220 


NIAIKSLSLSLSLSLCVCMCVCVSACVC 
MCLSLSLFSPSSLSHSLFPFSTPNM3IS 
HLPILSTQENMLLLLQ 


9019 


22920 


A 


9086 


227 


23 


DCPKANILGLTPRLVSNSWPQVILTPWP 
PKGQS LLCRQGEGDRI WTWHP WWLLHPP 
PQARFPPRTHFH 


9020 


22921 


A 


9087 


206 


326 


PHWAQWPTPVIPALWEAEAGKSQGQEFE 
TSLANTVKPRLY 


9021 


22922 


A 


9088 


364 


123 


SRWLPLKAPGEDTSLPLPASGGSRCSL 
AWGSIPPISASIPTWPSVLCPCEIFPLL 
FLLYGDCNLGPTLIPDDLIMRSLP 


9022 


22923 


A 


9089 


373 


205 


MPPRLANFCYFSRDWVS PY WSGWSQTPC 
WD YRRE PPF PPCVRTS YS VQYCACYAGL 


9023 


22924 


A 


9090 


283 

• 


2 


REFS FSAWLAFKI FWKLFSPPFWRVRP 
EDWF I PGGCGPNE PNFP PCP PGVGKKGE 
PRFKKKKKKNFISSQAQWLWPVIPAFWE 
AKAGGSLEP 


9024 


22925 


A 


9091 


16 


261 


CPEDLLVGQNVKVEDSDMGDPDRRPMCM 
FACFYLLVYRRCLAMLPRLYLNSWAQVI 
LLPQP PAS LELLQAHATTHPALCLS F 


9025 


22926 


A 


9092 


17 


109 


RQGLAMLPRLISNSWTQANLLPWPPKEL 
GL 1 


9026 


22927 


A 


9093 


282 


114 


KKGGKGALLVFLKTGRKPFFFFIiRQGLA 
MLLRLVLNS WAQAI LLP WPPKVLGLQA 


9027 


22928 


A 


9094 


205 


337 


NANLHGRMQRLTPCLPE I WEAETGGSQG 
QEIETILANMVKPCLY 


9028 


22929 

i 


A 


9095 


62 


287 


EHSCAKLFCFAFESGPYSVAQAGVQWHD 
HSLXjQSQTPGLKQSSCLSFPASAGFYKB 
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SEQID 
NO: of 
nucleotide 
sequence 

* 


SEQ ID 
NO: of 
peptide 
sequence 

• 

• 


M 

etb 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

i 

■ 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-AIanine 
C=€ysteine, D=Aspartic Acid, 
E^GIutamic Acid, ^Phenylalanine, 
OGIycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leu cin e, M=Meth ionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X«Un known, *= s Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SPSFYKDKSCHHLGKHSGA 


9029 


22930 


A 


9096 


170 


9 


AAS TGYCIGRFCTKLKI QSGGQVRWLS P 
WPKREAEVGGLLEARSSRP I WATW 


9030 


22931 


A 


9097 


2 


104 


RRPGGRAEPRARVRGDRDPPVLLRHYPH 
IFBECG 


9031 


22932 


A 


9098 


350 


142 


LCLSCPKGGGYRAMAPWRGNFCSFGKNG 
GFPMLGRVVLKWPQMGPLPQPPKRVGFQ 
GLSPPPGPPPFLV 


9032 


22933 . 


A 


9099 


494 


342 


YFYFYFVEMVSHHVTQAGLELLASSNPP 
ALISQHAVITSGSHHTQPKLYF 


9033 


22934 


A 


9100 

i 


30 


232 


GGIMLCHGRCACFPFAFCHDCKFPKVFQ 
PCFLYSLQNCS FLYKI PSLRSCRLQLAE 
LLS I X YXiENPL 


9034 


22935 


A 


9101 


26 


171 


RNSRRLRHKNRLSLGGGGCS EPRS CQCT 
PAWQTARLSLKNKKQQQKKP 


9035 


22936 


A 


9102 


486 


239 


HHTCPHQVCVCVCVCVYRVCVCVCVCVY 
IYMYMCIYIYIYIYICIYICRSRERBRK 
REMWFHHVAQAGLELLGSRDLPAPAS 


9036 


22937 


A 


9103 


487 


344 


TGFHHVGQAGI ELLTSSDLPAS ASQSAR 
ITGVSHHAQPLFMI FCAKC 


9037 


22938 


A 


9104 


470 


221 


TPGLKRYTCLSLPKCWDYRCEPHARQGA 
FLCAFSPLSSQQPwdKRPHFHFSYETNK 
VERGKRLPLRSPSTEQGFDFRCVLFQT 


9038 


22939 


A 

• 


9105 


468 


259 

* 


YVFF CRDGGLT I S S RLVLNS CPQQS P CL 
SLPKCWDYRHEPPCPSCCSNLUjMFHFT 
NVFL I DSECSHFTP 


9039 


22940 


A 


9106 


466 


329 


RVAGTTGTHHHAWLINIPSVETGFHHVG 
PQVIQPPRPPKALGLQA 


9040 


22941 


A 


9107 


409 


306 


AG FELLASSDLPASASQ SAE I TG VSHHA 
RPSLIF 


9041 


22942 


A 


9108 


417 


269 


MWAQADM ES S KE PGLFNW I INDSLVQ 
AYPELKEALYEEIKKAQRROA 


9042 


22943 


A 


9109 


533 


399 


RGSLRHETRLNAEARRCSELNSRQCTPA 
WATBRDAVSKKTKQKA 


9043 


22944 


A 


9110 


16 


239 


LVHFEVRWNSHEVSLTADSRKSHLIFPQ 
PNYADTLINQESYEKSEPLLITQDLLBT 
KGBPRQLQVSFFPPKREE 


9044 


22945 


A 


9111 


68 


219 


EPPHIiANFLQRLVFTMLLTLILNSWPGV 
ILPPGPPTVLGLQDKLPVLLPS 


9045 


22946 


A 


9112 


192 


57 


DRVPLCPPDWSAVARSWLTAASVSQVKG 
ILSPQPTECAPPGNST 


9046 


22947 


A 


9113 


21 


201 


TXVSWEFMNLGGGTCGEPRWCHCTPDW 
GTERDSVSGEKKEDKDVARRHWVGSFYN 
GPQ 


9047 


22948 


A 


9114 


2 


144 


TFTIDRCAKDLFVAKQVGTKLSVNEPLS 
FSVESILKRPSSAITRVSQ 


9048 


22949 


A 


9115 


156 


316 


MILNKKKKKKKKKKKKKKKKKKKKKKKK 
KGGGPFKKTLGGAKITPGWKNYFFFFMG 
GV 


9049 


22950 


A 


9116 


1 


84 


TKPTPFLPTLITLTTLIjLPISPFILIIL 


9050 


22951 


A 


9117 


1 




HLNLGGGV CSEPRS CHCTPVWATEILSQ 
KKKKKNGFKFWEPRPQKEGGGGRKKPPG 
VP 


9051 


22952 


A 


9118 


1 


184 


AAAGYPGQQQNFHSVREMFESQRIGLNN 
S PVNGNSS CQMAFPS SQSLYRTSGAFVY 
DCSKF 


9052 


22953 


A 


9119 


69 


374 


"i IFSKKKKKKKKKKKKKKKKGGPPKKKP 
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SEQID 1 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 

■ 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 

• 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
OCysteine, D=Aspartic Acid, 
E»Glutamic Acid, ^Phenylalanine, 
G K Glycine, H-Histidine, I^Isoleucioe, 
K=Lysine, L=Leucine, M=Methtonlne* 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
"^Threonine, V=Valine, 
W»Tryptophan, Y«Tyrosine, 

X^XJn known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 




• 






■ 




WGGP KI FVLRRVLS FFIRGBKKNTF6D P 
LKKTFFWGGEKLGKPPPKKKTPWGKKKI 
FKGERGKKPPFLGGKKK 


9053 


22954 


A 


9120 


405 


93 


EFSPLLLTTNQKRGFFPPSPPKIFFFPR 
RIYFLGGGGPHFPPPQKRGFPQKNQKGF 
FFSPPKEKKYNLPPPGKFCPPQGFFFPP 
PPLFFFS FFPFFFFRCQIK 


9054 


22955 




9121 


2 

* 


380 


WDYRCPPPHPTNFCI FSGDGVSPHWPGW 
SWPQVICIiPQPPKVLLTPWVRDRTRDLE 
GGDLWYGACLGACLHLGPQAASFGHGIL 
GPASLLCCIPKHVHMDTLSPGVKLQSCD 
TGVALRVGEKREGS 


9055 


22956 

■ 

• 


A 

■ 


9122 


2 

• 


534 


DMIREQKIYHKYLAQRREEBKAQRKEFB 
RILEEDKAKKLAEKDKELRLEKEARRQL 
VDEVMCTRKLQVQEKLQREAKBQEBRAM 
EQKHINESLKELNCEEKENFARRQRLAQ 
EYRKQLQMQIAYQQQSQEABKEEKRRBF 
EAG VAAN KMCLDKVQEVLS THQ VLAQN I 
HPMPQGMPQ 


9056 


22957 


A 


9123 

■ 


389 


46 


GPNFLPPP IFGFFSPFSPLKFFFSPKTL 
IFLGGFFPIFPPPKKSFFPKNSPGVFFC 
PPFKKKNFLFPSPLNFAPPRVFFKPPPP 
FFFFFFFFFFFFNFYRLYFI WVNGLAKV 
VW 


9057 


22958 . 


A 


9124 


422 


59 


AQQKKKKKKKKKKKKKKKKKKKKKKKKK 
K 


9058 


22959 


A 


9125 


333 


168 


KGPPCIVVCMVEFHKLEAGSHSVAQAGA 
QWCDGNLLQTLTLGUCRSPHFSLPSR 


9059 


22960 


A 


9126 


175 


89 


KPPPHFFFFFFFFFFFFYFFFFFFFIFC 


9060 


22961 


A 


9127 


368 


250 


CFNYLSRDRGLPLIiPGLVLKSWPQAILL 
PQPPKMVGVQG 


9061 

■ 


22962 


A 


9128 


392 


157 


FFSPKRLNFGGGGGPKMPPPKKKVSPQK 
PHPGFFSPPQKKKKKFFPPPVKMGPPKS 
FLKSPPPFFFFFLISSIRGLFL 


9062 


22963 

• 


A 


9129 

■ 


396 


13 


FPPPPPPIFFFSPRRFFFRGGGSVALSP 
PPKVFFLLNPPPVFFFSPPFKEKTSFFP 
PPFFCPPPQILFLAPPPYFFFFFFFFLF 
VFFFLFFFFLLPLLFLGKTSFLLLLFLF 
IYLETGSCSVAWARVQ 


9063 


22964 


A 


9130 


55 


180 


IiL YNKL I FTLKKKKKKKKKKKKKKKKKK 
KKKKKE KYE KTKA Y 


9064 


22965 


A 


9131 


2 


95 


SFVHMEHEKAVLLLKSFQNTVDLGIQRB 
LTV 


9065 


22966 


A 


9132 


172 


404 


KKKKKKNKKKKKKKKKKKKKKKNWGGGF 
FKNS WGGPKKRGGGKI FFFS LLGGKKKT 
LGVFLE KPLFLGGRKMAPPP P 


9066 


22967 


A 


9133 


1 


125 


QPVSYPFTI IGQVAYEIiYFTTIIiILIPT 
MSLIENKILKWAY 


9067 


22968 


A 


9134 


1 


120 


QPVSYPFTI IGQVASVLYFTTILILIPT 
ISLIENKILKWA 


9068 


22969 


A 


9135 


7 


269 


lATGQPRLQKLTIiDPTLLLKRFLLTKKK 
KXKKKKKKKKKKKKKKKKKKKWGGGGHK 
KKPPGGPKTYPVKKKIFFFFKGKIKKPP 
WGI 


9069 


22970 


A 


9136 


1 


64 


LVSISNTYGLDYVHNLNLtiQC 


9070 


22971 


A 


9137 


400 


259 


SQPLRRLRHKNILNPGGGHYSELGSCHC 
TPAWDTEQDSIHSWLLSSV 


907J 


22972 


A 


9138 


3 


310 


ECFPNS PNMNGHCNLLVQSTSPS I STMP 
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SEQU) 
NO: of 

n u t ivw uuc 

sequence 

— - 

■ 


SEQID 
NO: of 

sequence 

■ 


M 
#h 

■ 


SEQID 
NO: in 
USSN 
09/5154 
26 


Predicted- 
beginning 
nucleotide 
location 
correspond 

inp to first 
amino acid 
residue of 
peptide 
sentience 

1 


Predict- 
ed end 
nude- - 
otide 
location 
corresnon 
ding to 
last amino 

■ 11 1111*1 V 

acid 

residue of 

■ rntHJ m \a mm m* \w m 

peptide 
sequence 


Amino acid sequence (A=Alanine 
OCystcine, D=Aspartic Acid, 
E— Glutamic Acid. F=Phenvlalanine. 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L*=Leucine, M=Methionine, 
N=AsDa rapine. P=ProIine. 
Q=Gtutamine, R-Arginine, S=Serine f 
T=Threonine. V— Valine. 
W«Tryptophan, Y«Tyrosine, . 
X = Un known. *=Ston codon. /=nossible 
nucleotide deletion, \= possible 
nucleotide insertion 

■ ■ mm) m?W mp mw m m mm- mr 1 W Itrwl va^Bj 


■ 


■ 










CPKYVALEKVFLLNVLWFFYLKYRDNRP 
GSVAHTCNP STLGGRGGW ITGAQEFETS 
LGNIARPSLI KRKKYKLF 


9072 


22973 . 


A 


9139 


34 


142 


NCKKLNTHTHTLTHSHTHMLTHSDTFTH 
THTHTKYK 


9073 


22974 


A 


9140 


146 


2 


PPPPPPEGVSLCGPGWSAVARFRLTASS 
ASRVHAILPPQPPQQLPTRP 


y\j /*t 


99 07 5 


a 

at* 


0141 




109 


PPAS ASOS AQVTGMSHHAOP IEPLRCS S 
HRTT 


9075 

■ 


22976 


A 


9142 

i 


245 


51 


NREGDLLCCPRWSQTFGLKQSCDGFPKC 
WE YRRG PPHLAYI MSMRXNIL IOMI PYY 
NNPCLLYS 


OA9£ 


99Q79 

i 




QtAI I 
7l e fD 


o 
Z 




GAKCVSDRI KHAFLiIKEOKI I VKVLKAO 
AGSQKAK 


9077 


22978 


A 


9144 


3 


200 


MGFLHVGQAGLKLPTSGDLPASASQSAG 
ITGVSHRAWPVSGFFCSEAKECFWEEQL 


9078 


22979 


A 


9145 


405 


227 


WFFLGFPLKGKLGPPKMFFRAPRVYSPF 
PP P PFPWPP P PR I T PPPGPFKFFVFLAG 

PIW 


9079 


22980 


A 


9146 


387 


220 


PLFFVGCFFFFFFFFyYFFFFFFFFFFF 


9080 

* 

• 


22981 


A 

• 


9147 


539 


220 


VIKFKHILIKTVIQSLFFFERGSCLPPR 
I^C^SMIMAHCSLNLLGSSHPPASASQV 
AGTTGAYOHTOL I F I FFAETGS CYVAOA 
GLOLLGSSNPPPWPPEVLGIiOV 


9081 


22982 


A 


9148 


3 


141 


GGGGCCE PRWSHYT PAWATSAKLCLKKK 
KKKRFLEKS CLTPLLKA 


one*!. 






QIAO 


•11 j 




FI FCRDGGLTMLPGLTSNTWSOVILPPO 
PPKVLGLQAYHADELPHLATWGVLYGLT 
V 


9083 


22984 


A 


9150 


395 


193 


LLFGPCLLNLITQFVSCHLLDIKLQMTL 
SEGYFPLNIQESPFYRAPLDDPSVRHAR 
GEILPISLGPG 


9084 


22985 


A 


9151 


1 


77 


LVAAIVLGGTGWGFKNIRRGRSGGA 






a^» 


01^9 




181 

JOi 


IVDRPDPCRDADAOREVFTLSARYPKIW 
VPPGDPAREFPFIQSAPVMLIAAPDELV 
SAQIiA 


9086 


22987 


A 


9153 


373 


251 


FKKI F FCRDRALETMS PRLVSNSRTQAI L 
PPQPLKVLRLQA 


Q0R1 


990RR 


A 


0154 


204 


95 


DYHWSQPGMVAHACNPSTIiRGRGRWITR 
SGDRDHPG 


QORR 
7wOO 


99080 


A 


01 Vi 


44 


980 

LOU 


AELFGTLMREDGLS LGGGGCKEPKSCHC 
NPAWATEPDCLRKKRKESPYLLWTPPD 
SS YS PFFFLLLPAPDNNDYFLS 


OOKQ 




A 

* * 


01 Sf% 


9 


117 

1 X I 


ERLRHENCLNLKGRGCSEPGWCHCTPAW 
ETKQDSXSKK 


9090 


22991 


A 


9157 


3 


207 


HASAHASADAWDYRRVPPCPANFCILYV 
SHVGKAGLNLLTSSDLPTSASQSAGITG 
VSHHARTPTYV 


9091 


22992 


A 


9158 


117 


267 


NRVIXCCPDWPQTPGLKGSSLSLLSSWD 
YRHI PPRLAARSS YLGPGFYH 


9092 


22993 


A 


9159 


415 


167 


LFFFPIKNXFFFSPPLFFFFFFFPPPPX 
XXFFFFFFPFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFliELRQNF 


9093 


22994 


A 


9160 


217 


41 


SSQIHRDRNRMVRQGAVAHACNPSTFGG 
KGEQITRAQEFKTSLANMVKPRLYYKPC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
neotide 
sequence 


M 

eth 

od 

• 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 

■ 


Predicted 
beginning 
nucleotide 

mm mm VIVVMIJV 

location 
correspond 
ing to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid. F=Phenvlalanine. 
G=Glycine, H— Histidine, IHteoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serioe, 
T=Threonine, V=Valine, 
W=Tryptophan, Y-Tyrosine, 
X^Un known, *— Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














LY ^~ 


9094 


22995 


A 


9161 


341 


164 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFPFFFYVCF 
VSVELKKWQNYS FDVRI ILRVEP 


9095 

*S \J J +J 


22996 


A 


9162 


190 


402 


I HTGKHPKCYRNS YSVLFLAS KKKKKKK 
KKKKKKKGGG FKKKKNLNRGGGE I IFFF 
WGPKKIFGGGGLKKG 


9096 


22997 


A 


9163 


108 


209 


LSVKRCAKNPKKGKVAPGIKEIRTRSHQ 
DSEDI 


9097 


22998 


A 


9164 


2 


145 


KHCGRPRPQGPL PRKTRDLCSLVYLLTF 
PPLLSYDPAKSPSARNTQE 


9098 


22999 . 


A 


9165 


170 


402 


ISRCHFPSLIIiSYVKITKTIKAPKKSKK 
KKKKKKKKKKKKKKKKKWGGGPPKKKNP 
LGGGGGE I FFFLGAPKKKRGG 


9099 

■ 


23000 


A 


9166 


399 


113 


WRLRWEDCIiSLGGRGCSEPGSCY CTPMH 
VTEODOS SGYEGFORP SSLSKARI PSVA 
AERSMCQPACVCNLS VEHIiSLYS CLLAQ 
QGKALYKLLNK 


9100 




a 

mm 


Q167 
710/ 






S WITGVCHYAWL I FAFLVE TG FHRVGOA 
DLELVTSGDATTLRSH 


9101 


23002 


A 


9168 


134 


301 


EKLTQSHSVTQAGVQRLNVHSIiRPPPPR 
FKRFSCLSLPSSWDYRNLETVLOEETH 


9102 


23003 


A 


9169 


201 


I 


GATILRFKTKKKNVFFKKKKKKKKRKKK 
GKGWSGTESRWNLSGEGSSGLRRRKLQA 

WW A W &\M^ M>v>p7 W WW W W4pU\pl\pft\pl\pU^pTl 

PAGPQEFEKLA 


9103 


23004 


A 


9170 


2 


191 


FVPQPLPIiGFKRFSCLSLLSGWDYRCTP 
PHLANFCGFFFLKFFFFFFKKKNFFSPR 
GRGGALI 


9104 


23005 


A 

** 


91 71 


9 


191 

171 


GGTES CFVAQAAVOGCDLGS LOS P PTG P 
SETASQTVDPDKVKDPGKLRTMFVAELP 
KTIVTDS 


9105 


23006 


A 


9172 


3 


143 


SWFLYVGQAGLELLTSGDPPALASHSAG 
ITOVSHRARLQTGIFRAV 


9106 


23007 


A 


9173 


361 


174 


VLGI I LSMS ETE I FNTKCLDGS GTVAHT 
YNPSTLGGQGGRVlRAQEFETSIiGNTGR 
PHLFKI 


9107 


23008 


A 


9174 


493 


351 


LVETGFNHVGQAGLE LLTSSDP PAS ASQ 
SAGITGVSHLALVLISRPCSC 


9108 


23009 


A 


9175 


1 


115 


FRKIWYIYTMEYCIAVKKKEMLSFTITW 
MNLEDFMLVR 


9109 


23010 


A 


9176 


286 


96 


PLFFFNRWGLA I CLGWS AVAIHRRDHSA 
LQSQTPGLKRSSCLSLLSSWDYRRVPPC 
LANESHY 


9110 


23011 


A 


9177 


399 


237 


RNTKWLFFLRRSLTLSTGLiECSGAISAH 
CNtiRLPGS SDSLANG I FKDIALLDM 


9111 


23012 


A 


9178 


427 

• 


101 


GRTGRQGRGAPHI SHDGRPGRDAPRFPD 
VMAAGKRRS SLPR WDGGRAETLLTFQTG 
QPGRGAPH I PDDGRPGGDAPHFPDGWA 
GQRLQSQHFGRPRQAAGRWRL 


9112 


23013 


A 


9179 


193 


41 


IVFLFIYLFCFFVGTGPCYVAQAGIiKliL 
VS SGPLASAPQHVG ITGMSHHT 


9113 


23014 


A 


9180 


292 


119 


KITKINKQILNLSYTLDQMDLTDICRIP 
HPTATBYRFFSRAHGTFFRSGTKNSAWH 
I 


9114 


23015 


A 


9181 


3 


279 


ALS PP YS SECCKLGS FN RELS S PS PGGQ 
KSEIKVSAGPRPLPRVQGDPSPPLPASG 
GPRCSWACGQITAVSASGVIHPSPPLm 
VCCVCLI 
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SEQ ID 
NO: of 

nUCieOuuc 

sequence 

• 

• 

• 


SEQID 
NO: of 
peptide 
sequence 

• 


M 

* 


SEQID 
NO: in 
ftssn 

09/515,1 
26 


Predicted 
beginning 

II UklCUHUC 

location 

rnrrMnnni) 

ing to first 
amino acid 

rp<! id hp c\f 

1UIUUC VJ 

peptide 

i 


Predict- 
ed end 

nil 

III* Wv- 

otide 

IapoKaii 
IUCoLIUU 

correspon 
ding to 

Ia4t* n m inn 
last u in nil/ 

acid 

l ColUUc UI 

peptide 


. Amino acid sequence (A=Alanine 
OCysteine, D^Aspartic Acid, 

G=Glycine, H=Histld!ne, I=Isoleucine, 

1^— ¥ .vcinp T^Y.Piioinp 1VI~ I^fpfh inn i n P 

JV *-ijr J 1 U Cj JLT^JUCULlll Cj 1TJI iTACLU IUIJI UCj 

N=Asparagine, P=Proline, 
Q=CIutamine, R=Arginine, S=Serine, 

W=Tryptophan, Y=Tyrosine, , 

"V— ¥ 1 n If n nwn *~x¥An rnHnn /snAcnihlp ' 

nucleotide deletion, \=possible 

niidpniide inQprtinn 


9115 


23016 


A 


9182 


402 


238 


QMQWHNLWPLQSLPPBVTPLFCLNLPNN 
WGYRWPPPPPTNFFFLKGVSLGRPGSV 


qua 

7110 


OQrtl "7 


a 


Ol 


J 


ZU/ 


TGVSHRAQQACRFLHFFEGTCFIiSQELS 
YLKLGRDT TIC 


711/ 






Ql OA 

yi of 




JVO 


YPHYVKS IVASTFI ISLFPTTI FMCLDQ 
EVIISN 


yi io 


T3A1Q 

2.0V iy 






i in 


9A51 


GVLVGSPIjRTRFHSOIjOARGAEEIGVVT 
WMGDGVLPVCWLEVDV 


9119 


23020 


A 


9186 


925 


1132 


KCPSAWAAHPTSEIRISQGGSQAAVVLV 
ETSLANMAKPRLY 


9120 


23021 

♦ 


A 


9187 


399 


292 


ASQVAGTTGAHHHARLIFFVFIiVETGFH 
HVSQDENF 


9121 


23022 


A 


ni do 

9188 


7 


82 




9122 


23023 


A 


9189 


1 


1.17 


LPISIiPSFLPSFLPSLLPSFLPSSFLPF 

TPCDCOXnnDVT V 
C r roo V 1 Jr XJ-uv 


9123 


23024 


A 


9190 


238 


46 


NKAFRIKAFVSPSNKKFSSFFFFLNEMK 

OUVtmA»r«T.I?TT OCnWDQTT.XCnxrRVT'P 

3nXVI^AuXjaijUoo(jXNirSJ.JjAbQr4AA X 
GVSHHVRP 


yi24 


23025 




9191 


2 

■ 


OPT 


KVLRFTGVSHHTRPELYFQRLLAPGTLL 
EVWGTSGLR 


9125 


23026 


A 


9192 


44 


136 


CrXVO^CVCVCVCVSCKQTLMCCFLENF 
DV 


9126 


23U27 




9193 


469 


347 


DLPNSDQKFYVN 


"127 


23UZ8 




9194 


427 


Z3o 


PPZiP DTI 1? P V TP! P 1PVT?M\7PP WntTAfTr .XT 

VL I SGDL CAS AS PGVG ITGVS HHAQPK 


9128 


23029 


A 


9195 


34 


178 


SLVHWEFERGRAEWLTPVI PALWEAEA 


9129 


23030 


A 


9196 


22 


338 


CGPYSMWWNS I KPJSPJCEGKKRKKKRPJOC"" 

P^VTTP PP VT?P ITPP P Pfi/IPP^TTP'DnPP T?n 

KKRKKNKRKKVGLIGSPCIVFTWQAAYC 
EESGGVWPGGETBHTOEGIiVO 


9130 


23031 


A 


9197 


3 


91 


QGLSLGGGGCSEPGSCHCTPAWATERDS 


9131 


23032 


A 


9198 


1 


203 


RTRGNKS I DDTS WFDDFPES D ILQPVPN 
TTEPD YKS KD WVFIiNYTYKRFEGLTQRG 
SIPTYMKAGKL 






a 

n 


Q1 GO 
7177 


I 




FTCCKNOFENHSGCLPROI 


9133 


23034 


A 


9200 


1 


99 


YAAKEL VHDRYRPT I EME RRRGLW WLVP 
RLSLE 


7lJ*> 




a 




J 


1 AQ 
If 7 


LNAVfSPMP WOT jKVOEIJDPAGHKB T iPVMR 
VPSFYVPMVDVNDDFVLLPA 






a i 

** 


0902 


984 


91 


KGPPLFFF FFFLRQGLVLWHRLE YSGMT 
TAHCNLELLVSSDP PTLSSQNAETTGVS 
HCAQPASLII^KPYIGFLHQIQKSCLPV 
TFL 


9136 


23037 


A 


9203 • 


1 


77 


LRLVI^ T SWRQAILLLWPPKVLGLQA 


9137 


23038 


A 


9204 


529 


340 


GVLLWRQAEAGELSLGGGGCSELKVRHC 
TP AWATRAET PSQKINELI KS MAVELTD 
DFIT 


9138 


23039 


A 


9205 


413 


241 


HPPPPELKLFSCISLISRWCHRHRPPCP 
YVFVIXAETGFHVGQAGHKHTYCCVSIiP 
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SEQID 
NO: of 
nucleotide 

SCUIICUtC 


SEQID 
NO: of 
pepiiue 

9CUIICU LC 


M 
etb 

ou 

■ 


SEQID 
NO: in 

09/515 1 i 
26 

■ 


Predicted 
beginning 

n it i* 1 AntS H ft 

Inmtinn 

correspond 

\na fn first 

■ Klg Ell Ob 

amino acid 
residue of 
peptide 


Predict- 
ed end 

Illicit - 

vuuc 

location 
r cures do n ' 

ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alaninc 
OCysteine, D=Aspartic Acid, 

F^=T^lntflmir Arid F=Phpnvfnlanltift 
G=Glvcine. H— Hiftirtine. I=Iso leucine. 
K=Lysine, L=Leucine, M=Methionine, 
N^AsDaracine. P=Proiine» 
Q=G I uta mine, R=Argintne, S= a Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyroslne, 
X=Un known. *=Stoo cod on. /=DO&sibLe 
nucleotide deletion, V= possible 
nucleotide insertion 














T 


9139 


23040 


A 


9206 


33 


152 


QLQGLESWGMERCALPGYPVVFTNIiCKY 
RSWIEETMRDK 


01 An 


iJU41 

i 


a 




7A 


4UZ 


WVHTRTGAHTHMHTHTHTCTHCRS PRW 
VLNLGCTFELPGALKNARAYPHPNATKP 
MTSE SVG I RPTRQVFKAPRAEQHCSTA 


m At 

9141 


23042 






4o 




dm PPA AVR RHR <? WT tPnVLSRLAAARPP 
RRGTRRGRGRAGGAGGGRDRVGSRRBPA 
AARTAAEGERAVRGKC j 


9142 


23043 

* 


A 


9209 


15 


206 


PIEPLSERTNSAGAPPPASLPPCSLISD 

rtnn ewnPTl Q Vnun D CP DHFlQ hMOTQT Xti A 

KVPVSKD 


9143. 


23044 


A 


9210 


475 


110 


NLLAPHLLVSLEKHS I RMGVAQ FS RCHL 
SQLPLARKENSLTP CASGVRRCXA£iLiRL 

Q KS P VF CVAHAG S CRLELFLFGHLPL 


9144 


23045 


A 


9211 


420 


132 


PPPPNYFGGAGPFPPPPFFLRPPPPFFF 
LPPKKKKIFPPRGPKKIFFLKAPPPSFF 
irr urlfrE sec cccccS r cs rsr re ccct: 
LVRSHFILTIM 


9145 


23046 


A 


9212 


353 


-K= — 

87 


r iff aaUaf Fo.br rr aireAA.c&c XrMf w 
YLKYPLLPPSFFLFFFTPPPPXFFFFFF 
FFFLFPPPFFFFFFFFFFFFFFFFFFFF 

PFFP1") 
r c c c u 


9146 


23047 


A 


9213 


2 


276 


FIQIiLKRRIiCELGYEVLPHPSYSHDLLP 

TWYH FFJCWT ifJUPT^R Tf FFHNOODAFQ A P 
l iM inr c ivnjjuj.Nf xi^xuvc xxl\ rtn>n.c 

QEFAESQSMEFYATGINTLISPWQKCVD 

CSVSYFD 


A1 A1 

914/ , . 


ZjU4o 


a 


5/Z14 


.4 AO 




T ,ci(^. Q T. p Q Q WnVP C?P P PKPG YFFYPFIj F 
LVKTGFPHLAQAG 


9148 


23049 


A 


9215 


386 


37 


GFKKRVCS PFFFFFFFFFFFFFFFGREG 
FFLOKKKVTCNNPSSNECIiLTLGIjOGKKE 

* 1* XJ^4\A\i\i V X will t wUli 1 A ^J>iW 1 l\fwm\LI 

IQPALMKKNSISQS 


9149 


23050 


A 


921 4 


1 


142 


KDFR VQ VWYNHS SLQPQPPGFKQSSRLS 
LPKY WD YRHEYLHC I QLAP 


Ol <A 






yzi / 




lol 


FFFRXFFF FXXFFXXXXFFF F FXXXXXX 
XXXXXXXXXXFFFFXFFFFFFFFFFFXM 
FOOEKLLFI FYVNIWTF 


0151 


23052 


A 

• •> 


9218 


2 


87 


SGSLI fgli YICMCVCVCVCVCVCVYPC 


9152 


23053 


A 


9219 


420 


271 


YYCI FRETGFHHVGQAGLELLTSGEPPA 
SASQSAG ITATAPS PFLTFLLV 


9153 


23054 


A 


9220 


139 


413 


SGYHS KTLFQKKKKKKKXKKKKKKKKKK 
KK3CKKKKKKKKKKKKKKKKRGGGAFKKK 
I FFRGGGGNFFFFLGPQKNIGGGGLKKR 
GGGKKPPPPQIFFFVG 






A 


go 91 


134 


WW 


LHRWGPAMLPRI ILNTWPGVILLPRPPK 
VLELQA 


9155 


23056 


A 


9222 


364 


280 


STHLSLPKYWDYRHEPPCliAKKDAQSFS 


9156 


23057 


A 


9223 


1 


147 


TLLPMLESSGTIMAHCSLNYLGSSDPST 
SASQSARITSVSHCTQPPLPF 


9157 


23058 


A 


9224 


1 


208 


AGAPPPASIiPPCSLISDCCASNQRDSVG 
VGPS E PGAGYHLWRRLSQS EKRS IRVG 
VTRFSSIiLLSSHA 


9158 


23059 


A 


9225 


1 


145 


FQHVGQAGPKLLTSGDLPASASQSAGIT 
VSNRVRP KTKNF YTGNTRPN 


9159 


23060 


A 


9226 


396 


210 . 


KCWD YRHE P P YPALTNLLFI EMGS YGVA 
RAGLKLLGS SNPPI SASQSAGVPGLSHH 
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SEQID 
NO: of 
nucleotide 
sequence 

• 


SEQID 
NO: of 
peptide 
sequence 

■ 


M 

eth 

t» 

od 

■ 

• 


SEQID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A =A la nine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, F»PhenyIalanine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, L=L€ucine, M=Methlonlne, 
N=Asparagine, P^Proline, 
Q^GlutaminejR^Arginine, S=Serine, 
T=Threonine, V=Valioe, 
W=Tryptophan, Y«Tyrosine, 
X=Un known, *=Stop cod on, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AQPiF 


9160 


23061 


A 


9227 


2 


154 


FHHFGQAGLE LLT S S DLPTS ASQSAB I T 
GM SHHTQPRMVF FSNFS STAQC 


916J 


23062 


A 


9228 


1 


238 


FFItXtALGIFILHAIYIiSIYLSIHFPSMH 
PSIiSIYLSIYLSIYIiSIYLSISLSLSLS 
I YLS I YLSIYLS I YLSDFLSSAR 


9162 


23063 


A 


9229 


3 


65 


RGDLITEPFLPKLLTKVKGS 


9163 


23064 


A 


9230 


25 


188 


YFRTAWNQAWWYVP I VSATWEABAGGLL. 
EPTSSRLRCAVIAPVNSYCTSTWAVS 


9164 


23065 


A 


9231 


44 


232 


YDLKFNNHWFYFLFETEFHSLSKVBCN 
GAISAHHNLCLCDGARLHHRRGGKKTKJC 
QKTLPS 


9165 


23066 

* 


A 


9232 


3 


195 


FF VGLGS P YWQTGLELLCSGNPPAS AS 
LNARITOMSHQVQPNTGRLSFYLGSSIY 
GSIHAMI 


9166 


23067 


A 


9233 


380 


244 


SVPLAEMGFHHVGQASPELLPPVIHSSQ 
PPKAI GLQARATAP S LH 


9167 


23068 


A 


9234 


11 


101 


LLLINTLMFIHLSPILLLSLNPDIITGF 
EG 


9168 


23069 


A 


9235 


411 


226 


FFFFFATNSQGGPTSFWGPRNNAQVAGP 
LRNKI FTPVGSPKKIFPSWETPVF 


9169 


23070 


A 


9236 


404 


262 


QGLAMLPRLVSNSWTQAILLPWPPNQSA 
GGSGVSHHTQP I F I S EN IH 


9170 


23071 


A 


9237 


369 


152 

i 


HAFFVS IWRSFHVSNSYMTVTVQMYHNL 
SIPLPDPHFFFRGGGLAMLPRLVSNPWP 
QAILPPQPPKVIiGLQA 


9171 


23072 


A 


9238 


389 


190 


RPKKKKKFFPPPGGKGVFFKGAPPPYFL 
NFIFFFFFFFFFFFFFFFLIYTHNFLIL 
FLNCDKLYIT 


9172 

• 


23073 

• 


A 


9239 


330 

* 


3 


GGMEPPFFPPTGAQWGYFGSLQNSPPEF 
KQFLCLRLPNVLEYKSIPPCPTNFCSFY 
KNGI FPCWPGWPFKFQKKKKISWVWWCA 
LAEAGGSLEPRWRWRLQSAKMAPL 


9173 


23074 


A 


9240 


122 


248 


KKKKKRGGLTPVI PPLWEAKGGGSRGQK 
IQTILAGGGKAPLY 


9174 


23075 


A 


9241 


2 


286 


FFVQTE FHHVAQAALQLVS SRDP PTSAS 
QSVDITGVSHCAQPVIVSCDDQVTLVQR 
RQVTVTSLIAFCQVAYPKGAKSQWPQSF 
FFSSPFLFSL 


9175 


23076 


A 


9242 


2 


95 


HMKKCS S S LAIREML I KTTMRFFFFKL I 
TCY 


9176 


23077 


A 


9243 


377 


293 


PLSPRLVLSSWAQMILLPLPPKVLGSQA 


9177 


23078 


A 


9244 


422 


147 


KKGGTQGFS PPRVFKRGPRNFFFGPPKK 
KKNPPPPRGKIFFFLKGPPSFFFFFFFF 
YFFFFFFFFFFTHKYFLLGDLLLIYHLS 
KKNYSFF 


9178 


23079 


A 


9245 


402 


208 


TIGRCHHTQLNLKFFVEMGSHYWQAGL 
KLLDS TNPPSSAFQSAGISGVNPYAWTE 
IMFLYSIPI 


9179 


23080 


A 


9246 


314 


168 


E P YHLHFLP PHHYYYCRWGLI VLPMLVS 
NSWAQVILLPWPPKVQGLQV 


9180 


23081 


A 


9247 


3 


320 


WQAGLKLLSSCDQPASASQSAVITGVN 
HHARPL I FLYEMD YGSRTVL I IRKWDDN 
EQSNRTI FLKKDBLTGNRSACLKYTPMD 
INVNSGLGWRYRFGMHHQCRW 


9181 


23082 


A 


9248 


22 


303 


KGS YFS FT YE AQTES CS IARLECRGAI S 
AHCNLRLLDSSNS PASASRVAETTDHKT 
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SEQ ID 
NO: of 
nucl&fttide 

sequence 

• • 

• 


SEQ ID 
NO: of 
nentide 

sequence 


M 

ctti 

od 

- 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 

• 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

** ■■■■■■ X m» B W 

residue of 
oentide - 

• r r 

sequence 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
dins to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=*Aspartic-Acid, 
E=Glutaroic Acid. F=Pbenvlalanine. 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lvsine. L= Leu cine. M=Metnlonlne. 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Areinine, S^Serine. 
T^Tbreonine, V=Valine, 
\V«Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=posslble 
nucleotide insertion 




■ 






• 




. LVPERS CFTPKERNVIQKGQES SEKMSN 
HLFTWLPILH" 


9182 


23083 


A 


9249 


13 


174 


LGLKVRHRLWLIFVLSVETEFHHLGQAG 
LELLTSGNPSAG I TGVSHCARPTSHS 


9183 


23084 


A 


,9250 


3 


141 


HVGQTVLRLLTSGDPPTSASESAQITGV 
SHRPRQIFISLFTICIR 


01 fid 
"Ion 




A 






262 

XUX 


LRI PLT FGG WKGEEEDRAVMHLRLCGD 
DS PMSES LHF I KL PRTVLVDEDARPBKK 
KKKKKKKKKKKKK 


Ol ft* 
If loJ 


X JUoO 








X fj 


PADLFI FIWEDGGLTMS PRLVMNSWSQA 
TLQPWTPKVLGLQV 


9186 


23087 . 


A 


9253 


411 


258 


PKVFKYRCDPPRIiAWFLFFVQSLLPRLG 
LICSWAOAFLP PWPPRVLGLOA 


9187 


23088 


A 


9255 


391 


167 


GLKILTSGDLPASASQSAGNTGMSHCAR 
PLMSFKIRYFLFTIKMKLKLLWPVWSA 

RVHPP WF^ P P T VT TMKPR I 
u vnrrnr o it a. v j_ i. i* ivt x\ a 


9188 


23089 


A 


9256 


9 


255 


FFCRYGGLAVLPRLVSNSWTQAILSPWP 
LKVLGLQACTHHF FLMVNSNAARIKT I E 
MLLSEDVNFFVALLCPSFTISHFHL . 


9189 


23090 

♦ 


A 


9257 


487 


42 


RYF I FLF YF I FKTETGS YYVAQARLRLL 
YTFLFTHIP 


9190 


23091 


A 


9258 


1 


243 


ILSSGGSFISLTAVILIIFMIGEAFASK 
RKVT.TVHEPS INLEWLYGCPPPYHTFEE 
PQPKKSEIYSKCFFFSVFVHFGGWG 


9191 

* 


23092 


A 


9259 


411 


2 


FFFFSPPVSLSLCVLBSSPPPFFFFSPP 

PK fffTRF <5 PP PFF F <S PR VFPP PVFFFP P P 

PFFFFPPLKKNFSPPPPGKI FFFFFRPP 
PFFFFFFFLFFFFFRPFFFFSFFFLRWE ! 
FA/LYFIMCHHISDAYPPNVDMNKE 

4,-J V XJ A X> JU* X^^iAii* W4>A *T X7 Xj^ • x-# ^ UV^\U 


9192 


23093 


A 


9260 


417 


219 


SVEMGFLHVGQAS I KLPTSGDPSASGSQ 
SAGITGLSHCARPHSQLS IVCCYSVPVQ 
CLHSLAARY 


9193 


23094 


A 


9261 


408 


172 


CPNHFRKWGHRGCPP PRVF FFVGRDGVY 
VAQAGLELLDLSNSPASASQSNGTTGMS 
HLGOPLLFILNDVI AVI PLQGSF 


9194 


23095 


A 


9262 


401 

< 


132 


HQDNKPAPPGFFFFSPPPKKGFFSKPFY 
FVSPRFFPFPPFLNPAPLFYFWGPKKKI 
FFFPPPAIKFFFFKRAPPLFFFFFFFFF 






A 






XXV/ 


LF FLNK I PFFP PGGS KGGD FGPHTNPP P 
GVKPIFRPHPPEEGGPKGPPPRPREFFIi 
GF 


9196 


23097 


A 

* 


9264 


206 


438 


FPIL KTHFRLGTVAHTYNPTTLGGPGGR 
NTCS KFKTSLGTWGD P FFH KS ENCIjGWG 
VPNCGPCFWGAQQKSAWSEGL 


9197 

* 


23098 


A 


9265 


333 


48 

• 


AIxLAGPTPFFFLGKKKTRFGNRALFFFF 
FFFLNEVLLCNPGW IMWAIHRCDHGTL 
QPPTPGLKRSSCLSLSSSWNYKLKPLHP 
AGVRFLDFVS 


9198 


23099 


A 


9266 


3 


275 


TGFHH IGQAGLKLLTLGDP PLSASHSAG 
I TGVSHHAWPMHLES YWRVQYPFMPQVL 
YFWAPSGAQHCAQ I IENAIPTLLSRPQE 
EPASEA 


9199 


23100 


A 


9267 


49 

« 


226 


HGPEPCMDADAQREGINVSARYPKNWES 
TVDPAF^FTMIYSAPLMLLADPDEFASV 
QLA 


9200 


23101 


A 


9268 


159 


236 . 


PRLVLISWPQAILPPWPIKWGLQA 


9201 


23102 


A 


9269 


1 


208 


IFNYCVFSSGVALAYGIYKQDLPALKEK 
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SEQ IB 
NO: of 
nucleotide 
sequence 

* 


SEQID 
NO: of 
peptide 
sequence 


■M 
od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K~Ey$ine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=$erine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=po$$ible 
nucleotide deletion, \=possible 
nucleotide insertion 








• 






PRGWFVDMGHSAYQVSACAFNRGQLKV 
S I YFFSRRLLWW 


9202 


23103 


A 


9270 


1 


209 


IFNFCVFSSVVALAYGIYKQDLGALEEK 
PRNWFVDMGHSAYQVSVCAFNRGKLKV 
SIYFFSRRLiLWW 


9203 


23104 


A 


9271 


379 


263 


ESCSVGRLECSGAVPAHC^LCLCLPDSS 
GSPASSSRLC 


9204 


23105 


A 


9272 


312 


3 


FRREGLAALQPWWPNKVSSKKKRKKERE 
RKKERKKERKKERKKEDGATADGISEAE 
VIRVHPYPVGFGVLSEGDIWTEDMRAGR 
RPHAGESGDTVMLPQAQE 


9205 


23106 


A 


9273 


405 


246 


PTKKNKYLHAPPLFHFFFIIPNPNFFFF 
FFFFFFNFFFFFKTKGCEFLYFSY 






A 


Q77d 


7 


977 


LVETGFRHIAOAGLOEiLSSSYPPTSASO 
SVGITGVSHQPGPSFHFERKSLGLKTLL 
LCVKVGVRACFALFFQGSS 


9207 


ij IVQ 


A 

* • 


9275 


70 


304 


NL FLKAST YPSEHL YHLP PTP IVY IRKR 
LDAMAHACNPGTLGDRGGW I ARTPDLKC 
5 AYRGLS KCWDYRHE PPRPVLV 


9208 


23109 


A 


9276 


2 


260 


TVFYHVGQAALELLTSGGPPALASESAG 
I TGVSHHAQPFLLI L VT SLFPCNQS FHI 
SPYRFSFTQLiraGFNTVLYPLVINISEI 
FF 


9209 


23110 


A 


9277 


406 


330 


PRLVLNS WTQTI LLPWPPKVLGLDV 


9210 


23111 


A 


9278 


2 


160 


ELLTSSDPPASASQSARITGMSHHTRLK 
YIVYRVS RKS KNI PLKHIG IQS FP 


9211 

« 


23112 


A 


9279 


1 


91 


GLT VF PRLVWNS WAQM I LP PWP PKMLGL 
QV 


9212 


23113 


A 


9280 


3 


107 


LGETGFHHVGRAGLELLNKVMHPPWAPR 
VLGLQA 


9213 


23114 


A 


9281 


336 


234 


AHYKLLLKI KLPGQAQ WLTPVI PTLWEA 
EAGRS 


9214 


23115 


A 


9282 


3 


241 


DRPEPCWDADAQREGINASARYPKNWVT 
TGDPAREFTMIHTAPLMLIiADPDEIESV 
QLAQSWPFGAIASLGRSPCRKRN 


9215 


23116 


A 


9283 


404 


277 


I FFVETRSHHIAQAGLELLGSSDPPYFF 
QVFTEMSLSQRALF 


9216 


23117 


A 


9284 


399 


231 


HYVAQASLKLLYSGDPPASAFQSAEIAG 
VSHCTWPTGSLNKIQNNIHSRISSM 


9217 


23118 


A 


9286 


394 


231 


AFFFFFFFGERGFHHVARTVLKLLDSSN 
PPAWSQSAGITDVSHDFLISLYRIY 


9218 


23119 


A 


9287 


420 


311 


LNF CRDGLAML P RL VLNS WAQAVLS PQ P 
PKVMGLHA 


9219 


23120 


A 


9288 


398 

■ 


173 


LLSFFYLFFYFFFFILFFLFFFFFFFFF 
LMFKALLFS CHYFPAMVTFKFSLS I SGV 
VRTSGYIMLFSYLCLLKTM 


9220 


23121 


A 


9289 


270 


23 


TP EQCRG I KLFI YS L IL YVRDGVLLCCS 
GWPWAIHKCDHSTLQHQIPSLKQASCL 
ShLSS WE YRYAPLHLAS ANFLfCVTW 


9221 


23122 


A 


9290 


273 


46 


HLLNMPLFTEIWTERLFLFYILCFVFLV 
ETGFFYI TQAGLKLLTS SDPPAYAS QSA 
EIIGTSHCTRPICYVNVTOT 


9222 


23123 


A 


9291 


424 


230 


CLPFTHSLSLSLSLSLSHTHTHTHTHTH 
TIYMNK 


9223 


23124 


A 


9292 


2 


146 


NLFIiVBTRSHHVARASFKLLSSSNPPTS 
ASQSSEITGASHRTWPEVFF 


9224 


23125 


A 


9293 


393 


202 


GRERERKREREREKERKKERXKERKPER 
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SEQID ' 
NO: of 

n it r IpaH H a 

II UtlCVUUC 

sequence 1 


SEQID 
NO: of 

npntiHp 

sequence 

* 


M 

eth 

od 

• 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

nucleotide 

location 
corresnond 
ins to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 

otide 
location 

corresoon ■ 
ding<to 
last amino 
acid 

residue of 

neotide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 

P^OIutflmic Acid F , =Phpnvln Inning 

G=Glycine, H=Histidine, I=Iso leucine, 

IC = LiVSine. I ,=I,p urine. M==IVfettiinnine- 

N=AsDaraeine. P=Proline. 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine. V=Valine» 
W=Tryptophan, Y=Tyrosine, 
X^Un known, *«Stop codon, /^possible 
nucleotide deletion. V^oossible 
nucleotide insertion 




• 






■ 




PSGSLTYCVREGITAANLGKRRHKASKT 
LISHKNKN 


9225 


23126 


A 


9294 


252 


193 


GFCCCCCCCYCCCSDKRHPG 




71 177 


A 


y&yj 


400 

HUy 




SOYFGRLRRVDCIiS SGVODOPGOOGETP 
SLLKYKIYIHISWAGWHV 


9227 


23128 


A 


9296 

• 


2 


213 


FCCGLLIHLIYCKTEYSGQSTVAIHRCN 
PLCPAYIFNLSIGY 


9228 


23129 

• • 


A 


9297 


169 


408 


RFARHMNSPHEERLPPPAGIRVRPPPQP 

2X J^RT PP M A P Q T <5 1?T3 POMT P WTTHNTiP 
rvrtO J-JV3 ir ir Ivl IttJT Olo IvTVC ^i*J J. tr X X. IVXWX/X? 

AGQAQWLMPWPALWEAEAGGSSB 


QOTQ 

yJu,y • 




7V 




1O0 


CI 


T .WP P PP <JT .Q T .CT. QHTHTHTPRHTUTHTH 
. xxa.tr tr tr tr o jj o n x n j. xi x no n x *i x n x n 

TLWLGRPGPAQDWLVWGNLT 


9230 

i < 


23131 


A 


9299 


1 


176 


MGSHSLAQARMQWCNQSSLQPQPPGLKQ 
ror r siiroonu i iui v rrv>iind x r rulUin 
YH 










412 




ITGVSffiiNWPHCGFNLVFLMPDDVKHLP 
MCIIJVICVSSL 


9232 


23133 


A 


9301 


195 


347 


KIKGPRPGATAHAGNPSAxjGHQGORTAI 

ZVWT?T7 r PT e MQHM/1P PrT .VP WTTM 


9233 


23134 


A 


9302 


386 


190 


GFLFFFPPPKKEVFPHPFILGFPGFFPP 

P PTTlTTf P A PT7 T?T?T?WnPTfIfirVKrQ PPPP HT .W 

FFFFKGAPP 




ZJ 1 jj 






£i 




P DQHGKTLS LL KKYKI S RAW 


9235 


23136 


A 


9304 


374 


121 


QWCDHGSLQ PQT PGLKQS SCFSLP KRWD 

VP RF P Pff PTQT PV T VAWPPTT iPP PPT . «I 

AL WKHL IHQKCDGVMVXiQVPGRTHNVTL 
T 






A 




7ft 

1 




I TGVSHHAQ P I SHF 


9237 


23138 


A 


9306 


378 


256 


QKWVS I CCPGWS QTFGLQES SCLGDPKS 


9238 


23139 


A 


9307 


206 


3 


PCHIjGWSAWQSWIjTAASNSWPHAILPP 
KILNKEIKRGR 


Q73Q 
y^jy 


XJ AtV 


a 




J JO 


101 


HFKKKI YI YI YVYI HTYI YI YTHIHI YT 
HTYTYIYRYIHIYIHrYIYMYVYIYIPS 
LPF 


9240 


23141 


A 


9309 


503 


288 


FFLNLDKSQEWWLMPVIPTLREAEAGGS 
LEVRSSRPASSDPPTSASQSAGUGMRH 
HARP VVLRENGR I ON 


9241 


23142 


A 


9310 


418 


309 


FFCKNGISICCLPKLVSNSWPQAILQPQ 
PPKVQGTL 




23 14"? 


A 




449 


77*. 


VSPPPI/5GGFLS PPFKP PPFFFNLKGGP 
PFPTIPFLIIRGKKFCPPPPFFFPGLF 


9243 


23144 


A 


9312 


2 

•* 


95 


HENRLSLGGGGCSELRLCHCTPAWATER 
DPI 


9244 


23145 


A 


9313 


318 


154 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFI 
FILFIYSLLYWCVGVLGS 


9245 


23146 


A 


9314 


388 


199 


EQPKCVHRIKQIWHICALEYYAAFKKKK 
VLPYAVIQIKLEDTMLSEINQSWGKYCM 
IPLIGGI 


9246 


23147 


A 


9315 


2 


97 


STPGLKRSSHISLPKYWDYRRGPPGPAS 
IIiF 
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SEQID 
NO: of 
nucleotide 
sequence 

• 


SEQID 
NO: of 
oeDtide 
sequence 

- 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ins to first 

amino acid 

residue of 

peptide 

sequence 

i 


Predict- 
ed end 
nude- ■ 

otide 
location 
corresDon 
ding to 
last amino 
acid 

residue of 

Deotide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid. F=Phenvlalanine. 
O-Glycine, HHHistidine, I-Isoleucine, 
K m Lysine, LHLeucine, M=Methionine, 
N=AsDaraeine, P=Proline. 
Q=Glutamine, R=Arginine, S=Serine, 
■p=Tbreonine, V=Valine* 
W=TrvDtoDban, Y=Tvrosine. 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion. \=DOssible 
nucleotide insertion 


9247 


23148 


A 


9316 


3 


235 


TMWNAEGI P I RTTLDNSTTVQYACLLH 
HLTMKAKSTVRDIDPQNDLTPLRIRSKK 
HEIMVAPDKEYLLIVTQNPCE 


9248 


23149 


A 


9317 


253 


71 


LFFFFLFFVDTKFHHLAQDGFQLLSSSN 
SPTSASQSGGITGRSHGAWPYVHLLTLQ 
KLNY 


9249 


23150 


A 


9318 


425 


95 


CLLFC PKKKKKKKKKKKKKKKKKKKKKK - 
KKKGGP IKKKKKQP 


9250 


23151 


A 


9319 


25 


214 


VR I DDRPE PCMDADAQREG INSS ARYPK 
NWVTTGDPARECGMIHSAPLMLLADPDE 
FVSVOLA 


9251 


23152 


A 


93,20 


50 


231 


GHRPEPGRDADAQREGIIASARYPKNWV 
TTGDPARECTRIQSAPLMLIADPDEFVS 1 
VQLA j 






a 




Af\1 


97 £ 


LAATTGTRHHAQP S KFRNLKGWLCAKRA 
RT 


9253 


23154 


A 


9322 


410 


159 


LGSLQLPPPGSKRFSRLSLPSSWDYRGC 
SRLRLRHCfTPAWVTEODSVPPKICKKOI P 
RPFPFSYGESLGVGLRHLCFSKPSRRF 


0954 




a 






95 


KWGGLFFFS PPKKRGFFKOVFFCVPPVP 
FFPPFFKTGPGDIFI^PLKKKI I FPPPG 
GKKFFFKRGPPPFFFFFFFFFFFFFFFF 
FSEVECVCLCR 


9255 


23156 


A 


9324 


406 


289 


PGFYHAGQAGLKLLTSGNP S AS ASQSAR 
ITSISQRTRPH 


0956 ' 


931 57 


A 




i 
i 




LHSVLLPFFYFCVCVCVCVCVCVCVKHK 
YVRAGGAPGR 


9257 

• 


23158 


A 


9326 


2 


158 


FIFLFTVGGLTGIVLANSSLDIVLHDTY 
YWAHFHYVLS IGAVFAIIGGFIH 


0958 


9315Q 


A 


Q397 


* * 


IV / 


PGCS E PRTjRHCTPAWVTEGDCVS KKKKK 
VPSAWG 


0950 




A 




957 


9 


GFF SN I SAS VI I P P P I SRKNL FFPFR VY 
FGPPPKTFKGPPPFFFFFFLS PRLECSG 
VIMAQCSLTLPGS SDLPLSAS CVAMTV S 
I 


9260 


23161 


A 


9329 


66 

■ 


384 

■ 


GNLCGWMKSMAWDNPWRKADWGQIRES 
LE CO PNTGMLKKJOCKKKKKKKKKKKKKK 
KRGGPFLKISWGGPILGGRGKIFFFFFG 
GG I LTPRGVFWKS P FFWGGKKK 


9261 


23162 


A 


9330 . 


1 


98 


IIGQVASVLYFTTIIjILIPTISLIENKI 
LKKA 


9262 


23163 


A 


9331 


33 


412 


NLF YVMAE IIxHTHIQLCCQVKIiHLSVPR 
PSLPLLCCWNFSLVLHEHSQPASEKKKK 














KKKKNKKKKKKKKKKRGGVFKKKPGGAP 
FCGGGENFFFFFFGGGLKKPRGVPGKNT 
FFWGGEKRPPPPPY 


9263 


23164 


A 


9332 


401 


311 


PTRPVDLLVDDSVKESLIGYGPTEDSKE 
FS 


9264 


23165 


A 


9333 


5730 


452 


TRSAPRAASWLEDPREVRSVCLSATFFT 
LSLQKPRLLLFS PSWHLGVPLS VGVQL 
QDVPRGQWKGSVFLRNPSRNNVPCSP K 
VD FT LSSERD FALLS LQVPLKDAKSCGL 
HQLLRGPEVQLVAHSPWLKDSI^RTTNI 
QGINLLFSSRRGHLFLQTDQP I YNPGQR 
VRYRVFALDQKMRPS TDTITVMVENSHG 
LRVRKKEVYMPSS IFQDDFVIPDISEPG 
TWKISARFSDGLBSNSSTQFEVKKYVLP 
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SEQID ' 
NO: of 
nucleotide 
sequence 

■ 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine . 
OCysteine, D=Aspartic Acid, 
E=GIutaroic Acid, ^Phenylalanine, 
(^Glycine, H=Histidine, ^Isoleudne, 
K=Lysine, L=Leucine, M=Metbionine, 
N-Asparagine, P=Proline, 
Q^GIutaraine, R=Arginine, S=Serine, 
T»Threonine, V=Valine, 
W=»Tryptophan, Y^Tyrosine, 
X»Unknown, *=Stop codoD,/=po$slble 
nucleotide deletion, \=posslble 
nucleotide insertion 


• 

* ■ 

■ 


♦ 

■ 

* 

• 


• 

■ 




• 


* 

* 


NFEVKITPGKP YILTVPGHLDEMQLDIQ 
ARYIYGKPVQGVAYVRFGLLDEDGKKTF 
FRGLESQTKLVNGQSHISLSKAEFQDAL 
EKLNMG I TDLQGLRL YVAAAI I EYPGGE 
MEEAELTS WYFVSS PFS LDLSKTKRHL V 
PGAP FLLQ AL VRE MS GS PAS G I PVKVSA 
TVSSPGSVPEVQDIQQNTDGSOQVSIPI 
I I PQT I S ELQLS VS AGS PHPAIARLTVA 
APPSGGPGFLSIERPDSRPPRVGDTLNL 
NLRAVGSGATFSHYYYMILSRGQIVFMN 
RE PKRTLTSVSVFVDHHLAPS FYFVAFY 
YHGDHP VANS LR VD VQAGACEG KL ELS V \ 
DGAKQYRNGESVKLHLETDSIiALVALGA 
LDTALYAAGSKSHKPLNMGKVFEAMNSY 
DLGCGPGGGDSALQVFQAAGLAFSDGDQ 
WTLSRKRLSCPKEKTTRKKRNVNFQKAI 
NEKLGQ YAS PTAKRCOQDGVTRLPMMRS 
CEQRAARVQQLDCREPFLSCCQFAESLR 
KKS RD KGQ AGLQRAL E I LQEBDLiIDE DD 
IPVRSFFPENWLWRVETVDRFQILTLVfL 
PDSLTTWEIHGLSLSKTKGLCVATPVQL 
RVFREFHLHLRLPMSVRRFEQLELRPVL 
YNYLDKNLTVS VHVS PVEGLCLAGGGGL 
AQQ VLVPAGS ARP VAFS WPTAAAAVS L 
KWARGS FEFPVGDAVS KVLQ IEKEGAX 
HRE EL VYELNPLDHRGR TLE I PGNSDPN 
MIPDGDFNSYVRVTASDPLDTLGSEGAL 
SPGGVAS LLRLPRGCGEQTMI YLiAPTLA 
ASRYLDKTEQWSTLP PETKDHAVDLIQK 
GYMRIQQFRKADGSYAAWLSRDSSTWLT 
AFVLKVLSLAQEQVGGSPEKLQETSNWL 
LSQQQADGS FQDPCPVLDRSMQGGLVGN 
D ETVAIjTAFVT I ALHHGLAVFQ DEGAE p 
LKQRVEAS IS KANS FLGEKAS AGLLGAH 
AAAI TAYALTLTKAP VDLLG VAHNNLMA 
MAQETGDNLYWGSVTGSQSNAVSPTPAP 
RNPSDPMPQAPALWIETTAYAT,IjHLT.T,H 
EGKAEMADQAAAWLTRQGS FQGGFRS TQ 
DTVIALDALSAYWIASHTTEERGLNVTL 
S STGRNG FKSHALQLNNRQ IRGLEEELQ 
FSLGS KI NVKVGGNS KGTLKVLRTYNVL 
DMKNTT CQDLQ I EVTVKGHVE YTMEANE 
DYEDYEYDELPAKDDPDAPLQPVTPLQL 
FEGRRNRRRREAPKVVEEQES RVHYTVC 
IWRNGKVGLSGMAIADVTLLSGFHALRA 
DLEKLTS LS DRYVSHFETEGPHVLLYFD 
SVPTSRECVGFEAVQEVPVGLVQPASAT 
LYDYYNPERRCSVFYGAPSKSRLLATLC 
SAEVCQCAEGKCPRQRRALERGLQDEDG 
YRMKFAC YYPRVE YG FQVKVLREDSRAA 
FRLFETKITQVLHFTKDVKAAANQMRNF 
LVRAS CRLRLE PGKE YLIMGMGLDGAT Y 
DLEGHPQYLLDSNSWIEEMPSERLCRST 
RQRAACAQLNDFLQEYGTQGCQV . 


9265 


23166 


A 


9334 


46 


324 


SKYLPPVITFQHEIWVGTNIQTISDWVL 
ICCPGWSAVTQSRLAAASTSWAQAMSSS 
S CLGLS SS CYYRHTPPCLANFVAFVETK 
LYTMLPGLS 
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SEQ ID 
NO: of 
nucleotide 
sequence 

• 

• 


SEQ ID 
NO: of 
peptide 

r r 

sequence 

■ 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

i 


Predict- 
ed end 
nude- - 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
OCysteine, D=Asparf ic Acid, 
E>=G)uta mic Acid, F=Phenyla!anine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine> L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine,-- 
^Threonine, V=Valine^ 
W=Tryptophan, Y=Tyrosine, 
X^Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


9266 


23167 


A 


9335 


338 


178 


EVDIAVSHRRKQAFNLCLLKCKPHNNCL 
NLEGGGSSBLKLCYCIPAWATDQSL 


9267 


23168 


A 


9336 


396 


254 


KLSTDCTSKEI INRHPTEWEKI FANYAP 
NKGPTSTTHSSCLMFPYLF 


9268 


23169 


A 


9337 


454 


359 


S SWD YRHAP P C PAN FVFL VET I FIQGEI 
GLL 


9269 


23170 


A 


9338 


540 


328 


IKKRGAPRRGVLFFFTRKKKFFSYYKKS 
TKERVSFLFVAPGCIQGGKKKKPPTKKK 
KKKKKKKKKKKKKKK 


9270 


23171 


A 


9339 


21 


209 

* 


AEEFPDFKLYYKANQNTYRGQWNRTETS 
EITPDIYIjGIQLVSNSWPGVILIjPWPPK: 
VLGLQA 


9271 


23172 


A 


9340 


345 


2 


KIFFFIiIYILFPFNFFFFFFFFFFFFFF 
FF^FFLENRWFLMENFIKGRGHVKGPIjG 
QARDHDI SGRRWWRREQCRASQL PGVRR 
S LLOVQ AG GPVH I R I SGLQDARQGPRYA 
GA 


9272 


23173 


A 


9341 


15 


252 


LERVIRQSAALTDSFFNFILLILFIYLF 
FETGS PCVSQAGLKLIjGSRDPPVSASQS 
ARI IGVSHRAQPALTDFPS ISF 


9^73 


23174 


A 


9342 


484 


276 


RHAPPCQANFFVQTRFCHVAQSGLKL IR 
SRYLFASAFQSVRITCMSHCARPLYFLM 
FKRLQ YLKS PTKS 


9274 


23175 


A 


9343 


464 


326 


LVETGPHHVGQAGPELLAPCDPPAPASQ 
SAETTGMSHRARPPANFV 


9275 


23176 


A 


9344 


486 


280 


S FSHSHS CS LPS S WD YRCTS SHLANFFF 
KRWGFT I LPRL I LNS W PQVI LLPQSHRH 
EPLCLETLKTLNV 


9276 


23177 


A 


9345 


276 


179 


DRDFTMLPRLVLNSWPKTILLPLPPKIL 
RLQS 


9277 


23178 


A 


9346 


45. 


143 


RWNSRQENCLNLGGGDCSEPSLCHCPPA 
WATE 


9278 


23179 


A 


9347 


229 


73 


PPPHMI FFFFFFLFFFYFFFFFFFFFFF 
FFFFFFFFSRLQVSVYCSSVLNKNVD 


9279 


23180 


A 


9348 


391 


139 


IPSLKRSSHLSIAKCWDYRHKPPCQVTF 
S FLNVRI YATKFFLS TPLAAWS KYWYL V 
FLLS FS S KYYLTP I S FLFLTHNVS SS I F 


9280 


23181 


A 


9349 


53 


191 


IYVTYIWYSEVSHAYNSSTLGGQGGRIT 
GAQEFKTSLVNIAKPLLY 


9281 


23182 


A 


9350 


373 


235 


EEGELLEPGVQGCSELSWCHCTPAWGTE 
QDHCVSYNNSNTHNNVNY 


9282 


23183 


A 


9351 

■ 

■ 


476 


82 


YLATYLPTYLS I YLPTYIiPI YLP I YLYL 
GSHY I AQAGLKFLGWVGHNGSCL 


9283 


23184 


A 


9352 


286 


142 


ELQFKMRFGWRHSQTISWGLSLLPRLIS 
NSWAQVILLTWPPKVLGIQA 


9284 


23185 


A 


9353 


3 


102 


FFETESHSVARLEYSGAISAHCTPVIKL 
YLYT 


9285 


23186 


A 


9354 


3 


270 


TAGAPPRPANFCIFSRDGVSPCWPGWSR 
TPGLRCDPPASASQLLGRLRHENCLNLG 
GEGCTGAIIiAHCTSTLRAQAILQPQPSC 
PAPQ 


9286 


23187 


A 


9355 


1 


112 


GEGSAHCNLCLPGSSNS PASTS QVAGIK 
GVSHRARPN 


9287 


23188 


A 


9356 


1 


63 


VAIj IQAYVFTLLVS LYLHDNP 


9288 


23189 


A 


9357 


244 


140 


QRRHTHTHSHTYTHTHTHIPNKKI PILT 
ILKCIA 


9289 


23190 


A 


9358 


464 


181 


LGAGGMTG I LFS1X5AS MVLGGVAQMLAP 
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SEQID 
NO: of 

nnrlAfitffip 
I1ULICUIIUC 

sequence 


SEQU) 
NO: of 

sequence 


M 

cth 

od 

■ 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

nucleotide 

location 
correspond 
inp to first 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

ii uviv — 

otide 
location 

ding to 
last amino 
acid 

residue of 
peptide 

pan n on fa 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Add, 

E=filutflmic Acid P=Phpnv1«l»nine. 

G=G!ytine, H=Histidine, I=lso leucine, 
K=Lysine, L= Leu cine, M=Methlonine, 

N=A<;narfi<yine P=ProIin<»_ 

rKJ 1/cl 1 aKIIIC| A A 1 VllllV| 

Q=Glutamine, R=Arginine, S=Serine, 
T^hreonine, V«Valine, 
W^Tryptophan, Y«=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, N-posslble 

IiUClcuiiiiC iiiacruuu 




• 






• 




, KARTRRI QTTDNGKQNTY PSSLDNMVAQ 
GNVLPVL YGEI^IRVGSRWSOE I STADEG 
DGGQVWIGR 


9290 

• 


23191 

i 


A 


9359 

• 


11 


469 


IDLRCRFRHEERERERERERERERERER 
ERERERERERERBREGRAPPPPLFFGGG 

RP PttP AH A T . Q PPWT/5P PPT iVTYFl?<5VUP 
AP PTE R S ARTQREARRALKS VCVVRERE 
THSLPRARCVCSHPTGCX3GAPGARVSLS 
OS AHf PIiCVnGAG 


9291 


23192 


A 


9360 


22 


172 


GLSHEERERERERERERERBRERERDTV 
CPLFCPTGGPILREARVKSGCLCF 


9292 


23193 , 


A 


9361 


15 


345 


PPNRP PAE FRTRHL I S S LLCLEG 1 1 LSL 
F I IATLJ TLNTHS L LANI VP IAI LVPAA 
CEAAVGLALLVS ISNTYGLDYVHNQKKK 


9293 


23194 


A 


9362 


61 


298 


S CDSPLQECCIHTHTHTHTHTUTHTHTH 
LFQGTLI FGHTLFWDKPLWQKEF 


92S/4 




TV 






2 


wiftor Jr ir Jrv^W iv*-**^-* v irtrlrX LorKvflbf 

SLCSDFPSPKFYKEKRKKKGNKTPKRKG 
KTLLPTETI VLHVS PG CES KPGAGSADR 
RAPGVRDGGSDASSSGTRASELGDGPVR 

TS 


9295 


23196 


A 


9364 


432 


291 


GITGLRHHSQLIFLFLVETGFHHVGQAG 


9296 


23197 


A 


9365 


343 


507 ' 


GQGLTLVPQVGVQ WGNLSSLQLLP PGLK 
QSTCLSLPKCWDYKCGLPPPASFCFVL 










a n 




rlV? W ri ir \3ItV3nJr frUUEi I'lnnDU K fuS o uon V 

G S QAEGL LTS HTKGG P AETLLTS QTGWR 
LGRGCNIiGSLEGQGRRLGGGGCNEPSSR 
HCTPAWAPLSTE 


9298 


23199 


A 


9367 


500 


285 


SLPRSWDYRRAPPRPANFLFLVEMGFHH 

VRK'AVTiFTiT»T e 3nVT3T.T.T c JfTnr J PASA<5nfl 

AGITGVSHRAWQIRE 


9299 


23200 


A 


9368 


1 


229 


I VDLTT WG I PTHT I HKYTHH I HKY I HTH 
AVTHP7TWPHKHT YT Y T HTDMVWLS PHPN 

LI LNFS SHNPHVS WEGPNGR 






A 






00 

* 


KOTMT YTCI YMYVYI YTCT YTYVYI YTR 

TYTCIWCVYTCIYVCVYTCIYVCVYTC 
IYVCVYTCIYVCVYIRVYTCVCIYVYIR 
VCVYTCIYVYVYIRVYTYICIYTCISTY 
IOCIYTHIYKVTKCSG 


9301 


23202 


A 

* 


9370 


1 

• 


208 


ADETIEQRGGIPKFVHDNYVIRRTAVDR 
G I PLLTNFQ ETKLF AEAVQKS RKVDS KS 
LFHYROYSAGKAA 


9302 


23203 


A 


9371 


354 


202 


PAAQREG I TAS AR YS KN WVTPGDP ARE F 


9303 


23204 


A 


9372 


1 


336" 


KS L I KYKGYQVAPAELES ILLQHPNI FD 
AGVAGL PDD DAG EL PAAVWLEHG KTMT 
EKE I VD YVAS QVTTAKKLRGGVVFVDEV 
PKGLTGKLDARKIRE ILI KAKKGGKSKL 


9304 


23205 


A 


9373 


2 


187 


NI LVTALYSLY IFTTTQWGSLTHHINNI 
KPS FTOENTIiMFIHLS P IL LLSLNPD XI 
TGPSS 


9305 


23206 


A 


9374 


96 


240 


AC WLMPV I PALSEAEAGRLVQ PRRLRAA 
WATWQDLI S TKKKKKGGPF 


9306 


23207 


A 


9375 


3 


293 


STISAFTTSTTSATTTSTTSATTTSTIS 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 


SEQID 
NO: of 
peptide 
sequence 

• 


M 

eth 

od 

■ 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

am in oxacid 

residue of 

peptide 

sequence 

i 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucrne, 
K=hysine y JL^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threoninc, V=Valine, 
W=Tryptophan, Y=Tyroslne, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 




■ 










APTTSTTLS PTTSTTSTTITSTTSAPI S 
SmObHKKKKKKKKQKKKKKRKNDDTKQD 
TITKIHSGGGSIr 


9307 


23208 


A 


9376 


69 


160 


EHTHTHTHTHSHTHTHSHLPERFFCVLN 
SL 


9308 


23209 


A 


9377 


336 


199 


EGITASARYSKNWVTTGDPAREFPMIQS 
APLMLLADPDEFVSVQLA 


9309 


23210 


A 


9378 


321 


128 


EPGFTMWAS IiAPTS S LQVIKRLGL PKCW 
DHRRBPPHPAYHSLYESGPSVQNGEEKG 
INPIIKMG 


9310 


23211 


A 


9379 


440 


286 


SSDPSASAS KSAGFTGMSHHTQLLTS YS 
EGECLSYLVFLVLV 


9311 


23212 


A 


9380 


392 


154 


KGSSPLSPPISLAGGKPPPPRKFFLNFF 
FFFNRDRVS PCL PRLVSNS WPQAI LP PQ 
PHKALGLQAYATTAS FI FLIVTV 


9312 


23213 


A 


9381 


403 


178 


GRRHEPCRDADAHLEGIIFSAFYPKNWE 
TTGDGAREWTMSNSAPLMLLADPDEFVS 
VQLA 


9313 


23214 


A 


9382 


303 


206 


DPVSTKICSLQPPPPGFKRFSCLSLPSS 
WDHR 


9314 


23215 


A 


9383 


215 


116 


SNNFWCCCCCCCCCCLRWSLTLSPRLH 
SCGTM 


9315 


23216 


A 


9384 


2 


285 


ARHLLMQLS GGATLAI ST IDL PSTLI I F 
TLLILLTILEIAVALIQAYDITLLGSLY 
LGDNTYKKKKRGAAFEETKFTSAGMQSK 
NFFLGTLNWR 


9316 


23217 


A 


9385 


12 


123 


GYENFLLPTLHTQTHAHTHTHTHTHTHI 
HSKDPMEY 


9317 


23218 


A 


9386 


3 


230 


KCWGYRCEPLRLAPLRCFFTPGGISLTS 
GPALLSNAS ALAEVRG QQLCPPTP FLP T 
LIALTTLLLPISPFILIIL 


9318 


23219 

• 


A 


9387 


590 


396 


ARVQWPDLGSLQPPPPGFKRFSCLSLLN 
SWDYKCAPPCRADFCIFSRDGVLSCWPG 
WSQTPYLK 


9319 


23220 


A 


9388 


224 


79 


GLGPVGHACPTLWEGEAGGSRGQEIEAL 
PANTVKPR FLKRKKKG KKKG 


9320 


23221 


A 


9389 


249 


72 


LVYLITVKKILFPVHIRTFNKIDHMIGH 
KTCLNKLQMTEILQSVFSDYNRIKLEAS 
NN 


9321 


23222 


A 


9390 


400 


254 


LRQPGSTNS PVSASRVAGITGTHHHTQI 
I FVFLVEMWHACNPS YSGG 


9322 


23223 


A 


9391 


3 


64 


PPQKSDPPYKLVPVHLDSIL 


9323 


23224 


A 


9392 


266 


106 


LTANFLFLVETGF PHVTRVGLELIiAS SD 
LPTIASQRAGITGVNHLQNFWPGNF 


9324 


23225 


A 


9393 


430 


348 


FGKPQPPPKGKSPPPRGNFPPQKAPKQ 


9325 


23226 


A 


9394 


481 


311 


LSIiSEKRS IRVGVTRFSRCCPSPLSIiTC 
KGNSLTPGVSQB IHS PPWFG IAYS VCTH 


9326 


23227 


A 


9395 


3 


136 


VF P RL VLN S W AQM I LLC W P P KVLGLQAC 
TTRPSHFLPLLNSKYK 


9327 


23228 


A 


9396 


143 


347 


NF VFLVE TRFHHVGQAOLTLLTS SDPAS 
QSAGITRVRPEILTLKRYHFPSGI LGTE 
GMVIEMAYLLKV 


9328 


23229 


A 


9397 


2 


231 


LELLTSSDLPASASQSAGITRGSHHARP 
SFTFRCVIHLEFNFMRGAREERLHFYCR 
WVI RRLTT FGRGYPS PAASD 


9329 


23230 


A 


9398 


383 


200 


QH YLNRRLS LAQW WMP WPATQEAEAGG 
LLEPRSS SLQCAMMLP VNSHCTLAWAra 
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SEQID < 
NO: of 
nucleotide 
sequence 1 

■ 


SEQID 
NO: of 
peptide 
sequence 

— 


M 

eth 

od 

■ 


SEQID 
NO: in 
USSN 
09/515,1 
26 

• 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanlne 
OCysteine, D=A$partic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, ENHistidine, Msoteudne, 
K=JLysine, LHLeucine, M»Metbionioe, 
N=Asparagine, P=Proline, 
Q=Glutaroine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X«=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, ^possible 
nucleotide insertion 














REPHL 


9330 

- 


23231 

• 


A 


9399 


3 


416 

• • 


6FLHVGQAGLKLLTSGDLP7SASQSA0I " 
IGVSHRAGCLKS I YLTPPTRKPFVQMS I 
CDFKVEIEGKIGFISVPISLQSILHIVN 
NFF PFLETGS F P PGWRAWBKFGLIAPIiA 
SGVKGI FLPLPRENLGWRPSP IPPA 


9331 


23232 

■ 


A 


9400 


400 


151 


FGYPQVFPLPPFLKPGPGIFFLGPLKKN 
FP F PPPGLKI FS FKRAPPFFFFVRKGIiI 
MLLRLVSNSWAQVT I PPPPPEVLGLQA 


9332 


23233 


A 


9401 


324 


154 


FFFFFFKXXXXXLLFFFFFFFLFYIFFF 
FFFXFXFXFFFFFFFFFFFFFFFFFFFF 

P. - 


9333 


23234 

i 


A 


9402 


413 


312 


PKLVLKSWLQALLLPRPPKVIjGIiQARAT 
ITQPK 


9334 


23235 


A 

■ 


9403 


363 


3 


FKKKNFCLGGAGWFFFFYDPPPKKNIF 
YFPS KKI YFLS PASSS FFFFIiCFPFFI F 
FFFFFFF FFFLENRWFLMENFI KGRGHV 
KGPLGQARDHDI SGRRWWRREQCRASQL 
PGVRPRV 


9335 

■ 


23236 

■ 




9404 

* 


401 


82 


SELtoPiNPTi^yiypk^M^VCFTtil 

CTPMFTTALFSIAKIWNQPKCLSKRITI 
FMFLVTYSQLFCTHYPILPYPFFPAPLS 
SSHNYHYTLYLYENILLDSTYE: 


9336 


23237 


A 


9405 


382 


219 


WARWHGFLVPTS PGGEAGGSLEPGKRRV 
QLTQISPLPSSLGGKVGLPLKXXKNN 


9337 


23238 


A 


9406 


3 


195 


YLYSLVETGFHHVGQAGLELLISRDPSA 
LASBSAEITGMSHSIQPKSAFYIQIPKT 
FWQPAFK 


9338 


23239 

■ 


A 

■ 


9407 

■ 


371 


1 


FRRLRL KNRLNLGNGGCS E PRLCHCI PA 
WETRAKLGRKKKEERKREKGTKMYNKMY 
KMYLELNLTCPE PAS P KQMTRQLEVLS S 
NPSPGTFQFICKKQFKFSCCFALRVS I Y 
LLMSVFEYCSC 


9339 


23240 


A 


9408 


326 


229 


RQGLTMLPRLVSNSWAQVILPPWPPKVL 
GXjQA 


9340 


23241 


A 


9409 


&1 


231 


LESAS FPPG KVAHTCNPNTLRGCAGWI A 
WAQKFKTSLGNMAKPCLYKIK 


9341 


23242 


A 


9410 


367 


256 


HPGWEWWLVSVI PTLREAKVGRSLEPRS 
LRLAWDTWG 


9342 


23243 


A 


9411 

• 


2 


192 


VFLVEMG FLHVGQAGLELP I SGDP PAS Q 
SAGITGVSHHAQALTFLNIPINPSHPPK 
ACCSKQL 


9343 


23244 


A 


9412 


3 


83 


KNCLNLGGGGCS ELRLCHCTLTWATE 


9344 


23245 


k , 


9413 


371 


301 


EDHPRMVILRLLRYIVRLVWRMH 


9345 


23246 


A 


9414 


486 


337 


AGLELLGSSNLPALTSHIAEITGMCHHT 
WLAYI FVC^FGWVKLLDLCLAF 


9346 


23247 


A 


9415 


378 


254 


IVAHCSLDLLGSSDPLTSASRVAGPTGM 
CHHVWLIWFFVC 


9347 


23248 


A 


9416 


160 


309 


WRYSSNEFLLFLLFYLFKSNFTVFCCC 
YCCCCCCWRQSLTLPPKLECGV 


9348 


23249 


A 


9417 


162 


339 


RPSS FCE FLHDS VPELALPMKARFEKS R 
RSQVS WLMPVI PTFWEAKVGRYQARSL I 
PA 


9349 


23250 


A 


9418 


1 


147 


GGGGCS E PRSCHCTPAWATRVKLGLFKK 
KKKINFLFVPPFIFLKSKTGV 


9350 


23251 


A 


9419 


283 


126 


KMMYKGQAQWL I P 1 1 PAFWEAKERGLLE 
VRSLR PAWATEQDS TATKNKNKNF 
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SEQ n> 
NO: of 
nucleotide 
sequence 

■ 

• 


SEQ ID 
NO: of 
peptide 
sequence 

• 


M 

#h 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

tng to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
OCysieine, D=Aspartic Acid, 
E=Glutamic Acid. F=Pben vlalanine. 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine» 
N=Asparagine, P=Proline, 
Q=G I utara in e, R=Argi nine, S=Serrae, 
T=Threonine, V^Valine, 
W^Tryptophan, Y=Tyrosine, . 
X=Un known, *=£>top codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9351 


23252 


A 


9420 


133 


3 


ELM FS WAWWI MP VI PALWEAEAGGSRGQ 
EFKTSLPNMVKPRLN 


9352 


23253 


A 


9421 


3 


133 


LRG LY S GAG 1 1 RFEAKKWNALALWAWD I 
AGDNCAICRNHIMDL 


9353 


23254 


A 


9422 


409 


291 


ILILGGGSCSELRSCHCTPAWATRAKHR 
LKNKSPINVY 


9354 


23255 


A 


9423 


1 


210 


RFFHLGLSKHWDYRCESPHAGTRNEVLI 
YATIWMNSENI VPNEI IQTQRTNIVSFY . 
LYQVGRIGKF IDRR 


9355 


23256 


A 


9424 


479 


306 


GVPTNQPWSNRVKTPSPTKNIKISWPWW 
CMPCNPSCSEGQGRRIALNPGRWRLQ 


y j do 




1 A 




1 
■ 




GFFLFOGGGHHAGOAGLELLTSGDLPAS 
ASQSAGITGAGPNQGIFVALESPMESAA 
LLHLHWLS S PCS 

AA>AAAaA*A-A V W MWwfc 


9357 


23258 


A 


9426 


628 


433 


RQGVAHIiPGMECSGTITTHCSLELLGSS 
DPPASASOSSRLAWSHCAOPSIiLFSVL 
CLKPIFCCA 


9358 


23259 


A 


9427 


417 


125 


FLRQSLTSSPRLEYNGAIPAHCNLCLPG 
SSDSSILASQSIRITGVNHCTWPDAAVL 
RQNFF FS EKPQFYSEGFQQTH I IKSNLL 
YLKSTKCTCYPYL 

A# A AVW Ai m* £ A aJ 


9359 


23260 


A 


9428 


366 


256 


SFFYRDRSLATLPRLGSNAWSQAILPPG 
PPKVLRLLA 


9360 


23261 


A 


9429 


3 


140 


GFCHVGQAALKLLISSDLPTSPSQSAGI 
TGVSHCARLTRLFS SHE 

At V Imt A A ^ri>S 4p«M v> A«Ah A * AAA 


9361 


23262 


A 


9430 

■ 


368 


3 


VRASQLLCLP TQASAMAGTP P PASLP P C 
SS I SDCCASSEOGSMGVGPSEPGAGYNL 

mm* mm* mm m*» mm* m*M * > * ™» ^* * » %rf *» mmr ^ ■ AAm^ ^ XJ 

WCSSLRTLEKCSIRVGVPQFSRCHLSQ 
LPLARKGNSPTPCASWVRRCLTIiLQLML 
RGLYPLSCT 


9362 


23263 


"X 


9431 


3 


131 


LTMLPRLVSNS WAQGS SHLS P P KCWD YR 
CEPLHSGLKDIFLE 


9363 


23264 


A 


9432 


384 


274 


L S FAM LP RLVLNS WAQES AHLGLP KCWD 
YRHEPLGLA 


9364 


23265 


A 


9433 


431 


273 


PPPLGYFFIVKMEFPPWQDGFKLLNSS 
DCSTSAS PSGRI SGVNHSTRPRI F 


9365 


23266 


A 


9434 


211 


473 


CVCTFFS FLSMHI FFLFLVGEAESHCVT 
EAGVQWHDLGSWQPPPPRLKRFSCLSLP 
SSWDNRRVPPYPANFC I FSRTRVWSCWP 
GWS 


9366 

■ 


23267 

»f A»W f 

* 


A 


9435 


391 


188 


LPGSSDS PASAS RVAGTTGGCRHTVLLG 
FE FKDNT RWI KLVEKAHTQYGNI YI CVI 
KYMPLFYPYHH 


9367 


23268 


A 


9436 


331 


115 


S RD RG FAML PRLVLNS WTQAI CS LLDS Q 
SAHLSLPKCWNYRREPLCPASGEILSFI 
KPSSSKS FHNSTKKRQ 


9368 


23269 


A 


9437 


3 


109 

• 


IYGSYSLPACFFPSFLPSFLPSFLPSFP 
PFFLSFF 


9369 


23270 


A 


9438 


2 


167 


ECNGAI SGHCNLCLSGS SDS PAPAWATR 
VKU^IKKKKKKKNQILFFYPPRKTLF 


9370 


23271 


A 


9439 


648 


310 


VCPCILRHPAFLFAKSGNPLESSLLDHS 
SPTFKLTDRENLRHVKCTSHSQSLPTSG 
FQPGLPDSTLCAWYPTRSPKLSPAGWVG 
G YRAPVGRRGPGS RNS WRLHWTP I TSVG 
D 


9371 


23272 


A 


9440 


1 


170 


YEIWFLDVGQAGLELLTSGDLSASGSQS 
AGTTGMSHHARPLKLFTISVSSNKKVKC 


9372 


23273 


A 


9441 


133 


244 


L^6V0^RGIJU^KLDSNSWPOAVtt>P0 
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SEQID 
NO: of 
nucleotide 
sequence 

'i 

• 


SEQID 
NO: of 
peptide 
sequence 


M, 
eth 
od 

• 


SEQID 
NO: in 
USSN 
□9/515,1 
26 

- 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

* • • 

amino acid 
residue of 
peptide - 
sequence 


Predict- 
ed end 
nude- 
otide 
location 
corrcspon 
ding to 
last amino 
acid • 
residue of 
peptide 
sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartlc Acid, 
E=Glutamic Acid, (^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Lcucine, MNMetbionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R-Argimne, S=Serine, 
T=0Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosme, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














PPKVLGLLA 


9373 

• 


23274 


A 


9442 


365 


176 


RGSMDMVPSESCTGYNLLMCPLLRPLEK 
RS I R VG VTRFS RCHLSQLPLARKGNS LT 
PCASQVR 


9374 


23275 


A 


9443 


368 


231 


VEAGFCHIAQAGLTFLSSTDPSALASRS 
ARITGVSHHARPI FLKE 


9375 


23276 


A 


9444 


88 


194 


KMLEAKKKIEQQGGFTFBNKGVLSAFNF 
GTVPSNN 


9376 

■ 


23277 

- 

* 


A 

- 


9445 

■ 


3 


349 


IRSGSYACYFCYPREOTALFHCYSMIVS 
YGPYYFMNTDQRHESRLSTIYIFKFBIG 
SKHVAQAGLELLCSSSPPAFTSQSAGIT 
S VSHHTRPQKI FNVI YSCDLH FHPG I EG 
PQR 


9377 


23278 


A 


9446 


3 


141 


HAECSSMMIAHCELELLGSTDPPASASQ 
SAETTGISYHAQPYSTF 


9378 


23279 
• 


A 


9447 

- 


2 


282 


ICPTRSRFPPRVLKNGRKKDRLECSGAI 
SANCSLCLPGSSDS PASASRVAGSTVAE 
EDSTRRAVRHQGSSRSAPSGRGSWAGKQ 
ERADRAQQS 


9379 


23280 


A 


9448 


307 


430 


GLAW WLTPGI P I LWEAEVGGSLEVEFET 
SLASMEKPSLYYK 


9380 

- 


23281 


A 


9449 


2 

■ 


253 


IQCVCVCVCVCVCVCLSVCLHFIAILGC 
SPWGSRLPCSVSS PSGRGPGGEDLSPLP 
PTMSVSPEADFPPLAAAAPGNIMTAIT 


9381 


23282 


A 


9450 


1 


166 


NTGGGGEEIRVFSSCFAEDRLQKGKDRP 
GAVAHACNRSTLGGRGGWITRSQIFFC 


9382 


23283 


A 


9451 


1 


123 


NTRGLVMLPRLVSNSWAQVILQPLPPKV 
LGLQASVTVHINN 


9383 


23284 


A 


9452 


396 


291 


SLCIKRVCACACVCVCVCVSGAKLiRKLV 
LDFMRS 


9384 


23285 


A 


9453 


1 


147 


FAIRARFSNTYTKIGTIHTYIYIYTYIY 
TYIYTHTHIYILSQVLMEYEF 


9385 


23286 


A 


9454 


394 


235 


TRSCYIAQS WQWLFTGTLI IHYNLQLL 
SSRDTPVSASQYLGLQAHTTTPDSC 


9386 


23287 


A 


9455 


1 


85 


TNPVATLLE WQDALCF I PVGS AAPQGS P 


9387 


23288 


A 


9456 


339 


165 


KTHWWGKKS FLKN P FF FFFFFETKSRS V 
TRLECSGVILAPKQKTKKQNDKTTINKT 
I 


9388 


23289 


A 


9457 


3 


209 


LLI LPS I KMRNRPDS PALLS TQL P P I F I 
YIHTHI YLYI YTH I YI FI YI YLYI YLFA 
YIHIYVSLFPLK 


9389 


23290 


A 


9458 


374 


156 


QAGLELLTSGDP P I S ASQSARI TGLSQR 

* A« *m ma MM fMMM»f|*% W JMMijhb ^» VWaMMMv V 

AQLSELTLTV Ki lRHCTCLCAHFLYYTV 
FLEGRNDDEFISIPST 


9390 


23291 


A 


9459 


287 


3 


EAGLL WRKATHKYAHTHTHTH 1 HTHTH I 
RGLQNMINCKYITYQE IMTLAQAKNKGV 
I PDSFP FTSS PPLTYPPALS FHFQKEH P 

NAIL 1 1 I rHjfHU 


9391 


23292 


A 


9460 


3 


72 


KTELKMKFTNILESSFLMNNKS 1 


9392 


23293 


A 


9461 


327 


92 


ISLFLVWGALFFPKKI I YPPKKKSPLFF 
PPPPPQENFNPPSFLAPPVFPPQENPKF 
AGGRKI FCPPPPVFFFYFFFFF 


9393 


23294 

* 


A 


9462 


3 


173 


ACRGDSGGPLVTRDLKDTWYLIGIVSWG 
DNCGQKT)KPGVYTQVTYYRNWIASKTGI 


9394 


23295 


A 


9463 


258 


39 


LKVTLPSDPSSIFVCLFLENESRSVTQA 
GVQWHDLGFKRFSCLSLLSSWDYRRVPP 
PPAKFCIFRKHGVSPC 
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SEQID 
NO: of 
nuciconae 
sequence 


SEQID 
NO: of 
pepiiae 
sequence 

• 


M 

OQ 

« 


SEQID 
NO: in 

TTQ.QN 

09/515,1 
26 


Predicted 

beginning 

nucteonae 

location 

correspond 

iiig co iirsT 

ominn ociH 

residue of 
peptide 

bam n an 4*A 
SCqUCDCC 

i 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

UIIIK 11/ 

last amino 
acid 

residue oi 
peptide 

ocfJUCllLw 


Amino acid sequence (A~Alanine 
OCysteine, D=>Aspartic Acid, 
jc^Hjiuiaraic Acia, J 1 — Jroenyiaianine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
n— Asparagine, Jr=Jrroiine, 

T=Threonine, V^Valine, 
W«Tryptophan, Y^Tyrosine, 
A = unKnown, — oiop cooon,/— possiDie 
nucleotide deletion, \=possible 

nurlAAfl/lA t n fior4iA n 

iiULici/uui? inscruuu 


9395 


23296 


A 


9464 


104 


336 


NHVS QCEEl FLFDEYYI FS RAVASMFWV 
DAEIiGSDIYLDGKLKNSFLLSFLGFFFF 
FFLGKKVFFFSQNGGLGRPGQ 


9396 


23297 . 


A 


9465 


410 


0 


LNLGGGGCSEPRLCPCTPAWATE 


9397 


23298 


A 


9464 


254 


61 


KKKKDAFFTI PLYPSSQNLFAFTQMDPD 
TNISQQLTWTVLPQGFRHSPPTFLDKPF 
KQTSINFA . 


9398 


23299 


A 


9467 


403 


255 


PAPQLGGPPMMPMMAPPSSWMMPMGCAP. 
AMRLPIGGDI IWLCVPTQISP 


9399 

* 


23300 

i 

• 


A 


9468 

! 


1 

• 

■ 


1428 


P ARF KD LNLAGTA E VGLAG Y FMDHTVAF 
RDLP VRMVCS STC YRAETNTGQEPRGL Y 
RVHHFTKVEMFGVTGPGLEQSSQLLEEF 
LSLQMEILTELGLHFRVLDMPTQBLGLP 
AYRKFDI EAWMPGRGRFGE VTSASNCTD 
PQS RRIiHIMFQTEAGELQFAHTVNATAC 
AVPRIiLIALLESNQQKDGSVIiVPPALQS 
YLGTDRITAPTHVPLQYIGPNQPRKPGL 
PGQPAGPRVGYKALQPWPAQRLGPVGRP 
PCRPEGQPLS CCCCGCS CWGRLTRCLAE 
WTEVLTVGQVQRGPCTALLHKELCGTE 
PKGCASTEEKGLLLGDFKKQEAGKMRSS 
QEVRDEEEEEVAERTHKSEVQEQAIRMQ 
GHRQLHQEEDEEEEKEERKRGPMETFED 
L WQRHL ENGGDLQ KRVAE KAS D KE TAQ F 
QAEE KGVRVLGGDRSL WQG AE RGGGERR 
EDLPHHHHHHHQ PE AE PRQE KEEAS ERE 


9400 


23301 


A 


9469 


265 


170 


GToSHSVTQAGGQWIjNXjSSJjQPijPPGFK 
Ufa 


y**\J J ■ 






V4/U 




III) 


1?r\T.PT . A Q C*T i Q PT? D G I?T .15 C T?P O Q ITT .D CS TTT . 

KUJjlSiijnO r JJo r JP ror Jjxror rrorxjlror Ju 

PSFLSFSFLISFFPLPSFFSFISFSLSL 
FPB 


9402 


23303 


A 


9471 


67 


210 


TVTTGQALTHAYNPSTLGGRGGQITRGQ 
DFETTLVPGDLNFSRVLASS 






a 




*> 


loO 


lOUnKU/UUUM V£<JVX\.XL'XVX A E» V XJIVXIllJi-llJ 

VCTQ KS CS FLKNKNKAWLGKQ ANAPARK 
VEIRC 


9404 


23305 


A 


9473 


417 


265 


FLSSFLFFFFFFFFFFFFFFFFFTV 












ion 




9406 


23307 


A 


9475 


1 


150 


PTRPWQAGFKVLSSGNPPALVSQCFRI 

TT3\7CH'PaPDWT.T.'MT.T7ZiD7*JT>PVT. i 
1U V Oxlxvnlt trriXjlwl XJivrlxriNiJxr V JJ 


9407 


23308 


A 


9476 


404 

4 


109 


PPPLFFNFGPFFLVEARLHHVGQAGLKL 
PTTSDQPDSASQSAGFTGVSTRAQPTLI 

TT.V 


9408 


23309 


A 


9477 


391 


235 


PPFFFFCRDRGLTMLPRLISNSWAQVIL 
LPWS PKMLGLQARATVSGLNKRYS 




233 1 0 


a 
A 


9475 


/DO 

438 


277 


ivKAJr r r r r L.KJJK0I1 1 raLiirKJjl 0N0WAU V 
ILLPWSPKMLGLQARATVSGLNNRYS 


9410 


23311 


A 


9479 


399 


279 


PPPPVFKNPPPKKNKGAPKKKNISPPPG 

KINJ\ X C C V XV.V7XT XT 


9411 


23312 


A 


9480 


247 


26 


LKAGDDSRTVTjGKVHQSPCFSLIiLFFIQ 
VSCFJ)EGLTVXPRLVIjNSRPQAIIjIiPQL 

pklldhrcepphlanhg 


9412 


23313 


A 


9481 


2 


176 


happhpanflffeekescyvtqvsikll 
ds snp paltsqsag i tgenqhghpmncl 

SG 


9413 


23314 


A 


9482 


2 


178 


HAPPHPANFLFFVEKESCYVTQVSIKLIj 
DS SNP PALTSQSAG I TGVSHHGQPMNCL 
SG 
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SEQID 
NO: of 

nuucuuuc 
sequence 

■ ' 

- 


SEQID 
NO: of 

sequence 


M 
eth 

vll 

- 


SEQID 
NO: in 
USSN 
09/515,1 
26 

. 


Predicted 

beginning 
nucleotide 

location 
corresnond 

ing to first 
amino acid 

u ii a ■ w ***** w 

residue of 
oeotide - 
sequence 


Predict- 
ed end 

otide 
location 

correspon 
din? to 

\M III v V 

last amino 
field 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
F-=niutamic Acid. F*=PhenvIaIanine* 
OGtydne, H=HSstidine, I°Isoteuclne, 
IC=Lvsine. L= Leu cine. M=Methionine. 
N=Asparagine, P=Proline, 
0=Glutamine. R=Areinine. S E3 Serine, 
^Threonine, V-Valine, 
W=TrvDtoohan. Y=Tvrosine. 
X=Un known, *=Stop cod on, /= possible 
nucleotide deletion, \= possible 
nucleotide insertion 


9414 


23315 


A 


9483 


2 


151 


, GRVGBGPELVSLLVLDQHPVGTSDHRDA 
TQLGPQPLMSbLGSTGSTEOA 


0/11 S 


Zj JIO 




QAM 




JZJ 


FFVFLLETGFHHVSODGLDLVICPPRPP 
NTFV 


y*t jo 


zjjl I 






aha 


104 


LFFFLFFFFFFFFFFFFWNPPKTFFKKV 
FGLNFPTLLKLKFNPPPPGPPKKKKKBL 




i ZJjlO 






11/ 


ZOU 


SSTNOANNCFFFFKKRGLAJI1PRGAI1NP 
WAKG I LLPWP PKGLKLRGGAI GPGQK 


9418 

-• 


23319 

* 


A 


9487 


76 


353 

• 


VLWNTLEFCLI YLDLLFLLFFNKBAL I R 

AGLFLGDTI I YSRGAVLKNSGGETPRHT 
PSGGLWPR 


9419 


23320 


A 


9488 


3 


122 


HASEDNVFLHCSHSDLGYVUiAGTASLQ 
DFSFFQKSPPP 


9420 

• 


ziJzl 






l 




DTT7 DTVP QniMT.flH fWT .Pfl Q QR^T ,&fi 
tr jl Xvlr J, V UOunl'lilnnUi LILilgwaO Bo 1 iriO 

ASQVAGTTACTSTPDPFFFFFSL 


V4Z1 


OHIO 
ZJJzz 


a 




j 


Z1Z 


GITGMCHCVWPQFAYFCLLLVYLDHLKF 
TIIMLVLCLLFYD 


9422 


23323 


A 


9491 


2 


163 


ETLFRHVGQVS FEPLTSGDLPASASQSA 


9423 


23324 


A 


9492 


474 


296 


SQPAGTADCLLPLPPGSCFPTI PCPAAL 
VPLPSFPSARLWPPQSLHQLLPIiPGTCS 
SCT 


9424 


23325 


A 


9493 


7 


i 

151 


APVJTASSSEKGRDDGMGPRA 


9425 


23326 




9494 


52 


iki " 
Jo* 


1 IJUroiJO vr iJr J. e luCiiif V lwW vuv X Lin 

TS FI S FFLFF FS FLEKDFS FS PAGUPWP 
MFCL T O P LAS GVGG I S RPNP PS R WELHA 

LPP P PGGLGF FFWKKGGPPG 


y4zo 


ZjJZ / 


a 




18.1 
JO J 


IIS 
1 15 


mvdd PWVPPGAGVOTPPGPOGETPLFIjK 
KKKKRGRGGPLFGP PS FGGGGGE IPLTP . 
APKGOE I YFPP PP PPPGGKKKSLFOKKK 
KRIR 


9427 


23328 


A 


9496 


554 


419 


FFFETKPCSVSQAGVLiWRDFGSLQPPPS 
OVORDSPCPQASOVAWD 


9428 


23329 


A 


9497 


428 


281 


GSYVQLTFKYFS I YI YVHTHLHVYLYTH 
IYVYVHICIYKEGYITGSVLA 


9429 


23330 


A 


9498 


144 


16 


KVKKKVSQCPGIVAHNCNPSTLGGRGGR 
IAWGOEFETLSLOKN 


9430 


2333 1 


A 


9499 


2 


155 


NIALLGSSDPPVSASPVSGIRGVHHHAQ 
LIFFFFFGKRAWFNPPGGRKGEE 


9431 


23332 


A 


9500 


2 


184 


FIYGRDSIjTMLPRIjVLNSWPQAILPPRP 
PKVLGFRCEPPHLASLILFYRDILFTLG 

•UivO V 


9432 

• 


23333 


A 


9501 


465 


153 


PHKVFLNKGPFETPQGELSRGEWKTRPP 
PPRERSRPFHRKGGFGSPPKLRQGPPFV 

PPQMPPWfPPPKK'GARPninTfGPPVPPPP 

F FNPPFFFFFCLFF I FE IV 


9433 


23334 


A 


9502 


429 


11 

+ 


CRTLU^CVO/CVCVCVCVCVKAFPPI 
SYSCHLVPSDLPLLGVQFKCWVHFGLSF 
LTCPFPSPPWPGGAQPGSCYWGLGCLCL 
FACKLGARLTEQTLGIPASGPGTWATPL 
GSCASVIJUILPGGSERDTDLKVSVLSG 


9434 


23335 


A 


9503 


436 


202 

* 


LPGGYI FQGEGVEKTKFSPLLRENNFFS 
NFFP PKKNNGGGPR I P PPPPP PLFKKGV 
FFIPKFPAPPRPKKKFPKKKKK 


9435 


23336 


A 


9504 


3 


112 


S FDLLS S S DP PALASQSAG I TGVSHSHG | 
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SEQ CD 
NO: of 
nucleotide 
sequence 

• 

• 


SEQ ID 
NO: of 
peptide 
sequence 

• 


M 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

U %mt mmr M W w W 10 VP 

location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
seauence 


Predict- 
ed end 
nude- ' 
otide 
location 
correspon 
dins to 

F*> 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E^Glutaraic Acid. F=PhenYlalan!ne. 
G=Glycine, H=Histidlne, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine. R=Areinine, S=Serine. 
T=Threonine, V=Valine, 
W=Tryptophan, Y=^Tyrosine f . 
X—Un known. *=&tOD codon. /=oossible ' 
nucleotide deletion, V=possible 
nucleotide insertion 














QKVYICNM 


9436 


23337 


A 


9505 


426 


208 


VGHNLLVPRFLS PSEKRS IQVGVTRSSR 
C30^SPLSLTRKGNSLTPFASRVRQCLTL 
LRLARAPTDLRPLSGTP 


9437 


23338 


A 


9506 


441 


189 


CPANFPVPLREMGPHHVGQAGLLTSSDP 
PAS ASQSAGI TGLS HCARPTVGSDVLS L 
PSGQCTSALTGFMLGSTTVHSNAAARA 


9438 


23339 


A 


9507 


117 


278 


NKALLCHPGWNH I KKVNGRLGMBTHTCN . 
PTTLGGQGQEFENSLGNIGRPHLYK 

mmt ^m\ mm mm^B m^mr m^mr m^mr mw^ mm mmmMm ™ mmw ^mm^tm'mm mm* mm 9 mwmpmM mt .m*mrmm mm A % 


9439 


23340 


A 


9508 


3 


126 


FFVEMGFRCVGRAGLRLLTSGGLPVLAS 
QGAGITGVSHCI 


OA At) 


1 


ATX 








FLVERRSHHAAOAS IJ3 I I.S S S DS PTS Aft 

QCAGITOMSHCARPKYTSFFRLHRHMPM 
ATPTKLPPTLAS AS PQVKVCL YPLLPST 
S KGE I FTLOIiY 

W m\\J U mm- ± Jft Ji 


9441 


23342 


A 


9510 


2 


200 


PQPPTFYPSFLPSTPEPPAFYPRLPPST 
PASRIjLPOPPAFYPSACHLLPOPPTFYP 
SLGNRARLHL 


9442 


23343 


A 


9511 


414 


204 


ERDRKKERKKEGRKERERERKKEKKRKK 
KKERRKEGRKEGRKEASKGQRVWIMVD 
T 


9443 


23344 


A 


9512 


1 


211 


FHHVG WAGLGLLTS GDL PAS ASQSAR I T 
GMSHHARHRHGFDNLSLCALAPSPTPLW 
RRHRRRGHFTVPEK 


9444 


23345 


A 


9513 


390 


265 


VFLVEMGFHHVGQAGLELLTSGDLPPLE 
S OS AG I TGVSOO FS 


9445 

■ 


23346 


A 


9514 


438 


180 


IGFRHVGHAGLDFLPLGDLPSLASQNAR 
ITGVSHRAWPAMAFVNCHGAGGSVAMRT 
TRGHS CRHIjG I GG FWP AS LLQ P VLSAG S 
L 


9446 


23347 


A 


9515 


126 


11 


RNLDFFLSLGLAMLSRLVNSWPQVIFQA 
WPPKALRFQV 


9447 


23348 


A 


9516 


2 

mm 


249 


FLHVGQAGFVLPTSDDFRPTHTPQSSGI 

mmMMi m\ W m M mm w W mmw ^ m> ^mw mmwmmr *mt mt » mm mt mtmr mm mm mj^mmmrvmm 7 ^mW 

TGVSHRTRPLRCYFLRLNNI PLYVYAYY 
ILFIHSVXSGYVGGCrAHYCHPRKW 


9448 


23349 

■ » ******** 


A 


9517 

m* mf m\ 4 


403 


57 


PEGALVPQKKKKKKKKKKKKKKKKKKKK 

mm mmw^^Fm* w mm jH^m* ™™ mt^mmmm^m-mmm mmmm mtmm-mmmmt^ h w «b 

NIKI 


9449 


23350 


A 


9518 


406 


140 


PHYNHTFVYNGVRLEDLQHMCLEEiTVWD 
REPLASNDFLVGVRLGVGTGE I PSPILT 
CLKASCLFYSYYHSRPPTQSSSVMTWF 
FPFS 


9450 


23351 


A 


9519 


3 


75 


GIEWMVNCWRNVHRPPRQRDIT 


9451 


23352 


A 

* • 


9520 


183 

1 O J 




LLTHLILTSALIGE IMVAP ILQKNSGSE 

* " J mmt Jm A W «A> 4 Wi mmmtrnt mmm ~*m* mm* mm* fj + If 4> mmm m%, WWAi 

RECRLPRLVLNSWPQGILLPWPPKVLGS 
QV 


9452 


23353 


A 


9521 


89 


339 


LQCXiQKSGKLE PGMT YTKL I DADVNVGN 
I TSVQ F I WKKHLFEDS QNKLGAEMVTNT 
SGKYGYKSTFCSQDIMGPNILQNIiKPC 


9453 


23354 

mm9mmW -J w 


A 


9522 

^ **** 


3 

mm 


329 


PSLMFPWLKLSFPGRPRGRQTYSRYQTL 
ELKKE FLFNPYWTRKRRME VFHALGLTE 
RQGKIWFQNRRMKGKKENTRDNLPEAQN 
EEKVEEKGI EEKEKEEEEKEENRD 


9454 


23355 


A 


9523 


1 


156 


AKTSMNVNEIFMAIAKKLPKNEPQNPC3A 
NS ARGRGVDLTE PTQPTRNQCCSN 


9455 


23356 


A 


9524 


1 


272 


AKNSARALLGILGYRSHLISSUjCLEGI 
ILSLVI IATLITLNTHSLLANIGPIAIL 
VFAACEAAVGLALL VS ISNTYGLDYVHN 
LNLLQC 
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SEQID ' 
NO: of 
nucleotide 
seauence 


SEQID 
NO: of 
nentide 
sequence 


IVI 

eth' 

od 

■ 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
. peptide . 
sequence 


Predict- 
ed end 

nil cl fe- 
otide 
location 
correspon 
ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteinc, D^Aspartic Acid, 
&=Glutamic Acid. F^Pbenvlalfltiine. 
G—Glvcine* H=Histidine, I=IsoIeucine» 
K=Lysine, L=Leucine, M=Metniontne, . 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, *~ 
T==Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \= possible 
nucleotide insertion 


9456 


23357 


A 


9525 


1 


62 


GNDAWVCTNDMFRLALHNF6 - 


9457 


23358 


A 


9526 


1 


Ill 


GDI IHNENMRQGRDKYKTLRQIRQGNTK 
QRIDEFEAL 






A 


Q577 






I P IGNIMSCDSHSHHELVMSAOLNAI FH 
D 


9459 


23360 


A 


9528 

■ 


439 

• 


27 


PRRTDKWLTVEMIQDGHQVSLLSGELTV 
DORAS I TORPRDGKEKVLI PTWVCARGT 
DVKQVTIVVTFDLPVKQGEEPDYETYLR 
RIGRTGR PGKKGLAFNMI EVDELPS PMK 
IQALFDGRNKQLNAEVPCI QAAMED 


9460 


23361 

! i 


A . 


9529 


1 


236 


REVLITTNVCARGI DVKQVT I WNPDLP 
VKOGEE PDYET YLHS IGRTGRFGKKGLA 
FNMIEVDELPSLMKIQDHFRKS 












Ln\J 


OOVSSARPEPEAGEVSPPVGAGVNSNSW 
AFKYGPGNPKQSGPEPKKQTQVSFLLRR 
KGEASQPRQ 


9462 


23363 


A 


9531 


413 


260 


LFDQ WKVAS P S GLRTS TVR YGDPEKLK 
LEES LRQQAEKEGS ALS VR I SNV 










All 




GQIYDNISPRPPQGIVAPPGGRGNIPNL 

G, 


9464 


23365 


A 


9533 


2 


134 


IHGVCVHMCVCACVCVYACVCVGGYDGR 

IT IiHllOO X V roll VW ±l v J 


9465 

■ 


23366 


A 

■ 


9534 


419 


53 


LLLLTNGDRQTQREKIEACACQSYFDAV 
WGGEQR E EKPAPS I FYYCCNLLGVQPG 

KNG I VPL KSS P VPHYMVSS VLBLPALLQ 

STDCKVRMST ■ 


9466 


23367 


A 


9535 


439 


109 


NCI VI IDSTP YRQWYESH YALPLGRKKG 
AKVTPEEEEILYKNRSKKIQKKYDERKK 
NAKI S SLVEEOFOOGKVLAC IASRPGOC 
APADAYVT EGKELEFSLRKI KARKGK 




7T*fiR 




7JJU 


1 
i 




NTTS SHHFTI P CIiVL YS CTSTHTHIHTH 
THTH THTHVRVFS Y I PS QDVKEGCIPAP 
SVFSW 


9468 


23369 


"XT" 


9537 


168 

A Vw 


280 


NPNS IPiTSCCCC^CCCCWKSPYNVYNR 
LLRTGFTFP 


9469 


23370 


A 


9538 


142 


324 


KIVKMGFPHAGQAGLELLTSGDT POSTS 
Q SAG I TG VSHRS RARS I FLYP YLPSVYL 
PWRRV 


9470 


23371 


A 


9539 


3 


139 


YIGGMGQAWWLMPGKPRQNDYLS PGVQD 
QPGQHSETPSLQKGFKN 


9471 


23372 


A 


9540 


383 


318 


MSTQNNIQSYPIGEDEESESD 


9472 


23373 


A . 


9541 


372 


207 


YGLNMCRQCFRQ YAKDIGFI KVGVCRRT 
PCCIiGREGCGGYFFREROTDRSSLEW 

AT ywUwmiVWSJw AAA 4VU4\U A> AV A\ * ** *J AAA^ t t 


9473 


23374 


A 


9542 


2 

- 


360 


ARGDDDHRSSDRKDGVLEEQ I ERLQE KV 
ES AHSEQKNLFLVI FQRVIMILTKHLVR 
CETDGTS VliTPWYKNCIERIiQQ I FLQHH 
QI IQQYMVTLENLLFTAELDPHILAVFQ 
QFCALQA 


9474 


23375 


A 


9543 


379 


75 


TNLQQFAMSHGSAQVKGHGKKVADALTN 
A VAHVDDMPNTLSALS DLHAHKLRVDP V 
NFKLISHCLLVTItAAHLPAEFTPAVHAS 
LDEFLASVSTVLTSKYR 


9475 


23376 


A 


9544 


380 


56 

* 


RDLS M VRM KSMFAIG FCFTALMGMFNS I 
FDGRVVAKLPFTPLSYIQGLSHRNIiLGD 
DTTDCSFIFLYILCTMSIRQNIQKIIiGL 
APSRAATKQAGGFLGPPPPSGKFS 
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SEQ n> 
NO: of 
nucleotide 
sequence 

• 


SEQ ID 
NO: of 
peptide 
sequence 

* 


M 

eth 

od 

• 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 

4 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K-Lysin e, L=Leu cine, M=Meth ion ine, 
N=Asparagine, PHProline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophfln, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9476 

• 


23377 

• 


A 

■ 


9545 


304 


92 

■ 


S KS RVTNNTLRS TRKDTMI TLNTHSLLA 
NIVPIAIIVFTACEAAVGIALLVSISNP 
YGLDYVHNLNLLQC 


9477 


23378 


A 


9546 


394 


108 


KEIDFS LVDVTADDAGNYS CMYYQTKS P 
FWASEPSDQLEILVTVPPGTTSSNYSLG 
NFVRLGLAAVIWIMGAFLVEAWYSRNV 
SPGESEAFKPE 


9478 


23379 


A 


9547 


1 

■ 


335 


GNAVGLNLHRINFDKYHPGYFEKDGMKH 
YHLHRNQSFCPTVNLDKLRTLFSEQTRL 
NAAKNKTGTAP 1 1 DAARS VY YKVLGKGK 
LPKQPVIVKAKFFGSLIFCFRIWILID 


9479 


23380 


A 


9548 


326 


188 


NFLRMGTVAHASNPS ILGGLGGRIAGAQ 
EFETSLGNITRPCLYFKK 


9480 


23381 


A 


9549 


381 


152 


VTL I S LS LLlLQ WI GVLLWIARLNLNE 
VEKQWRLNQLNNAATI LVFFTWTNVFI 
TAFG AHKTG FLAARASRNP L 


9481 


23382. 


A 


9550 


170 


65 


I HRS KNKWKFHLKDGIMNLNGRDYI FS K 
AIGDAEW 


9482 


23383 


A 


9551 


544 


361 


RRLRQENRLNPEGGGCSKPRSCHCTPAW 
ATEQDSSSKEKKERraSILKGNTVKSVPLP 
LTSHA „ 


9483 


23384 


A 


9552 


369 


181 


GVLGATSGDMQIHTFGSMI IGSITGMVS 
ELGYKFLTVSTADVMIQTLNLHLFSHMS 
HQQKKSY 


9484 


23385 

■ 


A 


9553 

• 


1 

• 


516 


S LVHWE FGQAIARRLKPFGVQRFLYTG 
RQPRPEEAAEFQAEFVSTPEIiAAQSDFI 
WACSLTPATEGLCNKDFFQKMKETAVF 
INT S RGDWNQDDLYQALASGKI AAAGL 
DVTSPEPLPTNHPLLTLKNCVILPHIGS 
ATHRTRNTMSLLAANNLLAGLRGBPMPS 
ELKL 


9485 


23386 


A 


9554 


386 


56 


LFPACVLRWQ FAS DGGD IG FGVFLKTKM 
GEQQSAREMTEVLPSQRYKAHMVPE3X3S 
LTCLQAGVYVLRFDNTYSRMHAKKLSYT 
VEVLLPDKASE ETLQS LKAMRP S PTQ 


9486 


23387 


A 


9555 


375 


150 


I WGKTSFHVS IPLSLFFPGDVLYDLLQ 
HILKQSKPRILFS P FFHPGNS IHTQPEV 
IIjHQTHEEGTGRGFSFLA 


9487 

* 


23388 

• 


A 


9556 


3 

■ 


354 


clptqasamvdtpppallppcslisdcc . 
asneqgsvgigpsepgagynflvcrllr 
plekhs irvgvtqfsrcclsppsltrkg 
nsltpctsrvrrclallqltlselphcp 

VPSV 


9488 

■ 


23389 


A 


9557 


463 


40 


PIKVEDPEPVKKPPKEQRS1KEMPFITC 
DEFNGVPS YMKS RI/T YNQINDVT KE INK 
AVISKYKILHQPKKSMNSVTRNLYHRFI 
DEETKDTKGRYFIVEADIKEFTTLKADK 
KFHVLLNILRHCRRLSEVRGGGLTRYVI 
T 


9489 


23390 


A 


9558 


338 


104 


NFKKKKKRRTKKKEI RTEEKGRGANKKT 
FRGPKKKETRGSNRSPEGKKNHGAKKEG 
ERRPRiCKKKKKKKIiARYDFTCL 


9490 


23391 


A 


9559 


1 


152 


NTCVCVCVYTGFCHVAQVAHELSSSNLP 
SSASQSAG I TGI SHHAWPGFGF 


9491 


23392 


A 


9560 


517 


238 


SGPGESVPHPWSAVRTRPAKKPGSQCLL 
QS AFPLWCS SRRTTWAEERKLNTETFGV 
SGRFLRGRS SRGGFRGGRGNGTPRRNPT 
SHRAGTGRV 
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SEQID 
NO: of 
nucieoiiae 
sequence 

• 


SEQID 
NO: of 

n a n 4"i a & 

pepuae 
sequence 


M 

etb 

oa 


SEQID 
NO: in 

(09/515,1 . 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
. peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

nrXA 

acta 

residue of 

pepttae 

sequence 


Amino acid sequence (A«Atanine 
OCysteine, D=Aspartic Acid, 
sir- ijiu laniic A.CIQ, r— * nenyiaianine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leu cine, M=Metb ion i ne, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S-Serine, 
T=Threonine, V^Valine, 
♦v— iryptopnan, r— .tyrosine, 
X=Un known, *=Stop cod on, /^possible 
nucieoiiae deletion, v— possiore 
nucleotide insertion 


9492 

• 


23393 


A 


9561 

• 


709 


280 

• 


. NTS PALTAPCGRPRG I RPMQCWARVPAP 

OoA^JrAnUJairoljuljXS X V JrAFHJr SJrAAti 

QPCE LSQHHRGGLE PALPGTRQRLCSRT 
WHRCVRFLPGAQEAGELGPAACATGRLP 

ATH 


9493 


, 23394 


A 


9562 


3 


130 


YRGFCHVGQAGLELLTSGDRITSASQSA 
t» J. 1 li V onKAU Jrun£> 


9494 


23395 


A 


9563 


409 


160 


NPGVKRS PCLSLSSSGGYWSMPPCLAYY 
SHFGLPQSWDYRHEPLHIAQFFLFSLL 


9495 


23396 . 


A 


9564 


1 


241 


GICLGSNIGSGWMGTSSLCNPYSNTLNQ 
NLLSTTKPFPVPSVGTNCGIEPWDSGVT 


9496 


23397 


A 


9565 


2 


215 


IHWI^GWMDGWMIXSWMGSCVRAVTGPGT 
SSPHTSTCGHAVCVCVCVCVYTCAHVPL 


9497 


23398 


A 


9566 


244 


117 


DHQHI.IWSVFISFSFKVIKARKNKKGIL 
NJrDb £?Flb 1 bPJLJr r J? 


9498 


23399 

► 


A 


9567 


414 


77 


FGGVPPGTliVPPTWLPPVRVPPPPAFCL 
GELS PPPFrWRKGIjAPFIiljNFFFSPGwR 
GFPS PGGKGPRPLPRPAGFPGFVAKPPF 
RPPGFSSTPSDCGGLPFFFFFFFKKKVF 


r\ a f\r\ . 

9499 


23400 


A 


9568 


418 


198 


ft* V bii 1 £i V J. VoJS^JtiJKAtiJu Vr WJMMfc'valJUE* 

HPEGDNNYMEFIEVLTEALERVLLVRGG 
GSEVITI YSYPTLNARA 


9500 


23401 


A 


9569 


549 


A 1 AT 

415 


RQENRLN PGGKGC 5 B PRSCCC XAAWATE 
GDCLKKINNNKPKNEQK 


9501 


23402 


A 


9570 


546 


1 


VT CTDPS T r P7\'LT/" , CT ."KIT TVT5/T»T5T3T , G RTT^ 

AAGTTGTCH 


9502 


23403 


A 


9571 


415 


215 


SRLRKKIYIYIYMCTCVYVYICVYICIY 
VCTYICIYVCICVYICr^ICVYMYIY 
MHEFITFFSI . 


9503 


23404 


A 




374 


<<lo 

138 


IiAASAAI Qfi IjFKK I £> HiUFTPMF RKKArij 
HWYTGEGMDEME FAEAESNMNGLVSEYQ 
QYQDATAEEEGE FEEEAEEEVA 


9504 


23405 


A 


9573 


370 


233 


GSPPVRSVHQSGVSLSGTQVDEGVRSGS 
ItKMVAr Pw?RllVi l olio 


9505 


23406 


A 


9574 


3 


399 


YSSPGPIALALRDPCAHGGPSPNSSLSP 
AQGHPARAQQGPQPQQGPPSHLGSDIiRP 

nvAH 1 KvaALirKa if AJr Ai/Ao fAir Wjo V 

AASMCSAPRPGRAGGTLGSVLTAHHPAS 
AGCAHRDAYVCKAPGRAVF 


9506 


HA JAM 

23407 


A . 

* 


9575 


314 


48 


REARAWHNLTVIxATELDSSAQASRVQVA 
I QTLDENDNAPQIiAE P YDTFVCDSAAPG 
QVSN 


9507 


23408 


A 


9576 


505 


264 


IFWEMEFHHFGQAGLRLLTSGDMPASA 
SOSAGVTDMSHCTRPDTPFNFKELPGAV 
VTFLLCGMQVGRDELEDDWMGAWS 


9508 


23409 


A 


9577 

• 


508 


314 


PATWKGCLSNFSS YLTKPADFI EHQVLS 
WEQVPDGFFIFNERFKSFTWVLNNVAE 
FVCKYKLL 


9509 


23410 

* 


A 


9578 


512 


1 

4 


RMREAI FS PGRLPRGAAPLRGPAGTIjVP 
WMPTPGECWLSIiSVSAAGEKPYKCSVCE 
SAFNRKDKLKRHMLIHEPFKKYKCPFST 
HTGCSKEFNRPDKLKAHILSHSGMKLHK 
CALCSKSFSRRAHLAEHQRAHTGNYKFR 
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NO: of 
nucleotide 
sequence 

• 


SEQID 
NO: of 
peptide 

r r 

sequence 

* 


M 
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od 
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SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

• residue of 
peptide 
sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G«GIycine, H=Histidine, Msoleucine, 
K=Lysine, L^Leucine, M=Methionine, 

if w W w 

N=Asparagine, P=Proline, 
Q=Glutamine 7 R=Arginine, S=Serine, 
T^Tbreonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, \=posslble 
nucleotide insertion 














CAGCAKGFSRHKYLKDHRCRLGPQKDKD 
PV 


9510 


23411 


A 


9579 


3 


487 


WDLIYVGRKRMQVEHPEKAVPRVRNLVB 
ADYSYWTLAYVISLQGARKLLAAEPLSK 
MLPVDEFLPVMFDKHPVSEYKAHFSLRN 
LHAFSVEPLLIYPTHYTGDDGYVSDTBT \ 
SWWNKEHVKTDWDRAKSQKMREQQALS 
REAKNS DVLQS PLDS AARDEL 


Of 1 1 
yj 1 1 

t 

• 


> 


A 


7JOU 

I 


i 

* 

■ 




S KYLYDEGEEEEVFNDEWGAAGGVAVPT 
LNALERG FLSAMDWHLYTDPRE I FEVLS 
WLESCVAEQOGRWRGWYTYTDLCVLLEO 
PTWQLAIjGSLCQRLVKIjSCLIiAVAYVSS 
VALAVASVAVIHQSLGtiSCTPTPGPPDL 
GLTSRCLLEPCIPSVPQCLPSLANVSSC 
LEGSMGLRS LWGS LLASLTPP PLPPPDP 
PAPPTLLHNCHLCQKLQRDSPTCHACLH 
PNRTVPTALSSPWYHTYGLAPPWPWSPV 
PLSLPQPQQCSLFSVMEIARLKSFVFPG 






A 


7JOI 


o 

it 


l JO 


IHAGFI S VS VLDN 1 1 Y I CMCVCVCVCVC 
AHMCACISEHITHAPRbPEVFYTP 


9513 


23414 


A 


9582 


3 


144 


YTAGVISVS VLDNI I YICMCVCVCVCVC 
AHMSACFSEHITHAPMLP 


9514 


23415 


A 


9583 


503 


3 


GKCV CLSAVWGS LLS AFRGS LS VCCX/3Q 
FVCLSAVRGSFSVCCPGQRVCLSAVRGS 
VSVCTiSAVQCSNVSVCLSAVRGSLFVCLL 
SGAVCLSFCLLSGAVOIIiC3iAPTPCLSR 
SLAHRRCSVNVCEMTQLISVSYFFCWGW 
WRHRNNIHLQKCRLRILMAYTGQGPP 


9515 

- 

i 


23416 

• 


A 


9584 


2 

■ 


1358 

- 


AGLDS LHKFQVKI EKEKVYVRAS KQALQ 
LQRRTKVMAKCISPSAGYSSSTNVLIVG 
AETLRQEGFSDRIVLCTLDRHLSYDRPK 
IjS KS LDTQP EQLALRPKE FFRAYG I EVL 
TEAQWTVDVRTKKWFKDGFKLEYSKIi 
LIAPGSSPKTLSCKGKEVENVFTIRTPE 
DANRWRLARGRKVWVGAGFLGMEVAA 
YLTEKAHSVSVVELEETPFRRFLGERVG 
RALMKM FENNRVKF YMQTE VS E LRGQEG 
KLKEWLKSS KWRAD VCWG IGAVPAT 
GFLRQSGIGLDSRGF I PVNKMMQTNVPG 
VFAAGDAVT FPLAWRNNRKYN I PHWQMA 
HAQGRVAAQNMLAQEAEMSTVPYLWTAM 
FGKSLRYAGYGEGFDDVI IQGDLEELKF 
VAFYTKGDEVTAVASMNYDPIVSKVAEV 
LASGRAIRKREVEL FVLHS KTGDMSWLT 
GKGS 


9516 


23417 


A 


9585 


360 


239 


Q CRKGLGKL VS FGGLGI WS LI DVLL IGV 
GYVGPVHGSSYV 


9517 


23418 


A 


9586 


160 


1156 


MPRLTFAPKGWPHPPTSLHPGQVTDQTT 
WWLFQELPTPSENSMPPGLSTPTASQEG 
AGPVPDPSQPTRRQIRLSS PERQRLSSL 
NLTPDPEMEPPPKPPRSCSALARHALES 
S FVGWGLPVQSPQALVAME KEEKE S PFS 
SEEEEEDVPLDSDVEQALQTFAKTSGTM 
NNYPTWRRTLLRRAKEEEMKRFCKAQTI 
QRRLNE I EAALRELEAEG VKLEIALRRQ 
SSSPEQQKKLWVGQLLQLVDKKNSLVAE 
EAELMITVQEXjNLEEKQWQLDQELRGYM 
NREENLKTAADRQAEDQVLRKLVDLWQ 
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peptide 
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Amino acid sequence (A=Alanine 

OCysteine, D-Aspartic Acid, 

Ej oiuiiiiijit. /\ciu, i , *~*jrneiijriai2i[]iiiCf 

CI— r^lvrinp W=Wicf iriinp IssTcAlotiAitiA 
vJUJ 1.IIJC, Xl—TIIMIUIIIC) J"I2tUICUClIICy 

K=Lysine, L=Leucine, M^Methionine, 

N=Asparagine, P=Proline, 

Q=Clu famine, R=Arginine, S=Serine, 

TnsTh r AAn i n A V=Vn lino 

i & i ii reunincj ▼ — valine, 

W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 
















9518 


23419 


A 


9587 


419 


130 


RRDPHLPRIMGRTRLNRKNAETRINAQL 
PLTDKGRMARHVLDNSGE WS VTKRQVI L 

LYMLTHYLLPYA 


9519 


23420 


— * 

A 


9588 


2 


399 


EVTLGEENRKKLQKIQRDQEKERVMRAA 
CALLNSGGGVIRMAKKVBHPVEMGLDLE 
QSLRELIQSSDLQAFFBTKQQGRCFYIF 
VKSWSSGPFPEDRSVKPRHV 


9520 


23421 


A 


9589 

! 


19 


205 


LYCLVEWHDLCS LQ PLS PGFRQ FSCLNL 
LSSWDYRHMPPRPTIKJjSSYSSIKLIYY ; 
LFMLVL 


9521 


. 23422 


A 


9590 


500 


407 


Nil 


9522 

• 


23423 


A 

• 


9591 


727 


236 

• 


RRSRGIiEGREALALCPGDGSHLLCRRTD 
SS FS FMAFFFTFMAQLVIS I IQAVGI PG . 
WG V GGW I AT I S FFGTN I GS AWML I PTV 
MFTVMAVFSFIALSMVHKFYRGSGGSFS 
KAQEEWTTGAWKNPHVQQAAQNAAMGAA 
QUAMNQPQTQYSA1 PNYTYSNEM 


9523 


23424 


A 


9592 


381 


278 


HP S VYKVASGLKEGLSLFG IIiNRCHCKW 
GEKXiJjR 


9524 


23425 


7T ' 


9593 


412 


156 


SQRC^SEiG^EHIAJiAYAllICDNGPSS- 
LLGPPLPGWIYDIAQKYDFSFYIOGLLY 
MIG I L FLLIQPC IRX IEQSRRKYMDGAH 
V 


9525 


23426 


A 


9594 


263 


50 


KHAAP PAS LS LS LLLHHGQ KRACFP FAF 

CRDCQLLEGSPAMIjPVQpAKIjIjVI^E 

CLLCALLIPPSGSRI 


9526 


23427 


A 


9595 


432 


309 


GT FSERGPPLE PRSQTVTVD FCQEMTDK 
CTTDEQPRKDYT 


9527 


23428 


A 


9596 


457 


29 


TLLPGWITAQASEGEIGASLPSSLPPLIi 
YLPSWRCSPSGAIiTHPAACPTGQRSSSA 
LLPSIjPQAPPGFCGMTGLRPLPS psvgc 
CSAPSPQLLCPTQLPPAPESSEGGCSES 
RCVANVKYTRDLGDFLEKCTNGEASALE 
YP 


9528 


23429 


A 


AaAH 

9597 


864 


407 


PbKtUjor iUKi irL^ooAAUAPPPrKijiSK 
RSRTSRPP WGP PHS ASARTQRACTTWYS 
RSWACSCSWITSRPSWMTTKRTCMIFST 
X PTTPRTALWRGLS LWKDAWLLTRLKLA 
PSSPTSMPPTTTCTACSSTWMTCGSPAR 


9529 


23430 


A 


9598 


421 


145 


LTQHNGDAAAS LTVAEQ YVSAFS KLAKD 
PNTILLPSNPGDVTSMVAQAMGVYGALT 

ELDRVKMS 






a 






ou** 


PGRPTRPDlCLIiLER^EQVKtirVMKTFR 
EELYNIEKAFEVERQELLASNKKKWEQA 
LQAHNAKELEYLNNRMKKVEDYEKQLNR 
QRIWDCEEYNMIKIKLEQDVQILEQQLQ 
QRKAIYQLNQEKLEYNLAGAEEER 


9531 


23432 


A 


9600 


409 


191 


HHVGQAGLELMTSSDIiPSLASQGAGi'Kr" 
VSHCARTVYGLS IVFFSSHRGFASVRTP 
PSNENFPDRPISIAHP 


9532 


23433 


A 


9601 


367 


1 


PPRYSPPLGGVRGGVPPGPGVLAPPFPK 
GKP P F FFKNPNLPRPGGGAPYS PFFGGV 
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Predicted 

beginning 

nucleotide 

location 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

ding to 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A s Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F-Pbenvlalanine, 
OGIycine, H=Histidine, I^Isoteudne, 
K=Lysine, U=Leucine, M-Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S= a Serine t 
T^Threonine, V=Valine, 
W=*Tryptophan, Y=TyrosIne, 
X-Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














GRGISFNPGGQGSPGQKKPPGPPRRGTK 
GFPSKKKKKESTKLPVTSHTTHCWFLPE 
ALDLELDPLY 


.9533 

- 


23434 

» 

• 


A 


9602 

• 

• 


1722 


394 

• 


GWNGSWNDNLVDTSPLKRDPLQDICRRY 
MEDLpCICFYRELNSKTTLKFVHTSFHG 
VGHDYVQLAFKVFGFKPP IPVPEQKDPD 
PDFSTVKCPNPEEGESVLELSLRLAEKB 
NARWLATDPDADRLAAAELQENGCWKV 
FTGNE LAALFG W WMFDCWKKN KSRNADV 
KTCVYOTiATTVSSKTkKATATilTFfiFtfFRE 

All V 1 11 1 ' JL Vwi9I\JLlJiU\l/UJJ\Ci\JCn££tD 

TLPGFKWIGSRI IDLLENGKEVLFAFEE 
SIGFLCGTSVLDKDGVSAAVWAEMASY 
LE TMN I TLKQQ L VKVYB KYG YH I SKTS Y 
FLCYEPPTIKSIFERLRNFDSPKEYPKF 

VSKNSQMITFTFQNGCVATLRTSGTEPK 
IKYYAEMCASPDQSDTALLEBEIiKKLID 
AL I ENFLOPS KNGTGSGRSC LG VP PNTV 
MTLCGAYGNRATRRNCHTLEPCG 


9534 


23435 


A 

t 


9603 


14 


356 


DFVERTQ YTHTQTHTHTHSHAVRLS PRP 
VQSRPEASQTGPTRTKPALGPAHPSPRG 
APCPDDPGSLRMLGYGSRASHNSRRGRW 
GHGRCKDORAMG PHSHMWBAALPPGALS 
SP 






A 




477 


964 


LVETGFLCVGOAGLELLTSGDPPTSASO 
SAG I TGVSHRTWAALST FTVLCNYHYLF 
LEVFHHPKLKLS PH 


9536 


23437 


A 


9605 


1 


378 


EGINFSDNLRRCWMVGMPFPNIRSAEL 
QEKMAYLDQTIiPRAPGQAPPGKALVENL 
CMKAVNQS IGRAI RHQKDFAS IVLLDQR 
YARPPVLAKLPAWIRARVEVKATFGPAI 
AAVQKFHREKS AS S 


9537 


23438 


"A"" 


9606 


417 


2^4 


MVS LTQE LCPVAMRVAEGHNKMLSNVAE 
RVTVPRNFIRGALLEQAGQDIQNKLE 


9538 


23439 


A 


9607 


404 


272 


PPKFIiIiESGPIRWDWWSIiGALMYDMLT 
GAVGAQLKAACI IGLC 


9539 


23440 


A 


9608 


409 


205 


HMNGSLGSGDIDGLPKNSLNKI SGISNP 
PGTPRDDGELGGNFLHS FQNDNYSPSSL 
PCSWSLFYSLPF 


9540 


23441 


A 


9609 


449 


263 


VWICPCDPPASASQSAGITGLSHHLPKC 
WDYRLEPPRPAHCFCFYSFTMRNRNLLS 
LVKYSR 


9541 


23442 


A 

* 


9610 


366 


75 


TPGLKRSSCLSLMSRWGYRIEKKRTIIR 
SWEAIKEQDGREVDWEYFYGLLFTSEN 
LNLVHIVCHKKTTHKLTCDSSS I YYPQT 
RLKRKQPVRKRQ 






A 


701 1 

* 


j 




PGYCGSWVFTCGALROLSGGRDLSPjGAR 
MGNSALRAHVETAQKTGVFQLKDRGLTE 
FPADLQKLTSNLRTIDLSNNKIESLPPL 
LIGKFTLLKSLSLNNNKLTVLPDEICNL 
KKLETLSLNNNHLRELPSTFGQLSALKT 
IiSLSGNQLGALPPQLCSLRHLDVMDLSK 
NQIRSIPDSVGELQVIELNLNQNQISQI 
SVKISCCPRLKILRLEENCLEUSMLPQS 
ILSDSQICLLAVEGNLFEIKKLRELEGY 
DKYMBRFTATJiUOCFA 


9543 


23444 


A 


9612 


179 


329 


KGLAFCPPPGRGGGPPPPLWPPPPSYKK 
IFGLKTPKSGALSPPPPPPGNF 
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seouence 
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location 
correspond 
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peptide 
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otide 
location 
cor res pan 
ding to 
last amino 
add 

residue of 

peptide 

sequence 


G^Glvcine. H=Histidine I=lso leucine 
K-Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vallne; 
W=Tryptophan, Y=Tyrosine, . 
X=Un known, *=$top codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9544 


23445 


A 


9613 


410 


298 


PRFTPFSCLSLRSSWDYRCPPPRPANFI, 














SLSSVLHGT 


9545 


23446 


A 


9614 


1 


130 


NPGSHCVAQAGLQLLSSGNPSASASQSA 
RITGVSHCAWPNTTI 










i 
i 


l*rO 


NTWRLHMVPRLVSNFWAPG 1 1,1,53 WPPK T 
LGLQKRATTMPGPEIYTSIN 


9547 

■ 

• 


23448 


A 

* 


9616 

i 


552 


26 


RLS IRNLLTHAFFAEDTGLRVELAEEDD 
CSNSSLAIRLWVEDPKKLKGKRKDNEAI , 
EFSFNLETDTPEEVAYEMVKSGFFHESD 

CKAVAKS IRDRVTP I KKTRBKKPAGCLK 
ERRD SQCKSMGNVFPGPONTALPLAPAO 
QTGAECEATEVDQHVRQQL IQRE PQQHC 
SSVTGDL 


9548 


23449 


A 


9617 


420 


48 


FSQILRASYEFDSPFWDDISESAKDFIR 














HLLERDPQKRFTCQQALRHLWISGDTAF 
DRD I LGS VS EQ I RKNFARTHWKRAFNAT 
S FLRHIRKLGQ I PEGEGASEQGMARHSH 
SGLRAGQPPKW 


9549 


23450 


A 


9618 


190 


285 


RQGLTMLPWLVSNSWPQWLLPWPLKVL 
GDQV 


9550 


23451 


A 


9619 


392 


308 


SPIKPKIPLSAPRKNTNSVKYRLKFRFG 


9551 


23452 


A 


9620 


166 


23 


FCNPIPSSLPSSLPSFISPFLPCFLTPF 
LPCFLPPFLPFLSFMTLTF 


9552 


23453 


A 


9621 


49 


374 


DRRGIRIMAAALFVPLGFSLLGTHGPSG 
AAGTVFTTA*YLGSKILLTCSLNDSATO 
VTGHRWIiKGRWLKEDS\LPGRKPEFNV 
GPHPPNGDKSSFVFPPPSPCPRLTF 


9553 


23454 


A 


9622 


85 


472 


SHVFPPLRLTLTLHSMAVPETRPNHTIY 
INNLND KI KKD E LKKS LYAI FS QFGHNL 
DILVSRILNMMGQAFVIFKEVSSATNSL 
RSMQGFPFYDKPMRIGYAKTYSYI IAKN 
KGTFLSC/DRKLIK*TPQ 


9554 


23455 


A 


9623 


170 


2 


I FFFFCFERGSCSVA*AGVQWRS YSLL* 
S *TPG\ SSNTASATS VAGTTGL\HHHAB 


9555 


23456 


A 


9624 


2 


469 


RRLCS DRGLQRS LGS MS AAVTAGKLARA 
PAD PG KAGVPGVAAPGAPAAAP PAKE IP 
EVLEDPRSRRRYVRGRFLGKGGFAKCFE 
ILDADTKEEFAGKIVPKSLLLKPYHR*K 
MFMEIFIHRSIjGHQHVIj*FHGYF*EHDF 
MS EELEL\ CRPRS FLF 


9556 

r 


23457 


A 


9625 


661 


1393 


ASPPPSQTTPAAPGCSAGEVSLA\GIjCL 
QERRLVDVAGFS I FI PSRYLDHPQPSKA 
EQDAS I PPGTHEALLQTALSPPPPPTRP 
VSPPOKAkEAPNTQAOPISDDEAS /RWG 
RNPG*CS*SGSPEQWPI*PHLP*GRL*A* 
PSPCCPG*VG/FVHSPQARRRPITTSIS 
QRSWKFWPKPTVFRETSGGPWAMPRPSM 
PSRAS I SLS PRTRSRJCQPRPLTLGCCVW 
HWHTDGERATCGDVRRAHHSPR 


9557 


23458 


A 


9626 


1 


286 


PTMAKLIALTLMGMELALFWNHQASYQT 
RLNALRE VQPA* LPNC3JLVKGF* TGSED 
F\EILPNGLTFISSGLENPWI*RLDP\N 
SPGKMLLMDLNE 


9558 


23459 


A 


9627 


173 


491 


EGPLPLESSSNWGQADLDKKSRELLWKT ■ 
VYLL*LNIQLPYNPVNPILGINPRETHT 
C\HQEMCTRI FIATLFTKAKA* K* FRCP 
*AREW/I K* / I WNIHI IEHY 


9559 1 


23460 


A 


9628 


351 


1 


VLGDAIQSRRGSSRKAGSYLLSRSP/CS 
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Amino acid sequence (ApAlanine 
OCysteine, D=Aspartic Add, 
E=^lutainfc Acid. F=Phen via lo nine. 
G=G lycine, KNHistidine, I=Isoteucine, 
K=Lysine, Lr=Leucine, M=Meth ion ine, 
N=Asparagine, P=Proline, 
OssfwItitfltTiine. R=Arpininc S=Serlne. 

T=Threonine, V=Valine, 
W=Tryptophan, Y=TVrosine, 
X-Unknown, *=Stop codon, /=possibte 

mirlentiHa deletion. Wnnuihle 
nucleotide insertion 














. EGGRGGCWAWNLGTMSSVASGHGAWVPR 
EIiTS * AWPEPC-IPAGOREGRPPPTPCPPT 
CAHGPPAQP *PRGKAAYPG/CI/3LQWPD 
♦SAFNP 




ZjhO 1 

i 


a. 




jyy 


1 1 


"KMMR* *H* AKSOIRNAI PFTIAHK/RDIi 

XVi J YJ aK 17X JTTL±\jJ\t J- JTVA J, C W A«L/UUV/ S^m\m4m\m\ 

KSLKIQLTKEAKDIYNBNYKIPL/NBIR 
DVTNKWKNT P\CS+ IGYLYSTYKYKCIC 
LVNIYL 


9561 


23462 


A 


9630 


226 


1 


KCDNLGYSTFFFFFFPFFFFEIGSYFVA 
QVGV* WHN YS NS / D AS AFQVAGTTGVCH 
HAQLI FNLFVEVGS HYVAQA 


9562 


23463 . 


A 


9631 


241 


1 


VSGCPLPRG*RPPPQPQSPPPSTPGTSD 
HPCTPCR/PALRR*SP**LPCIiDLPPAS 
GPPSTSGNSTGTMQPT*PKLNSSSS 


9563 


23464 

• 


A 


9632 

'■ 


201 

■ 


1027 

i 

- 


MTLTPAPGQGQQSSQCIiWRRPL/PIiPPG 
PDSEEPGSGAAG/ PGSSL* SPDLPPAAP 
PKP * TS S * EMNTVG S TQ EGSGLAQRTTQ 
HCPVPPST/RVDILGTCLQPFPALCLTS 
APWLVRMPPSQRSSREIP*RLEAF/SLR 

PPP*PPKPTHQTIiAGQAPRSGQGIiARPT 
IjPYPPSPIiRVTPRICRVHV*GHLRRPRG 
P P PSSARRPTWDWRVGIiPWEDIiEWLCR 

PF 


9564 


23465 


A 


9633 


103 


379 


YHCITI I VLKCIFFFFPLKPCLILSPRL 
EARGIILGH*NLCLT\3QLLRRLIRKSP 
LTLEGGVCS DPRS PHCS PAWPTKGNFFS 
KGKKKILD 


9565 


23466 


A 


9634 


155 


405 


AEMQGLDIGNGQPPCPKLSSFA*ASQVM 
GTTGPRH/HC* ALI FLVRTGFPHVAKTW 
F 


9566 


23467 


A 


9635 


4 


423 


GAMRGDRCRGRGGRFGSRGGPRGRFRPF 
VPHIPLDFYIiCEMAFPRVKPAPDKTSFS 
EAIiLNRNQDLDPNSGEQAS I LS LVTKMT 

GTMTT*RH/SVADLTVILQILTTWESVA 
S 


9567 

• 


23468 

■ 


A 

■ 


9637 

- 


2 

• 


492 


GSRAIGFGLAHELERHLIKIWXiAS I VR* 
DLA* LMNWSG\SHKDLAGKYRP I LEKAI 

UJjduoDU^W"^ JVriX uriopi VHmi vojjv ic 

RQMLTDFCTHLPNLPDSTVKEI YHFTLE 
KIQPIVI SFEEQVAS IRQHFAS I YEKBE 
DWRNAAQVLVGIPL* TGQKQYNVDYKLE 
TYI1KXARLYI.EDDD PVPAY 


9568 


23469 


A 


9638 


1 


417 


WQGLGLVMGSPSRRLQTKPVI TCFKS V 
LLIYTFI FWITGVILLADGIWGKVSLEN 
YFSLLKDKATIVPFI \LIASGSAI IIiPG 

mm 4m* mmm' mm^mmmml mm ^mmmr mnmm^i m» mmt mmm mr ^m> mm ^mw m mmmw mmmmr mmmmw ^mrmp mp^m mmm ap mm ^mw 

I LLCX FSV* HYPV* LLL VTFXJ * LLVFL 
KKWWISPHFLILTFNYLLFDII IDNSS j 


9569 


23470 


A 


9639 


125 


451 


IYRKPLKSIRRVGGLHLNKIFFFFLKQI 
WGWARGLMPC* HFGRLKKGDCLTLG IGD 
/SRG/RYDHDPALQPRQQS KTIiSLKINK 
IGWAQWLMPGIPAPWEG*AGRSFGAGI 


9570 


23471 


A 


9640 


670 


927 


GHVLL P RRAGCAQ PGTG PARAAP AAVQR 
LPGGGVQPSGPGGRAGQGQGLRGT*AGG 
NPGTGF/ GGPHASALLPPSEAPGSLLPS 
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SEQID 
NO: of 

nuclcntide 

IIUvlCullUv 

seauencc 

• 


SEQID 
NO: of 

- 


M 
eth 

o H d 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

nuclerttirlp 

location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

nuclp— 
otide 

location 
correspon 
din? to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 

ur-wiui.ttiiiii. /Villi, JP — X Ucll J lUIUIl IUc T 

G^dvcine. H=HiQtiriint k IsTqnlpiirin^ 

K«=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
ry=r^lutarninp R=Arcrininp> ft=Sprinp' 

T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=»Stop codon,/=possible 
nucleotide deletion, \=possib)e 
nucleotide insertion 














npp 


9571 


23472 


A 


9641 


115 


460 


ISWIFLIICVGFFGGGGGARVPLCHLSP 
RLECSSAIRARCGFDFPGSSDP*ASAS* 
VAGTT/G/SCHHVQLIF 




2J4AJ 






1 


At 1 

4J1 


WVL 1 1 VDVPDNEQANAS 1 1 VKLTDS FTE 

QADQLTAEVGKLLGE* KVDAI LCVAGGW 

AWGNAKFKSLFKNCDLMRKQS I WTWTI F 
cut etmut war*! t.tt . aTrAMA^rr TV""PW« 

MIGD 


9573 


23474 


A 


9643 

! 


154 


408 


I IMNAQS VEEDS Ilil I PTPDEEEKI LR V 
KTj* EDSDGKYGSKI PWNHLPEPENFRLL 
FKHVGYIiD*SGLY\EILTKF*ILCVIiCY • 

Y 


9574 

> 


23475 


A 


9644 


3 


403 


YSQFFGGPKLEDPIRPGVFSQPGEHRGH 
PFQIF* KVGLGNGARPG FP VLGKVKCKD 
NLKLGGQGCSRVEMHPRTSAWAIERDPV 
LKKKKRQR\EHQCSNEQKHTN 


9575 


23476 


A 


9646 


3 


399 


ARAM VLS PAD KTNGKAAWGKVGAHAGE Y 
GAEALERMFLCFLTTKTYFPHFDLSHGS 
AQVKGHGKKGAD S LTNAVAHVDDMPS AL 
VAI^DliHAHKIj£VGPVNCKIiLRH*LLVT 
LGG\HFPAWLTPGG IAFLEKF 


9576 


23477 

* 


A 


9647 


22 


407 


APS AWGMCHFTEED KAT ITS LWGKV\ NV ; 
EDAGGETLGRIiLWYPWTQRFSDSFGNL 
SCASAIMGIPKVKAHGKKALTSLGDAI * 

ttt TMIT *pfPT?»ftT CUT lJr*TWl ITT 7TM3 OTC V 

LLGNVLVTVLAIHFGKE 


9577 


23478 


A 


9648 


243 


2 


Q IATTTLGPGS EGVGKRDLGS SS RHHSQ 
VESVKH\PRDLAQDA*GHGSAPRPLCPP 
PAQS WHCLRG P CLAPHHPTLCEPSR 


9578 


23479 


A 


9649 


283 

■ 


196 


O VTC T ■* T T 'l u l»l 'TJ CT T" \ TCTTT1T T DDMrt T >T i 

S iF I * 1 1 11 i HS L I \ 1 S 1 1 PJjJj FFNQ IN 
NNLFSCSPTFSSDPLTTPLLILTT*LLP 
LTI MAS QRHLS SE P LSRKKL YLS IX I S L 
QISLIITFTATELIIFYIFFETTLIPTL 
AIITR*GKPTPLIQ 




23480 

► 


A 


9650 


1 A 

10 


354 


yiaKVl. IKIji WWW* 1 bA*bRbv oi P*AH 

\ SSDALHL * RELTEQRFI VSVNCAIAHC 
RHQSAS E DHVKLRNEVTE FAQTWDADES . 
AE^CDKSLHTLFGDKICTVATXjRETYGE 

rlrUJ 


9580 


23481 


A 


9651 


2 


404 


AFGTTKWVTYISVLFLCSSAYSRGVLRL 
DAHKSEAAHRFKDLGEENFKALALIAFA 
QYLQQCPF\EDHVDLVNEVTEFAKTCVA 
DESAENWDKI \LHTLFGY * LCAVACFRE 
T \ YGBTAAC* AKHEP VEDE \ CFLR 




2J4o2 

• 


4* 




83 


396 


W le*v» VvjUvjOVFI l^Ia V WKxtJ Ibnl %iK*KE 

PYIQSLRSGNWSERMSHMVAR*RQRHHP 
YYT \ TjTiO AOT .f! MA \ SDPANVMMD9VTTA 

GVLVA/SRRARRISKLGHRI 


9582 


23483 


A 


9653 


67 


954 


REGNHNTERNCKRPPQDTGPTQ/RPGPP 
PPEVPWQDASSAPTPAAAPVGKLLPS*G 
PPGSASCQGR*TLGSVHWL*GSPLPSPS 
AGGRALPYGPAQGQSQGHWENTGRGTPL 
PGSRET* GHELTTGVEHQRHKAPREPQA 
GQPRLCPWVQSLPPT\ARSPSWSQGERP 


9583 

- 


23484 


A 


9654 


1 


252 


RPPQLLASDSLPPSRPPSFPPSLLLSLR 
PPLSRSIiCIiSHYIiCPPLSLIQQLTSFPD 



1125 
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SEQ 10 
NO: of 

mi f* Ipnf - 1 Hp 
II u ticuuuv 

sequence 


SEQ ID 
NO: of 
npntide 

sequence 

• 


M 

etti 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 

• 


Predicted 
beginning 
nucleotide 

location 
correspond 
ing to first 
amino acid 

HIEJIIIV ALIU 

residue of 

peptide 

sequence 


Predict- 
ed end 

n ii rip- 
otide 
location 
correspon 

U 1 1 1 IU 

last amino 
acid 

residue of 
peptide 


Amino acid sequence (A= Ala nine 
OCysteine, D=Aspartic Acid, 
E=dutflmic Arid ir=Phfcn vidian Itte 

AJ VIUInlllll> \*i (J , A A IIUIJUUIIIIUI^ 

G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L= Leucine, M=Methionine, . 
N=Asparagine, P=Proline, 

O s= Olutamine R« A raining S=S*rinp 

T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, V=possible 

iiuL-icULiuc mscriiun 


■ 


■ 




■ 


■ 




TFSPPKSTTSGPPQPBDSPSS*PPTPSP 
P\PPSLxjSSLPPSLSSAPPF/YVHSVSP 
TISAPIiYP * YNS * PHFPI PFPPRKVOHT. 

ARPSPKTARPPWTliRRI 


9584 


23485 


A 


9655 


1 


394 


SLRIYCFVIEVICFLYLL/FIMYSIS*G 
I *LLMNPTILCIGy*LLQYFFPVWKSP 
YFNFIRIPFILFLYLICLLVF*LIFFDL 
SYP/MCLRLI I FNLFY* LFLLYS I CL WN 
nvLHiiiioiujf * r " r i»u<\f ajjxu 


9585 


23486 


A 


9656 


122 


371 

• 


SPYLLLiLQGGVSWIVWLLNVESTNSSFTi 
F PETGS HCAVQGG VQWCGRGSLQS * TPQ 


9586 


23487 


A 


9657 


425 


3 


KTSMMNSIVPHISIPTLNVNGLNVPLKR 
YRIAE* IKIHQPSXCCLQETHPTHKDSH 
KLKVKGWE * I FHANG\QGVAILISDK/T 
Ur J\AX I Viv/ KJJKJSVatlX XJXJwljVyySNV 
TOLNLCAPNTGALKFI K \ QLLLDLRNE I 
NANT 


93 of 


234oo 


A 


VOJO 


179 


1*71 

515 


MKGNLLESK3TSLSVNLIQKHPHRNIQN 
V/AQNIWVSWNS 


9588 

- 


23489 . 


A 

> 


9659 


103 

• 


373 


LKHVLRLCLQSERFdiHYLFNIDQWGKFY 
SQGLALSPRLDCSDAX LAHCILEFLGSS 


9589 


23490 


A 


9660 


216 


424 


TDINVTTKTIKPLDENIGITLYDLGLGS 
GFIiDMT*T/AQVANEKIG*WDFIRNLKL 
TCIGHYHEVKKTNPW 


9590 


23491 


A 


9661 


276 


2 


PRIGIOjPTNMIEPIiIIKEMIiINIiAMRFH j 

T.QDVOM&T /TITT WCKTDPBJ*r!r , OT?VriT\ 
iOrVKnfU./ J.rtXI\.l\-oINKV_r» w k3X.iCtSI\Jjx \ 

IYHYWMECKLVQPL/WKSSLK/FIK*LP 
IDLSFVPAIPLL 


9591 


23492 


A 


9662 


239 


356 


ICIKRRK/WSGTVAHTCNPSSIiGGQGGR 
in v yjctc yru^y 


9592 


23493 


A 


9663 


1 


340 


RHBLMTPHAFGAMKRVTFISLI1FLFNI1A 
YSRGVT SRDAHR S EDAHP I KDLV* ET FK 
ALVMIAFAQHLQQRPFEDHV*LUJE\VT 
EFAKTCVAEESAENCDKSLHTLFGDK* C 
TV 

xv 


9593 


23494 


A 


9664 


759 


1039 


KRI P FGRPRRVDHLRSGVRDQ PGLHDET 
PS\LKIQKLAGQGGICL*SQLIiRRLR*E 
NHLNPGSRGCSEPRSCHCTPAWATE*DS 
ISKKKNTKIS 


Q<0/t 
7J7t 

1 


A5**yj 




yoco 

» 


i 




T TTTT . VCMITHPn + \7TYT .AT TT CT SXCl KYTWRO 
X X U±j X oris fix *j V UUt\ x x 1 & Jj Ti \a AJoIN 5^ 

KAGR* TLERIxljDVYP WTHRFFDS PGNLT 
SASAIMGNPIVKAHGKKVLTSLGDAIMH 
liDD/LKGTFAQLSELHCDTLHVDPENFH 
ENPCDGS 










1CA 


■i 
J 


APNT P PS VTR I PALLAFRS OCLCxaSCPlj 
PLATEPVGLLLSMS FQAAPCFTWWVTBA 
WLTG I SWPPDFIilSLLCl/HINYNAVI F 
RKYRPRVGAVAHICNPSTLGG*GGWIA* 
AQEFETSLGN 


9596 


23497 


A 


9667 


1 


357 

4 


P I SNAMRHLGQE ETATI TS LWGKVNVBD 
AG* ETLGRLLVI YP WTHRLFDS FGNLS S 
AS AI MGKP * V \ KAHVKKALTS LGDAI KH 
LDDLKGTFAQL I ELHCDKL FVDPENFKL 
LGNVLAT 
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SEQID 
NO: of 

nnrlprttirlp 

sequence 


SEQID 
NO: of 

npntirip 

sequence 


M 

#h 
nd 


SEQ ID 

NO: in 
USSN 

09/515,1 
26 


Predicted 
beginning 

n 1 1 flporirfp 

11 UtKVUUC 

location 
correspond 
ing to first 
amino acid - 

rPciflttA nf 
I Colli UC ill 

peptide 
sequence 

1 


Predict- 
ed end 

UUUV"* 

otide 
location 
correspon 
ding to 

IdSl BIIIIUU 

add 

residue of 

peptide 

sequence 


Amino acid sequence (A»A!anine 
C=Cysteine, D=Aspartic Acid, 

fisf^liitamlf* A pM IT^Phonvf alanine 
es V7 iu ua iiiiK. r — x UCIiy lalallinc, 

G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, MNMetbionine, 
N=Asparagine, P=ProIine, 
Q=G!utamine, R=Arginine, S»Serine, 
i — i iir cu ii i iic, v— Valine, 

W=*Tryptopban, Y=Tyroslne, . 
X=Un known, *=sftop codon,/=possible 
nucleotide deletion, V=possibie 
nucleotide insertion 






a 
j\ 


QAAQ 
5rOOO 


1A 


loy 


DAGQBTXiARLLOVYPWTQRL FDS FGNLS 
SASAIMGNPKVKAHGKKVLTSLGDAI *H 
LDDLKGTFAQ\LSELH*DKLHVDPENFK 
LLGK 


Q<QQ 

yjyo 




A 


9669 


i *> 

12 


365 


ijliljMJJoJtiKis.^ rut, I U&/U?CbDAMjNTVEM 
TTKNLE+SINLVDKAVA/SGFERIGFNF 
EKSSTLGKMPSNS IACCRE I FHERKS /S , 
MWQTSFLSYFKKLRQA/PPAATTLNDHQ 
PSALRQDPSP' 


9599 


23500 


X 

■ 


9670 

! 


346 

• 


2 

• 


YPYDPWSLINNMFFLFKRFQNFFLKEGT 
Lir P FKGFRGDItS IiFFWPP YKLFNLKS F 
L/CKFLE I * RYFFG * I IFFFPFFFFFFL 
RQSLTLSPRLECNGTISAHCNLCLPD* S 

\TOT3 


9600 


23501 


A 


9671 


121 


364 


HKKKTAGRVQWPLLVIPALWEAKEADCL 
SSGVQGQPGQHSKTPSLP* PSKSWDYRC 
X JVPHxiA&r UNrrbr r c^RIajVXiIjCCPGW 
P *TPELKQS A/ CLGLPKCWDY / RARATA 

Or!T.T.t?T?\ T7MT . WT7T .17T^t> AT 7"M T A.XTAKT* I/TT 

LSEPDVKNEICFV 


9601 


23502 


A 


9672 


437 


223 


EME/ FSLLLPGLECNGTCIiAH*NIiRLLG 
S SD F PAS AS PVAG I TGVRLHAQLELYFL 

XTT.T/217T?irT TCT \7\7VT.G 
rilMJJV Jc J? X X &JL» V V Z Xio 


9602 


23503 


A 


9673 


100 


.338 


KSRPS AVANLTPVI PALWEAEAGGSPEL 
KQFSLSLLSSRDHRFATPC/RG* FFIFF 

VPMPPttf , VACM ,, Y5PPT .T.HQCCD ryp q a 


9603 


23504 


A 


9674 


161 


1 


SFLWKLCIiRGAPSCMRCQLA\IiL*DVSQ 
LGY/SGVRDPLEEAVCPFSDLKLRAGR 


9604 


23505 

■ 


A 


9675 


304 


1 


VPPLASQLGDKRRIFFQKKKKKKKNKDK 
IHIIISIILKKFDKI*YSLIIK\TL*KL 

uMa" 1 IliWXXJVVX Xi/KlrXAoXXXioUCiivLi 

KSFPLKSGR*QECPIiL 


9605 


23506 


A 


9676 


Ul 


346 . 


S KEGS KEGS LEI WGFLVLAFP FP FP FPF 
S FFFF \ * RQAGRPRLECRSMI TAHCNLE 
LLGSSDPPASAS\QVAGTTGPTSP 


1 




TV 
it 


9677 


3 


349 


iutJUsKX JL fJJijWijVjr J? r cc t tr irKKl fJLi 

FTRDKGGGKLQTAKKNNPHPVKREKKKP 

KTFGGGAHPPDPTLG/PSPQS*KRPFGK 
RGPPP 


9607 


23508 


A 


9678 


38 


368 


QASLLKESESERKSPTSLTLSQKLGMIR 
T.n?wnM«;if aprzin*PT^3T T.w/nvcfYr uwp 

KBKLFKEI KSAISVOTLMIRK/ RNSLTA 
DMDKWVAWIEDLSSQNNPLSQSLIHSK 


9608 


23509 


A 


9679 


187 


345 


PFILFFETESYSVA*AEMQWCDLGSLQT 
PPPG\SSNPPVSALWEARAGGSRSQ 


9609 


23510 


* * 


7O0V 


199 


•two 


ERTSLFTIHLLKLLTITNSAAMSNHV*V 
> FV*T*IFLSLGQMPKSVTAGSNGQSIiFR 
FFK\NCQTLFHGGCAMLH/SHTQLNAIP 
LFRNLWQHLS 


9610 


23511 


A 


9681 


141 


367 


GSWAASPCSCTMDDFAKESFTVVDYVLL 
ENCPNMGDYWAPQFMTDNYVRVTQLNW 
DGVGTQ*KDYI/SSERNL 


9611 


23512 


A 


9682 


193 


3 


IFGERR* ILKI YKLTGHVGLRL*SQILR 
RI/RLNLGGGGCSEPRSCHCTPVWATER 
DSISEKKEK 
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SEQID 
NO: of 

nnrlpntifte 

IIUUCVUUv 

sequence 


SEQID 
NO: of 
nentirie 

sequence 

< 


M 

etb' 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

nnrle- 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 

sequence 


Amino acid sequence (A^AIanine 
C=Cysteine,,D=Aspartic Add, 
K=Glutflmic Acid {^Phenylalanine. 

G=Glycine, H=Histidine, I=Isoteucine, 
K^Lysine, Lr=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 

nuLicunuc iiiacruuil 


9612 


23513 

• 


A 


9683 


1900 

* 


1988 


, NLVH FE EHFPSLFTP S P SLCHI F / CSRD 
RVSPY*PGWSRSPDIi/RYLP/CI*DLPKC ' 
WDYRRE S P CPAS S RFLMNFLYF *NTTLW 
fj * KT T PTjTjF* QGDC! /S* PP * VWPT JVHT«3 

STMVNSGISRNSRPVFKLADKAPSHSPL 
SIIAFGVEFF/NLGSLQPQPLKLK*ASH 
LNCLSSWDNRHAPPCPDNFFFNIL*RWG 
FPTTiSRMVUJSW /PLCSI.9TS ASOSSG / 

ITGVSHCAGPILVF 


9613 


23514 


A 


9684 


69 


358 


ILilVKVTFPHTQKPNFVLRIiSLQSIQQM 
LLPERKWTHAQKTCVSSENTERCHEKQV 

TTQ TJWT .fiMVATTCi njp C TT ftftftftfttfTT * ft 

QVFET/5LANTVKP 


9614 


23515 

* 


A 


9685 

• 


66 


344 


IGEKLIiLTDKHRNWVLETE * APGKDSVN 
IDEMT/TR/DLEYYIN\TDKAVTGFERT 
DFNFKSS IRGQSL* YSII/CAKGSQQKQ 


9615 


23516 


A 


9686 


269 


1 


PKIQNGHCKTLPKNPSQSDTKTFPLFTS 
GSSPQPHKIiSFCFSAGPTLPSKTQLKIH 
L\WQGTVPHAYNPSTLGSRGGRiT*GQE 

xrclXXy 


9616 

< * 


23517 . 


A 


9687 


229 


361 


dry y ykshfkpgaeahac* lviias ffmc 
yraimink*hlyhrkl/vyqvirh*kyr 

n / T * TWVTT .TTTT. * TIP WW <3 WPff Dft AF 

AHACNPSNLGG * GG * 1 1 *GQE FKTS LAN 
MA 


701/ 












AU XrA 1 IK L/cmjVt X V^lVCi JL XJJWJJX ^ X XIxXXVX X 

KRLIQHD* VEFI PEME/NWFNI * /RKS I 
NVTHYIHSLWEEKNYMIISLDVADVSDK 
IKLTFLIF 


9618 

■ 


23519 


A 


9689 


168 


3 


GNSNTMIFFTILYLFFLYFF/LFFIFDM 

p 


9619 


23520 


A 


9690 


192 


3 


WHKASLSNPQPAGCMQPRMAT.HF.AQHTF 
IT*GQEFK 


9620 


23521 


A 


9691 


388 


1 


CRSAGVCWRSTPDPVCLGITSSGCRTAE 

\R* TLL * GVSQSGGTGVRDPLKEAVCPL 
AELENGARRSTALFRASRQDSLSLLKLR 
PQLPLLPGALSQIEGI 


Of: 91 
>Dx 1 




A 








WFLIjPOVKTjATMTPRLiYPPNIiPPPPFF* 

WW JL JJxJLT \£ w l\xxnj. I^i A xiViJ X X7 xTLvAJc * * 47 I? K 

DRV/CTVAjQAGVQWRDLSSLLIiSLLGSS 
SSPTSASRLTGTIILQHHAE 


9622 


23523 


A 


9693 


305 


I 


NHAITTVN* FGL I RHLVTKAAFNSGEVD 
TV/SIKDPLTJ^TjNYTVYMFHHDSTHGKF 
RGTVK 


9623 


23524 


A 


9694 


320 


1 


PRDP PAS ASQSAGI TGI SHQAWPDM IL Y 
IENPKDSSKNPLGLINXYSKVAGYKINT 
QKSAAFL*TNNYLKN*P/MRTIPFTIAT 
KiCKYLETYLTMlWIU)LYTENYKM 


9624 


23525 


A 


9695 


263 


1 


WILLFSESl^SPRLECSGVISAHCTL 
CLPGSRGALSIiLSRvDY/Y*WLVKAGF 
PJW/GQAGLEFLSSGDPTVPSLPDFWDY 
RCKPR 


9625 


23526 


A 

-- 


9696 


262 

- 


359 


IRKIHKIWSGVVAHTCNPSTLGGR*TKA 
HEGSLYCLRVGKI VSNKVGTRS * FFRTQ 
K* V/HLFFS * vyhingsh*srtsl* IRK 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 


SEQID 
NO: of 
peptide 
sequence 

- 

• 


M 

eth 

od 

■ 


SEQID 
NO: in 

09/515,1 
Zo 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- . 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
OCysteine, D=Aspartic Acid, 
E—Glutamlc Acid, F=rnenylalajnne, 
G=Glycine, H=Histidine, I=Isoleucine, 
K— lysine, L»— leucine, vfk— Methionine, 
N=Asparagine, P=Proline, 
Q=G lutamine, R=Arginine, S=*Serine, 
T=Threonine, V=Valine, 
w=i ryptopn an, y== tyrosine, • 
X= Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














IHKI WSGWAHTCNPSTLGQRVQWI T * Q 

Q 


9626 


23527 


A 


9697 


166 


473 


TQDTSQPWKNIWISFGSLKVFENVDIHK 
LYLELDNRHRSWMTDILLFFYF*DGSFT 
VT\RLECNGAILAHCNLHLPGSSNSPAS 
AS * VAEVGGTLE P RRLRLQ 


9627 


23528 


A 


9698 


456 

• 


59 


LFLWKLHLRGAPGYVRC3UiA\LIjGGASQ 
LGYSGVRDPLEBAVCPFSDLKLHAGRTT . 
TLFKAVRYGHLCLQKFLLPFVRLCPAPT 
GGVFTVRQASMSCDGLLIV* \ VSERH* L 
PKPQQFWE L V PKFVRFQLKSL 


9628 


23529 


A 


9699 


4 


375 


VFCS S VTI VSE I AFLMWIiSAQLLL VYRN 
ASDFCMSSLYPETSIiTKVTQGQAL/H*N ' 
G*LT * * EQYWVKTDQKAVI / CQQSTGSS 
GQS S QARE I KG PGAVAHTCNP S TTiGGRG 
RWIT*GQEFETSLA 


9629 


23530 


A 


9700 


239 


361 


RNDTFFFFETESNSVA\RLQCTGVTLAH 
♦NLCLPGSSNSPA 


9630 


23531 


A 


9701 


169 


1 


IFYWKAI FFETESHS VAYAGVQ* SNIX3S 
IjQLPSS\GSSDSPISASQEAGTTDAHH 


9631 


23532 


A 


9702 


37 


348 


HAKNFDVSLLPYTKT^EHFVKEDIWIAN 
KHVKRCS ATLVTKE IPPQS /TLKWLKH* 
PPC*QGNGRTGSPIP/C/WWECKIVQLW 
KIV*PFPKKLSIYIi/PY/DPPILUjGI 


9632 


23533 


A 


9703 


16 


339 


ARLNTFAMAAKC+SERKS/RN/SLTLNH 
KLQMIKLIEEGMLKAVNGQKLRLLYQ/T 
SQFANAKDELLMD I ISATPATV*MIG * Q 

_ _ *m* mm mm* w> mw mJmi Mk« Wmt • ■ - « J * mm-m* m « w * _ 

NSLIADLEKVRW*LDDQ/TSHNIHLSH 


9633 


23534 


A 


9704 


276 


2 


G I SRGGLS KPP FFP * DFPLD YLGQSP FC 
KVKRNS/RFRTTKSPLDFPIPPLFFFFF 
EKvSQAISAHCKIiHLPG* CHSPASASRV 
AGTTGTRH 


9634 


23535 


A 


9705 


352 


2 


CNSRSDFFQLMMNCFLRWAVSPRVECS/ 
GVTTAHCNLNLPDS VDPPTSTS * VAGTT 
GHTP \ HTWLI III I IVAGIFLFLFFLFS 
FFGRDGVSPCCPSWSQTSEFRQSACLSL 
PKCWDY 


9635 


23536 


A 


9706 

■ 


6 


351 


I WNSRPRRP LGGQGGR I QDWP KLHGKTR 
SL/LKKIjQKLA^GRASLWSQVIiKRIiRL 
twvdclslgg* gcsqp * sh/ 1 hcs paw 


9636 


23537 


A 


9707 


76 


359 


CFLFVC3jDFVIFLAYFCN*YLFPFLISH 
tfffflkkglifapgvnlrdqnkfyftl 
FDVFFNIYYC*IFFIF/CLVIiLFFIFFF 
FFYCFVtmFIVL 


9637 

■ 


23538 


A 


9708 


342 


3 


GEELLLMDEQSKWFJjE\TPGENATNIVD 
LTTKDLEYS ISWDKAAAGFBRIGSN*N 
STLELKTVN/SVPHAR* IFCKRLQSVWQ 
TSLLPILRN\LP*SPHPLVSAMLIIQQS 


9638 


23539 


A 


9709 


211 


350 


RQDIALLPRLE*SGTITTH/CDPQLSGS 
SEPPTSASLVAGTTGACHYV 


9639 


23540 


A 


9710 


253 


3 


VKCFSREHMIiIRRSAIGGKKHIEKGTLI 
MSFWWYVSQSEH/NLCNHYAY\SHIKK 
VWIiGTVADVYNPS TLGGQGGRI T * GQEF 
E 


9640 


23541 


A 


9711 


80 

• 


324 


LI TLTMASMCSNE R KSHTSLTS NQKLEM 
SKPTEEGMLKAKTS * KLGLFHQT / SQLV 
KTKKKFIjEE I KSTTPLNAPLMRK* NSL 
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SEQID 
NO: of 
nucleotide 

sequence 

■| 


SEQID 
NO: of 
nentide 

sequence 


M 

eth 

od 

If V 

• 


SEQ ID 
NO: in 
USSN 
09/515,1 . 
26 

- 


Predicted 

beginning 

nucleotide 

location 

correspond 

jne to first 

amino acid 

residue of 

peptide 

sequence 

• 


Predict- 
ed end 
nude- 

IlilVIV 

otide 
location 
co rr es Don 
ding to 
last amino 
acid . 
residue of 
nentide 
sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid. ^Phenylalanine. 
G=Glycine, H=Histidine, l=Iso!eudne, 
K=Lysine, L=Leucine, M=M ethionlne, 
N=Asparaeine, P=Proline, 
Q=Glutamine, R=Arginine; S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyroslne, 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion. \=nossible 
nucleotide insertion 


9641 


23542 


A 


9712 


246 


3 


, QLAKHGLLFFLFFEMGSH* LRLECSL \ T 
ALLSKI SAHCNLNLPGP\SDPPTSASQV 
AGTTGAGLLAQLI FKNFWETGSHYVA 






a 


y i ij 


7 

it 




AAASTT I KEQDMLKFYFLFYFFETESHP 
VTQVGVQWCNLCSIiQPPPP\GPSDSPAW 
VTBQDSVSKB* KRiCKLIKINVYLQVTTH 
P*HSPQHIVRVLSVLN*LNEKINRRVTK 
SSSKEKDR 


9643 


23544 


A 


9714 


278 


53 


KLFRGLNNHRKFQTRCGGLFFFLFIYF* 
DGA/YALSLLECSGTIMAHHSIiDIIiGSS 
HPPISASLAA\GWRTCSPSC 


9644 


23545 . 


A 


9715 


166 


402 


TLSLKKNENISRAWWHVPWSGVWKAEA 
P+ST,VPRSIiRlA HCTPAWA 


9645 


23546 


A 


9716 


168 


1 


VDC I XMRRS IAKS PRRKCRGT 1 3 ARCKL 

RLSG\SGHSPASAS *eagvtovghdpr 


9646 


23547 


A . 


9717 


1 


553 


TS IRLFFLLS I FFFSRARNRILFKI YCN 
GVRAKGRGGGWGTCRGHRNTMTWPGPQL 

PPPPGSTSRPWTPRSPWPLRLHLFSASG 
PPGSSSCSP*LFPLTDSQVGRCSGPDPN 
PPYHLMKVQPLPALtiliPIiLKPHRFPSLG 


9647 


23548 


A 


9718 


13 


364 


PPNR*RNPLIELTNHSLIYLPTPS\GIS 
A* *NFGSLLGACXILQITTGLFLAMHYS 
PIRLOLAFSS IAHITRDVNYG* IXRYLH 
ANGASIFFICLFLHIGRGLYYGSFLYSQ 
T*NIGI ILLLAT 

♦ 


9648 


23549 


A 


9719 


82 


351 


GEALEPELPMPAXiSH/HAATLIiAPLPTT 
PFVRTKVTTVROAO W /HOAL* SOHFERP 

RRVDYLHLFSRNGQNPISTKNTRISWAW 

WRVPVIPA 


9649 


23550 


A 


9720 


179 


365 


FSLF LAS LKAQ II DS KPFFSNTLENI * K 
YEVINFFL YI \ I KT iHFMNLTS LC FYCHS 

V V»> X X 1 1 1 i 


9650 


23551 


A 


9721 


2 


320 


PI PPGNE S SP P TAQEDMQAANKH / IRRY 
STSLAIRET/OL/KTTTREPYAPITMAT 
TBTSAOTTCWRJBYGESGSYCWVGCKTLR 
PLCKTVWQFLKKPSM*LAYDPAITLF 


9651 


23552 


A 


9722 


1 


344 


PLPTQNWPQGQSFIiHYPVLIiPAVaVLWS 
S FFFFFWKQILLLPPRLEG/NGQNSG* W 
KFPLPGPSLFSCPS FQTSGNYGPPQQAR 
AIFWKFKI KTGFHGVTRBGIiNFLTSGSA 
PLGS 


9652 


23553 . 


A 


9723 


156 


3 


VTEAYFILFYFETESHSV*WHDLGSIiKS 
PPPG\SSNSPASASQVAGTTGAIiH 


■ 

* 


• 




y Iv* 


10 

• 


jj / 

• 


YLS I YLS I YLAI YVS I I YLP I YHPS IS I 

x i hi x x J x x uni x v w x x aajc «k ^ x 

YVSS I YS INHLPLYYFSTHLS STHSS I Y 
LLSISILSINYXjLLSISZjS I YLSI*I*S I 
YDSSIHLSIHLSHLYIYLI*YLSIYLSI 
YLAI YVSI \TI YLS I IHLSLSMYHLFIL 
S IIYLCI I FLPI Y/LSTHSS IYLL3ISI 
LS INY/HTS I YLS I YLFVY/HLYI YHPT 


9654 


23555 


A 


9725 


249 


436 


ELAG* IQDHP/GQHHKTLSLQQIKNLAR 
RGGRHL* SELFRLRQEDHLTPGVRGCSB 
L*SHHSTP 


9655 


23556 


A 


9726 


177 


3 


HLN**FSNLIFFETESPSVT\RGIISAH 
RNPRLPGSSDSPTSASRVAGTTDT 


9656 


23557 


A 


sin 


272 


233 


RKNQRI YQIARKRLNEMARI S PLRSMI I 
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nucleotide 
sequence 
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SEQID 
NO: of 
pepuoe 
sequence 

• 


M 
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SEQ ID 
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09/515,1 
26 
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beginning 

nucleotide 

location 

correspond 

lug iv iirsi 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucie- . 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue or 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
Hi— vriuianiic Acid, r— rnenyiaiflnine, 
G=Glycfne, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
n— •Asparagine ) r^rroiinB) 
-Q=Glutamine, R«Arginine, S^Serine, 
T=Threonine, V=Valine, 
Ws^Tryptophan, Y«Tyroslne, 
a— Unknown, *=stop codon,/=possioie 
nucleotide deletion, ^possible 
nucleotide insertion 














LNVSGLNFPLKRCRLAE WTE KKK\ D P 1 1 
CCL/ QKTHFTG/ KDI YRLKI KGWKKI FH 
ANGSQ*Q 


9657 


23558 


A 


9728 


368 


380 


RKGQRISEIAIKRUJKMARISPLISI II 
LISrvSGLNFPLKRYRRAEwTEKKK\D 
CCL/ QKTHFTG/ KDI YRLKI KGWKKI FH 
TNGSQ*RR ! 


9658 

- 


23559 


A 

■ 


9729 


294 

■ 


2 


KGNLS PKKKKNLKNFKVTPQNFGLKDQK 
TQGKPTLKV SQNQKKP KFK* NLKGKRPIi 
KLFPKKKKLNPSRSWFYEKKKK\INKLD 
GTLVRQ INKKRKNQ 


9659 


23560 


A 


9730 


7 


284 


SQGLGRIiRRBNYL* FETBSHSVARTGVQ 
WRNLSSLQS PPPG\SSNS PASASQVPVI 
T 


9660 


23561 


A 


9731 


1 


322 

i 


RSLKKKKRRBKKGKNQPGQHGKTLFLPK 
TPKI * KI KKI YPG / RGGTHLNSQLLRNL 
RKENHLNSGGKGCNDPKLGHC I PAQMTK 
TPFKKKGGAKQNGKKLLQLIYGIK 


9661 


23562 


A 


9732 


168 


2 


GFYRKSLSCLSVWICFKDFFFFFEMVSR 
SVA*AGVQWHDLGSMRPPPPG\SGDCPA 


9662 


23563 


A 


9733 


43 


317 


IiKLQPRDTHCIWLST?SPFIYFFFETES 
CSVAQA*MQWHDLGSLQPPSPG\SGDSP 
ASAS CTGMHHHTWIiMKCINS fdvpdmpl 
GLGLNSHI 


9663 

■ 


23564 


A 


9734 


8 


296 


NWSACSLNGLVKGRLS CLTLNQI L * MI 
TLTTERFLKAERGQKLGLLHQTA/ QWN 
/AKEI FLNEVS SATPVYEGI I RKLNSL I 
ATME KACWW I EDQT 


9664 

■ 


23565 


A 


9735 


3 

• 


300 


TRPSNS PAALSKCSSRRKT/R/ TCLTLN 
QKLEM I KLSKEG I LKAERGQ KLGLLHQT 
A/ QWN/AKEKFLKEVRSATP VYEGI IR 
KQNSJjIANMEKA*VVWIEDQ 


9665 


23566 


A 


9736 


352 


1 


PFFYLLNFPTFFKGPPLIHFFLESFLEN 
FGFQGAFFKLQNFFAFLFSLLNGPFLEN 
PFYFKFPINC\KFLGONTFYCPYLGFF* 
Rii If J*' b'fifb'lf E'X&a KP VA \ RLECSGAI S 
AHCKLG 


9666 


23567 


A 


9737 


130 


3 


KI FFFFFFFFLRQS LS VAQAGVQWC/N* 
LGSLQPPPAGLKRFLV 


9667 


23568 


■it 
A 


9738 


167 


*> 

3 


I SRS VFC FFFETES CS VTQ V * EQ WRNIG 
SRRPQPP\GS SNYCASACRVARFTGLV 


9668 


23569 


A 


9739 


140 


419 


INVKYLSRHFFKGNIQSPNKHRKAIREI 
QIS \ * TDNNKCW * GCGEVQ YF IHCCMCE 
MVHSLWKXVW * LLTGL/HI ELSDDPAI L 
LGGI PPEEMKT 


9669 


23570 


A 


9740 


93 


371 


KLCNGYYQESKREPTERN* IFANHKTKG 
CISKTYKELQLNNKNS/NPTQKWAKDLD 
ISPQTMKQTAKtTQVKRCSTSLVIREMQI 


9670 


23571 


A 


9741 


1 


288 


GTRDHLRSGVRDQ \ PGQHGETRSLLKIQ 
N*GCGHGGSCLSSQ\LTWRLRQVNCLS 


9671 


23572 


A 


9742 


296 


563 


FIKHLNSYMLGQNLATKLSBICKLPFYI 
EK* IQAGYGGTCL* SQLLRRLRHDNCLN 
LGGGGCS E PKI \ HC CTPAWATDGGSVS K 
KKKSRR 


9672 


23573 


A 


9743 


32 


400 


DALVPHWSS YS P ICLDLNVLGIYLFSDR 
VSLLVPRLECSDAIMAQHSLDLPRLR*S 
SHLSLPRNWLMFVILVBTGFCHVA*DGR 
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NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
DeDtide 
sequence 


M 

etb 

od 


SEQID 
NO: in 
USSN • 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

sequence 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamlc Acid. F=Phenvlalanlne. 
G=Glycine, H=Histidine, I-Isoleucine, 
K^Lysine* L=Leurine, M=Methkmine, . 
N=Asparagine, P=Proline, 
Q=Glutamine, R^-Arginine, S-Serlne, 
T=Threonine, V=Vaiine, 
W=Tryptophan, Y=Tyrosinc 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \ppossible * 
nucleotide insertion 




■ 










. KLLGSSNPPALA/ FLKC/WRNMRKSQL* 
PLLLOLLSADHSD 


9673 


23574 


A . 


9744 


60 


356 


HSEVDSI I IPILQGRKQISEVRPQRMCE 
REGGLF I S ETLL FLCPAPS CS /HFLTPA 
Q* SYF* KNVSWLGAVGDTCNPSTLGGRG 
GWI \RGQE IKTIIiANMV 


9674 


, 23575 


A 


9745 


2 


446 


PMVNKHVKRCS AS LV I RAE Q I KT IMRSH 
CTPNS * M* KTDNTKSW / *GCGATGTP IH 
C / WMKYKMVQ PLW KKAQQFL\ KH IHLP Y 
DPINLLLGIS*EK*NHLC/ YKKTYVRI F 
IAALF 


9675 


23576 • 

i 


A 


9746 


257 


1 


VWLRLSFPSFLLPILSPSSQGDLVCCFL 
MVF*RDGVSLPIPAGVQWLDRSSL*P/P 
S PG S S S P PAS AS * VAGTTGMHLHAEP YA 

sc 


9676 


23577 


A . 


9747 


148 


3 


IKLCEQFHK*TIHVLNHK/I»/WPGMVAH 
AYNPSTLGGRGBWIT*GQEFDSC 


you 

■ 


23578 


A 


9748 


769 

• 


I 


* ENKKWVLNHKGATLQ/ EGKGNFGDTFK 
VTLKDKXAVAVKTRQERLPQELKLKFLIi 
EAKILKQYNHPSNVKSIGVHTQRRPIYV 
I KELVPGGDFLS FQRKKNELKL \ VKFS L 

NNVLKI SDFGMSRQEDGGVYSSSDLKQ I 
P I KWTAP E ALNYGRYHSESDARS FGILL 
WETFGIjG\ VCP * PGMTNQQPQEQVERRY 
WMSVSWQC 


Q£7R 




A 








S FS S S S S PEF I S YGCDLLPMRDPHVFI T 
HPAFNLPLFLALGMFFSLFSPSPLFIYF 
LPPPPRPPC*HSPP\PPPPLTNSLFVFL 
PPPPGVPSSPPNTSPPHLPVPRPPPSPP 
TSTHLSSPPPPSPTH 






A 


y / j\j 


j»j 


JJ7 


AR*GLSCSG*S*TPGFMGSSCLNIiPKCW 
DYRC/RATVPSLCFVF 


9680 


23581 


A 


9751 


213 


453 


NLAVNVDE INLFFFPLFLI VTLASKFCS 
/HLLVGR* FIQS CHYFLFF I FDTESRS V 
T\RAEMSATVLAHYNLCLPGSPDSSA 


9681 


23582 


A 


9752 


368 


3 


PLSSPRLPFPLFDPSYIFLISISSIRFL j 
SPPCYFLCVFLS ISWGFSLVSQDGSDLL 
SS*SAFLGLPKCWDYRI.*PPHPAWRELS 
FFFS /S FFYLR*GSLSPSLBCSGMI IGH 
CSLDILVSSARA 


9682 


23583 


A 


9753 


2 


335 


ARAGFSGGDWNVGMQSTASGLSLFHCFI 
S*GELFIILYILLKCYITL*IN*LMFGS 
FYFRDLQHFRLP / TS I VGKKRPGTVAHA 
YNPNTLRGRGGWITQGQEFETTLANMVK 


9683 


23584 


A 


9754 


467 


120 


SQLFGRPRQTDHLKSGV*DQPG\QHCET 
PSPLKIQKLAGRGWV/CAPIGPRLLGRP 
R* ENGLNSGGEVCS E PRSRHCTPAWVRE 
RDCLKNQNKTKSGVLFHSPHQCFLISNT 
SRNLL 


9684 


23585 


A 


9755 


137 


368 


DGVYLWTHRPYCGLGSLNFGSVI IVLP * 

VKAYGWMVIiTSLGDAI QPLADPECS F\G 
QLRELRCDMLHVDPEDFRLLGK 


9685 


23586 


A 


9756 


172 


267 

i 


SKSGNKP1^YHLMNGSVRWCIRTV*QLL 
KRLTTELPYGPAVAFLGVYAREMKAYVH 
MKTYSLQKMCrr*IFIAALFGIVKKWK*A 
K* LSPDEW I3K/M/WCI RTVE YYLT I RB 
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SEQ ID 
NO: of 

UUIICUUUC 

sequence 

• 

* 


SEQ H> 
NO: of 

sequence 


M 

etb 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

nurlpntklf* 

location 
correspond 
ing to first 

Amino Arid 

residue of 
peptide 


Predict- 
ed end 

mi rips. 

otide 

location 

correspon 

last amino 
acid 

ICOIUUC Ul 

peptide 
sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 

R=Glutamic Arid F=Phenvlfllflnine 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, MhMethionine, 
N=Asparagine, P=Proline, 

Threonine, V=Valine, 
W=Tryptophan, Y=Tyroslne, 

nucleotide deletion, \=possible 
nucleotide insertion 


7Q0\J 


1.3 JO t 


A 


y t j f 






OFWQFS FFFPFPCPTDTETCS VSOAGFVO 
WHNQSWLTA\P*SDPPAS\AS*VAETIG 
MCHHDQLI FFFFKQALCPRGWSAMAQSQ 
LTSSS 


9687 


23588 


A 


9758 


17? 


1 


KDTDRVSRGNTQDTLKPPPPG\SSDSPT 
SAS * VARI I GVHHNTWL I FC I LVEMRFH 
PRA 


9688 


23589 


A 


9759 


219 


208 


NFCNFQMPLYQKILFWVRVSLLSPRIjEC 
SGTI ITYCSI*/RTPKLK*SS*LSLGVQG 
CSKL 


9689 


23590 


A 


9760 

( 


133 


371 


AMAYQLYRNTTLGNSLQESLDELIQXLR 
NRVNFRGSLNTYRFCDNVRTFVI^DVEF 
RBVTELI * VDKVKI VSL\DGKQTOF 


9690 

• 


23591 

* * 


A 


9761 


813 


3 

■ 


CGEGWAAGQDQGSRMGSCLPVPRftGRQL 
GSLFPAFQVYCSS CGTQRSSQSQWHKTE 
RSLPGNNPAP QDS / ASAPXYQTPGPVLG 
VQSPLNQQSCHTALSPEASSSAFFHTTP 
ATiilmQARTJjSIiTQQSRPTTw PvLIiAMP 
A*RPRESRH\GR* EWTKRSQARSQEAIN 
ARPRKALGLSDPAPSLHPGSLEDSPKFS 
iXiiVriS-Ar' JrjjJrAo o Jr X 1 yyorlJLoyyi' 
TATGFLGARLCPSPQHICQRGGFPGVTP 
IiLLLGEWGVGEHS FPDVLV 






41 




. 

yo 


511 


LEXtLTS \FKEFLGTYNKLTET* FIiDCAK 
DFTTRERKSEETTIiSEHCLPKYSTMT\H 

rislt\ftdnhi 








y/oj . 


I04 


700 

jyy 


NIVEMTTGDLE * CINLVDKVAARLERTE 
TNFE/RCSTVRQ/MLSKSIACCTE 


9693 


23594 


k 


9764 


225 


25 


KCKITMNST* IE/TVFKDYYEHLYVCKL 
KNj^KTDKVLTIHNFLRLNQ*EPEVLTR 

a TnVTVTT T VNT1 T P 
nJ\n I Iv X IviUlL 


9694 


23595 


A 


9765 


128 


218 


FFKDLPDCFPKWSHCFTLLliAVYBGFTF 
ST*LVIREMQIKTP\I*YHFSPNRMVIM 

*PLWKTVWQILKKLIIG 


9695 

i 


23596 


A 


9766 


194 




KNFFFLEME F/ SVLLPRLECNGVISAHR 
RLRLPLSS YS PAS S S QVAGD YRACTTTA 
G*ILYF* *RQGFHHVGQAGLELPTSGDP 

KGLTGVTS 


9696 


23597 


A 


9767 


238 


3 


SFLWKLCPRGAPACI^CLSA\LIiGGVS * 
SGYMGVPJDPLEEAVCPFSBTjEiniAPJlTT 

AJjFPJ^VPQGCLSLQKLSAAFCSC 








7/OS 




i 
i 


R (TAHHSRGDC VP FTRFCBP T P VTJJWP TTP 

LFPFFV*KSPIERSSLLGPPLKRSKNRF 
PFLR/VPRFFKNHPEFFFKKI FLFPPFF 
FFFEMESHSI^\RLESHGAILAHCNLSL 
PGSC 


9698 


23599 


A 


9769 


191 


2 


FRINH*TSH/LRGQKKKKKTNSKAIGRK 
QITKIRA* LNETQPPESIQR INET*S * P 
SGKIKLI 


9699 


23600 


A 


9770 


165 


2 


GKPPNCFYILNFKKKKGQDVFNRT/WLG 
WAHACNASTLGS QGGW IT * AQECKTS 


9700 


23601 


A 


9771 


97 


912 


VIIiSTGCSSGPLPGWQSVLHHSLTRCPF 
SFFLSSPTPEME/FSLLLTRLECNGEIS 
AHCNLLLLGSSNSPASASGVAGNTGI/C 
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SEQID 1 
NO: of 

D UClCUlluC 

sequence ' 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 

nil 


SEQID 
NO: in 

UdOll , 

09/515,1 
26 


Predicted 
beginning 
nucieotiue 

location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue oi 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
r-< v» i u ui m i c ACiQf r^r uenyiaianinej 
G=Glycine, H=Histidine, Msoleudne, 
K=Lysine, LHLeucine, M=Metb tonlne, . 
NsAsparagine, P= Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan t Y=Tyrosine, 
A— unknown, —atop coaon,/=possloie 
nucleotide deletion, \=posslble 
nucleotide insertion 


• 


• 




« 






GLHAC*AS*FLYFHSVETGFRlitf&£GLG ™ 
LELLTSDD\RPASASQGAGITGVTPRV* 
PLuNfar oRHGIibCF FKBTRVIjSCCPSWT* 
TPGLK* SSCLNS * E/.C*GYRHTPAHPAQ 
DMVFIFITY/CPFFVFPLL/VYIPRFIiF 
LRWSFTLIVQAGVQWRNLGSLQPPPG\H 


9701 


23602 


A 


9772 


3 


363 


HELRSSANEHVFAS FIGPTI LGLPGVLL 
IILYPPLLIRTSIYLISN\RL\VTTQQ* 
LIKLTSKLMITIHNSIGRS*SLILGSLI 
1 1 IATTNLLGLLP YS FTPTTQLS INLAM 
AIPL*AGAV 


9702 


23603 - 


A 


9773 


38 


385 


YLI ILDSPEKGLICGCLCNCMHSSLLDH 
LNLT*SS*MCLYANIHIYVAISCIYFI* ' 
F*S I / YI FI I * YFHYSIAYVYI Y/I YI Y 
MYTHAHIHMCI 


9703 


23604 


A 


9774 

■ 


13 

■ 


1350 


DRVSLLLLRLECNGS ISAHRNLGIiLGSS 
DYPASAS QVAGT IGVHHHTRPTFALFLV 
ETGFHHVGQAGLELPTSGDPPPL\ASQS 
AGITGVSHCTWPHLSTTGKILSSGTICP 
GI WGQRWWAFL I L \ S LP WL * LWHPCLS S 
IMCLSLFFFLCQ/RCKPLISD 


9704 


23605 


A 


9775 


376 


1 


KYFELMYIPVIAICLFNSBVFLNK*QFF 
NLTSSSGIQNLNRLIISNEVBSIIKSLP 
TvKNIXSPDGIAATF \ TYKEQIjTL I LKLF 
QKHEEARIIiSNLTSETSITIjILRKQQPK 
KVPNEHTVKILV 


Anne 

9705 


23606 


A 


9776 


192 


2 


NVx QP PP VFL YP PGVAGGV* PF * S PPVF 
RVK/LGF*GSLFFFFFFFETKSRSVTRL 
ECSGVILA 


9706 


23607 


TV 


9777 


77 


350 


WJjKKFGSRNSGKXfTIiYIjNVYYIiQTSr 
FFFKIGV* LSPTLECRGPIWVN*NLCFP 
G*RDFPTLTS*VL\GPRVILEFGFLEKT 
GFSHVPQL 


9707 


23608 


A 


9778 


133 


325 


PRWVR* PYIjGDCTPRVSEMIjGLiQGVVPS 

sspffvcfet*scsvaqagvq/wwcnt < g 
slqplplg 


yvuo 

* 


» 


TV 
A 


9779 


143 


362 


VFEMESLSVS\RLECSGAISAHCNLCTP 
AWTTERVDAAANSRSMN 


9709 


23610 


A 


9780 


260 


1 


QKQNQEKKRKKI FRNI KHQRDYNEQ* YG | 

/ vi i/ t rNXTPPrMmKrcrp^ Lui'Lu/m* /mitt itadd 

IGMLNRPVTNKVL*LVIKIPLTKKSPEP 
DGFV 


y/iu 


2J6J 1 


TV 


0*701 

9781 


230 


1 


d o Do IV X Lr £> o V V JLt VACKF F F FJj ETGSGS V 
TQAEVQWHDHSSLQPPPPQPPK*/PGTT 
GA*STIAS *NAEITGVSHHAR 


9711 


23612 


A 


9782 


20 


322 


SQHFGNPKGNPKI FRNLEINFFRLRNAD 

TERIFLNLI KAS * KKPTANI I LNG * RLN 
I \ FPKTGETKHIC 


9712 


23613 


A 


9783 


158 


379 


LVKI I CNS FQYIS F F FFFFLETKFCFFP 
QVEVQGGNFG*LNLPLPGLNHFS\ASAS 
REPEIPGPPHHPG* ILVF 


9713 


23614 


A 


9785 


54 


312 

* 


KVDKS TKIGRNQS KMAKNS KNQNAS S PP 
KDHNSSPARKQNWTENELDELTEVGFRR 
WVITNSS KLKEHV* PN/GKETKNLEKRL 
DQ 
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SEQ1D 
NO: of 

niirlpnf irtp 

II UtICVUUv 

sequence 


SEQID 
NO: of 
nentirie 

sequence 


M 

efli 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

location 
correspond 
ing to first 

amino acid 

tl III 111 V dlflll 

residue of 

peptide 

sequence 


Predict- 
ed end 
nncle- 

■ 1 Umv 

otide 

location 

correspon 

{finer to 

VI 111 EL IV 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
OCysteine, D==Aspartic Acid, 
E=Glutamic Acid. F=PhenvlaIanine. 
0=Giycine, H=Histidine, I=Isoleucine t 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, • 
0=("»lutflminerR— Arpinine. S=Scrine. 

\£ MIUIhIJIIIIm Am* Cllllllft^ ^ Vvl lUVj 

^Threonine, V=Valine, 
W=Tryptophan, Y=Tyroslne, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9714 


23615 


A 


9786 


13 


613 


PGAGWARQHHGV*DQPGQHG\KRPSLLK 
IBKLAGHGGRCLLSPAIGGGLKQGESLE 
TGGRDCSEPRIX!HC\ MPAWGNFBNKTTjSO 

/QNKTR 


9715 


23616 


A 


9787 


794 


164 


FS FLFCFFFLRWS LTLSPSLECS CVTS P 
HCNLQLPGLSHS PATVS * VAGTIGTCHY 
TKLS*FF\FDDSL*APFLGWPTBTYYP 

U f c *d o XSA3U ur Jtio vjy nu^un c tr ai o luuvxi 

SC*VLPHLRS/STSLGQP/RIW*IiSKVB 
V*RGFTFSPTQNTPTNNNCSIAPGRVD* 
SLLID*LQWCDHSSIiQPQTPVLK\HPPT 
LASQSAGITGISHHTRPD 


9716 


23617 

i 


TV . 

A 


9/55 


256 


380 


VTODrtitnWT DTDr 1 AT.CTJT7 7\\7T'T?TV7 T T'D *TT 
X Xc KyAM^laJjlr k UUJF KriAVv»ISrJ If w V 

FAC/PKNVLFYVHP*LFG* I * I YQLKIT 
F*NYEGMVT/FVFRFLFETRSHIiSPRIiE 
CSGTITTHCSLELVGSTNSSISAS*VDG 


9717 


23618 


A 


9789 


22 


226 


TKINSKGNKELNVRAKTI KLLEENIDRN 
LCDLGLGNCFLD*YCIL*TKINSKGNKE 

LDMTPNAQTTKGR\tDKLDFIKI*NFWL 
DT 


9718 


23619 


A 


9790 


288 


2 


QVWQE * AETES AIHCWWERKAIQLL* KM 
RWQFLKMTNV\EIiPYGSAI PLVGIHGRE 
luLa"oi Tixpir lArur vxuA/ a in xjc x\.vi 
PSPYEQITKMHSC 


9719 


23620 


A 


9791 


142 


361 


PPGEGKRVPFTTEPSPRIPS KAKS VSQG 
DTCTPMFTAALFT IARMWNRSKCPSFEE 


9720 


23621 


A 


9792 


305 

• 


346 


TYEYTDYGGLIL* *LITFYDHAVLIIFIi 
ICFLFL/YALFLTLTTELTKTTGGDAQE 
IETV*TILPGIILGLIALPSLRRLYITD 

D APDAfi I/T T KS I GHOWY*TYE YTD YGCL 

ILNS 


9721 


23622 


A 


9793 


2 


317 


SRJDRPRVRDR*LFSTNHKDIGTRYLLFF 

A*AfiVT.(TPATJ3TiT»TPAl?T/^PfHOT 1 TfilTO 

HI YYVI VTAIiAFVI I FFIDLP ILI \SSF 
G\SVYVLLILGVPCTAVFLHSS 


9722 


23623 


A 


9794 


384 


1182 


RIGKI KG/ LCFLFV* WFLKFFFKMEFLL 
PRLECNGKI \HCNLLLMG3SNSPTSASQ 
VAGT TGMC * F\ VFL T EMRFFHVGOAGLK 
LLTLGD/PPRPPKVLGI 


9723 


23624 


A 


9795 


205 


1 


GI^IKCTMRYHFAPTKMAITYFFLKKGK 
G\NNKCW*GS /GTEIGILIHCWWECGMV 
QL LWKTADAWADAW 


9724 

■ 


23625 


A 


9796 

■ 


307 

• 


348 


QSARL* EAVCPFSDLQLRTGRTTALFKA 
VRnfiWT.<! T /IPTjTjTjS F\ Vr*T iYPAPPGGAY 

RGRQASLSCGGLHPVRASRLLCLPKQAW 
AM VGAPTPASLPPCSS ISHCCASJSTQRDS 
VGYDPSSP 


9725 


23626 


A 


9797 


193 


381 


ILLI*IHILLSMISSPFFFETGSHSS/V 
SRLECSGTASAHCSLDliPGSGGSPTSAF 
*VAGTTGA 


9726 


23627 


A 


9799 


153 


359 


FGTYMCFADSYQLSYLFWGTFSWLEEE . 
GV*WCICGSIiQPQPPRVK* /FLPSLSLL 
SSWDYRCAPSC 


9727 


23628 


A 


9800 


2 


396 


ARAARAARELEEL I KI FFFFFFFWGQIL 
ALMPKGGGQGGIIiTYPNPPLPG *NNFPG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

A * » 

peptide 
sequence 


M 
eth 

* 

od 

• 


SEQBD 
NO: in 
USSN 
69/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid - 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
-ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A«Alanine 
OCysteine, D=Aspartic Acid, 
E^Glutamic Acid, F=Phenylalanine, 
G=G)yclne t H=Histidine, I=Isoleudne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyroslne, 
X=Unknown, *=Stopcodon, ^possible 
nucleotide deletion, V=possible 
nucleotide insertion 








■ 






ltpprtginglgppgrvnfgifkkkggf 
ppgargv*np6prgtpppgfpkqr\dica 

PHPGPL* TFEKVPRPPNFPQ 


9728 


23629 


A 


9801 

* 


256 


347 


LY I LLLNSLK* LVEKLR*NWLASLI IR 
YKVTTKRF* * FSKPAQSEELQLLTSAPG 
LNF*RYIISFFFWIQSFALVA*VGVQWC 
DQGSPQPLPPGFKRFS/CRSWDYRHBPP 
R 


9729 


23630 

i 


A 

■ 

* 


9802 

• 


109 


401 


HLRRPCPTPPFALRTSGDQPLSPEARPD 
SGIiLHSPLTSPRGLNGQCPPRGSTLNFN 
P *AS I PASPFPSGQPQASQEPPPVASEH 
PPDT+GDPAQPPP\RPQDFWOPATVPRS 
QAGQWPSPLPSDFSKGAQWFVPPTR/PP 
PSTSTQSRWT 


9730 


23631 


A 


9803 


371 


2 


PGFPL FS FPEGNGP S KRQTGD * IRCIiF* 
DGKVWECSPPKKXKKTKKRKKAVIF/CV 
PVQTKCI WEGGEETLVGDV* V* P* \GS 
FKHVVAMFPEK/DCLCTLYEASFKTKES 
RRVDGFVCVRVGT 


9731 


23632 


A 


9804 

• 


188 


1 


FLFFFFS ET* SQS VTHAGVQGLEQS LPP 
G\SGNSRASASQVAGITSRRKHSWI I FV 
FFVETG 


9732 


23633 


A 


9805 


112 


383 


VFINIRVFRLFTFSFFFKRGFNIjSPRVK 
CNG* TKGHCI PDLPGVR* S FPFSLLTNW 
DYRCAPPRPNKFFF/SFETGSHSVTQA 


9733 


23634 


A 

• 


9806 


599 


237 


FRDRS IiTLIiP RLHCHCVITAHCS LKLLG 
SSELPQASPLSSWDYRHSPPCLANFLFF 
VETRSHY/ASRNSLGSSNPT*A/FPKCW 
DWQV* ATAPS LAYWFS EAE KLI LRLLAL 
K VS DENI CTN F 


9734 


23635 


A 


9807 


168 


2 


PQQDVFFFFLETGSYSVTKAEVQWCDHG 
SVQL* PPG\STDPPTSASQEATGTHHRA 


.9735 


23636 


A 


9808 


225 


3 


GPLGYI*PPYFPQPLISRG*NPSDELKIS 
RVFYGT/HYKFEFLFLFFFETESRSYA\ 
RLECSGVISAHCKLCLPGS 


9736 


23637 




9809 


541 . 


3 - 


RQLTGINSGRRQFQPQMSNYVI PYMMDM 
I ILKTKKSNNKYWQGCEKTELLIHCLWE 
CKMVQLVWKTV* QFLNRSDIEFPHDPGI 
PPL/GYKRKKMKTCLPKKLCVLVEP/RM 
FTAALFE YAK * * KQ/PQTP ITR* INTMW 
RIYTMEYYSAIPRKTPLIHPIA*LDTES 
ri*TEKIQSQETTYY 


9737 


23638 


A 

k 


9810 


213 


1 


FLCVDF VPCNFAELKLALVGL FVCLLFG 
SVTQAGVQWCNL\GSL*PPPPGI*SDLPT 
SAF * VAGTTGVSHH 


9738 


23639 


A 


9811 


54 


388 


PARPLPRQ / WDER PNQQPTKKKKKRGRN 
LPTKKKTQNYSHQNPVAQRNPGGKTNQK 
KPKTNPKKKKGGAL * KKTQRGPKQTGGR 
K* KIS PNKGGK*NP CGKLLEKNP FSGGE 
K 


9739 


23640 


A 


9812 


429 


24 


RS KGAE F P I CPCRS S KFS VLQQRQVR I V 
FQHPBLGESVAR/CRYSQLLGRLRQ*SH 
LNWGGRGSSEPRS CHCI PAWATRASS I F 
CNFQAS S VEVRRS ARKKLFS D I LKRHNT 
INWRVSGLLLVDSYFGRIATPVRTQ 


9740 


23641 


A 


9813 


149 


350 


CHFPCFHTTL/CE* EPGEDVTGPIGTKF 
FPAPGRTIX5HTHTHTHTHTHTHTHSDPH 
HLCVWNTLIHFV 
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SEQ ID 
NO: of 
nucleotide 
sequence 

> 


SEQID 
NO: of 
peptide 
sequence 


•M 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

• ■ *m *•* w **r *•» »> « *«f 

location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A«Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine t I=lsoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R»Arginine> S^Serine, 
T=Threonine. V=Valine* 
W=Tryptophan, Y=Tyrosine, . 
X=Unknown, *=$top codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


074 1 
y /*rl 




A 

* • 


70Jt 


66 


401 


PENKKT VYRKP I ATI I FNCERLNALPLK 
LETRP * YIiL Y * LFNIMLEVLAS WRQEK 
EINLSIFT*DIVIK/MKIPKNLENKLLK 
LLSPFSKITGYKISIQKLTSSFLYYGFT 
V 


9742 


23643 


A 


9815 


82 


1 


APPPARFFFFFFF*EKFSSFPPGEVQRG 
VFCPCNT/CFP*G*KIFFL\KPIiKKVGF 
WPPPPGPKNFFYF*KKGGFPFWARGFFI 
PTPFVPPLHPFKKWGVKRG*APPPARFF 
FFF FFL* GRVSLCHPGWNA 


9743 


23644 

i 


A 


9816 


256 


2 


QHFP*LPIPHIPIQT*QHFPQTRSPFQG 
PPLCQKPFSSPWGSKWFILLSFFFFF* 
DRVSLCHPGWNAVM* SQLTAASNTW \ VK 
*SSHLSLLSS *DYRCVLS 


9744 


23645 


A 


9817 


53 


411 


TI YCSNYVYNFVSQHFRSLYS * KNCGLC 
L* FHTj F VTjOS * NFMOFHM * FCLKI * VDP 

U £7 ALU* V JJ^fhJ XV & 11^1) All! & UUI\4 Viz* 

I ITFEV* FTYI * FF*TYSS VSFTDAYNF 
VKPHR/ VYRGPGAVAHI CNPS TLGGQGG 
WIT*GOEFK 

f l«b A ft-# ft\ » » 


9745 


23646 


A 


9818 


100 


417 


DEHSEKQDEPLQRPRSKLLPFPGTPPSS 
LLLLNHIGQESGLTHVRLASCLRKRGFC 
FDMKERI1CLLPMLE6RGMISDHCSLI/R 
PG*DKPPASAYQVGETTGTYHRA 

Mm MmW ^» ft> mm * ft ft> *^» *» ft" ft> ^fc **** ™P ft ftftAftft 41 


9746 


23647 


A 


9819 


136 


2 


I YMGMQPGAVAHACNPS TLGGQGG/ WWI 
T*GQE FKTRLANMVKPC 


9747 


23648 


A 


9820 


257 


426 


I VPGLFLGTGS CS FTHAGVQ* SNCSSVQ 
S*TPGP \ SDP PASASRVAGTTGAHHKAW 
L 


974-0 






yoZl 






ILLLLEYSIDSLC* SVLCCYKEIPEAG* 
F I KKRGL / WLGVVAHACHLS TLGGRGG W 
ITRSGN 


9749 


23650 

• 


A 


9822 

■ 


50 


513 


RGDPRVRPRVRI IKLSEEGMWKAEIGRK 
T ifJT JiHHT I SOWNS KEKFIiKEMKSATS V 
NTRMIRE*NSLIADMEKVLW*DQTSHN 
IFLVQNI XKSKGIGLFKLMRAKRDEES/ 
AEGKLDASKGRFVRLK/EKRSHLHNMKV 
ODEASVD I ETTAS YLKV 


9750 


23651 


A 


9823 


48 


406 


LI I VVYTLTTRW IGHSAVS YPLLRPPYS 
LRHNIEIRPINNPTMASKCSSERKSLTS 
LTFHLKIKMIKLSEEGVSKAKTG*KLGL 
L /R/QTVSQVVNAMKKYLKE I KSATPVN 
T*MIRKQNNL 


9751 


23652 


A 


9824 


143 


AAA 


WNKKNNKNELADMILWGCGDKTIFMRKL 
TRDBTKYGI PQLALRGHSHFVRDGVl\N 
LDGQVDLS\G*WDGTLHLWDLTSGSTTR 
RFVNK/TKDVLSADFS 


y JjI 










9 


HRSPTKETKIIAS I KKKTMITKDLNROF 

* AXVftw* C A AVftT ft> *\ft A^tift-* •*> 4VlvS\*i'l4* A i\ft*4ftVl»\yrA7 

S KQDKHLTSE YMKRCPTS / LREM* AKT I 
MRYHLTP IRRAP I KNKNK/NKSKCW * GC 
GETEILVHCW-ECKMVP 


9753 


23654 


A 


9826 


175 


400 


GGKGGKNFSLKGGE KKTNLG I FG KKP I F 
GGGTNGANPPPKIKGSKEKKNF*VFFPP 
FPFKNFFFP*NL*FLGGGWPHLSPPQK* , 
V/CFPKIPKLVFFSPPLREKFFPPLPP* 
NLGPPGF/SFWGPPLFFFFFFFF*RDKF 
S FYHPGW KAVI AAHFSLELLGS SNPPTS 


9754 


23655 


A 


9827 


64 


622 


MFSFFLFDLILNHLFFCFV*MFSFFLFD 
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seq n> 1 

NO: of 

nucleotide 

sequence 

• 


SEQ ID 
NO: of 
peptiae 

sequence 


M 
eth 

00 


SEQID 
NO: in 

lie cm 

09/515,1 
26 


Predicted 

beginning 

nucieouue 

location 

correspond 

ing to first 1 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

nil oltt 

nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A B AIanine 
OCysteine, D=Aspartic Acid, 
Cj — vjmu in in i l /\ciq, jr~Jrncnyia iii nine, 
G=Glycine, H=Histidine, I=Isoleuctne, 
K=Lysine, L=Leucine, M=Metbionine, 
N-Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S^Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y^yrosine, 
a— unknown, *=otop codon, / =r possiDie 
nucleotide deletion, V=possible 
nucleotide insertion 


■ 






• 






. LHjNHLFFCFVIRQQFSFLFF/ ISFPFS 
rx JtiCWVN I cauCurSVCinSDNSrRclia, 
LF/NF * HMTLC I SE LR* TLKQS CYS YTK 
IC**FYSVQLSFLTILY**CKS*QALSI 
ALQNEFLS V\ QYHKNMNYI TLTI EPS RN 
KFLSVTFLLFSIFSHNFGISASISIFIL 
H V x 1 K£iN tf ibrr K. 


9755 


23656 


A 


9828 


298 


1 


MRNCLTDDERKWFP EMASTPGEDAVNTV 
EMTAKDLE/Y* * YINLVDKAASDFEMID 
SNFERCS TV/ NKMLSNSTACYRENFHER 
★GQSVQQTFVRVGERV 


9756 


23657 , 


A 


9829 


361 


1 . 


FLTG TQ WG K/ DS P FNTWC* DH/ WNIHRN 
LDLYLI P YIKINLKQLTGPNLRAKTIKL 
PEQNIGENIiCDL* LSRERYSTKSITRIR 
KL / DTLG F I KI KNKC IS KDT IRKR* ATD 
WEKI FANHVLKG 


9757 


23658 


A 


9830 


402 


2 


RFHHRFS F VLFGP FAKVAFP IGAKNF I S 
NHYWAGVFPFWEKKNKIDSFLPPFSKNP 
ILGELKA* F* NFLF * GFQ IFF* S LFF CP 
fggk* ipgvfsffffflvffeig3hsvp 
\RLKCSGTITAHCSLDUjGSSN 


9758 


23659 


A 


9831 


2749 


3215 


FCQ*IKMYVMCICIYNFRRVYLCIYTYIi 
HTHT FTHTHKHTPTQ I PEKDS / QCSLSD 
LKGHSL 


9759 


23660 


A 


9832 


3 


386 


KLRLGQLNTSAMAAKC* SBRKS / R/ THL 
TLNQKLEMIKLSEEGMLKAVIGQKLGLL 
YQ/TSQWNAKEELIiKDIKSATPVTA*M 
IGKQNSLIVDLEKV*VV*IBDQ/TSHNI 
SLSQSLIQN 


9760 


23661 

• 


A 


9833 


1 


370 


RRCRWP PDPSRTVGRQIGKLVTHRPTVF 
QERGCPFPLTRQAGSHHGGGAFAQVISP 
TKS I S PCUKbGSRJj* bQHr G-RPMQVDHJj 
R/LGVQDQ/RWPICGQYGETPA\LLKIiP 


9761 


23662 


A 


9834 


125 


409 


GQENRETWKIVHLV*VLYT*HIKRIjNCF 
SHFI * CYQPTASQAHVHHDSNDSSTHV\ 
N*NSRWPGTVAHACNPSILEG*GGWIT* 


9762 


23663 


A 


9835 


223 


1 


PKPQKMGTF+PPMEGS*MGPPHWEKMGP 
FUCKKEGHPFFFFFFEMESRSVT\RLEC 
SGA/TISAHCNLCLPGSSNS 


9763 


23664 


A 


9836 


377 


515 


F ILFLRQVLTLLPRLE YSGAI VAHCSL \ 
AF*G*SDPPASASRV 


9764 


23665 


A 


9837 


274 


1 


AGEWHDQ I C VFKI PIjMKNGIjHGARVGQE 
SWKERNQQV*DRCQKQNLSDS* IQT*R/ 
M/WPGTVAHAYN/PSTi£GQGEWlT*GQ 
EFETSLANMVK 


9765 


23666 


A 


9838 


553 


114 


GAGVKTHPGHKGKTLFFFKFFFQKLPGV 

QSSPPS FPG/ WGKKGGGLPFQKKKKKEK 
RKKGKKNGVAPSB PPLKYSN/ WQATWG* 
RKLLNDTMRVQPTKSRMQKILQDKGPVN 
GIFTKTGRL 


9766 


23667 


A 


9839 


36 


434 


LPFPQCVTEFI XVLI S WCYIREDACKNL 
KHVAITI INVCI YI CVCVCPSTLYVYMY 
MLP /HLS DTLLTIiN* VTLVEMLS S FQCL 
MFWYHGHI *K*HELDWAHACNPSTMGG 
QGGWIT* CQEYKTSLANMVKPR 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptiae 
sequence 

• 


M 
eth 

OQ 


SEQID 
NO: in 

09/515,1 . 
26 


Predicted 

beginning 

nucieouoe 

location 

correspond 

ing to first 

nnnno aciu 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding 10 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
OCysteine, D=Aspartic Add, 
jii— v»iumiTiic acio, r — r nenyiaianine, 
G=Glycine, BMHistidine, I=Isoleuctne, 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, ^Proline, 

\£~~vyiuiaiiiiiic, xv — ATginioc, o^ocrincy 

T=Threonine, V=Va!ine, 
W«Tryptophan,Y=Tyroslne, 
X=Unknown, *=5>top codon, /^possible 
nucleotide deletion, ^possible 
nucieonoe insertion 


9767 


23668 


A 


9840 


268 


1 


RFPPFFCFLTILFWAPQWFSNFLGVRA 
PQPP*IFFPI*KKPSPLIFFFFFFEKES 
RS VS \ RLE YSGT I SAHCKLCVSGSQHS P 
ASAS 


9768 


23669 


A 


no A 1 

9841 


463 


86 


CFDDKVYGLDYGNTS QVFT * NSKL I KL Y 
TL WYS FLCINQTTI KPSEM I QATKAN I 
DK* NYMKLENFCTAKETVNKMKRQLKKW . 
EKVFSIHIPQEVNI 


9769 


23670 

K 


A 


#%n a\ n 

9842 

i 


37 


A f\f\ 

400 


VHSP P FKFWSLYVS VLKFP FCSLYFKFL 
ALYS IFLQMOQHLLV*AFFF* KRGLNSV 

T\RLBCRGAISPNCNLCL/SGSSDNWAS 
LPRKAEAGNSFT/CR* SQVAGNPI PS IS 
TRIjNQRKTP FES 


9770 


23671 


A 


9843 


42 


407 


NCLS FLITFFFFFLFENKVSFCPQG*G/ 
RGGPFWPHGTLPPRG*GNNPP*LPGRGE 
* RGAP P PPG YFWLFGKKRGFPIi \ GQGGS 
KPPALKEPPPLGPPKGG\NYKRNPPPPP 
GNFF * LP PYQW 


9771 


23672 


A 


9844 


113 


379 


IjGP WATSLRGRFLGYQLQQS KKH*NYP 
TVR*GCGEMRTSVHCW* ECKTA* PL*KT 
VWQFLKKLNTGLPFDAETPL / IiAETPKE 
LKA 


9772 


23673 


A 


9845 


284 


528 


PLPKTME IMLDKKQ I QTI FLFKFKMGHK 
I AETTRNI KNAFGPGTANE \ KCTAQWWF 
K_K± CKA*J5 \oLEDKEA\ *GxiPS£jV£JjTiJ 


9773 


23674 


A 


9846 


888 


1295 


LEQGCNLFLHKHS * LGE VFF FVCFRRS F 
TLYAQAGVKWRDLGS L /HKLPSS LSLP S 
S WD YRP PLP RLA/ NF FVFLVEMGF \ TVL 
ARMVS IS *PRDPPTSASQSAGI IGVSHR 
AG PVAG I LMFCRWNT SNKPKGAVFKKK 


9774 


23675 


A 


9847 


88 


387 


AYRMKI IDRXSLSLYTAALWVTLFLFLF 
LFFFFKTEFCFAPQAGGQ/WGQFKLMDP 
KPPPPG* K\DFLVSHPRDLGIKGAPPQC 
GQNFVF YKE KGGWLLTAT 


9775 


23676 


A 


9848 


298 


438 


KX PRGAPNFPGGGKX FS PPXjGGPXK.TFK. 
GALEKKTLF*VRGPWPGFVKKESPRPKK 
LG*QIFWPPGKKPNPPMGG/SLDPPPPL 
I LSRPDPP* KKKKKKKKKKKGGGPLKNP 
PGGPKFPRGGKNFFPPFROAYKNTPGGS 
+EKNPFLGGGKKRKQP 


9776 


23677 


A 


no Af\ 

9849 


251 


498 


AX X KKMhis * KH13JS lis 1 Xi 1 H v_ 1 War 
KMV*PFWKTV*QFLK/D/LNMGLPFDSA 
FQLPD I CLGSLKTYVHTENCTQMFMAAL 
F 


9777 


23678 


A 


9850 


478 


37 


ESRNKHSHLWSINFQKGLR*LNRGKQPL 
KK* CRHNSISTCKKMKIjDPYIjTPYXKX \ 
KSKOTKDLNVRAKTIKSLEENIEVNLHD 
LG * GNDFLDMTPKA.OTTK*NID*IiDI IK 
I *NFCGSKDTINKVRRQSTE*EKI FANH 
YADKLK 


9778 


23679 


A 


9851 


2 


378 


RLEGLFLCALFCSIAICMFFFFFFFF*K 
KRGPFGFQGGDPGGNFRQLDPLPPGIKQ 
FWGP \ P P RKAGKTGGG P P PGANLGFWGK 
KRVPHGGQKGFKP /RNPRGS PRPGP PKG 
GVAFVFRPKPPGLEQ 


9779 


23680 


A 


9852 


229 


3 


FGPYKI FFKKKGAPCPLENPP VF\HQKL 
GLGFFFFSFFFF*DGVSLFLLRLKCSAA 
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SEQID 1 
NO: of 
nucleotide 
sequence 1 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 

■ 

• 


SEQID 
NO: in . 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

seauence 


Amino acid sequence (A=Alanine . 
OCysteine, D^Aspartic Acid, 
E^GIutamic Acid. F s Phenvlalanine. 
G=Glycine, H=Histid ine, I=lso leucine* 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
-Q=GIutamine, R=Arginine, S=Serine, 
T-Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop cod on, /-possible 
nucleotide deletion, ^possible 
nucleotide insertion 














ILAHYKFLLPG* SDSAASAS 


9780 


23681 


A 


9853 


108 


420' 


GPRVCVRLSLRDDSDLLATEDSLRHTEG 
DPVTAS SHS QAP SLS VHPOAALBGVCWG 
TGAIAGTPEPOKPPOLTPGGRPA\PlAP 
GLTWEPPSHPPPQ*RSPPPPP 


9781 


23682 

• 


A 


9854 


2 


256 


FFIFPLRYYIYCARFQFLSPILYLGK*K 
MD* *RRVFOEKWEOAYFFVEV\SPMCLI - 
CNQTLS VSKE YN* R CHYETNHGBNFD * F 
TEKMHDENLLHLTTRQP 


9782 


23683 

i 


A 


9855 


323 


3 


IIQMANKHRKRH\QSQ* * VRVMQKKI IM 
IHRYIPQWLKTSKSDMIKCCSGYGTTEI 
FTKYTiWRnKTiTiiniTjWPMVWTjV TT.T.lfPTTT 

DTPHDPAIPLLGTYPAECMHMYPRA 


9783 


23684 


A 


9856 


28 


385 


DGVTQAGTQ WGS KFTAA* TA* YQGS I PN 
AWSFFSPPFVLKTPPPPQKK/YPPPKKK 

ICTPPPKlfTC 
err iviyxn. 


9784 


23685 


A 


9857 


436 


518 


AANRLENIYRHLIYDKD/EHYRIVGKDD 
LFNKWCWVNWI/ASNKEKNLDSYLIPHT 
K/LNSR* I IQLDVKD* 1 1 KLLEDNLYLH 
DLGDRQKFLGRI / PVFTIKKKIGKFDPM 
LKLSTFVHQKTPC*RLNSA 


m 




a 






Aon 


AVSQAESSSSSAGVSLVLFRLIADWMRP 
IPIKEENKL\SQSTDLNFNFIQKHCHRN 
TQNSV* PGGVGPPGTCGPFRRQSSP 




ZJOO/ 


n 




z 


41V 


TTnirr/^vgHiyTfQ f rv , QTJT7QTf'g'pn i H''i"i'KrTrn 
± j. vj jxxj\i van j\xxo x x orxf oaajt iiu jlinjcvu 

MKRYSPSL/AIREMQSKTTG\MRYHFTP 

KYG\NNKCWLGCGETETIiI YGW * ECKMV 

QPLWKAVWHFLK*IiNIES/PIY**NYS+ 

YIS YS WRKTC I TTYDPVTTiTiTiG IYPREV 

KTH 


9787 


23688 


A 


9860 


88 


419 


TFFFFFGFWFLKTKPLFVPQLFEGPGPI 
LG* WNPPPRD* KNFS/GPNPPGGGE*RA 
QPPPPGYFLFFKKK\GVSPWGGGGSKPP 
TPGNPPPGPPKIRVIRGGPPPPGGRKHF 


9788 


23689 


A 


9861 


301 


401 


KRA*GGQEPIKVCPFFYNRKVGNGSPPK 

WVGKVGEP IT/DP FKEFGGR\LPEKDGP 
LV 

* 


Q7RQ 
y soy 




A 

** 




990 




KIMGGAO I FRGGGG F FF FLEGWE KNNFG 
VSFRRKFFPGGGVFLPPPP*HKKNYFSS 
QRQYI SLGGGGRKTPPPKKNFIjLKDTPK 
LFFSHPS KKKKNPP PPRKI WAPPMI F * I 
PPPPIFFFFFFFFFFFFFFFFFKN*P/S 
FFFNFKKPIFKTFLSPPFKVFPPPKKKK 
KKNP P I S YRRWPLAX 


9790 


23691 


A 


9863 


116 


366 


GQE FETS PAMFCFETESCS I A* awqwh 
DLSSLQPLPPGXSGDSPASASRAAGEIiL 
EPRIRRLQSVEITPLHSSIiGNRVRLHL 


9791 


23692 


A 


9864 


619 


295 


FFFEMEFSLLLPRLECNGAISAHRNLRL 
PGSSDSPASA^LL*SQVMITRLRHHD 
♦LILY/ FLVEMRFHRVG * AGLE LLTSGD 
P PS SAS QGAG I TGMSHSAGHYGKI F 


9792 

■ 


23693 


A 


9865 


12 


432 

• 


I ADRRLF FTNHKD I GTLYLLFGA*AGVIi 
STALSLLIRAEIiGQPGNLLGNDHIYNGI 
GTAHAFG I ILVI VI P II IGGFGN*LVTL 
IIGAPDMAFPRIIWrSEX3LI*i?TSLrJXL 
TS AI LEARAGTC * TVYPCLA\GGYSHLG 
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SEQID 
NO: of 
nucleotide 

seauence 


SEQCD 
NO: of 
nentide 
seauence 

■ 


M 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

location 
correspond 
ing to first 
^amino acid 
residue of 
□entitle 

UvUIIUV 

sequence 


Predict- 
ed end 

nude— 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

neotide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid. F=PhenvIatanfne. 
G=Glycine, H=Histidioe, I=lso leu cine. 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
\V=TrvntODhan. Y=Tvrosfne. 
X=Unknown, *=£top codon, /^possible 
nucleotide deletion. \=DOssible 
nucleotide insertion 














A 


9793 

• 

• 


23694 


A 


9866 


30 


380 


LPSTNRRGIGPLYLLFGA* SGVLGPALT 
LLI *AELGQPGYIjLGDDHTYNDIDTAHA 
FVI ILPI VI P 1 1 IGGFGD* LVPLI IGAP 
DMAFPRIN /NI S p* LT. PAS LLLUASG I 
REGRSK 


9794 


23695 


A 


9867 


256 


395 


EVLRKSEKSFETVCIINNKC/WPGMVAH 
ACNPMTLGGRGGRIT*GQEF 


9795 

* 


23696 


A 


9868 


458 


2 

■ 


PKKRFFSQKPPRGFYSAPLKGKNYIFPP 

TTtTNTiYlDDirn'pWvnDDT T?l?Trii , T7I?T?l71717T7'D 
tr vri r Kaffr^xjC 1? iSJLafirJUr x? r J? see V C CP if 

RFFFFFLEKG*KF*NFFFT*NMVFFYIS 
/CSKKFPFQFVLTPIPF*KVPKPVWFFW 
DFP I FFKKI F FFFFFCDRVS LCHPGWS A 

V A P P T .TZVfi P P V 
VAR 3 X\XJ X flO XT Xt V 


9796 


23697 


A 


9869 


144 


425 


IPLCSRIYSFGPC*FSLISSSSCLSPFI* 
SIPLCVLS**T* *YLFFYISMREIYKTR 
TKGIYPGAQRNSLYTH\FSIEVQSTIKA 

EFE 


9797 


23698 


A 


9870 


138 


401 


DLRLKLNLS KN 1 1 RLQGTVES S VLT I KE 
MQ I KAKVR YNL I P VKI KR / SGNDRWW *G 
CGERGTLIRC* *ECifLVQPLWKTVWSFL 

IVIVXVJVJV 


9798 


23699 


A 


9871 


179 


441 


PSGKGGIKGPPPPPKKIFGNFFFFLKKK 
GVPPGPPGGPKPRPLETPPPNPPKGGEY 

ROfl P \ WPPP P T P * PFJC OP FITTf K flP PWTM 

RGNP 


9799 


23700 


A 


9872 


57 


435 


ftqmrknlknnsgnmkk*gs itplkdhs 
nfpavdpnqnemfeipgkefkk/sdy*v 
tqgdtrerckpt * vlkti ldmdekcs ke 

MD T T .NITTWnQH T.T .F TTKTYTFP FT .niTOMP <2 P 

NNRLDQVEERITELE 


9800 


23701 


A 


9873 


228 


443 


FSSKFILISVFIjFLFLETGSCSVAQAGV 

ql*dysslqpqspg\sgnppisasqiat 
tagaclhall i fvffv 


yoUI 












LKGFNFFRGFFPNFPPPKKKFFSKNSQL 
VFFLPPLKKKIF/CFPTPVKFGP/SQRF 
FLKGPPLFFFFFFFFLDRVWLCYPGWSA 
VARSRIS 


9802 


23703 


A 


9875 


405 


2 


KKASRI^ElWRKKKKLEMIKl^EEGMiX^ 

IPLNTQMIRMQNSLIADMENII,/ WWI / 
EDPTNYNIPLSQSLIQSNALT\FNSMKT 
ERGEEAA*EKFDAIS3* FMTFKERS 


9803 


23704 


A 


9876 


355 


394 


THP YYSHQEYQS P * P/LTGALSALLKTS 

OT.ZiM * WPH^ T TT J ■ T r /5TiTiTMTT .TT Yrt * 
v?JLini*i uiir iwi x LtXMXi^ASixLtxn x ux x ly 

WRDVTR* SAYQGHHTPPVQKGLRYGI IL 
FITSEVFFFAGLF*AFYHSSLAPTPQLG 
GHWPPTGITPT^PLEDPLLNTSVLLASG 
VSIT 


9804 


23705 


A 


9877 


47 


85 

• 


TIYIHICVRIYACECVCVCAYIHICGVP 
AHLTEQNL\KPARTQSPPAPQKNPELPV 
DLVS I SLDF^QDVELNHYFJjGNTCRTD 
VLPKVWTLCLRL/NFFPCLPILEELQEL 
P*LENI PLFPQPIDYIYTYMRAYICV 


9805 


23706 


A 


9878 


2 


389 


GRGQF P P EFF Y F F WGGS LVP S TPNKKKP 
LGE VLGGG WSNRAP KGWGGGFPP PILAF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 

W W 

sequence 


M 

etb 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 

• 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

* 

ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Aianine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 

» m ~ 

G=Glycine, H=Histidine, I=Isoleucine, 
. K=JLysine, L=Leuctne, M=Methiooine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R»Arginine, S»Serine, 
T=Threonine, V=Valine^ 
W=Tryptophan, Y=Tyrosine, 
X-Un known, *=Stop cod on, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


■ 






• 


■ 




. KPKGPFGPKFVLKFSNVPPLLFFPKMRO 
PLPKGGGFLP^FFFFETKSRSVTQPGVQ 
WCNFGSL*PPP\TGSSNS 


9806 


23707 


A 


9879 

• 


351 


1 


FLGPSKNFNPFPPQKWPFAFFPKSHRAR 
VNE I FS /RLSVINF*GLTKPQRLFFSLL 
NFSS\PIPPFQFPSRLIKBIRFSTFLTP 
ENSKI KKPLFFF* FFFF*DRVTLCCPGW 
SAWQSR 


9807 


23708 

i 


A 

• 


9880 


123 


387 

* 


SFLWKLRPSGAPTCMRCLLA\LLGGVSQ 
LGYTGVRDPLEEAVCP FSELQRHAGKTT 
ALFRAVRQGCFFFAYF* IELFPCY /CSG 
RVGG 


9808 

j uuu 


23709 

» 


A 


9881 


341 


659 


S FLGR VQ WLHAC KS QH FGRLRQ VDHIiRS 
GV*DLPGQHDETLSLLKMPK/QKLARHG 
GMCL* SQLLRRLRQENHLNLGGRGCNEL 
RSQHCTPAWATEQDSISKTKTKK 




23710 


A 

mm 


Q8R2 




1 
t 


PKFFWVLRDSQI FKPGFGPQGFF /GGFG 
NPQGKKKPGFWGLG* KKKGFFFFFFFFF 
LRDGISFCCPGWSIQW 


9810 


23711 


A 


9883 

* 


2 


244 


GRVGAVGRREGENFIKCDELPPPPPEGK 
QQNLCKFLTGFS LPCQDKKP \ CFFV* KK 
KKKKKKXKKKKKKKKKKKEKGGGGA 


9811 


23712 


A 


9884 


310 


2 


I FLQTLFPNTFENSLSE YCI I FS E YSF I 
**KQ\TNFILSSILLYYI*PSGSYISNL 
* Y* ELFSQNLS YQHFLWI * KKWGLGAVA 
HTYNPSTLGGQGRWIAW3PG 


Q812 


23713 


A 

m m 


700J 


749 


358 


KTVWLYMNI*MANNHME*CSLPLVIREM 
*IKTPVTVH\TVGMAAI*NKK*NITR/C 
W * ACRETAVLVYCWGE FYTQPL * KTVWL 
YMNIE / 1 PCDPAI PLLGMYLKE * RPGTV 
AHTCNPR 


9813 


23714 


A 


9886 


360 


2 


LLKLG F P FWAKKG KKKRL WVEG P LG KP P 
GGFCQKLFFSPGGGPKGFSPSIIIIiPKG 
* PFVWGPFREVF* KKRFKLKKKFRPTFS 
/Y/LFFFFETESHSVA\RMECSGAILAH 
CNLCRLGSSD 


9814 


23715 


A 


9887 

J w \J 9 


236 


347 


IKGFFFF*TESHSVT\RLECTST/SAHC 
NLCLPGSSNSP 


9815 


23716 


A 


9888 


310 


3 


NFFFFFETGSCSVTQARVQWSDQ/GSLQ 
PRSPGL\SDPPTSAS*VAG»HiKFIiSGLC 
SSL* SLEIS*DTKKSQVQITDDLCYTW 
LCLFFEMESHSWQAGVQWH 


9816 


23717 


A 


9889 


119 


391 


APAYWNPHARDSPFFFF*TNFPFFPQVG 
GQWPNSGSPQAPPPG/SPPP/YSASASK 
X SGAPGAP P PPGPNFFFSFFSS PTFFSR 
DGVSPYEP 


9817 


23718 


A 


£890 


195 

. ■ 


3 

• 


DGVSLLLPRLECNGTILTHCCLSPV/SW 
DYRRLP*RL/V* FFLFLVEMGFHHVAQA 
GLELLTSGDHP 


9818 


23719 


A 


9891 


3 


378 


RDGERNMLALGTASAKALKYSLALSLPR 
LECSGTI ISHSSLNPP/GLKWSSCLSST 
SSWDFRPAPPHLAKKTKQQ/HATFF* EG 
ES PYVAQAGFKLLAS SS PP AFS LPKCWD 
YKL\DRHAW . 


9819 


23720 


A 


9892 


231 


1 

* 


PKPFFPKFFSP*N* KPFFQISPAKIQKI 
RVYQS \HKKFFSFFFFFHGVSLLXiPRLE 
CNGTISAQCNLCLSGSSDSP 


9820 


23721 


A 


9893 


2 


347 


APARQENWCVYVCVCMCVCIVVLHRHC 
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Amino acid sequence (A=Alaninc 
C= Cysteine, D-Aspartic Acid, 
E = Glutamic Acid. F=Pfaenvlalanine. 
G=Glycine, H=Histidine, I=Isoleucine, 
K = L»Ysine» L= Leucine. M=Methionine. 
N=Asparagine, P=Proline, 
Q=Glu famine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X = Unknown *=Stnn end on. /=nossible ' 
nucleotide deletion, \=possible 
nucleotide insertion 














EVLCFYKMIX3KKRSIYAY*LVY*YIYIH 
MCVYIYI / YI YV/YMYI YTYVCI YTH 


9821 


23722 


A 


9894 


28 


382 


TAHLEAGATGYRTFLLPAIGLLCLPWVL 
DDGAGOTAO * SMLLDHAMLIAHRAHELF 
I /DTYYEFEETYIPKDQKYSFLHDSQTS 
FCFSDSIATPSNMEETQQKSNLELLRIS 
LLLIESW 


9822 


23723 


A 


9895 


93 


386 


FWKI I SHYYFKYFS CS I LIS PSDLPITC 
ICMFHLLKLCQSSEFLHLLLFVWLVFET 
OSHS I KOPECS ** LSAA/CHLPGS SDPP 


9823 


23724 


A 


9896 


174 


1 


TLDWGPRKKNPFFFFFETRSRSVTSAGV 
Q*HNHGPLQPQPAG\SSDSPTSASQADA 
W 


9824 


23725 


A 


9897 


251 


386 


LQRLECSGTI/SAHCNLCLLGSSNPLAS 


9825 


23726 


A 


9898 


127 


393 


GHMGTLTGS PATLS ILALFFWGGRISTQ 
LDGINKS \FSLFLYFLRHCSLSLRLE*S 
GAI ITYCRLELLGS GDTPAS ASQETGTA 
GTCHT 


Vo2o 


23727 


n 
A 


QOQQ 

yoyy 


3 


11 A 


TLNQKLEMI KLREEGMLKAVTGQKLGLL 
YQ/TSQFVNAKEELLKDIKSATPVTA*M 

FLS 


9827 


23728 


A 


9900 


356 


1 


GRKP FSHLP KATLLPMGPVLGGRRALMG 
PDSRPGPVPSCSLVLLTPLAPLPLTARE 
SLCPCPPS*TPOPSVNP /H/GKLARHSP 
CWSGRQSLP * AB I VPLHAP PALGDRDB 
TPSQKKKK 


9828 


23729 


A 


9901 


118 


343 


I P YAKEKKKQENVLRFIHVNLCISNWFW 
FETGSHFVTOG*VHWCNLSSLOP /PNLO 

i UXwiJIll V X Vl'sJ V XX TO Wily ft / iiiliJ^ 

GSGDPPTSASRAAGATGVQH 


9829 


23730 


A 


9902 


321 


50 


SLRTRV*RPPSAPG*NPFCLKPQKIFQG 
GG\ESPLSQILKRVKQENSYNLGGKGFN 
* PKLPPCPIjTWATKKTS PPKKKKKKNYR 
TW2TIS VS 


9830 

i 


23731 


A 


9903 


317 


8 


!JCVD^GlVl)PLt,6i:t ) lPKSM\ t r*VRT 
KTCT * ML I VALC 1 1 TKKWK* S KCLSTDK 
QVNKI *YIHIMEYYSPIKGRKY*HTLQH 
/WMNLBNIS *KRLDIKKPHII 


9831 


23732 


A 


9904 


287 


3 


RGALNGRGGPRGVKS PGFAPFKLQLKPP 
GTFUJIPK\QFPFFPNPPF*KKGAFFHFF 
LPLF*SFRWTEFYSVT\KLECGGAISAH 
CKTLCLSCSS DSRA 


9832 


23733 


A 


9905 


2 


406 


PRVRTSSRSRATALFFFFFFFVFFWGKG 
EIWAPPLKNC/ IPPEKFCYPGGGAPGAG 
LPPPPGVIFFSLCRVVKKKKKKGAPPGG 
VLGKKKKTPFTPKGGTFLTLGGF* KKSL 
FGKKTLLWVGGPLLLKNFF* EKFR 


9833 


23734 


A 


9906 


1 


296 


I WVGATEC I FKNNPTI WGHQETHFTCKD 
TYRLKV/KGWKKIFHTFHTNGNQK*AGI 
AIVISDKTDFKSKTIKR*RGHY/ IMTNG 
S IQQGDILDWYKSNCSF 


9834 


23735 

4 


A 


9907 


474 


41 


FMEYLTLSFIHVLLQ*FIHCFVHLFIHS 
FFYALMKSLIQ*FIHLYSCF/ICIDLLI 
SSCIYSFIYALVKSLANSLTHSFIHTFL 
Q* FVHLFTPLLICSFTHSSIH*PSHSLT 
FIQLHLFFHALVSGFIWSFIYLFCKINV 
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£r=GIutamic Acid, *-~rnenylaianine, 
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X=Un known, *=Stop codon, Impossible 
nucleotide deletion, V- possible 
nucleotide insertion 














FLIDK 

• 


9835 

* 


23736 


A 


■ #^ a a 

9908 

• 


286 


84 


LKP P PP FF FFFQANFLCF I RD/ GFHHVG 
QDGLPLLTS*SASHGLPKCWDSRCEALR 
PASPLISGPHQTI 


9836 

• 


23737 


A 


9909 

■ 


316 


63 


DQLLVRSIGFEALMSPIFFFQMBF/SLS 
LPKLECNQAISAHRNFCLLaSSDPPASA 
AV* / PMANLLKLCAS VSYCI BLVLNBLH 
WIK 


9837 


23738 


A 


9910 


370 


461 


F*F* FLFSETESHSVA\QLECSAMISGS~ 

CNLRLLGSGDSPATREAGAQESLEPRRQ 

SEPGLCHCTAAW 


9838 


23739 


A 


9911 


294 


28 


PGKNSTVQKEPGKWFCEMKFTPGENAVN 
IV* PTTKDLEY/WINI /DKTVAGCBRTD 
SNFERSSrMGKML\SNSIAWYTEIFRGR 
KSQLMGI 


9839 


23740 


A 


9912 


307 


463 


CKL*TLSNNDVIVGSLIVT/TCTrLVGV 
LIMGD/RLCMCGEKEYLGNLGTFLSIL 


9840 


23741 

■ 

* 


A 


9913 


377 . 


28 


REMLTVKPLAPSFFRWSRAYRAIFFITP 
TWVSKSGKVTGPLFFLHIFFPGP/CPGL 
PIFFFIKTKGTRFFFGWFFFFERVSLCC 
PG * ME CNGAI SAHHKLRLPGSR YSPASA 
SEFHR 


9841 


23742 


A 


9914 


384, 


725 


HLTAIRMTVI KDTENSKCERGC \ KTLMH 
S CWE YERVQPL W KTR W+ F/ LQQVB/LPS 
PCDPSIP/LLRMYPK*tiNTD/CKRMC/C 
I PTS IAALFPVTK/ SWKQ 


9842 


23743 


A 


9915 


187 


3 


QPHKHLGLDNI YI FFEIGFCS VAQfTGVQ 
*RDHGSLQPRPP\GSRDPPTSGSRVAGR 
PRQENG 


9843 


23744 


A 


9916 


15 


167 


DGVSLSLPRLECNGAI LAHCNLR/LPGL 

T? O r»T .PT ,PQ Q DT) VPT >PD DUD &HV 


9844 


23745 


A 


9917 


345 


477 


YCSRTSLPL/ TLKKGSA* PGAVAHAYNP 
SIIX3GRGRWIT* GQEFK 


9845 


23746 


A 


9918 


460 


42 


HQRP KVDKTTKMGE KRSRKTGNS KNQSA 
S/APPKERSSSPAMEQSWTENDFDELRQ 
EGFR * SNYSELKEEVRTHGKEVKNLEKK 
LDEWLTRI /SNAEKSLRDLMELTPKA*Q 
LPFERTNLSS * FNQLEERVSVMEDEINK 
MK 


9846 


23747 


A . 


9919 


282 


21 


AAG^LQ^SEFWLkSLGSRRGLlPtfeVFPV 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
Threonine, V=Valine, 
WKTryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, ^possible 
nucleotide insertion 














RGRQAWLSCDGYHPVRPSRLLCLPTQA 


9851 


23752 


A 


9924 


254 


3 


PGGFYFPGFLPPPLGFPKGFL*RLWPPC 
PATFFFLYQI SKFPFFFFGKYQKNFFFF 
FFFFPESKSRSVT\RLECNGAISAHYNL 


9852 

♦ , 


23753 


A 


9925 


309 


1 


YTPTIMLIiGIYSNELKTNVHMKT* I *MF 
TATLFLIAETWKQPRCSSLSEWl/QQLW 
YIQTI/BYYSWRRNIK/LMKTW/MKLK 
YI LQSEKTQSEKTTYI * FQLYDI 


9853 


23754 


A 


9926 


237 


2 


RWNFNFTQLXLKCI FLRTKFFS *VS K*N 
FKYFLGFWFFFFLRQGLTPVT\RLECSG 
TISAQCNLRLPGSTDFSASASQE 


9854 


23755 


A 


9927 


244 


1 


KPQCLWRK*D+WHNTQKLDVEKGKPLIH 
YWWBRKLV*PLWKTVWRLMR/NLKIELP 
YD P VIP LWGNYS KEMK* LCQREDSRA 


9855 


23756 


A 


9928 


2 


644 


IVQVDQKIRLNIHCLQETHFKCKYTYRIi 
KVNR*R*VYHVNINQEKALVAIDGAYFR 
ARKITRDKERHY/IMIMGSMLQKD 


9856 


23757 


A 


9929 


1 


364 


GTSGTRPINNTTVFSKCSSERKT/R/TH 
LTLNQKLE M I KL SGGG I S KTDMG * NLGL 
LHQTVSQ WNTKEM FL KE I KS ATP LNT * 
MIRHQNSLIAAIGKyLWWIDQ/TQHNT 
IiLSQRLIWNKALT 


9857 


23758 


A 


9930 


88 


374 

> 


ILKACGPCPGNPHPWGPL*GQGGLGLNP 
SKSPPHPHPRGDPTWGGTPIGDYGPKPP 
KLPHGAGLPKKPCSSGGQNN/PKISGP 


9858 

■ 


23759 


A 


9931 


384 


2 


LCSPRFFYRKIWGGGQGSRLLPPPFWKP 
GGKDCFS PGVLDPPGEQRGAPCFP * KKI 
RWG*GAAP WPPPQEG* / VGRAPLTPKI 
PAPVS FGCS P ALPGEGQSETP FFLKKKK 
KEKSEAMIPPPPGPRA 


9859 


23760 


A 


9932 


32 

* 


359 


IFFFLFFF/C*KGGPPPVPQAGGQGPKT | 
PPWPPIWAKKTKPLSKKKKKKKKOAGPP | 
PPPKGGPPPLKKGEGPLLKRKFLKKNGG 
ENFKKKKIWAGGGGNPREPPPFGGAR 


9860 


23761 


A 


9933 


138 


1 


TGFFCCC* SFALVAKGGVQWCDLGSLQP 
PPPG\SSDSPTYASCSC 


9861 


23762 


A 


9934 


103 


355 


LAGQACCS * S TS ANRCVNHA\ PG YANGR - 
CDTITCILVFIRKSWLITPIALITMFCY 
HVG*VRPQEQAPGNRVSLSDLIiL\PPVT 
C 


9862 


23763 


A 


9935 


29 


363 


AWSHNAFLLFFFKRELAFVPQRGERENN 
FVSIiKPPPPG*SPFSAPNPPKG<3*KGPP 
PPPQLIFFYF* / GEKGFSPYNPEGPKPP 
TLGKPGLF*NTKKICPQPRGLKKGGQHI 
W 


9863 

• 


23764 


A 


9936 


173 


363 


PKKRPI ISLKRAFNSNPGD*QKFLKRGL 
PPMGSQFKRGNYPYDKNLEKLCPGGGNK 
GSG/YHYGL 


9864 


23765 


A 


9937 


360 


3 


NQKMRKNQS SKTGI CKNQSACPP PKERS 
SSLAMEQSWMENVFDELREEGSR* *NYS 
ELKEEVRTHGKEVKNLEK/NLDEWLSRI 
S /N * EKSLKDLMELKP KAQELHGECPSL 
SSQFNQLARA 


9865 


23766 


A 


9938 


194 


370 


VSNFFFFFETEFCS FAQVGAQGQNFR* L 
KAPPP\NCCIiFPT*ASRWGTTGTCHHS 
WLI 


9866 


23767 


A 


9939 


426 


1 


S QEFKVAVS YDLTMHSNLSNS ETLSQKE 
KTKQTKTKQKKT Y* EGRKH I KRCPTPL I 
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X-Unluiown, *««Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


• 


■ 


• 








IREIQIQTTVRYHFTHIRI/VLLPQKQN 
KCWCRVEKLKPLVRC*WECBTVQPLWIT 
VWW/FLKILNIKLPPDLATPVyCIYQME 
FSC 


9867 


23768 


A 

* 


9940 


255 


16 


RVIYNIYIMYILCIYTPSRLLVEIGFHH 
VGQDGLDLLTS + S I RLSLP K\ CWDYRYE 
PPCVNIYIYLHIYIiIYMSDGVKY. 


9868 


23769 


A 


9941 


206 


355 


FLQGHTAI KNGRG/ W PGAVAHTCNPSTL 
GG*GGRI I *GQEFKTSLTNMVEP 


9869 


23770 

i 


A 


9942 

i 


3 


402 


HEELNPRSATSS PALVLHVTI AHVTIAF 
AEGCFQHDFCFSQPENRHIPLVFTTASK 
VRN I FYLLCRGRG I LRFFLVCLKIDPAI 
PLMGIYPK\KSIiYHKDTYIHMFIAAEFT 
VAKIRYQSKCPSTDD* IKKICV 


9870 


23771 


A 


9943 


79 


328 


CIKILLKFLCLFCILPPCLNFFLRQSCS 
VARAGVQW\SLQP*PPRLKHTPTSAS*S 
AETRGTHHHAHLIFSYNFL*R*VSHVA 


9871 


23772 


A 


9944 


321 


414 


VPSWAG\PVAHACNPSTLGGRGGRIT*G 
QEFE 


9872 


23773 


A 


9945 


142 


480 


QEAKSFDFSDSTTMEVMUDKKQ IRVIFL 
FEFK\MGRKAAETTQTIENAFAPGAAND 
* QMRWWLEKFCKGEESLEDEECSGRPSE 
VDDDQLRAI IEADPLTTIQEVAKELS I D 
H 


9873 


23774 


A 


9946 


191 


3 


FFLPRGGGDSVFKKPPKKKNGFPKIFPF ' 
CPGFPLAT/ INFPPQI FFFFFFF* DGVS 
LCHPGWNAV 


9874 


23775 


A 


9947 


451 

• 

• 


461 


K* INK* KVKGW/RKEHHANINQKKVDVT 
I FISDNMNFRKKNKI TGDREGRYI 1 1 KQ 
PIHQBNIAILNVYVPNNRVAKYVK* KL I 
ELKKE IHKFTI LVGD FNT ILS TI DRTTR 
*KVS 


9875 


23776 


A 


9948 


186 


405 


NHTLAI SLF * P YKP * AILKLG I YRKERK 
LVC*RDICALMFITELVTAAKLWNQPKC 
PSTDK/WKMWYICTMEYYS 


9876 


23777 


A 


9949 


18 


412 


PEFRITTILPEKKNLL*LTN*VSKFSGY 
NINI *TSVAFF\YSNNKISETGIKIVPF 
TIXiSNRMKYL^/TLTKNMKDLHTENYK 
MIiRKENEEYQMGNMGSWILKINIVKISI 
LHKVI YRVKFLSNFNAI LLKK 


9877 


23778 


A 


9950 


1 


396 


IJVFDGRAERLKTGFHRVTQDGLELLTS* 
SVRLGLLKCWDYWREP/PVPGLWD 


9878 


23779 


A 


9951 


307 


405 


GIRVRR* PQGLRPPNPDALVFKKMVWPP 
PTN\PRGWGGKGWEPPQGPQDPPPQAAP 
GFLPETWRGPWNSLPTRGAWPPNSNLPP 
WPPAPSRHPAQRAPGPSTAAPPPGGGGE 
GWDPPGRV 


9879 


23780 


A 


9952 


2 


402 


YGRPKEGGSLRFWLTTPGPKGETPFFL 
KTQKITGGGGGGP* FPLPRRVRQENSFY 
SRGGGFN* PKRAPCALTWGKKQKFFLKK 
KKNTPQKPKKLPGCRIQY*TQK/SSSLF 
MGGTPGKRQ 


9880 


23781 


A 


9953 


249 


1070 


S SFISRFYFICI YLFETRSHSVTKARVQ 
CCDYN*LQPQPPGAQEILPDQPP*SAG\ 
ITGVAPNTPQLIFYFFVE/SGGSTHVAP 
RLALNS WAQNI AL/ RLPKVLYFTF 


9881 


23782 


A 

• 


9954 


75 


396 


GFKGRKRGLPLPLNQKLKTLNLSEKGLS 
KANTGQKLGLL/RPVSQWNPNGKFLRE 
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Jb— i»iutamic Acid, ^Phenylalanine, 
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T=Threonine, V=Va!ine, 
W— 1 ry ptophan, Y= I yrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IKS AF / P I EK* VI * KQNSL I PDMBBVLG 
V/ WKTKKYSHNI PLGLSLTQKRVLTLF 


9882 


23783 


A 


9955 

« 


181 


458 


RLWNDGLYNI FDLkljQ/ S> i tiV* SLLLLV " 
ILEPESHSVS\RLECSGAVTTCCSLNIP 
GWSDSPPQAFQVAGTTGVCPNAWVPSAF 
L 


9883 


,23784 


A 


9956 


67 


430 


LAPRLI YSPI2CFQWS FMSVRQIQVQSIiA 
IGNFHNPRLPILICHCYAFHRILLFQMV 
RNFILYS ISISANKQESKCILKYILLY I 
NLRNWLGTV7UiAYN/PSTLGV*GGRIS* 
AHEFETSQGN 


9884 


23785 . 

i 


A 


9957 


77 


422 


LPLALWASCRPQLCLDTDFIHGPPTSGA 
ASQFLCFQTMQPPTCSSSRLYNL* P ALV 
LDHTTSNWCFRSP/ SFFFLLPLPLPPA 
PLP\FFFFFFFDRVLLCHPGWSAVARFQ 
LTAAS 


9885 


23786 

■ 


A 


9958 


3 


422 


LHSS L VTEQDS I S KKKKKKKI FLGGTKK 
GGPSLGGFFFGPPGFPKIVKKKRGALIG 
FPFFCPKXKPKKPLFFPKPLGKPPRKKG 
F* IGE P WKT/EGPG FPKKLNFVFFS PWG 
GNFLFPPQKPGGKVGKILLQPKKKPPGK 
G 


9886 


23787 


A 


9959 


370 


2 


PFSSWVANFREGDGEKCSVLQGIRFCMF 
CRY INVTPLVHFPFP I S KGTS FFPLYFF 
SPEVM/HFP*LPY+ FGWALLSRF* /MS L 
L*LFFSKFPFFPSFFFF*RQFHSSPRLK 
CNGAI SAHCNLCL P 


9887 


23788 


A 


9960 


256 


2 


AWEEETAPLHSHCLGNQARP WNHRHAP \ 
P * L I VFVFS VETGFHHVAQAVLELLASS 
DPPTRFNLLSSWDYRGLSPQPMSRKSAQ 
EG 


9888 


23789 


A 


9961 


269 


3 


HRQACGQVRSPIQCWWACKIAQPSLQIV 
RSSLKG* I *SPYDPVTPLLVI YPRELKT 
\SLHTKTCT*RFTAALFIIARRWKQLKC 
PSIDE 


9889 


23790 


A 


9962 


268 


324 


SGTLSH/WIFS*VGIEITMWLKVPDQTK 
KI SLRSAEAAIKYFLTQATAS 1 1 IRRAI 
LFNNRLSEQSSITNTTNQYSSLIIIMAI 
AI KVGMAPFHF* VPEVTQGS PVTSGLLV 
L 


9890 


23791 


A 


9963 


341 


2 


ITFLPTKI IRQSNFKTGKRKTDSNDLNR 
HFKREDTQMANKYM* F I *KMQI \KTTMR 
YYFI S KRTTISRKTSNIQC * DTWSPGTF 
INCLLBCKmvRLLwKMTWQFLTEVHIHL 
VY 


9891 


23792 


A 


9964 


250 


2 


YHPVNLYFKYISP I DV* FGCFI YI FDYP 
LCSCFSIiHPLVYLQ*L/LFFFFEAESHC 
VARLECSGWSAHCNLRLPGSSDS PAS 


no no 


23793 


TV 

A 


9965 


3 


334 


SAAIN IPC/ HS YV* THF* YS * MYRGRTA 
GSYAT/SMLNLLRNCSTLLHRNCTF*PF 
YH/WE*VPISPQSCQKLDFFIFLIIAIS 
EG 


9893 


23794 


A 


9966 


179 


352 


NQE* TE I IiNRPMSNQ I ES VI KYLPTRKS 
HRPHRFTAEL YQTY \ ILLKLFQKVEGEG 
LI* 


9894 


23795 


A 


9967 


2 


350 


THPSINSFIHVFIHASMNSLIHLPTHSF 
SIYLFTKSFIHLFIHFFSFLFFLRPSFT 
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SEQ ID 
NO: of 
nucleotide 
sequence 

* 


SEQID 
NO: of 
pepuae 
sequence 

• 


M 
eth 

Ou 

• 


SEQID 
NO: in 

1TCCM 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to iirst 

amino acid 

residue of 

peptide 

sequence 

< 


Predict- 
ed end 
nucie- . 
otide 
location 
correspon 
ding to 
last amino 
acid 

■ a r> a"a ■ ■>■> ^h. am aF 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
Hr-vviuianiic acio, r— rnenyiaianine, 
OCIycine, H=Histidine, I«Iso!eucine, 
KpLysine, LMLeucine, M-Metfaionine, 
n— Asparagme, r^rroiine, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
A^un Known, "^atop cooon, /^possible 
nucleotide deletion, ^possible 
nucleotide Insertion 














DBFTHSFTHAFIQHLFIH* 1 1 / IFIYSF 
ISFLFF 


9895 


23796 


A 


9968 


228 


558 


CHQ JjKQEIiAI FTS rvl JUQL FSGnliDVYM 
QAWAQRPDKYEYDNK* FIEIKKIIQFTI* 
ISKRMK/YVGINLTR*VKDLHNENYKTL 
MKh I JScDTSB WKDI SCS WNGRINS V KT 


9896 


23797 


A 


9969 


196 


2 


DRKVF FRPLF 1 1 FGLKKKI FPLFWFFPS 
/IRGRG*IFFFFFFFFETESPSPRLECS 
ETISCHCTP 


9897 


23798 


A 


9970 

i 


245 


1 


TQCDMCVFIYTHIYISHICYMVYIYIHT 
RTHTGI LFTPKREKNSNTCTCATKDNP / 
CMTLC* / HKRRQTHTHTHTHTHTHTHT 


9898 


23799 


A 


9971 


163 


4% A affk 

380 


NHRRQKKNGR / HNGNKYKTVTKMINSNP 
P ITRNNLNVNGLKTPI KRQKLSDS I KKQ 
YPS I YCL* KT/ YLKYKD 


9899 


23800 


A 


9972 


328 


1 


AKCGFLKKLN IEL FNS I PGHS GKT * TQM 
FAAALYIISKKYKQPSCPSNDE*MNBIW 
HI LTIGYSAVKKI S WG \ RWMNLENI ARL 
SGRQRGHILQDS VDMKS PE * AI PQ 


9900 


23801 

* 


A 


9973 


200 


3 


MASKCSSERKSHR/^ITHNQKLEMIKLS 
EEGMSKAKIG*KLGFLCQ\VIHWTAKG 
KFLKEIKSATP 


9901 

■ 


23802 

_ 


A 

■ 


9974 

- 


111 

■ 


517 

• 


NLEQPLIKRTIANKPIKKIKTTILI ITI 
NVNGLNTLTK/ RQRWSD * I FKNQDPTMI 
/C* KKN/HCKYKDTNTLKI KG* KNIYDA 
NMLTQIKKKAGVAILI/DKIDFRA/TDI 
TRAKEGHFIMIKGSVHQEYVTILNVSAH 
AS 


9902 


23803 


A 


9975 


342 


3 


FIiATKKIFFFFPVFPPQGPPPVLVAPFF 
WAFGGGPPKWAPKKKFFGPRGPPFENFF 
F*GFFFFKTGGFFFIiRFSKKVFFFFFFF 
SETESCSVA\RLECSGTISAYCNLCIjPG 
SR 


9903 


23804 


A 


9976 


13 


326 


ILDHSNRPS SSTLKTPNAGKDVEQKQFL 
/LCCW*ECKIVNYU3R*L/WQFLTK3aiL 
L*PYDPEILLLGIYQKELKTCIHTVACT 
♦MFIEALFVIAKT* KQPRYSS VG 


9904 

• • 


23805 


A 


9977 


74 


396 


LAPRTQPLPCSPAPLRQCHTQSLLPPFL 
LRI* I F YF FCFLKTKAHS VAQAGAQ* RDH 
GS LOT * T PTLACT P ASA / VARTTGTHHH 
AWLiX vtrtrLi/ iCivVorloijVLjJKi 


9905 


23806 


A 


9978 


287 


2 


QPHNSSYTQIFTPKIIIKTPPPNLKQNN 
TTPIKKKTQ*FP/PSTNP/SYPSTHPPT 
HLFIYPLIHISPPPSPTHPRIHPPTYLS 

THDQTTTDQTUDDQP 
inro litUrOaUUrJriSMt 


9906 


23807 


A 


9979 


97 


332 


GNNDLFLYFFFFLETGSHSVAQARVQWC 
DHSSL* PQPPGSSGPPISTC/RLGIjQAC 


9907 


23808 


A 


9980 


168 


2 


STWLRWDYWKTFFKTSKS/WPGTVALTY 
NPYTIiGGQGRHIT*GQEFKTSLANTVKS 


9908 


23809 


A 


9981 


267 


1 


LKRERR*RRAN/ SKTSRRKEXIKIRABI 
TATENRKPIEK/ SNKTNG* FFEKTDK/ I 
DNPLERLRKKERD* ITKVRNGREDIL*N 
NDLIKVKKD 


9909 


23810 


A 


9982 


230 


412 


CHIYLC/D*PIDGIV*CHIYIYFGHKHT 
HTYTHTHTHTin^PTC*VSKILLKKKRR 
TCSVCA 
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SEQ ID 
NO: of 
nucleotide 
sequence 

'i 


SEQID 
NO: of 
peptide 
sequence 

■ 


eth 
od 

• 


SEQID 
NO: in 

09/515,1 
26 

■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence* 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
Jl— Glutamic Acid, F=rbenylalanine, 
G=Glycine, HHHistidine, Msoteudne, 
K«Lysine, L^Leucioe, MNMethionine, 

^WT^^_ Jm ak mm mm mm. mm* £ _ ^T^k ^f^fc — 1 J jM - 

IN-Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, ^Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


ft ft t ft 

9910 


A<% Oil 

23811 




9983 


229 


415 


■ SYVFCHIRFCPLDSLNKEIVA/ SQVQWC ; 
VS WSTT* VAEAGGSLEPRS LRPDWASQ ! 
RDPISKN 


9911 


23812 


A 


9984 

» 


2 


288 


RG YDS TS S AYTVGHFT * BDKATI TRLS6 
KENVKHAGGBTLRRIiLDVYP * TQRI IDS 
VGNLNSA\SAMM\GPPRLPKVLGLQAWT 
TMPGP*KYTHIFF 


9912 


23813 


A 


9985 


246 


3 


FLLDYQYCALCCHCFLFNFANCGHFNI L 
LGGVKHTKFPKYS * KFS VL/ WLDAYAHS 
CNSSTLSGLGWITRGQEFETSLANMVK 


9913 

■ 


23814 

* 


A 


9986 


342 


1 


FLNFGLTGFFPKGPPKPLGLRDGALPLD 
PPQFLGVFKRGAFFLGTQ* FHF\LEGFK 
FWFWEPKAFFFFF/ CCETISAHCNLCLP 
GSSDSAASASWIAGITGAHHYARLSRRL 
RQQN 


9914 


23815 

* 


A 


9987 


330 


1 


NRPLNNLVMAS KCSSERKG 
/LRKMIKLSEESMIiTVTTG 
/ VQVNTKEKFLME I KSATP 
NTLTADREKV* VI WI KNKT 


YTSFVLSRK 
+KLGLLCQT 
GNKGTVTKS 
NHKIPLSQ 


9915 


23816 


A 


9988 

* 


209 


2 


IKGSHFPVLI*YRKIIIIFdGVlC6KG&F 
YSY\LFFFFEKGYHSVTQLKCSGMILAP 
CNLCLPGS*DSPTS 


9916 


23817 


A 


9989 


256 


357 


QHRWMNTEMLNYLC* 1 I YVHF/HTHTHT 
HTHTHTHTHTHTHT 


9917 


23818 


A 


9990 


263 


1 


IQ I RVFENSHLW PS YVHFPHGHS YLFCS 
L* *AFFRH\G*FVLSVHFLKIFLNLKKI 
FFETG \ S HSVAQACS GM I SAHCNLCLLG 
SSDR 


9918 


23819 


A 


9991 


3 


368 


SLDPRRSRLHLAKIVPPRSSLGDRARPC 
LLKKLELFFETGFCCVAQPGVRWCSHGP 
LQPRIPG\SSNPLTSAT*VAG\IAGMCY 
HTQLQLAF 


9919 


23820 


A 

* 


9992 


294 


391 


S GTVAHACNS S TLGGRGG W 
TSLTNM 


W*(3Q/ESQ 


9920 


23821 


A 


9993 


317 


3 


TDKELKLRDEQRKCFVEMEFSPGKHAMS 
NVDMTKDLEYS INLVDKAAVELERADSN 
FERSSTLGKMLSNSISCYKBIFLERKSQ 
LM*QIPLLLM/FKKLPQPP*PSA ■ ■ 


9921 


23822 


A 


9994 


193 

• 


395 


IFWITCCSSLSVLAASPCTFMLWRQLtS 
LNLMNQ*DHSLSQSIiIQSKSATLFNSVT . 
AERGDESAEEKLEASRSWFMRFKERSCL 
HNIKVQGEAARTDR/EAAASYPENLR/V 
KDEGGYTK* R I FWDEI AFCWKTMPSKT 
FIAREKS I PGFKVS KDRMTARCGRSCL 


9922 


23823 


A 


9995 


82 


420 


S FLW KL C PRGAAAYVR CVSA \ LL EDVSQ 
LGYTGVRDPIEEAVSPFSELKRCRERTT 
AVFTAVRQGHLSLQK\FLLPFVQLCPAH 
RGGV*RQ*ALLCCGGIiSPVQAPLPLCLP 
TQ 


9923 


23824 


A 


9996 


376 


1 


CGRTDAPASTS P YAMTVRFLR P PQ PCVL 
DGLHNCPARAPQ EQNSLGEVDKRG PREQ 
TR\ PATAAP PRPLGS PSWICPGDW* *CR 
NS LQGFSLAAS VARGRDWALAGLPATVS 
ARFQEQQKMNTV 


9924 


23825 


A 


9997 


495 


982 


VQKFLRPNLAP KKHQRKLAP NS LQGRLR 
SLPSPTVWCITaAPPTGVLSSLLLLVTIA 
VCLWRMHS WQKNHW * AS FKKTHD * TGLA 
EP\SCARKQCSEGRTYSNAVISPNLETT 
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SEQ ID 
NO: of 
nucleotide 
sequence 



SEQ ED 
NO: of 
peptide 
sequence 



M 

eth 

od 



SEQ ID 
NO: in 
USSN 
09/515,1 
26 



Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 



Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 



Amino acid sequence (A-Alanine 
OCysteine, D=*Aspartic Add, 
E=G!utamic Acid, ^Phenylalanine, 
G«Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methfonine, 
N=Asparagine, P=ProIine, 
Q=GI utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y-Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 



RIMRVSHTFPVVDCTAACCDLSSCDLAW 



/WFEGRCYLVSCP; 



tfCEPKKMG 



9925 



23826 



9998 



401 



341 



IA£KCSTERV/SRSHTLKQKLERFKL*E 
KGM/SKNEIGQK*ALLYQ/VSQVVNAKE 
KFLKB I KVD\ LHAQM I RK*NS LTADVKK 
VIiWKIDQTL/PLGHSVIQSKALTLFSS 
MKAQ*GEEAA*EKS 



DTTILKLRQLIIGAVATKHWSERKSLTS 
LNLNQKLGT I KLTEEGMLKAKKGQKRGL 
SCQTVGQVMN/SKEKPLKEIKHAIPGNT 
* MISK*NSL/I IADMEKVLVG/WSKVI F 

*DQTSHNIP*SQN 

EQRRLPVTPGVASLQTLCCFLPRRGC 
G * ESAATHPPGPELVLLLLQGH *AGYLG 
/ DFG/PRTATGS PG\ PAGVQPGPCLVPQ 
ItSRAAAAG * PDPT* * GAI FS YH CKALRA 
TAARHCPPGC*WPG\PAGSPPGTSRPLS 
HPSSPSPPWTGVSS FLIiNLVLCP * ASDA 
QGHAC Y PG P VPALAGQGHDS AGGRLGD P 
GHHWQQAALHG/DQGPPGHQAHAGTQPS 
AQGLGC*ATHPGEGG*VGYQGLPAG*GR 
SHPRFP I VTRPMGRS CP YATVRPG PGSQ 
GQTPPGLV\ PRGBAGQNPGGAAPGLSCI 
QEVTGRWWW*GKKPLGHYPPCGAVRCPS 
S*PHPDTMFKGTGREGCLPPTSPVGVTG 
QMS\EGSRPCEWTLTMSPWGE*SPRASC 
AMLHFC PGS RVGG YHGCPPLHHMGPQST 
AGQAGQPHTliD I KAS 



9926 



23827 



9999 



30 



393 



9927 



23828 



10000 



133 



1420 



9928 



23829 



10001 



1818 



6682 



9929 



23830 



10002 



439 



1135 



IKFPEAEEEEIQEVLVLLFRVAEEKENP 
SRVGNSQRE YENQVVL YSI CNQLR YRNN 
L/RHVKKDERGYYEELLNYSRDHLMLYP 
YRUJDIMVKGFSITPFSYYTGIMENIMN 
SGHNFTAADCLRLLGIGRNQYIDLMIQC 
RSSKKFFRRKTAGDLLPI KPVE IAIEAW 
WWQAGY I TEDD I KICTWPEKCATDKTV 
DSGPQLSGSLDYNWHSLYKKGFIYLDV 
PMSDDSCIAVAPLEGFVMNRVL/NFDYF 
ETLLYKI FVS VDEHTNVAELANVIJS I DL 
S LVKNAVS M YCRLG FAHKKGQ VI NIiDQL 
HSSWKNVPS INRLKSTLDP*KMLLSWGG 
GESRRPVQEASSATDTDTNSQEDPADTA 
SVRSLSLSAGHTKHIAFLFDSTLTAFLM 
MGNLS PVQS TGEGEAQR YFDHALTLRNT 
ILFLRHNKDLVAQTAQPDQPNYGFPLDL 
LRCESLLGLDPATCSRVLNKNYTLLVSM 
API/TNEIRPVSSCTPQHIGPAIPEVSSV 
WFKLYIYHVTGQGPPSLIiLSKGTRLRKL 
PDIFQSY\DRLLITSSG\HDPGWPTSH 
VLTMLNDALTHSAVLIQGHGLHGIGETV 
HVPFPFDETELQEDSC\NMGVHKALQIL 
RNRVDLQHLCGYVTMLNAS SQLANRKLS 
DASDERGKPDLASGSDV\NGSTESFEMV 
IEEATIDSATKQTSGATTEADWVPLELC 
FG I PLFS SEIiNRKVYRKI ATHGL\ * EKR 
AFKNLLHSSRKLSLQVLNFVHSFQEGAS 
TLDIHTEPS FSSLLSQSS FADMGVPLPA 

KKFN/SLKIVSY 

MAVDFYNFVTKLVVTTGYLRISFIiAYKF 
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SEQID 
NO: of 
nucleotide 
sequence 

■ * 


SEQID 
NO: of 
peptide 
sequence 


eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 

• 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Ala nine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, ^Phenylalanine, 
G^GIycine, H=Histidine, I=Isoleutine, 
K=Lysine, L=Leucine, M=Meth ion ine, 
N=Asparaglne, P=Proline, 
Q=Glu famine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *-Stop codon, /"possible 
nucleotide deletion, V=posslble 
nucleotide insertion 


• 


■ 


■ 


■ 






FSFPFLDSLS/LLSPRLECSGVIPASLQ 
PLLPGFKRFL/ CLSLPSS*GYRRAPPPC 
PALLYF*VETGFHHVGQAGLBLLTSGRL 
TRLGLPKCWDYR 


9930 


23831 

• 


A 


10003 


216 


572 


LTIiVS I FTAKQS VTLP 1 1 CMLEMRKWLL 
NDTSAF/PII * KVLFPKI FFFFFF*KGV 
LFCPPGGREGAQFGLTKPSPSGLKEIFL 
PNPPGGWKQRRVPP FAGYFCFFIKRRVS 
LLWRGLPN 


9931 


23832 

■ 


A 


10004 


3 


354 


YSNNPKDRKGETGINEKTNNKMVDLSSY 
TSVITLNF*ML*/D/KIN*KFVKSP*KK 
PTLCCLLETYFKYGIGGLKVKRWKKI *H 
ANTIGKKTLVAILITKQTSELS*YPNKE 
EHYILIKE 


9932 


23833 


A 


10005 


187 


455 


ASIHYSLGVRICIYIPGCIHPGEKSQKC 
NTCGKNSTQKPELVN/HR * LKGHRCMK* 
NEGGKTFTRVQSL/ ACHKRVYTBRKS YK 
C 


9933 


.23834 


A 


10006 


134 

■ 


456 

- 


I WNLP LGKPCAWKQVTSDL I S PDAI FIN 
AAIHSLSFFFFFFETKFLFVPQVGGPWH 
NLN*LKLLPPKL\SDFLVA*VSKKVGIT 
GPPHHPGLVIWGFKKKGGSPMLPR 


9934 


23835 


A 


10007 


210 


451 


CYYT I QTTEQPANDSMLG I KPHVS ILTL 
NVNNLNTP P KRLG VADW IK/ TQD PAFCC 
LQETSLTCNRTHRFKVKG*/WKKIYH 


9935 

• 


23836 


A 


10008 


169 

• 


462 


YPCWVDMAKNTS YNKCWQGCGE I RILPH 
CRWB YKMASPLWK/ SQFLNK*NMELP * D 
LPIPLLDIDPKEWKTGVQTKTCR/RMFI . 
AALFPTSRS 


9936 


23837 


A 


10009 


490 


724 


BMPP * FME KNFLAPGG \ VTP PGVNTTRV 
NPKTAFDPKRSSSPFLEKSQKPGLIEGL 
THLKTLILNPLKKDESGEIPVLF 


9937 


23838 


A 


10010 


306 


588 


KQFIRAFIEMSWWLTPVIPVLWDTKA5G" 
SGIRDQPSQHGETPSLLKIEKLAGHGGA 
GL* /SQLLERLRQENHLNPGGGGCSEPR 
SCYCIAAWVTE 


9938 


23839 

• 


A 


10011 


473 

• 


53 


ISLGQDSTDISTINIPRMAFKLSSKRKS 
HMS FS LNQK/L / EM I RLS / EEAC * KPRS 
QKLGLLSQS*PSCESKFLKETKSATPVN 
TQMI * K*NSLITNK\LKEVIiMVWIKDQI 
KHTISLRQSVI*CKALPLFNSMKAERGE 
ESAKC 


9939 


23840 


A 


10012 


184 


455 


FWLSLI KKPLSSVTQKKINRET\ ILLTT 
KKKKKDCNFLEGGIjL IGGSVCRLVYRHY 
ATLYS VFCVDSS KSERGSVDPTQVFVET 
*DKCFDY 


9940 


23841 


A 


10013 


203 


453 


KEELYISRERKSSTS LLLNQSQK* LIKL 
SEQAMPNANIGDRLKARPLSPNSHLIKT 
KQKFLKEV/ KCATP 


9941 


23842 


A 


10014 


237 


483 


TLIRYI ILQGLFS YI P PFFFETESRSVA 
QAGVQW* DLGSLVPGSRHSPS SASQVAG 
TTGFHY/HAWLI fclfcatgs 


9942 


23843 


A 


10015 


157 


875 


DDPVRGRGEESAMPSGGRRNRSVSSSWV 
GSMAGITTIEAVKRKIQVLQHQADDAEE 
* AEHIiQ * EAEGKR W AWEQ AEAE VAS VNG 
RIQLVEEELDCAQECLATALQKLEEAGK 
AADESERDTKVIE IWALKD/E/EKMELQ 
E I QLKEAKH I ADBADGKYEEVACKLVI I j 
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SEQID 
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ing to i irs t 

amino acid 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
OCysteine, D=Aspartic Add, 
u<— oiuuiijjic aciu, r— r nenyiaianine, 
OGIycine, H=Histidi ne, I=Isoleucirfe 7 
K=Lysine, L=Leucine, M=Methionine, 
ii— Aspuragine, x ^rroiiiie, 
Q=Glotamjne, R=Arginine, S=Serine, 
•^Threonine, V=VaIine, 
W=Tryptophan, Y==Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


* 


■ 










bGDMGClEERAEIiAES RC * EMDEQ I RLiM 
DQNLKCLSAAEEKYSQKEA* CEKERKIL 


9943 


23844 


A 


10016 


579 


30 


OGISTKKNKWMIKQNSSELSTCVHNQIT 
LHKIAMATQWG\KNSFFNK*C/W/EN*V 
ST/CKKIKLSPSVTCTI*KQYLKWIKYL 
NI KI * QTS * IS \NGGKYDI GLGTNTLLiQ 
KTKNLMS FS * L* NQMHKEQ 1 1 EKWDY I K . 
LQNFCTSKEKFSG Vi^SPKKW/ DE I FEN 
HI FDRS * FS QY INNV*NS TKS E 


9944 


23845 


A 


10017 

i 


44 


228 


EF VCR VRGLWW \ CVVLAS QI*L WEAEAGG 
SLEPKSLRLQ*AMIM\DCTPPW 


9945 


23846 


A 


10018 


471 

- 


40 


PGEGRFQQTRFP P FLPPWG P KGGPFFKK 
PKKKKRKEMIKLS * KPRQANQ*LGLLCQ 
/TSQWNAKEKFLKEIKNATPVNTQRVR 
KLNSLIADIETVLWGTEDQTSHNIPFG 
QSP IQINILAVFESMKAEQSKEATEEKF 
EARS 


9946 


23847 


A 


10019 


217 


486 


KKI S * PVYLSLYLS I YLSVCLS VYYLS I 
YLS I YLS I YLCI YLST I I YYYLS I YLPV 
YLSINIiIiSl/CLSP^YLLTYHLSSYLLS 
YLPTYLP 


9947 


23848 

■ 


A 


10020 

• 


327 


1245 

■ 


TENQQGVLALAALS KIRMMPTRLKWFLN 
ID YTKFCEAFHV FKKRKKKKINKNI KQL 
KMPLTKRI YLAFTE YSTQ* QQNTH\ FFL 
S ADETHGKIE / HVXjGPKTRHWKFKR 


9948 


23849 


A 


10021 


329 


490 


GES PTDNHCRS PKSQ VE KVI YTFFETE S 
RSVTPAGV*WCNLGSIjQPPPP\GTSD 


9949 


23850 


A 


10022 


3 

« 


394 


YRVLEVLGYMLNIRFWLGCSSCdQAAQ 
FREGSRCTTACRHPAHTLHAAKMSRRKV 
SSAKCR* RETPKKRSARWSAKPAP\QSE 
TKPKKAAGKDKSSTSS\FSSSSSLKAEK 
GGKEKQAEVADQETKDLSAE 


9950 


23851 


A 


10023 


448 


477 


SQVFETSLAS*MSKNIL/WAGTGAHTCN 
PSTLRGQGGQIT*SQVFETSLAS 


9951 


«t ***** 

23852 


A 


10024 


198 


455 

• 


S LKTNRRE I PED FHGQPCG1CLVHSEKPR 
VRMGQTESKYASYLSFIKVILRRGGVRA 
S /TRNLI MLFQT I EQFCLWFPE *GTLDL 

KUW 


9952 


23853 


A 


10025 


469 


26 


PPRAVQPPFLPSSSliHTCVPLCRTVFSA 
TSRPDSLLPALQVSAKMSPVL*SSLLLP 
SLGQGFPMASIAPRL/PHRSLRSCWAA 
AHbR/UiljUKHbAi ljQUbLoUWPKiSJDrJbJr 
S FFRDRVLLCHPGWSTAA* S * LTAASNS 
GAQVMEFQ 


9953 


23854 


A 


10026 


222 


489 


KKKKKKKKFCFIARVKSNGAISAPCNLC 
LSGSWYFPASTSI»VSGNTGACPIiARYNF 
FY*NL\ FLVKMGFHHIGQES I 


noc4 


23855 


i\ 


10027 


313 




LYYILQHSLNCNIIG*LQFESCYIKLAG 
DKIAFRFLTLKTNHQ LGLVAHAYNS STL 
GGRDGWIT*GQEFKPSLANM 


9955 


23856 


A 


10028 

« 


49 


388 


TPKIRGVFPFKGWFLGVFGAFPFPQKDI 
FYFPRTPMGTRLVSPGKKKKKEV*RVLY 
GLKKAKKI FVLKVKFSHKRGFFGEIQFK 
KLFKW/HLKRNPPPKKKKR 


9956 


23857 


A 


10029 


394 


231 


GRQ IASAQE FKTS WGNMAKLCLF I * RDR 
VSPCCPSWS* IPELKQSACLSLPKY*DY 
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peptide 
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Predict, 
ed end 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
OGlycine, H=Histidine, Hsoteudoe, 

•mm w * -W ■—• » m m "fc At ■ ft * W 

K=Lysine, L=Leucine, M=Meth ionise, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Argtnine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosme, 
X«Unknown, *=Stop cod on, fc* possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RC/ RNTTPGL * HLL FRCLLI I LP SYL FF 
CKSFKLLYMDI . 


9957 

• 


23858 

- 


A 


10030 

• 


265 


2 


HDNMRGSDYFPASNGLAVPFKRQVFTIA 
SNFLEKSPPRLAFT*LQVD*ERKWAG\P 
VAHTCNPSTLGGQGGWITRGQEFKTSLA 
NMPKP 


9958 

• 


• 23859 


A 


10031 

• 


235 


413 

• 


KAGKLFFFTE ITFKKGKKAGGGGSFLKP 
. HLFGRPGGGNHRVR I L* KRGFYKMGRW 
LKSLPCDSPPLASQKGGVSRKSPPPRPF 
CPF*RLFP*KKKAFRLFIPKKV**ALV/ 
SLPFIGEGKTLFFFF*DRALLCRPGWNA 
VAPSGLKLSS CLS LliRS *DYKHVPPCLA 
MF*KFF/CRN 


9959 


23860 


A 


10032 


167 


424 


GREFDLRMGHMVKPPLSFFFFPFSETMS 
LSVTQAGVQW*DITALSASWTPVISDSP 
VSASRVAG\IGTTGVRHHTQLIFL 


9960 


23861 


A 


10033 


252 


3 


VEGCLSSGVQDQPEQHSETPSIiQKKIFF 
\ *LAGHGSMCL* SQLLGRLRWEDHSSPG 
G*SCSEP*SCHCSPAWATB*DPISKKKK 


9961 


23862 


A 


10034 


175 


492 


CPTCPFVWIPLCSEQPVQLYMWHTHTH 
THTHTHTHTHTHA/HLLFSLSFSFLRQG 
LSLSPTLWRSGMI/CGSTQSQSPRLKQF 
CSPRYSRG*GWRMA* /DPGGRGCSE 


9962 


23863 


A 


10035 


200 


448 


L YLHLVQLSP LDF PTPLFRVE CPVSIiSL 
CLSFS*FP*VCFFFLVSLGIiF*GRVLLC 
RPGWSAWQSQLTAD/ SDFPVLKHSSCL 


9963 


23864 


A 


10036 


229 


463 


MILGI S K I S FSLAI TLANFP PTLNLF I F 
LDTGSCSVAQADVQWHNQGSL* /SELLG . 
SSDPPTSAS 


9964 


23865 


A 


10037 


164 


422 


GLS SQLTDGHLLALIiRVGGSE VGSREES 
ERE EAGRE REG ERRDSRGG*REREREBS 
GAREEREKKEKRER/ERDRBEK 


9965 


23866 


A 


10038 


1 


490 


PNQIQNAVLIMI S FS ICFVS VI FFKSE I 
YNGQPKFKSFFFF*DESCSVAQAGVQ/W 
CG/LQSQQPRPPG\SSNP\PTSAS3VAG 
ST 


9966 


23867 


A 


10039 


255 


446 


FFLRQSCYVAQARVQ/YAIHRCSHSTLQ 
LPALASSNPSALLAPRVAGTAGMQHHNQ 
L*YFFRIHS 


9967 


23868 


A 


10040 


157 


390 


DHTCPPPSTADYRLS* FLYFLLLLFRRS 
LALPPMLE CSGAI SAHCTLC VQE TRETD 
AAHFKAARTIRAPHPAQLTIAFQVQTHV 
YQDGH/DGSWWCDLRTHPPRS 


9968 


23869 


A 

« 


10041 


290 


30 


TLKKRKEFSSVLCPRGYIVLLLLVHQFL 
FFQTGSCSVTQAGMQ/W/CDQSSLQP*T 
PG\ SNNPP VSASQVAGPTCMHHGFLHTE 
YESRGPP 


9969 


23870 


A 


10042 


190 


929 


NHR Y I S YLTRLVK I KNFANKS WLESLS P 

jB a mm mnr mm afm- t fm m fc_ jm# - ^ m -m* mm *» im m m^mmmgm mm mm mmm mwmp mm km V 

TIiKHS*/W*CK*V*SLWKTVWHYLVNL\ 
FCDDLTFDPAI P FLMI YLIEMCAQ VHPE I 
YARMFIAVLFEIFKHQ\NNPNKLWYIHT 
RESYKTIKIKE 


9970 


23871 

■ 


A 


10043 


457 


724 

■r 


HFGRPRQVDCLSSGVQDQPGQHGEVQSV 
LKL\ KKLAGHGGAHL * S QLLWRLRHENH 
LNLGGRGCSEPRWCHFTPAWTTEQDSV* 
NNNNNN 


9971 


23872 


A 


10044 


111 


429 . 


KPDEDTTGRENYRPVFFTNTDANLSKIK 
CMYPCIS VH/HVNM YGKIPEMMFT* Y* * 
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correspon 
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residue ot 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCystelne, D=Aspartic Acid, 

IT* f> l,r4. n milt A ~IA 1C?_T)l l __.,i«i_„:«._ 

H>=Olutaniic Acid, 1*— Jrnenylalanine, 
G=€lycine, H=Histidine, I=Isoleucine, 

T/' — T irfltnn T —I TV/I . 1 AM Saa A 

tv—Lrysme, Lr~ Leucine, iVi=Metnionine, 
N«Asparagine, paProline, 
Q=Glutamine, R=Arginine, S=Serine, 
i— J nreonme, v—vaiuie, 
W=Tiyptopnan, Y=Tyrosine, 
A— Unknown, w =atop codon, /=possibie 
nucleotide deletion, ^possible 
nucleotide insertion 














* LFLS FF F YEMB SGS VPRLECSGVI SAH 
CNLSLPASSNPPPGFPR 


9972 


23873 


A 


10045 


30 


220 


LGVLYKKNSVFNKCYWDN* IS I WKRM\G 

kuV I JjlFrlli\.lJMJ? ilWlKlJljfJl 1 X Kill It, 

CSWMKTKA 


9973 


23874 


A 


10046 


203 


361 


TYKLLVIFIVIDHF/LFSFLSFFFRDRV 
SFLLPRIjECSGAI STHRNL * LPGSSD 


9974 


23875 


A 


10047 


198 


358 


I QRKCFT FFFFFE V * S C FVAQAEVQY CG 
SPG\SSNLPF£ASEVSKTIGWILPS 


9975 


23876 


A 


10048 


270 


272 


RKNQRFFKI ARKRLNKMTR X S PL IS 1 1 1 
LNVSGLNFPLKRYRIAEWTKKKK\DPI I 
CCL/QKTHFAG/RDIYRLKIKGWKKIFH 
TNGSQ*QRSEFF 


9976 


23877 


A 


10050 


256 


1 


GVTTNLYFWKNFLNPPFLVKKCENCPIN 
KVFP * KPPLLKKTLFFSKKKFCGV/HPT 
EKFFFFFFESLALSPRLQCNGTILAHCN 
LR 


9977 - 


23878 


A 


10051 


112 


359 


SKVSEEPTENEVEHNLKV*SFILPLKQY 
SM* * QKKNIHIHN IS FYKK* ELTWPGMV 
AHACNPGRLGTEAGRSPEGQEFETSLA 


9978 


23879 


A 


10052 


346 


2 


RVSSYTLITLNLNRI^NVALKRYRRLNGF 
FFLKKP * PN\ I CC I QKTNL I YKSTYSLK 
VKGWKNIH\HANGLKKQAEVAI \ LFISD 
KKDFKSKIVKRDKEGHYVMTKGSIQQ*D 
RTIL 


9979 

• 


23880 


A 


10053 


376 


6 


TRPSQHPDFILLNKCLLFEATPFLVICY 
SGHRKLRPVAPP IRGS KSQI I PMPPAVL 
PRWHLSCLSFRIPSSFF*LRLPRLCKSR 
/SAWGPVS PHLPQ FLFFQAI S FFFFL * D 
G I LL CH PGWSAVA 


9980 


23881 


A 


10054 


102 

i 


347 


FLSFFVDGKLCLEAKIWVLAPAVFIVTR 
WiTJiThSTL/ RQVELRNIYLCSJJVHIHFH 
LYFC I YLS AYILQTMS LY*HL * YHSNV 


9981 


23882 


A 


10055 


251 


51 


I SLVNWLFI LENMKLDPYLS P YTKINIR 
/WQ*DLNVKNGTTKVLEEN*GTIYVMVG 
WKRPILACQQPKS 


9982 


23883 


A 


10056 


169 


342 


SQKQYSTCQNVFCFVLFLETRSHLV\TT 
LECSGAI MAYCRLDLPG *RHP PTSAS *V 
DG 


9983 


23884 


A 

• 


10057 


1 

• 


586 


AAARPAQGKARPGLLLRRGRWELRAGF 
LTAKGAFR WWLSQKHVKMS YP PQSR03 C 
GGMGAAGPPSLTVHQELCGAYS PDGTTB 
ALPLAS WPRARPS PAKAPAYDTAKLPAL 
IGOGSRRPPGVNPGASSLKPGACVSEGA 
GPTGTLESAGSR PPTPLPPPV/ CC PGPP 
ARADH*FCHVTPQCR* /PPPRPSP 


9984 


23885 


A 


10058 


393 


43 


HARPGAE C E RPAEG EAARADAFS GRHAC 
PHSLYCRL*RGFFVHKLHLNKA/RLIKR 

GLLEQTVFS FEIRSHCAAQAGVQRNNHG 
SLQEF 


9985 


23886 


A 


10059 


228 


466 


GLLHGPPIYDLPEPVSTTPSLYHPPIVP 
+ASPLGPSAILTLPFHSPPPA\PLFSPP 
HSVLGSSGPRSLPT/SPTHLPTLTPH 


9986 


23887 


A 


10060 


293 


9 


DILQFYTFSPIQ\CPVYIIiHL*YISLQR 
L*FHQKYLIGIYFIETWSCSYARLECSG 
RITPRCNLNFPGPTYPPTSASGVAGTTG 
VCDFLGLPWSR 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

etb 
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SEQID 
NO: in 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanlne 
OCysteine, D=Aspartic Acid, 
il— {glutamic Acia, r— Jrnenyiaianine, 
G==Glycine, H=Histidine, I=Iso leucine, 
K»Lysine, L=Leucine, M^Methionine, 
IN— Asparagine, Jr^rroiine, 
Q=Glutamine, R-Arginlne, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X^Un known, *=Stop cod on, /"possible 
nucleotide deletion, ^possible 
nucleotide insertion 


9987 


23888 


A 


10061 


199 


428 


TAYCRTL PLKJ^SNPT I SGS FFFFLETGP 
RSVPG*RAGG\DLMAHCSFDLPGSRDSP 
ASPRQWGATGPCHHP WLI FB 


9988 

* 

■ ■ 


23889 

* 

■ 


A 

• 


10062 

* 

-p 

* 

■ 


191 

- 

• 


2281 

■ 


LQTRGKLFKYKEKGICTKLHRSE 1 1 IGK 
QWKEAVCPPTGQAQSQGGLDPKSREALR 
GGHGFVPGGGVCGHQEL*LHSDGLKAVH 
PAGS E PS D PGGVQAAACQDAATGGTPLQ 
PMPETARGC/ TATAS S RKAGPGMDP/ RE 
A/GPEAWQLGKARS S WGGS STSGGPAWV 
FIRDTPSWWGIPGTVSPPAGPRCPPPTR 
PQWBQLGSGRPG * MQTTAEBTNAAPRAG 
QERTPSRGPSADPGVSQHVLSHRRSCPK 
CERRS VP PNG \ AG PTL WTB LHP SDASVS 
ALT PL * GDVEVGP YG\ CBGYI KPGRRTH 
QHNWLDLDLRPAAS RTCRKMLE LPRV* F 
L LRL P ELAETYS S T P P ACAQRPRGTGGG 
/PPQRCVLMSGSHQGGAS*KGRDAPSLG 
SGLERPREGGPGWLS / RQPSPHAVQPCK 
HMLCLGPAF*RGRGARGVPVPVRASHFP 
FHGAAWW PGDHMSQP * WQMLSRPARGHR 
PSWQVRRQCTVLGALKPGLPDLKCPGPT 
AFLPG*GVS/EENGQIPNVSTQLYLQNL 
PLPREQN* RRFPRDLS SGGAGPGTG* CG 
GGGGHALSPTPSRRLRAHSKDNSWAQPL 
CCWMPAASGC\PGIGCNGVPPVAAS*QA 
AA*TPPWSEGSLPAG\*QPSAHRCVTRV 
PGGR I PHLLAQAVP PDDRAFPNCQASGQ 
LLGVHSRPSLSAAGCRQPIiAVSGIGCNG 
VPPVAAS 


9989 

• 

■ 


23890 

* 


A 


10063 


65 

■ 


453 


ISREFLFLETKSLYIMLKGS IQQEDI I F 
VNI YTLNTGAPRY/ 1 K* ILLELKRQIES 
DTVI PEDDNTPLSALNRPSRQK/ VDLIC 
TIEQMDLIDIYRTFHPTASEYLFFSAHG 
S FSKKYYMLGHKI CLETFLN 


9990 


23891 


A 


10064 


100 


466 


LGILPLQSRPSGPGGQCTPTLPFWVSLP 
LPVRGCFLHLCWCLPYSGALGSPCHTGL 
VCWHIPCF*CCX3VSP*R\SCLEVHLYTH 

HIT f H Iff If f iHUfliUfftW U>tTT JUf 4MD f P DT T3 V/ 'VU 

iiiJLxilrlXiixrlXiuf V JLrri w f1K 1 J^KJVJor£U<U 
VMRLCGGGGRA 


9991 


23892 


A 


10065 


226 


377 


GLAGSPCISGSVSFCSGCIY\RGHGIMF 
ICNDCKVFRFCKSK*NCFKEHNP 


9992 


23893 


A 


10066 


292 


458 


KVSS I CLKMSFLFLFFFRTGSHCH * KVS 
S I CLKMS FLFLFFFRTGSH/ SS PRLECS 
(jMl XAHCbJ-iK.ijijUbbiJLiTAaAb * VAQ 


9993 


23894 


A 


10067 


252 


36 


TGFNPLHFIYYYYYYYY\FESKSCSHAQ 
AGVQCCDHGSLKPQP\PSSSHPPTSAP* 
VAGS RNAHQANSTAW P MVS 


9994 


23895 


A 


10068 


56 


481 


GILKNCHNLMIKLERMRSCVLMGEQRK/ 

MTV* D"R QTPGTTOTVTJTVPTT TNDT.WVT? T 

nil UfiO a rTJCiVl V Am A V CiX a All L/IIb IC 1 

NLVEKTVA/ GVETVDSNFERGPTVGKM\ 
IVCYREIFHEKKSPSMQQL\YF+KFPQP 
LQPSAATTLVS*QPS\SKQDPLSARRLQ 
FT AGS D 


9995 


23896 


A 


10069 


263 


435 

r 


HFSFLSFFF * KTBSHSVT/RVEGRGGIS 
VDCKLCRPGSRDSFASASRVPGTKGISQ 
GRG 


9996 


23897 


A 


10070 


116 


474 


RGAGAWGVSGRSSPPLSALPIQPPPHLS 
TPPPLETP/PPVLH* PLKDLGGWGALLA 
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SEQID 
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Predict- 
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ft\ iTMn fk n 
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peptide 

sequence 


Amino acid sequence (A=Alanlne 
C=Cysteine, D=Aspartic Acid, 

IT— l»i+r» m 5 r» A <*ir1 onvlalanino 
Cr'UIUlaiuiv ^^CIU, X*~x IiclJj'lnlalllllv, 

G=Glycine, H=Histidine, I=Isoleucine, 
KHLysine, L^Leucine, M=Metbtonine, 

N=A tnarfloiriA P— P»-«» lino 

Q=Glutamine, R=Arginine, S=Seritie, 
T=Threonine, V=Valine, 

WsTrvn^Anhon V^TvrMinA 
tT" JLijpiV|f llan, f— ' M. jiwSlDtJ, 

X=Un known, *=Stop co don, ^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














PKTPVDPPDGPFLDPPLFGWPPPP\VFL 
PGERPPPL 


9997 


23898 


A 


10071 


159 


377 


KSHMPLTLNQKLEMI KLSBE/GLMSTAE 
XGQKL/GLLLPNSQWNAKEKFLKEMKS 


9998 


23899 


A 


10072 


180 


473 


AIEIQLLSRPLCLWPSGRFPDAQVANIR 
VRVAV * GNPLR * VGPAEPGEQRPSPWGF 
PSIS/WDISP?WSSSASPPGLSADCKFT 
CHYRCRALVCLDCCG 


9999 


23900 


A 


10073 


98 

• 


440 


GQLNKliSGPYPENVGYTLPVLINPLAQP 
VI YS T I F AGTIi I TAuSSn* F FT» VGLE I 
NMLAF I PVLTKK\ INPRSTEAAI KYFLT 
QATASIILLIAILFNNILSGQKKKKKGR 

PF * 


10000 


23901 


A 


10074 


358 


404 


PCLYTYHLHNSNST*SLRLVSIMXEIVT 
SNLHRTVPHSYTPPTQLSINLVMAIHLS 
AG AW I IS 1 PSMIKNAbVHFJjPQUSPTPIi 
IPILVI IETISLLIQ\PRPLAVRLTGNI 
TAGHLLMKL I GS ATI^iAI ST INLASTL 1 1 
FTILItiLT 


10001 


23902 


A 


10076 


3 


419 


KWRKKMWCIjQKMEYYSS * KGKKJjLjSHVT 
I WMDPEE VMLS / E I S /SVTEGQI LFNST 
YV 


10002 


23903 


A 


10077 


335 


414 


DKSPF/D*RFFLPPNSNGFLVKMPPSVK 

RLPFFFIFPFSFFFFETESRSVT\RLBC 
optpt lMinwr tjt t>/-» cone 


10003 

* 


23904 


A 


10078 


279 


1 


KDTQLPFIQFFFETGSCSVPEAGVQWCD 
HSSLQPPPP\GSSGSSPLSL*VAGTIGM 
LIFKIFSRNPLNFFFFLRWSLALSPDWS 
AVAKoWri 


10004 


23905 


A 


10079 


397 


3 


KSSS L FQNHL WANS ST YSHL I PTLGY I L 
NHGNQFDPDTL/ RDKM*NFFCTI VCPHY 

tjt T3CT.T?r\W /\7CTyiGT.KrVTVrTT.TaT»nT T?C*V 
iSLir'oiJlJjy H / Voi-iOoi-iW XLrl J.JjirijJJJ_ir L.JV 

RQGKWSE I PYAQDFMTLYQNLTICQT/ P 
QNPPPPPKESSKVELDIIDDPPH 


10005 


23906 


A 


10080 


281 


1 


IFLGEWGPFFPPQKKSFFPKIPQWVFFT 
PPYRKKI FFFLPR*NWA/HPKI FFKRPP 

DI7TTI7PT?PT?l?Rl?T7r^Ti , TrC , Mr r TrTT.T.OOiri?tTT 

rt r ccc rr ecrv\jc ruLlNur JLJblirr'ftXtfll. 
TQAPAWGFAG 


10006 


23907 


A 


10081 


207 


35 


QENRRC*RGRGES/GTLIHGWRECQTVQ 
FH 


10007 


23908 


A 

• 


10082 


2 


408 


IAPLHSSLGDRARIiRLKKKKKKGKIRPL 
JartJbr r ]SJ>*\2vtcor JVrJNorlor Ivje/VurLciir \lr 
PPHIPPFFFWKVQPPLFPFRGTIiEFPGP 
IGTCKLQTIjGSPG*WDPPCPLPHQSPGR 
GGtlPGRKGEDPPSFRGLPPPHF 


1fVW8 
lUUUo 




** 


lUUoJ 




7 
* 


ECKORPOIiEMVS FRRRSHEPFFFETGPR 
SVAQAGMQWH/DLSSLKPPAPG\SGDLP 
TSAS*VAGTI 


10009 


23910 


A 


10084 


219 


1 


RLKIKLLKIIITTTIDSGICSFKSCKL* 
H/ LKI QNMWLDXLAHAYNPS TSGGRGRQ 
I T* GQE FKTSLGNMVKP 


10010 


23911 


A 

* 


10085 


364 


413 


KKKGGEKGPLLKKER* KTNGQFLVHTNF 
/RLPGLKYFF/CPPPPSKWGLRAPPPKP 
GDFFFFFFFFLVFLVETGFHHVGQAGLE 
LLTS * P I PLGFPKCWD YRRE 
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SEQID 
NO: of 
nucleotide 
sequence 

"i 

- 


SEQID 
NO: of 
peptide 
sequence 


M , 
eth 
od 

■ 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 

■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
0=Glycine, H^Histidine, I-Isoleudne, 
KNLysine, L=Leucine, M=Methk>nine, 
N=Asparagi ne, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y°Tyrosine, 
X=Unknown, *=Stop codon, /"possible 
nucleotide deletion, \=possibte 
nucleotide insertion 


1001 1 


23912 


A 


10086 


200 


3 


INFILLLFI FVLLI FETGPCS IAQATVQ 
WRDHGTLQPQPLGLKQAILP/ASAP*VP 
RITDVHHHARLN 


10012 


23913 


A 


10087 


154 


329 


FSLSDFKTYYKAAGHRGSCL*YQHFGKP 
\RRTDPLRPAVQDQPDQRGBTPSLLKKK 
KKK 


10013 


23914 


A 


10088 


157 


390 


NSSSFFPPSCGAQ/KSEVKKSSGPHSSS 
SSG*RGK/ CLPLLVSGGSWHFLAYGSLT 
PVSACVFT*PFSMSLCVYIQISVS 


10014 


23915 

c 


A 


10089 


305 


1 


KKKRAQSLPGQKGGRGFSQP VTAGDRS F 
RPK IR KEALGLGQ LGLTDLCRT FP PTAT 
G*TFFSSAPFPG\YPPV*PTKKKNTNLN 
*FTRIKIISRIFSGHEK 


10015 


23916 


A 


10090 


329 


2 


RGFFQPIiGFWDFGP I SNIiGGPPKNLLLK 
AEIMGYNFCVPPFPPGGLPTTHFFFPPR 
NPPF/CFPFTPSLGFFSPNPPGFFFPFF 
FFLSRVSLCHLGWS AVA* SWLSAAS 


10016 


23917 


A . 


10091 


84 


408 

- 


EKKGENKLFWGFIiLLGTLFIGGYRFKI I 
FCLVGYFIFIYLFGEME/FSLLIjPGL*C 
NGTCSVHHNLPLPGASGSPASTSGEGGI 
TGMRRHPRLY* I ylgrws FALVARAVMQ 
WHVLGSPQPSPAWCKRFSCLNLRRGWYY 
RHAPAPPSLLNIQKIILGFWLAPVY 


10017 


23918 , 


A 


10092 


184 


359 


FQINS S VTRRS FVLAAQAGVQWRDLGS P 
LLPPP \GSSDSPASAS * VAGITGLHVGL 
SHK 


10018 


23919 


A 


10093 


298 


386 


KLCEK* * ITPIRMAI I * KKREGENS \KC 
W *GCGEIGTLVYCL* EYKMVQPLWKIVW 
QFLKILGIEL 


10019 

- 


23920 

■ 


A 


10094 


310 


1 


I CMNEM IL YKQDFGRTI I IRKIVIYFW I 
FN*FGH*IRIY*NLLRYKTCMIj*RG**I 
NLPQRS FKNLRLKI * * W/WPGTVAHAYN 
PNTSGGRGGRIT * DRE FETS 


10020 


23921 


A 


10095 


40 


407 


EHTHTTACE PGG I * IRPVD*MVLISWL* 
HGTITCKMVTLGETGQKPQGISLIIIFF 
FFG\TQSLFVPRNGVHCRGP ISAR* KLC 
LPGPPLS PT * AS * VAETTGACYCTWFEN 
NFFLRLPLHCY 


10021 


23922 


A 


10096 


189 


2 


P PFFFFSNLFKKS F YEQGLALS PRLKCR 
GIIIA/HCS/r**TPG/YK*SSCLSLP\S 
SWDDRCAPPRV 


10022 


23923 


A 


10097 


260 


401 


DHLRSGVQD* LGQHGETS S LL KVQKLDG 
RGGLQL I P \ RRLKHENHLN 


10023 


23924 


A 


10098 


187 

• 


464 


WYMTRCGSLGPSSAAHESHHPPPQQLMN 
P\PSPSSAAHESHHPPPQQLMNPTIPLL 
S S * I P PS PS SAHE S PGRVGRP EFTGGAH 
GYLHLSPP 


10024 


23925 


A 


10099 


397 


1 


FFGGAPCPPPRRKGKTPPPPRKIjFPPFP 
PKKPPLPKKNC*NKNPLIiGPKKKNKRPP 
PGRFFKKGFF*NPPKAPLTNFWEKKKKK 
\SPFFKGVFFPPGEPKKPPPFFFFFFFF 
FFFFPTRP PRVRPRVRPRV 


10025 


23926 


A 


10100 


253 


3 


NAINRNKMQ* TKTSEWKVYADHNATIKQ | 
KQKRKCQVL / WRG CGETETLVACWWEQQ j 
MEQPAPVEN/W*HFLENIKIQLPYDPAI 
HE 


10026 


23927 


A 


10101 


121 


404 


VLASWS IFFFFFFWKGGLIFSPRLEGG 
G/LILG*LKPPLSGLGPFSCLTLPEGWE 
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SEQID 
NO: of 

II uucvtiuc 

sequence 


SEQID 
NO: of 

npnfirlp 

sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 

VV4JJJ » 

09/515,1 
26 


Predicted 
beginning 
nucleotide 

MM mm wf m* \J mm mm mm* 

location 
corresnond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nude- 
otide 
location 

• m* ^-a* m mr mm 

correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AJanine 
OOysteine, D=Aspartic Acid, 
F=Glutamic Acid. F=Phenvlalanine. 

M ■ m mm mmm 1 mm m mm mm\ ^ • KS % A A IIK*i j (lilHllflli^ 

G=Glycine, H=Histidine, I=Isoteucine, 
K^LYSine. L=Leucine. M=Methionine. 
N=Asparagine, P=Proline, j 
Q=Glutamine, R=Arginine, S^Serine, 
Threonine, V=Valine, 
W==Tryptophan, Y=Tyrosine, 
X=l)nknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














UCPPPPDPGFFFFF*GKTGFPRGGQBGL 
NFLTWEPPPLAL 


10027 

• 

* 


23928 

> 


A 


10102 

i 


468 

• 


1829 

• 


IRDTYTQNPDKAKGYFSCI FFE I EETTM 
SRKQNQKDSSGFI FDLQSNTVLAQGGAF 
ENMKEKINAVRAIVPNKSNNE I ILVLQH 
FDNCVDKTVQAFMEGSASBVLKEWTVTG 
KKKNKKKKNKPKPAAJBPSNGIPDSSKSV 
SIQBEQSAPSSBKGGMNGYHVNGAINDT. 

ESVDSLSEGLETLS IDARELEDPESAML . 

* 

niwrr T^T3'iv , CMTyTMr r \7Cri'irR'PVCT.TMUCTTi 
UniuJK i\7oITLLrtJJNvj V aUV C X JXoJjii*lrlo±Xl 

NSQQPRNAAKSLSRPTTETQFSNMGMED 
V/RPRHQ* KS * VP I FEKS VKDLQRCTVS 
LARYRVWKEEMDAS I KKMKQAF ABLET 

QKKAELLKKMTHVAVQMSEQQLVELRAD 

X L\SiE V O £• IvIV X UEtXJ XJ\71v V nT\ C X V Xa X XJXv 

KSIDSFGQVSHPKNSYSTRSRCSSVTSV 
SLSRPSDASAASSSTCASPPSHTSANKK 
NFAPGVY 


10U28 


Z39Z9 


a 

A 


tAtAJ 

IU103 




A 


FYFPP* IFPKIFFFFFFWFGTQSCPVT\ 
RLECSDRIPAHYNliriLQSSSHSRASVSH 
B 


10029 


23930 


TV 


1 A1 A/I 

10104 




A 1 Q 


WTPT.W*PTCP /r*IiV(^OW\7f2PPPPVTV"'T»Q^ 

GIGDQPEQHGEPHLY* KNAQSSQAWW 


10030 


23931 


A 


10105 


153 


439 


TTVT SLC I YYT I YY YFRLLSTHAYTHI Y 

TWTYV* TTLTT JfDPOAOPSEGP* JtKEP* * 
x m xv c\jmi x J. Ayr vAutwuvjf tuvor 

RRW/WVGTVAHTCNPSTLRCQGGWMT*G 
QEFETSLANMVKL 


10031 


23932 


A 


10106 


424 


157 


SSHLGRPRWVDHLSPGVPDQPWQHGETP 

QT./TiTTr.anoaPT.* ^OT.TRPM/PT»53<?t3GR 

GCSEP*SHHCIPVWETEQDPISKNKQTI 
FRMFKN 








1A1fV7 
lUlU/ 






NYKr P P T PNTjNTIiKAL YPKKIvI IjS pka* 
AIKEK1D*/IIKIKHFYASNIINRMKRQ 
LTEWKKI FANHVSDKG 


10033 


23934 


A 


10108 


300 


1 


TPKQRGKKGSPREKPFPVNLPGKTLF/Y 1 
EV* FMNPLK*OK!OCNFOGFFFFFLIOGT 
VYHQFTSK* ERGVPGTVAHTCNPSTLGG 1 
RGGWIT*DQEFETSLG 


10034 


23935 


A 


10109 


120 


421 


KKIKKKNKNHMI I \S IDAE+ AFDKIQHR 
FMIKTLNKLGPEGKHI / KIMKAI YGRGG 
ODGOLHTAKK/E PliPIiRENKI SGKP 


10035 


23936 


A 


10110 


102 


418 


TLVCVCVCVCVC VC VS LSVFLSLS LCLS 
QAGVQWCDHSSLQPRPSG\SGDSPTIAS 
VVGGITGVHCHIWPTFYFLFLIRLKSLM 
L**TSSEMALTKMGVHSHGLFN 


10036 


23937 


TV 


1 01 1 1 


382 


z 


VKTM2XTM D QVfQTPVQHMQT •TQN'nifT.PMT 

KLSEKGMLKAKIGLKLGLLC/QVSQVVN 
AKKKF*REI*NPTPVNIR/DMQKVLV/V 
KTEDQTSHNIPLNQILIQNKSLTLFNSI 
KAERS EEAA* EKCEDSRG 


10037 


23938 


A 


10112 


61 


427 


ENTI YQVQCS I KEE YLQRRTLKYASLFQ 
KHICGSLAFLHLL* PKLYHSRSNABSDW 
I IQLFS VQPNVKB I SKHKIGW * RGHACN 
/RQHFBRPRHKDHLRlJGvTU3HPCQHGET 
PSLLSLLKIHI 


10038 


23939 


A 


10113 


71 


399 


NVLGKEARSVGWEAVRGQ ISRVS IQMKW 
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SEQID 
NO: of 
nucleotide 
( sequence 

■ 


SEQID 
NO: of 
peptide 
sequence 


M, 
etb 
od 

• 
■ 


SEQID 
NO: in 
USSN 
09/515,1 

• 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F-Phenylalanine, 
G-Glycine, HMHUstidine, l=Isoleueine, 
K=Lysine, L=Leu cine, M=Methion ine, * 
N=Asparagioe, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Un known, *=Stop codon, /-possible 
nucleotide deletion, \=posslble 
nucleotide insertion 














KGLIKGEKLKMEKSQACICIYFFFFETE 
TNFIAQAGGQGHDLGFMEPPSPG\*SDS 
PA* APHBQNGL/TGMHHNPRVI PV 


10039 


23940 


A 


10114 

i 


362 


1 


SKKKSWGSHSGSKCHTSFLSYPTKEQKS 
YLSLFYKCNLQSLTFKELLDHFRPVCLP 
TIY*NKNWP/RDLPIiKPSFKTLML*NQT 
L/ WPG WAHACNPS I LGGQGG WITRGQE 
LKTIKANTVK 


10040 


23941 

» 


A 


10115 


314 


3 


WRERRCGAHLMRCASNMAHDKPSRMLNL 
* KP KMAT/RNTDHDNFW* GHGBTE I LNH 
YWWDYKIM*PLWKTI*QFPKMPNIKLPY 
VSYIQLLGIYPRNLKINIHTKT 


10041 


2394? 


A 


10116 


44 


394 


PG FQRVSQDGLDLLTS * SAHLGLPKCWD 
YRHEPLRPAGIYSYPAVLFVLSTYRASN 
IT*MLMTPKPTSG\YSFLLSFRFSSFSY 
LRDISAYMSLSCLKLNVFKSKPIIFL3P 
LWLLV 


10042 


23943 

• 


A 


10117 


398 


3 


MESCCFMGIEFVLKEEVLEIGCRTM* I Y 
IiHY* SLHLRMVKMVTSLLCVFYHN*TI F 
KKSS VFLYAGNKQL* I * ILKNYI YVMI P 
FTITSRNMKRNNLTKDGKDMSTETYKTQ 
LKE ITD\ *NKWRDTTFMHQN 


10043 


23944 


A 


10118 

• 


104 

• 


386 


KVPVWATCGVKMHRLAGHGGWRLWTRLP 
WRLRRDNRLSSGSRGA\AAEILPLCHCT 
PTWVTEHDS I T YKY* KKGAP CWRASS * C 
TLSGHALFFFA 


10044 


23945 


A 


10119 


258 


401 


YNYPFLTCVMKEKIGAGMAAHACNPSTP 
GGRGG* \TRGQHLETSPTNMV 


10045 


23946 


A 


10120 


310 


384 


GVFPFLTFLYGGIKGGPPPPRFFLF* FA 
FLFFFF/ETGSHSVAQANCSGSVSAHRS 
FkLPGPGDPPTSAS * VTGTTG I CHHS 


10046 


23947 


A 


10121 


245 


479 


TSSLYRKIQKRFLYMIiALBVLKICWTYF 
MNTERFPLKLFGQFS/FRIISI*FFPFL 
FPKI YHYFLRDRV* LCHPGWS AV 


10047 


23948 


A 


10122 


268 


469 


NSVF WLGWAHA/ W * VNPSTLGGRGGW I 
TRGQELETTPANILPP 


10048 


23949 


A 


10123 


221 


423 


GQAGAVAHACNPNALGR PRREDSLRPGV 
GDQPGQHSKTPSL/LKSDNF* KMGWLGP 
WAWWLTPIIPALW 


10049 


23950 


A . 


10124 


73 


183 


AGGGWALTNTVINPT\PPSTPFTPCLSY 
SCTISTASAFRFPG\STSTAVLTHVRIL 
KSTPDAPHPPLKI PRARRNIQRDHTLSS 
NLFYYNHQLNVTEELTYYDLT* INPTAT 
IHAIHTMPLLFLHNLNCISI 


10050 

• 


23951 




10125 


342 

• 


2 

• 


PNLGPCP P PRGTKGGP FFPSFHPSI \IG 
PPNFFSF*RP*FF*MLKSLGVP/SPTGP 
I PKFPFFPGFKNYFSPPL\ PKGPLLKPP . 
RALFFFFFFNRHRVLL\CPGWSPTAGLK 
RSSC 


1 005 1 


23952 


A 


.10126 


85 • 


442 


KiCVINIKCW*RVWRTGSLIHCSWG*KMM 
PHWRAVWQS FR/ S LS MYLS YE PALPLPG 
SYPRALRTTCVCTHSRCFSLLELPQA/W 
NH 


10052 


23953 


A 


10127 


180 


484 

■r 


KSFFFFSQSAQP I KVI PAP*NFHFFGVK 
RSSCLSFLRKWGPRWVPPPPFFFFFFFL 
I FVEMRSHYAAQAGLKUASS \VPPPSG 
IPRTLD*SYPD 


10053 


23954 


A 


10128 


376 


1 


WS PGAQLLF IS CLYYNLWSLHTGRTPAK 
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SEQ n> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 

* 


SEQ ID 
NO: In 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

Ah*!h A m^ ■% ml mf& 

amino acid 
residue of 
peptide 
sequence 

i 


Predict- 
ed end 
nucle- 
otide 
location 

correspon 

j • Ad- 
ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
Ur-vviuiamic ACidf r— rnenyiaianine, 
G-Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P^Proline, 
^^uriuiamine, iv— Argwine, o == oerine, 
•^Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Un known, *=Stop codon, /=possible i 
nucleotide deletion, \=p ossicle 
nucleotide insertion 




• 










PF *GWTLQKTTPLSCPAP* LPPPLPSPL 
QQIDLKEKLVFTAPACSL/ PHSPNSSPW 
PQRRWPSVS/P/PVAHPAAPRLPPPPVC 
ARPLWLPEAPLLGIPPH 


10054 


f\ fS **\ mm 

23955 


A 


10129 


140 


454 


CL I RS S INS VHNN I YIHSTPDRHIX3CFQ 
FLWTKNAAVSRLAYVP* * I * I PTYDHI 
HTHTPAGYI PRHGIAGSQLTNIFNFSS * 
LSSNSF\QVGCTSYFLFPSHI 


10055 


23956 


A 


10130 


340 


473 


AGARHSVLRPRPPLAGSSCPSS/ OGGHC 
VALGVSCPGGGCRAACVPGCCCLLCGLP 
SDVCVC/SVCWGGSMGSRLGGWAAPGF* 
GNSREGPGNCSSGRKAGLAGGGLGPAWA 
EQAPDILCCGPGPHWLGAPAQVPLAPPL 
SPPPCLYGCLYSDRGDL 


10056 


23957 


A 


10131 


280 

■ 


2 


DSLALLPRLEYNGVNVAHCKLKSSWIRR 
SSHLSLPKHLDYRSMSH\LPGIiEIF*YS 
YCK*CLLTSPYFEFLMNSLEDISSCFIP 
IRVMPKCRMK 


10057 


23958 


A 


10132 


244 


3 


1 1 STDL I S HSWTLYLI S SLVYVFLKQHS 
LAVIiPRliDCSGMI IAHC/KLQTPGLKR/ 
FL/CLSLPST*DYRij(TPPNRANFYYYYF 


10058 


23959 ■ 


A 


10133 


272 


3 


VGKPRVFPYRI AFNFGRVTS I LGPH I KK 
YHFQCRGGKHCFF* KAPPFFFF\FNETG 
FCSVPRLECSGTI IALCSLNLQGSCNSP 
TSVSQV 


10059 

* 

• 


23960 


A 


10134 


217 

• 


3 


FPRFRPLVS PALTE LGQF/ HFIiTRGL * K 
KKPFPFLFFFFSETESCSIA\RLWGSGM 
ISAHCNLCLPGSSRTRG 


10060 


23961 

■ 


A 


10135 


229 


3 


KNDYLKCWQGCGRTETQMLGCWDIK/LI 
QP/IiWKI IWQFLLKLNWHIjLCNPTLPLL 
GI YP * EMNVYVHTQTCICTDAW 


10061 


23962 


A 


10136 


84 


352 


RREFKTGLATWQNLVSTKKYKTTVRYHL 
TSTRMDI I KGKR* QVCW/RKC/ GDKGTL 
I^C/WMRMLVQPIWKIV*VWKFCRK*KL 
NLFFPPPPQ 


10062 


23963 


A 


10137 


280 


2 


KCVWPEERILREKSQHLGKIKQEGRLN 
AWLIQPGQHGETLSLQKI / QLAGHGGA 
FijRSQLinRRMRREDIlIjSJTOVRSRSEP* * 
LSLHWVTEQ 


10063 


23964 


A 


10138 


2 


386 


HIQEKQLPNKDSTLNPYFIiLISIIiKRKD 
EKIQMPSKIlMKRCSTSbVlRKMQIKIRS 
HFPH*VGKILRSANIKRW*MCRKRG\TL 
ITCW*TYK*TYKLVKTF*RI ILQHPLLK 
THQSYYPAIPHLKETQA 


10064 


23965 


A 


10139 


229 


1 


TRSYHESLTILHTNHHLNCSPFPQAITP 
* SRFPYSNRA\wI<GS VAHTYNPSTIasGR 
GRRNA*GQEFKTSIiGNLPK 


10065 


23966 


A 


10140 


261 


429 


LSHLFHVALI * LGLVW/ SFVF VFFETGS 
F 


10066 


23967 


A 


10141 


377 


34 


WVFGSSGPPPPGFKKFFSPPPPKNKNPR 
GPPPPPIFFFFFCKKGVSPLFLPPGGGP 
/HPFLSPPPPGVKPENFFDPGGGGFP*T 
KIFSRPPPPGGKQNFFFLKKKKKKKEHQ 
IKR 


10067 


23968 


A 


10142 


208 


424 


F FSLFMQT FH I HI RF Y FFLAGGGGAGVD 
RVS/LLSPRLECRDTISVHCYIHLPGSK 
QFSCLSLLSN*NYRHAP 
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WO 01/64835 
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SEQ1D 
NO: of 
nucleotide , 
sequence 

'i 


SEQ1D 
NO: of 
peptide 
sequence 


M. 
etb 
od 

■ 


SEQID 
NO: in 
USSN 
09/515,1 
26 

■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


j Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alantne 
C=Cysteine, D=Aspartic Acid, 
E«=Glutamic Acid, ^Phenylalanine, 
G«GIycine, H=Histidine, Msoleucine, 
K=Lysine, LHLeucine, M=Methlonlne, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y-Tyrosine, 
X-Un known, *=Stop codon, /"possible 
nucleotide deletion, ^possible 
nucleotide insertion 


10068 


23969 


A 


10143 


1 


297 


PTRSPQVSNLNHKS FISFYLFIDLLSVC 
LSVSVYPETSLTLLPRLBCSGAITTHLS 
LLSS*GHRHAPPOQV\FFVETRFHNVSQ 
SQIELLSSSHLPTPA ' 


10069 


23970 


A 


10144 


185 


488 


SGFP RGTEL I GAVCVCVCVCACVCVC I K 
LGYI*NHKRIYI*LLYIYSTYYIFNLSL 
*LSVY\S YVNRNSM*DRSQS 1*1 FILPK 
LKTCLENTHTLKNTHEGG 


10070 


23971 


A 


10145 


283 


3 


LGWPGRKADGSGKLTIDYGRLDKLVSP\ 
I * SAI S DMAS TTKAVLQAQRDQ YS VLDL 
AHAFFS IR\NQSQFSPIRDSPQYAFTLP 
SBEHLNSLDGQ 


10071 


2397? 


A 


10146 


146 


380 


CFLWDNFFFFI FTEKTGSCS IAQAGAGT 
Q/WVGIISYCSLKLMGSRDPPTSLSCVA 
GTTGMPHHAQVI YFFI PL * RQGL 


10072 


23973 


A 


10147 


275 


3 


IKGMLKAKTG* KLGLLCQ/TSQVENAKE 
EFLKEXKSATPVNT/RMIRKRNCLIAEM . 
EKV* VTLIDQTGHNI PLSQSLPQSKCLT 
LFNSKKAER 


10073 


23974 

* 


A 


10148 


281 


2 


KI FLPQP PQEMVFQGFAPPGG* I FLVF* 
♦KQPFPPLGRFVFK/LPGGI PFFFFFFF 
FFFFFETESHAIT\RLECSGTISAHCHL 
RLSGSSDPTRP 


10074 


23975 


A 


10149 


413 


139 


S LR VWP I FGLLQANKKKFFW ILFIiS / IR 
PLFFFSRERGGTPLFPPFFLRNPGKKSF 
PPG* PPPPLKIQFPGKYFFTSRPAPFFP 
LFCQ PPGKNLNYWGLGPGFPKI FP PPPL 
FFFFF* DRVSLCRSGWSKVARS *LTWGK . 
AFFTWVS 


10075 


23976 


A 


10150 


100 


431 


GPRLTDHIiRSGVPNHPGQHGETS FLLI / 
HNYSGG*GRKIANRLNPKGGGCSEPKSC 
HCTPSWGKKRNFVSKKKKKKRE 


10076 


23977 


A 


10151 


134 


417 


GLAAPLVG/WGKRNPPP*PFG*GEKTGW 
GPHPGKIFLKKKLLFFFFF*KGGLSFPP 
GGRERAHFWLKKTPPSRKKEILPPPPPR 
RGGGGPPPPPFL 


10077 


23978 


A 


10152 


403 


2 


LAFPII*QEVPQCFCNLIiHSLRMYSSTGS 
KLFPLASPQPHSLLYHLTLARAPNILEE 
LQSPNTALLLKYL*EAALKKKSR*HITS 
SL* KVDLS FFLETKPRSI A\RLKCSGTI 
S AHCS LRLLGPSNS CATASQVAG 


10078 


23979 


A 


10153 


382 


297 


GLN* ESWSGEKGTL/LHCWWECKFVQPL 
WKMV*RL 


10079 


23980 


A 


10154 


209 


15 


S S PKF* FFFLSPKKI PLPFF* P I FFFTK 
TP/HFFF/SFFFFVCVSGSHSVPRLECS 
GAITAHCSSTS 


10080 


23981 


A 


10155 


153 


422 

• 


RTSGKNGNRGQIILVTMVTNWLFSFFFF 
P LKTGPHFVI QAGGQGRAHGSLQA* IPG 
\ S GDL LAL VS LQAGTAG FHHHAQLS FFF 
KKTGLF 


10081 


23982 


A 


10156 


257 


3 


RRLQNSKDCCLFFPLEAPRGATARCQPB 
LSCMRCLSA\LLGGVSQSGYMGVRDPHK 
EAV* PLAELKR* AGRSS APFKAVRQGCL 
S 


10082 


23983 


A 


10157 


37 


386 


DATRFRINSVGTAEGGLDRIEPNDLQ* Y 
INLVGTAVRRPDRIESNFESSTVGKKLS 
NNIVCYRSFFCERESQLTQQTSMLYYFK 
KLPQLPQPSAITTLH/SRQDPPSARRLQ 



1161 



WO 01/64835 PCT/US01/04927 

» 



SEQID 
NO: of 
nucleotide 
sequence 

• 


SEQ ID 
NO: of 

A * W 

peptide 
sequence 

• 

- 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanlne 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PnenyIalanine, 
G=Glycine, H=Histidinc, I=Isoleucine, 
K=Lysine, L«Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argmine, 5-Senne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Un known, *= s Stop cod on, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














LGQ 


100S3 


23984 


A 


10158 


407 


272 


HKGGKRERKRKKKEKKEGGREKGQERRK 
ERE/RERKKGRKREMKKVNRYSKCTQ* * 
LLSN 


10084 


23985 


A 


10159 


248 


478 


VKVVHRLLLDPSVEDPWKKSLKGPLQP* 
/RL* I FFF*NSGLLCHPSWRAVAQSRFT 
GVSTSQAQAILPSFSLPSSWDR 


10085 


23986 


A 


10160 

■ 


458 


129 

» 


GFSVLNRLEDEDLRQENLLMPAV*DQPG 
QHSKTPISTKNNMNPAGHGGVYP* SQLL 
RTVRPEDHLY/ SNKPCSYLCAPAWATET 
LS LKNAFS CCSNHYGL S FQTS FHR I LMF 


10086 


23987 


A 


10161 


82 


291 


YSKTKSLFFAGHGGTC I YSQLFRRQVQV 
RQBDHANPGV*GCS /DTMIMPMNSHCTP 
AWAA* * DPVS KKKKK 


10087 


23988 


A 


10162. 


6 


409 


IjR INI LPE P I SLS PLTAPAVFMP WTNYS 
SCDTFSLASMTLLPLHFLSTSLALPWSL 
FIiAPSLFQISKHQLYLSPVLLTLSSCYF 
LNPCFRNHIIiMNLGRPRQANYLRSGV^D 
QPGQRGETPSL/LKNTKITWPWW 


10088 


23989 


A 


10163 


249 


415 


AP PECSLPLLVLLLLLLLLLFLRWEDHL 
GPGG*GCSEPKSYHCXSAWVTR\GDPVS 


10089 


23990 

■ 


A 

■ 


10164 

• 


1 

• 


405 


RSEWRLRQPERQSETVKKKKKGFFFFSR 
GGGGGKI FGLMEPS PSGLGQFFLFTPLE 
NGGYRPPPLGGPFFFFFKKGGFPLCGPG 
GFEFPALGTPPLLFPKGLNFRGGPTPPA 
RF FFFWGP VFS VFS PKN*NPPVFRGKNG 
KNRPPKKKKPGRG\WGPPRKFNPFGNKR 
GGVPRAGNS KPPGPQRGNP PFIiKKKKKG 
PPKGGGR* PPFSRGVNKKNCPKPEGEGS 
INPKIFPPPPPREKKKKPFFFFLTVSLC 
RSGWRNLHSL 




23991 


A 


10165 


175 


404 


AI KMVW I QAVPS QCLHG VGF YLPRS VCV 
KTP * NKRHRQGWAHGCNPS TLGGRGGW 
I \RGQEFETSLTNMVKP 


10091 


23992 

- 


A 


10166 


317 


2 


PFKNSPLLLNEMTNIHFLEPTCNYVNVS 
PFLK*HKFLFF*EMESCSVT\RLECSVT 
I IAHCCLKLXjGTSNP ptsas WAPGMI IG 
MRHCPAPKVCSHASAHASAHAS 


10092 


23993 


A 


10167 


325 


2 


KMGEKQGHF I KEE I R KANKHIERCS RLL 
AIEEM* IKATVTI/RKHQTKC*QG*RES 
GSLVHG* + KHKI VHPLWKILSVS YKTKH 
S 1 1 1 IGYSSVLLGTVLVHSGSDAW 


10093 


23994 


A 


10168 


245 


3 


GEPFLFCFPIKKPKVFFCPYFNQAVFFL 
ETHRGFFEFFFLF* KKKKBS/IiGMVAHT 
YNPSTLGSRGGQI A* AQEFKTI LGNMS 


10094 


23995 


A 


10169 


164 


1 


MNKMSQYTFPYKN* QWGG\SVAHPYNPS 
TLGGQAMQIT*AKEFETSLSNMAKPC 


10095 


23996 


A 


10170 


214 


419 

* • 


LS*PPPPFPGFKKFNRVLRFYGKSKPGP ; 
/ rMrVALFQJtGGFr XQNXlrrlr V JSJjiSt'rP 
PTILRVSFFFL*DGWLCHPGWNAVA*S 
QLTASSTSRVH 


10096 


23997 


A 


10171 


168 


1 


WTWWLFPGFS ITNNYCSHFSSLHLAIW/ 
LGTVAHACNPS TLGG * GARGQE FETSLT 


10097 


23998 


A 


10172 


139 


3 

i 


pah/ces*ppglkrssyvslpsg*dyr/ 
htpph\qlifvfwemgfhqpa/ sa/fp 
kc*dyrs\dhhkwpt*hlk/sfhfikk* 
n*lkrylacsrlwylfclircmv**nld 
dls ilylhlnkpqflfi ylffemes 



1162 



WO 01/64835 PCT/US01/04927 



! SEQID 
NO: of 

nucleotide 

sequence 

'i 


SEQ ID 
NO: of 

sequence 


M 

eth 

od 

■ 


SEQID 
NO: in 
USSN 
09/515,1 
26 

- ***** 


Predicted 
beginning 

n nclentirie 

location 
correspond 
ing to first 

amino ncid 

residue of 

peptide 

sequence 


Predict- 
ed end 

llUVKr 

otide 

location 

correspon 

dinp to 

will EL W 

last amino 
acid . 
residue of 

sequence 


Amino acid sequence (A»Alanine 
OCysteine, D=Aspartic Acid, 
E^Glutamic Arid F"=Phfcnvl8lanln£_ 

G=Glycine, H=Histidine, I=1sofcudne, 
K=Lysine, L= Leucine, M=Metnionine, 
. N=Asparagine, P=Proline, 

0 = filutfl mini' RoA t*oininp Sr^iSfj'ifi^ 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 

tiuv-icuuut: utficuuijj \-~g9vo3iuic 

nucleotide insertion 


10098 


23999 

• 


A 


10173 


253 


2 


FPLKICOTLPPQIFSTQRGLLSLPFTIF 
LLSLILLRPFFFFF*TESHSVPBCSGMI 
SAHCSLRLPGSSDSCASAS\QYAOKTCA 
R 


i nnoo 


t 


7V 




i no 


j 


KRVFFLAPSEKFVFFKNWPP*LSFLIFG 
EKNFFFFFFFFS EME S CS VA\RLE CNGM 


10100 


24001 


A 


10175 


117 


381 

* * 


E ILRY I VS IRIRRTNLKFQQEELRVNI K 
NN^SSLGNMVRPRL/ECSGVTTAPTAVS 
APGFK* SSHPSLPSS WDHRRAPLHLGDF 
FL I E TR SHHVAQAGQ VI FNVYSKFLLLK 

WX C DCflDVOWH T CfM"il?T .A Tf2DTltfiyR7DT .. 
r is. X o roUr z xuNN X ts W JJ r -LifvJL l»Jr l£in V f Lt 

QPYI VLCACRTKNNKI IDLDSEQSSRT 


10101 


24002 


A 


10176 


194 


412 


TLKNLItLSLHCLiS FVPTDLYQPERLS KP 

nnrij VTTWT7T ,TT. /ttrryiw^xmr^JDQ'PT.rs 

GRGGW I T* AHE FDINLTK 


10102 


24003 

* 


A 


10177 


100 


339 


VPVKSLLLCVCVCVIiLL*RISDHLPTLy 
ANLG * CFFS YLVI SYS \NSTY C ILQRHL 
LAIQKQP/CSFLRQyHLVGWLHKKKHTH 

NTTEFLLK 


10103 


24004 


A 


10178 


229 


2 


KEKY I KSKLHFEARHNGSHL * SQHFGRP 
RLMDCL\VQDQPGKOGENPYLQKNTKIS 
QAWWLVPWPVYSHRASKTLS 


JU104 


24UU5 




i c\t its 
10175/ 


ICQ 

368 


2 


OPOOTJCTODArDDr>T3T t~*T PCB VT\TT JWf 

obo o 1 o I o vaKir UJb 1 liuliJr onKl V JjUW-b 

ATIPLPLCFLLKI FHPLTLPPLLLLGGE 

FYV* AS VPEKI \FFFFFF*DTVSLCHPG 
WTTVAQSRLTA 


10105 

• 


24006 


A . 


10180 

- 


398 


1 


QNIML YTINTYNFI CQLKKKTI QWGENP 

TNSKWISELNL\KSLKVNRGGNLHDFQF 
GNGFLGDTKSKTK/AKISKLEFIKIKNF 


10106 


24007 


A 


10181 


252 


419 


GARFKGSNFTSAGGQGI I FFMGPPKLI S 


10107 


24008 


A 


10182 


101 


380 


FCWIYNSWLIGYLFFPFRTLNILAMCX3G 
SCL* SQYFGRQRQADHLSPGVRDQPGQY 
GETPS/LTKISWAWWHVPWPATQEAEV 
GGSLEPREVK 


lUlUo 






lUloj 






KLGGHGG / IR+ EDHLSLGGQGCKEPRL Y 
HCTPAWVTEQYLVSQKKKKKKKK 


IV Ivy 






1 A1RA 
IV Lot 


i Aft 




PASASQAGCCGTCP*SQIi 


10110 


24011 


A 


10185 


364 


3 


S VCPPRLGGVSQSGDMGVRDPFEEAV* P 
LAELERCAGRS TAFFRAI R* AHLSLLK\ 
FTRNHPFPHVPCPRVMSR* FGLLPFFOR 
CPAQRGEIWRRSLAAVALRSCX3GRLQVR 
TCWQLCLHR 


10111 


24012 


A 


10186 


2 


249 


S VRLNKNKENTQ I RNKKGDT INLTE I KR 
I IKE YSEQLFANKLH/SLDEMNKFIjBR* 
NPLRPGAEAQRLVS * RFWRATAI DHMRP 


10112 


24013 


A 


10187 


270 


2 


P WAQRGCVFKTNQKI FFGFFE I PHLD* P . 
CVFQRPPS\F*VRLIFIGGGFKPFFFFF 
FEMPCSCSVARLECSTISAHCNIiRIiPGSS 
DSPASA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

i 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid,-F=Pbenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y«Tyrosine, 
X«Unknown, *=Stopcodon, /"possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10113 

» 


24014 


A 


10188 


428 


28 


PIFDKG/E*GN*WGKKSIFNKWCWEKCI 
C/KKLRDSSLTSY/TRINLKWIKDLNVR 
TK/STKLLEENIDIV/IYDLGLGIGFLD 
MTL KARATK / AKI E KLDK I KIGNFYAS K 














DPKKMKR*FIEFIEQGKIFTNHVSDDTS 
I 


10114 


24015 


A 


10189 


85 


243 


DRVS/ LLSPRLECSGMI SAHCSFDLPG* 
SDPFTSVPKSSWNYRCPPARKLSFT 


10115 

i 


24016 


A 


10190 

i 


96 


440 


U3SNHINTLNVNRLSSSMKRPRLSDBI F 
\KKHGPALCCL * EM YYKDKGTCTLKI KG 
WKNI YHANAKNNKT/ GMVLLSGKIRLE S 
N/TVIRVKEG* LVI / INGRLI /QEDIiIL 
LFILNVYAP 


10116 


24017 


' A 


10191 


137 


423 


AKDLDRQFSKEDIHMTNRYMKTC/SGRQ 
LIRETQIKTTMRYPLTPLDLRKSKDKKS 
* * * YEQKGTPAHC* *ECKLVQPLQKI I * 
TFPKLLKINL 


10117 


24018 


A 


10192 


221 


72 


AKVK* VWF FCFLFFF* DGVS / HWYAQ 
ECSGAISAHCNLCLPGSSDSVRVG 


10118 


24019 


A 


10193 


460 


121 

■ 


SDKTKYW*RCRAI KTLTHF * LKSKLVPL 
L*KTI *CYLAKI»KIHIVFDSAI PL*DTY 
FR/ E * FCI CAPKTYSNMVI EALFVPANQ 
TSKKKNWK*LSCPQIVDNFLNIHLMKYH 
LE 


10119 


24020 


A 


10194 


393 


2 


ATMPS YQRLLLTAVHELES PE * MDQFLD 
TYNLPRFNQEE I ENLNS P I MSNE IQS VI 
KCLPFGPNPGPNGFIIEFYHTYKEE/LP 
ILIi*LFQKIEG\ ILPNSFHEVSITLIRK 
TDKDAT * ETHKP I S LMNTDA 


10120 


24021 


A 


10195 


3 


353 


EAAPMPVKFFQIiLEPSGPPGLSMSCLGD 
CNGYACFS VSLHQSANQSIDQ/S I YQSA 
IHLLIY/VSIIY*SI\YLLIi*SI 1 I*PMY 
VPIYPLSCLSIYLSFMPLSCLSFNNLSI 
TRHGSMCL 


10121 


24022 


A 


10196 


118 


395 


dfnagllpsyqvlslyfcf*ngv\chct 
p/rlgdrprlr/ppkkkkkktpppplgg 

SPPGGEPYTPRPGVGPPPPPFEHPPRBN 
PGCFHPPPKAWP 


10122 


24023 


A 


10197 


437 


160 


FGYPRVFPLPALLKRG PGILF WG PNKKN 
YSS PARGSKICFFKRGPPFFFFFYFFF* 
QTESCFVAPVGVQWCDHGSLQPQPPG\S 
SGPPPSTS 


10123 


24024 


A 


10198 


259 


3 


RQGNHVTHVYS S RLQLILS PPASLAVF I 
QKIYVaLCTYIHIVNLYA*MCICVCICY 
TCIL\ ICI YGLVAVAHTYNPTTLGAKLG 
GS 


10124 


24025 


A 


10199 


1 


416 


I ISAHCNLHFPSSWDYSRTPPYPANFCI 
FSRDRVS PCWSGV* HHLEQHS KMLSKK* 
NKIDMHGGMCL*S *LLK/RAE*DVIiRQE 
NCLNPGGRGCGE PGSRHCTPAWATRAKT 
LSKASYASSSSSPKKKKKKSPS 


10125 


24026 


A 


10200 


3*9 


3 


TAPPFFFFCRDGLRTSLIIFPSRATPYS 
LVTSGSFLPAGAGGRGLSLEIRVLDLVQ 
DGCPCSQAWAAALGGPGWWAPFLCFFET 
ASCSVAQEGMQ*CNLGSLQPPSPG\SSD 
SPASWLWTSK 


10126 


24027 


A 


10201 


314 


2 


LiHHMRPEV* FTPSKLGVTGYMVSNISYP 
WRCRAHK* SQVLVS LRHKHLFPLEMESG 
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correspond 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A)anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
(^Glycine, H=Histidine, l^Isoleudne, 
K=Lysine, L=Leucine, M=Methwnine, 
N=Asparagine, P=Prollne, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y-Tyrosine, 
X^Un known, *«=Stop codon, ^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














sbvql\dtnrq:isancnfclpgssnspt 
sas * eagttdsrqpdawadaw 


10127 


24028 


A 


10202. 

i 


68 


413 


KIGILLKTH*PSIC/VSITNF*YNDTGE 
LKVKQ*KNIYYANINQK/KAGMVMFVLD 
ETDFRAKK/DYQGH* IMIBVSVHKE/E/ 
L VILNL YASNKRVTKYMK * KDTDIKGB I 
DKSTTIVGDL 


10128 


24029 

* 


A 


10203 


351 


1 


PKFPPLNPAWGTTIKI FFKKKNAHNWFS 
PRKEKALQKIHKKG/RLLAKEISFLGGF 
* TDYKTNPDSQS IRTLGEKKKKIRPGTV 
AYAYNPSTFGG*GRWIT*GQBLKTSLAN 
MVGRV 


10129 


24039 


A 


10204 


2 


392 


FLEGVLLLLPRLECNGAIPAHHNIjHLPS 
PRFKRLS CLSLP S S WDHRHAPTRPANFV 
FLVETGFHHVGQDGLDLLTL* S PCLGLP 
KCWDYRH\DHHAQPSF 


10130 


24031 


A 


10205 


264 


2 


PLSLSNIiFPI PLNS ILKI FFILK*L* IH 
GKLHKMYRKV/LV* ts PSPSQC*HLR* V 
*HN I QNKKLMGLGAVAHASNPSTLGGRG 
GWIT 


10131 


24032 


A 


10206 


281 


2 

i 


ISPPLFFLFSPPKKIFFSQKHIFLGPPL 
FYPPPLFFSPPP\PHFLWGFRPVPNPMP 
KIDPQTWAKNKDKGPNFPPPFFFILPPK 
KNIFFTKTYFFGPP P FLPP PP FFFSP PK 
KK/SYPFPLKKNSPNPPPS*K*AAISLP 
PPFFYFFFFFFFS FFLENTWKMADAW 


10132 

■ ■ 


24033 

• 


A 

■ 


10207 

■ 


340 

■ 


I 


STKKLVQKLGEHTQCWQGCEATGTLIHC 
RWECKLVQPLWK/SAWQILRQL*NFMHL 
PYNPTIQLQGSYSRE\QHTCPYKDLYVA 
ALFT I AKNC * QAKFLS TGE * INKL WC I H 
ITE 


10133 


24034 . 


A 


10208 


136 


3 


KNRGPPPRSGKFFPFIiPLFFFFFF*DRV 
\YPSHPGWSAMVPSQLT 


10134 


24035 


A 


10209 


327 


2 


INKKYYEQLYA*KFDNLDEIYQFPGRYN 
LPKLTQEKANNLNRPVSLRETKSTINNL 
PKRKVPDPDGFTGELYQTFKEE ITP\FS 
TEAKGILPNSFCEVTI IVRLFLLRQ 


10135 


24036 


A 


10210 


89 


485 


EIVPLLRFPLVIFIASPTTFYTYGFVSG 
EKSQ YKKRE KLCDKTLEQSRS LLGMHST 
CGIALHPKREGQMCTLCACFY* FFLKES 
HSAARAGVQWHNLSSLQLPP \PSSSNS 


10136 

< 


24037 


A 


10211 


361 


462 

• 


IQQIGSNTGNIPAMELALKRNSKNINRL 
AGWNGP * VFAC/ PKNVLFYLHP *LFG* I 
* IYQLKITF*NYEGMVP/FVFRFLFETR 
SLLSPRLECSGTITTHCSLELVGSINSS 
ISARAEFLQRGANL 


10137 

- 


24038 


A 


10212 


62 

. 


469 


SLLWKLRS SGASTSLRCLSA\LLGDVSQ 
SGYTGVRDPLEEAVCLLSELERGAGRTT 
ALFRAVRQGRLSLEKL\LLiPFVQICPAP 
GGGIYR/GQ*ALLSCGGLCPV*AS* */M 
CLPC/EA* TRLLKPPPGS 


10138 


24039 


A 


10213 


197 


1 


WTQPNEWCVTNKDAS FQQRCTKKI KT/ 

WPGMVAYTCNPSTLGGRDGWI T* GQQFK 
TSLANMVKPH 


10139 


24040 


A 


10214 


31 


213 


HASLDEG FHHVGQDGLNLLTS * STCIiGL 
P KCWD YRHE P / P APNWW KKFLS S KAFKR 
♦LGCY 


10140 


24041 


A 


10215 


442 


29 


PTINPTMACKCLSKRKSHTSFTLNQKLE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
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sequence 
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M 
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SEQID 
NO: in 

w rim 9 

USSN 

09/515,1 

26 


Predicted 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
OGlycine, HHHistidine, I-Isoleucine, 
K=0Lysine, I>Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GJutamine, R-Arginine, S=Serine, 
T=Threonine, V=Valme, 
W=Tryptopban, y=Tyrosine, 
X-Un known, *«Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


• 


- 

« 








■ 


TIKLSEEGMLKAEAG*KLDHVH/QVSQI 
VNAKEKFLKE I KSATSLNI * MIRKQKRF 
I VNME /KKVLMFWTEDQTSHNI PLSQSL 
IRKRP+LFNSLKAERSEEAPEKLEASSG 
G 


10141 


24042 


A 


10216 


195 


533 


LVWFWLRVTYGVAAKMS SGTTI IGRLSW 
GWRLHF\KMAHL*GYWWEASVPLHMKLS 
SCHGTG\ S LRVKD PRERLRQKLQ CF YDL 
VLEVTYWHFY.CILFIRSKSLSPAHIQSK 
RI 


10142 


24043 


A 

■ 


10217 

i 


274 


494. 

• 


CVLKGLPFALPWFFFFF*DKVSPSPRLE 
CSYAI /SAHCSLNIiLGSSDPPASASQLA 
GPPGIiRR 


10143 


24044 


A 


10218 


142 


480 


VTIKIYLHPHDTTTPLRQGGSRLKEILV 
SY*QQKVETRPSHVIFFTVISTQFKKTV 
TTRSGAVAHTCNPSTLGGREGQI /T/RG 
QEFETSMTNPVK 


10144 


24045 


A 


10219 


163 


464 


AVYFS FWRVCEHDLASLYFENSGKRDNK 
FVFLVLQAQGPAWRQGEVHKHLLPCVYH 
FAFWLTQQPQS FF * DG/QSHSIAR/ LE 
CGGVISAHCNLYLPqSSNS 


10145 


24046 


A 


10220 


379 


469 


V* KVQM/ WPGMVAHTCHPSTLGG* GGWI 
T*DQEFETSLADVAKPC 


10146 


24047 


A 


1022! 


211 


473 

» 


VTWLIiLFFSIFRASSLSTDSVSYVIAFQ 
FYFYLFI YFYLFI FETESCSVA* AGAQW 
CDLCSPQPPPRGS \SEEPSTLQMTSGSS 
NL 


10147 


24048 


A 


10222 


142 


394 


GVCQSLLGSLCQLGGTGVRDQLEEAVCP 
IAELBPCAWRTTJJ.RICCSIiQSQQAGIF 
/RFC*SYIHICPFPQVLLSSEMGVLSIR 

P 


10148 


24049 


A 


10224 


77 


396 


KRKTKKS FWTPSGFGIiRPP I PKKKKFFN 
SRGKQGGFGRGSGSRGGGGSTSGRGGGY 
MGKVFGMGQ*NFTGDQGLAKGGFVFVFL 
GRTT IG I KGALKRLFA/NNEHDLR ■ 


10149 


24050 


A 


10225 


155 


443 


GQAREGTLKSQFPFPSPPGWPLILPGGV 
FSPKFFFFFFETRV/ CAVAQAGVQWCNI 
DSSNSCASSS*IARITGAHHH\TELIFV 
F*ISPPSQ 


10150 


24051 


A 


10226 


290 


485 


KRGEGPVQKiCKFFFFFFETESCTVAQAG \ 
VQWHNLSSL* PPP PG\SSDS PAEFPR | 


10151 


24052 


A 


10227 


228 


424 


KLYQNLHDTIiFGFELSHYEEVH-KMLKG 
SSWEFVMHSS/VSNKCILICSAKKKKKK 
KKKKKKKKKKKKK 


10152 


24053 


A 


10228 


269 


454 


DL> FAAS QGHAQLLE E LMATTLS LTHTHT 
HTHTHTHTPWSLF* SR/SALSLVC 


10153 


24054 


A 


10229 


255 


1 


FYIFSKFIiYVWI»NVSHLDSHICFCIQSV 

1 1 SCFGVFYYYYY\ ff*rqscswhtgv 

QWCDRSSMQPQLQGQnDPPTSASQVAGS 
A 


10154 


24055 


A 


10230 


18 


472 


GPEEFGTRRERDRERERETRDSRERVER 
VRERER* /RQRERERERERERER 


10155 


24056 


A 


10231 


52 


361 


WRKNYYFQIKITVYIiENXKDFIQKLHTK 
CW*G*KEIGRAIHCWWECKM/VIQPVWK 
R/VWQFLRKLNMLLPHNPAVELLDIYPN 
ELKTHVCTKALYEAGRGGSRL 


10156 


24057 


A 


10232 


3 


402 


APRLPLGGW I S FS S PMGAMGLNRATGIM 
GS FFFFFLKRKSPLG PQGEGRGQNLG*G 
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ino tn firei 

■ Ilg IV 111 «l 

amino add 
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peptide 
sequence 


Predict- 
ed end 

II IIHC"* 

otide 
location 

rnrmnon 

ding to 
last amino 
acid 

residue of 
peptide 

acl|UCllLc 


Amino acid sequence (A=AJantne 
C=Cy stein e, D-Aspartic Acid, 

fts^l^liitam ir* A<*iri ir=Ph^nvInlnninn 

Kj Vjf 1 U 111 III I C AUU) T JTIICli riniallUlV) 

G==Glycine, H-Histidine, I=Isoleucine, 
KNLysine, L=Leacine, M=Metbionine, 

Q=Glntamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyr0Sine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \r=p033ible 
nucjcDuoc iiiaernoii 














NPG PRG * THFPG * TP * KAGNNGEGPPPR 
GNFG FL KKKGVPQ KGQGGS KP / RDPRGT 
PRLGPPKGLE*RGGPPPPAPQK 


JU137 


2405 o 


A 


1UZ5J 


45 


4/U 


PGRRFTTAGLGKQEPAASGPRAPRPAQT 
PLKREAARPSLRHLPEGA* APRSSARS P 
LLIPAASLRS PRRRTCRRRLRHFGI ITR 
PFQPGVRLPDSGPGIiA/ RWACAAVPAAS 
RALPALRRPL*RRPGEGAPSSKSHGHSR 
SP 


10158 


24059 

* 


A . 


10234 


169 


446 


CPPGPAPSHLLI 
WWDRWREKTLFI 
\FEAESHSVTHS 
SNNPPTSAS 


2DRDD PQAGRQSS FYAG j 

jFLRQSLTLSPTTPFFF 

3GMPGVISAH*NLRLPG 


10159 

♦ 


24060 


A 


10235 


264 


2 


KLKP I FKMVFLKI MFMS YIMEVFPF I ET 
GSRS VTQAGVRWHDLDSLQP* PP\GSSD 
PPTS VS Q VGKWHKTWEAELAVSRDHATP 
RAKL 


10160 


24061 


A 


10236 


306 


3 


SQDMVKSPKKKFP*LV*GLTEI*KIRAL 
VFNKKFFFFFFFPETGSTGSCSVA\RLE 
CSGTIIAHCNLQLLGSSNPPASASRVIN 
S I KKLFANF I VSANPRPT 


10161 


24062 


A 


10237 


20 


469 


QCGAIPRRGDHLRSGV*DQI 
LLKIQKLSRHDGH/CRRLSE 
RG CS ELRS CH CT P AWATR \ G 

s 


>GQHGETPS 
[KNGLNPGG 
WSVSKKKK 


10162 

• 


24063 


A 


10238 


313 


3 


TRPRPTQGPPHSGGRACLC I FSPHGPPQ 
TRLSTPVS P SVRPLGFQVLLS ARHGFLL 
CQHGMF\ PSLFTMSNITLWPGAVAHTCN 
PNTLGGRGGWIT*GQEFQTS 


10163 


24064 


A 


10239 


274 


49 


KGNPFFFVETGSHSIAQDGVQWCNHSSL 
QP*PP\G\SSDPPTSASRVAGTTGVCPC 
EDKLSHILRRCYAQVFMLAG 


10164 


24065 


A 


10240 


144 


454 


VCKI KI I YL I VLLED Y IN* FVKSSL* LK 
S*K/LIQQMWAIISLPECLNSWCLGKV 
G* KRCSQQGTVAHTCNPSTLGGRGGMI P 
R 


10165 


24066 


A 


10241 


289 


178 


FN YS S I KNNTDG WVWWCLLP VI PALWE V 
EAIESF/CSRDE I PLSCPGWS PVPKLKR 
F YCLNL P * CWD YRQEAP PHPAI S 1 1 FDA 
TVIEIjFS 


1 A1 CC 

lUloo 


24Uo7 


a 


lUZ4Z 


JjZ 


ZV4 


PPRTTKMW*DQPGQHVEAVTLQKMQKL 
VERGGKHPQSQLC/REDH*SMGGGGCSE 
PRP*PCTPAWVTERDHVSKIK*NKIK 


1U107 






1 t\*>A1 

1UZ4J 


£?6 


i 
1 


VSLCRESSNSFIWQCPWSHIIALCFQR 
EKKRLDLP/S*MVGK*EIPFLVPLLFLP 
HWPLLLMK/YM* IFFFGNRVSLCHPGWS 
AVAQSQLTAASVPRR 


11/ too 








Z JO 


j - 


LERH I P VRP 1 1 1 CD YKWVWRF / HFLRPL 
HN+ FLFS FFS FFFFFFETGS YS VT\RIiE 
CSGTISAHCNIiCVHLIiL 


10169 


24070 


A 


10245 


185 


1 


KKQPQQTKNPLNWVKKEGKGQHFFFFLE 
T*SRSVAQAGVQWCDLGSLQPPPPG\SS 
DPPPS 


10170 

• 


24071 


A 


10246 


2 


265 

* 


AHKKKPAKHIAPSTFLYGETEARRSMPK 
AAQLKCGRAGI ILFVFKTES CSVAQDAV 
LWCNHGSLQP*PPG\SSDPADSACRVAR 
TTSI 


10171 


24072 


A 


10247 


2 


430 




iLTGRESGv 


QRRRDSMG 
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M 
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26 
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beginning 

nucleotide 
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residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanlne 
C=Cysteine, D=Aspartic Acid, 
EsGlutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=*Proline, 
Q=Glutamine, R-Arginine, S=Serine, 
l=Tnreonine, v^Valine, 
W=Tryptophan, Y=*Tyrosine, 
X=Un known, *=Stopcodon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


• 

• 




■ 


• 


■ 




m mm \f+TT\~y\. ^*T*1 f\ (Till T» ^% f J m 1 nf.mVH^TTT mrm * "Ft W T\ m 

CQGTAGRGTRAGKEGPWRPEHL»SRRPPA 
SSVLLSWAIHLAWVEGS * TGSLCDLEAP 
GPTLS P WCSWS PELLQLHV/ HHHTHTHT 
HTHS /HPNTHTDTQS LCHQPS PS VS PRA 
RTVL 


10172 


24073 


A 


10248 


201 . 


16 


TPPGGVFFLGFFFLFFFFETESCCVAQA 
GVQWCDLGSPHPPASRRPGNRSSI/AAA 
*GYHFK 


10173 


24074 


A 


10249 


167 


2 


GFDPFEGMLSYLEKEET/WPGTRAHTCN 
PSSLEG*GGWIT*GREFKTSLTNMEKP 


10174 . 


24075 


A 

■ 


10250 

i 


92 

- 


389 


FQPGQHGEILSLYE IKKI * / EIAGCI DM 
CLWSY /IiLRRLKQEDCSST* VQGCSEP * 
CLSCIPSWVTEQDPDSGRQI*KKKKISL 
GRRGGIFLFLKIWHVFFL 


10175 


24076 


A 


10251 


221 


415 


; KKKKRGGRFKGSNFTSPGWRGNIFFMGP 
PKLNS RAGV* QRRD / WEKPGGPQFNCFG 
SNPPFPPGGK 


10176 


24077 


A 


10252 


94 


383 


GKLNRQG I WLLGRCQRS P AVSNALWLHR 
VAPEEEAGEAWAGTLSLQIiSCKSKITPN 
KIVY* KKNKT/WLGTVAHVCNPSTLGGR 
GGWII*GREFETSL „ 


10177 


24078 , 


A 


10253 


249 


2 


YPTMI FKCSS KRKS PTSLTLNQKLEMMK 
LSEEGMSKAELS * KLGLLH/ QVSQWNA 
KEKFLKE I KS ATPANTGMI RKRNGLIA 


10178 


24079 


A 


10254 


132 


414 


NNTYFQGKI FI LNS QVKGML I CCWWECK 
* VE PLWKAVWRFLK\ DLEQNSKE YKL * T 
WNQPRYPAWDWI KKMW Y I YFME YHA\A 
IKKNEIMFFAA 


10179 


24080 


A 


10255 


359 

* 


3 


YSLRHNIEMRPVSNPTRASKCSHERRSV 
I SLTLNGKRE I IQEASKEGTLKANKPKA 
KLFVPVRQWNAKEKFLKBAKSAAPLN\ 
RMIRKQNSLTADTEQV*VIWIEY*TSHS 
IPLSQSL 


10180 


24081 

• 


A 


10256 


283 


2 


S LQ P WKL YVHVFLP KYATYP P FGTLP KT 
HFFHKILPSKAGSDLNS I YAMTYSHRHI 
TIFFETESHSVTRLECSGAI \SAH*NLH 
LPGSSNSPATK 


10181 


24082 


A 


10257 


216 


386 


tote mm* mm. mm mm*, mm mmw m. -w mm, m/-m, *mm m » ^taak V¥l _W «h Wm^k. 

PSKNPPPIjINPFFFFETDSCSFA*AGVQ 
WRNLSSLQTPPP\GSNNSCASAS/RIAG 
X 


10182 


24083 


A 


10258 

• 


263 


1 


^OwtlLPSPX^YSFLFLk^FAiXpkL 
ECSGEITA\PSSAS*VAEITGACHHABF 
LNFFVEMEFHHVVQA/UjKPLGSSEPPTP 
ATK 


10183 


24084 


A 


10259 


248 


47 


I PKSRVKRENRLNPGGKGCN* prsppcs 
P PWVAKQNSLS KK/NKVKI YQNLHTETQ 
TVHGDIRGQEK 


10184 


24085 


A 


10260 


159 


2 


KRDM S S LKLE Y * KKE / R I W PGAVAHTYS 
Jj b 1 JjuvjK e» W ± x w oyt. r cj J,o li 1 WM v K 


10185 


24086 


A 


10261 

4 


190 


405 


ASLHMFMYFFFFFFPKQIFFFVPQAENQ 
WGNLGKRNPPPPGLRDF\PASPPKEAGF 
RGGPH* PGLFLVF+GE 


10186 


24087 


A 


10262 


39 


293 


PSSSTSRIYPEDTPLITQKYICKILFIV 
ALFIFPKHWKLCKCPHIG/B* IKLQCI Y 
TWR YYVAVQ KNE ENL YQLMYND YQE ILV 
SD 


10187 


24088 


A 


10263 


282 


3 


AFYNR FFLKI FFKRGP RGKE KKGGKKEK 
KKWGKKKKKKFFFFS PGQI FFLGGVFFF 
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NO: of 
nucleotide 
sequence 


SKQW 
NO: of 
pepuue 
sequence 


eth 

Ovl 
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SEQH> 
NO: in 

09/515,1 
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beginning 

mi rlpntirlf* 

IIUVICVIIUC 

location 

corresnnnd 

ing to first 
amino acid 
residue of 

1 WIUUV V* 

peptide 
sequence 


Predict- 
ed end 

llUUIr 

otide 

location 

correspon 

ding to 
last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A a Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamlc Acid F=Phenvlatanine. 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lvsin e , L=Leu ci n e, M=Meth ton in a. 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S^Serine, 
T^Threonine, V=Valine, 
W«Tryptophan t Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














FLFFFEAESRSVT*AGVQWCDQ\SSSQP 
RPPRLKPFFQ ' 


10188 


24089 


A . 


10264 


196 


376 


KKGGGH IINFPPHKG YKPGPEKKKFFFF 
FFEMES CSVA\RL* CSGMI /SQAHCNLC 
LPGSSN 

VJ^J w*« 


10189 


24090 


A 


10265 


514 


157 


GLTLSFRLEYSGMMSL+P*LPG/ FR*FS 

RIjSLPSSWVFPLMYVYHYAQQSFCIFFV 

T?TRFWRVVnanijKVT/?IjRDSAHIjGtiPBC 
Diorxmv v^rtvjjjxvv uwuaywiMMwyrHSi 

W\DYRCEPL*LTKLICFTYGLR*RSKPP 
FFVPGIQLF 


10190 


24091 

• 


A 


10266 


57 


378 


RSRIW*DSVILCS*LVSV/VDQSYFY*V 
TEILBQIKLLGSKIMCFCILQLHFLCFL 

STLGGRGRWIT* DQEFETCLASMVK 


10191 


24092 


A 


10267 


1 


212 


LIQKDTNIYYIKBAL*IPNKHI/KKISS 
LYECRETRTLMHCW*GC 


10192 


24093 


A 


10268 


3 


iyy 


♦FIPIKWLVLKSKRESNKCWQGCGRTGI 
VIHFQRECKIVLLIi*KTV*QFIiKKI ) NIR 

"lgavahscmpstiigcltrprvodopgghg 
KTLSLLEIQKLAWHGLHLSPS 


10193 


24094 


A 


10269 


190 


386 


FSPFLPLVTILECDKAICRRGKSFDFFL 
LGPSFYFFKLLL/FIFYF*ERVLLOGPG 


10194 


24095 


A 


10270 


166 


327 


KTNSKTAEVCPLSVITLNINGLNSPIKR 
QR*TDF * NP I\ I CCLQETHFRFKGTN 


10195 


24096 


A 


10271 


180 


1 


I FFLFEVETCSVA* AGVQWNDLGSQQPP 
PP\GSSDFQLIFAFLVETGY* IMVPSSS 


10196 


24097 


A 


10272 


273 


3 


FLGLFSONPLKANKPVCHLTLP ILQRI F 
FFFESEFHS\VAQAGVQWCDLGSLKPPP 

VRLRLKKK 


10197 


24098 


A 


10273 


399 


?7 


YHFVPTRMAI IRRKNN * CW * TCGE IGNL 
VYCWWECK/LV*LLRKIVWQFLK/DVKQ 
NYHPVTLLLSIYRI *FDCLFLCKSHVEM 
* LISGVRVGAWWEVIGSREWFLMSGFTI 
SSCHQDCEFSQDLIV 


10198 


24099 


A 


10274 


275 


462 


KINLVLQLD FFFVEMESHS VTQAGVQWC 
HLSSPQPPPP/GSSGDSRASDS*VAGIT 
SVHQHAGL 


10199 


nil aa 

24100 


A 


1U2/D 


i 




RRPfiQRGT .(3 ^RTf 1 FHPGRGGE P I VYOHL 
F*FPGP\PENYILILPGFGIISH/IVTY 
YSGKKEPFGYIGMA*AMISIGFLGFIV* 
AHHIFTVGIDVDTRAYFTPPYHTFEEPG 
Y 1 1 SRRRRKTKGFLF 


10200 


24101 


A 


10276 


2 


Ann 
407 


FIDGSPTKPNRHSQAKAGYATVPSISII 
EATA/ PAP tTTC * QAELIALTRALT IAK 
GLCINI YTD / SHILHHHAVMG* K/GFLT 
TQGSS X I DVSLI ETLLKAFLLP KEAGA 


10201 


24102 


A 


10277 


203 


545 

V 


FLWRGQSLALLIjRLECSGS ILL/ ahcnl 
GPHRFKRFSLPQPPE*LSNW\DYRRLPP 
CPANFCTFSRDGFTTLGQAGLQLLTSSD 
LPTLASPKCW\DYRHEPLVHGLTFNRSF 
NFFF 


10202 


24103 


A 


10278 


250 


3 


AKWFLEMKSTPGEDAANI IEM/TKDLDY 
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peptide 
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Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, HHHistidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methiomne, ( 
N=Asparagine, P=Proline, 
Q=Glutamine, R=ArginTne, S=Serlne, 
Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Un known, *=Stop codon, /"possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














DINLVDEAVAGFGRI DSNFERS CTWKM 
VSNSITCYRELFHERKSRLMCQM*LSYF 


10203 


24104 


A 


10279 


333. 


3 


SLILLLFQSMNHTHVRVTFIDQFCVCSN 
YSTDWLFPHLSPISLRTPYSLRYNIKIR 
PINHPTMAMASKCSSERESRISLTLNQK 
\LETIKL/SEEGMSKAQIG*KLGIiLCE 


10204 


24105 


A 


10280 


251 


3 


I TSKVI DS NPTTS 1 1 TLNLNGLNTS F * R 
QRLL/DWASSPK* DS I ICYLQEMNFKYK 
IFKKIGRRIYYANISQKKVGVAVLTPDK 
V 


10205 

■ 


24106 

i 


A 

■ 


10281 

i 


22 

■ 


226 

■ 


TKINSKGNKELNVRAKTIKLLEENIDRN 
LCDIiGLGNCFLD*YCIL*TKINSKGNKE 
IiNVRAKTIKLLEENIDRNLCDLGLGNCF 
IiDMTPNAQTTKGR\lDKLDFIKI*NFWL 
DT 


10206 


24107 


A 


10282 


238 


383 


DAKRKVS STEGDS EGVAHERRSAHLSAK 
PCPAEVE/AKPQKAAEKG* SSE 


10207 


24108 


A 


10283 


127 


377 


RIPGGSAtiGKEIRQCFGVKGQAEIGKIE 
MSQ PTQERE WEG I GS QKAAQVDHLRSG 
VQDQPGQYGETPTLLKIH*LAGHG/GRC 
I* 


10208 


24109 


A 


10284 


328 


370 


VGPGPGF* SSGRVSSCCPGWS \PSELK* 
SAHISLPKCWDYRCQPLRPATFFFKTAS 
HYVAQAKVQWCNYSSLQPGTIiRLKCCIF 
KKIQWQW 


10209 


24110 


A 

» 


10285 


341 


1 


KWCVLTAPLSGRFSVCLPFLGPPYFLQ 
HSDIE I KP INYPTVAS KCSNERRSHTSL 
TLN* KLET I KL I EEGGMSKAETGRKLS L 
L/QSVSQVVNAKEKFLKEIRSITPVNT* 
MI 


10210 

• 


24111 

■ 


A 


10286 


199 


329 


LFFSFLNCLSPHRIEESY*KEIKSAPPI 
TPGIPRK*TSLIAGKKNVFMDWKDQTNH 
NIP/LSQSKTITLFNSMRAETVEEAEEK 
L/ ERG C FMRFKERTHLHN I KMKGPGWA 
HTCNPNTLGGQGRQ I T 


10211 


24112 


A 


10287 


358 


387 


YTNAIiLLYV*DQPGHLMGKDSLFNKWC* 
KNEYSHAKKKQNH\ P * FSLYTKTNSKWS 
KDLNVRAKPIKILEKK 


10212 

■ > 


24113 


A 


10288 


111 


373 


SAQCQPDQHGKTPTLLRIQKtiAGPDGAC 
L*SQLLR\ *GOGE PRSRHCTPAWAT/EG 
KTPSQKKKKK 


10213 


24114 


A 


10289 


197 


375 


IRIGLCKFI I * FLETKLHTVAKVECSSA 
ISAYGTIiSLVDSGDSPASAS/RWGTTG 


10214 


24115 


A 


10290 


118 


371 


QCIiLMMS PGLKT* MTGRLRQENRLNPGG 
GGCGS PRS CNCT PAWATK\GNS AS KNKQ 
T\KNKQQQQQN 


10215 


24116 


A 


10291 


214 


368 


CGYHQKNKKTNPKNSKCWQGCGEIGTLM 
HYWKDHKQV/ Q/PIi*TIVQQFLKTRI 


10216 


24117 


A 


10292 


2 


376 


PFPLMAQRPTHLVPLSASSGTSLLMPST 
LLWLPVfLYKAHAVPTVPLSKL FDHTKPY 
AVSKNNLGALSI*SY+VIPVPPRSRF/S 
FIRDKPTS 


10217 


24118 


A 


10293 


I 


379 


NILHNLDFFKNTTLEPQSSRLQRAVIAP 
LHFSLGNIVRRCFKKKSGWAQWLTPVIS 
ALWEADVAGSLEFKSETTLGNMMRPCUf 
FKKKKKKAFLFGAPGFFFFMGGKILFGI 
AAPF*SSGRVSSCCPGWS\PFELK*SGH 
IS LPKC * D YRCQPLR PATFFFKTASHYV 
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OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L-Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R«Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophao, Y=Tyrosfne, 
X= Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AQAKVQWCNYSSIiQPGTLRLKCCIFKKI 
QWQDI 


10218 


24119 


A 


10294 


201 


384 


GEERLSVQCSAFK*LR*HINISLQTFFF 
LRRESHSVA\RLECSGVISAHCNLCLPG 
SSDSPA 


10219 


24120 


A 


10295 


161 


400 


VCVIESVKANHLVDEHALKSLSFHLSLL 
FGV*GVCVCVCVCVYMCF /CVCIYVC 


10220 


24121 


A 


10296 

■ 


1 


433 


GEGGGGCSGWSGVETAGSRPAPGAEQDL 
LPAPALGRWQGLNSVCYPAWPQPNRRQP 
LALLCAAGLGPGORLECSGMIMAHHSLD 
LPGSGDPPMSAG\S lAGTTGTCHHAVf \E 
FEHVAQGGUBLLGPSDWPPTVLGIiQM*A 
TTPGH 


10221 


24122 


A 


10297 


209 


1 


LTFELLYHHYTLYFF*ESGSCSVAQARM 
QWCDLGLLQPKALG\SGDPPASAS*DA* 
STGVD YYS WHTC I 


10222 


24123 


A 


10298 


225 


1 


RPGMPLSALPCTGHAPARQPHSAACSGP 
KCPWCQSPQPAVGWTV*VCPLPLSPCLC 
/FCVPQPWHGGLPLPCPPV 


10223 


24124 


A 


10299 


212 


367 


VTAG SGHMC YTWWE £RLEQSL * KTVWRF 
LK* LKVELRFDPAVP^LL/S I YPED j 


1 MIA 








0/7 


■ 


GEACS SRPFTT I C / EEK IHLVCALTEED 
* RLAAETTANTTD I P IG SAYR I LI KKLK 
PS KLSTC/YVAKQLSPGQLQTK j 


10225 


24126 


A 


10301 


3 


422 


YRSSSKTFLIIRRGFIPSSTFLLAHLST 
TLSTLSHFQKGWVKLFFFFFFKKGFPFW 
PPGGGGRAKP* F/HGNPPPPG* KNFPAL 
PRQGGGKKG\GPPPPGQFWFFKKKGVSP 
WGGGAKTPELGNPPPGPPKGGGITGGPP 
PP 


10226 


24127 

■ 


A 


10302 


330 

- 

■ 


1421 

• 


RTKGRPLLLALPVLLLLFLHLAFFSAPP 
GPSSLSSTEDLKWTLLSPAQSRKLQPQS 
LHRAGVRDLQ WQGRAG AELTAAC / W PAP 
PRPETAVLLQGLPC/QEGRLLLESLNGF 
ALWSAEGT I F YASATTVD YLGFHOTDV 
MHQNIYDY1HVDDRQDFCRQLHWAMDPP 
QWFGQPPPLBTGDDAILGRLLRAQEWG 
TGTPTE YSAFLTRCFI CRVRCLLDSTSG 
FLTNQFQGKLKFLFGQKKKAPSGAMLPP 
RLSLFC ZAAPVLLPS AAEMKMRSALXiRA 
KPRADTAATAD/SKMRS* LTLSESRLPN 
* FEIWDSAAPGTLPSLPFFGAFDGQHIF 
CIHTRALRGS TRTR QCIAAALEGAS FRN 
HVEL 


10227 


24128 


A 


10303 


172 


383 


KIENKVWQGYEE I ETLVYCWWECKMVQL 
LWKTTWWLKILKLGPP* \DSIIPLLGI 
NQGTEIRFCKRYSNT 


10228 


24129 


A 


10304 


205 


2 


NFYPA*KTDFFLVGSLFIYFE* ASHPIT 
\ RLE CS GTI VTHCS LD LLGS GNP PASAS 
HVSGTAGAHHHCI 


10229 


24130 


A 


10305 


423 


30 


LLEGKLTNRKD IRTKT PSVRRLHQSPKV 
DRTAKMGKKQSRKSGNSKNQSAYPPPKE 
HSS S PAMEQS WMENDE LRE EG FRQSS / M 
S KL KEEVRVNGK+ VKTRE KRLAKS LTRI 
TNAEKYLKDLMELKTSVLRPL 


10230 


24131 


A 


10306 


211 


387 


IFFFF* I KACS VAQAG GQG RHLGS LQ P P 
PPGLKQD9C/SPSRGVGTTGMTYHAQFI 
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nucleotide deletion, ^possible 
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FFGF 


10231 


24132 


A 


10307 


229 


382 


ERAWQPG* HGBT/ PSLKKL* KLSROGG 


10232 


24133 


A 


10308 


168 


417 


GSYCFFVRRNLLIVKYMRSVHLLFIMLL 
YRGD*WYVTTQYNFCILQKQEKLIYRIA 


10233 


24134 

i 


A 


10309 


253 


402 


KGGGGGVKTPARGNPPPGPPKGGEKRGG 
PPPPTQKGFF/CFFFKKGTRPPGGKTGE 
RAKPNEIPAPGKKEFFLPKPPEKGEIKG 

RPGGTPPPGPQRGGKNGGGPPPPPKKGF 
FGGKKKKKNPPR 


10234 


24135, 


A 


10310 


182 


400 


LHVTMI I FWLYLI KNMLLNFTYF/ CYI F 
S VATSTFKI VCV / WiAift V An 1 tA~r ii IXiU 
G*DGWIT*GQEFKTRLANM 


10235 


24136 


A 


10311 


371 


400 


r IWGNI *ASNKHIiUvl \£>lbijAJrKlUUV 
K\TTMRYLSPPTPIRMT/I/I*KSYSSK 
CW*GCEE IRSLIHCKMIQPI * KT/WQFP 
KI INIQLLYNPAIALLOJRTRG 


10236 


24137 

* 


A 


10312 


113 


A f\ A 

404 


QRCGKJG1 r XHCWWhYKPWQPljWKTUWH 
LLQRLN I ELPCDLAI PLLNLYPREMKTS 
IYICT*MFI AALFI I /AKWRQP/KCSSA 
DE/IGKMWHICIMK 


10237 


24138 


A 


10313 


146 


1 


QPPSP\GSSDSPASAF*VAG 


10238 


24139 


A 


10314 


157 


434 


FFFSEI FFFGPGVFS PPPFFIGGPGIFF 
r CFLiKKK. I Li P P PROKKJf if r r KKfcrP / HP r 
FFFFFF*EGVSLLLPRLECNGAILAYCK 
IHLPGSSDLP 


10239 


24140 


A 


10315 


277 


2 


FFFTAFFIiGVPGVFPPPRFFKPGPGFFF 
lAje'*l\JtWrr rirr'KtxK./ llUJrrJjJWar'r'olf 
FFFFFF* EGVSLLLPRLECNGAILAYCN 
LHLPGSSF 


10240 


*^ A "% A % 

24141 


A 


10316 


1 


too 

388 


VRMH I CVCVCVHI S I + MDHIVS YKCIH I 
\LCIY 


10241 


24142 


A 


10317 


134 




T/^r.'D/iriTvrijn^iTT'iriJDDTnrpaT.wcnNrz 

lAsrJuJrvjVjlJl 1 JL tM rr lr r CinnllNOUno 

GK\NP LLGP KTRPLG WGLCST P KFWEKP 
RVRWDLN* TTPCIiE KRAPWLKAKKTPA 
FEIRI 


10242 


24143 


A 


10318 


233 


3 


EFLFSLICAICL*CALKQFSSCFFCLAL 
/SFFPLFFGANYTYNKPFEYSLI 


10243 


24144 


A 


10319 


221 


8 


CDQPKS PPLSGVQMGD I LWLVFFL FtiF 
AF*R*GSHCVAOAGLELPGSSDP/PQAT 


10244 


24145 


A 


10320 


1 


407 


RSTPDPVYLGFTSGGCKTEKRAGCFFLW 
KLHT KGAPARCQ PE LS GMRC \ R * PLLGG 
V/ S OS RYMAVKDSLEEAGCPIiSELKRCA 
GRYTCLFKGGRLDHLNLLRIXIPQTTLCP 
GALSQEDGS FI YKPLTGAAAFLS YM 


10245 


24146 


A 


10321 


115 


398 


ADEELLLMNKQIKYLLEMESTPGKDDVN 
TVEMTKESEYSI/NLKAVAGFERIDSNF 
ERLSAVGKMIi*NS I TC YKE I FH/ RKSQL 
MSQTSLLSYFKLP 


10246 

* 


24147 


A 


10322 


137 


454 


FFSLYFQGFFFFFFLGEKFFFSPGGKGI 
CPKPPFGPPPPPGKGFSPPPPPGKAKKK 
GPPPPPP\GFLFF*KKPGGGFLGGFFFF 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=IsoIeucine, 
KHLysine, LHLeucine, M=Methionine, 
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LGGK I F FS P AGG KGGPQKP P F P 


10247 


24148 


A 


10323 


246 


372 


I S I KS YQLW \ LGGVAYACNS S TLGGRGK 
WIT*GQKFKTSMANM 


10248 * 

* . • 


24149 


A 


10324 

• 


415 

• 


1 


IWGAPRVFPPPP F*NPPPE KNFWGPKKK 
KKFPPPPG*KIFFFKGPPPLFFFFFFFF 
SETESCSVT\RLGCSGGISAHCNLRLPA 
SSNSPPPSSLRMPSTPVLRPHGDLLSPQ 
LSAWVLSWPFLPSLLLPPDLDFIVQ 


10249 

* 


24150 


A 


10325 

i 


3 


559 


GDGI TGLS PTI S KFLFF VFES I SLS PRL 
GSSGAIMAHCSLTSRREPPCRSINFFVS 
VITLNVNNLTNGKIGRMDFF*MM*YTIQ 
I CNLQNTHCRP / KDTNCFKMKRRKE I IH 
ANSNQKRAGVAI L I PDKIDFKPKNVTER 
HRMLVRES / I SQEDVTI INTY 


10250 


24151 


A 


10326 


24 


387 


NQGLGARNQKSQRVFLAGFFFFESSSCS 
VAQAGVHW*DHSSV/PGSSAPPASATW 
VGTTGVHNHPWLI F/VFFVETGSH 


10251 


24152 

• 


A 


10327 


145 


396 


CDYFGEMFSYLLLNIYLCDDPAIPLLPY \ 
VPYENENTDSLKDMTDAHSSFIHNSPKL 
ETKCPGTDEWI * MTKKSDYCKHW*GYRV 
TGTLLYYEGECKI V*,IiWGNV* LF\LLN 
IYLCDDPAIPLLPYVPYE\MKTRIHSKT 
* PMP I AALF I IAPNWK/ PKCPGTDEWIN 
TL * HTHAVQ YKVAVER 


10252 


24153 


A 


10328 


94 


639 


AS I S SS I KWDNI S FLVGLLGELNEIRNI 
CKAVNMKQALNKWKLILLFVIENNNVSA 
SSNNFKIFYAYYIAKAVLLIITEHITLS 
I VLI LKKQNNTRAVTKCW\ RGGREKBTL 
I WL VG * EC KLVQ P L WRRVWRFL I KVKI E 
LPYDPT/ILLGIYSKERKSVYRK\NICT 
PMF \ VAALFTTAKTWKH 


10253 


24154 


A 


10329 


143 


3 


SAWFFFQDRVLL * /SPRLECTVS ISNHC 
NLRVPGSSKNYCLSLISIWDR 


10254 


24155 


A 


10330 


153 


389 


RQGLTLFPGLVSNFWLPVLSSHLGLPKC 
WGYQCYHAWPS* CF*NRSI /WSGTVTHA 
YDPSTLGGQGKPIT*GQEFETSLA 


10255 


24156 


A 


10331 


382 


422 


R YKLDL * KLNTELP YDPTVP LLGNYPKY 
*T*AF\SKTCT*MFIAAYSQPPKGRSKC 
SSPNE* INKMWPI YIMR/YYSAIK 


10256 


24157 


A 


10332 


194 


1 


CKRQCNKEDRNI / CSK* *NPTILF*TTK 
RKG / WLVAVAHACNI.S TLGG * GGRAS *A 
QEFKTSRANM 


10257 


24158 


A 


10333 


348 


1 


NSGQS WIM YPLLKKGVGRVS PPDPDIVN 
SISMVWCGGRLFSNERFASLLTIKKKLA 
GHGGTCLQS LPLGRIiRREDRLSQGGQG/ 
C*GYSE*LCCRCIPAWAAEGDLWSWSPN 
SVAY 


10258 


24159 


A 


10334 


388 


2 


I PGS PKRGPGGEKKKTFFHYLFLRAAPP 
LPRFIFFPPKNPPLKGGGGFFFPPPPKL 
KTPRLRKKKKKENFCPWKKFFKKKQTPQ 
RGR FFFFFFLRQS FAVAQAG VQWCDLGS 
LQPSPP\G*SYHPARTRG 


10259 


24160 


A 


10335 


3 


396 


LLRPPCSLRHNDIEIMPHDNPMIASKCS 
SKKKSCISLTLSQKLDVIKLSGKGMLKA 
KTGQNL /GLGHLYRTVRHAVNVKE KCLK 
EIKNAALLNTOMTRK*NSLIADTEKM*V 
V* IQE * AS /HSMPLRQNLIQNK 


10260 


24161 


A 


10336 


165 | 442 


GQMELLIIARNFGII^LHFNLSIYYRRLF 
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nucleotide deletion, \ppossible 
nucleotide insertion 














FSIPRYIiFIFTYLFIYLFIYFESVAHSD 
AQVECSGTIMALT * KKNASLP / ASAPRA 
AGATG 


10261 

> 


24162 


A 


10337 


33 


430 


TGSTHASANFCVSLVWTQLISRSSGQGL 
ARLQLRCLPCC I L FFS F FSLV I YRLHS Y 
LRLPWRRICLQVPSQC*QNSFPCNFRIQ 
GSLLL*/MQQW 


10262 


24163 


A 


10338 


284 


3 . 


SQHFGRPRWVDHLRSGVRDQPGQHGETL 
SRBTP\SLLKIQKLSGHGSACL*SQLFG 
RLRQENCLNPRGGGCSBWRQHQCTPV*V | 
TEQDSVPKKK 


10263 

* 


24164 


A 

■ 


10339 

- 


414 


2 


ERAGP FSRKI FFS P PLAKKVKYPFPPTP 
RARAHKRP I FS RYVKKBRGNSPDLFWPP 
FFPSLW * KQ*GPSLGRI*FYPV/SWGI FP 
SKP*GFPGPRPKVFCFLG/LPPGPFGGP 
FFFF FFGDRVSLCHSRWSAVAQSQLTAR 


10264 


24165 


A 


10340 . 


211 


422 


ASLGNGGRPSPSLAAALQFDL* LRAGI I 
FALFKAARQGDLSL * RLLXiS F\ VCLCPA 
PRGGAYRGRQAFLS WGGLHPDRASRLFC 
LP KQ AWAMAGAP P PALLL PCS L IS YCRA 
SNQQDSGGIGP SEP S AG YKLLVPGFLLP 
AEK\RTI YGGVTRFFRGP 


10265 

* 


24166 


A 


10341 




432 

■ 


SRQRMPSTYHVQGSGDTKTGEKPCANGK 
KGCGSSILRSCLHRKRFSF*SLKALVWC 
GGACLEAQLLRRLRQEAHLSQGVPGCNE 
h* PG YCT PAWATE P \ DS VS KKK 


10266 


24167 


A 


10342 


103 


458 


SLNVCVLTTLPSSCSPISLPLRGPPYSL 
RHRNIEIRPINNPTMASKCSSKRKS * MS 
LTLNQKLE\MINLHEECML1CAKTG*NLG 
LLHQIVDQWNAKKKFLKKI KS AI PVNT 
QMVRKQN 


10267 


24168 

• 


A 


10343 


53 


408 


SFQFFCS VFS PS LW FYLLLVFDDGDVQM 
GFWCGYPFCLLVFLLTDKTLSCRSVGVP 
CRVRCQ CA\ LLGGAS QLG \ SRGSGVRD P 
LEEAVCLLSDLQLRAGRTTTIiF*AVRQG 
HLSLQRLL 


10268 


24169 

■ 


A 


10344 

■ 


265 


421 


NNFPPSPPRGGGNPGPPPPPRFFFFFLK , 

NPGLPPF*VIYLL/YFTIjTPSFC£jWCYT 

ISVYGCGFVFFFFSFFFFFFEKKIFFFP 

RVEPKNRIPVYPKILFPG*NNFPPSPPR 

GGGNPGPPPPPRFFFFFLKNPGL/HPF* 

KKGAKFPDI»INPPPWP 


10269 


24170 


A 


10345 


207 


420 


WQKGCPVTFMELGWGPLYPCFLHLWIQ 
PTSHRKYIHL /WLGMWHTCNPNTLGGQ 
GGWIN*GQEIETSLTD 


10270 


24171 


A 


10346 


115 


3 


APRGAVKKR/RPWMGAATHVCKPSTLGG 
RGG*IT*GQEY 


10271 


24172 




10347 


41 


421 . 


FTFVSVCVHFKGI FLSFLSFFFETEVPG 
SPLGPKRGGGGGNSGPWNPPPPGLRGSP 
PPPPQRTGKGGPYPPPPGNFFFKFFFRX 
RGV/PPL*HPGGPQSVT*GTPPPGPPKF 
WGAQSGGPVWGKNPIiF 


10272 


24173 

* ■ 


A 


10348 


228 


1 


S KQVKE KYDS FLHKI Q KQAT * NRTFFRV 
TKKHVEITKK/V^GHWAHAYNASTIjGGQ 
GGWITLGQEFETSLANMVKP 


10273 


24174 


A 


10349 


405 


1 


GGFLKRGPPFFFFLRQGLSHjPRIjECSG 
MTVAHCS PNXLDS SNLSLRRS WDYSHQA 
QLIFLVSF*VLGSHSVAQAGVELIiAS/P 



i 
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■ 












KCWDYRHEPIj/WPSQSQNIjNSPFLDGFS 
LLLSRLECNAAISAHCNIiRLIjGSS 


10274 


24175 


A 


10350 


301 


546 


RHIFMNLPFHNLVKIIFHCRGKTLFLDI 
S NP YLKKQ VS S S KPNCK/HLKE I *LFLE 
AKDYWLGVVPHAYNPSTLGGQGRSITG 


10275 


24176 


A 


10351 


873 


1258 


FCYQCYFHLLHHLHHHRPS P RLEPP P PG 
AALPPVPEGRIIPLPCPSSGPRQTTGPG 
KGRRHPAPHRTPAAS AASLPARRTLPAA . 
! P W* PLS FQNTTPGSA\ PGGGNQPASGAG 
GNSRT 


10276 


24177 


A 


10352 

t 


184 


576 


glsplshlkmaassseisemkgveespk 
vpgegpghseaetgppqvlagvpdqpea 
pqpgpnttaapvdsgpkaglapettetp 
ag\pqkqprpqtsa*aqegnqrptaapk 

THAQETVS KPEVS KEATAD 


10277 


24178 


A 


10353 


110 


384 


vpgsprktrspqqmervlapktakkgcp 
rpgdhrsrlqncrdtrari kwadsverp 
pwag\pyahacnpntlggrggwit*gqk 

FKTSLVNM 


10278 


24179 


A 


10354 


123 


422 


RKQ I ANDKS KLFI INKYLNVNELNS PI K 
/RERIiAGWIRKEKHDPSICHL*KTHLT\ 
KDTNSLKMEG*KKIYHANNY/QKKAGVA 
ILISDKIDFKFKKKWREK 


10279 


24180 


A 


10355 


36 


426 


PPERPQYSARVCVCVFYTECRGLAPKPE 
. CRGAI I AHCLLNVLGS Q *S SNLRVWSS * 
DYTQLIFKLFVQTGSHCIAQAGFELL/S 

skdpptla/ fpkhwdyrh\dhhsw 


10280 


24181 


A 


10356 


193 


3 


FQTRIFKISFFFKEFIiKFKKFFFFILI* 

ESRS iaqagvqwrdlgslqpppp\gsss 

PPASASLY 


10281 


24182 


A 


10357 


205 

* 


377 


SPWFQPYYRLTIPYQKC*LATVCI*VY 
* FFKLILYPAILLNS 1 1 IWINFTSDPL* 
FSSHITG * LSLI KNVWVGPGMLAHA\ *N 
PSTLGG/RGGWIT*GQEFETSLANMVKP 


10282 


24183 


A 


10358 


51 


376 


LCRPFQLCHGDRTDMWCPFPEEIiQDRCS 
CRSTSMRVAKGTTPCFLKVLQNIYTV*M 
FCRTFRKHGWPLATRMLV\ I yknsgpv 
TLQGRGTTYLSCHHGKAGRVCRATQHAA 
GVWTKQRACI 


10283 


24184 


A 


10359 


295 


376 


ASELTSQSPGSLFENQKKGRCIAFLFC* 
KCNSEILVHCWWGCKM\WKIVW*FLKTL 
\RLELVYDPASPLLAIYPRCI 


10284 


24185 


A 


10360 


451 


1 


CQDIKTILRNEQE VLVI PSVTKYKQVGK 
S FVSRRYL WSGCVLCYAFVTEAS KTWHH 
FP YRSPGFWGRSGGRQFDS* PKSQADFR 
APQGP PTPE PEGAERRE PE IAPPPTADP 
SPKPT/HSPPRSGAPPSAPFPLLGRAER 
GVG\PGQGRPRP 


< AAA m 

10285 


24186 


A 


10361 


2 


116 


FFFLRWSLALSPRLECSGAI SAHCKIjRIj 
AGSRHS PASASRVAGTTGARYHARLI /L 
/YLFLVETGFHHVSQDGLNLLTS * SACL 
GLPK\C\WIYRREPPRPAYFIFFLRRSL 
ALSPRLECVQWCDLRSLQAVPPG\LHHS 
PAS AS LVAGTTGAHHHTRIiVL I LL \ FYY 
FLRWSVTLS CRLE\AGVQWHNLGSLQAP 
PPGFTPFSCLSLPSSWDYRCPPPCLANP 
LYF**RRGFTMLARMVSIS*PRDPPPSA 
SQSAHDLFIVGQ 
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Predicted 
beginning 

f\ ■« «|aa tiff A 

location 
correspond 

■no tri ft mi 

ing w urn 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

mi /*Ia_ 

nut-ie- 

otide 
location 

vurrcojjuij 

ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCystelne, D=Aspartf c Acid, 

jc^vjiu laiiiii. >\ciUj v— k licny laiaaiiic, 

G^Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 

Q=Glutamine, R-Arginine, $=Serfne, 
T^Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10286 


24187 


A 


10362 


188 


365 


CPQLHTQSNLINMDASQRYQFELKIOAL 
/ WAGTVAHACfr PSTLGSRGGW I T*GLE Y 

IVTCT 


10287 


2418$ 


A 


10363 


146 


1036 


GWAVYQTLQAAWLGPRTESGVS FH6RFS 
LPHHPPQPTLHAWPAPPLQPEPCVPLRE 
utrr MJf&o W ^Lttrxro X rAiSUVSF w ECDr 3R 
RMMGKGSPVPTVAP PLQPAPEPFCVTAG 
CAPMHTPTHTQ\PPHCSIFLPKISFKSK 
nfi w Jjiir CKCJjo 5 5 V r JSLPES I IjLlLF IV 
E SVCQ PG E RCLALNMGWDG KGS GRRWAT 
KSQGVMDIAATQTPA/VFSPV/PATSCP 
AGVSGGK 


10288 


24189 


A 


10364 


229 


1 


PLENAKSI YWGKGRIiLKNNARIjFTVA* S 
WNQPQFSLMVDWIKKK\MWRMYTTESCA 
AI * R/ DE IV 5 GAGTWMET iK AMY 


10289 

• 


24190 


A 


10365 


186 


449 


RFFFFFPKFFWVFGLKSPFFFFG/RLGI 
* EKDGGFFFFFFFFPEMASGSVA\RLBC 
SGAVSAHCNLQFPGSGSLPASAPRVA 


10290 


24191 


A 


10366 


445 


3 


RVDNGVRGR YRGKRGERGR* TRWGWGHP 
QRWDQDRS RERPRETGRERAAGRRRETE 
AAETGRRRRERGRSDRNRN/RGRPRERM 

* 

IEAGRDAGRRER*ARQESWTEKKTARQE 
EGDRD/RQGHQETRAHKGWGNN*D*GRQ 
AERSENRRMY 


10291 


24192 


A 


10367 


1 


491 


NTAPFPTSLESAFWVAVQDRVS/LIjSP 
GPECRGAI \LTHCC/LRTPQ/FKPISHL 
S/LPS/SWDYRHVPPPLDNLILCRDNVS 
QSGLKLLASSNPLTSAFPST/YDYRRET 
PGPRAS P * TQMQPCFMP/RS * SQP*TQI 
A/R*QSNWGSQ*VI*NNREIiAFGTKFFGQ 
RPKAFF 


10292 


24193 


A 


10368 


572 


40 


LHSFRQHFSNTYPQCTPGTP*VTQSLiAA 
RPAGLPEPTSASGGAGAVTPPPRTD\PP 
S VRWSGKKGLDP VLRGGAPRPS PANPAP 
APQGPAFKx^TKCVPrASKTPRPGGRWE 
PAAP PHAHPGKDLQ P P STLVKE CPEARC 
PRALTGDLPSSGENRNGLSAFQGQRRPA 
QKSGENCEN 


10293 


24194 


A 


10370 


36 

• 


517 


PFLFSDIDFLFSFYFLSFFFF*KRGLIL 
Pr / O ehtesjSaKsei AU * W FJaHPRG * RJE PPNIi 
TPQTRGNKGNRQTGPVI FGILRKNGFPP 
FGQGGFKPPGLRBP PGLALQRGGNNGQN 
PYP/ P / PGKFF *RTKRGEKS PQPPYNKP 


10294 


24195 


A 


10371 


60 


403 


DVEQPKLRSYPTLKCKPRSAILTICPWA 
XjQYRSHGLI FGFSWGFLSFETESCSVSQ 
AGMQWCDHGSLQT/PTDSTSQAQAIDPP 
A5Ar X ivifiuriri AUJjlr Vr Jj*RWG 
LAKP 


10295 


24196 


A 


10372 


358 


33 


DSLGVPGAPLLS PGVTPHKQ WFFRP YTQ 
IKKLGFFPQIFFGFKV*KAPFFFWGFGN 
LKKKGGFFFFFFFSEMASGSVA\RLECS 
GAI S AHCNLQFPGS S SLPTSAPRVA 


10296 


24197 


A 


10373 


264 


3 


RQTSVSGSNDGKSWPPADIDTI \KYVIG 
NRI»ITTQQ*LIKLTSKQMITIHNTKGRT 
*SLILVSLI IFIATTNLLGLLPHSFTPT 
THCI 


10297 


24198 


A 


10374 


298 


384 


TL VI * KDG VHTTPHHQQHA YPGPKE YML 
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SEQID 
NO: of 

n ii r IpntiHp 

sequence 

• 


SEQID 
NO: of 

nentirfe 

sequence 

• 


M 

e,th 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

nucleotide 

location 
correspond 

inp in first 

■ •■K W 111 9m 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

nude— 

otide 
location 

rnrre^nnn 

kVl I vjpvll 

ding to 
last amino 
acid 

residue of 
peptide 

cpmipnrp 
SCl| UCIILC 


Amino acid sequence (A=Alanlne 
C=Cysteine, D=Aspartic Acid, 
E=Glutflmic Acid. F=PhenvIalanine. 
G=Glycine, H=Histidine, I-Isoleucinc, 
K^Lysine, L^Leucine, M»Metbionine, 
N=Asnara?ine~ P^Proline. 
Q=Glutamine, R=Arginine, S=Serine, 
T=Th reoni ne, V=Val ine, 
W=Tryptophan, Y=Tyrosine, . 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, ^possible 

IIUUICUIIUC IIUCI uuu 










• » 




HKAPLLHS VSRDPS P FAQS SNCYNRS I K 
QE PVDPLTQAEP VPRDAGKMG BtTPLSEV 
SQNGGPS/HK*GQYSGDV 






n 


IVJfJ 


QA 


JlO 


JLriiuJAyiwAtfOJJi^Xi uvwiujy n 1 lulu / \S A 

QhR/ PMLGGNFI ALN/AHI KKLGRTQIN 
NLTSQLKELEKQD /RNPKARGRQE IT\ K 
I*AELKETET*KKEIQKIYBFRV/WFLE 
NTNK"P\ PAQRTjTOPMTRPNTTPNrnitRNV 

TSAPPDV 


10299 


24200 


A 


10376 

i 


70 


363 


FHQKNLVPGLWVQLLGRRTLQSQHPPAP. 
PPSSSPBVNVPSEVQAAD*LCCIFYDVK 

/ CLC* KRGEKNLVGKT 


10300 


24201 


A 


10377 


299 


2 


Y P VAGRGNR * G CGAHALG VGGAAMP LGA 1 
TISSVSLTISSLCAFPRRGGXiEPSLPAP 

HTDDDDDT7 /D*AQCPQPf2PXJWTA7PPPMn 

PGLPPLPPPRLSPLV 


10301 


24202 


A 


10378 


302 


3 


WPTVATTCS CERKSHTSLT *NQKLEGTQ j 
LSEEGMSKAEIGQKPGLCHQT/VQWNA 
KKKLLKEIKNATPVNTQITRK\QNSLIA 
DT\EKV*VIQIBDQTSHV 


10302 


24203 


A 


10379 


125 


375 


HLSSGAQD* PGQRGKTPSLQNIQKLSRH 
GGMCP /RRLRWEDHLS /SRGG *GCSETR 
ST 


10303 


24204 


TV 
A 


10380 


A CO 

458 


3 


J. L J.o 1 XoUJVHlor liKAr'KHoKr'Jr JrUJiVY V 

SQAGPAPLRGWEN*GTRRRDDWPEASLK 
DPRRSGEPEAPL/PLRLML*SRLPPPLR 
WRRRPPPPDSHTPSPGNGKSPRVPRTGS 
KD KR P EALP P * FAAGEPLEANVP PVQHG 
GGSVRSRGSYLTAT 


10304 


24205 


A 


10381 


327 


2 


•LSAPKKFFFQKSLFLGANFRSPPFPGRP 
FTNRFSDLNVNPLTKKIYFSHKSSNPPT 
FFFFFFETGSNSVT* SGVEWRNHNSLRP 
ESPG\SGDRP*V*HTTGAPYHTWLV 


10305 


24206 

• 


TV 

A 

• 


10382 


754 

■ 


1 


G* LIMPDLHNHPLQQTSGRNQRLQGEVT 
DPSPGMSGGAQTGPKAPPRPPSALRHSG 
SIjQGQP * FAHLYCGTI PYPCYQTAPLPR 
APPPTPTGH*CPGEA\PQDELPVCGCSP 

CS PCSQSKQSPSRPPCVNDS / PGQGPGP 
KAPAGPPPALPEASLGSVHCSP*VGVMT 
PTRPGHFLVPGKVTLEQAPGGGASGGFV 
p 


10306 


24207 


A 


10383 


39 


433 


S HDQT YTEEELL FMDBQRKW FLIiMESTP 
GEDAVNWE ITTKNLEYCINLVDSTVA/ 
GVSNFERRSTVGQMLSNS IVCYGKI F/R 
* KR/SQSMWQTSLLF\NKLSQPP * PSTP 
TILISQQPSMWRQDPPSEERL 


10307 


24208 


A 


10384 


283 


1 


NVFSCKPTSSCLI S PDS PTPDRRFSKAD 
TYRNFMFPLRRHTHMHTHTYMHTHVHVC 
TQLCLiHIiRPH/lTHGIiStj/TCAPPVL*R 
PSFKPLHPTPV 


10308 


24209 


A 


10385 


12 


471 


S P P WWTHGFRYWE PTPPRFSVPHDALGQ 
FHTALFRLTRRFQLLS S C IN I S PHTCLP 
PCLGLCVGGPLEPAVCWAR/CPP*RGMV 
SGWKASPLG/CPPTLLPPC 
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SEQID ' 
NO: of 
nucleotide 

mm If vvifUW 

sequence 

'i 


SEQID 
NO: of 
Denude 
sequence 


M 

etb 

od 

* 


SEQID 
NO: in . 
USSN 
09/515,1 
26 

■ 


Predicted 
beginning 
nucleotide 

location 

correspond 

ing to first 
! amino acid 

residue of 
. peptide 

sequence 


Predict- 
ed end 
nucle- 

otide 
location 
correspon 
din? to 
last amino 
acid 

residue of 

nenfirfe 

sequence 


Amino acid sequence (A= Ala nine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid. F=Phenvlalanlne. 
G=Glycine, H=Histidine, I=lso leucine, 
K=Lysine, L=Leucine, M=Methionine, . 
N=Asparagine, P=Proline, 
O^GIutamine. R=A win inc. S=5erine. 
T=Threonine, V=Valine, 
W^ryptophan, Y=Tyrosfne> 
X=Un known, *=Stop codon,/=possible 

nnp(pntirlp riplpfinn V=nfiMihtp 

liUtlvUUUC UCICllUIIf ^Ul|90lwfv 

nucleotide insertion 


10309 


24210 


A 


10386 


287 


1 


. SFCVRSRPAPTSGRTPFPAAPPFPAAPP 
FPAATTPSTAJ&DGVPG+ G / PAPAANCS P 
HS S QRGSAAMG I PLPQAS /PPAPTPAPP 
SP*SLPFPPAPMY ' 


main 

1 vJ Its 






lyjjot 




•a 
J 


DKPTKMGRNQSRKAENS \ KI KS ASS PPK 
ECSSS PATEESWMENDELREBGFR* LVI 
TNFSELKEKTGKLNPAALVCGRSBDQTT 


103] 1 


24212 


A 


10388 


15 


470 


APSPDAMGRN FTEEDKATITSLRCKVNV 
EDAGGETLVRLQVVYP WTQR YFDS FGNL 

HLDDLKGTFAQLS ELHCDKLHVDPBNFK 
LLVNVLVTVLAIHFGKEFTPBVOAYLHK 
MVTWANSLRYK 




2AL\o 




i Also 




4 JO 


AGEYGAEALERMFLAFPTTKTYFPHFDL 
SHGSAQVKGHGKKVADALNNAVAHVDDM 
PNTLS DLNDLHAHKLL VD P VNFKLLNHC 
LLETLDGHLSAE FTP\AVHASLBQFTAF 

HTTMT.T 


10313 


242X4 

* 


A 


10390 


2 


444 


TMS FNTLAI CLDCLCSTLQPTRS I PG YP 
SSPIiPGNPTPPMTPSSSVPYMSPNQEVK 

Q D PT ,Pni .If P WT.WQ T .P C C P QR CftDniPT .P 

LTFPVRDGW/ SGALPPAAQPGC 


10314 

■ - 


24215 


A 


10391 


405 


3 


KMILTCLALPAQHCPPSACPQTMAQPPIi 
PLS I KGAASMSDKLPYKVADIGLATWGH 

GACIADCLQITVETAILIETLFSLGVQE 
OWS S CS I FSTOEHAVAVFAEAGMP 


10315 


24216 


A 


10392 


248 


5 


RLNKVGGHGGNYEHLYVNKLETLQEMYK 
FLDIYNLPNLSLEEIE\NLNRPITSNKF 
ESV I KHF PTKKI S GLDGFTAE FTjTNN 


10316 


24217 


A 


10393 


262 


3 


LPRDRQAGRS QG P WPQAVGRARKMP P F 
ISYQVYS KNFFE I ESCS VTQAGVQWRN I 
SSMHPPPP\GSSDSPASASQAPGIPGAH 
HHA 


10317 


24218 


A 


10394 


287 


410 


ALKGSSRILIMLGAWLTPVI PA/LWEAX 
AGRSSEVRS 


10318 


24219 


A 


10395 


175 


1 


GGREVDDEIPLVRESTIFHIFFFFFETE 
SHSVAOAGGOWRDLGSLPSS \ PPG\SCH 
SPA 


10319 


24220 


A 

* 


10396 


661 


963 


STL I AF I VISTLFP LLDMTE I YFSLLDE 
IVDTLGEGAFGKWECIDHKAGGRHVAV 
IQVKOTDRYCEAARSEIOVLEHLNTTDP 
NSTFRCVQ\MLEWFEHH 


10320 


24221 

■ 


A 

« 


10397 


1 


268 


PQTHREATMGLSTADWTNVKAGWGKGGA 
HAGEDGAEAXiERMFLSFPTTKTYPPHFD 

LSHGSAQDK3GHGKKVADALTN\AVAHVD 
DMRNAL 


10321 


24222 


A 


10398 


33 


302 


S RGATL I YVDNENGE PGTR WAXDGLKL 
ESGPS I KALDGRSQVSTPRFGQTFDAPP 
ALPKATRKASGTVNRATEES V\ KTRGHL 
VQEHP 


10322 


24223 


A 


10399 


16 


233 


P TP YE FNTTT YRE CG PRRP E FS TS LDLL 
S QP CRAVYM\VGE KND I PFELR I VDL I T 
GKSILGFGEPKSQEGDR 


10323 


24224 


A 


10400 


177 


1 


KLLDLGQPQS PALAACLEYSGAI AAM \ C 
NLCLLGSSOTPTSASRIARI TGVHHAEP 
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SEQID 
NO: of 
nucleotide 
sequence 

• 
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SEQID 
NO: of 
peptide 
sequence 


M 
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SEQID 
NO: in 
USSN 
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26 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
EKJIutamic Acid, F=Phenylalanine, 
G«Glycine, H=Histidine, I-Isoleudne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Senne, 
T=Threonine, V=Valine, 
W=Tryptopnan, Y^Tyrosine, 
X=Un known, *»=Stop codon, /"possible 
nucleotide deletion, \=posslble 
nucleotide insertion 














YA 


10324 


24225 


A 


10401 


185 


419 


HVNKDTLESHLVWKLKSALLGRKSESVT 
GPSCHPEPSDKTVEL/WQGAVAHACHTif? 
TLGGQGGWITLAQEFETSLDNIV 


10325 


24226 


A 


10402 


104 


3 


GIILIPKPGRDTTKKENF/RPLIPIDAK 
ILNKILAN 


10326 


24227 


A 


10403 


3 


344 


SQVMAVAGPAPGAGARPRiiDLOtfLORFL 
QILKVXFPSWSSQNALMFLTLLC/LTLL 
EQ FGNYQ VG LI PNQ Y YG VLGNKD LEGFK 
TLTFLAVML I VLNSTVRS FPLS VSLVLS 
SPV 


10327 


24228 

* 


A 


10404 


3 


404 


araqrrgdls atgrnws plp paglpatv 
vlrhsgslmaatce isni fsnyfsamyr 
s \ ed ytlasvp paatfgaddlvltlsnp 
qtslegtekanwmg\eqsqfwpktqgld 
wis fqaeknkyeas aigfprwdk 


10328 


24229 


A 


10405 


254 


443 


NQLSSIMVMFKKIRSFEEDFNDPEKVYG 
S GD KVAG \ RL I VEVCED TRVKAVR I LAC 
GVAKELRM 


10329 


24230 

- 


A 


10406 

• 


26 


461 

- 


GEVARRS CCGAMAGTALKRLMAE YKQLT 
LNPPEG I VAGPMNEENFFE WEALIMG PE 
DTCFE FGVFPAI LS FPLDYPLS P PKMRF 
TCEMFHPNIYPDGRVCIS I LHAPGD/DP 
HGLREQ \ PERWS P VQSVEKI LLS WS ML 
AEPNDES 


10330 


24231 


A 


10407 


1 


463 


QQAAKMAENSGRAGKSSG/ T/ PARGRGR 
C / PAEQVT AGFNRLRQEQRGLAS KAAEL 
EMELNEHSLVIDTLKEVDETRKCYRMVG 
GVLVERTVKE VLPALENNKEQ IQKI I ET 
LTQQLQAKGKELNEFREKHNIRLMGEDE 
KPAAKENS EGAGAKASS P 


10331 


24232 


A 


10408 


180 


446 


KKAAAS VSAASGS HLSNS FAE PS RSNGS 
MVRHSSS PYVRYPSDKPFLNSDLRRS PS 
KPTLG YPESNSRAI FSAL / KNLQD/ KIR 
RLNL 


10332 


24233 


A 


10409 


1 


417 


EAGLVTMEEIGILGEKAQDE I PALSVSR 
PC3TGLSFLGPEPEDLEDLYSRF\KKLQQ 
ELEFLEVQEEYIKDEQKNLKKEFLHAQE 
EVKRIQS IPLVIGQFLEAVDQNTAIVGS 
TTGSNYYVRIIiSTIDRELLKPNASVALY 


10333 


24234 


A 


10410 


305 


53 


LEAQ PHAP PLLGHHLPCWRI PSHLGTCR 
RPRAMRVQGSEEEGVA/GQAPPLPWVHR 
G S L P LVRRRPAGKP TGGNARE E KGKVE G 


10334 


24235 


A 


10411 


419 

■ 


2 


NNSRKFGNGTLKLVEVEIYPLI SESIPD 
KSELLLTPQDKKHAPLPSAVPNPSA/CP 
DLRPHTTPASPVS\PAKTVETRPSAPQG 
PLPGPVRRPRGRFSVRAASPQTSGRPGA 
PRPPGPALREATDAPRAATP P IAALAGH 
S 


10335 


24236 


A 


10412 


74 


322 


MDEMATTQISKDELDELIEAFAKVDLSS 
NGFI CD YELH E L FMEANMPL PG YKARET 
I QKLMLDGDRNTDGKI S \ YDE FAYVSM 


10336 


24237 


A 


10415 


1 

• 


415 


P QTHRE P TM WAS ADKTNVMAAWG KVGA 
HAGEYGAEALERMFliTFPTTKTYFPHLD 
LSHGSAQVKGHGKMVADALTNAVAHVDD 
MPNALYALSDLNAHKLWLDP/VNFMLLS 
HCLL VTLVAHL PDE FTPA VHACLD KLL 


10337 


24238 


A 


10416 


19 


454 


APS PNAMGHCT EEDKATITS LWGMVNVE 
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sequence 
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NO: of 
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sequence . 
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SEQID 
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USSN 
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26 
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beginning 

ft A * 1 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last ammo 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Add, 
E^Glutamic Acid, ^Phenylalanine, 
OGIycine, H=Histidine,I=IsoIeudne, 
K=Lysine, k=Leudne, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=VaIme, 
W»Tryptophan, Y«=Tyrosine, 
X^Un known, *=Stop codon, possible 
nucleotide deletion, V=possibSe 
nucleotide insertion 


- 


• 


■ 


■ 






dagretlgrlLwypwtqrffd^fgn\l 
s sdsai mgnpkvkahgkkvltslgyaim 
hlddlkgtfaqlsellcdklhvdpenfk 

LLGNVLVTVLAIHFGIEFTPEVQASWPK 
KMNAVA 


10338 


24239 


A 


10417 


2 


395 


ALSSSQPSAPMHPFYTRAATMIGEIAAA 
VSFI SKFLRTKGLTSERQLQTFSQSLQE 
LLAEHYKHHWFPEKPCKGSGYSMIGVNP 
\KMISLILQACIEGGI 


10339 

• 


24240 

* 


A 

■ 


10418 

• 


2 


374 


LSTPHAFGTGKWGTF I SLL FLFSSAYS R 
GVFRJRDAHKSEAAHR I KDLGEEDFTALV 
VIAFAQHLQQWPFEDHWQLANDVTEFA 
TTCAA0ESSENCDQSLHTLFGDLLCSRA 
TIiDESHCVQDGRP 


10340 


24241 


A 


10419 


48 


347 


DPKAQLPEPLRVLWQLTYAMAAGSR/TS 
LL/LAFALIjCLPCFKAGPPNRPLSRIiFD 
HAMLQAHRRTTAI DTYQEFENLYPKD/ Q 
QYS/FLMTPTSSALDSIPTPS 


10341 


24242 


A 


10420 


166 


1 


nflkkpgpfqkffqrglpliltfffffx 
fetkphsvsl\lecggtvsahcnlclpv 


10342 


24243 

■ 


A 


10421 


2 


251 


LGCTQHRSQELVAAATSHQTC IQASEDV 
KE I / FARARNGKYRPLKIS I ENGQLM IG 
SY/SSQPSDSWDNDYDSFVLPLLEDKQL 
CY 


10343 


24244 

■ 


A 

• 


10422 

- 


251 

• 


1271 

■ 

■ 


KEDLSPRAPMSGTQSTITDRFPLKKPIR 
HGS I LNRES PTDKKQKVERIASHDFDPT 
DSSSKKTKSSSEESRSEIYGLy\QRCVI 
IQKDDNGFGLTVSGDNP\VFVQSVKEDG 
AAMRA\GVQTGDRI IKVNGTLVTHSNHL 
EWKLIKSGSYVAIiTVQGRPPGSPQIPL 
ADSEVE PS VTGHMS P IMTS PHS PGASGN 
MERITS PVLMGEENNWHNQKVE I LRKM 
liQKEQERLQLLQEDYNRTPAQRLLKEIQ 
EAKKHIPQIiQEQLSKATGSAQDGAWTP 
SRPLGDTLTVSEAE TDPGDVLGRTDCSS 
GDASRPS SDNADS P KSGPKER I YLEENP 
EEQKKG 


10344 


24245 


A 


10423 


198 


569 


QRNMVGQRLVEPRRLKPGFINVKSYNGD 
WE WHQGFFLVCFFFLRWSLALS PRLECS 
GAISTHCNLCLPGFKQFSCLSLPSSWDY 
RHAHHCTQL IFVFLVEMG\ FHHVGQAGQ 
GFFSLEKSLTI I 


10345 


24246 


A 


10424 


2 


343 


PQTQREPTMVLSPADGTNVYAAWGKVGA 
HAGE YGAEALERMFLS FPTTKTYFPHFD 
LSHGSAQVKGHGNKVS \ DALTNAVAHVD 
DMPNALSALSDLHAHKLLVDPVWFKLLS 
HC 


10346 


24247 


A 


10425 


31 


342 


RAAVMPREDRATWKSNYFLKI IQLLDDY 

Tiirr'D'nrnnnxTCV^eti'rtVinATDMOT DflimT T 
i'K.LZr IVGAUNtfcoKUMyUlKMoiiKUKAV 

VLMGKNTMMRKAI \ RGHLENNPALEKLL 

PHIRGNVGFEITKEDLTEIR 




24248 


A 


10426 


196 


2 


KNLIISQTKKVICPNTLFLVFPKRVFFF 
FF FFETE SRS / VS PRLECSA\ S I SAHYN 
LCLPGLSDSS 


10348 


24249 

* 


A 


10427 


1 


405 


RLECWLE P PHGAGLQGLG WVWS CS VSTG 
PTMQALV\LIiLCIGALLGHMSCQNPASP 
PEEGYPDPDSTRALEDDEDPIFIDPENK 
LPEAGSNFGYDIiYRLRAIMSPTSNVLLP 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, F=PhenyIalanine, 

-+m. % p mm mm m . m m m mm mm m m 

OGIycine, H=Histidme, I=Iso1eucme, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Prollne, 
Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W«Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














PUWATDLWDLSLEAQQLTES I I 


10349 


24250 


A 


10428 


223 


361 


EGPTEENMAAKVFES/ IGKFGLALAVAG 
/GPVNSALYNVDVGHRAVIFE 


10350 


24251 


A 


10430 


3 


359 


LTQREPTMALS PGDMTNVKAAWGKVCAH 
AGDYGAEALERMFL YFPTT \ KTYFPHFD 
LSHGFAQVKGHGKKWDALTNAVAHVDD 
MPNALAALSDLHAHKLRVDPVNFKLLSH 
CLLVTLA 


10351 


24252 


A 


10431 


273 . 


1 


NHKDGKKQSGKTEKSKNQSASPPPKERS 
SSPATEQSWTENDFDEFREEGFI\RQSN 
YSELKEE I RTHGKE VKNLE KKLD DRLTR 
ITNAQK 


10352 


24253 

» 


A 


10432 


211 


3 


S FLW KFCLRG I PSHVS CQS A/ LHX3GAS 
QLGYSGVRDPLEEAVCLFSDLKLHAGRT 
TTLFNAVRQGHLSLQ 


10353 


24254 


A 

■ 


10433 


281 

* 


2 


TNQE KKKGDPNKI RSGKGDTTTDNTVT I 
IRDYYG/HTYTNKLKN/LEEVDKFLDTY 
YLPRFNQREIENMNQPITSNEIESVIKS 
LQTKKSPGPMAS 


10354 


24255 


A 


10434 


219 


1 


SFLWKLRLRGAPGCMRCQSA\LLGGVSE 
LGYSGVRDPLEEAVCPYSDLNLCAERTT 
TLFKAVREGCLSLQKF 


10355 


24256 


A 


10435 


133 


2 


FPKPPGNFFFFFETESHPVT\RLECSGA 
ISAHCNLHLPGSSDSPA 


10356 


24257 


A 


10436 


248 


376 


KGGVFFFFFFFFETESCSITQTGAQWHT 
\YGSLQPRPP 


10357 


24258 


A 


10437 

• 


250 


1 


KGTRKELTFXEQLLSAKYWAAGCMHYFR 
GSGKYSNG YPVFVFVFI FSBTE S CS \ VA 
QAGGQWCDLVSLQPPPP\GSSDSPGASF 


10358 


24259 


A 


10438 


354 

i 


2 


KRGLKKNPFFNPRNMGEFLKLWGPNPKK 
I FKNCKI FKMGLKKKVLGF YKNSKNLKR 
GKLNPFVIFFFFFFFFFFFLETGSYSIA 
\RLKCSNSGTNTAHCSPELPGSRDLPDS 
TTQVAG 


10359 


24260 


A 


10439 


126 


1 


GMYDCVCLFCFAIKEGREREKEREKBRK 
RERQREKER/ERERDKERKREERK 


10360 

i 


24261 


A 


10440 


29 


347 


IQKPTADTKKFFKKKLKYTTRENHLHNK 
EDRKEGKKEEKTTKQHGSSKSLLINKTL 
NINGLKS P I KTHRVAE / WNKNKDPMLYC 
IQETPFTYKDTHRL/RIQGWKKIFH 


10361 


24262 


A 


10442 


405 


1 


IFFPHPFFFKKSFRLSPQVEIQGIFLGP 
WKLFPPGFNNFS/CPNLPNNLGFKGLPP 
RPSYFFRI FCKKKGF/H/RVYPEGFPFIi 
ALGFFPLFPFKSWGFQGDPPSWGPFFFF 
FFFVL VAE GTL S E G CRLIRGRW KNL VP 


10362 


24263 


A 


10443 


122 


372 


lynd p i s x yqvtl yrrvffwvlysanws 
lrwkanflf fs ffkmasdpvaqagvqwc 
dldslqpqppXglXsdtptlasqvsett 

G 


10363 


24264 


A 


10444 


171 


3 


RHPTMAGSSEMFLGSGFGCRLTVFVLFE 
TESCSVT/RECSGAISAHCYLSLPGSSN 


10364 


24265 


A 


10445 


173 


3 


CMNWCFS IFIFLKINFILETESHSVSQ 
AGVQW\LITVHCSLKLMGSSDPPSSASQ 
V 


10365 


24266 


A 


10446 


187 


329 


S FLWKFCLRGVPGHVRCQSA\ LLGVASQ 
LGYSRVRNPLEEAVCPFSDL 


10366 


24267 


A 


10447 


27 


226 


YSYPVYLLLHIFLCWKCSLYFIYFVFEW 
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M 

eih 

od 

• 

• 


SEQID 
NO: in 
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nucleotide 
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peptide 

sequence 


Predict- 
ed end 
nude- 
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corrcspon 
ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A D AJanine 
OCysteine, D=Aspartic Add, 
E=GlutamIc Acid. (^Phenylalanine. 
G=Glycine, H=Histidine, I=Iso leucine, 
Kr=Lvs ine, L=Leu cine, M=Meth io n in e, 
N=Asparagine, PHProline, 
Q=Glutamine, R«Arginine, S^Serine, 
T-Threonine. V-Vallne. 
W«Tryptophan,Y=Tyroslne, 
X=Un known, *=Stop codon, Impossible 
nucleotide deletion, \ppossible 
nucleotide insertion 

mm %m mm 9 mM w U mm mH m m Itjwl mm LA 




* 










. ESCSFAQAGMKWCDIjQPLPPG\SSDSLA 
PASQVAOVDAA 


103fi7 
1UJO t 


?A7/yt 




1044& 


1 1Q 




SPIHYILVI I ICFIHVILLI I IFSEVGS 

mmm mm mm mm m* mm mmm m mm mm mm mmmm mm mmm W *AMia mm mm mm wmf mm V mmmmr 

HSVAPAGV\SGAI I AHCNHKLL3SSS PP 

vsvs 


IvJOO 




A 


10449 


1 1 1 
111 




SLKKLVFFFS I YTHNFFPFFDPTLKNST 

mmw mmmmmmmmmmM w Mm m m wmff mm m m* m mmm m> m mmm v m mm w m 

rhs lylslsls lflshs v\ s rlecggt i 
iahcslnlpgssdphtsmsqvVagptgs 
c 




9A990 


A 


10450, 


91R 

Mm 1 O 


1 
j 


QRFWGPGPNKVFKNLGSAFFIjQRGKLFF 
LBIWGGFFFFFLVFFLRQSHSIAQAGVQ 
WCD\LNSLQPLPPRFK 


10370 


24271 


A 


10451 


179 


3 


GPFKLTLFFMGKFALQRBKCHFFFFBTE 
SCTVAPAGLQWRDLGSLQPPPP\GSSDS 

mm m m\ m mm * IS » • m—M^m* **m* mmmW^^ tm* mm mx mm ^ ^mww mmw m*uw mmw WmW 

PAS 






A 


10459 


71 


232 


SSPLSDLLFN I VLEVLAKDIMQEVE/ I K 

4M#% * mmM mmS W mmmm mtmmmmw mmm m t^|M V J m\m mm* 

LILIGKDEVKWSLIVDKIILYVKNSKH 


10372 


24273 


A 


10453 


153 


3 


ARGELYFLEIGGGFFFFFLVFFLRQSHS 
IAQAGVQWCD\LNSLQPLPPRFK 


1 ATM 




a 




JJO 


991 


YRHEPPRPVIPS 


1 A17d 


• 




1A45<J 
1 \rtjj 






YNYKYLIHYYS YLFFIFI FF /NI IHYI F 

A &t A XV 1» mJmUmmmm m m mmtmV C mmm. Jk Am mj f mm, ^ JU** A ^ m 

FILFFYLTSFIFLSYFFFFFITLF 


10375 


24276 


A 


10456 


155 


2 


DRVSLLLPRLECNGAILAHCNLCIiLGSR 
HS /PRVAGITVAHHHARLIFCIF 


10376 


24277 


A 


10457 


870 


1249 


BGPRWADHLRPGVQDHLG\QHGETPSL\ 
OKIOKLARRGGTC /LOOSQLLGRLSOEH 
CLNLGDGGCSEPRLHHCI PTWGDE\GDS . 
PKKKKE 


10177 


74.97 R 

£>*T£ / 0 


A 


1045ft 


979 


404 


KQRPGRAQWLTPVI P KLWBAELGR \ MLE 
ARTLRPAWAT 


10378 


24279 


A 


10459 


122 


362 


LSBDQLNCYPQRIiHHFIFIiPVITRVQMS 
HYLCQQS CLFHFF/S FFFLYFEMESS S V 
T\RLE CTGMI SAHSNLYLPGS S QSSG 


1037Q 

lvJ i7 


949K0 


A 

mm 


10460 




102 


AASTLALSPRLECNSAILAHCKLH/LPA 

m mmm mmmw a AH vwmp* * ^mmmrngmmmmf mmmm mm 

FTPFSCL 


10380 


24281 


A 


10461 


133 


339 


KLSKFQGS KKPE I KANSLDLALFYHNY I 
TRQCFTSYSNFFGDRVSLLSPRLECNGV 
ILAHCNL/RTPGFK 




949J19 


A 




•i 


193 


ASNMMTELKSQI P ILTLNVNGLNAPLKR 

• +mmm+mm m M • mm mm* mm* mmm** M f M A Mftl V *4 MA«»V 

HRVAGWINDPT I CYLQETYFT / CKNNHR 
LTAK 


10382 


24283 


A 


10463 


126 


314 


RLGGSTI KVQRDLS SDRSKPGRFLYTSN 
S S YVR KGKRNKACFTERVP / P VNDI I 


10383 

• 


24284 

■ 


A 


10464 


1 


332 

• 


KKNLFLS KDCLSSLFRFSKS PPAILGPP 
RGWTPKTRFIGPKAFFFKKGSLSLGRKY 
HFGLKPVP /CPPFIiSKTRFLPGVPFFFF 
FFFETESHSVA\RLECSG/AISAHCKLR 
L 


10384 


24285 


A 


10465 


173 


365 


RFVCSTIKVLRDLSSDRSNPGRFLSLQI j 
PP\LRKDKRNKAYFTK/RPSPVNDIIST 


10385 


24286 

• 


A 


10466 


335 


3 

* 


PFKKFYFKISTRALLEPVFFGKKIRVPP 
QRLSP I WAPLFKRGPPNSFQVNGVPLFF 
FFFFVRHGXiALSSGVECSGTRIiECSGAI 
/ STAHCHLCHLDASHPPTSASWVARPTG 
T 


10386 


24287 


A 


10467 


818 


1008. 


VHTDVLADiLkSlNNAEKRQT/RRCFLS 
\ WLQGHCP FLTVMMQHGLCGVELGGLPG 
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li^-Asparaginei rojincj 
Q=Glutamine, R=Arginlne, S=Serine, 
T=Threonine, V=Valine, 
w — i rypiopnan, jr— tyrosine) 
X=Unknown, *=Stop codou, /^possible 
nucieoiiuc aeietion, x^possioje 
nucleotide insertion 
















10387 


24288 




10468 


111 


1 


MVS FGRPRQADHKV\GVRDQPGQHGETP 


10388 


24289 


A 


10469 


204 


1 


LWOPILKIFCQILKTICEGNFRQPTFFI 
FFFEMESQSVTQARMQWCDLSSIjQPLPP 
GS\SNSRASAS 


10389 


24290 


A 


10470 


190 


Tin 

330 


COT Wr\TXT C T pr , T/W7TUT?TT?VncnDT T7~\r 

KGWKK\ I FHTNKNQKR I WT 


10390 


24291 


A 


10471 


162 


2 


IKGPKRAPPANKGRAFFFFFFFFETBSC 
SCLQAAVQWHDLSSPQPPTP\G9SDS 


10391 


24292 


A 


10472 

i 


120 

» 


3 


SWGFF FFFF \EVES YSVAQVGVQWCDIjG i 

^ft» » V^*W«ftL«ft^«F WW" 

SLQAPHVMILVK 


10392 


24293 


A 


10473 


298 


1 


RRALPCPANFLYFYFLSLLLFFETEFHS 
\VAQAGVRWHBLGSLQTPPPG\SSDSPA 
SGEYSANYTGEYSANYTGEYSANYTGEY 
SVNYTGEYSVNYTGEY 


10393 


24294 


A 


10474 


1 


380 


AVTGRAGSMVAPR \ PLRRLVL FYQGKLR 
SMAGNFWQS SHYLQRIIiD KHD LLNERQK 

ft^^ ■» ■ <0 ft^ ft»^^ft> »f# » AnnTwitnTT j 1 1 ' I t i t 

DLKVLSEEEYWKLQRFFTKVIOALCEHL 
KLRQ P GIATATAY FKRF YARHS LKS IDP 
VLMAPTCVCLASTAN 


10394 


24295 


A 


10475 

■ 


289 


2 


TQG FP P INNS FYALYPKQKI PFFSDKLF 
RVKGFKGPVFHYGVKDFKNKDGVFFIFF 
FEMASGSVAQAGVQWRNLGSLQPPI \ PG 
SSSLTTSAPRVAS 


10395 


24296 


A 


10476 


138 


402 


ITGRAP PKP FPLKRG YLSGGLTNlGFS F 
SFS FFETESRSVTQAGVQWRNLGSRSWL 
EST/SPDSSNS/PAPASRVAAIT 


10396 


^AA.«fti 

24297 


A 


10477 


2 

* 


336 


ARETNDTTQTRLLPANNTMTKSDPYI S I 
LTLNVNGLNTPX KR\KRVASWINKQGLM 
GCCLQETHLSSHETHNDTHKLKIKTWRK 
IYQANGKQKKARW/ XuXaVKrUrKP 


10397 


24298 


A 


10478 


51 


250 


RKVS LFKKW CWNTLFI CKK \ LS LDTYRF 
AKINFEWITDLNVREKT/ I KLLKGNI KE 
NICDLGLGKDFF . 1 


10398 


24299 


A 


10479 


216 


2 


GFTRKALSGFFKTGWFWVFPNFSKLFFK 
KNFSERFFFFFFFEMESHSVT\RLECSG 
PISAHCNLHLPGSSLV 


10399 , 


24300 


A 


10480 


191 


*> 

3 


FEMKSCS / VS PRLECNGALSAHCKLCLP 
GS CHS PARA 


10400 


24301 


A 


10481 


135 


768 


LGGQVLSCASwPVSljrPLFFFIilSFIjGC 
LLAFLSACFASTFS CVLGS PETS FSP CL 

tUrr jrZS.VLtooxlr' \rrLoKr/ ^.FUxrJri\JL7/ F 

PLPKHACPP 


f A JAI 

10401 


24302 


A 


10482 


126 


333 


MVVwyKiLVnurVi \ VJUvJXir LtLtv / www 

FliW/CSVYIVYWYGYAVVLFIFFFFFFS 
]_uj ^_ r v v_ v njjjji tt\JC 






A 


IvtOJ 






ATEKNMGLGTSVRGLSGSNPSYGRVRER 
IGGGVCLWSQILLRGC\LRWEDHLGLGG 
GGCSEPRSSHCTPTWVTEQDPVLV 


10403 


24304 


A 


10484 


136 


2 


NRPS Y I FS FFF ETE SHSVAQAGVLWUDL 
SSLQPLPP\GSSDS PARA 


10404 


24305 


A 


10485 


161 


328 


EKNFKPYGSCRHFYLFIYFETESRSVA\ 
RLE CSGTXSAHCNLRLPGSSN/ SPAS 


10405 


24306 


A 


10486 


193 


347 


XLFS FLLFIYFFETESCS VAQAGVQWHD 
HGSLRSRAPG\SYNAPALDSQGTE 


10406 


24307 


A 


10487 


114 


2 


GAWQSEVQWRDSGSLQAPPPG\SRHSP 
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X»Unknown, **=§top codon, /^possible 
nucleotide. deletion, \=possible 
nucleotide insertion 














ASASQVAGLV , 


10407 

• 


24308 

■ 


A 


10488 

i 


314 


49 


IRRSTERSEPRYSMVCRRQPRQGGPOA 
RSQLLGRLRQE DHLS P I AGDCS EPRLH\ 
LACTPAWATEQE P VLKKMS PTLFCLFLW 
PLPCQ 


10408 


24309 

• 


A 


10489 


291 


1 


KSPPPPNRRGLTLRSRGQIGNGVSFYYE 
APVTDFIFFFQSLVLSTQAGLQWCNLSS 
LQLPPPG\SSDSPASASRVAPVSQGHAT 
APQPGRQTLSSC 


10409 


24310 


A 


10490 


226 


3 


GSRCTMPRRRLAHAFPAARMP KRKVSST 
EWATEEE PKRRSARIiSAEPAPAlCVETK/ 
PEKAAGKDKSSDKKVQTKLV 


10410 


24311 


A 


10491 


2 


153 


arglvffwlvemtlchvaqAXV^l'Fs"" 

SEPP\LGLPKCWDYRREPPRPT 


10411 


24312 


A 


10492 


123 


342 


THYLCLLRFFFLDRGFLLLPRLDC/NAP 
ISPNLNIiRLPASSDSPASASRVTGITVV 
NHYARLTLLFIFONETLF 


10412 


24313 


A 


10493 


194 


2 


IPGWPFKNFSLFPLFFFFFEMESCSVAH 
AGV/GSSDFPASASQVAGTTSMRINftWL 
IFVFFSRN 


10413 


24314 

■ 


A 


10494 


373 


1156 


KVQLKYYLFFFLRLSLALLPRLECSGTI 
LAHCNLPPSRFEQFSCLSLPSSWNYRRP 
PPHHARLIFVL/ SVETGFHHVGQAGLEL 
LTSGDPPASA/ FPKCWDYRHE 


10414 

■ 


24315 


A 

■ 


10495 


181 

• 


3 


S FLWKFS LRGVPGRVRCQSA\ LLGGAS Q 
LGYSGLRDPLQEAVCPFSDLKLCAGRTT 
TLFK 


10415 


24316 


A 


10496 

• 


156 


2 


AASTTRLGDRDQPGQHSETPS LLKIQKL 
G\ SGGGHLRLRQENCLNP\GGRGC 


10416 

♦ 


24317 


A 


10497 


157 


300 


IQSWFNILKSIHVIHHSNRLK/EVNHIV 
STDAEKAADKIQHPFLIKKQ 


10417 


24318 


A 


10498 


3 


289 


LVKKMLTGSTLGKSYRHSPFSINQGHNA 
LRKAAG\ PLPRKAGY\LQGFS plrygl\ 
WD3KDLT IHQPDTREGS VLS RI S KRGRP 
L/CS CLPLGTECLS I 


10418 


24319 


A 


10499 


138 


1 


WYGLI YLLLEME S CYVAQAGVQWHDLP S 
IjQPPPP\NSSNSPASSC 


10419 


24320 


A 


10500 


138 


I 


WEGLI YLLLEMES CS VAQAGVQ WHDLGS 
LQPPPP\NSSNSPASSC 


10420 


24321 


A 


10501 


200 


343 


LEAHS FLS FFIF 1 1 1 FFDTKSHS VT\RL 
ECSAAISAHCSLLLPGSSNS 


10421 


24322 


A 


10502 


84 


339 


PRDGKRRDKQKQBTGGEHYKS QLRGRLR 
QEDHVSPGGRGCSEPRSRHCTPAWAT\B 
SNSVSKKS 


10422 


24323 


A 


10503 


141 


362 


EWNCCVCC/ CCCCCCRCF 


10423 


24324 


A 


10504 


198 


320 


TIKKQKNTGGLIFNSYILPPLFLEPGDL 
\RFLDVDNRVKLPV 


10424 


24325 


A 


10505 


30 


356 


G YPCNP I LY YMSTVGFTTSLLDFD I PLB 
HPILKVCPTHTRALRDLSSDRSNPGRIL 
STSNSSLY/EKDPRNKAYFTK/RPSPVN 
DUST 


10425 


24326 


A 

V 


10506 


16 


375 


KKFFKKKKKKI FFP P FPLKNFFF PKRVN 
FFGRGWPKKS PPKKKVFFKKFPGGFKKP 
PLKKKKIFFSPPVKMGPPKGFFKKGPPP 
FFFFFFFFFFFFFFFFFFFFL/LCLIPR 


10426 


24327 


A 


10507 


156 


2 


AASTTRLGDRDQPGQHSETPS LIiKIQKL 
G\SGGGPLRLRQENRLNP\GGRGC " 


10427 


24328 


A 


10508 


313 


2 


QSKWLLEEESTLGEDAVNIVEMTTKDLE 
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OCystetne, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparaglne, P=Prollne, 
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T=Threonine, V=Valine, 
W=Tryptopban, Y^Tyrosine, 
X«=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


■ 




• 








YYRNLVDKAAAGFERIDSNFESSSTVAK 
ILSNCIVC/Y/REILHEGKSRPMWQTSL 
S YFKELPQTTQHSAI ITLTSPRA 


10428 


24329 


A 


10509 


325 


1 


ECAKEMNAETKRNIAIDATETQRIIRDC 
YEHY1OT1ENLEAMDKFLDTHNLPRLNHE 
EIENLiNEPIMSNETETV/ 1 /NSLPSKKN 
SGSEGFTAEFYQRYKTPLILKLFKK 


10429 


24330 


k 


10510 


126 


14 


DSSGQVQWLKP 1 1 P V/ LWEPEAGRS PEV 
RSSRPACPTW. 


10430 

• 


24331 


A 


10511 

i 


435 


769 


PFKFPPLSHACDGGSLFCRLRLECSGVI 
T\AHCKPQLPG/SQSDFPASTSQELGP/ 
TGACHYTRIil CCREEVL\HVAQAGLELL 
CPKDPPTSASQHARDYRH\GHPAQL 


10431 


24332 


A 


10512 


147 


412 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/RPSPVNE 


10432 


24333 


A 


10513 


251 


466 


QTCP TGS PASLLEL FNS IATQ WELVRSL 
IAGNALKDEND\SAVKMLASLIMSYKAA 
AAEDYKADCPPRNPGP 


10433 

• 


24334 

■ 


A 


10514 


286 


1 


SFLRKFCLRGVPSRVKCQSA\LLGGASQ 
LG YSGVRY PLEEAVG P FSHLHLHAGRTT 
TLFKAVRQEHLS LQROE LLPLVGRVGRPN 
DSRCDRVIGH . 


10434 


24335 


A 


10515 


28 


428 

* 


RFVCSTI KVLRDLS SDRSNPGR FLSTSN 
SSLY/EKDKRNKAYFTK/RPSPVNDIIS 
T 


10435 

■ 


24336 


A 


10516 


263 


2 


I KKPADS E S AGAL I LDFPASQTVRNSFL 
LLTSHSVCYPCYSS PNGLRQQLKI I IFF 
FFLRQSCSV\SRLECSGMILAHCNLCLP 
GSNN 


10436 


24337 


A 


10517 


62 


292 


PATLFSGVI LPPTNS INAFSGSDSKSAH 
K/CEMLGTENFNFFTCHTHI IAEHSNST 
HTHTHTHTQTHTHTHTPLLFYB 


10437 


24338 


A 


10518 


172 


2 

■ 


RGKKI FFPLEKGKNLVWWITALFFFFFE 
TDSHS V\ TRLKCSQTXLYHCSLCLPGSS 
DF 


10438 


24339 


A 


10519 


42 


194 


I LVETF \CLRFVSIiENLVKMRFIiKLKKK 
KKKKIOCKKKKKKKKKKKRKKKKI 


10439 


24340 


A 


10520 


120 


I 


I FAKRWPI KFFPFFFFFFETVFHS\VAQ 
AGVQWCDLGSP 


10440 

- 


24341 


A 


10521 

• 


202 


375 


S FLWKLHLSGVPGHVRCQSALTGG\ASQ 
LS YSGVTDTLEEAVCP FSDLKLRAGRTT 
TL 


10441 


24342 


A 


10522 


272 


402 


PKIFYVTKKAWNYYPYTITEYTCSF/IR 
KFFIHIETKYEDNKGSN 


10442 


24343 


A 


10523 


147 


439 


IiILYVNYISRKSTVYDSRHPCTYYLIPE 
KNCYKYHCNFIjCT/SSSIPFPTPSTPSR 
ASPKKKKKKKKKKKKKKKKKKKKARGG 


10443 


24344 


A 


10524 


7 


415 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/ rps pvndi is 
T 


10444 


24345 


A 


10525 


321 


442 


TSFSFLFFF/LETEFHSVA\RLECRGTI 
LVHCNLCLPGSSNS 


10445 


24346 


A 


10526 


477 


3 


IEGKLPNRKDIHTKNPSVCRHHQRPKVD | 
KTTKMGKKQ3 RKTGNS KNQSAS P P PKER 
SSSPAMEQSWTEKDFDKLREBGFRGSIY 
SKL/REEIRTNGKEVKNFEKBLDEWITR 
I TNAKKSL KDLMELKAEAQELRDECTSR 
AASANCVAVPQAFDRSHGVQ 



7 
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SEQH) 
NO: of 

n 11 pImiHHp 

11 U LICV UUv 

sequence 

• ■ 


SEQ n> 
NO: of 
nentidfe 

sequence 


eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 

■ 


Predicted 
beginning 

nucleotide 

location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
seouence 


Predict- 
ed end 
nticle- 

otide 
location 
correspon 
Hin? to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A»Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaraic Acid, F=Phenvlalanine. 
G=Glycine, H=Histidine, I=Isoleucine, 
K= Lysine, Lr=Leu cine, M~Metb ion i ne» 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argioine, S*=Serine, 
T=Threonine, V=Valine, ; 
W=Tryptophan, Y=Tyrosfne, 
X=Un known, *=Stop codon, /^possible 

m Wt mm m^mm mr w w m* m mp ^mr ■w-»^ •^•mw -mmt -mf mmm W mmr ^W'mm-mr mr mm* m-mmt 

nucleotide deletion, V=possibIe 
nucleotide insertion 


10446 


24347 


A 


10527 


265 


411 


. RFVCSTI KVLRDLS SDRSNPGRCLSTSN 
S/S PVRKDKRNKAYFTK/RPSPRN 


10447 

• 


24348 


A 


10528 


81 


390 


RFVCS T I KVLRDLS SDRSNPGRVLSTSN 
SSLY/EKDKRNKAYFTK/RPSPVNDII 


1A/MP 
HrrrO 


24349 

i 


A 


10529 




23 


RI FLFYFHS FLRQGLARAEGQ PHNHS S L 
QPPPPG\SGDPPASASRAARTTSAYHHI 
PLFFFRSYCVAQAKIINMIiSLL 


10449 


24350 


A 


10530 


10 


398. 


RFVCSTIKVLLDLSSNRSNPGRFLSTSN 

ssly/ekdkrnkayftk/rpsp 


10450 


24351 


A • 


10531 


289 


423 


RFVCS TS KVLRDLS SDRS S PGRFLS TSN 
SSL /ORKDKRNKAYFTER 

mmvmmT mmW f \£ A\ftw> Mr\m\ M> m± mm mm Wmg+% 


10451 


24352 


A 


10532 


152 


401 


PHCS I HSQVR I QANVHKGHRQRTYGS VI | 

PHILPLHVLKKTFSLRDFHFSVSLKKNL 

VLTCLHLFL/GVRTPRNDPFVSMMLLFT 


10432 


24J53 






2 




LDPT qLLLPP^KCSGPVSAHRtilVWiP /6 
FKRFCS LGLPS S WDYRRAS PRVAKFVFL 
VDTGFHHVGQAGLKLLTSSDPPSLTY/P 
QCWDYRREPHLAAF 




24J54 




1U534 


4D7 




rl X MOiuiifiHV. w tr v/ inCinivuUiOiJii 

PGKQRL 


10454 


24355 


A 


10535 


354 


47 


KP KNLPGGGGP P FYS SFS KGKKIPLPW K 

KKERKKKKMATVQKGM/ PHKYYHGKTGS 
YNVI QHAVGNAVTNRTRGS 


JU4jj 


24 J jo 


TV 




1 




TPTf KT? FPPKKPTfGPRFPPPlCKirFCKT F<! P 
x jr x\.i\_r\.r r xr xvcvtr rv\j v c c c rrxwiuuu\ir o t 

PPKFWPPQKNFKKAPPPFFFFFFFFFFF 
FFFFLKSNPFFFFF/ sknpilkpplppl 

FRVFPPPKKKKKKDP 


10456 


24357 


A 


10537 


204 


435 


TSGSL YKLLGMGDLGLGKTT I FKQGGHQ 

NSS \ QWHPEATTG / VNWAFKQIiHWDPKA 
now \vnnircini iw^ •iinnciwunnwcivn 

LAHLQLWNIAL/ QER/FGKMTKACY 


10457 


24358 


A 


10538 


208 


382 


RFVCST I KVLRDLS S DRSNPGRFLFTSN 
SSLFER/DC3RNKAFFPK/RPSPVNDIIS 
T 


10458 


24359 


A 


10539 


63 


409 


GWRLTNFPVGKTPSTSGTGLLFIFLLLN | 
LMGRLKP S S I SM KRE FRKQNRGQKC / E F 
MKKKKKKKKKKKKKKKKKKKKKIGGGAF 
LKIFRGGPNFPGGEKIFFFFLGGGIKSR 
GGGFG 


10459 

• 


243^0 


* 

< 


10540 

■ 


3 

■ 


1322 

- 


RGYAWPNGALPASTVPCGFAACPGEFLC 
SVNGRQAPAGSSGFLPPVPSLCPHTVCR 

APIPSVPGVPCGTFTFQCEDRSCVKTLS 
A55 PIiPUjIIjCPPHHIj PPSPDCGLOGPSSR 

IVGGAVSSEGEWPWQASLQVRGRHICGG 
AL I ADRWVITAAHCFQEDS MASTVLWTV 
FLGKVWQNSRWPGE VSFKVS RLLLHP YH 
EEDSHDYDVALLQLDHPWRSAAVRPVC 
LPAR/ SENREQGLQ /CS WITGWGALREG 
GPI SNALQKVDVQL I PQDLCSEAYRYQV 
TPRMLCAG YRKGKKDAC /QGES PG 


10460 


24361 


A 


10541 


154 


3 


GVGNFFFFFLRWSLTLAQAGVQWRSLGS 
LQP / LTSGSLFRQFSCLSLPSTRP 


10461 


24362 


A 


10542 


3 


406 


LMVLWTAHLPALAPGSRTSLLLAF\ALL 
CLPWLQE\AGAVQTVPLSRLFDHAMLQA 
HHAHQLAIDTYQEFEETYIPKDQKYSFL 
HDS HTS FCFSDS I P T PSNMEETQQKCNL 
ELLRISLLLIKSRLEPVQVLTSMF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 

out 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PnenylaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K«=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginirie7S fiS Serioe, 
T=Tnreonine, V=Vaune, 
W=Tryptophan, y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
n ucleotide insertion 


10462 


24363 

■ 


A 


10543 

• 


3 


414 


HSSAHAYGTYESSGKRKYLQRPSRTAEL 
KKALKEKKNRTtTiTiQQRSKSVTS SS SSSR 
DSSARDSSSEREETSTSSSSEDSDTDES 
SSSSSSSAYSTNSSSSFDSDSDSSS\SR 
GRSTSTDSSSADD/STNEEESELE 


10463 


24364 


A 


10544 


240 


509 


T£r"Tsn itiXJtTllLGEFCKCllNriFGl /GKFR 
MKVACQIETLGILSFFFETESRSVTRLK 
CGGAIIiAHCKL 


10464 


24365 


A 


10545 


275 

• 


2 


GCSRP FDTAI pLMGP YPEEKKLLFEKNK 
/DTCTCMFIAAHFPVAKMWIQPKCPS IN 
EWIKNPVTHIYIHTHTYMCIYIYTHICV 
CVIHIYI 


10465 

• 


24366 


A 


10546 


233 

• 


2 


WSWMAIQSVLPQTAFTELKNKYSCIQET 
P ILF YFLKQSLPLS PS LECSGVI IAHCN 
/LQ/RPGSTTFPASASKVERTTGA 


10466 


24367 


A 


10547 


2 


420 


KKKI PNRGGCLKRDTKAFLKTPPMLQRR 
TPPAFPKGPGGFLPDFNPLPPSKGNPPI 
YSWERFFPRFFKNIWAPVPKANLLSFFF 
FSI FWRTGSRYIAQAGLELLGSS YPPAS 
CLPESWDSRR\DHRJ^RPTI 


10467 


24368 


A 


10548 


190 


1 


AASTFFACVCFRIiFFyFVFETESPSVAQ 
AGLQWHSHSSLQPPPNLG\SGNPPALAS 
PVAGNC 


10468 


24369 


A 


10549 . 


155 


1 


ARGEKYFLEIGGGFFFFLLVFFLRQSHS 
IAQAGVQWCD\LNSMQPLPPRVK 


10469 


24370 


A 


10550 


223 


392 


RFVCSTI KVLRDLSSDRRNPCRFLSTLN 
/SLAVRXDERYKAYFTK/RPDPVYDIIS 
T 


10470 


24371 


A 


10551 


183 


3 


WQKLLFL FGTES CS VARVGVQ WRHLSS P 
KPPPP\GSSNS/PASASRVA\TGAHLHA 
RIiIFVFL 


10471 


24372 


A 


10552 


297 


10 

■ 


TDEELLCMEEKRKWFF \DMAS I PGEDTV 
NTDEMTKT / DLB YYINLVDKVSAGFERT 
OTNF\ERSSTVSKMLSNSITCYRKI/F/ 
RERKSQQMWQTS FLSF 


10472 


24373 


A 


10553 


509 


3 


LLTDRTLS CRSVGVPCSVRCQCA\XiLEG 
ASQLG\SRGSGVRDPLEEAVCPFPDLQL 
RTGRTTALFKAVRQGHLSIiQR IJiTjSF\ V 
CLCPAPRGGAYRGTQASLSCGGLHPVRA 
SRIjLCLPKQAWANVGTP PP \ TSLP PCSL 
ISDCCASSQRDS/VGVGPSKPGAGYNLV 
VRRF 


10473 


24374 


A 


10554 


262 


453 


GWGILTANLLVFGGGSLSISERI FFSPIi 
ATGFCS VAQAGVQW\ LI IAHCSLE FLGS 
SD 


10474 


24375 


A 


10555 


112 


372 


KKKGGGPPFFFFFIFFLIRQRKAKIiWL 
AFNRRGLRKPD/ 1 FYYAGVGKI VKKKYS 
GFF IEL V i I UUKNYIjMCALiAVIDPRDSN 
I I RS 


10475 


24376 


A 


10556 


50 


358 


RFVCSTIKVLRDLSSDRSNPGRFIiSXSN 
SSLY/EKDKRNKAYFTK/RPSPVNDIIS 
T 


10476 


24377 


A 


10557 


203 


350 


QLHADKVSRFATHAE CRGVIMAYFNLNL 
PH/FKRSSCLSLWSSWDYRHAP 


10477 


24378 


A 


10558 


98 


387 


I QDTDL IRGVYRYRDKKRAREKDS PPER 
EGERERE I E KQRKKER/ ERKKERKKEST 
HQKE 


10478 


24379 


A 


10559 


237 


3 


EVFFKNIFEIESCSVTQARAQGQDHGSL 
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SEQ ID 
NO: of 
nucleotide 
sequence 

• 

w 
"* 

- 


SEQ ID 
NO: of 
peptide 
sequence 

• 


M, 
etb 
od 

■ 
• 


SEQ ID 
NO: in 

09/515,1 

* 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
pep ii u e 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding 10 
last amino 

BCIU 

residue of 

peptide 

sequence 


Amino acid sequence (A»Alanine 
C=Cysteine, D=Aspartic Acid, 

17 f*\ ., i j-,.. L jj. A —?-J Y~\ 1)1. _ . • 1 1 _ _ • _ . 

E=v>iutamic Acid, F=rnenylaianine, 
G=Glycine, H^Histidine, I=Iso leucine, 
K^JLysine, JLPLeucine, lVr^JYietnJonine, 
N=Asparagine, P^Proline, 
i^triuiamine, K s Argwtne, apocrine, 
T=Threonine, V=Valine, 
w— a rypiupnan, i— tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














QPPTPG\SSSPPTLASQSAVIT3VNHHT 
SjbljLi 1 r U b KAXAs KML. 1 b sCsu 


10479 


24380 


A 


10560 


37 


359 


RLKCGGP VTAYCKLE I LGS SNTPTSTS W 
VAuTT / 1 UlHAQ 


10480 


24381 


A 


10561 


170 


3 


RPFFLPWGGKKFQGVSFFFFFFLVFFEI 
GSHSVP\RLKCSGTITAHCSLDLLGSSN 


10481 


• 24382 

■ 


A 


10562 


• 

1 


388 


IjPKFijMvLKliU^ijVAMAl^bKTTlaljX 

FALLCLPWLQEADAVQTEPLSMLYDHAM 
LQAHRAHQLD I DTYHELEETY I PEDQKH 
SLLHDSQTSFCLSDSIATPSNMEETHQK 
SNLBMVRISLLLIETWLB 


10482 


24383 • 


A 


10563 


224 


392 


CVDP ILKWFPLRTATRQGCPLS \LFNI V 
LEVLATAIRQEKEIKDP/QIGKKBVK 


10483 


24384 


A 


10564 


185 


2 


RYFGGPDSPQNGKTVFNPLFFFFETRV\ 
HSCCPNESNGTISTHCNLCLPGSSNSPA 
SASQVA 


30484 


24385 


A 


10565 


175. 


324 


APGYGTKYRVI LCKII1LLRRI1RHKNHLN . 
PGGRGCSEPRSHHC/ITPAVTVTS 


10485 


24386 


A 


10566 


3 


367 


HELLWSTIKVLPYLRSDRSNPGGVISTS 
NSSLC / E KDKKNKAYFTK/RPSPVNDM I 
ST 


10486 


24387 


A 


10567 


128 

■ 


391 


GETCEGVYGPQQGGKQTWKVFHS KKDBG 
P CS KTWDVGLGSS PQFI FLKKVRFFFFF 
FETESHSVTRLECSGTPGT/HCNIiCYPG 
TS 


10487 


24388 


A 


10568 


251 


485 


SFLWRFPLKQVPGHVRCQSA\LLGGASQ 
LGY SGVRD PLEDAVCLLS DLKLCAGRTT 
ALFKAVRQGHLSLQRFLLPLFSY 


10488 


24389 


A 


10569 


176 


3 


FVP P I LKKMG P FLKKKEGKP FFFFFSEM 
ESRSVT\RLECSGA/TISAHCNIjCLPGS 
SNS 


10489 


24390 


A 


10570 


212 


386 


RFVCSTIKVLRDLSSDRSNPGRFLETSN 
SSLY/EKDKRNKAFPTK/RPSPVNDIIS 
T 


10490 

■ 


24391 

• 


A 


10571 


134 


489 

■ 


SLF FLPS PRDRWSLTQSTEMPPSWKFPG 
REIRSPLIiAMLHSIiLiLQPTAFLCPSSTL 
LLSLKPDSLRTGSLFLSFFS FLYPEACS 
VA\RLKPSGTISAHCNLCLPSSSbSPAS 
AYXvSG 


10491 


24392 

• 


A 

■ 


10572 

• 


1 

- 


470 


GQSRGI PLLLTLDLEKPVSLLLSVTMLY 
SKNSAQFSTILQTLSFPATFTPSPSIPL 
SSAYFFFFSDRVS/LLSPRQECSGUJLG 
AHCRPQPPRFKRFSCLSPSS\SWGLTRR 
HTTTPGL I FC I F / S VEAGFHHVAQTGLE 
LtL>bJjbiN PPAbA/ FPKC WDHRR 


10492 


24393 


A. 


10573 


242 


410 


VMLRMQI YFSLHNPI FFFFETGSHFIA\ 
RLECSGM I PAHCNIiCFPGSSNAPAS ASQ 
V 


inhere 

• 

* 








236 


1 


CPSVLLDRDSLSERVCEILCIHAC/VMR 
IHEHTHTY I YAHTHTHTHTHTHT 


10494 


24395 


A 


10575 


241 


35 


RSS YLFI FNFFVETRS VAQPGLKLCCSA 
WAGTPG\SSDPPTSSSQS7\GI IGMSLNIt 
FLQWLIIDQACarrN 


10495 


24396 


A 


10576 


294 


72 


FKNFKS PLFSLFFFLILGTRFCYVAQAG 
LK\SWPQGDPSALASRVAGTRGVHRHTQ 
LQVSFNYYKVLAMHSGSQL 


10496 


24397 


A 


10577 


1200 | 1625 


PDGWSYSSGHSWLPEEASNGRKAWSSCW 
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SEQID 
NO: of 
nucleotide 
sequence 

• 


SEQID 
NO: of 
peptide 
sequence 

■ 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
OCysteine, D=Aspartic Acid, 
E=Glutamtc Acid, ^Phenylalanine, 
OGIycine, HNHistidine, I=Isoleucine, 
K=Lysine, X^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


■ 


• 


■ 




- 




vppqlsppqappspathlgptppla/sp 
pqappppathlgpapppplssapXpppd 

TLLGPAAPPPPSSGPDPFRYSP/EVPPQ 
LSPPQAPP 


10497 


24398 


A 


10578 


226 


325 


ERHTHTPHSHTRSHTRLTHTHA/ YTHTH 
AHTHA/YTHTHAHTHFPS 


10498 

• 


24399 

• 


A 


10579 


108 


320 


T INVFLKVNVMKLS YLRFNTRNCS FFFL 
AGVQGCDHGSLQLQPPG/SSCDPPSSAS 
RVAGTTGAYQHTQLI F 


10499 


24400 


A 


10580 

i 


1 


288 


ARGERERERERERERERERERERERERE 
REREREREREREGLSFFFSRSRGGGKLW 
CVC/RPSARPRERERHAVFSAPPSLQKK 
KKVFGGREGTHSLFYPPCGNSIiRR , 


10500 


24401 


A 


10581 

• 


1 


476 


REWG LAIiS PRLE CSWI XAHCS DGSLGL 
KRFSCLRLPSWKDYRCAPPCPSLNLFFV 
ET\GPHYVAPGWSLLLGSSD/PPPLASP 
KCRDYRHE 


10501 


24402 


A 


10582 


210 


346 


PHCS IHSQVRIQANVHKGHRQRTYGSVI 
PHILPLHVLKKT/FSLRDF 


10502 


24403 


A 


10583 


316 


461 


LPNI LGWLFFVSETGFCS VTHAGVQWC 
DEHGSLQSQfIFG\SS|PLPTSD 


10503 


24404 


A- 


10584 

* 


172 


383 


NLDL YLTS YI KINS KWI IDLNVIAKTI K 
IiLE \ KNVAGENI PDLGLGKE FLD\KTQK 
ALS\TNKKIDKLNFTR 


10504 


24405 


A 


10585 


273 


374 


GPGTEAHTRNPSTLGGRGGAIT/R/GQE 
FKTSLAKM 


10505 


24406 


A 


10586 


150 


344 


KLKCKI 1 1 KLGI KI SFAVI WVEIGFHF 
CYLGWVQW\PIRTHPGSSDPPASASQSA 
GITSMSHHT 


10506 


24407 


A 


10587 


199 

• 

< 


3 


SFS YIRVHLGHI KHTRHKASCEVYANCK 
AKGIITGIKKK/RFWPGTVAHACNPNTL 
GGRGRWITRLV 


10507 


24408 




10588 


216 


1 


PRGAPTCMRCQSAL\LGGVLQLGYTG\V 
RDPLEEAVCP I S E L KRCAGRTTALFRAV 
RQGRLSLQKFLLPFLV 


10508 


24409 


A 


10589 


190 


3 


I SAQPWAAFEKRFI FI YRAPA/ LKFMI F 
FSPFKGVPFFFTFFFFFEMBSRSVARLE 
CSGVISAH 


10509 


24410 

• 


A 


10590 


369 


40 


KTERNS ININKNDVHTKTPSKGRQHQRP 
KADKSREMRKNQHKKAENSKNQNPSSPP 
RDHNS S PARKQNWMENEFDELTEVG/ FR 
KVGNSSELKEHILTPCKETKNRTRGW 


10510 


24411 


A 


10591 


114 


398 


RFVCSTIKVLRDLSSNRSNPGRFLSTSN 
SSLC/EKDKRNKAYFTK/RPS PVNDIIS 
T 


10511 


24412 


A 


10592 


132 


1 


I LYLWCVCVCVCVCVCVYLLILV/ CKVL 
S CLGE PALS VGAFQMS 


10512 


24413 


A - 


10593 


3 


287 


IGFITLNVGLYLYLLCYIYVYFFQVNYI 
VFILVIAVPLLKNRLLRYNTALYNEEVT 
REKISHYFI/NSYIHTVGKINKND 


10513 


24414 


A 


10594 


2 


363 


RAEVGVMAPLYLLLLLSGALGLTDTWAG 
S HSLRYFSTAVSRPGRGEPRY I AVEYVD 
DTHFLRFDSDTAI PRMEPRKP WVEQEGP 
QNWE \ WTTGYAKANAQTDRVALRNLLRR 
YNQSEAGE 


10514 

* 


24415 

* 


A 


10595 


156 


3 


NGLELDSDDGClALXiFFYFETEFCS\VA 
QAGVQWCDLGSLKPQPLGFKPTRP 


10515 


24416 


A 


10596 


80 


306 


MDEELLFRDEQRKWFLEMES TGE VAVN I 
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SEQID 
NO: of 
nucleotide 
sequence 

• 

« 

• 


SEQID 
NO: of 
peptide 
sequence 


eth 
od 

■ 


SEQID 
NO: in 

09/515,1 
Zb 

• 


Predicted 
beginning 
nucleotide 
location 

Am* At* mma»^Am\A^. tt*. ^pm At*, m*m* fm 

correspond 
ing to first 
amino acta 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Add, 
iLr=v*iutaiTnc Acid> r — r neny laian we, 
G=Glycine, H=Histidine, I-Isoleudne, 
iv— juysine, lf 1 leucine, ivi s ivietnponine, 
N=Asparagine, P=Proline, 
1^=^71 uia mine, k— Arginine, cF = &enne, 
T=Threonlne, V= Valine, 
W=Tryptophan, Y=Tyrosfne, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \™po$siDie 
nucleotide insertion 








■ 


■ 




V/EMPI KDLG / Y YRKLVD KAVPGCERVD 


10516 


24417 


A 


10597 


232 


389 


ILCWVFCCCCCCCFETBFLVIQAGVHWC 
DLDSLQ/ SPPPGSSDSPASAS 


10517 


24418 


A 


10598 


A aT 

463 


154 


M/™i/"V^ VtJODBT ODUDT7VT nnnr VnDOOI O 

prvqgrakkvpgtlaswgqkillpqppr 
/rvrqenclnpggggfsepkippsfpaw 

Gil KGGo JuIjKJtt JjRJtKJv 1 l\H 1 1. 


10518 


24419 


A 


10599 


208 


38 


LGPPQGFFKTAPPFFFFFETESRSLAQV 
GVQWWDIjGS JjQLiP PPG \S SDS PGLiSKKF 
LZi 


10519 


24420 ■ 


A 




200 


3 


CQPELSCMRC\RQSLLGGLSPSGGMEVR 
D PVE EAVCP LAE L KRCAGRALLVR I CCS 
LQSQQAGTFK 


10520 


24421 


A 


10601 


73 


427 


RWCSTl^febLSSDR^PGRKSfSiT 
SSLY/EKDKRNKAYFTK/RPSPGUDI IS 
T 


10521 


24422 


A . 


10602 


193 


401 


j**A m 1 ■ W*. ^ J"l tat— V «M /U**Mf « • * Jill 1 T~l f _ / W 

GEVSLLSPRLECNGVISAHCNFR/LPGF 
KRPSFFFL 


10522 


24423 


A 


10603 


181 


395 


RFVCSTI KVLRDLS SDRSNPGRFLSTSN 
SSLY/EKDKRNKAYPTK/RPSPVNDI IS 

T. 


10523 


24424 


A 


10604 


384 


10 


PRRPGGECVHCLGDSLVTDRRYSGLRS S 
DQTLSGDPMEAVPWSNSLSCSHYRLKPA 
SGITWGCKQAGFFSFFLFFKMESHSV\T 
RLECS GAISVPCKLCLPGPSNS PAPGGS 
LGPRSSRPPWTT 


10524 


24425 


A 


10605 


215 


379 


RFVCSTI KVLRDIiSSDRSNPGRFLSTSN . 
SSIiC/ EKDKRNKADF1K/ RPSPVNDI I 


10525 


24426 


A 


10606 


1 


163 


QTHRE PAMVLS PADKTNVKAD \ WGNVDA 
HAGEYGAEALERMFl^FPTTKlTcFPR 


10526 

• 


24427 

• 


A 


10608 


283 


2 


IQKHGLI IRCPQETPFKNNDVGMQKVKG 
KGEKbXMQILt/ Y r Jfr b XjNo/HIjSPRJjB/ 
CSGAISAYCNLRLSHSSDSPA/VDGTIG 

jv ^TJ U'PrtT TT7T T7T Ty w 1 
AL-xirl 1 (JIjX r jut JUVCt 


10527 


24428 


A 


10609 


144 


327 


CSWSWFCGCVRVSTLIKKKWFFFEMBFR 
S\VAQAGVQGRDFGILKPPPPG\SSDSP 
ASASRV 


10528 


r\ A inn 

24429 


A 


10610 


137 


1 


EEIMSFGKTWMELENIMI 


10529 


24430 


A 


10611 


201 


334 


LGYSGVRDPLEEAVCPF . 


10530 


24431 


A 


10612 


146 


2 


KTPLKKGLKRVPFFFFFETESCSVTQAG 

VQWnM XGalJU V ir f i* \USRIiSF 


10531 


24432 


A 


10613 


205 


259 


TTMPGLSFCTRTKSNKNLS FLTLDGSAR 
VDJjFIiX liiiLiifK 1 1 rUNir/ CUKjDK VXjLCC 
PGWSQTPELKQLICLDLPKYWDHRYEPL . 
CLAYPFALGQNLTKI 








10614 


IDS 




V? V JLum V OJUU V UOL'tXirliVUf CwC JJX* f / uf IS X 

QSHS I \SRMECSGVI SAHCNLCLP 


10533 


24434 


A 


10615 


213 


8 


KPPFFFPFFFLFFFLRWGSHCWSGY/C 
KGTIIVHCGLELLGSSCSPPAFQVSGIT 
GLCYCASIFIFEE 


10534 


24435 


A 


10616 


146 


1 


ILTIRDYWVKITLFFFFEPKFHS / VLPR 
LECSGAI S AHCNLCLPASSDS 


10535 


24436 


A 


10617 


208 


2 

* 


NNFWFSSSGKYGQYFSSETESRSVTWGH 
DLYSLQPPPP\GSSDSPASASQVAGITG 
SCHHAQLILVFIVE 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 

_ 


SEQ ID 
NO: of 
peptide 
sequence 

• 


M 

eth 

oo 

• 


SEQID 
NO: in 

09/515,1 

ZD 


Predicted 
beginning 
nucieotiae 
location 
correspona 
ing to first 
amino acia 
residue of 

npnfiH a 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
aing to 
last amino 

residue of 

pcpiluC 

sequence 


Amino acid sequence (A=Alanine 
C=Cystelne, D=Aspartic Acid, 
£s=vriutamic Acid, r— .rnenyiaianine, 
G=Glycine, H=Histidine, I=Isoleucine, 

l/" — T \rc ina T • — -¥ Aiming K/IaA/|A4liSAnina 

jv^iiysine, jl*— i^eucme, ivi— lyiemionine, 
N«Asparagine, P=Proline, 
v^ = v>iuiainine, iv— Arginine, o— ocrine, 
T-Threonine, V=Valine, 

X~Un known, *=Stop codon, /^possible 

ilUCICvilUC UCICIIOIl} \ B pU»ll/IC - 

nucleotide insertion 


10536 


24437 


A 


10618 


120 


2 


SFFFFFETBFRSVAQAGVQWHDLSSLQP 
PPPG\SSDSPSL 




244 Jo 


M 




1 02 


J 


LGYSGFWHPLEEAVCPFSDLTLCAG 


10538 


24439 


A 


10620 


193 


419 


TFFF FETES RSVAQAGVQWRDLGSLH/S 
QPGQQSETP 


10539 


24440 


A 


10621 


313 


460 


KPGLWRGTRSQKQVLFFEPESHSVA\KL 
E CGGTI SAHCNFCIiPRSSDr SA 


10540 


24441 


A 


10622 

i 


243 


505 


PTGCPKQEQVQAMLRPWFFFVFETQFN 
S/VPRVECSGTISAHCNIiRFPGSRDSPA 
LASRVAG I CRR/ CATTAQLI FVFLVETG 
FCHLVG 


10541 


24442 


A 


10623 

• 


6 7 


430 


LARRTWKNQHSSTHGCDLFFFFFFFFFF 
L I KKGVFFF PP EGGHGHRFFF I N I KFLN 
YKRFFFCLFEDVLMI SVGPPLAFFFFFF 
WRGGFSYliLLRCV/LTFLALGAPLFAPM 
FFLFIiFGEEL 


10542 


24443 


A 


10624 


3 


207 


PGGQLGSECSGVRMDEDVL\TLKSLIIG 
ESGVGKSSLLLRFTHDTFDPELAATIDP 
S S VVNPARNAATP 


10543 


24444 


A 


10626 


151 


380 


KMLFGQVQWLTP I I P^LWEAEAGGSLE \ 
LRSSRPALKRSETSSPPSLPCQKKKKKK 
KFPRAGGQMIiEVPFPGRVGAG 


10544 


24445 


A 


10627 


210 


3 


NFCQVKGFWKQKS SQ I KGFGKNKKKKGG 
SFFFFr rETESl^VT\RIjECSGTI IVHC 
SLDFPGSSDPPTSA 


10545 


A 111/ 

24446 


A 


10628 


237 


2 


LiCr JjfatjAi) P Y IDV Y I b VNCjVNPTr I 1 
LTNKTNTFGPLFLI I FFFGETESLSIA\ 


10546 


24447 


A 


10629 


1 


352 


RGPLS IQDYISKALFFLLLFGFLLLLLF 
FEVRSCSLTQAGVQWCNHSHISSLQSPT 
PG\RSPSDPPASAS 


10547 


24448 


A 


10630 


48 


200 


AlKPKNFLFFSETKSSb'VTQAGVQWRDL 
SSPQPPPP\SSSDSPAPAN 


10548 


24449 


A 


10631 


209 


376 


SFLWKFCLKGVPGCVRCQSA\LLWGASR 
IiGYIXjVRDPLYEAVCPFSDLKLCAGRTT 


10549 


24450 


A 


10632 


141 


1 


AGFFFFFLFFFLETESRFVA\RIj\QCSG 
AJ. 1 An UJ Jj Ldj Wijb SJM inAb 


10550 

■ 


24451 


A 

• 


10633 

- 


18 


522 


PLYSLTKRHTPREA/ kdnlkstqllsvi 

DAXSEOPIIIiOPVIXHjKSVIjUNST^ 
EGNTNISGVTWFRAGEQBQTPPEGFES 
SGSETVLGTEVKYDTPITRTITSANIDR 
LRFTFGVQALVETTSKGDRNPSEVRLLV 

v 


10551 


24452 


A 


10634 


127 


2 


NLSPLFFFFETGSRSVTQAGVQWSDLGS 
LQTPPP\GSRHSPTK 


10552 


24453 


A 


10635 


336 


452 


RI cr Flir ETEALtS VAPAVAQWFDJjGSIjQ 
SPPP\GSSESPA 


10553 


24454 


A 


10636 


259 

• 


3 


GTRIFFPFNSSPLSFPPVQKKSVSPFFF 
SF/LFFFFETESRSVT\RLECSGAISPH 
CKLHLPGSRHSLASATRNVLRPLWRIHA 
SCA 


10554 


24455 


A 


10637 


96 


413 


GDNTSQHSAI LINYFI FFFTFLFF/ETG 
SHS VAQAVRLE YSS VI TAHCN FRLRGS S 
NPPKKKKKGGAVLKDPWGGQSLPGLATY 
YFFPYRGANKNLLGDFWEGPLFV 


10555 


24456 


A 


10638 


180 


473 


CYMGRKAWLVLSNLLTLKKKKKKKKKKK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptioe 
sequence 


eth 
oo 

■ 


SEQID 
NO: in 

09/515,1 
26 

• 


Predicted 
beginning 
nucieoiiae 
location 
correspond 
ing 10 lira i 
amino acid 
residue oi 
■ peptide - 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 

cnrrocnnn 
CUlicopUIl 

ding to 

tact a m inn 
lad I Ullllliu 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Add, 

G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionlne, 

M^A cno ram n a Ik— Ovn lino 

ii — /\s|j2tragine ( r^rTOunc, 
Q=Glutamine7R=Arginine, S=Serine, 

W=Tryptophan, y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide Insertion 














KKKKKKKKKKK/ EMPVKQKNRGRALYKK 
KKGGRPFPIFWPFFFFFPLOGGKKHPGO 


10556 


24457 


A 


10639 


312 


480 


STVDSTSGEDDVNIAEMATKDSBDYINL 
V\QKAVAEFERTDSNFE\RGKMMSKSI 


10557 

* 


24458 

i 

i 


A 


10640 


2 


516 

■ 


MKPLENLS QTAS LARGATLLRP VLRRLC 
GliPGJjQRPAAEM PLRARSDGAGP JjYSHH 
LPTS PLQ KALLAAGS AAMALYNP YRH/ E 
ERPRISTS TLDLGKLQSLPBGSLGREYL 

QRYREVHDMLHTLLGMPTNILGEIWKW 
FEAVQ 


10558 


24459 


A 


10641 


56 


455 


PLCSYIQFIYHSKFNSNQISLVHTIPIiS 
KLFGYFSQLYS I YSYLCQHFIHLLS IPX 
TVFC I VF YWVFVHLI SVPMYI CVCLCTN 
VCVCV/CCLW 


10559 


24460 


A 


10642 


272 


33 


GRWFLFLGPAKYFLTGGRFPSDFFGPLK 
INPPFFFFFFFDTESHSVAQAOVHWHNL 
G YLQ / S P PPG \S S DS PAPAEFHHT v 


10560 


24461 


A 


10643 


285 


1 


MGNFLKRFFKNEKKFFFPILRAHPFNFF 
PGKS P FKTPRRALP JjGVPPKHP FF FFFF 
FSETESRSVARLBCSGMISAHCNL\NLH 

br*ySGlSPALi 


10561 


24462 


A 


10644 


100 


466 


FLLKFRLREVPSRVRCQSAMiLGPTSQL 
GYSGVRDTLEEAVCPFSDLKLCARRTNT 
LirKl VKUvraLio JjUKJTuIj VUlA.lrV.lrKU 
GVYRGRQASLSCSGVHPVRASPPLCLPK 
VPPPS 


10562 


24463 


A 


10645 


260 


460 


LKPHAERETIDKGRLPYYSFFFFETESC 


10563 


24464 


A 


10646 


360 


504 


QIGHI CAYVEKTELRLIFFFIiFFFFRQS 
CSVT\RLECSGTILAHCNIiCL 


10564 


24465 




10647 


241 


45 


WEVEVPRVAPCHlLKRDY^Fl tWLFtP 
FEMES YS FT \ELE CS SAI STHCNLCPPG 

CDnQO/lTBDXI 
O ICIJo i. Ir C XI 


10565 


24466 


A 


10648 


76 

• 


462 


FLWEYQEKAHI LWS LFS KLiILSRSRFTV 
KEKVRQKPGILFIYLFIFEMBSCSVAQA 

GVQWRDLGSLSPQKKKDS/DQSKAITFV 

WTXIC VD/lTMAnTYSDm? /iipct.wtviti?t.t ' 
iSLi JLINo XVK1jv» rl IXyfttV/ rlL-o-u AX"! J. IT JL» J. 


10566 


24467 


A 


10649 


3 


359 


QTQRE P TME LT PAD KTNVKAAWGKVGAH 
AGEYGAEALERMFLSFPTTKTYFPHFDL 
onuoRy v i\.vjrioi\.ft. v «uo u 1 WoJiritirj jj i n 
LNALTALNDIiHAKKLWVDQVNFKL\LSH 
CLLVTLA 


10567 


24468 


A 


10650 


266 


3 


tp BkKKKl dGLGAk J? pg YtKici Wpp^fIT" 

re jvis. i ivr r r ruKVunArrrAor r rrrr jt 
EVESCSVAQAGVQWRDLDSLQPSP?\GS . 
RGSP 


1UJDO 




A 


luOJ 1 




HQ* 


KKI ICGFFFDTGS CFVAQAGGQWRDLRS 
LQSPPPG\SSDSLTSASRWGMSRHMRA 
VF 


10569 


24470 


A 


10652 


161 


456 


VFFFPGENIAFVENKTTLSYHFMFTRMT 
AAKKMBNYKGCWGWETIGSFMPCWLESK 
RVQSLW\KVWQFFKWLMVKLLYGLAIPL 
TGICPKKLKKYVQGY 


10570 


24471 


A 


10653. 


91 


486 


PCFNHGHTTWYMYFFSLSLILFFVFSLK 
P VILAYKTFS FP I T IRVLLQRRHRQ KFV 
LDGFNGVDQNNILGS I CVSLIKI FFLTE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 

nenf id& 

sequence 

• 

• 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

corresonnd 

ing to first 
amino acid" 
residue of 

peptide 
sequence 


Predict- 
ed end 

nnrlfw 

otide 
location 

correspon 
ding to 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A==AIanine " 
C=Cysteine, D=Aspartic Acid, 
f^Glutamic Acid. P^Phenvlalanina. 
G=Glycine, H^Histidine, I=Isoleucine, 
K=LiVsine. lr=T murine. M = Methionine. 

N=Asparagine, P=ProIine, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine. V= Valine. 
W=Tryptophan, y=Tyrosine, 
X B Unknown, * e =Stop codon, /possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














S C FVAEAGMOWCNIGSLOAPRP \GSPHS 
PGISPPVLG 


I vJ / 1 


Lf¥rldt 










P I FCFWRR I LTLSPRLE C/ RQ/MI FAHC 
NLCLPGSGHSPASA 


10572 


24413 


A 


10655 


775 


3401 


TFPSQIRYGLRKRPNSLTLSPRLBCSGA 
ISAHMORPPPGFTPFSCLST.PSSWHYRR 
PPPRPAICVCVCVCVCVCLVETGFHRVN 
QDGLDLLTS/S/ IPPASA/FPKCWDYRR 
E 


10573 


24474 


A 


10656 

1 


287 


454 


LPIilYFLYFYETECHPLAQAGVQWRDLS 

S LOS P PP \ G SND 3 S AS A 9 PTVPT S AM 


10574 


24475 


A 


10657 


212 


64 


LFLSKFYFFEMESCSVAQAALQWCDLGS 
VQPLP\PGSEDSPASAWGYLD 


IUj / j 






IUOjo 


J DO 


All 

47a 


TTEYYSAL 


10576 


24477 


A 


10659 


358 


3 


ILAVFCFCGSAPGFEGGKLTPFFNRGRE 

FKQPPNPNPNS FFFFFFETESHT I ARAG 
VQWHNLCSLQSPPPG\SSDSPGRWSLQR 
TBIAPL , 


10577 


24478 


A 


10660 


2 


235 


KRDLIRHYPKEDIYMANKYILK\CSTLT 
MTAVGVHI CNPNTtiTGHGRG IA 


10578 


24479 


A 


10661 


9 


538 


CVTVRIPSRPTRPLSSDRSNPGRFLSTS 
ST 


10579 


24480 


A 


10662 


374 


38 


SFLWKLHLRGAPGCIRCQSAXLLGGVSQ 
LGYSGVRDPLEEAVCP FSDLKPRAGRTT 
TLFKAVRQGRLSLOKFLLPFVQLCPAAR 

R 


IXJDoV 






iUOOj 





An 


AT. T PLQ SLT T <! APWT ,T .LT.PTVC! TQ Q D IfM 

ECSGAITSHCSFNLPDSSPSPTSASR/V 
TGTIGARHOAOLMFTYPC 


10581 


24482 


A 


10664 


64 


451 


FSSERKSHMSLTLNQKLEMIRLSEEGLS 
KAKVGQ KLVRIiHQTVSQVVDAKEKLLKL 
IKSATPVNIGMIKQHNLIADIEKFGMIW 
TDCOTSHRTVLCORLIOSK/ALTLFNSM i 
KAERGKEAABEKLEVRRG 


10582 


24483 


A 


10665 


355 


474 


ilfyfi ffetescsvaqagvqwrjjlgsl 
oapppgVsrdsp 


10583 


24484 


A 


10666 


317 


481 


GHTCPWQTFFFFLFRDRVLIiHHPGWSAV 
TQSWLTAALT\FGPKRSSCLNLLNDWD 






A"" ! 


1UOD / 


57 

j I 




VKNTOWGKDSLPNKRV\ PKNWASTVRR T 
KLDXLTSYAKINSKWIKDLNVPJuEIVKV 
LQVE YPS FKI LGNGS VLDFVFFYSG I FA 
LHLMGEHP 


10585 


24486 


A 


10668 

A \r*0^*\0 


86 


468 


ENYKI PLMGGKNFLLFPS I PPYFFFFSR 
LGLTLL PRLKCS GDHCS LQP RP PGLKRS 
SCL\GFPKCWDYRNEP/CVPR 


10586 


24487 


A 


10669 


254 


25 


GSH I CKVPAAI YSNTCTS S EUGDGGWG 
GCSSGSTTHPSPVADSFFVWETVSL/S 
VTQAGVRWCDLSSLQPPPPGI 


10587 


24488 


A 


10670 


206 


3 


YGRPRKLCN IGLKSGFLDPTHWGQHRVI 
S FFF IMETDS RS VAQAGVQ WHNLGSLHP 
PPPG\SSDSPAS 


10588 


24489 


A 


10671 


20 


355 


GFTSQSELLYSIDPITRPTDQVTIGISA 
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SEQID 
NO: of 
nucleotide 
sequence 

< 

■ 


SEQU) 
NO: of 
peptide 
sequence 


M. 
eth 
od 


SEQID 
NO: in 

TIC CM 

09/515,1 
26 

■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

ammo acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

acid 

residue oi 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cy5teine, D=Aspartic Acid, 
n>— oiutamic aciu, r— jrnenyiaianine, 
G=Glycine, ENHistidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
V^iviuiaraine, jtv— /vrginine, o^oerme, 
T=Tnreonine, V^Valine, 
W=Tryptophan, Y=Tyroslne, 
a— unknown, — otop coaon, /=pos5ioie 
nucleotide deletion, \=poxsible 
nucleotide insertion 










* 




QSYCRVHVHNRVYDLDVE3QHPNOAPAI 

NS S L Y/EKDKKNKACFTK/RPS PVNDI I 
ST 


10589 


24490 


A 


10673 


112 


359 


NKAQQCVHENHFKLKDANTLNI KVWRNI 
CHS S FNQKK x Vj-LA X LtNixUKbO rKbKKU 1 
GDEE/HFIKIKKSVIQEDFSIINIYA 


10590 


24491 


A 


10674 


317 


2 


TGPGFFPQIwKvFPPFPIiKIF FTQKFLV 
SFWGFKTIKVGPFVFFPKGPQPRFFSPF 
DGVQAEKFFFFPFFFFFFFFCERGSHSV 
A\K1jECSGAISAHCNIjvPPGFK. 


10591 


24492 

i 


A 


10675 


240 . 


495 


DHRPEKTKSSTCQAGEVPLLGLFVPFRS 
i STDWVRPAFI /MEG/HI CPTQSTNSNIN 
i P VWKFi P HRHTQ DNV W PNAWS FHGFVKliM 

HKLI 


10592 


24493 


A 


10676 


271 


484 


NPAGQTCRIKSFFFSFFETESCSVfQTQ - 
VQWHDHGS MQPQSLGP \ SDPPTS ANSS A 


10593 


24494 


A 


10677 


364 . 


121 


KQEVVRSLSHTtALCNDQAsPLFGSG 
SKQKLSKAAPCAGSSKHKHL/HKECWKH 
THTHTHTH IHTHTHTH IHAHTKDRFA 


10594 


24495 


A 


10678 


354 


539 


FF FFFF I VFEESHAIAQAGVQWCHLSS P 
HLR\SPPPGFN\DSPASASSAGirniSS 
SVRL 


10595 


24496 


A 


10679 


123 


853 


RWS LCHPRLE \ CSGT ISAHCK/ L/RAPG 
FTP FS CLS LPSS WD YS ARHHARLI FFVF 
IjVETGFHHVSQIXsIiDljJjT/ SG/ IPPAuA 
i /FPKGWDYRRE 


10596 


24497 


A 


10680 


251 


30 


ASLGVSAPIiCIHVSPCPPNGDLGKTWWV 
CSLCLLSSSNSPASASRIPP 


10597 


24498 


A 


10681 


203 


1 


NFPAPGKLGPPRDSLKTAPPFFFFFETG 
SRSVA/EAAVP/W/CDLDSLQPPTLPDS 

SUSPNSASGVAfcjl 1^3 


10598 


24499 


A 


10682 


237 


397 


DSLTLSPRIiECNGS ISAHYNLC/RLGSS 


10599 


24500 


A 


10683 


8 


388 


LYMCWFRPGFLAHNSHDHGYSLTLSCWG 
KKKKKKKKKKDS 


10600 


24501 


A 


10684 


201 


463 


t vracn ^1^^ tx ra tyo t .TCDTrtnvTcirtPT /tLn r» 
JL I WriJJl VytrX VAUKLil oK&rtiUoJvr XtonU 

LMFGQGFLFFEMDSHSAARAGVQWRSLG 
SLRPLPPG/SQVILLPSSDSPAIASRVD 

alnuV 


10601 


24502 


A 

* 


10685 


151 


1 


PEKNRKTQKGAPPFFFFFBTGSPSVAQA 
GLQWRSHSSIQNEPP\GSSDP 


10602 


24503 


A 


10686 


165 


3 


CVFVEDIISNFFFFFMETESRSVAQAGV 

AliJDt?tir , C\ A CDT7"A 'C"T w Pr > \7UTJTJ AOT.TT7D 


10603 


24504 


A 


10687 


292 


384 


NTEIGWVWWLTTVIPALWEAEAG/EITR 
SG 


1U604 






1 A<OQ 

lUottfS 


101 


itO 


LSSDRSNPGRVLSTSNSSLY/EKDKRNK 
AYFTK/RPSPVNDII 


10605 


24506 


A 


10689 


286 


3 


QPSSVWSFRSVICAPAPFSGMGLSPELQ 
Q CI VGNFAS RYTMT KS SAVLF I L I FSL I 
FKLEEL/REAPASLVSFLPPQMLRIKWE 
SEQWLVLLCDRG 


10606 


24507 


A 


10690 


169 


2 


SDTPAWWPRKHVFPPGFFFFFETESRFV 
T\RLBCSSTISAHCKLCLPGSSDSPASG 
S 
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SEQID 
NO: of 

IIUUCUUUC 

sequence 

— 


SEQID 
NO: of 

npntifip 

sequence 

- 


M 

eth 

od 

■ 


SEQID 
NO: Id 
USSN 
09/515,1 
26 


Predicted 
beginning 

n u r Ipnt id *» 

location 
corresnond 
ing to first 
amino acid 
residue of 
peptide 

sentience 

1 


Predict- 
ed end 

nude- 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue nf 

peptide 
sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid. (^Phenylalanine. 
G=GIycine, H=Histidine, I=Isoteucine, 
K=Lvsine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serlne, 
T=Threonine, V^Valine, 
W=Tryptophan, y=Tyrosine, 
X=Un known. *=Stoo codon. /=D0Ssihle 
nucleotide deletion, ^possible 
nucleotide insertion 


1ftA/V7 
JUOv/ 








94.0 


AftfL 


DG I RRLS F FFF FF FETKFGFVA\ RLEGR 
GTIWVNVNLHLPGSRDSPASASQVAGIT 


lUOvo 




A 


1U07Z 




"I 
j 


OVPLNLSLHLKLGMLFPGCINGSIiSOFL 
QFFAQISPYQTGFPSFYKTSIYHNSLHQ 
YSLS FLP YFI FLFFFSFFFFLKQSHSVS 
\RLE CSGMVSAHCNIjRLTATS ttqvqvi 
LV. 


10609 


24510 


A 


10693 


196 


3 


YDAGHTKKTLHFPTVYPFVLFCWWECKM 
VQPRWKI VRR/ FLKKLNTKLP YDPAI P F 
LSVHOKELRT 


10610 


24511 


A 


10694 


245 


391 


KTDYQPGAVAHFGRPRREDHPRLGVQDK 
PDOLEKPVSTKNTKLVW /SWWL 


10611 


24512 


A 


10695 


252 


389 


KTQGKVLFFFFETESRSVAQVGVQWRAL 
RSTiOPP\ OPGTSDCPASAS 


10612 


24513 


A" 


10696 


385 


1 


P PNKAKM X SSKDNKNLHWGKDTLLNKWC 
WESWIVTCITMKLDSHLSPYTKIKPKWI 
KD/LKT I KI LGENI KKTVLD I GLHK\NM 
SKTSKAMTKILDLIKLKSFCPAKEIISR 

VTJRK^TRWFCKVPA^YT.^ 

v X*! XvXXO ± Ei n CVKV V (nu X IJ^vl 


10613 


24514 


A 


10697 


296 


430 


KHIQARRGGS/CGN/RQHSGRPRRADHL 
RSGVREQPEQPGEKP.HL 


1U614 


24515 








4oo 


r>VT.r"T. /QT.QT VT.QTVT.PT YTJ5T TWOPTV 

HLF I YHLSNLS S I S I FFTKWLS 


10615 


24516 


A 


10699 


15 


393 


RSVGVLGPVRCQCA\LLGGDSQIiG\SQG 
SGVRDPLEEAVCRFPYLQLCTGRTTALF 
KAVRQGHIiSLQRLLLS F\VWLCPAPIGG 

EAWAMAGAPPPASLPPLS 




x43l / 




iu /uu 


1 T< 


i 


DWTTNFTTFrjYN Pit PS *5 IMP YTjSHT .FKT 
^#rt A iiif x x f uinc ivr <L3k3XJi ic^ x xjjjxxjxt i\i 

lxR/MWPGAV7UxARNPSTLGGR0GWIMRS 
GDE 


10617 


24518 


A 


10701 


494 


80 


FNKKDIHSDTPSEGHQIK3RPNVETLKKM 
GRNQCKKGENPKNQNASSPKDHNSSTPR 
EONWMKNESDELI EVGFRRWVITNSSEL 
YKX3\DVLTQCKEAKNLKNRLGKVLTRIT 
SLEKNGHGLMSVKNIAQELCEASAGWR 


lUOlO 




A 






din 


KKGPLFTPPGGGGG/POKKPPGPLNPGG 
QRDSSFFPPPGGGNTGETPPGG 


10619 


24520 


A 


10703 


125 


3 


NRGNKGQV/ QWLMP VI PALWEAEAGRS P 
EIRSRDQPRQHGBTL 






A 




J o*t 


? 


RLRVLAPCRHLPRAPRTWPKRPFFFSFF 
LFFPQTQSHS VA\RLECSGAI SAHSNLC 
FPPTRP 


10621 


24522 


A 


10705 


187 


406 


LFLWKFCLRGVPGHVRCQSAL\LGGASQ 
I^SSGVRDPLEEAVCLFSDLQLRAGRTT 
TLFKAVROGHLRLORILL 

mw mmt m *Vm4sm- V A%|^W*4M(\MW4\A MAI 


10622 


24523 


A 


10706 


82 


410 

• 


ILRGKFGKHYFNRIDWREALRQSLSLFN 
FTTFSNFIiASTjHKPEMETEIiKGSFIEIiR 

kalfqlnardasllstvdsdfsfcrkfs 
r/cskcgq 


10623 


24524 


A 


10707 


390 


2 


S KDCRTAKI AACS FLWKLRSRGAPARCK 
PELSCMRCLSA\LLGGVSQSGGTGIRDP 
LEEAWPLAELERCVGRSAALFRASRQE 
HLSLLKMHPQLPLPSGALSQALX3SFIYK 
PLTGASAFLS EMP CQERR 


10624 


24525 


A 


10708 


229 


392 


YSWWRQMHSVAHAGVQWCD/LVSLQPQP 
PG\SSDPPASASLAARTTGAGHHNQLIF 


10625 


24526 


A 


10709 


193 


2 


RYLCYQDHVI LLLLLLLLFLRWSLTLSL 
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SEQ ID 
NO: of 
nucleotide 
sequence 

• 


SEQ ID 
NO: of 
peptide 
sequence 

• 


M 

eth 

od 

• 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

i 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cystelne, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methiomne, 
N=Asparagine, P=Proline, 
' Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V=»Valine, 
W=Tryptophan, V^yrosine, 
X=Unknown, *=Stop codbn,/=possibIe 
nucleotide deletion, V=possible 
nucleotide insertion 














RtiECSGT IMAHCSLDS\SSNPPI SASQA 
AGTTGACYH 


10626 


24527 


A 


10710 


180 


11 


LLFFFETRS/HVSQTWQCGMISGHCNL 
RLSGSSDP PPLASRVAGTTGXHQNIWLS 
R 


10627 


24528 


A 


10711 


234 


408 


MKLIMLKVILFPQIHLQIQPNFYQNIFP 
FETELCSVAQSGVQWHNLSSLQHPPP\G 
SSD 


10628 


24529 


A 


10712 


312 . 


470 


TGPHCVT \RLECSGAITAHCSLDFAGLS 
TSPTSVSQVSGTTGTWMKLETI ILS 


10629 


24530 


A 


10713 

i 


64 


392 


PKMVI RI SS ETS L YAS LPLQMKGQRQK/ 
CEPQPKKKKKKKKKKKKKKKKKKKKKAR 

G . 


10630 


24531 


A 


10714 


169 


427 


NNQKTNNKMVG VS F YLS III LNVNELNA 
P I QRHRVAE W I KKE KKKKK/ DPG I CGLQ 
QTLFFYEDP/HDPLRLKIGGW/RKYYPS 
RGTQIK 


10631 


24532 


A 


10715 


176 


410 


ARSSWGLHTAVPVAFFSLRFLLEIFLDR 
DTRCS P PAAFFFE TGS CCVA \ RLEGRGA 
ITAQCSLNLLGSSNPPTSASRVA 


10632 


24533 


A 


10716 


234 


389 


NSGNMDRYKDVQNI IQNPICWPGTVAKA 
CNPSTLGGRGGQ I L / RGQEFBTS LA 


10633 


24534 


A 


10717 


190 


2 


GPFPPHPGGFLRGFFVPNTI PPPTFFFF 
FFLRRS LS VAQAGVQWCDLS SLQ PLP PG 
P\SNSPCQ 


10634 


24535 


A 


10718 


201 


1 


WPLFLPKMFFFFKRFPHMGPSPWVGAQA 
KKKKI FFFLTGSHSVT\RIECSGTISAH 
CSLNLPGPSH 


10635 


24536 


A 


10719 


195 


419 


EYHTSLVTCGNPCVYRSNNKLNQTSRRK 
VITKI RAELNE I ETEK/ LQGSGETKXWF 
FE KINKTGLELLGS SDPPVW 


10636 


24537 


A 


10720 


218 


407 


GKKNIiAFKKKKRKEKKRKKSPSQSNMNS 
AKIEARTNI KL WKHG WKNCE 1 1 DA\LQ 
KAFGDNA 


10637 


24538 


A 


10721 


231 


409 


GTQLHLGGFFFSEKELSFCFWF/MFALF 
EMEACSVT\RMECSGrVLAHCNIiRLPGT 
SNSSA 


10638 


24539 


A 


10722 


40 


401 


PLCPSESSGNTLMAS SDPSTPAVP PPNT 
THPPLCLS KS HL PLRP KQGL PSGNLLQL 
PLTLLIPLLGAPVACWQLPQQCTIiSTFF 
FETKSHPVAQAGVQWCCLGSLQPIiSPG\ 
SRDSPDSAC 


10639 


24540 


A 


10723 


210 


14 


HVMGLLLL FLNKLTVKNFGWIiAGFWCLG 
FFWFFS FFETRSGS VT\RLE CSGM ISAH 
YKLCIPGSSH 


10640 


24541 


A 


10724 


347 


462 


TFFFFFFETXSRCDI QAGVQWCDLCSLP 
PSS\PDSSDCP 


10641 


24542 


A 


10725 


169 


386 


DKKQAKT I KWGKNS FSNKWCWNNCIATG 
KRMKIiDP \YLTPYKK./ INSKWIKDLjSI 


10642 


24543 


A 


10726 


257 


54 


PLFFFFKRQGLTLSPRLECNGTVTAHYN 
LKLLGSRDLPTSASP\SAGITGISNPAR 
LFTYLNPTHLQT 


10643 


24544 


A 


10727 


2 


401 


NNYDRAETQIYQYMCLNPTFYCLQETHL 
T CND I YRLKVKGRRE I MQIENKRVGVAI 
LVSDKTDFKPTTVKKKLHYTIRGSIQPE 
DLI ILCTYSPNIRASRFIKRIPDLRKEI 
A/HTVKVGDFSISLNRLSRQNT 


10644 


24545 


A 


10728 | 154 


1 


PMCSLLGLSKGGIIFFFFFEABFCSVAQ | 
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SEQID 
NO: of 
nucleotide 
seouence 

* 


SEQID 
NO: of 
nentide 

V V|# WW 

sequence 


eth 
od 

- 

* 


SEQID 
NO: in 
USSN 
09/515,1 
26 

P 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
' amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A ra AJanine 
C=Cysieiae, D=Aspartlc Acid, 
E=G\utamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I-Isoleudne, 
K=Lysine, L=Leucine ? M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutimrne, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=0Tryptophan, Y=Tyroslne, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














AGVQWRDLGSLQPPPPG\SSDS 


10645 


24546 


A 


10729 


34 


151 


PRPPVPSLLDRGRLQLWRQRGLRHRAHS 
NGFIGGKQQIMKLLKNYVRRPVG/VAVA 
IMFD P DPRYPR S W I EDD FNYGGSVASAT 
VHIRMGSLVENNKS 


10646 


24547 


A 


10730 


2 


408 


VFLLTVRTLI CRS VGVWWRSTPDLVCLG 
ISSGGCRTANIGLYQMLLPDHSSGS FCL 
RGVPGHVRCQSA\L3jGGPSQLGYSRVRD 
PLEEAVCPFSDLQIiHAGRTTTLFKAVRQ 
VHXi I LQRFLLRFVWLCPAPRG/GVYR 


10647 


24548 


A 


10731 


274 


89 


ENIPIVFNPPVCSPLLWQPQH/CEYPKL 

cklclklhkhrvfy\ flpfffetephsv 

ARIiECSG 


10648 


24549 


A 


10732 


251 

» 


1 


tfqmmqkcfshrki fhnlldkasyki vy 
kedpfspsslsssvslknnffflbtesc 

SVTOAGVLWCHLS SLOP PPPGVSRDS P 


10649 


24550 


A 


10733 


375 


1 


APFPPLWVRGSPPFSPLVFPHPKLKPPL 
AE I LGFFKERKWGS IRKPCLFKVKKLVS 
VWPGLELQIFKFIGEFPFSFPSLVGRKP 
NFFLGPVFFFFFEKESRSVA\RMBCKGT 


10650 


24551 


A 


10734 


82 


398 


SFLWKLRLRGVPSHVRCQSA\LLGGASQ 
LGYLGVRD PLE EAVCP FSDLQLPAGRTT 
TLFKAVRQGHLNLQRFLLPFVQLCPASR 
GGVYRGRQAS LS CGGLHPVRAS 


lUOjI 




A 


IKJ/JJ 

■ 


7 




TFCAISWLESGVEDGPRSRISYRISTFF 
FFFKGAPEPKEVRGRPKPGLAPTSLPGA 
HGTTGLSGWTTPS IGGWQPPPPPRENPK 
GEHPPAPVAGDTFQ\PKKPPPILKVFFP 
K 


10652 


24553 


A 


10736 


171 


1 


RVFYLLAFALFVDTGS PF VSQAGVQWCD 
HSS I QS QTSG\ SNDPPAS AYRVAGTTGV 


1 I/O J J 




A 


10717 

L\J fj f 


1 90 

1 7V 


1 
i 


EHKTTVRKI PKMEEMADSGSNMLRI IIF 
LFLFFETKSCSVT\RLECN/GSISAHCN 
IiHLPGSSNS 


10654 


24555 


A 


10738 


252 


413 


GLLGLQNFSYKVHLTKAHLKKCSWLDAV 
AHAYNPITLGGRGEW/ IQEFQTSLTNV 


10655 


24556 


A 


10739 


101 


249 


AGSTSRCI/QELSDLLEHIiEQENCLNPG 
GRG CSE PQS CHCT PAWVTETQKK 


10656 


24557 


A 


10740 


185 


403 


LGLPVHTMKSNYHYHLCHHHHHLQHHHH 
HHHCHCHYHS /HHHNSQHP P PP PPPPPP 
HHHHHHHHHLP 


10657 


24558 


A 


10741 


119 


425 


IKFHLHSMERTHLFLWNCWAHKQRNCFK 
ADFVITDDIKQLCPPQSWWTRAGKLPLG 
AGRGG S HML S HH F WR PRKAD I L YLGVLD 
QPGQQGSTPSL/LKNTKSSW 


10658 


24559 


A 


10742 


112 


2 


GETFF FETGS S S VA \ RLE CS GAI SAHHN 
LHLPGSSKSP 


10659 


24560 


A 


10743 


188 


3 


PPPLFFFFFFFFFKTGSGSAT\RLECTA 
HCNCCLPGSSHHPTSAYQVARTTDVCNH 
AWLIFV 


10660 


24561 


A 


10744 


176 


3 


SLYSKNKNPSHLLFLPIPIKFFFFFFBT 
EFRSVAQAGVQWHDLSSLQPPPPG\SSD 
SP 


10661 


24562 


A 


10745 


250 


1 


GPRRI FFLKEFYPRFVFGKNPAPGGFFS 
GGKKPGPFPFNPRPIKFFFFFFETEFRS 
VAQAGVQWHDLSSLQPPPPG\SSDSP 


10«2 


24563 


A 


10746 


155 


3 


' PPHPf'PF^PLPIKFFFFFFETEFRSVAQ 
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SEQID 
NO: of 
nucleotide 
sequence 

• 


SEQID 
NO: of 
peptide 
sequence 

• 


M 

eth 

od 

\ 

- 


SEQID 
NO: in 
USSN 
09/515,1. 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

• i t m ft 

ing to first 
amino acid 
residue of 
peptide 
sequence 

i 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A B Alanlne 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanioe, 
G=Glycine, H^Histidine, I=Isoleucme, 
KpLysine, L*=Leucine, MNMethionine, 
N=Asparagine t P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tnreonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X» unknown, *«Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AGVQWHDLSSQQPPPPG\SSDSP 


10663 


24564 


A 


10747 


157 


1 


TPRPFPFNPPPLKFFFFFFETEFRSVAQ 
AGVQWHDLSSLQPPPPG\SSDSP 


10664 


24565 


A 


10748 


157 


1 


TPRP FP FNPHP I KFFFFFFETE FRSVAQ 
AGAQWHDLSSLQPPPPG\SSDSP 


10665 


24566 


A 


10749 


153 


2 


PPPLSWTPAPLRFFFFFFETEFRSVAQ 
AGVQWHDLSSLQPPPPG\SSDSP 


10666 


24567 


A 


10750 


157 


1 


PPPPFLFFPRPLKFFFFFFETEFRSVAQ 
AGVQWHDLSSLQPPPPG\SSDSP 


10667 


24568 


A 


10751 


155 


3 


NTFPFFFTPLTPKFFFFFFETEFRSVAQ 
AGVQWHDFSSI*QPPPPG\SSDSP 


10668 


24569 


A 


10752 


86 


406 


SFLWKLHLRRAPGCMRGEUJl\liZiGGVSQ 
LDYSGVRDPLEEAVCPFSGPKLHAGRTT 
TLFKAVRQGHLS LQKFLLP FVQLCPAPS 
TGVQEGRQAS LS FGGLHP VQ3 SR . 


10669 


24570 


A 


10753 


364 


469 


GQFLHSLD/SHWKKSEDFCFLWFPHSEN 
VSAIHQDH 


10670 


24571 


A 


10754 


250 


61 


GE KKKKKPRVFF FFFFFETE PGS VT\ RL 
ECSGVLSAHCNLRLPNPNDSPASASRVA 
ASAKLG 


10671 


24572 

• 


A 


10755 

-. 


66 


446 

■ 


SFLWKFCLKGVPGCVRGQSA\LLGGASQ 
DGYSGVRYPLEETVCPFSDLKLRAGRTT 
TLFKAVRRGHLSLQRLLPPSVCLCPAPR 
GEAYRGRQAS LS CGGLHPVRASRPLCLP 
TQALAMVGAPPPGSL 


10672 


24573 


A 


10756 


252 


482 


RLPRQPVRKWVAGVRGCCVWGWSKSLQ 
RSTTLDWQGPPQRGGPILLFFFFETLCH 
/TRLEGNGEISAHCDLCLPGSN 


10673 


24574 


A 


10757 

• 


205 


488 


PLESLRSPLGPLSWRTASSVSPVTCIYY 
IYIYIYIYTHTHY/HIHTHTHTHTHTHT 
QYI FVCVVFLKDRVLLCHPRQSAGARSW 
PTRTSASQRQEM 


10674 


24575 


A 


10758 


272 


471 


YS YVLFFIFLGI ES CSV/AFSAGVQWHN 
HSSLQLQTPGLKQS3HLSLP/ASASPVA 
GTTGMR YHARL I F 


10675 


24576 


A 


10759 


17 


342 


GTLSSGAQVLIGRIESIVWGLKPWALG 
GCPSPRAVHWLLASSDWRPSLQDGAEGW 
KKGEANGNHKR/GIAVI SDQ I DFKTKT I 
KGD\ iCKSHYVMIKGPIQQEAITI INI 


10676 


24577 


A 


10760 


42 


498 


EFRERGREKEREKERKTEERMEDRERKA 
E REREKERQEGRE RERE KGRQRERERKR 
ERERKRERE/RHEPGSL 


10677 


24578 


A 


10761 


I 


305 


ASWDDPAHNNNFHIPGGVVAHFFFCRLL 
D CP PWAPRPPAPR/ VLL/TPAAAAAAAA 
AASRPEKKRAEENVGAI PPRKRPPRWEE 
RRGPRKGSAP PGRGAGRRAR 


10678 


24579 


A 


10762 


364 


2 


FPQKQNGQYLPLFPKPTGLCXjKGKXRLG 
FFLNYFGKKKKKRGRELGFPPFCQIWGT 
S IYRGSMGYFFFFFFWRQVSLALLPMLE 
C:\RGMIRVHCSLDLVGPSNPSTSASQIA 
RTTGTCMCHH 


10679 


24580 


A 


10763 


152 


484 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLY/EKDKRNKAYBTK/RPSPVNDIIS 
T 


10680 


24581 


A 


10764 


208 


420 


S FL WRFPLRGVPGHVRCQSA \ LLGGASQ 
LGYSGVRDPLEDAVCLLSDLKLCAGRTT 
ALFKAVRQGHLSLQR 


10681 


24582 


A 


10765 


319 


85 


GKLLNNNRCFGS KFQG IMFFFGETESP Y 
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SEQID 
NO: of 
nucleotide 
sequence 

• 


SEQID 
NO: of 
peptide 
sequence 


M. 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
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nucle- 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~ Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenytatanine, 
OGlycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Metblonine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonlne, V=Valine, 
W=Tryptophan, Y=Tyroslne, 
X=Un known, *=Stop codon, /^possible 
n ucleotide deletion, \=poss>bte 
nucleotide insertion 


■ 








■ 




ITQSGVRWHDFGSLQPP\PPGSRDSPAS 
AS\RGITGARKHTQPIFLLVSNS 


10682 


24583 


A 


10766 


122 


3 


KFFFFFETE FRS VAQAGVQ WHDLS SLQP 
PPPG\SSDSPSL 


10683 


24584 


A 


10767 


240 


86 


RAPPFFFFEMESCS IAQAGVQWLSLGLL 
QAPPP \GSCHSPASASCELMFSKI 


10684 


•24585 


A 


10768 


244 


85 


RAPPFFFFEMESCS IAQAGVQWLSLGLL 
QAP P P \ G S CHS PAS AS CELMFPKI 


10685 


24586 

* 


A 


10769 


35 


296 


EVKSPSARQPPRLGSEEHLRPAAAPSGR 

EVGGQPPPGQPPCPGGEG/PPPPGSPDW 
BVRSPSARQPPRLGGEPNSSLRTGHDDD 
GGFV 


10686 


24587 


A 


10770 


260 


484 

■ 


MDEELLLMNEQRTW FLEVES S PGEOAVS ! 
I IQLATRDLEYDIiNLVEKGAAG/LERKH j 
YSFERSSTVDKILSHNTACY ] 


10687 


24588 


A 


10771 


248 


393 


TQEGKKL INWPGTVAHACNPS TLGARGG 
RTTRGQELETK\ LANKIKPCL 


10688 


24589 


A 


10772 


239 


2 


SFLWKFCLRGVPGRVRCQSA\LLGGASQ 
LG\SQGSGVRDPLEEAVCPSSDLQLRAG 
RTSALFKAVRQGHLSLQRLLLSF 


10689 

- 


24590 

* 


A 


10773 

• 


158 

- 


455 


LFFTLCPSLLQHIAVMLELGLKGPKSIQ 
P I FW VFLQGTE P / HFLVTPVR/ CCLPLL 
KLFCLLVFGMES P S VPHAGECSGVI SAH 
CNLCLLGSTDSSASPSRV 


10690 

« 


24591 


A 


10774 


166 


403 


KKTFLGE P LFWGGAKKKKPGKKNPGFF P 
RG/ IKPRVFFSRFFFFGPPPKKGFPQKS 
FFLKSLPGFFLFGGCPPPFFFFFFFFEM 
ESCSNTRLECSGVI LAHCNLCLPGSSDS 
P 


10691 


24592 


A 


10775 


208 


1 


RLCFFYFRKALLGKAQIKNIAFFPRKGS 
FFFFSETESRSVAQAGL/LDCSGAISAH 
CKLRFPGSRQSPAS 


10692 


24593 


A 


10776 


192 


29 


IFRKBFPCLNFFLLFFETESRSLAQAGV 
QWRDLGS LKAPP PG\ SRRS PASRDTGV 


10693 


24594 


A 


10777 


275 


490 


KPGFFLLQMAKVYM I FFFFEMEFCS \VA 
QAGVQWHDLGSMQHRPPG\SGDSPSCLP 
SI/AGIPG 


10694 


24595 


A 


10778 


340 


3 


LKVPVGKPRSTLLGVKKVPFFGNKRKKF 
LALFFFPRPPPGEGFSTAFLAQKPTPRV 
VPALGFPKNQSPLSPFFFFFFETKSCSV 
TQAGVQWCGLRSPQPMPPG\SSDSPASC 
L 


10695 


24596 


A 


10779 


313 


1 


ANPFGGPS GGDPFSSRVFPP PGPKNETP 
F FKNKTTAT KKTGNKGG WAP S PHRGGP K 
KGPALWDKKGKNLWPFFFFETESHSVT\ 
RLQCSNTI LAHCNQCLPGS 


10696 


24597 


A 


10780 


116 


422 


ILEI)TNIQTIETLLAIREVQIQTTLRNH 
FTLTGMAI I/RKTDNNKCWRECGKIETL 
I CCWRE C KMLCNE VGTALE IVWQFXjQSL 
NIELS YDLAMPGFI IFPRE 


10697 


24598 


A 


10781 


120 


418 


TQTTGAPQLHLASR WLSRSGGLTS S PQE 

ipklfwsiesplgsskhlslqvfvclfv 
csfvpemescsvarlecsivisahXctl 
hlwgsshfhasasrva 


10698 


24599 


A ! 


10782 


134 


1 


SS FFFLCQTESCFGVQWHDLSSLQPSTS 
RAQA\LSLPSSWDHRR 


10699 


24600 


A 


10783 


2 


435 


CSHRGDS S S YSQLSG IRAGDLGGGGKD I 
FRLLPTTLNIFAGKESYDWCVTHERMC 
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Amino acid sequence (A=Aianine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
OGlycine, H=H*istidine, I=Isoleucine, 
K^Lysine, L=Leucme, M^JVlethionine, 
N=Asparagine, PHProline, 
Q=Glutamine/R=Arginine, S^Serine, 
T=Threonine, V=Vallne, 
W=Tryptophan, y=Tyrosine, 
X-Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 








• 






LC/SCLFIYMCLCMCVHV/HYIHEACVF 
MCLCL 


10700 

• 

* 


24601 


A 


10784. 


432. 


161 


FSRAGFH/RVSQDGFDLLPS \CLPPLGL 
P KCWD YKREPPRPAWKLMCRQVHRKCTW 
LFIQ IGSTLFKTNGGLS AVAHAYNPNTW 
GGRGBRIA 


10701 


24602 


A 


10785 


270 


442 


NVRLRLGLALSPTLECRGTIMAHCRLDF 
PGLMQS S HLS HRVAGTTG / TCHHAWL I F 
KFL 


10702 


24603 


A 


10786 


12 


424 


LIQLPRLECSGALIARCNFTY/SLGSGD 
PPTSASQ/VLKTTGVCHHAQL 


10703 


24604 

* 


A 


10787 


.350 


3 


DSSS VQTNKGIPGQVLSDILS CAVKAS V 
LVI I PLHNALSSWLVPALFtiWKS WQVGK 
SQQAHSSVSGLC/MHYRSHTHAHTHSPH 
RHRHTYTYARAHTHTHTQMLSAYLPSKQ 
PSGSLS 


10704 


24605 

■ 


A 


10788 


155 


3 


HAFFALCIRNRLECNGVILAHCNLC/RL 
LGSSDSPVSASQVNG IAGACHHAQL 


10705 


24606 


A 


10789 


188 


2 


RRDLSSLQPPRGGQ/MRGCIYTHTHTHT 
HTHTHTHTVHWGWGKRHHVPKGKMSANG 
BSGRAK „ 


10706 


24607 


A 


10790 


19 


405 

> 


I RPT I SRVE RG INS LVASEGQRLPWDG I 
ACSQGLVWQQTWGPLGPFPSLLGMPHR 
PTFRDLNSEPAPGVANVSGTLSTPLPGA 

• 

S HGLLVF F FETE SHFVAQAGVS WGDLRS 
LPPPPPG\SSNSPVSAS 


10707 


24608 


A 


10791 


157 


1 


KPGPFPFY PGPLKI FFFFFETEFRSVA\ 
RLECSGT I SAHCNLHLPGSSDS P 


10708 


24609 


A 


10792 


157 


1 


QPRP FPFNPRPPKFFFFFFETEFRSVAQ 
AGVQWHDLSSLQPPPPG\SSDSP 


10709 

■ 


24610 


A 


10793 


83 


387 


SFLWKLHPGGAPACMRCC2LA\IiLGGVSQ 
LG YTGFRDLLEEAVCP FSELKHHAGRTT 
AVFSAVRQGCLS LQKFLLP FVQLCPAPR 
GGVYRG/RQALLS CHRIiHPV 


10710 


24611 


A 


10794 


204 


419 


KGVYGHSGS FSPPAPIiACFRDKTLFPVS 
LWKEFVHSPWCKCTLPQPLWKTWRYLK 
NFKME\lPYVPEIPLIiG 


10711 

* 


24612 


A 


10795 


,207 


2 


RRG FTMLVGQNS LD PS TSRSAHLS LPKC 
WD YRCKP / PAPS / LREGFS YLLHS LLHP 
TPAPGS HHLWAAL I C 


10712 


24613 


A 


10796 


2 


237 


FFFLRER I LLALS PRLECSDANMSHCSL 
NLPG / FSQS FCLSHP \ SRWDHRHMPPYP 
VKFFGI FVGDRVLALFPKLIS YI I 


10713 


24614 


A 


10797 


342 


40 


DRVFFCSPRLECSGAIIARCS/LRTPGL 
KQSSHLSLPKCWNHRCKPPRPARCSUSIE 
SHSAKKWPGQSTDS SACEEVQ PTS PFPT 
VPYPTLLSPSAGHGRGR 


10714 


24615 


A 


10798 


209 


2 


CSVLAVILNPECIYWPWLELLAHDPIP 

t)T ^ r/^Tji/c t?c d?Ti/T?poin\ r\7\ n'xrritiTZTTXT 
XJjVL.lriN.or r r c* 1 tvr Lo VAUw VUWrUJlAj 

SLQPPPPG\SSNSP 


10715 


24616 


A 


10799 


142 


2 


IFFFFFEMESRSVARPGVQWSDLGSLPP 
PPP\GSSDSPASATPSPMQS 


10716 


24617 


A 


10800 


276 


2 


ILPIIIRNTCCCFFLKESRSCSVTQARV 
QWHII/GSLQPQTPG\SSNPPASAFQVA 
TGAHHHTSLIRNTLI I INWEKNTHKLSG 
CTSMHFRFL 


10717 


24618 


A 


10801 


12 


364 


LHHYKTVSG IYKCLCVLNGPPTRFFPDF 
LPPIjGLPYSLRHNNIEISPIKNPPIASK 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspnrtic Acid, 
K=C?lutemir Atirl F=PhenvlalAnin# 

llr vJUWimiC ALIUy M? M UvUTlallllllUC} 

G=Giycine, H=Histidine, I=l£o!eurine, 
K^Lvsine. L^Leucine* M—Methionint. 
N=Asparagine, PHrYoline, 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine V= Valine. 

A X mmm VUlHIIti T T IIIIIIW| 

W«Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, Nppossible 
nucleotide insertion 














YS RERMS CT S IiTLNOTLVMI KLNKBSIili 
KANSDHKLCLLR/SVYQVVHAYEKFL 


10718 


24619 


A . 


10802 


235 


3 


HRPSEDLDFPPHWAVMRAPQYPSWDTRE 
GGPRSFLFLFLFFEMESYSVAQL/BGTI 
SAHCNLCLPSSSDSPTSRALCSL 




Z40ZU 

1 








1 


wyi fr rrrr rwiaiioi ijuuo \xi?xxux 
LLKYPEVA/ESASQRDPEWBAAVWRWLB 
GPQS AQ P PSAPAKGQELDPVVGQRPVPS 
PDDKVQWP YTNAVLLE IQRFI S WKRTL 

1UL( i. XJ X 


10720 


24621 

4 

1 


A 


10804 


365 


3 


I DVCNV CVRKQ YRGFCDQKYRAWIS PVY 
PHKCGWHRVYNTPTPHCETEWLWVILHA 
QEH/TFSLTGRHTHTHTHTHTHTHTSKL 
At rAo KAJjr\? VAa V cJvjvJ\Xiivb trS>0v?K X 
LAITMFVQPCI 


lU/zl 


Z4622 




10805 


406 


96 


GLGWRI FKVFFWS PPKVQTSLFFFSKT 
GSHSVT\RLECGDTIIiAHCDLCAPGSGD 

tfcJnUinl. x> V X V X vVaJJtr JtrWtrnK 


10722 


24623 

* 


A 


10806 


58 


369 


FFFFKGDRAQNNS / WGERCIjIiNKGYWD I 

BJT QTr , K"KTVTK' CTD VT .TT JITUT TQC IffST ,imT , 
n,J.O A \— JVI\J T lI\.t3 J. If X JJlLiniAlOSIUaliAiJD 

I IRAKRIHLLKKYIGINLHDLGLK\DFL 
NMTPKTLATKEKIDTLDFIKIK . 


10723 


24624 


A 


10807 


126 


1 


KEPFFFFFFETESCSVAQAGVQRCNLS3 


10724 


24625 


A 


10808 


204 


415 


HLGFDSIjTTCISLSDGLKYKATVFLVFF 

feresnc\avwaecngpisvncnlrlpg 
sgsspaspsrgveit 


10725 


24626 


A 


10809 


168 


2 


kiskrpffffffetgshyva\klbcsgv 


10726 


24627 


A 

-• 


10810 


97 


389 


lavspslslslslslslsylglpyslrr 
snieitpintpaegsvcsserkghmsls 
fnekievitlsaqdmsntkig/rkldll 
ch/tsqwnaeekflk 


\\)i£l 


Z40<£O 


n 

.M 


i not i 


Oil 

Z4 


4lO 


liCtX XAKKXLA3V V wlar ff ffiiNKvbDlVoA 

gpfkfffprgffstrnoappggpwgplp 
wgggppvgfqkqgkggap \ ppkknrfpk 
ggpltqpnlpktpienpkgpptrgffps 
gpppkkgagpppifrvgpg 






n 
*t 




OU 




J\JVc\XUM 1? tr\^i\JVX O Jr X XjrJjflrf r HX rr JjtV V 

KNQTPPV/ CCFFEAPFSLKKPPRGLKKG 
/ WGKNPPPPLVPOKKKKKDTAGVATLVS 

DKKDFNPTKIKKDKGHYIMVKGSMQQBG 
LGCPDAWVPS 


10729 

• 

■ 


24630 


A 

* 


10813 


618 


1519 

• 


FWFGVNCKSVCVPFLPVLCFWCVWCER 

FKGDYS VTMTPGKIiRTIjCE I DWPALEVG 
WPSEGSMDRSLVSKVWHKVTCKPGCPDO 
FPYIDTWLQLV/YRPPPY 


10730 


24631 


A 


10814 


179 


15 


KVKRIiKTP FFFFFFETRTCS VAQARVQ W 
HNHSSLQPQPPGPKRS\PTSVSREAGT 


10731 


24632 


A 


10815 


61 


422 


NCFFLKGPPLFFFFFFFFFFLLQIIVFI 
YKFFAFLFQMEFRSLPRLDKCNGAISAH 
CNl^LPSSWDYRNLPPRIANFFLFLVET 
RQPASA/ FLTCWD YR 


10732 


24633 


A 


10816 


170 


440 


RQGLSLLPRLECSSMTTAHCDLKLLSSS 
NPP I S APH I ALG/ LTGLCHHTQL 


10733 


24634 


A 


10817 


322 


443 


FI FIFYFFETP^HAVAQA^QWRxxljlir" 
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C=Cysteine, D=Aspartic Acid, 
Jb— oiuiaroic a ciO) r— * nenyiaianine, 
ij^vHycme, H^JnisuQiue, j=i»uieucine, 
K=Lysine, LHLeucine, M=Methionine, 
n— Asparagine, r=* ruiine, 
Q=Glutamine, R=Arglnine, S=Serine, 
T=Tbreonine, V=Valine, 

W— Trvn^Anhnii Vs^Ttrrnqirtp 
w — i rypiopiiaii, i — x jrusiiit;, 

X°Vn known, *=Stopcodon,/=possible 

nucleotide deletion, V=possible 

nucieoiioe insertion 














QPPPP\GSSDSPA 


1 ATI A 

10734 


24oJ5 




lUolo 


134 


5 


ISAHCNLCLPGSSVSV 


10735 


24636 


A 


t aoi r\ 

10819 


76 


245 


ITT T.D VfiTDrifinDT.Q Q /T.T.l?lS.TT\rDIMTT.AD 
r IjIjKJvvj i KUuU r Lio tS / Jjlir JN X V KUVJUfiK 

AIRL\EKKIKDIQIGKEEAELSLFADMI 
IF 


10736 


24637 


A 


10820 


233 


440 


GGI KAAKTAS LKKHLQGWGKVMLMANKL 
LRGERAW/FPPRIRG 


10737 


24638 


A 


10821 


293 


402 


RNP FF FIiETE PHS VAQGEVQW CNuS S LQ 
PPP\PGS 


10738 


24639 


A 


10822 


168 


1 


PLRGLSDSSLGIMKRMATDLSSLQPLPP 
G \ S SDS PAS AS RVAG I TDSHHHAWV I F 


10739 


24640 


A 


10823 


97 


405 


LCVNI CFNFPWLGVEWLNHI IGVGLTFF 
FEMES CS VARLECSGAI S AHCNLL/ LP P 
SFGSSDSPASVT 


10740 


24641 


A 


10824 


186 


2 


BPGTI S LVALKLQKW PRR/ SDHLRLAVR 
DQPGQHDETPSLLKNTKISWAQWQAPVI 
PALEPV 


10741 


24642 


A 


10825 


199 


1 


KKTQ IGGAPGGALFF FFFFETGFPFVA\ 
RLQCSGTNQAQCSLNLLSSSNPSFPALQ 
VTGTPSACP 


10742 


24643 


A 


10826 


252 


381 


GQQEFFFPCSVPQAGGQWCNLSSLQAPP 
PG \ SHHS PASASRVAG 


10743 


24644 


A 


10827 


77 


427 


IPQVHCPMSPPVPMACIPRVSSFTSWVF 
HNLLPPSECPLGPLVPASSHPRPCVCCR 
PCTSWS\CPLWPRPPCSNSPV/TCVPCL 
PCLCISEIPSCVPWP/WTYSSLCPMSHV 
PDSPCPIiP 


10744 


24645 


A 


10829 


20 


518 


S FAFSLLQHLTETS FAXNSCSEATLLFL 
S VFL / RAQ TLTAP CQTRG P RRGKDRG SG 
SSSPSGPKATKESSVERRKSFDSWGHRF 
AA \ QRLMDNQ AERES EAG VGLQRDEDDA 
PLCEDVELQDGDLSPEEKIFLREFPRLK 
EDLKGNI DKLRALADDIDKTHKKFTKAV 


10745 

» r 


24646 


A 


10830 


351 


3 


LH FS PFLQHRQNI K VWLMRAS MQRHS RT 
HGAFPLHQDEIQTRPRIiRSAASSGPSIjIj 
SDHILSTLPAFTCANLSSRFCSSSPCSC 
SCIiRA/CCTCHFDLSTLIHAHTRTHTHT 
. HLtLlmi 


10746 


24647 


A 


10831 


8 


376 


GMLPAHLADVLRHNSVGRPKHMRVMAGA 


10747 


24648 


A 


10832 


336 


38 


GVATEGVGEAAQGGEPRQPEQ/ PPPQPY 
' PPPPQQQHEEEMAAEARQA/AGAPMDDG 

X7X CT T^C T5 C VT7T VGTYD RlTtaiM^TT^T T TT r\hJT T 

e La IiUb Jr 1> X V Jj I o LIKAa yUMJX Uii VljyiN V 

GPNPWQI I YSDKYTLWK 


4 A** .4 A 

10748 


24649 


A 


10833 


206 


1 


SSLRLECSGIITASCSLNLPDPNDPPAS 
ASOVAETTCATMY 


10749 


24650 


A 


10834 


226 


376 


RISQAIISFYFYFLFETESRSVAQAGAQ 
WSDLSSLQPPSPGV\SDSPALPS 


10750 


24651 


A 


10835 


213 


1 


DRVLLLSPRLGCSGMITAHCNLHLPWFK 
RFSCLCPPE/SSWDYRCPLP/PPRLTSV 
FLVETGFHHVGQAGLR 


10751 


24652 


A 


10836 . 


351 


447 


RENLWLT P VI P ALREAKAG I \LEPRSSR 
PAWAT 


10752 


24653 


A 


10837 


71 


472 


S ASTAPMAP VKKLWKGG/ K/ KKKQVLK 
FTLDCIHPIEDGIMDAA/NSTNYEQFLQ 
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OCysteine, D=Aspartic Acid, 
E^Glutamic Acid, ^Phenylalanine, 
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T=Threonine, V=Valine f 
W=Tryptopban, Y=Tyrosine, 
X=Un known, *~Stop codon, A^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


■ 










■ 


ERIKVNRKAVflLGGVVTIBRSKSKLTVT 
PEVPFSRRYLKYLTKKYLKNNNLCDWLC 
IVANSKESYELRYFQITQDKKKKK 


10753 


24654 


A 


10838 


37 


382 


SRCIMAFYLYGIRSFPELWKSPYU3VGP 
GHSYVSLFVA3RCGVRNQQRLCSVKTMS 
PQNTKATNVIAKARYIiRKDEGSNKQVYS 
VPHFLIAGAGKERSQMNSQSEDHK/LA/ 
PVRNAV 


10754 


24655 


A 


10839 


313 


617 


AFFLI VFQYVARERERVSRS / VNGGLYQ 
HCYFRTL 


10755 


24656 


A 


10840 


309 


.511 


WEQGSWEHVLQAPCSLIAPRITHAYTHT 
HTHTHRYTHTHLYPH/ APTS XL 


10756 


24657 


A 


10841 


269 


450 


T PWP LKKE FF FFLKWS LALVAQAGVQ WR 
DLLGSSDSP\LQPSRFKQFSCISLPSSW 
DYR/PC 


10757 


24658 


A 

- 


10842 


420 

• 


1 


LERGENGTRRDRRKGLSHCHQPMDSVLP 
PLCHPPPLIjVTMEEEITMLFIDIGSSMY 
KAGFAG / DD AS RAM F PS I VRCP WHHGVM 
VGMSQKDSYVGDEAQTKHSIIiTLKYPIK 
HDIITNE\WDNMEKIWHHTFYNKLHVAP 
CI 


10758 


24659 


A 


10843 


431 


1 


GEHSWASDLAEDVTKVTRGPLGPLFWGL 
CSGNLSFSGCVPGLPAGAVPPWVPVPFQ 
GGA/SWVWKGPS PWTIjHLLLRTWGLAGG 
VGGRSLGRWRAW PGNPGS QGQGAPPAHQ 
ATGTPRSRTGSTGGIETVTIIiEGSHVSG 
MGIR 


10759 


24660 


A 


10844 


55 


297 


QRW PGTLLGVQPG / AP PDSTS ASGSGGA 
RGGPVPTLEGGNTGSRKWEDPCWGSQNS 
PSSDGSKPPPGPLTSKVCDDSRLSE 


10760 


24661 


A 


10845 


109 


427 


QTGPSAAGLLEFARGPLQTLFAWVPAAV 
AAEQQ I F VNRECCCL I VPLB FCLRGVPC 
CVRCQS A\ LLGGASQLG \SRGSGVRDPL 
EGGSCPFSDLQLHAARTTALFTA 


10761 


24662 


A 


10846 


17 


416 


SFLWKFCLREVPGRARCT»RA\LLGGASQ 
LGY SGVRDPLEEAVCP FSDLQLRAGRTT 
TLFKAVRQGHLSLQRI LLPFVWQCPAP I 
GGVYRDRQASLS CSGLHPIRA/SRAAVP 
TQASAMAGAPPPDSLPPCSLSSN 


10762 


24663 


A . 


10847 


265 


429 


LSGAS CCLFSLP I VLFAT iT 1T1LS VLSFS F 
C^CVPSWSLSPTLFLTFVLHFL*RPIS 
CF*VNSNPKNLTFMVNFQGKGMKIGtAL 
ERICICGCSLVFPEAILETSCQNLFCTY 
ACAGVLSSVYDYLALLAVSFLFL*FFSL 
FFFFLYYLFLFALGASLPGLYLIiLYFLL 
LCCISYSLF 


10763 


24664 


A 


10848 


12 


462 


QTLGTKMNEGLFATFI APTI LGLPDALL . 
I ILFPPLLI PTS * YLINNRLI ITQH*LV 
KLTSKQM I T I HNTKGRT* SL I L I S LX X I 
IVTTNLLGLLPYSFTPTTQLS INLAMAI 

PL*AGAEVIGFRSKIKNALAHFLPQGTP 
TPLIPILVII 


10764 


24665 

* 


A 


10849 


2 


462 


TTLHAFGTMKREAF I TLLCLFTS ANSRG 
VYARDAHKSEAAHRI KDLAEKDFLALVL 
IAYAQYLQQCPFEDHVXL*NEVTBFAKT 
CVADESAETCDKSLHTTiFGDSLCTVATL 
RETYGEMVDCCAKHEPVRYECFLQHWDD 
CPNLP*WRPEVD 



T 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 

• 


SEQID 
NO: in 

v ro oik.? 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

i 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

• * 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Aianine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=*ProIine, 
Q=Glutamine, R-Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *~Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10765 


24666 


A 


10850 


164 


1 


RGFQVRGE *NSGRETER *RERE * KLARE 
RAIGQEREREKSREAGRERERERKKE 


10766 


24667 


A 


10851 


417 


100 

• 


MMLWGNLETRLWSQAQTIPCTKQR5SL 
FFPVLNAAI FRS KS KVLNCLE YFLGNTL 
FSWDLIPEILFPLSHYHHYHHHHHNHQY 
*KS*QWSASNTRFQGILDKNSY 


10767 


24668 


A 


10852 


339 


3 


ETTHS EARRGRS AAAS CRGSALRRGRFP 
ESRRGREAAPVCPRHVLL*GAQSKQAAV 
AGKRSGTRHASRWPKSLFT PRRRRIS LK 
RALHFWQQSADPSPSVSRIPPHGVGSS 


10768 


24669 


A 


10853 


1 


373 


LQQKGMRKRAGQSEMAPAGVSLRATILC 
LIAWAGLAARDRMYLHPFHLVIHNEST* 
E QLTKANAGKP KDPT F I PAP I QAMTS PV 
D EEALQDQLVLVAAKLD TVD KLMAAMVT 
MLAI FLGFRI YG 


10769 


24670 


A 


10854 


1 

■ 

- 


423 


VSCSFLKLKTMKHGLLLLLCGFLLKSQG 
VNYTEEGFFRARGHRPLDKKREEAPSLR 
PAPPPITGGGYRARPAKAAATQKKVER* 
APDAGGCIiHADPDLWV* S P TGCQLQEAL 
LQLERPITNTVDELNNNVEADSQTSSSF 

h 


10770 


24671 


A 


10855 


343 


3 


RGCEAHPLPRSEGPAGSALAQPVMYCTI 
FAGTLITAISSH*FFT*VGLEINMLAFI 
PVLTKKINPRSTEAAI KYFLTQATAS 1 1 
LVIAILFNNILSGQ *TLTNTTNQ*SSS I 
I 


10771 


24672 


A 


10856 


147 


1 


TRTPTGQCVSPKSMFU3AVAHSCNPITL 
GG*GRRIT*GPEFDPSLANMV 


10772 


24673 


A 


10859 


189 


422 


NHTMDDFERRRELIRHKREEMRLEAteRl 
AYQRNDD DE ERAAR * RRRRARQERLRQK 
QEEES LGQVTDQVE VNAHNS VP 


10773 


24674 


A 


10860 


90 


273 


SHQE I EQNSAMAPRKRGGRGIS FMFYCL 
RNNDQRYMT*RL* SGIGWMLLSSGRMGY 
ALPG 


10774 


24675 


A 


10861 


25 


411 


APCAKPCGDWRSRGSLVWAMSGCNARKg" 
DCCSRRCGSHL*N* I PTDWPLN* FFLTS 
AKVKECFPKKESIYSQTVY*SPG*KM*T 
*DKEHPRYLIP*LCIQFYHLCWVTGTGG 
GI ILKHGDEI YIAPSGV 


10775 


24676 


A 


10862 


33 


302 


SRRATLIYVDMENGEPGTRWAKDGLKL 

SLPIATIRALGTVNRATEKFVKTNGPLR 
Q*QPRS 


10776 

» 


24677 


A 


10863 


1075 


1521 


YCHTGKGEQLGERFCEGVSRRGPAERGS 
DSQTPWPWPLCAAAGTSAGTSHSGSSSG 
AF$S WP CWTAAE PAAR KRGRPAGS WSS P 
ATGAPGRCRHRI LSRGAGG S AG FVCSGL 
AESGL* ESSSPGRSQG * PQGQRHPQPNG 
liPAPPSTSV 


10777 


24678 


A 


10864 


245 


418 


TS KLAFP I S IP VI YANKVCP * FSKKKKK 














KKKKKKKKKKKKKKKKKKKKKKKKKKVF 
FFEGPGFFF 


10778 


24679 


A 


10865 


132 


398 


LNMGKGDPKKPRGKMASYAFFVQT*REE 
HKKKHPDASVNFSEFSKKCS *RWKTMSA 
KEKGKFEDMAKADKARYEREMKTYIPPK 
GETF 


10779 


24680 


A 


10866 


115 


455 


LLTRNMDRLIilloAGGMPGLGQGPPTDAP 
AVDTAEHVY I S YLALLKMLKKGRAGVPM 
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SEQ ID 
NO: of 
nucleotide ( 
sequence 

• 


SEQ 10 
NO: of 
peptide 
sequence 

* 


M, 
eth 
od 

• 


SEQ ID 
NO: in 
,USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

■ 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
IXJIutamic Acid, F=Phenylalamne, 
OGlyrine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methtonlne, 
N=Asparagine, P=Proline, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Un known, *=Stop codon, /= possible 
n ucleotide deletion, ^possible 
nucleotide insertion 














EVMGLMLGEFpDDYTVRVIDVtAMPQS * 
TGVSAEAVDPVIQAKMSDMLKQTGKPEM 
D 

* 


10780 

- 


24681 


A 


10867 

• 


64 

• 


483 


QGERPAAAMKI *SLTLLSPLLLAAHVLL 
VEG KKKVKKGL Y I KVDSE * KDTLGNTH I 
KQKSRPGNKGKFVTKDHTNCTWAVTBQE 
EGISLKVECTQLDHE FSWAYAGNPTLCL 
KLTDE I VYWKP VARNLRLQND I M I Y S PQ 


10781 


24682 


A 


10869 


229 


1 


DTTILKliGRLIILQWLLSEGKSHMSFTL 
NQK*EI IKLSKECMSKAELSQKLGCFCQ 
VGSQVMHAKEMLLKE IKSAT . 


10782 


24683 

* 


A 


10870 


934 


515 


KKVLLCLPGLGCGGVILAHGSLALPGSS 
SLHLSLQSSWGLTGM*YDARLIFVYFFR 
EMGSCHI CQAGLNS YNSS I PPTLVSQNV 
GTTGVSHRAQLAPVFCFCFVTLVDVKAL 
HFGKQENRIFFFFILFPGGSRDEDRVA 


10783 


24684 


A 


10871 


373. 


424 


EFKTNLANMG * CLKSQHLGRPRCANHMR 
LGVQNQLGQHGE I P FLUC 


10784 


24685 


A 


10872 

■ 


1 

• 


469 


RSRSGDSLPACDRPSGASALATCETIFS 
AISCFWDLPAPSLRLTPSCQPTMSSQIR 
QNYSTDTEAAVNS LDNWYLQAS YTYLCL 
GFY FDRDDVALE G VS QF PRE LAE ENREG 
YDRFLKMQNQRGGRAIiFQD IMTPAEDD * 
GKTHNAMTAAMGLETK 


10785 


24686 


A 


10873 


244 


2 


N I YS CKETFSVPLLA I HVYHFLVGRGGS 
QGTEMLWHRVDLRYREQAGHSGSHL* SQ 
HFGS PRQVDQLRYGVSDQRGQHGE 


10786 


24687 


A 


10874 


83 


427 


ISLNMIRIAALNASSTIDDDHE* SFTRH 
NTQTKEAQEAKAFAL YH * AXiDLQKHDRI 
E ESAKAYHELLE AS LLREAVS S GDENEG 
IiKHPGLILKYSTYKNIAQIiASQREDIiET 
AME 


10787 


24688 




10875 


94 


1 


KSQKACNPSTLGG*GGWIT*AQEFTtfgE~ 
ANT 


10788 


24689 


A 


10876 


2 


413 


GVTRG FNMRIEKC YFCSGP I YPGHGMMF 
VRNDCKVFRFCKS KCHKNF KMKRNPCKV 
RWTKAFRKAAGKELTVDNS FEFEKRRNE 
P I KYQLEL WNKT I DAMNRVEE I KQKRQA 
KFIMNRL * KT* E LPKVQD I QEVQLN 


10789 


24690 


A 


10877 


65 


417 


RFAGAGAI PEARAWPTDVHAAKEEKEMD 
LP DLAS RVFCGR I LS MVNTDDVKAI ILV 
QKNMLDRFE KTLEMLLNFNNLAS ARLEQ 
MS ERLLRHTRTL * DMKPDLGSLFRPIRT 
LEWKL 


10790 

- 


24691 


A 

* 


10878 


4 


442 

• 


APTPDAMGHFTEEDKVTITGLWGKVNVE 
NAGRETLGRLLVDYP WTHRFFDS FGNLT 
SGSVIMGNPKVKAHGKNVLTSLGDAIKH. 
IiDDLKGTFA* LTELHCDKLDVDPENFKL 
LGNELETDMAIHFGQDFTPEVYAYLQKM 
VTWAN 


10791 


24692 


A 


10880 


1 


419 


GKHIRQYHEEKETGQRlNIHEYYLCSNML 
AKNLLFEKEREAEEKEKSYEIPTKNIQG 
QMTP YYPVGMGNGTP CS * KQNRPRSSTV 
MYICHPESKHEII»SVAEVTTCEYEAVIL 
TPLLCSHPKYRFRASAVNDI PCQSLPG 


10792 


24693 


A 


10881 


54 


335 


REIFTMSGALDVLQMNEEDVLKNLAVIT 
HLCCTKTDS PMEQ IIYGS *TE YI YS IKL 
NRT* QNLMLAVP * NYD I *NHDEVTVMTF 
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SEQID 
NO: of 
nucicuuuc 

co fin pn pa 
Self UCJJl>V 


SEQID 
NO: of 

• 


M 
eth 

ou 


SEQ ID 
NO: in 

09/515 1 


Predicted 
beginning 

n tiff pfktirlp 

II UIICUUUC 

Incstinn 

correspond 
ing to first 

-ft m inn nrtrt 

•<l 111 lit v flviu 

residue of 

peptide 

sequence 


Predict- 
ed end 

niif*lA_ 
vuuw 

location 
correspon 

Hinp to 

last amino 
acid 

residue of 
peptide 

>CU UCULC 


Amino acid sequence (A=Alanine 
C=Cystelne, D=Aspartic Acid, 

TCssf^liitsimir* Arirl PWPhpnvlntaninA 

£T^JIU VI 1IIIV /1iUU| M A llvll T IHIttUi 11 

G=Glvcine. H—Histidine. I^lsoleucine. 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
O^^GIutamine. R=Ar(?inine^ S=S€rine« 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 

llUvlvtlUUC I1I9CI IIUU 














NNTLQMFEP 


10793 


24694 


A 


10882 

• 


216 


3 


MS I KS MS I CCLRKKGKHGFQ I RGK* NS G 
RETERERERERKRARERARGQERERQKT 
SEGGRERERERKKE 


10794 

■ 


24695 






3 




flVT .VP G V"H IC CCl CM Q ZV IC OPT T W T TnZiPFT. 

KGQPRGGVEEGPTVLIKAGLLEKLKEHE 
r*n\ rNrn Yon t ,p f an t pnt> q a ft? t vtcntpp q 

VGKASEQLAGKVAEVKRSG* ISLVLCVN . 


10795 


24696 


A 


10884 

i 


3 


375 


S TMRAW I FFLLCRAGRALAVPQQESLPD 
ETEWEETVAEVTDVSVGANPVQGEVGE 
FDDGA* ETE * D WAENPCQNHHCKHGKV 
CEIjDENNTIjMCV CQDPTSCPAL IGBVEK 


10796 


24697 


A 


10885 


95 


2 


KCTQG P S AVAHACN PNTRGGRGG W I T * G 
QEF 


10797 


24698 


A 


10886 


1 


114 


MGFHHVSQDGLDLVTL *GACFS LPKCWD 

VDDTPTyTTJT.RlI 
I KKiS trl rlliArl 


10798 


24699 


A 


10887 


1 


368 


E PTME L TTAE KTNG KAAWRKVGAHAGE Y 

GAEALERMFLTFPTT*TYFPHFDLSHGS 

AHVKGHCKKVADALTNAVAHVDDMPNAL 

S ALNYLHAHKLRLD P VNFKLLSH * LLWT 
T C7SUT tT\n?T? r r 


10799 


24700 


A 


10888 


336 


9 


FRGQVI PAI FFPFQTTSLSYFQTGQPN* 
TPFFQKI LKLTRGGGKLLDFQTPGRIiRQ 
KREDPLFPGV*NCN* P *SHPWWGTKQNP 
v a ajuuvKJUjix i o o ± r v riK vsnoirx 


10800 


24701 


A 


10889 


3 


362 

* 


GFIiIAPLEMQELGTPMNRILQtTtAfiQE 
TFLTPALLLP I PHQTYSTASAVPLAKPD 
T * PKD VG I LALQVHFPAQYEDQTDLENY 
NNVKALKYTEDLGQTLMGFCSVQEDINS 


10801 


24702 


A 


10890 


66 


333 


TLPGNIGISFVERVMEVLRPQLIRIDGR 
NYRKNPVQEQTYQHEED* *DFYQGSMEC 
ADEPCDAY*VEHTPQGFRTTLRAPS*LY 
TPIVG 


10802 

i 


24703 


A 


10891 


405 


49 


I PWMGTERTRASAQRDVAGPARNPACPL 

WClf* P Dn D a FWTfl PT .TP VXVZ C* T .P PfWF 

ERHGPGCWGPRERIiSSGQTHPPSPQGCA 
NTCPCHKHQQELLAGGVAVFYKLFYRYG 
NSPKWDL 


10803 


24704 






2 




PODT .PVT .WTZiWT.ZlAM2VPCiQPTQT.TiT .APA 

LICLPWLQKAGAVQTVPLCTLFYHAMIjQ 
THRAHQLVIDTYQEIiEETYI PKDHKYSF 
LHDSQTCL* FSDS IPTPSNMEETQQTSN 
L*LPDIPAPGFCPDIiPAIiASKGWCRPNR 

PT .PP YAPNAfiRAPTCM 


10804 


24705 


A 


10893 


3 


334 


DQLPEPLKVLWTAHLLAMAPGSRTSLLP 
P YALL * LP WLQEAGAVQTLP VSRLFDHA 
MLQAHRAHQLAIDT YHE FDETYI P * DHK 
* S FLHDS QTSFCLPDS I PTPSNMEET 


10805 


24706 


A 


10894 


3 


341 


LLTPGVS DAICY I LD5GYI IMSDTFTAY 
VIG*RFEWG*HATVRFAAVVPPVALPW 
LGV*WDNPERGTYDGTHEWTVYFKCRHS 
TGGFPIRSNKVNLGTDFVTADKNLYVLD 
Y 


10806 


24707 


A 


10895 


25 


351 


AMIQTRDLQGGRAFGLLKAQQD*RLDEI 
CTQLLDDLKYSNDEDTjPSRLEGFKEKYM 



1206 



WO 01/64835 PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 

— 


M 

etb 

oa 


SEQID 
NO: in 

09/515,1 
*>< 

• 


Predicted 

beginning 

nucieotioe 

location 

correspono 

ing to first 

amino acid 

residue of 

pepnoe 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acia 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanlne 

OCysteine, D=Aspartic Acid, 

jLf — V7iuiamic Acio, r— i nenyiaianine, 

C=Clycine, H=Histidine, I=Iso leucine, 

iw— L>ysine, j-r- jueucine, ivi = iYieunoniue, 

N=Asparagioe, P=Proline, 

Q=Glutamine, R=Arginine, S=Serioe, 

T^hreonine, V=Valine, 

w — i rypiopnan, i — tyrosine, 

X=Un known, *=Stop cod on, impossible 

nucieoiiue oeietion, \ppossiuie 

nucleotide insertion 














E FDLIGNGHIDIMYLKLMLYKLAVPQTD 
LQLPRS IGEVYSGSGETISYPDFLM 


10807 


24708 


A . 


10896 . 

• 


364 


38 


rbUr J\iAVr UiUs.li'r JrUroFr rv^K'w'*^ 
LKFS KF FPNKRG FF PHPFF* KRGPFFCP 

KMGIKGPTPPPGFFFFFFFFFFSW 


' mono 

10808 


' 24709 


7V 
A 


10897 


136 


3 


T*GQQFKTSLTNMVKP 


10809 


24710 

* 


A 


10898 


193 


269 


GNKYCLTI LLLLLLLS FKRQSLTLLPTL 
E CSG * TRTPGPKGS S LLNPS * VAGHGGV 
CL*SQLIRRIiRWEDHLSPGV*GH8DPRL 

HHCTJj J WATtS * Do V5 w IVl'l"! 1"! X JL A 


10810 


2471 1 


A 


10899 


316 


374 


CCCCCCX:CCC*CCCCCCCCCCYKETNFB 
QQ 


10811 


24712 


A 


10900 


38 


206 


VYCVLVFTICTLLCNTSLGLFHPEFFFF 


10812 


24713 


A 


10901 


364 


2 


DYQHISPEKHCRPEGSGMVYLMCRKKKK 
RKKKNLSTKILNPQPSFIL*KPRNSARK 
S*QKBITKIGADSLIIBNRKRIEKIYIN 
ETMS WF I E I LNKI DQPLARLTM I ERKDS | 
TKFRNERG 


10813 


24714 


A 


10902 


132 


3 


ATSSPWGYSAYRVAI LiliFLYFSNKIiAFT 
VIiYGFV*NYFLQEI 


10814 


24715 


A 


10903 


234 


352 


NFCFFETGS CSVT*AGVT MAHCS LDLPG 
SSNSLTSVSEE. 


10815 


24716 


A 


10904 


250 


356 


TMEMM^DKKQI * A* FIj^+ FK^GfakAAET 

ittt XT TMMIV T 
TliNXttMAii 


10816 


24717 


A 


10905 


280 


322 


QT *SL I LVSLI I C I ATTNLLGLL P YS FT . 
PTTQLS INLAMAI PL* AGAVVIGFRSKI 
1QIALVLCTSTPLPTS TMLPVHMMDTSSD 
ikPKI *SIR*QTCIQHRLIiLWLKC 


10817 


Jilt D 

24718 


A 


10906 


271 


365 


FKT 


10818 


24719 


A 


10907 


135 


358 


LRYKLTNPKLTLLTCGLI FLKGNIVNIG 
QCNGVHM*YQHFGRPGQDCLSPGIQGQH 
GQHKESLSL* KSLKI SWS 


10819 


24720 


A 


10908 


107 


212 


T VKTEV'KJ/"' rD MT M TTT f T D 7\ T Ml? 71 *T7/"V»T»T5 

1 KNEajMGQ V KWLtM V V i yALiWKA* VUG I F 
EVSHSWL 


10820 


24721 


A 


10909 


165 


383 


PKNRP I I PLKRSS I SNPGDFQKS LRPGL 
TPMGPHFKRGNYSYDENLENLCPVGOJK 
G PGSH * GIiIjTCcTCKGF 


10821 


24722 


A 


10910 


119 


291 


GGYRFYVKDTF*K*GFWPLTLFI ILKPL 
AGHGGACLWSWRLMRLK*EDHLSLGGRG 
C SELWLCTLA* ATE * DPVS * KVRLLAIH 
FVYNT KT FS RAWW CM P W1ATHEAEVGG 
SliboviKSKJjv 


10822 

■ 


24723 

■ 


A 


10911 


283 

. ■ 


2 


SDNTTDTFLPFIYKYIFPITWEQGETWK 
KNPRNSRXjKJ^TLiRS BTIAQIPLKCNLiWP 
GMVAHACNP YTLGD * GGWVT* GOE FETS 
LANMARPSC 


10823 


24724 

* 


A 


10912 


1 

• 


360 


PHAFGTMKWVTFMSLLFFLSWANSRGGF 
RQNAPKS E VAHRLKDWGKKNFKAWG * MA 
LAQNLKQG P FENHGKLGKEVPEFAKPGV 
ADDAAENGDKWLNTLLGNYLAPVAAVRE 
TYGERAEC 


10824 


24725 


A 


10913 


270 


361 


S KTWPGTVAQTCN PRTLGSQGGW I T * GQ 
EF 


10825 


24726 


A 


10914 


157 


I 


VFFQLSRLAWKGFSRFFFVFETESHSVA 
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SEQID 
NO: of 
nucicOtiuc 


SEQID 
NO: of 
peptide 


M 
eth 

OO 

• 


SEQID 
NO: in 

0Q/S1S 1 


Predicted 
beginning 
nucleotide 

correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucie** 

At/if) A 

location 
correspon 
ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A B AJanine 
OCysteine, D=Aspartic Acid, 

XT— f2 1 ■ ■ TO m i r» A /»IH IT— Ph on via loninp 
— flivpi n<» TfssfTtci iHinA I=IcnlpnrinA 

V* \JWJ 1-1 II A A X-l 13llUIIICj 1 I$VICUUllV| 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Vaiine f 

TT~" X I jrJJLUJJIlUJl, I — JjrfUSlllCj ■ 

X=Un known, *=Stop codon, ^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














* AAVQWRDLGSLQAPF PGFTPFSS 


« AAA V" 

10826 


24727 


A 


10915 


1 


357 


TPPKKPARKGEEKPGPFKRAITTI FVAR 

HTGAPNCPPC PliAAFP KDVPNFNPKMVE 
AVKNPIL 


10827 


24728 


A 


1 AA1 ^ 

10916 


102 


471 


PTTI FMCLYQEFI ISN*HLPTTQTTHLS 
LS FKLDYFLI I F I LLT FLVTWS 1 1 EFS L 
♦YINSNPNINQFFKYLLIFLITILILLT 

AXMnLirULif J.Vj 


10828 


24729 


A 


10917 


2 


138 


RE PTMVLS PADKTNVKAVWGKVGAHAG* 
YGAWLFTWLTLNVISD 


10829 


24730 


■it 

A 


10918 


178 


393 


LVIjCTVIjIjyiWRFHFHDSIjS IliKAQDVTE 
NLSNPTVljRGETGPSAVAHTSNPSTLGG 
♦GKWITQGQEFETSLA 


10830 


24731 


A 


10919 


278 


124 


TAWADWGSNETP FLLKLPKKLT * GGGVC 
ljEFQVi^RVRPKNPFNiAjNQGrN*PKFR 
PWTSTWGAKQNFV* KKKKKNKNKKP *NS 
KxiiPPF 


10831 

» 


24732 


A 


10920 


221 


243 


IIIGSPLIDIKISKC^LKIVAFGQAQWP 

V 1 FAJjWi&AciAvKa r*c» Vtyb F Kir A * Fvyo LiA. 

VHVDNNWESIN 


10832 

• 


24733 

. 


A 


10921 


33 


292 

* 


GTNWLE I RHAVE VQCVLVSELV I PTSGD 
ifT3nnr , *nuvT.TT* VT.TT/airufiT * * T<3na 

JVriSy^ w Url X J-i J. I X LlXJ-iUflWoX J. o Vji 1 

LE KKKKKKKKKKKKKKKKKKKNKI KTGG 
r j. 


10833 


24734 


A 


10922 


108 


3 


KLAVYGSVCL*SQLLGRLR*EDHLNI i GH 
GGCSEP 


10834 


24735 


A 


10923 


106 


2 


KLAVYGSVCL*SQL1^RLR*EDHXNLGH 
GGCSEP 


10833 


24736 


A 


10924 


1 At 

195 


2 


AaJjrivoyLtNKAr X i J_ir liVf Ira 1vtoJL> t 
PMLVCS G VT TAH CNLHVLGSGD P PTSAS 
QVAETMVLi 


10836 


24737 


X 


10925 


127 


2 


YVCILKNLKGRPGMVAYACNPSTLGGQG 
GWIT*GQEFKTSL 


10837 


24738 


A 


10926 


639 


AAA 

282 


OUT O T V/SUTJ/Vl PMD T TT3TDW/*T VOTPrHXT 

Ff LoIKGWVOyr MF i. I F X FWOJbKyilUnJj 
RPGLRDQPVQNS KTPSLLKI * KLARRGG 
ACL*SQLFRKPR*ENCLSTGELKPKGGI 
F IRLVQNKIPTTRGE KRKQSHTGS YQCP 


10838 


24739 


A 


10927 

# ' 


1 


363 


AL LTQ ALTCRQ AGAEAPI IAGATP SLM P P 
SLPQGFRDCSPSDAAYTMEMTIDKKQNQ 

ANKGTVQ WRFKNFCKRDESREDDE* YAQ 


108*9 


24740 


A 


10928 


53 


379 


teaelltlyllpnallnhftspplmfad 
rrlfctnhidigtlyllfga*awlgta 
lslliraeix:qpgnllcndhiynvivta 
hafvi i ffiglp i i iggfgn*lfpl 


10840 


24741 


A 


10929 


180 


350 


E PMAKGKTES PGP KRCGP* I * WVI SQRG 
TLRFRGAGLF FMGE FLRLGKNLLEI PRG 
A 


10841 


24742 


A 


10930 


361 


379 


RRYWWLG* VQWLMPVI PALWEAKAGRS P 
KARNL 


10842 


24743 


A 


10931 


121 


543 


HRNTGSTHASAHAYHIVHTNP* PLTGAL 
SALLMTS SLAMG IHFHS ITLL I LGLLTN 
TLT I YH * WRDVTREST YQGHHTP PVQKG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M, 
eth 
od 

■ 


SEQID 
NO: in 

¥TCCKT 

09/^15,1 
26 

■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to tirst 

amino acid 

residue oi 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
it^ijriutamic Acid, r— .rnenyiaianine, 
G=Glycine, H=Histidine t I=Isoleucine, 
K=Lysine, L=Leucine, M=Meth|onine, 
n^Asparagine, j^=Jrroune, 
Q=Glutamine, R=Arginine, S=Serine, 
1*= i nreonine, V s valine, 
W^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \ppossible 
nucleotide insertion 














LQYGIILYIT* * VFFFAGFF*AFYHS IL 
S PT PQLGGHWS PTG I T P INPLKDP LLNT 

s 


10843 


24744 


A 


10932 


227 


339 


VGG VXS VQLLLNCHFSR* MKKKKKKKKK 
KKKKKKKKKKKKK 


10844 


24745 


A 


10933 

■ 


64 


409 


DQRNKSAHLRAHLKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKRGEL* KKIN 
WAFFLPLLKRKFVGENFLKKNFFWGGNI 
GGQISSKKxKGGGK 


10845 

■ 


24746 


A 


10934 


2 


362 

■ 


NKLSGP YPEKVG YTLPVLMNPLAQLVI Y 
STIF AGTljlTAliSSH*F FT*VGIjEIDWXj 
AFIPVLTKKINPRSTEAAIKYFLTQATA 
SII1iIiIAILiFIINILiSGQ*TITSTTSQiS 
SLIILRAM 


10846 


24747 


A 


10935 


154 


3 


GCPLSPLLFNlVVKVLVRAVMO^Tir" 

TAT/"»T CTTTT*e»T T T1AMMTT VT /"*XT 

IQIGjjEVK* SJULTDNMIIiYlASN 


10847 


24748 


A 


10936 


250 


363 


TAWPGYLYSFSFLY* ETTKI WLGAVAHA 
CDPSTIjGGRG 


10848 


24749 


A 


10937 


199 


339 


NVLGILIiVFRI I VSI YKCGRLWLGTVAH 
TCNPSTLGCQRGQTT*GRE 


10849 


24750 


A 


10938 


130 


240 


KNEQDPRDL*DNDKWPNIHVlSVPgSSlC" 
DNGTERVFD 


10850 


24751 


A 


10939 


35 


235 


FILVENTKKMCICPLLNMIYAILYIiFVP 
SVFL*EENKQ*GITEKKKKKKKKKKKKK 
KKKKKKKKI KPGGL 


10851 


24752 


A 


10940 


179 


12 


DQPGQHNKTLSLQENKNKSS WVQCCV * P 
QLLGRLRWEDCLS PGGRSCEEPCSHSG 


10852 


24753 


A 


10941 

• 


297 


278 


INQDNE LTL INQSS KQ I KHKN INQTLRT . 
KMNENL FA* I IAAT IIX3I»SATVQI ILF P 
PIIiIPTSKYLINNRLITTQQ*IiIKLTSK 
QtolTIHNTKGRT*SLLGGD 


10853 


24754 


A 


10942 


3 


318 


FGGGRGVRNYPDAFVLSVLLPSRLLFPH 
LFPHLFPLPPFLGLAPYCFLTL* YF* K* 
♦LYYSQS I IFFFYREMKNKLLVFWAINL 
FFLYYYNYMEMWTL I CYWKN 


10854 


24755 


A 


10943 


190 


3 


KLSHKQSVYFKTPFI*KNHVLFK*KNLW 
VYPNDKFFFFFFES LALS PRLQCNGT I L 
AHCNLR 


10855 


24756 


A 


10944 


281 


1 


KKWLFSSYSSLYGNGLFLI PPFLSvSBS^ 
GKGFWKKFFFLRARGALFWGSPLKGFFL 
GFFWVFFFF* IXiVSLLLPRLECNGI ISA 
HCNLRFPGS 


1 AOS/ 

10856 


24757 


A 


10945 


169 


319 


NGVEDAFKNMVLGWAQWVTSVI PALiWEV 
KVGGS PEVRS FRPTRPI * KYGFRLGTVG 
YICNPSTLGGQGRRIT*GQEFQTNPANM 
VKPC 


10857 


24758 


A 


10946 


312 


1 


FARPGLLK5 WDFQP * PLDPVWGFFFKRF 
PENWLE FE I FP FL 1 1 MAKRKKLS KSLFP 
c jrrxAj r ixj-/ uy iv *. «r r rrrsi vouuDrfli 
LECNGAI SAHCNLCLPGSSD 


10858 


24759 


A 


10947 


208 


329 


I FTGDRS RNNR I G + ARWLT P VI PSLWEA 
EAGGSLEPKSLR 


10859 


24760 


A 


10948 


116 


374 


F YFGYIoLFFFCFFEMAPCSRG* SWSARG 
QSFCNIilFPGSSDLYVSAFRVARITGAR 
DHACI I IVFWDGGCVDTITSGLAGWLR 
RG 


10860 


24761 


A 


10949 


264 


434 


QLAFCTDTLTS VREQCEQL * KCVKARKR 
I ELCDEQ * S CRSHTD * DCTDELFD FLHA 
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SEQID 
NO: of 
nucleotide 
sequence 

• 

• 


SEQID 
NO: of 
peptide 
sequence 

■ 


M 

etb 

od 


SEQID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- - 
otioe 
location 
correspon 
ding to 
last amino 
acid 

residue ox 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
H=Hjrlutamic Acid, 1* -Phenylalanine, 
i*— oiycine, n=Histiuine, JF=lsoieucine, 
K=Lysine, L^Leucinc, M«Methionine, 
N=Asparagine, P=Proline, 
Vf=vjiutamine, K— Arginme, J>=aerine, 
T=Threonine, V-Valine, 
W=Tryptophan, y^yrosine, 
a— unknown, w =otop codon, /—possible 
nucleotide deletion, ^possible 
nucleotide insertion 














T 


10861 


24762 


A 


10950 


226 


429 


NFGFRLDKTVFFS KI RINAWAQ W FMPVM 

taCT liTCH VT\ /"V"« TT«* 7\ /"\T7T WfT PVTTTTT1 TIIVT 

PbljWgAKAGGlT^AQBljgl b JboN JL VKJPw 
LYKKFKN*VWWC 


10862 


24763 


A 


10951 


188 


433 


YSAE WKI DLG I EVFWVGKMTYKQ * FFWF 
VFKFLFF* RQGRTLSSKLBWMS V ITGHC 
NLDLIX3S RETFASAFQVTWT IGS CHS 


10863 


n An ^ A 


A 


10952 


f% #*V 

298 


A A e 

445 


LFS KCSSKS I KEEG PGWAHACKPGTLG ■ 
GRGGWIT*GSEFESSLANMVK 


10864 


0% A W & A* 

24765 


A 


10953 


244 


1 


KT PQGLWFCLFKKS PLKGI KGVSMGVL 
ICFFLKSRINLFNPRGLGQRKNFFFFFF 
• ETDSYSVAQTGVQ*HNLGSLQPPSC 


10863 


24766 


A 


10954 


199 


3 


QETKKEQNKENKQIK*RSTRKKHRQGTN 
KTKBRGERQTPPVGNRQTPTLGIHARPR 
RRATTSPRA 


10866 


24767 


A 


10955 


157 


3 


YSYTFSFI ITTS ILI I Q TL SGHGGACL * 
SQLLGRLRQDNHLNLEHGGCTRA 


10867 


24768 


A 


10956 


263 


357 


GLSLKVLTQWLGAVAHACWPSTIjGGRGG 
*IT 


10868 


24769 

■ 


A 


10957 


333 

• 


2 


KWKADMLI PLVI KSFFLGI FFKMFTNSK 
RYI FLLQLIAQHSS F,FSFIRCMEL*WPY 
PIPLCYGQSNQPSV+LAYCRDLFNELIF 
FFFFETEFHS CCPGGI SAYCNLCLS C 


10869 


24770 


A 


10958 


130 


228 


gssflggs psvaqagvhlpdhgslq*dk 

SPSVQKMSKWIGCSGACL*SQLLRRLRC 
QGRLS p*gqacsep* SGRCTPAWATEGD 
PPKKLLPYTTDS 


10870 


24771 


A 


10959 


250 


3 


VQACS I TGALYQRRNAEDE PQTARP ISG 
FTTS IAMRIiMLLTCSGHIWP VADLAFSG 
IMP * GL FF PSRQNIiALSPRLECS ARA 


10871 


24772 


A 


10960 


199 


326 


VLGRMWSNQKNYTLIiARM*KIK* *NNTR 
CWGGCX3ATRRIIHC*QECRI*KCQPQSL 
WRTV*QFLNTLNIR 


10872 


24773 


A 


10961 


323 


3 


LLFFHLPDNWQH*YVFCHYNLSFLQFYI 
N*IIQLWFCVWLLSM3IILKSHPCHFS 
K* LS TVTFFFRQSRS VSRLE WSGAI SAH 
CNLCLPDSSDSCASAS*VAGSC 


10873 


24774 


A 


10963 


120 


3 


PFFFFFF * EMRSCFVAQARVQ* CDQSSL 
* PSTPGlilCQSSC 


10874 


24775 


A 


10964 


168 


441 


LTPVI PALWEAEAGRSPEVRSSR* SPP 


10875 


24776 


A 


10965 


19 


207 


APLKLNVAMELS LGQ WDVSRSNLWB X PL 
KKGDTGRVQWLTPVI PALWEAEVDRTPB 
VI bvl KC* PIjKIiNVAMEIjSIjGUWDVSRS 
NL WE I PLKKGDTGRVQ WLTPVI PALWEA 
EVDRTPEVTSVTRC 


t AM/ 

10876 


24777 


A 


10966 


209 


1 


1 KoAliSIfM VvTTxiIGJjUKNFCI VLi*DRV 
SVTQIGVQWHDLSSLQSLSPRIiKDPPTS 
AoKVALj JL luMrlLV 


10877 


4M 1 to 


A 






i 
i 


DYLRSGV* dopgohgktssllkiokxar 

LVGPIISC 


10878 


24779 


A 


10968 


141 


2 


GLAMLPRLENI I FRPGTVAHACNPNTLG 
GRGGQIT+GQEFKTSLSC 


10879 


24780 


A 


10969 


175 


348 


LVYECKHILCVFNTSIiFFPH*TCYK*RG 
VLWLGBVAHACS PGTLGGRGGWIMASGD 
RD 


10880 


24781 


A 


10970 


108 


2 


KT KKLHMRSG WAHTCNPSTIXX3 * GGWI 
T*GQDSC 


10881 


24782 


A 


10971 


184 


2 


SFLWKLHLRGVPGRVRCQSAPTRGASQL 
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SEQ ID 
NO: of 
nucieoiiue 
sequence 


SEQID 
NO: of 
pepuue 
sequence 


eth 

Ou 


SEQID 
NO: in 

I1CCM 

09/515,1 

• 


Predicted 

beginning 

nucleotide 

location 

correspona 

ing to first 

amino aciu 

residue of 
peptide - 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

j4 • run 

aing u> 
last amino 
acid 

residue of 

pepuue 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
n< — Glutamic Aciu, r— Jrnenyiaianine, 
G= Glycine, H=Histidine, ]=Isoleuctae, 

iv- juysine, JLT-Jueucine, ivi— ivjetnionioe, 
N=Asparagine, P"Proline, 

v — vriuxanune, k = a rginine, y^oerme, 
"^Threonine, V=Vallne, 
W«Tryptophan, Y«=Tyrosine, 

X=Un known, *=Stop codon, /^possible 
nucieoiiue deletion, v^possiDic 
nucleotide insertion 














LFLV 


10882 


24783 


A . 


10972 


245 


339 


SQ I SGPGTVAHACNPS I LGGRGE WI T * G 


10883 

■ 


24784 


A 


10973 


295 


340 


KRVFF * ARKKNP FLS PLWGLTLENPLG F 
Kbr LiN r rlAibivloyr r r FKyi FLiAr r r 
WGTLPRPLNLKFGPFFFFFFFFFFF*DG 


10884 


24785 


A 


10974 


186 


340 


RP YLT I RFKGHPPNEP IQ * S FFVLFCY G 

cnTTJODoirpbT prinp OXTO 7\ T7PVTT 


10885 


24786 

1 


A 


10975 


253 


491 


FLECTSLLKRQRVHPGQHSKAPPTKBLQ 
TTGRGGALLQSHLIiRRLRQEHCIjGPGVG 1 
b X 5 a F *IiflHCTFA W V irXjUS SKK 


10886 


24787 


A 


10976 


141 


1 


S FS FFKFS PTGDM I GFF * HFFF FLRRSH 
S VAQAGVQ WCHLS SLKPRA 


10887 


24788 


A 


10977 


3 


138 


HEETGFHLVSQDGLNLLTS*STHLGLPB 
CWDYRRE PPPGREGDF 


10888 


24789 


A 


10978 


186 . 


323 


YESRSATQAGVQ WCELGSL* PSTSRFQ* 
FFCLNLSSTWDYSGLAPS 


10889 


24790 

• 


A 


10979 


183 


3 


IKILFPFFFLRIiSIiALLPRLECSGTFSA 
HCNLCLLGSSDSAS * VAGITGTTTPSPM 
QLV 


10890 


24791 


A 


10980 

• 


309 


55 


KNFGPNWKFLGGKGGEMAFLGKFFFP f 

ffffekeswfvfqg*mqwgdfrslqgpp 
pgvtqisrlgkiggplfpkkkkkrergn 

M 


10891 


24792 


A 


10981 


mm 

637 


838 


sqhlgrpmrvdqlrpgv*dqpgqhgemp 
sllriqr*aghggthl*sqvlrrvrqdn 
crisggggcse 


10892 


24793 


A 


10982 


275 


2 


rnrtlkmeflswfwgfwnwllnmirpkn 
vkdsts ksmentds pwhelfkelgkina 

FDTPDSLLVRGKFSDSIHNTFDHM*RTK 
EYNEARA 


10893 


24794 


A 


10983 


98 


346 


GHGHATLRGLCVLTFSFHITALSVSGTN 
DAEDCCLCETQKF I CG YIERNLLYLL I K 
DVCRVPAW*WERVYSLISRYSLWRD 


10894 


24795 


A 


10984 


30 


410 


LPEFTGRPKRTRTRGFSTNHTDIGTLYL 
LFGA*AGWGTAVSLLIRAELGQPGNLL 
GNDHMYKAIVTAHAFEIIFFIVLPIIIG 
GFGN*IiVPLI IGAPDMAFPRINNICF*L 
IiPPS LLLLIiASAIAE 


10895 


24796 


A 


10985 


141 


360 


QTLRT KMNENLFAS F IAPT I LGLPAAVL 
IILFPPLLIPTSKYLINNRLITTQQ*LI 
KLTS KQMI TIHNTKGRT 


10896 

- 


24797 


A 


10986 


3 


347 

• 


HE LRTKVHEHL IAS FMGPTS LGLPAALL 
IIL*PPLLIPTSKYLISNRLITTQQ*LI 
KLTSNQMITIHNTKGRT*SLILESLIII . 
I ATTNLIiGLLPYS FTPTTQLS INLAMAI 


10897 


24798 


A 


10987 


46 


317 


KSMTPIRKINPLIKLINHSLIDLPTPSK 
I SA* *NFGSLLGACLILQITTGLFtiAMH 
YSPDA*TAFSSIAHITRDVNYG*IIRYL 
HANGAS 


10898 


24799 


A 


10988 


245 


1 

* 


VSCLLEGKLTNRKDIHTKNPSVHHHHQR 
PKVDKTTKMGKKQNRKTGNSKKQTASPP 
PKK*SSSPATEQSWMENDFDELREE 


10899 


24800 


A 


10989 


255 


1 


S CLPWANW I S VQNQHKKTFLV 1 kTFFPl " 
GSFHVT*LLGPPLILISPPRIFFFFETC 
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SEQID 
NO: of 
nucleotide 
sequence 

• 


SEQID 
NO: of 
peptide 
sequence 


M 

elh 

od 

■ 


SEQID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A a Ala nine 
OCysteine, D=Aspartic Acid, 
E^GIutamfc Acid, ^Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleuciue, 
K=Lys ine, L=Leuci n e, M=M eth io n ine, 
N=Asparagine, P=ProIine, 
Q»Glutamine, R«Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyroslne, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














SFTVARAGVHSGAISARCNLRUiGSSNS 
A 


10900 


24801 


A 


10990 


166 


307 


GEKPWGPLIWGGLVGRSSWASCQVLELL 
♦EMVFRHVAQAGLKLLSSSNPPASASQS 
AGITGVSH 


10901 


24802 


A 


10991 


109 


330 


TNQ FKTKKEREAG KKKVKELERERERGK 
KRKEQRKNEKERR* P*NMD+RRIRBRNH 
S FD VYE FLI CGLLNLLHV 


10902 

i 


24803 


A 


10992 


244 


326 


GR YLKG WLDGPA* AVVLGTALSLL IRPE 
LGHPAIIiLGIDLISSVIVTAHAWSILLV 
RVI P 1 1 1 AGFGN* LVPL I IGAPDMAFPR 
INSISF*LLPPSLLLLLPPPI*QMRAGV 
GEREVRVR 


10903 


24804 


A 


10993 


101 


2 


KTFWARFVGTCL*SQLLRRPRQEDHLRL 
GGRGC 


10904 


24805 


A 


10994 


177 


1 


TPSLLFFVNI ICFCLNPGGGGYSEQKLC 
LCTP AWVTE * NS I SRPSQNNS RQS RQVN 
HLF 


10905 


24806 


A 


10995 


214 


326 


KEBRKNPRAI RW * PWGFFFPLFKEISL 
FLFVFRFWHGYPFPGAFFSTQKWLVF* T 
MVLFFFFFS * VSLLPRLECSGVTSAHCN 
TFLPGSSDC 


10906 


24807 


A 


10996 


184 


379 


LCGMVYHFPS I LLL YL* CKLF FSP KLE F 
SSCCPLECNGTVLAHRSLRLPASGDSPT 
SASRVAGHGGTCL 


10907 


24808 


A 


10997 


136 


3 


GRVDGQHIMTHQSHAYHIVKPSP* PLTG 
ALSALLMTSGLAM*FH 


10908 

• 


24809 


A 


10998 


274 


363 


PWAYAEPLTLHDATYSCT * SL IRASFTL 
FIA* TTRLRLLP* S FTPTTQLCMNLAMA 
IPL*AVAPDIGYSSMITNALSHLLPQCT 
PTPLISILGI IETVSLLIQPITIiGVRRT 
ANIT* CHLLMHLIGSATLSISTI 


10909 


24810 


A 


10999 

■ 


305 

• 


1 


NLHTTPSQIPLT* PLPNINIHNLHLTTT 
Q* I KYL I L I PSNLFHNKPNLKTAYS VPP 
SPNLHLPSSSNSPASASQVAGNAGARHY 
AWL I FVFLVETGYS LLV 


10910 


24811 


A 


11000 


154 


2 


IWGILQSRRCGSRL*SQHFGRPRQADC 
FTPGVPDQBYFTLLPEWFFLV 


10911 


24812 


A 


11001 


142 


367 


G VCL YQR I S I Y I KKTKKLS QGLF YFLLL 
LLLF* EGGSHS VTKLECSGGVSAHCNLC 
LLAS SHP PTSSSQVAGTTG 


10912 


24813 


A 


11002 


60 


385 


GNTLLSHTLRAHLIENLHAEFIDRTILG 
LPAA* LI ILFPPLLI PTF* YLINIRLMA 
NRH * L I KLTS ERM ITIRNS IGRT* S I IL 
VSL I I TIATANLLGLLP YS ITRTT 


10913 


24814 


A 


11003 


214 


83 


S KKSASP LTFN I VFQVLDNS VREBEIRY 
TEMEKEEIKLS*FVD 


10914 


24815 


A 


11004 


241 


368 


S YLL * PS P * PLTGALS ALLMTSGLPMRF 
HFHS ITLLILGLLTNTLTI YQ* WRDVTR 
ESTYQGHHTPPVQKGLRYGIILFITLBV 
FFFARFF*AFYHYSLAPTPQL*GHWPLT 
GINPLNA 


10915 . 


24816 


A 


11005 


3 


396 


HEPHALGMPLTADLPSMASCSQTSLLLL 
LHLLHLPWI*EARAYQAAGCSKLFDHAM 
LQ AHRAHQLT I DTYQEVEETY I PEDHKY 
SFLHDFQTSFCFSDS I PTPSNTEETYQK 
SNLELLRISLLLIESWLEP 


10916 


24817 


A 


11006 


156 


335 


FGCCCCFLFF*LVYCCYVMMLYYIYVLI 
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peptide 
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Predict- 
ed end 
nucle- 
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inn 4w% 

uing to 
last ainino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Add, 
Er^oiutamic Aciu, r— rnenyiaianine, 
G=Glycine, H=Hlstidine, I=Isoteudne, 
jv— Lysine, ju— jueucine, lvj^iYieinioninc, 
N=Asparagine, P=Proline, 
v^vjriui&ininc) iv^vvrgi nine, o^derine, 
T=Threonine, V=VaIine, 
W^Tryptophan, Y=^Tyrosine, 
X=Un known, *=Stopcodon,/=posslble 
nucleoli a e deletion, v-possioie 
nucleotide insertion 














Q TJfi V W * VUT M * VP VPT ,WHT VT A7T.17VT . 

LYMS 


10917 


24818 


A 


11007 


107 


2 


KCSSSKHFTKEDSQITNKHXEKCSS*LL 
VREMQII 


10918 


24819 


A 


1 1 AAO 

11008 


129 


A 1 

412 


K 1 K£>Gr KJv JL Fv»s2KIGtvi PIibKKTPKlTGP 
GGGALKPPFSGGFRPENPLNLRDKGSSG 
PKTPTFPSPWGKK* NFFQKKKKNLLYPG 
KIFFFWMKNI 


10919 


24820 


A 


11009 


63 


268 


CTFKFKNHCSTIQSTT*LDRSEKKKKKK 
KKj^KKKKKKKKKGGPL * KNFS LKGG 


10920 


24821 

* 


A 


11010 


154 


422 


NEFSFFFFFWKPKPPFFAQVGGQYRNWG 
♦LKPRPPWLKNFSGFASGEAGITGGVPH 
PG* ILVF*VKAGFHIENLVWNSETGNQP 
ACASQ 


10921 


24822 


A 


11011 


23 


375 


TRYLTKI KKKKKKKKKKKKKKRGAPFLK 
TPWGAPFFPGSAI FI FFFFGGFF* TPLG 
FFWETLFFWGGKILGHLSPKILPFWGKK 
NFFWVKGGKNLLNSPFLKI FFLGV ririut 
FFPPG 


10922 


24823 

* 


A 


11012 


151 


361 


ACTTTPSSFFSNIDTVSLCCPG*VPGHN 
NSSLINFPSFLRSWYYRHAPPHPAHFFP 
XL IRSR YVAQAGLE FIALRDTP I SDCPC 
TVHYP PSLAN* SVFFFFLTQSFVSLPRL 
KRK 


10923 


24824 


A 


11013 


146 


378 


NFWFSIEFLHICY*IYSFICI*IKVFL* 
*KKKKKKKKKKKKKPCKKKKKG 
GGKKFLGGGKNNS F FRNGGCF 


10924 

• 


24825 


A 


11014 


82 


264 


TEL FLYT I ONWSAI L YX LC * RNKVFLS F 
IOTEKKKKKKKKKKKKKKKKKKXKKKKK 
KKKKKKKKIERKI 


10925 

• 


24826 

- 


A 


11015 


381 

• 


2 


GRTALFTLLKGPIHRPQMFFTPLRVFPG 
S YTGIS IRQSGFFFI KR VI FFRL* * KAL 
FLLPFLLPFFSFFLKIGTYSVAQECNGM 
I IDN*NFE LLGRRNPSTS ASQ IAGNTSA 

V^UTV^T 1TBT pnnnr 

r*HTQLVrLiCBiXjI 


10926 


24827 




11016 


1 


251 


PNCI VHS LRTGTLNFI ILLLLCFTFE * P 
GAPLKKTPGGPKFYPASKKKISPQKGG 


10927 


24828 


A 


11017 


234 


1 


DINI KGS WVKGI PKPS VLSLQIiFCNSKI 
FPN* KLIKKKNPNEPGVMVHVCNPSTSG 
GRNGQMA*GQBFKASLftNMAKP 


10928 


24829 


A 

■ 


11018 


107 


342 


FQLCIGAIVHCFLFMKQHSEYXSTDHRS" 
SSKCHTVQGSSQLSTSAFKQK*THKKKK 
KKKKKKKKKKKKKK * KKKGGAS F 


10929 


24830 


A 


11019 


264 


1 


CFKP LLFKYPG 1 1 KAVNLP VS TASATFH 
KF* YVTFSFS *KKFLCVGGKTGSHSITQ 
AECSGTIKAHCSLELPGPGDTPTPTSRV 
AARA 


mom 




IV 


1 UY>(\ 1 


L 


Zoo 


PT*P(TT Q ^TJ A ATT ■ T5*t?'PT7P TTITKrft 

KlAw J, OiSAOKfUll Jj v EEC IT C (UUSvcrCIU 

NLPTTFGKLGNKKFPLNLFPGGPPLPHG 
GLKKGPGAPQVPPLFLKKPQTSPP*GLR 
EGP P PLKKFYLF 


10931 


24832 


A 


11021 


115 


287 


STCE I SFGCSRGCCWDYLMVWEFSHFYF 
* FLETECCS VA*AGVQWSNLS S LQSLHP 
E 


10932 


24833 


A 


11022 


186 


1 


RPKVDKTAKMGRDQSRKA*NFKNQKASS 
PPKEYNSSLAREQNWMENEFELTEVGFR 
KSVITN 
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SEQ ID 
NO: of 
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sequence 
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m 
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M 
eth 
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26 
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ntir ten tiff p 

location 
correspond 

inp to first 
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residue of 
peptide 
sequence 


Predict- 
ed end 

otide 
1 location 

corresnon 

ding to 
lost amino 

acid 

residue of 
peptide 


Amino acid sequence (A^AIanine 
C=Cystefne, D=Aspartlc Acid, 

lfc=fwliifaniir Acid IfcPhpnvfji Inning 

G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
Nf—Asna rapine. P*=ProIine. 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonfne. V= Valine. 
W=Tryptophan, Y=Tyrosine, . 
X=Un known, *=Stop cod on, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


10933 


24834 


A 


11023 


180 


353 


IFCFCIFSRDGVSVC*PGWSQSPELMIH 
*NTKI S * AWORVPKI PATOEAEAGESLE 
PG 


10934 


24835 


A 


11024 


155 


1 


PQARRGGTCLQS* LLRRLRLRQEDRLSP 
GV*GYSEL*SCPCTPAWETERLV 


10935 

• 


24836 

■ 


A 


11025 

- 


22 


325 


KLILL YDPQLTF I PLNTRAWAHCAFCC 

WPTKFVPflKTYOFVP^rTjPTPI PVYSOKA 
Ft it mr v c ivjr v jt o uur irn v a juiui 

DLFFKPICKKKKKK2QCKNKKKKKK* KKGG . 
PGIiKN^ WGGPI FWGFP K 


10936 


24837 


A 


11027 


127 


259 


GQVQWLTPVI PTLWEAMVGGSTKLRSS R 
PFFSTP * PCI VMIIjYG 


10937 


24838 , 


A 


11028 


.37 


408 


IASGALFFFFKAAGKRGDFKTEGAYQRQ 
RTI FKNKKRARRE KTGRENLR.GNYKNMG 
RGLKTPRGALGGPYLDKKGPLRGKGPFQ 

RGTT .T.GGGP If T K T DGTT .* TPPNY UP V T P 

RYNRFKXRQKNM 




24835* 




1 IvZy 




5 


T.T .\7T .¥f FiT .P R T MVP CI PTY T YI YT Yr. VT Y 

I YI YTHTHTHT FIT* VLFC 


10939 


24840 


A 


11030 


268 


435 


LMCLRNETN*F*KYE*WIjGAGAHTCNPK 
TLGG*GGRIA*GQEFETRLRNIVRP 


i f\r\ a r\ 

10940 


O AO A 1 

24841 


.A 




TOO 

322 


2 


VCL* S Q PLRRLRQENHSNPGQCTSAWVT 
HTKRMSPTKADTKSTHCGGGRV 


10941 


24842 


A 


11032 


251 


380 


HKPJjHTI YYFVPLK2CKMGPGLVGHTYNP 


10942 


24843 


A 


11033 


64 


324 


FWFFFFFFF FFLNP PGGGKKKKKP PPPL 
GKKKKRGKKGGEKFFSPPPRGGGGGPKK 
IS LKPVIiKF F*NPPGGTKKGGGP FLIP P 
GSP 


10943 


24844 


A 


11034 


420 

■ 


534 


CP G WAHAYNPS TLG S * GRQ I T I DRE I B 
TILANMAKPH 


10944 


24845 


A 


11035 


196 


1 


KNGHSLPPGPQKGTPFPKKKKTPKINLS 
WIN*GQEFE 


10945 


24846 


A 


11036 


10 


248 

i 

i 


PSDR*LFSTNHKDIGTLYIiLFGA*AGVL 
vj x rrxiO xjxj x rv-r\ c» i Ajyruri ui-KJFi unj. in v x 
VTAHAFVIIF 


\f\QAA. 
JU>W 

1 








JZ 


/IAS 


DYVSKRKEKSEKKRNVILETSISSHLVE 
WMLCSRYHPLIKKRSSVRILIALLCPQK 
VRDMS*GIGSKKQCWDSWIiSIWIKMESD 
PFLIPYPKINA*RLKDLCERINLKI I IK 
REYLHDYRVKKV 


100/17 

ivy*/ 


0A9A5L 






JO / 


*rf 0) 


KNPRGFFGKNPFFWGGP FGGPPPPKKMG 
FGEKKKF* FKRSKKKKKKKKKKKKKKKK 
KKKTG 






A 


11 070 


707 




RFVCSTIKVLPJDLSSDRSNPGRVLSTSN 
SSLK*KKK 


10949 


24850 


A 


11040 


213 


403 


VHRGI KYYFLNKLPDFPRQTFKK*G*RL 
GAVAHACDPSTIjGG * GGWITLGQTFETS 
LTRMGKS 


10950 


24851 


A 


11041 


104 


2 


SAFFF * ETES CS VAQAGVQ WCDLS S LQL 
PPPGFK 


10951 


24852 


A 


11042 


177 


3 


AYTAY I C VC IHI YLYK* MYIC INVHVCK 
ITYLYIYKYLKYNAIYLHIGFFSFLHTI 
S 


10952 


24853 


A 


11043 


314 


395 


TVYHjNLS INSVQSVLLSVFHRIiRHQPR ! 
GRVQWLTPV I PALWEAQAGGS P + VRI SQ 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Add, 
E=Glutamic Acid, F=Fnenylalanine, 
ir-triycine, Jtf~Hi5tidme, I— isoleucJne, 
. K=Lysine, L=Leucine, M=Methionine, 
N~Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serinc, 
T=Threonine, V=Vallne, 
W=Tryptopnan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














IKTPA 


10953 

• 


24854 


A 

> 


11044 


310 


403 


RGKGGKKPRNPGGGKS *ALTKKKKKKKK 
KKKKKKKKPGGALKKKPRGGQKKTGGGK 
KKFLPKRGAKKKPPGDFGKKNQIGGGEK 
RGKTP P KKKKP * GKKKNLKGKGG KKTP K 
SGGGKKFPPRW* KKKLPPGO 


10954 


.24855 


A 


11046 


181 


3 


EGPLSLSLIiFFFF*RQISSLLLRIiKYSG 
TIIAHCS FKFPSSRNPPTSASQAASTTG 
VHH 


10955 


24856 

i 


A . 


11047 


392 


41 


KESRSLSQGGREKGPFKFLAPPAPKFKR 
IFRPHPLKKLGPKGPPPSPS* IL3LKKK 
GGPPILARWFWNSCPQGFPPPGPPKRLG 
FKGGPPPPPLF*KKKPPFLGGENQKVKN 
FFFFF 


10956 


24857 


A 


11048 


147 


307 


TVI FVFIiLRKGLTLLPRLESGMi MWHtLK 
S* PPGLKQSSCHSLLSSWDYRRRRG 


10957 


24858 


A 


11049 


118 


I 


HGKIPSPLQ*HK* IAGYGSACLQSQLLR 
RLRWBSRSISI 


10958 


24859 


A 


11050 


54 


284 


RKI RRGGLHIjWSNLLGRIiRWBDL LGRLR 
WEDRLS PRRGGCS E PRLCHCP PAWATE * 
DPVSKKKKKS LFVEKPQGGG 


10959 


24860 

r 


A 


11051 


130 


340 

■r 


HNMHFAAHGSRINFDFFFFFFFFERGFP 
FCPPVGRAGTHFGLLEPFPSRV* KQYYW 

V Jilf ■ ■ i -tf"iL TTTTi*lltn..ril ■ 1 ■ -ft- ^1 W *\ 

LGTVAHVCNPSTLGGHVGES *GQEFKIS 
LANIVK 


10960 


24861 


A 


11052 


140 


337 


NI IMFFFFFFETVSFLLLGLEGKGTIWV 
N*NLCLPG*GDSPLGCS 


10961 


24862 


A 


11053 


197 


428 


IRSINNPTVASQYSSEWKSHI PLILNQN 
LDMIKFSEEGMLKAKIGLLRQTVGQWN 
AKEKFLKKNQSATPLS I * IRK 


10962 


24863 


A 


11055 


180 


1 


AI I ALLYSRLGDYARRFHLRRKKEQWQG 
MVTHVYNPSTLGGQGGRTA*TQEFKSSL , 
GNIA 


10963 


24864 


A 


11056 


158 


426 


FFPLPIiPLPTVSIjFPRSPSlJftEPKLiDCT: 
AAISAHCNLPA+FSCLSLPSACNCRRAP 
PRIiTASASRGAGIADGVSFTQCSMVPRL 
ECSGV 


10964 


24865 


A 


11057 


220 


421 


YALHLNMKNNS YF * MRKKKKKKKKKKKK 
KKKKKKKKQNKKDPGGAVYKKIP 


10965 


24866 


A 


11058 


162 


464 


SCSVGLKLFSMKTSIALSYYCTjTiTiTiLTL 
HYYYYYYYYY*SLVFM 


10966 


24867 


A 


11059 


1 


134 


APENRVD PRVRKTIiVPL I LP 1 1 TTLANP 
CKKD* YP YYVKIS IAC 


10967 


24868 


A 

* 


11060 


382 


273 


SSCL* PQLLRRLRQEDYLS*GA*GCNEL 
* *CHCSPAWVTEQDPVSKNIHTYIHSQS 
INYGSMIGYIHEQKGIIADHKPIIAEPE 
DSTYPRDHT 


10968 


24869 


A 


11061 

• 


298 


378 


SCSASPCCSSRHGWSPCG*QTCTiTiIiAIA 

T DPT TitiTt TiTT 7\ T UTV TV^ POT 17TYEX'71MT O 

AHKAHQLVIDTYQEIGENYIPQDQKNSF 
LLESHTSFCFSDSITTPSNMEETQQKSN 
LKLLRISLLLIETWLEPVRFLTRIVANN 


10969 


24870 


A 


11062 


39 


486 


RPTRPDAYHIVKPSP* PLTGALSALIjMT 
SGLGMRFHFHS ITLLILGLLTNTLTI YQ 
* WRDVTRES TYQGHHTPPGQKGLQYRI M 
LF I TSEVFFFAGFF * AFYHSSIAPTPQL 
GGHWPPTGITPLNPLEVPLLNTFGILA1 
GGS IT*AHH 
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nucleotide 
sequence 
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SEQID 
NO: of 
peptide 
sequence 


M 

etb 

od 

• 


SEQID 
NO: In 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cystelne, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GJycine, H=Histidine, I=Isoleucine, 
KpLysine, L=Leucine, M=Metbiomne, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonlne, V^Vahne, 
W=Tryptophan, Y^yrosine, 
X=Un known, *=£top codon, /^possible 
nucleotide deletion, V=possibie 
nucleotide insertion 


10970 


24871 


A 


11063 


269 


429 


ATWQ KL PLQ I Q KNY PGLGLVAHACNP S T 
LGGQGGRIT*GQEFLNQPANMVNTH 


10971 


24872 


A 


11064 

• 


248 


65 


IIEGQAQWLMPVIPALWKAEMGSPEVG 
SSKPA* PCDPPASASQSAGITGVSHRTQ 
LILNS 


10972 


24873 


A 


11065 


176 


1 


LFPPPKKVPPWGNTQFKSICLYTTNFFF 
FETESRSVARLECRGTISPHCKF*LPGS 
RE 


10973 


24874 


A 


11066 

i 


2 


397 


MVLWTAHLRAMAPG SRTWGLLTFALLCL 
PWLQEAGAVQTVPISRLCDHAMLQAHRA 
HQLAIDTYQEFKETYIPKDHKYSLLNDS 
QTS FCDSDS I PTP YNMEETQQKTNLEVIi 
RMSLLE IDS *LRPAQSNKR 


10974 


24875 


A 


11067 


172 


399 


QILYL* KYAYTVTMQHLSHP Ili* KKKKK 
KKKKKKKKKKKKKKKKKKKKTGG 


10975 


24876 


A 


11068 


192 


3 


PKLTPMLKLIVPHI ILLPLT* LSKKHI I 
* INTTTHS I I IS I I PLLFFNQI TSNLFS 
CSPTFC 


10976 . 


24877 


A 


11069 


204 


345 


DKSGEHSKTSS I Q KNLRLGAVTHTCNPR 
TLGGQGRWTT* SQEFQTSP 


10977 


24878 


A 


11070 


3 


416 


HELPQPLRVLWTAHI^GATAAGSRTSLLL 
DFALLCLPWVQEDGAGQTVP * SRLFDHA 
MLQAHRSHRLGIDTYQEWETYIPEDRK 
LS FPDDCHT YFCF * HS I PTPSHLGETLL 
TSNLBLLRICLVLIDSWLEPARILTS 


10978 


24879 


A 


11071 


108 


2 


PSPFFFETESCSVT*AGVLWRDLCSLQA 
PLPGSSC 


10979 


24880 




11072 


400 


278 


KVK* LRQENQWNSRGRDCSE PRSRHGTL 
PWTTEQDS IPKNKTKPYKIS 


10980 


24881 


A 


11073 


139 


3 


KLAGCGRAHLS FQLLRRLRREDHLSPGG 
*GYSEPWSRYCTPASRA 


10981 


24882 


A 


11074 


17 


421 


DHVIGQPRWUyiPVILALWEAEPGGLLEP 
KVQDQPGQNCKSLSLLTIKKKRRENFPG 
RGGTQLCS Q LLRRLRVGCC I D PKLHNCL 
PAWMTERDPTFKRE * KKNDVLEAKNFFT 
ILTFMNTTKEKMVAQRIGSLFHG 


10982 

* • 


24883 


A 


11075 


412 


1 


QFFFLGVCC PTQ I FGVGKKGE KKKKKRG 
FPPRVFFFFNPQIFSPFFFPGPFFFLGK 
FPPFFGERFLFFSKS KFFFPGVFKRGFP 
FPPKNFFFF*RIF*KFWFSPPKPFFFFF 
FFF*DRASLCHPGWSTVAQS*LTSC 


10983 


24884 


A 


11076 


279 


410 


VFCCVAGRDGAPGVKAHFPAAQKPWDGG 
FPGP * SAGVGPPPI PR 


10984 


24885 


A 


11077 


12 


386 


IAHLLLFS FYHKDTGTLYIiLFGAGAGVL 
STALSLLI RAELGQPG ILVGNDHI YNVI 
VT AHAFG 1 1 FF 1 1 1 P 1 1 IGC YGN * LATL 
I IGAPDMAFPRINNI SFGLLPTSLLLLL 
AYAI VDAGARTGW 


10985 


24886 


*> 
A 


H078 


412 


3 


LAI LSLS LS LGF P YS LR YKNI KIRPINN 
PTMASKCSSKRKTRTSFTLNQNLEMLKL 
IEKSTAKAKRLKVRPLVPVSQWNAEEK 
FLKEIKSAIPLNIQMIR*QTSLIADMEK 
VSVL* IKDPTSYNI PLSQS* IHSK 


10986 


24887 


A 


H079 


78 


443 


RHRLPSDTCLTAAVP * PGKGSKKKKKKK 
KKKKKKKKKKKKKKKKKKKGGGGFKKNS 
GGGO^JFAGGKKIFFFFFGGVFKWPLWIF 
WKKTFFWGGKISGQLSQKKLSLWGEKKI 
FWGOGGETSF 
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SEQID ' 
NO: of 
nucicuiiuc 
sequence 1 


SEQID 
NO: of 

pcpUUc 

sequence 


M 

etb 

uu 


SEQID 
NO: in . 
I ISSN 

09/515,1 
26 


Predicted 
beginning 

location 
correspond 
ing to first 
amino acid 

■ colli lie UI 

peptide 
sequence 


Predict- 
ed end 

otide 
location 
correspon 
ding to 

loci' ominA 

add 

residue of 

pep LI UK 

sequence 


Amino acid sequence (A B Alanine 
OCysteine, D=Aspartic Acid, 
K=Glutamic Aciri F=Phenvlalanlne. 
G=Glycine, H=Histidine, l=Isoleudne, 
K=Lysine, L=L£ucine, M=Metbionine, . 
N-Asparagine, P=Proline, 
Q^GIutamine, R=Arginine, S^Serine, 
Tn=Thrennine V=VnIine. 

M 1 111 VUlilllC) T V BIIUv) 

W=Tryptophan, Y=Tyrosine, 

X=Un known, *=Stop codon, ^possible 

n nrlpni'iH p ffolptSnn V=nnK<rihlA 
iiuttcuuuc uviciilmj, v^pu&aiifKJ 

nucleotide insertion 


1 AQQ7 


i 




I l\JoU 






KRVPKXXPKKNS P PGGKKGQS PP* SPP F 
GGPDQPIF*VGGFDPPWLTPENPFFFKK 
KNN* PGKGGMPLILKSLGGWGKKKIFTP 
GGGLPKN* KETP PPPPGGKKKI PFPKKK 
KKKKKRAAARDLELVPTRP 


1 AOft ft 


Z4oo7 


a 


l iuoi 




1R7 


TVSTTKCFSLLIPYYGRKDME*KI * I IO 
KKKKKKKKXKKKKKKKKKKKKKKKKRGG 
PP 


10989 


24890 


A 


1 1082 


173 


3 


NNVI CY I LY I PS VPKYKTTLNMRGS AFF 
FFFFFI F*DGVSIiCHIiGWSAVARSWIiTA | 






TV 


1 IflQl 

1 l\)oi 




^01 


SVAHACNPC3TLGG*GEOITRGWEFETSL 
TNMEKARL 


10991 


24892 


A 


11084 


257 


3 


IKKLREVNMLGWLYYMRBBARPVDYVHW 
RGPEYS SLTKTIRNVXVKGHQHY* KAS A 


10992 


24893 


A 


11085 


42 


171 


ARRMRAGIHRVSQEGLDLLTS * SAHLGL | 
PKCWEDRREPLRLA 


i Anno 

10993 


24894 


A 




170 




T7 VD Q V Ktf TTW WW TfTf WTf If Vtf tfPftft&T . 
r iVKO rv.xS.iS. 2VTvT\_IVIVI\. AJ\jr\J\JvIv^J\JvI\XvV3VXnJ_l 

LKKF * FKRSKKKKKKKKKKKKKKKKKKK 


10994 


24895 


A 


11087 


67 


379 


KKKKKRKKPKTKKRTGPVRQNVRWGRSL 
S PGRAPAEG PQGGS TES P PLLVGRCLIL 
GLRE PGVP PAAYGR FLRD YNNS I QGKLE 
KDRPO * GLPSLTGAAELI KH 


10995 


24896 


A 


11088 


128 


326 


KKKS GGGVFF F I IGGGGHFFGGGRFFFF 
FFLGGFFFKLFFFF*EKFFFLGGNLVKH 
SSKKSSWVGW 


10996 


24897 


A 


1 1 ADO 


1 AO 

lily 




QEFBTSL 




Z4oVo 


A 


1 1 AQA 
I lUVU 


<*> 


J*W 


NNTHFLFLFLSIiFFFFLKQSIiALPPRLK 
ITGLAHPPRLILFF 








1 1 AQ1 


1ft£ 


4ft 


IHl^PPLOICTIiKFFTNKNPHLFLTGCP 
KYALIFLCPGALVKTGFHHVGQAGLQLV 
PLCPDRLGLPKCSD * RRE P PHPAKKQFF 
IDRVMLCYPGWCPALGINPSSPVOLPKR 


10999 


24900 


A . 


11092 


164 


3 


VYSLLLRCPFS*ALTAKFINRCTYIHVC 
MYMCEPYAYTCLYTSTHIHVCMYIY 


11000 


24901 


A 


11093 


148 


325 


CLKLQPFGLGWAHACKPSTYFISYQ*L 
SPVI PALWEAKTGGSPEVRSSRPARP 


11001 


24902 


A 


11094 


162 


3 


LLGLKRP I FFFWGGKFFFFFFFFFE IBS 
PSVAQAGVQW* CNLGSLQPASGRV 


11002 


24903 


A. 


11095 


259 


358 


LRPGTVADTCN* GQARWLI PVI PALWEA 
KAGGS LEVGS LRLAW P 


11003 


24904 


A 

■ 


11096 


254 


2 


SWGFI PMEERERENRYWDRQQGHATVSN j 
IVINAFIiYRYTEREKPGVSLSPRVECSQ 
MIVAHCNLSLVGS S HS PAAAS * VAGTTG 


1 1004. 

1 J Uv/*T 




A 


1 1097 

1 11/7 / 






KKKKRQPRWAHACNPSTLGGCX3GWIT* 
GQKFRTS LVNMVKP 


11005 


24906 


A 


11098 


2 

» ■ 


455 


LNIjEQLI EPLEVLWTAHLEALDPSSRTF"^ 

VLLAFALL*LPWLQEAGAVQTVPLSRLF 

DHAMLQAHRAHQIiAIDTYQEFEETYIP* 

DQKYSFLHDSQTYFCFSDSITTPYSMBE 

TQQKSNLELLRIYLLLIESWLEPVQFLM 

SMFGWWLEYDT 


11006 

• 


24907 


A 


11099 


2 


375 . 


QLPEPLMDLWTGDLVAMAPGSRTSLLLA 
FVLLWLP * LQE aGAGQTVPVSRIiFDHAM 
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SEQ ID 
NO: of 
nucleotide 
sequence 

■ 


SEQ ID 
NO: of 
peptide 
sequence 

• 


M 

eth 

od 

- 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E>Glutamic Acid, F«Phenylalanine, 
G»Glycine, H=Histidine, l=Isoleucine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Prolfne, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=*Tryptophan, Y^Tyrosine, 
X=Unkno wn, *=Stop codon, possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LiQAHRAHHIAIDTYQEFEBTYIPKDQKY 
SLLHDSQTSF* FSDYTPTPCSMEETQQK 
SNVELLR ICLLV 


11007 


24908 


A 


11100 


1 


359 


LQLPKPLTVLGTAHLPAMCLAYATALIL 
GCALLCIiA* VE * ARAAQTVPQSNF*DHA 
MLQAHRAHQLAIDTYHEFBETYI PQDQK 
YS FLHDYQTS FCLS YS IATPSNMEJ3TQQ 
KSKLELL 


11008 


24909 

mm • m* w m* 


A 


11101 

1 


1 


370 


LPEPLTDLWTAHLVPMAPGSRTSIiLLAF 
ALLCLPWLQQAGADQTVP* SWLFDRAML 
QAHRAHQLAIRTYQEFEETYI PKDQKDS 
FLHDSQTSFCF * DS IATPSNMGETPQKS 
NLELLPISLLL 


11009 

m A V V ^ 

■ 


24910 

mm » +f m V 


A 


11102 


2 

mm 


361 

mm \m m 


AHLLAMAPQYRTYLLLAFALLCLPWLQE 
AGAVQTVP*SRLFDHAMLQAHRAHQLAI 
DTYQEFEETYI PKDQKYSFLHDSQTSFC 
FSDS ITTPCNMEETQQKSNL*LIRICLL 
LNESWMEP 


11010 


24911 


A 


11103 


8 


366 


PLRALWTAHLAARAPGSRTSIjIiLAIALL 
CLP WLQEAGA* QTVP I SRLLDHAMLQAH 
RAHQLP I DTYHE FE KTY I P KDHKHS FLH 
DSQTSFCF * DS I ATPSNMDETQQKSNLE 
LLRIFLL 


11011 


24912 

• 


A 


11104 


3 

• 


341 


EPLRDLRTAHLVAMAPGSRTYMLLAFAL 
LCLP WL * SAGAAQTAPL FRL FDHAMLQA 
HRAHQLYI DTYQEFEETYI PQDQKYS FL 
HDSQTSFCFSDS ISTPCNMEETHQRSNL 
E 


11012 


24913 


A 


11105 


240 


329 


RNWPGAEAHTYYP STLGGRGRQ I T*GRG 
FE 


11013 


24914 


A 


11106 


2 


316 


HEERERERERERE S AI VRQ IHTQGEGLT 
EIiVFP YRRPAPS GLKFTFVYVQEHILG V 
GGNGLS PLQ IGVI LRDGRGLAH I RCGTR 
NKILRILKSQGRAPDLS*DLY , 


11014 


24915 


A 


11107 


169 


1 


EDGKHSGLHS KQQ KQ PAWPSMVTHTGNP 
NTLGGRGGWIT*GQEFETSLTNMEKPCL 


11015 


24916 


A 


11108 


192 


355 


GMSLTGHSNY I ILNLNFLWPCWAHTCN 
PMTLGGQGGRFT *GQEFENNLAYRGK 


11016 

♦ 


24917 


A 


11109 


145 


383 


EVNKDKGKYHVSQGIQSFSYVAYVYFRK 
LIYTMMTIVNSALLYTELVICQGRRVLV 
FDFGF*FWFFETESCSVTQAGEQ*CDLG 


11017 


24918 


A 


11110 


199 


400 


TRAGFFPPFKKFLKGPGPGAPPFNLNFL 
GGRGGGFF*IPKKKIYPLIjLPFLKVQKT 
PSKPP FFFFFF FLRQVLLCCPGWSAMAR 
SRLTRA 


11018 


24919 


A 


11111 


373 


177 


QFETSLGTMSGPSFYLKN*KSARCGHTC 
MRFQLI*QRQSREDHLSSGVHRCSNP*LH 
RCTTTWVTK 


11019 


24920 


A 


11112 


204 


399 


NHFWDWYHFSS PFFFFFFEKNLCSVPR 
AEDQGMNLS *LNPLPPGLKQFFA* PSQE 
AGXIGPGHH 


11020 

« 


24921 


A 


11113 


192 


16 


KKTENNKR*RGYGETETPAHCGRACKIiG 
*PLWKPV*QFLKKLKMKLPYDPTIPLPA 
IFS 


U021 


24922 


A 


11114 


113 


369 


AGGQG PNL FLQKPG P PGLKG P P PLPLPG 
GGNYGGGPPSRGIF+FFEKKGFGGVPKV 
GPNLRG* RDJJPAP PS KGAHPLGNGFFSR 

N ! 
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NO: of 
nucleotide 
sequence 1 
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SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 

■ 
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Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nude" 
otide 
location 
correspon 
ding.to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine . 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, ^Phenylalanine, 
Glycine, H=Histidine, Hsoteudne, 
K-=Lysine, L=Leucine, M=Methlonlne, 
N=Asparagine, P=Prollne, 
Q=GIutamine, R-Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y°Tyrosine, 
X=-Unknown, *= s Stop codon, /"possible 
nucleotide deletion, \=possJble 
nucleotide insertion 


11022 


24923 


A 


11115 


250 


387 


ILCDVEHSLWPGVEVHAYNPSTLGGQDG 
RlA*AQEPETSLNNIARL 


11023 


24924 


A 


11116 

« 


192 


1 


VCDLFWKPSPHALTRRRTSVFHIiKR*F3 
WLGAVAHTCNPSTIiGQ*QPWRIT*AQEP 
ETSLGNMV 


1 1024 


24925 


A 


11117 
i 1 1 1 / 


289 


202 


VFYCQ* KFIRKKKKKKKKKKXKKKKKKK ■ 
KKLSKKIKKMDQQKLRSPTIK 


11025 


24926 


A 


11118 


412 


421 


RA* PTEQVTIiG ITEQS YSRGH INNRVHD 
LDVGS *HPDGAAAI KGS FVQRLKGY 


11026 

■ 


24927 


A 


11119 

■ 


16 


472 


DRLKPSATHAFADAYHIGKANP*PLTGA 

UU X 1 J I H I X k3 V7JL1 -L 1*1 V nuciis X A III* X * TTI f 1 1 X 

NTLTIYQRWREVTRESTYQGQHTPPVQK 
GLRYGI ILFITSEALFFAGFF* APYHSS 
LAPTPOI^GHWPPTGITPIiNPLEDPLLM 
TSVLLASRVSIT 


1 1097 

4 


94.Q7R 




1 1 \ 0(l 




34A 


GLWHATGITP*PLTGSF*xiLlJ»iTSSIiAM 
*FHYHSITLLILSLLSSTLTIYH*WREV 
TRES TYQGHHT P P VQKGLR YG IILFITS 
E&VFx^FC*AFYHSSLAPTPQL*GLWH 
ATGITPLNPL 


11028 


24929 


A 


11121 


174 


420 


FISALLKSAIQRNNAMFSTYYALTCTVG 
TAVSWSSVS* PQRNMMMS * VHAPAMAGR 
LVCHYIRVYRRFEHVCKRAVHGQYP 


1 1029 


24930 


A 


11122 


299 


466 

• 


I FILHE ICIT FFFETRHFCH * AEVHWCS 
LGLLKPPPPGLRCDPSTSPSRVAGTTRPS 


11030 


24931 


A 


11123 


138 


1 


NPFNEKNPPPKGPGPKNAFFFFFFEMBS 
CS VAQARVQ * HDLG S LQ P 


11031 


24932 


A 


11124 


241 


499 


NQRERS *HLYLL YNTVCLTHGLPILI I Y 
SAYOLAI YDLLLRRDALARLOADLADRR 
RGLQPKKRFAFKTRRKDAASSTKVDAAP 
GI 


1 1032 


24933 


A 


11125 

111 CtJ 


252 


391 


PxjSLDLQSNASLSHFTSPPLMFADR*LP 
STNHKN I * TL YLLFGAWA 


11033 


24934 


A- 


11126 


45 


459 


AFLKNHPIxET*SNHSFIDLPTLSNISA* 
GNFG * LLGACL I LQ I TTX3LFLAMHYS PD 
ASTAFSS IAHI TRDVNYG * I IRYLHANG 
ASIPPICLFLHSGRGLSYGAPLYSQT*N 
IGIILLLATIATAFIGYVLP*GQISF 


11034 


24935 


A 


11127 


109 


301 


QDSLMRKKS KNRS PS HER* RS KCLERKR 
SGDKKKKKGGPPKKKIGGFPFGGQMELK 
PFSILGGF 


11035 


24936 


A 


11128 


322 


2 


S CHHVS S LG DRAR P CLQKE KKKE RKKVG 
DFNTPLS I MDGYQHWYQARKI I IMDGTT 
K*KITRK* KIRNQ * DLTDVYRTLHPRVK 
YTFLS CAYGTFSRTHRIRGQAI 


11036 


24937 


A 


11129 


182 


2 


FLPRGFQPPPVWPGPLGFLFPPPSSRD 
PGFPSMPFFFFFFFETESCSVS*AGEQW 
RNLG 


11037 


24938 


A 


11130 


138 


1 


GWVFFLWFSFSFLFFFLR*GLSLSPTL 
ECSGAITAHCSLKliPGLS 


11038 


24939 


A 


11131 


304 


83 


WKKTDFPRKGRNS AS RLPSDSNCNSS F»S 
LQGLSLLYRFWTYHAVTGKGSQSRPQEQ 
VLGYHARKNSR*VHRVK 


11039 


24940 


A 


11132 


247 


225 

■ 


KNFWGGPPLRGGKKNFFFFFGGGYKK*L 
LLCLVNMYI*CV*YI*HKRIISDP!CKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKK^ 
K 


11040 


24941 


A 


11133 


242 


3 


KKKKIFFPPRYFWAPPYFFFLGPPPFFF 
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SEQID 
NO: Of 

_ ■ _ .At J 

nucleotide 
sequence 

> 


SEQID 
NO: of 

„ A* J — 

peptide 
sequence 
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M 

eth 

od 

- 


SEQID 
NO: In 

1 tCC TV! 

09/515,1 

I/O 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

* 


Predict- 
ed end 
nude- * 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystelne, D=Aspartic Acid, 
H=Lviutamic Acid, r— rnenyiaiamne, 
G=Glycine, H=Hislidine, Msoleucine, 
k.— Lysine, Lr- Jueu cine, ivi— lYieinionine, 
N=Asparagine, P^Proline, 
Q^GIutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y==Tyrosine, 
X^Unknown, *=Stop codon,/=possiDle 
nucleotide deletion, ^possible 
nucleotide insertion 














FFFFFFFFFFFFFFFFFFFVF*TQFHjjS 
PRIiECSGTISAHYNLHLPGPSDC 


11041 


24942 


A 


11134 

- 


361 


2 


ASQS AG I TGVS HRTRPTHGFKDT*RKPA 
EPEVTIH*I*ITLMSCWIBSPEKVCADL 
IRGAKEKNIiKVKRPAGTP J KAS KXTTKK 
TPCEGSNTRDRFQMRIHKWLTDLHSPFD 
GRVGGRV 


11042 


24943 


A 


11135 


1 


370 


IRQTFFQKRE I VGI I LCYWNVGYYSQIiH 
*SSRMLWIFHFYRVYISVKVILVILVPH 
EKKKKKKKKKKKKKKKKKKD 


11043 


24944 


A 


11136 


273 


2 


GGRAGQQILFCRGIWLGPPRSCQCTFRS 
PPGLAL*PWECFSIiPPALKIRFPSFIiPF 
FF^MEFSRLSPRLEOIGVTVAHCNIjRLP 
GSSNFP 


11044 


24945 


A 


11137 


130 


354 


GQLNKLSGPY PEKVGYTLP VL INPLAQP 
VTYSTI FAGTLI TALSSH* FFT* VGLEI 
NMLAF I P VLTKKKKKGGPL 


11045 


24946 


A 


11138 


366 


2 


LVEMGFLRVRQNGLYLLTS*SARLGLSK 
CWDYRREPPCPASDWVFILTSPLIHALD 
GKEHTHTHTHTHTHTGLGICQSSLGKQS 
GGWGWLSANRGQFS PFAVCLWS FLPEV 
PWTSALFT 


11046 


24947 


A 


11139 


228 


1 


CRAGRGLPAAVLI ILFPPLLI PTSKYLI 
NNRL ITTQQ* L I KLTS KQMIAI P5JTKGR 

■n i mm wmW 4ft> mr m*mm\ mW mm ^mw mm m± I W <V ^"tj 

T*SIiIliVSLI IFKATTNLLG 


11047 

• 


24948 


A 


11140 

■ 


157 


337 


HIPTPPHTTCPERPS I RHNS I YYLKRYC 
LiLR 1 1 LSLLPLQTS P * PLTGALSLLLMT 
SGIiSM* SHLHS ITLLALRLLTYTLTI YQ 
* WRDVSRHSTYQRHHTPPVQKGLRYGI I 

• 

VFTTSKDI VFSGLF * AFYHSRLAPTTQIi 
♦CHWPPSGITPLN 


31048 


24949 


A 


« 4 4 A * 

11141 


188 


3 


I PT PGG P PPLP PQKvIiFKPGNPRAPPGV 
FFFFFETDSRSWRLECSGVI *AH*NLH 


11049 


24950 


A 


11142 


224 


3 


SVSSPCISNTHKVDVRSSALLQESTSSR 
EVMTGHFHYLI I YET* LRSLAQSARLAC 
SGT I SAHCNLRLPGS SN 


11050 


0K A\ £\ AW « 

24951 


A 


11143 


262 


371 


S x CGPGVVAnAYNPS TIX3GRGRR IT*GQ 
EFETSLTN 


11051 


24952 


A 


11144 


244 


379 


TIGKQRCHAGIEEVSDSSQAPWLKPVIP 
ALi* KAAAGUoijisPKbljK 


11052 


24953 


A 


11145 


1397 


1565 


DRLESLLEMHIPGVYPNQWNTNFYLFIY 
rEAESHSVAJyTGIjQ*RHijGoIjybPrPQV 


11053 


24954 

ft 


A 


U146 


284 


1 


GRGRQGGDS SGKGEGNRAMDI TGGLVGK 
GQGAGEAFVEWEKELRGFKVEGEGDMD 
KERWD*GL*GGRERERERERERERERE 

UWlIJODOTJOr TT 

RSRarCiSKao uv 


11054 


24955 


A 


U147 


64 


286 


THALAWTRNQR I GSGHGAMCIi YSHLIiRK 
KPTAKAI FFNLKKKDS WD 


11055 


24956 


A 


11148 


230 


358 


VSVSLII FIATTNLLGLLPYS FTPTfQS - 
SINLSMAIPL*AGAV 


11056 


24957 


A 


11149 


81 

• 


367 


QTLRTKMNKNWASIilDPTIIiGLPAAEL 
IILFPPLLIPSCKDVINNRLMTTQQ*LI 
ELTWEQM IT IHNTKGRT* CLI IiECLIPW 
IATTKLLGLLP 


11057 


24958 


A 


11150 


257 


2 


SVCAVI TKYLRLGCFADAADTQRPliPPA 
TVKLTMFFNIATHGEPIiGCIF* LFVDKV 
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seq n> 

NO: of 

nucleotide 

sequence 

. > 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etti 

od 


SEQW 
NO: in 
USSN 
09/515,1 
26 

* 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D»Aspartic Add, 
E=Glutamic Acid, ^Phenylalanine, 
OGIycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leutine, M=Methlonioe, . 
N=Asparagine, P^=Proline, 
Q=Glutamine, R^Arginine, S^Serine, 
"^Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, Impossible 
nucleotide insertion 














PKTAENFHALSTGEK* FSYKGSCFHRIL 
P 


U058 


24959 


A 


11151 


257 


1 


WRCGEKETLIRCWWBCKLVQSLGRTVWR 
FVRELKLELPFNPAIPLLGIHPKEKK*L 
DPCTCMFIVALVTIAKMWNQPKCPSMDL 
V 


11059 


,24960 


A 


11152 


17 


367 


NHKDGKKTEEKKQSTS PPQKDCSSSPAM 
EQSWTENDFDELREEGFRRSNYELQEBI 
QTKGKEVKNFEKN* DECITRI TNTENCL 
KELMELKAKARBLCEECRSIiS SRCDOLB 
ERVSA 


11060 


24961 , 


A 


11153 


213 


2 


TFCHFFFFFETESGSVAQPGVRWCDIjGS 
LOAPLPGFTPFSWRORLQ*AEIVPLHSR 
LKLHFKKKAGVFSC 




940*9 


A 

m\*\ 


1 1 154 
1 1 I J** 


i 
j 


14.7 


GTSNHFP FL YTYOVHNSTD YPRNRCCLN 
PSLRFHTSS KPLPAR PHIWTHQS HAYH I 
VKPS P * PLTGALLALLMTSGLAM * FHFH 
S ITLL I LCLLTNTLS I YQ*WRDVTREST 
YQG 


11062 


24963 


A 


11155 


223 


3 


YSTNLKI LRNHF I LFF * DR VLLCLPGW S 
AVASSRLTANSAS CLS LPS S WDYRRAP P 
HPANFCTDRFHHVGQVG 


11063 


24964 


A 


11156 


143 


1 


RGFLNQCPSQNFPFFFFETESCSVTQAG 
MKWYDLS *MQPPPPRFKLV 


11064 


24965 


A 


11157 


149 


355 
j j j 


LQPLP PGFKRFS CLRLLTSWD YRKDVDS 

mmm v£ smmr *p v^to ■ ^m*mmmwmM PNP* ▼ m mmw mm • va vir w mmw mmw 

ALLSNYVT* KYI SKTLCRHQS PE I VRE F 
LTAMKSHKLTKVG 


1 1065 


24066 


A 


11158 

1 1 I JO 


156 

1 Jvl 


370 


RSDGCOKIPRNMLGGWSVLHHTVDEQ*G 
PNRY I KKCSASL I I REMHI INTVRRCLT 
PATMATVRRKTTSVG 


1 1066 


24967 


A 


11159 


298 


4 


HFQWIMWRQLVKGRIfQAQKNAAFTDSH 
THLCYLKCVYFCVCVCMCICVFVCV*SR 
AL* *SLYLFLCI KAVKAGHSVSGLESQH 
YGRLRQADHLRSGV 




24968 


A 


1 1160 


364 


285 


NSWGKIKILKGK*PPPPKKIFFPKYPQS 

mm* *imr ¥ V **mw m»**mM» mWt mm* nmmrn mt W+imJ mm mm mm m 4 ^mm mm mm mm m> m, mm mm 

FFLSPP*KKKTNFPHPRKYLSPPGIFLK 
APPPLI FFFFFFFFFFFFFFFFFWG 


11068 


24969 


A 


11161 


198 


406 


KSVYMTLKKKRGSFHERGDVPAICNFFF 
* IHIYMCVYIHICVYIYIYTHI YIYTHT 
QIYIYIYIYIYIY 


11069 


24970 


A 


11162 


95 

* 


376 


KIITKHNIARTNPYTFCIMN*LEITLQG 
EPKLRPPKPDKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKRGGGGFKKKPGGGP 
KIRGGEKIFFFFLGGD 


11070 


24971 


A 

* 


11163 


343 


391 


WR* LS STNHNDIIiTLYXjIiFRARAGVIiRS 
AVS LL I LAEQGQPGNLLGDDHI YSAI DP 
AHAFAIMFFI VIPMI I GGFDN * LLAVRN 
GAPEMAI PHINNI S C* RLPPSLLLLLAS 
AIVEAGAGTG*TAYPPLE 


11071 


24972 


A 


11164 


1 


158 


QPMTPNRGPLSP* PLTGALSALLKTSGL 
AM*FHFHSITLLILGLLTNTLTIYQ*WR 
DVTRES 


11072 


24973 


A 


11165 


265 


466 


KLLTSLSFFNIiLRETNQVGLLELSQAKY 
KKTPRLGTVAHVCNPSTLGGQGGWIT*A 
QEFETSLANMA 


11073 


24974 


A 


11166 


204 


458 


KRCIaLPLPPDSNTQSSQCRREQIGRAIR 
KI LTFE TQNI KRLKTE KQLLKLLE * SGK 
I TAHCNLDLPGSS S P PTS ASQS AG I TAG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQD) 
NO: in 
USSN 
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beginning 

nucleotide 

location 

correspond 

ingto first 

amino, acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide . 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H~Histidine, I=Iso leucine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagioe, P=Proline, 
Q=Glutamine, R=Arginine t S=Serine, 
•^Threonine, V=Valine, 
W=OTryptophan, y=Tyrosine, 
X=tln known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide Insertion 














S 


11074 


24975 


A 


11167 


256 


3 


KILFFKPGVLSPPWGGVNFFQLMPFQGG 
PPKKKRGPLSPPKKGPFFFFFFFF*ESR 
SLLPRIiECSQAISTHCmCRLGSSRTRG 


11075 

• 


24976 


A 


11168 


1 


457 


HRTPSVRTPNCRGDPRVRSTNHKDIGTI* 
YLLFGA*AGVLSTALTLLIRAEIiGQPGN 
LLGNDHI YNVIGTAHAFVI IFFI VIP 1 1 
IGGFGN+LVPLI IGAPDMAFPRINNISC 
*LLPPSLLLLRASAIVEAGAGTG*TVYP 
PLAGNYSHPGAS 


11076 

• 


24977 


A 


11169 

i 


343 


428 

• 


GR* L FS TNHKD IGIiLYLLFGA* AGGLST 
ALSLLIRAELGQPGNIiLGNDHIYNVIGT 
GHAFVI I FFIE I PI I IGGFGW*LVPLI I 
GAPDMAFPRINNISF*LLPTSLLIjLLAS 
G I GEAGAGTG * TVYP P LAGNYS H PGAYV 
DLT 


11077 


24978 


A 


11170 


91 


2 


PRQDPSFFFFFF*TESGSVAQAGVQWCD 
L 


11078 


24979 


A 


11171 


219 


1 


IFKSKILCLPPPEPNFFFFFFFFFFFFF 
FFMRHGGACCSPSYLGG*GERMT*ASRV 
AGTTGVCHYAWLI FVFL 


11079 


24980 


A 


11172 


292 


365 


RI * SLGQAQWLTPVI PTLWEAKVG 


11080 


24981 


A 


11173 


251 


1 


I CDLQ I LTHGITT C VKMGLTTLF VSG I T 
FZilHHSLAIKFGV* KNPS*LGTWHTCN 
LSS *GGRGGRIA* ARE FETCLGN I VRP 


11081 


24982 


A 


11174 


200 


407 


KI FETJD VYVKCSN I LCLrWAS I S FSNYFG 
GPFYLNL* * FLSYFSFLETESHIVAPLE 
CSGM I LAHCNLCL 


11082 


24983 


A 


11175 


11 


155 


STNFFLFSETGSGSVLQAOVQWPGLDLL 
AS SHS PASAS * VSGTTVDAA 


11083 


24984 


A 


11176 


140 


1 


PSTFPDWAMRRVS CLS KGDKGQV* WLT 
PVIPAIWEAKAGGSPESI 


11084 


24985 


A 


11177 


210 


1 


TDEELLLMDEQRK* FIBMESTPDSDAVN 
IVEMPMKDLEYYINWD*WAGLXRTDC 
SFERSSTVGKMLSN 


11085 


24986 


A 


11178 


133 


148 


STSTR*LFSTSRKDIGALHLLFGA*AGV 
LGTAVS LL 1 3AWLAQ LGSNKEA 


11086 


24987 


A 


11179 


236 


1 


KFWKKDPLTKGGCPLKKKGGGGIFSKKK 
KNVFFPKKKFKGGKKKNFFFFKKKKKKK 
KKKKKKNKE * LFI S FAHI AVHL 


11087 


24988 


A 


11180 


210 


1 


QSLGQENLQVFFPLFGP* PNYLDKGEKW 
IRVGKIWGHIiFFFFETESRSVAQSPRLE 
CSGAIAAHCNLCLP 


11088 


24989 


A 


11181 


292 


348 


KKGAL FFS PGG* KKNFP PGGKKKGPLFS 
KKKKNPP * KKPGE KE KTRFP P FLGGI/3G 
GFPLAPGGKPPRNRVIAPPPNPGGKKKK 
KTPFSQKKKKKKKKP 


11089 


24990 


A 


11182 


168 


3 


KPEASMLRMDFFFF* EAGSHS VAHAGVQ 
WYYCSSLQPPTPELSDPLAPSSKVGGIT 


11090 


24991 


A 

» 


11183 


87 


351 


KGIKWSLNIQRGVQPHLRSYNYSFFSFF 
S *DWVLLRLECSGTTTAHRSLNLPGSSD 
PPTS AS QAAGTTGMHCHTRL IF* ETGFH 
HAVQ 


11091 


24992 


A 


11184 


55 


344 


TSLNLCFFFFFWGEKSPFLPPGGDEGPL 
FGQLDTRPWG I EKI FFAPPP * KGKRGVP 
PGGEE FKNP PAPKGE PPLS * KK*NQPGQ 
GAP PCYP P FLGGGAKKI FS I PQGRVSN * 
PKRGPSSPPGGKKGDFSPQKKKKKQRFR 
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M 
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peptide 

seauence 


Amino acid sequence (A=Alanine . 
OCysteine, D=Aspartic Acid, 

E^GIutamii* Arid. PWPhfcnvlftlnnin** 

JW ■ •* Ullll f\» /ft^lU) M> M llvllJUllullllJv| 

G^Grvcine. H=Histidine. leucine. 
K=Lyslne, L=Leucine, M s Methfonine, . 
N— Asoa raeine. P=Proline. 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine. V=Valine. 
W-Tryptophan, Y=Tyrosfne, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=posslbIe 
nucleotide insertion 














EV**PNSPSGAP 


11092 


24993 


A 


11185 


344 


190 


LVETRFHRVSQDGLNHLLTL* STRLGLP 
KCWDYRCEPPRPANTANSLNFPM 


11093 


24994 


A 


11186 


135 


3 


I FP KVFGGP FFFF FFF * ETES RSVAQVQ 
VQWCNLGSLQAPPPGFT 


1 1094 


24995 

• 


A 


11187 


305 


14 


IML ITS IRRSRLS STAB KGiL^lilikiCG 
RVAKYLINNRLITTRE*LIKLTSKQMIT 

FD*SLLLY 


i iuyj 


9AQ0A 


a 


11 loo • 


I Oft 


i 


T.PnTTTKTMMTTPPN'K* Q T T Q T PC T?f2ITT.'R , Jr T C 

FP FLLFFF* DE VETjIJjSRIiECNGTILAH 
CNLCLPGSSD 


11096 


24997 


A 


11189 

- 


352 


3 


QLCNTPLPKTNHAMLTSSNANTLIYRRS 
ni/iftAQDrJrJ,ijr X^/wJxXXjXaN&xCuJ. lirrlU 
*LIKIiASKQMIAIPNTKGRT*SLILVSL 

T TCPFVC; 1\ T ^ WfTiTt/VT T.T .MP VT.T7W2 T 
iiv.r J-* Vuijiunv-W v v j. oxvJuiji'irv iliij vol 

NRGG 


11097 


24998 


A ' 


11190 


270 


137 


CIINRLyVCVyiYIYiyiYi + lVFYMYV 
YNILCILGHRRESGVI 


11098 


24999 


A 


11191 


18 


342 


ACRKRKMSSKFALRATGNCYYS CTFCS I 

KITV* K* S KTKKNI KGLGAR * ELS LA* L 
LGEPP IFAKGHPLMLKWVEICKRP 


11099 


25000 


A 

* 


11192 


169 


3 


ERSGIIRVYQD*NQCLAWSLA*kllHPW 


11100 


25001 


A 


11193 


407 

• 


I 


KRACFRWLTLL I STLWEAKQENC3jNAGV 
EVRWED* LSQGG*GYSEPCSHHCTPAWA 
RWLVREMWPSCQAS VIQVRRI S F 


11101 


25002 


A 


11194 


110 


412 


VOGFRLLHS PAI LMPLLMPS Y FRRHSNT. 

EIKPISNPTVASKRASERKSCTSLTLNQ 
KTjEMT KLS BEAT * KAEVt^ITlV31^1\WOirv 

SQWNAKEEFLKE IES A 


11102 




A 

mm 


1 1 1Q5 




ijj 


SWDYRRKPPCPA 


11103 


25004 


A 


11196 


289 


453 


NQPGKSFMISSTFHFMMEFTLVHAGV*W 
LDLGLLQPPP PRFKG FS CLHLP SGWDR 


11104 


25005 


A 


11197 


177 


432 


KWEKQAQTLGGGNFPKGPKTRVGKNFBK 

GERGIPGKKGGKTGPPGFKGPMEKIKRG 
K 


1 1105 




A 






■a 
J 


AMVS iZS PRLEC?S SM TMAHCS T/5Sr/5flflr> 
P PTSAP + VAR I TGMHHP 


11106 


25007 


A 

• 


11199 


150 


492 

* 


GDPLPGHHP V * EVRRPFARQLPRLRSBE 
PLRPVATPSGK*GASLSGSHPIQEGGGG 
GQPLPGHLPHPGWRSGVKP P PGQS ARPG 

RFfMT.TiP2VTDTnK'*fif5PA<5rtnPHT.Qf2f2P 

RE 


11107 


25008 


A 


11200 


239 


64 


YQKYYY YYFF* ETE SRS VARVKS SGLI S 
AHCNLHLPGSRLQLCLFPTSAVFGQESH 
IT 


11108 


25009 


A 


11201 

■ 


31 


396 


I LTMRE I VH I QAGQCGNQ I GAKFWE VI T 
DEHGMDPTGTYHGDSDLQLDRISVYYNE 
ATGGKYVP RA I L VDL * PGTMDSVRSGPF . 
GQIFRPDWFVFGQSGAGNNWAKCHYTR3 
AELGDSVLDV 


11109 


25010 


A 


11202 


85 


407 


VLLPHSEALEGADTMPHSYPALSAEQKK 
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sequence 

• 
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SEQID 
NO: of 
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sequence 
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M 

OD 


SEQID 
NO: in 

09/515,1 
26 


Predicted 
beginning 
nucieouue 
location 
correspond 
mg io iirsi 
amino add 
residue of 

pepiiuc 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

a CIO 

residue of 
peptide 

Sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 

je/ vji u lain it, /\ciu ; j^jrijenjiuiaijinu, 
G=Glycine, H=Htstidine, I=lsoleucine, 
K-Lysine, L=Leucine, M=Methionine, 

i>— >vipd ragiiic, x— rroiiiicj 

Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 

X^Unknown, *=Stop codon, ^possible 
nucleotide deletion, ^possible 
nucicoiiuc inserxiDn 














ELSDIALQIVSPGKGILAADECVGSMAK 
RLNO I GVENTOENRRLYROVLPNADDHA 

• 1 Illy AW V 1 . > i ^ A ^^JCIA^iAXJC^AJ a V Uv IVwvXW 

KKRIGGVIFFHETL*QKDDDGVT 


1 1 1 w 

* 




7V 


1 


iUZ 


115 


THTJDTTVAT YT i Q T YT T YT J? T YMT »S T YPTT . 

S IYLCRF* LID*DRVSIjYFSGWIjQMPGL 
KPSSCF 


1 1 1 1 t 

1 1 11 1 




A 


1 1 0A/I 
1 1ZU4 


1 *1A 




PMjl YflfiDr' G:\7T A/fiRT* * T P AT .TI7DC.TPT . 

DSKFQRDLVTLRTDFSNFTSNTVAEIQA 
LTSQGESGLDGALGWGQ 


11112 


25013 




1 1 OA< 

11205 


232 


62 


LiECGGMILAHCNIjQIjPGESLEPGRQSLQ 


11113 


25014 


A 


11206 


161 


2 


KKKGFLLPL I EPFCIQB I QLRLGAVAHT 
YNPSTLGGRGGWIT*GQEFKTSLAN 


11114 


25015 


A 


11207 


251 


3 


PNHLPKAPSFK i I LRIRFQHLNFRGAQQ 
TFGLQHKLTS KQM I TIHNTKGRT* S LI L 
VSLI I FI ATTNLLGIiLPHS FTPTTQL 


11115 


25016 


A 


11208 


145 


287 


GFRLKL* INQKYS FLKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKK 


11116 


25017 


A 


11209 


436 


110 


ISGRGEQKFKYFS PAPFFRGINPPPLGG 
E*NQNLKPP*KKIFLGSPKVPKKKGGPQ 
KiliKKGPGGGGPPijr PPPiiGGQGGGF PF 
SQGFKTPPGKKKKPPPKKKKKKKK 


11117 


25018 


A 


11210 


nil 
217 


2 


K* KKNLWPGMVAHACNTNTLGGQGGRI V 
♦AQEFETSMGNTVKC 


11118 


25019 


A 


11211 


146 


3 


QKVSHAQYRCLSWGYFFFILFLETRSRS 

lAQAuVQWKUlAjbJU* FFPA 


11119 


25020 


A 


11212 


187 


384 


GLI CFYLTCFYF I FY FILBTFSLJjCYVM 
I^rWLCmtt,CYVMVC^VMLCYVI*DRV 

oU UrtJriJWoMA 


11120 


25021 


A 


11213 


373 


482 


MIKLRHRLGTVAHTCNPSTLGGRGQWIA 


1 1 tot 
1 1 121 


23U22 


J\ 


1 1Z14 


04 


ioZ 


I IAYCNFELPGSGDSCHLSIiPSS WNYRY 
MPCMPS *LKKNF* *RQGLTVLSRLEKDY 
I*L*DFSALRSFPWGR 


111 




XL 


1 JZl-> 


1 CI 

loi 


J DO 


r*nQ 3X VPT .T .n T VPTP Flf SkDT.TrTf TT? T PMP T 
\*L40f% V FLU tUX i tr ± CiC rvr^L/ J^IvJ\ XA1 XT Irl v J. 

IALFAVAKR * KQPT YPS IDKWI KKLWAG 
CGGSRL 


11123 


25024 


A 


11216 


268 


369 


TTRPDM WHTYNS S TSGGQGGT I T* AQQ 


11124 


25025 


A 


11217 


90 


3 


IGQAQ* LTPI I PALWEPKSGGSPEVRSL 


1 1 IOC 

1 1 125 ; 


25026 




11219 


277 


272 


QT.TBDftT * COT.T>DT>T.DU17nUT CTV20CCT7 

SEP * S CPCTPAWVTQ * DS VSKKKEKKKE 

TTKT? KF TTP KVMnTPlf ATPQ POT> AT • Tf3«TWG. 
ivrvK ivcj xV-Cj ivv y a ir ivrt i. CtOtr\s^t\lJ xvjoxi«j 

LQVQKRVYAHPIL 


1 1 \*^C 

1 J IZO 


25U27 




1 1ZZU 


1 Q 

ly 


34.3 


FLEMKYT PGEDWNT I KMTTEDLKYOTD 
LVDKAAAGLERINFNFEGSSSVGKMLSN 
S I AWYS E I FYERQSQFMWQT * LIiA 


11127 


25028 


A 


11221 


266 


356 


FFEMESHTVA*AGVQWCDLGLLQSLPPG 
FK 


11128 


25029 


A 


11222 


105 


2 


RFVCSTI KVXgRDLSSARSNPGRFI *TSN 
SRPRKS 


11129 


25030 


A 


11223 


145 


350 


RKGRQLLD * DLGWV *WI/TPIIPI LtWEAE 
LGGSLEARSSRPAW 


11130 


25031 


A 


11225 


155 


348 


PLFFFCFHRNTFY* KSVCRFPKKKKKKK 
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sequence 
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1 SEQID 
NO: of 
peptide 
sequence 
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M 

etti 

od 

. 


SEQID 
NO: in 
USSN . 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D^Aspartic Acid, 
E=G]utamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, Msoleudne, 
K=0Lysine, L=Leucine, M=Meth ion I n e, 
N^Asparagine, P=Proline, 
Q=G!utamine, R=Arginine, S^Serine, 
"^Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKXKKKKKXKKKKMEGGGGGANTKFLY 
GENRKPPLFFFF 


11131 


25032 


A 


11226 

• 


271 


3 


NFLSVFNFTRICLQMCFLSIFLLIPGGS 
LKYQLFLAQGFFFFIiHYFFTSSIHSSLL 
Y*MLDFLDLFFLYLFFCNFNFSVFLFCI 
LSKFF 


1 1132 

■Mil Jm 




A 


11227 


j 


413 


SPAEPHRSYTMWNVKVGVNGFGRIERLV 
TRAAFNSGKVDIVAINDPFIDLNYMVYM 
FQYDSTHGKFHGTVKAENAKLVINGNP I 
TIFHERDSCKI KRGDAGAEYW* STGVF 
TTMDNAGAHLHG * ATRGI I SAPSAD 


11133 


25034 , 

i 


A 


11228 


48 


413 


VEVLVSCGFFFFFFWEKKICFFWPGGKK 
RGKIGGTGTFGPKGKGNFRGPPPKNKEK 
GGPPPPPNNFGFLKKKKFYQGGQGG*KT 
RNFGKP PAPNP PKGGDLGGGPPTQGQNG 
FLKKKKKQKK 


11134 


25035 


A 


11229 


172 


404 


EFENSCSPGWQRRGVMIPGVTVEDMNQQ 
EFIRALVAIiLKKSRKLKVPEWVDTTKIiA 

KRKELTL * DEKWFYT *AASTT . 


1 1135 


2503 6 


A 


11230 


247 


2 


GPKKKS VS KKKKKELPCDPAI PLPGI YP 
KEKKS VYKKDNLHLRVYCRAVFT IAKI W 
NQPKC*SGDK*IKKI*YIYTVDLRR. 


11136 

• 


25037 


A 


11231 


1 


372 


TETTVE VAWCELQDRKIjTKS ERQRFKEE 
AEMLKGLQHPNIVRFYDSWESTVKGKKC' 
IVLVTELMTSGTLKT*VHOYYKS*PRSM 
REFD FLNFRSTS FTLWQPKKGTVLPTV 
TVLAYFLAHKVL 


11137 


25038 


A 


11232 


311 


464 


KKATPRNLCKWPGTVAHACNPSTIiEGRD 
GWIT*GQEFETSLTNMVPP 


11138 


25039 


A 


11233 


64 


418 


VPVASRRRRRCGRVGGGKAMADLDKLNI 
DSIIQRLLEA*GAKPGKKVQLHENEITG 
LCLEPRE I FXjRQP X LL * LVAPLKI CGD I 
HGPYYDLLRLFEYRGLPPKSNYLFLEDY 
VDRRQH 


11139 


25040 


A 


11234 


233 


421 


PXTSNMRSRLLRGAS S * KKTAGPQQRNL 
EPALPRRWGGRSADNPPSGSLRKSGKNM 
QKTPGTA 


11140 


25041 


A 


11235 


343 


I 


NNTHGLSHS S VGRNPGAVGSGS CMSGIK 
LSEGLRSLLEAVCISRAAVIKCHSWVAS 
NNRHFFS PS SGG * KSEAKVWAGPCSL*R 
RVPSLPLPASGGSRHCLVCGCVTPFSAF 
TW 


11141 


25042 


A 


11236 


122 


419 


RTPRGPKPNPGLQRKTFFLIGGRIKTYL 
GIFGKDLYFWGGKNWDTLLPKI * RSREN 
KNF*REKWVNLPPYPGCLKNGARQQHFA 
HLLILRDTKTYMLAF 


11142 


25043 


A 


11237 


1 


379 


AFNHLHAGHGLSGAAMKS LVLLLCLAQL 
WG CHSSPHGPWLI YRQPNCDDP *TEKAS 
LEAIDYINQNLPWGYKHTLNHIDEVKVW 
PQ*PSGELFDIEIDTLETTCHVLDPTPV 
A* CNVRQLNEHAVQ 


11143 


25044 


A 


11238 


1 


397 


QTMTPNRGPLCPSKDLRSSHVISLPLHS 
ATHTRPTNQHTNH I PMMARRS TRKHIRR 
APHTTCPKRPS IRDNPI YYLRNFFLRRI 
FLS LLPLQ PS P YP P IRRALAPNRHHPAK 
SPRRPTP * PLTGALS ALLKTSGLAM* FH 
FH S I ALL I LGLLTNTLT I YQ * WRD VALE 
STYEGHHTPPVQKGLRYGI XLFITSE I F | 
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SEQID 
NO: of 

nil 4*Iaa42#1 A 

nucieoiiue 
sequence 

• 


SEQID 
NO: of 
pepuae 
sequence 

• 


M 

eth 

uu 

• 


SEQID 
NO: in 

09/515,1 


Predicted 
beginning 

ft n j*1 AiinA a 

nocicouue 
location 

PA n*ACn nn/1 

ing to first 
amino acid 
residue of 
pepuue 
sequence 


Predict- 
ed end 
nucle- 
otide 

lUCilUUU 

correspon 
d'mgto 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, DfAspartic Acid, 

IT—/"* 1 ti4-r\ m if A f*t/t T7— -Pt» aniilalanSno 

e/— v^iuiuni il >*nu, x 1 — jt u tny laiunine, 
G=Clycine, H-Histidine, l=I$o leucine, 

Ifssl .vtinp T j=»1 .<*iii*itia lVislVlAttitrktiSn* 

F\. M-iJ 3 1 11 C , Jj"JU^CUV.|IICj 1T1— iTJLCUHUIIlUC, 

N»Asparagine, P^Proline, 
Q^GIutamme, R=Argfnlne, S=Serine, 
T=Threonine, V=Valine, 

WsTrvn^Anhan V^TvrActna 

tt~ ji rypiupuaii, f x jrusme, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














F FAG FF * AF YHS S LAPT PQLGGHWP PTG 

X X Jf Jjli It XJlLLJr'XiXJl* 1 O V XjXiriOU V O X X 


11144 


25045 


A 


11239 


267 

* 


3 


FVSVRITPIYASAATWLSYVWVWASSPN 

V V^f f VjL/K ViiJ-iOr'KV EiUOVsl XOAvKJNXirtXj 

PGSSD YPAS A* PADGWAPPQ YRCPPPRL 
ANF 


1 1 1 A C 

11145 


25046 


A 


1 1 Oil A 


39 1 


31 


vr r r r r r err rrr J? J? 1? r err r rr eetrrif 
LKNPFEPPPNFFFFCPKF*ILFLN*I*F 

# TTTT7* T * VTD/2PPT.KrDDDPl?DP'G , I?l?T?DD 
" XXVI: x MrurrmNfrfrr if ire cc V rr 

GALFFFFFF*EGVLPCSPGWSAMSHLLS 
EKRKLTPRMSYN 


11146 


25047 


A 


11241 


371 


491 


KI FL CWARHDAHACNPS I FRGQGGHIT * j 
vU&rixxSxjANMV 


11147 

■ 

■ 


25048 

T 


A 


11242 


l 


221 


HRIDPLCRNSARAFFFLSTGCHRVSQDG 
LDLLTS * S AHLGLP KCWD YRCKP PRPAL 
NCFFRTISQLLNLCIKD 


11148 


25049 


A 


11243 


382 


333 


T YKI KDF KTF * RW FHPN IS RVKAEKJjFIj 
SRGQRGDFLARPSESSPGGFTltSVR 


11149 


25050 


A 


11244 


169 


477 


TQKDEKES TTTTKNAENS KGQS AYSS PS 
DOJTSLARAQN*EEAEMDELTEVGFRRR 
VIMNFV1\xjKGHIIjTQ^REAE^ 
LLTS ITSLEWNINDLMELK 


11150 


25051 


A 


11245 


200 


462 


S LIYNEIiCTIHILYQYYVA* QRS * CFQQ 
LiPIS Iri * KTljKIsJbi\i^oijL.KVAJjJji PS r UK. 
WECRGWLGTVADT* KPSTLGGQGGRIA* A 


11151 


25052 

• 


A 


U246 

■ 


484 


301 


I FCEKKGL * FQLLRRVRWEGP FNPGNQG 
CS*LLSSPCPPGWITE*DPVSQKKKERK 

PMVT17MFT ,VC TiOPTsnTNTV DP C3VHT .ft&riB G 

MLAFSTTPR 


11152 


25053 


A 


11247 


226 


458 


NTVAVDWGPRCFFSLLLSHAENCDMWFT 

TTT T7T.T7T 1?TWTJT T.T.TTnnT.TTMT.liff * r , 2/^liWli^ , 

NPSTLGGRGGWI T*GQBFETS 


11153 


25054 


A 


11248 


304 


436 


SPSRCRK* KIG * AQWLTP V I S TLWEAKV 


11154 


25055 


A 


11249 


216 


451 


THKYLFYYLGYNPMLYIERETGSCSIA* 
CLANF FFFFIiKKWS L WLPRLG 


1 1 I 

• 




n 
i-i 


1 IZjU 




AHA 


nKVWT.TiVPT.IfaTiNWT^JZXHTWrT * KVQIf T 

1/XV.XI7X4XJ I f XjXvfM ilN V VXKSi-iClXM X IV.V OI^X 

MC*K*DKNMYNFFFSF*DGVSLLLPRLB 
CNGTILAHCNLCLPC 


1 1 1 ^£ 
1 1 100 


*><n<*7 


TV 






WO 


KKKKKKKKKKKKKKKKKASS SQKAQDYS 
KASDYK 


11157 


25058 


A 


11253 


348 


478 


DLGWLKLGTVAHTYNSSTSGGQGRRIT* 

\3\£RJ? oLtrWt v VX\Xr v» 


11158 


25059 


A 


11254 


174 


475 


MHHWVLILQDIDDNT*NCN1jHFSLCSQL 
KS IQKBKKKKKKKKKKKKKKKKKKKST 


1 1 1SO 
J 1 4 J> 




A 


1 IAD J 


TOO 




TRSSSWNAGTFIIFKIVTFFLFFFF*BR 
VSLKLTRLERSGAI PAHCNLHL PEF 


11160 


25061 


A 


11256 


347 


16 


TQSADWCIYNPPARHKSSPSPHPTQ*PS 
WLHLVDPAPGPWVELPTSPAPSACTPQP 
LGSRWDQPPQSKGQCPSGRLGPQGSPPQ 
VGLGKAGWRSQALPVGLTRS IESHHW 


11161 


25062 


A 


11257 


238 


590 


APGAYI FPQQHNPTDSQVI FSLTASHLL 
ICSPLCISVIjHFNGMNIDDENRNAHVHA 
RACAHAHTHTHTHTirniTLYIiMNYRET^ 
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Amino acid sequence (A^AIanine - 
C=Cysteine, D-Aspartlc Acid, 

V=CZ 1 1 1 ta m if» Ariri ItbzPhnnvlnlnntnA- 

JCi VJ> 1 U lil 1 U IK. t\V,HJf M? X UCI1 JrUIIlllJUMSy 

G=GIvcine H=Hi<rtidihe I=Isoleucine. 
K=Lysine, L=Leucine, M=Methionine, 
N = As Dam nine P^Prolinc 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

X=lJn known *=5?ton rnHnn ^mossihlfi 

nucleotide deletion, \=possib!e 

llUtlVUUUv lllOwl HUH 














. MLLKTRGRATGSHARCIFS+SLLQACP 


11162 


25063 


A 


11258 


310 


461 


LSFGDVLQLESDRPGAVARTCGPSTLGG 
*GGW IT* GRE PETS LANMLKPR 


11163 


25064 


A 


11259 

* 


461 


174 


MGFHR VSQDGLDL LT* S SAHLGLPKCWD 
Y * CE P LRLAPVGKLFKCVG YKLVLCHGS 
Q YMSMVI KIRMPQL I WSLEGG3AAWHR 
YWSIWEISFGC 


J 1 JO 4 * 




a 
*\ 


1 IZOu 




477 


AWVTK* DPVSHKQE 


11165 


25066 


A 


11261 

■ 


229 


3 


VKMKLQ*VPVKVKI*KYTPLLGIYLKEL 

If Afi P(5R MTPTTiMF T AATi PT T AMRWMOPR 
ivn\j c uiui jjfic innuf x xiinnnnyr a 

LP* IGECVNNM* YIHTME3 


11166 


25067 


A 


11263 


163 


399 


KLIi PGTRNLHAMEHP I FGWLRNPHATAQ j 
SSCIIAGYPN*EGMFASQHHRVH 


1 1 K*T 
J 1 10/ 




*» 


l mc/i 




A*7*7 


TKTWVT T+ TTNTTTHGT.TTQTTDTT.PTOn 

INNNLFSCS 


1 1 loo 




TV 


UZOJ 


Io4 


41o 


1jGGKKKFIiRGKGGKTFFFFPF*KIGSSS 
SPPPPPGGEQAPPPPPKSPPRR 


I 1 1£Q 

I I lov 




TV 


i Izoo 




i ■j 


KKPKQGFITPPLKQKIKPPPPGGYSGPP 
RVL * KGPPH I FYK 


11170 


25071 


A 


11267 


3 


173 


SHHARPETGFHRVSQDGLDLLTS *SACL 
GLP KCWD YRRE P PRQARKL YCLGMRTNE 


U171 


25072 


A 


11268 


157 


2 


PVDITHVKTTVGLGMVAHACNPSTLGGR 


11172 


25073 


A 


11269 


291 


468 


ISEGKRSLFVILHFILFFETGSCSVAQA . 

GVQWC^IJ3SL*PHIPGPK*SSHLGLHTG 

ACH 


11173 

• 


25074 


A 


11270 

• 


29 

• 


478 

• 


GWN P YNQQD I GT L YL * S GARAG VLGTAL 
AFAI I FFIVTPII IGGSGN*LVPLI IGA 

DnM A VPP TMM T Q I? * T ,r .PTQ T .T .T .T .T A C A T 

AEAGA*TG*TVYPPLAGNYSHPGACVDL 


11174 


25075 


A 


11271 


61 


474 


PYNQGGNDAHAAWPHLTKWRSQGTTEIC 

R R PFC3 AWORPJ5n PP T iHTffSR Q PROT.P AT J? 

QNLPSYSGGSHLLSAYYVPDTISKHFIN 
TIGWIJ3TVAHACWPSTL*GEDRWIY*AO 
EFKTSLANVL 


11175 


25076 


A 


11272 


277 


478 


KIYQPVKIHGPRPGAVAHTCNPSTLGDR 
DGWIT*GQEFETSLANMNSPSL 


111 /o 






1 1* / J 






KPKT ARO AR WT.TPV T P ATjWR AR AGAfi PE 

VRSSRPA*PPP 


11177 


25078 


A, 


11274 


138 


3 


NYINLYAWIjGAVAHACNPSTLGGRGGWI 
A* AQE FKTSIjGNMAKP 


1 1 1*79 
J 1 1 /o 






1 IZ/D 






r v x XM'Hjr ruvvsiUAarixr Uru • OAirXAjXtx' 
KWWDYRH * PPRLALFFFF 


11179 


25080 


A 


11276 


262 


3 


FCMSLVHLFSHSQDNYFTSFSL* IFNIP 
SGPFLFAFQL7rLQ*P*DFIYFYFETESP 
S VTRL E CS GA I S AHCNLRL INS SNS PAS 
AS 


11180 


25081 


A 


11277 


285 


1 


VKKGDSVSKIFPJCiUCjCRJOKIRLHFMIT 
TVNNLGI EGNLLNMI KEI YKNPTANIIL 
NGERLKAFSL*SGTRMSFLTPTFF*ILV 
DTQQVYIFMSH 


11181 


25082 


A 


11278 


160 


3 


TDKE VPL INAQRKW FLEME S I PGEDAMN 
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ing w iirsi 
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nanf i ft A 
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Predict- 
ed end 
nucie- . 

UUUc 

location 

i*A nmc n A n 

vUix espoo 

ding to 
last amino 

residue of 

peptide 

sequence 


Amino acid sequence (A-AJanine 
C=Cystelne, D=Aspartic Acid, 
tv-uiutaniic Acid, r -Phenylalanine, 

oiycine, it— Histiame, i— isoieucine, 
KMLysine, I>Leucine, M=Methionine, 
n*— : Asparagine, i^tToiine, 
Q=Glutamine, R-Arginine, S-Serine, 
T=Threonine, V=Valine, - 
w— irypiopnan, x— tyrosine, • 
X=Unknown, *=Stop codon, /possible 
nucleotide deletion, \=possible 
nucieouae insertion 














IV*MTTKELQYYINLVEKVLARPE 


111 ttO 










/too 
4oo 


JriiJM 1 LA3I*W1jN* *Nr QPGSVAPACNFST 
LGGQDRWIT*GQEFETSLANMVK 


1 1 1 Q1 






1 11 OA 

1 12oU 


527 




r r Vp Jjvfc.1 At HH V AQDSIJjIJjTS* SSRLi 
NLPKCWDYRHE PQRQAKNVGEFSETRS I 
Q 


urn 


25085 


A 


11281 


486 


248 


VRQLFSLLLPRIiBCNGVISAHCNLRLPG 
S CDS SASAS * VARITGASGSQAWLQVQ 
CLQPVQPGEhJLRVVLPQLWhQR 


1 t 1 Of 

11185 


25086 


A 


11282 


38 


157 


STGLHHVSQDGVDUjTS * STLPKCWDYR 
HDPLRPASQRL 


11186 


25087 


A 


11283 


474 


339 


FVFLVQMGFHHVGQAGLEHiTS*SACIjG 
LPJCCWD * RHE PTDPAK 


11187 


25088 

• 


A 


11284 


2 

- 


462 

• 


LETTRTARWNS VIALRHHNS CPYKMANTL 
IDKCVCSDYSKDWPPPIPLPLLRPPYCL 
RHNNMSTRSINNPPWPSKCSSEKTNEEG 
MLKVEIG+KLCIJjCQTFSKDMNAEKKFL 
KEI KS ATPMNTSM I RKQNRL I HOME KVL 
VIRIEYQASHNIL 


11188 


25089 


A 


11285 


65 


280 


S RAVE FNLLTT * S ACLGL P KC WD YRCE P 
PHPASP*FSKHiSSl^LKTYIiSSL*DSHS 
GFYSFVCSLGLI ili 


11189 


25090 


A 

• 


11286 


244 


488 

► 


MRLGRVPSVLFCYVRMQREGI ILEAENK 
PS PDIESVGALILDFTASRTIRNKILLF 
IIFPG*GILDSPNVP 


11190 


25091 


A 


11287 


345 


445 


KRPGT VAHACNPHTLGG* GRWI T*GQE F 
WTILA 


11191 


25092 


A 


11288 


223 


500 


SPGTRPKVLECQSTIjGRQAILLLGEGWL 
WGQSHRl^QSQLCLCVLWDLTCSLSCS 
FQI IKLR*RPGVMVHTCNPSTIjGaCGGR 
IAQGQEFK 


11192 


25093 


A 


11289 


157 


22 


LSCQEVKGEIGEAEICWAQWLPPIIPAL 
WGAKVG*SPEVRSSKPA 


1 J 193 


25094 


A 


11290 


305 


497 


KLVWCIYFKGVSFMVRELYC3«CIV*WLG 
TVAHI YNPSALGGACGWIT*GQEFETSL 
TNMAKPRL 


11194 

■ 


25095 


A 


11291 


478 


1145 


QHVQACP ER PQMMGTLERSRAVASKIGH 
SYSLDSQPARAVGKPWPQQACTRVTELT 
EATGKLIRTSHIGKPHPQS FQPAAATQK 
LRPASQQGVQMKTQGGASNPALQIGSHP 
MCKSSQFKSDQSNPSTVKHSQPKPFHSV 
PSQPKSSQTKSCQSQPSQTKPSPCKSTQ 
PKP5QPWPPQSKPSQPRPPQPKSSSTNP 
SQAKAHHS KAGQKRGGKH* RANSRDL 


11195 


25096 


A 


11292 


352 


468 


TFPGQHGKI PSLLKI QKLAGHGDACL * S 
RLSPR 


11196 


25097 


A 


11293 


295 


24 


EN I QGS KKNERQHMHNGVGTWNSNPG FV 
STWVYRNCLACLDI SFSFI KWVKSHKDT 

^MOWPTa ZX T .T7T T VT * MH Q TY^P QMVTtUW 
oi'jvjt'ir x/uur iiAM iiyi i v^jfori vuvin 

STTRTS 


11197 


25098 


A 


11294 


274 


429 


ECE P VQTLWKI RKFLPCKLKIELPS DPAI 
PLKGIWLKEVKSVP*SDTYTPMCI 


11198 


25099 


A 


11295 


168 


469 


PSLGNKSETPS PKKKKNLFLGFPYFREK 
LVGILLKTVLQKGAGGVFFGPKNPPEKN 
PDPPLDSPPKTFRRLGAFVKTFPKGLKA 
AWFPALV* POGQMGG 


11199 
- 


25100 


A 


11296 


37 


348 


IKI FNKTLKYFKKKKTIKKKKKKGGGPL 
KRPRGGPRIYPDGAPJCNLPHIGTAKKHW 
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peptide 
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Amino acid sequence (A=Alanlne 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Htstidine, I=Isoleucine, 
K«Lysine, LHLeucine, M=Methionine, 

awi a • Tt IT* a* 

N^Asparagine, P=Prolme, 
Q^Glutamine, R=Arginine, S=SertoeT 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /■= possible 
nucleotide deletion, lpposstoie 
nucleotide insertion 














GGYWE KNPHGCjJGGNTGPTP PQKTRPLGG 
KNNLKRVQGEKPRPRLGGGKTRSQLFSC 
HA* P * VQGEKPRPRLGGGK 


11200 


25101 

> 


A 


11297 


93 


388 


DEYASDNCCYFPGSSHDSQVKQIGSQFT 
TQIQAHLNLAKPRRAL*NHS * PMEMRSK 
CLLLAYKKKKKKKKKKKKKKKKKKKKKK 
PKYLSSRGKDHF 


11201 


25102 


A 


11298 


61 


243 


YYIHRTTVFMCMN*GLKDNVDK*TIDLC 




























KKKKKKKKPQGGGA 


11202 

• 


25103 

« 

t 


A 


11299 

■ 


250 


341 


I YMSLTLGLQNIi* E IHLTHKDSHKLQVK 
GWKNI FHANVNQKQTGVAILISDNIDI K 
ATTVKKKKKRKNPT I LNIYVPNTGAPKF 
MKHLLLNLKNERDGTTVLG 


11203 

• 


25104 


A 


11300 


3 


285 


DAW E R KKRS Q MNQ KT TN KVAGVS PYLS I 
MAWNVNELNFL X KRHRVA* CI KKTKSMI 
CYLQETNFIKDTHRLKIRGWKKIFHPNR 
N+KLAGWY 


11204 


25105 


A 


11301 


175 


407 


AENNPDVLQLKA* TSRGTFMP * ILLRVK 
RGSNSRNNRCWQGWRGTGMLIjHC * QECK 
LVQPIiWKTVWRir.PDLESEIL 


11205 


25106 


A 


11302 


75 


280 


YHLNLVLYPHPPKNRVC*KKKKKKKKKK 
KKKKKKKKKKKKKKKRGGPLKKNLLGGP 
FFPGGKKKNPFF 


11206 


25107 


A 


1 1303 


299 


3 


IHAS FLFRGC ITLAFTVRCSHRS ITSSE 
TSDLTHYYYYYYYYF* LASQSYALAGVQ 
WDQLGSNHPPTSAS * VAETAGASHHARL 
IFNFFVEMEVSLYC 


11207 


25108 


A 


11304 


137 


323 


FFTVNGRHRKEKKSNKDQLNLGLHDFLD 
NPLLLF*KYLGREKKKKKKKKK1QQKKKK 
KQKKKK 


11208 


25109 


A 


11305 


64 


481 


RPTRPHVYHIVKPI P * PLTGALSALLMT 
CGLSM* FHFHS ITLLILGLLTNTLTIYQ 
♦WRDETRESTYQGHHTPPVQKGLRYGII 
LFITSEDLFFAGFF*AFYHSSLTPTPQL 
GGHWPPTGITPLNPLEVPLLNTSVLLA 


11209 


25110 


A 


11306 


142 


330 


FFTVNGRHRKEKKSNKI^LNLGLHDFIiD 
NPLLLF * KYLGR E K3CKKKKKKKNKKKKK 
KKKKKKN 


11210 


25111 


k ; 


11307 


259 


440 


RPWIRDNP I YYLKS CFLRTI FLSLLPLQ 
PRP YPPTRRALS PNRHHPAKSPRS P * PL 
TGALSALLMTSDLTMGLHFHYITLLILG 
liLTNTLT I YQ* WRDATRESTYQGHHTPP 
VHKGIiG YG I ILF ITS KWFFARFF * AFY 
HSSLDPTPQLGGHWPPTGITPLNPLEVP 
LLNTVRL 


11211 


25112 

- 


A 


11308 


433 


3 


FPPPDIRHSRSLGFPPSFGRAGFLNFAF . 

SKGQEPALPGGFYYPSPKKKNLAFCSPP 

IjFYWGKPGSKFSIiG vgplffp* *SPKSF 

CQFLENKGFFPEKSFFKFFPPAGGCVFF 

FSEGGFPFCLFLFFFDRVWLCHPGWISV 

ARS 


11212 


25113 


A 


11309 


2 


423 

* 

i 


GRVGDR* LF YTNHMDIGTLYLLFGA*SG 
VLGTALS LL I RAE LGQ PGNLLGNDH I YN 
GI VTDI1AFD 1 1 FF IAI P 1 1 IGGFGN* LV 
PLIIGAPDMAVARINNISV*LLPPSLLL 
LLAYAI VEAGAGTG * TVYPALAGNYSHP 


11213 


25114 


A 


11310 


234 


1 


TFFGKFPFCLKFKPAQRGGENTPGPFPF 
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X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


• 


• 










LVFFFFEMESRSVAQTCVQWRDLLGSSN 
SPTSAS * VAGTTGACHHARLIF 


11214 


25115 

• 


A 


11311 


406. 


211 


LSLFLHSLISTHGARPTDPVGIiVPTQDG 
LDLLTS * SAHIGLPKCWDYRRAPPCPAC 
GFDLHLPND 


11215 

* 


25116 


A 


11312 


379 


398 


KT*NFTVRGSIPLLNNIPMASMTFSKKK 
KKKKKKKRGG P LKKNPGGAKFNGGRKRK 
I FS LKGGE KKNPGG I LEKKPFFGGGKKG 
ENPPKKN»3LPEKKKF*RGKGGKKPQKG 
GGKKMGS S S AKTPTWG 


11216 


25117 


A 


11313 

i 


390 


489 


RYRLGWAHACNP S TLGSRGRWI I*GQB 
FETSV 


11217 


25118 

* 


A 


11314 


286 


434 


ILWYMNYISIKIiLKIIIYKLGAVAHTCN 
PSTI*GGRGGWIP*GWEFETSL 


11218 


25119 


A 


11315 


322 


3 


FTFYGCRQFQVF* IVTKNDRi I frfcg 
NLLKI YLGLY* FYYVPLLRTKSLKKTGS 
FESNLSFK*VFLWSGTVAHTCNPSTLGG 
RRGWIA*GQELETSLTNMERPR 


11219 


25120 


A 


11316 


251 


389 


TFFFFEREFHFVPQVGVQGGSLGSLQAP 
PPGSHSLA*ASPRCWDGG 


11220 


25121 

■ 


A 


11317 


313 


80 


E YICPRCLSPGFRD * PGQHGETPSLQKL 
QKSAGCGGAHLRSQLLRRLGDRVRHCLK 
KR I YLS LFNS LVLSNNRFQ I I 


11221 


25122 


A 


11318 


145 


410 


APKTLKVALTS AAAARES I CQLGVLDEL 
VT FLE LNRS TDQR PGAVAHTCNPRTLGG 
HGGW I T* AQE FKTNIiANMVKTC 


11222 

• 


25123 


A 


11319 


393 


3 


NFFKIIFTLSPPLPLIFTPPFFFFFKKI 
PPKPLFFGVFHSQTCPQISFFKNFYPFL 
FWFSNPFFLLFSPSRKGPPPFSRFFFFK 
VGNPIiFFLFFFFFFFFFG*EGVSIiIiLPR 
LECNGTI S AHYYLR PRV 


11223 


25124 


A 


11320 


135 


1 


HRDVKLEYAIRKMEVRPGVAAHACNPST 
LGSRGGWIT*GQBFETS 


11224 


25125 


A 


11321 


282 


2 


NSESVLIKEKGDBEEKVLSFLLFSKRKV 
LESKTFPTLHSGCKKRSYFLKITTSSFL 
* PRVECSGVI IDHCKLKLLGSGDPPTSA 
FPVARTTGM 


11225 


25126 


A 


11322 


149 


30 


WGSGGQVRWLTP 1 1 PALWKAEAGGS PEV 
RSS*PVWPCLY 


11226 


25127 


A 


11323 


316 


399 


STIX3GRGAQIT*GREFETSLANMPNPIS 


11227 


25128 


A 


11324 


223 


397 


I I KAYQ * DVNREQ I KT I I I YFQNKLKC I 
KKMIWPGAWHVYNPSTLGG*GGWIT*G 
QE 


11228 


.25129 


A 


11325 


3 

• 

■ 


396 


NYSTAALFLRGH I WG WRWGTL YYG S S P I 
YCRGLNS TLGL YPLNAS SNPHPRKLCKP 
KI YTHIAKCPIGHKI I YPYANTPIH*HH 
TOTHTHTHTL*DSLLWPGVVAHTCNHNT 
LGSWGRQIT*VQEFKTSLA 


11229 


25130 


A 


11326 


113 


442 


KRYIQ IRAHSEVLCGHELWGTLHNPVQT 
IQTWEPFPYLPLCSRVTSGKSHHRFVPL 
LFHL*NRNNSHHHHHHHHHHHHHRNPC* 
E CCRAHSD S YAVWSDKVSQEPVAAM 


11230 


25131 


A 


11327 


145 


3 


E LDI KP PNPFFCHTGVQ * CDLG S PQLLP 
PGFK*FCLSFPSSWDYRCA 


11231 


25132 


A 


11328 


189 


1 


KTNLLLIHSXLRLHLQSSVFQC * S KSEK 
SNVCT * CAFSCLP KLFW LGKVAHACNPS 
TLGGRGG 


11232 


25133 


A 


11329 


193 


405 


ID* QTLTKKKKKKKKKKKKKKKKKKKKK 
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SEQID 
NO: of 
nucleotide 

■ ■ U m*mm'\mT % 9 %m %m 

sequence 

t 


SEQID 
NO: of 
neotide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
()9/515,l 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
dins to 
last amino 
acid . 
residue of 
peptide 
seauence 

■nr *fl mm MM 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid. F^Pfaenvlalanine. 

mmmf 9 m9 M4 m 9m mwmr m m\ iflll m Ms m, ■* ■ * J 9mm m 111 9 9M>wmm% 

G=Glvcine, H=Histidine, l^lsoleucine, 
K-Lysme, L=Leucine, M=Metbionine, 
N^Asparagine, P^Proline, 
0 = Glutamine, R^Areinine* S^Serine. 

^« ■ ww wwm m mm m mm wrw, a m 9 m m mm* * WW m lvtV| 

T=Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














. AYTSKKKT 


11233 


25134 


A 


U330 


209 




GGKRGTPPTINNIFFNFFF*EKGSCCSS 
RAC*CNGEI*AHCPLRLPOSSNSPAAAS 
RVQETATTPS 


11234 


25135 


A 


, 11331 

■ 


38 

■ 


390 


VDQSTVCKMGQSAGCXSWGSDKGIKAGCIi 
K* QRQ KKKKKKKKKKKKKKKKKKKGGGG 
LKKKLFLGPGGEKFFFFFWGPFFFFGGR 
FLKRGGGKKPGYKKNK I SGHPPPFLGGG 
KKKKR 


11235 


25136 


A . 


11332 


38 


326 


VDQS T VCKMGQSAGCGWGSDKGI KAGCL 
K* QRPKKKKKKKKKKKKKKKKKKNRGGG 
FIKKIFFGPGGEKI FFFFWGGFFFFGGG 
FKKRGGEEKNLF 


11236 


25137 


A 


11333 


1 


234 


RTRGKNWMAFIKEGWVPTGRMGGSFIKH 
SFLE S KN1WLGKKKKKKKKKKKKKKKKK 
PHDK I KDLFDPV* NKKONI SIP 


11237 


25138 


A 


11334 


244 


398 


INHL YSNKNDL I FLLVFYI*FFFP*RRPG 
EPPLIKGWLPYLGGVLNLRKDPL. 


1 JZJo 






iijjj „ 


I ft 


i 


DAVAHG CNPGT LGAQGGW 1 1 * GQKFBRT 


11239 


25140 


A 


11336 


148 


411 


LFFFPHCYLLNAPSILYLNEPKWYNLH 

KKKKKKKKKKKICKKKKKKKKKKGGAFFK 
KLLF 


J Hi U 




A 


11^17 
Ujj/ 




4JJ 


GORPtLS FREKGS FFFIiliHTE FCFAPO A 
EGQGHNLG* LQLPLWG* SNS LVS SSQEV 
GITGAGHHPRLFFVF 


11241 


25142 


A 


11338 


65 


347 


VCRVDDF VPEARTRF FKS I * EAWNNKN I 
KPLLSTFSQVPGSENEKKCTI*DQAFIG I 
LEEE I INHSSCENVLAI IS LAIGGVTEG 
KYGSVLFCLK 


1 1242 


25143 


A 


11339 


158 


393 


VPCTHSEGIRKGRKCIiPKTRENAREERD 
KK I P PGLC* AGKLFPQEKS STWPGKWH 
AWNPSTLEGRGGWIT*GLEFKTT 


11243 


25144 


A 


11340 


117 


3 


GIRPGAVAHVCNPSPIjGG*GRWISRDWE 
L*TSLANIR 


11244 


25145 


A 


11341 


292 


60 


SGQGLRLRGAFRGKVQAKEWMPIPKLGC 
LVKDMKIKSLEBIYFFSPPIK*FEIIDF 
FLGAS FKDE VFFLFVCLFVCF 


11245 

- . 


25146 

■ 


A 


11342 

■ 


149 


30 


TQIFPPPP PPGKKKKFFFP KKKKKKKKK 
ERKKDRKNKOK* KGPGVGVSPliFPPPLG 
GQGGGFPQGLGFKTPLAPKGKPWFP*KT 
QECFPRRGGGPPKI PLFGGGGEKKPFSPG 
GOGSNKPKFFP PP PPRGKKKNFFS PKKK 
KKKKRKKERKTEKTNKK 


11246 


25147 


A 


1 1343 


120 


504 


GVHDRCLWDDMTRRPPAAAATACPAGRP 
P PRREEEHS QLLL I S FQGFRWD*DQDVN 
TPNLDHIiAREGVKAKYLMPPLVTMTSPS 
HFTAI TGKRHSAHFTRCPSNPQRPS FP V 
IRSKSSVSSREVBVA 


11247 


25148 


A 


11344 


213 


399 


ALVIHCLTTTIQVSGLSGSKYLMRQGMV 
AHGCHPNTLGGRGGWIT*SQESETSLAN 
TVKPCL 


11248 


25149 


A 


11345 


308 


3 

■ 


NPTPRSGDLWVEEGS PGAGNWGCLSEQT 
LRAI IKATTSYS FYCYFFETGSHSVAWA 
RVQWYNPGCSICGSSHLPTSAS *GAGTI 
GMHHHAWL I FLFFVENE 
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SEQID 
NO: of 
nucleotide 

in AM VPk>v IIUw 

sequence 

■ 


SEQ ID 
NO: of 

sequence 

— 


M 

- 


SEQID 
NO: in 
USSN 
09/515^ 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
inp to first 
amino acid 
residue of 
nentlde 

sequence 


Predict- 
ed end 
nucle- 

U ULIv 

otide 
location 
corresoon 
ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Clutam(r Acid K=Phenvtalnnine 

G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N = AsDflrflpinp P= Proline. 

Q=GIutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W— Trvntnnhnn YssTvrnslnp 

X=Un known, *=S>top codon, /^possible 
nucleotide deletion, \=possibie 
nucleotide insertion 


1 194Q 


£mJ I JU 






ISO 


433 


GHCRAGTSLTLTHLGGSAMELI IHIiGIjC! 
LGRMKF I HTTC 1 1 IiFLPFLSATQAGGQG 
GDLS *LOPP PLGFKDFSCFSLPRSWD 


11250 


25151 


A 


11347 


257 


3 


KWGELCFLT I KE ERMGFPLAKKRKI KGV 
P*FGLFFFFETESHPVAQAHCNLSLLDS 
SYS PAS AS * VAG I TGACHHNOLIFI FYE 


11251 


25152 


A 


11348 


159 


434 


LSPSCRDVGTYSARAHPPHTHTRVRAHT 
HACTHVRAHTHTHTTAHGSHLTKEQPGL . 
CTLPGSSW* PGLGGRAIATAQAWVHCAG 
ITTAAHPP 


11252 


25153 


A 


11349 

i 


240 


406 


AFKKLAKNNSMGVNRSLSRPGVVAHAWW 
PSTLGGRGRQIT*GQEFETSLANMVKP 


J Izjj 




A - 


1 1 jjU 






POT iT .OT iMffPT jftM * T .T .1?P T ,P PfYW/3T .WT.Fft 

LACSESRSRPCAPVWRTERDSVSSPQPR 
PfiARGPPQFP 


11254 


25155 


A 


11351 


145 

■ 


406 


PLNSGAKPTLPPWS PRRPELNAPPPPPG 
FKP FF PPT /^PGFTjPTVPOFF AGCIRMT V# G 

kppppg*reffapppggdgnkgappppp 


11255 


25156 

* 


A 


11352 


122 


337 


ILWVLLLVRESLKFFFFLLLLFFLFFET 
DVA* AAVO * S DHS S P OP * P PGTjPGSYTS 

AFRIAGAIGlCHHLY 


1 1256 


25157 


A 


11353 


325 . 


416 


KCVY IMG WA* W LTP VI P AL WEAGAGRS L 
EV 


11257 


25158 


A 


11354 


260 


415 


LEELQNTACQKLEPFLSRTETKQGCLLS 
PRLFNI VLEVIA* AVRQEKEI KGI 


11258 


25159 


A 


11355 


160 


2 


ISEISTKRNNYFLKNWLGWAHACNPST 
* RG * GGQTTRGQDFQTSLANMVKP 


I IZDy 






1 13 JO 




i 


LGGQGGRX P*AQEFKTSIiGNVRRP 


1 1Z0U 


ZD 101 


a 
jt\ 


1-1 757 
1 1 33/ 




*HJO 


nCiV TiP A AT . PP YOWWT .WPTTRT.T AHOMfitfT 

VWRFL* PLKKEGPHDPAIPLIiDI YPKNK 


11261 


25162 

- 


A 


11358 


11 

■ 


393 

• 


QTERTPTPKPHLYVSFPPNSQDPQLQGG 

KIGKKQSRKTGNSKNQSASPPPKECSSS 
PATEQS WTENDFDELREEGFR* SNYTLL 
KEEVRTHGKEVKNLE 


11262 

i 


25163 


A 


11359 


283 


475 


FMVSVS IKVYFFIFI FLFFEMESCSVTQ 
AGVQWHDCLSFLC*VLASSPGHSSPVR 


11263 


25164 


A 


1 1360 


266 


407 


LEIHHSKTL*FWLGIVARACNPSTLGG* 
GRWIT*GQEAEI ILAN1VK 


1 1264 




A 


1 1161 




i 
i 


DVLS ILLPRIiBGNGAISAH*NFRIjPQSS 
NSS 


11265 


95166 




1 1369 


131 


377 
oil 


meem i kgrg lskppalss yltlks s * lk 
RVPJUIPJUjFTjKSRFSLDLQVAGFFVCLF 
VFWDRVSLCHPGWSVWQSRLTAALT 


11266 


25167 


A 


11363 


212 


392 


QAEECAPIiFIGVKEPPKNLTHWLGTVAH 
TSNPSTIiGG*GGQITWGQEFETSTjANMA 
KPCV 


11267 


25168 


A 


11364 


141 


3 


NELWLNHQNVLEWLGTVAYTCNPSTVGR 
WIT* V* EFETSLGNMVK 


11268 


25169 


A 


11365 


215 


384 


ANFNIRPEISLASKCPSERKSRTIjLPLN 
QKIiEMI*LSEEATSKAKTG*KLGLLHQI 


11269 


25170 


A 


11366 


48 


373 


EPWTRKEVTjKISSSLKIHCYLSIIiCIilL 
MPTLECTAAIMAYSNLELPGTSCPPTCA 
S*VAGTTGACHRAPTVLKIFLRDRIFFF 
FFTNVTYP WCHTPADPAI FKANAR 



WO 01/64835 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M , 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

Ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
OCysteine, D=Aspartic Acid, 
EpGlutamic Acid, (^Phenylalanine, 
G=Glycine, H B Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methtonine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Senne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Un known, ^Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


11270 


25171 

• 


A 


11367 


310 


2 

■ 


GLICLNRENAYFFVIRQISMSCLEKBKP 
ARYS S PYAKS VTRGEKRRQRFFLSRAPP 
IPFLPFI * IV*AESRSVAQAGGQWLHFS 
S LQTPP * RRTSDS PAS AS 


11271 


25172 


A 


. 11368 


303 


50 


MFALTNRGVQGS P VI FLGVFL 1 3 KMGFT 
CFKGFSRFKVLVFFLGVFPS FFFFLRAR 
VLLCCPGWSPTLGI K* SSCIjGLPQCVLT 


11272 


25173 


A 


11369 


109 


419. 


QSNASLSHFTS PPLS VADR* LFSTDHKD 
IGALYLLFGA*AAVLCTAISLFILDWLG 
QPCNLLRNDHIYYVIVTAHAFARIFFIG 
IPI I IGGFGN*LVPI»IIGA 


11273 


25174 


A 


11370 


112 


258 


GGRFKGSNFTSAGMQRNIFFMGPPK*NS 
RAGV* QRGEGKNPG VTKLNRL 


11274 


25175 

« 


A 


11371 


823 


1078 


SQHFGMPR WADHLRSGVQDQ PGQHGETL 
SLLKIQKIiAGRGGRCL*SQLLRRLRQKN 
CLSLGGRGCSEPRSHHCTPAWATQ*DSV 
S 


11275 


25176 


A 


11372 


184 


420 


I KQQKNLFLALVKELVQLLWKTVWQFLK 
KLNI ELLCDPAI LLLGI YPRELKMYVHT 
KTCT * MF I VELF I KS KK* KQPTC 


11276 


25177 


A' 


11373 


234 


44 


LAT I TDRLLALS DSAI PPLG I YAKEKKS 
VYQKDNCT YI FVAAL FTVAQ I *NQPKCP 
STRTRGS 


11277 

• 


25178 


A 

■ 


11374 


73 . 


456 


PPIGSPLVPYIPSPALSSPQAPRMGSHP 
CLIPSASCHPPLETDFLSLLPTHFCLAV 
FTKERFSPPPPSPYPRF*KFLRS *KFSF 
FLPPFLFFGGTVLLCLSGWSAVAQSWLT 
AASTSLGSSDPPTSTS 


11278 


25179 


A 


11375 


43 


456 

• 


E FFHHVGQDGLDLLT3 * SAKLGLLKCWD 
YRRE P PRPASDGHY * TDATGSL PS SGTT 
* IRTKPSQAPAS WGLWNLAHHPPRSHPS 
CPMANLICSTLSSFDGGSPGTGPGGWCP 
LGLSGS PARAVFKDSS CSLHPLATG I 


11279 


25180 


A 


11376 


288 


484 


EWVLSAVGGSQYGVCLPFLHCFI FFETE 
SRSVAQDGVQ* CDLGSGSSLPPG 


11280 


25181 


A 


11377 


129 


1 


N FNALNL I RAGAVAHAYNP SHLGGEDGR 
IA*GQEFETSLSNTA 


11281 


25182 


A 


11378 


307 

. 


1 


DSEI PSLGLAKFWNHRREPPCLALFLTL 
NQKLEMIKLSEEGISKAKTR*KAGFLCQ 
RVSQDVNAKEKFLKEI KRATLVKTQMMR 
HQNSFIADAEKVWMAWMK 


11282 


25183 


A 


11379 


200 


397 


C I SLFSHY YKDL PETGS FIKKRGL IGSH 
WLGAVAHACNPS ILGGQGRRIT*GQEFE 
TSLASMVKPC 


11283 


25184 


A 


11380 


m ^ mm 

167 


1 


SNGLYSREARMVQHTH I RPG WAHTCNP 
STLGGHGGRIT*AQELKTSLGNIVRPC 


11284 


25185 


A 


11382 


12 


395 


AHSSFLSLDLSVFATCPHPRAETQTGHR . 
FSTLLPLSALYPK* DYFKKKKKKKKKKK 
KKKKKKKXKKKKSGGGA 


11285 


25186 


A 


11383 


160 


2 


TMEMMLDIK* I+EIFLFEFKMGLKAAET 
TQNINNTFGPGTANEQTAQRWFNK 


11286 


25187 


A 


11384 


274 


389 


GQAQWLTPVIPALWETKAGGSLE*GQAQ 
WLTPVI PALWETKAGGS LELKNSRPSLG 
NMVKPP | 


1 1287 


25188 


A 


11385 


389 


37 


WSYESSWSFIFKSFLFSFMISAYDM*YA 
SQILFTPKLYRTHPYFLLVLFLILSILV 
ILETGSRFVSRLECGGGISAYGSFK*LG 
SGNPPAAAS LSSRI PYVHQS CWS DLAFN 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 

• 


SEQID 
NO: in 

T1CCM 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 

^ - - - — — ' a^ Gam, 

correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PnenyIaIanme, 
C»— Glycine, H=Histidtne, l—lso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=As pa ragi n e, P*=Pro li ne, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, yKTyrosine, 
A— Unknown, *=»top cod on, /=possiDie 
nucleotide deletion, Impossible 
nucleotide insertion 














PEFEM 


11288 


r\ kT 1 OA 

25189 




11386 


214 


374 


KULKsKrKGSK^lSPGGGiiKNr I* J.CSAfc'K.* 
NSRAGV* KRREGKNPGVTQLKKGVG 


11289 

i 


25190 


A 


11387 


142 


410 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
* KKKI KKKKRGAPFKKKNFFYRGGGGKY 
FFFGAPKINFLGRVFQPGGGENPGRSPF 
KGAWG 


11290 


25191 


A 


11388 


144 


1 


CLSNEDS I KTPKGLGTVARTCNPSTWGG 
RNGWIT*GQEFKTSLANTVK 


11291 


25192 


A 


11389 


167 


2 


RNISSHLLNMAITKQTNNNNCW*GYRE I 
I THIHC* WEFKMMQLLQKLVPQKVKHT 


11292 


25193 


A 


11390 


2 

- 


412 


S DQVQDAGRL X S SEAS LLGLPTATFLLC 
LPTGFALCTRVPGVFLLLRTSVILREGP 
TLVTS FYLNYLLKDLVS IQSHWGLGLQP 
MNFGETQFSL*QKLV* KEKESMPGMVAH 
ACNPRNLGGQGGLIT*GQEFKTILA 


11293 


25194 


A 


11391 


1 


206 


TGRLAGHRVSSLP I RGGRAEAPLTSRTG 
RLNQDGLNIiLTS *SARLGLPKCWEYRRQ 
PLRLAVI SGFLT 


11294 


25195 


A 


11392 


262 


388 


VS^KKKKKKKKKKKKKKKKKQ 


11295 


25196 

- 


A 


11393 


78 

. . • 


414 


I CFFLRAP P FFF FFFuFF* TKI FFFFPAL 
NSMANFFFFWTPPFLGGIFLPPPFFLEG 
GGPGLASPPGGAFFF 


11296 


25197 


A 


11394 


238 


11 


I K WETAGRS S ACL * S * LFERLRRENPLN 
PGVHG*DEI**HHCTPAWVTVRPRLSKY 
INKNKK*ARLAARCGGSRL 


11297 

■ 


25198 

• 


A 


11395 


340 

• 


1 


KH ILVP PF INVS E I QR YSLV* HLLFVGV 

a . *m.mrmp «k mmm am> Mmm 9mmmm»m» aua» m9m\. mm ^mmm Mm am m m m warn m^m MW*a*m mmm> amm* ^mm ^m*. 

* NLFFLFLS FKKP VLCSM * I I CHE I ISP 
PS KPKCKASHLS FV* KKNSNLWXjGAVAH 
ACNPGTLGARGGR IT* GWE I ETSLTNME 

mm 

K 


11298 


25199 


A 


11396 


363 


245 


QMDLNRHLS KDDK* MANQCMNRCLTSLA 
IRQMQIKTIMI 


11299 


25200 

- 


A 


11397 


292 


2 

■ 


DPKKEAGRGGAQL* FQLLRRFKHETHLN 
PGG*GCSESRSHSS PPAGATK*NGVSKK 
K*Y*APKEERHPRPVIiVEPRVPQVPSPG 
S QTYRQDHS PRV 


11300 


25201 


A 


11398 


121 


321 


KKLYTHKS KSDVM I FLKIETVS WLYCPG 
WRvMAHCDIjKLjIjGS S KPPTSAS * VAETS 
GTCHHARLIFL 


11301 


25202 


A 


11399 


118 


394 


QAuPQP I PVLILIiQFFFFFF* GTGFCFV 
AQIGGQGGHYG*LEPPIRGLKPSSRRTL 
Kvjt(jWLJ *UfailriN/\yii X JjOt r oit ioi> t-r V V 
QAGLKALA 


11302 


25203 




t f j( An 

11400 


273 


379 


YTSLGNM 


11303 


25204 


A 


11401 


360 


38 


KI FGQARWLTP VI PAFWETEVDGS PEVG 

R5RPA* rb 






A 




aiDH 


AAA 


G PAGHD FHM I TPS KS TETGLGAVAHVCN 
PS TLGGRGGW 1 1 * RQDLETS IiA 


11305 


25206 


A 


11403 


161 


3 


I LTPTLQY I KINLRGQAWWFTHVI PALW 
KAK*SGGLLELRSLRPAWVIBGDL 


11306 


25207 




11404 


1 


170 


MGVSLLSPRLECNGVISAHCNCHIjL*FT 
FK* FSCLSLPSSWYRCVPPCLATTFLIY 


11307 


25208 


A 


11405 


249 


3 


NAKSRIQVI S IWVFTVKFFHLYYLCNFS 
Q*NEMAGGQSIFRKKHLDFSLEHSGMMT 
THCSLDLPGSSDPPNSASSVGGTRH 


11308 


25209 


A 


11406 


70 


401 


CGKHGVNL I I HS FLNHLKYLLLLQMANL 
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NO: of 
nucleoli ae 

• 

sequence 
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SEQID 
NO: of 
pepaoe 
sequence 

* 
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M 

eth 

oa 
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NO: in 

09/515,1 . 
26 


Predicted 

beginning 

nucieonae 

location 

correspond 

* Mil Am A ma| 

ing to iirst 
amino acid 

V*A{*{ff t*A nf 

rest one oi 
peptide 

(An IIAIIflA 

sequence 


Predict- 
ed end 
nucie* 
otide 
location 

COrrCSpon 

ding to'" 

loci amlnA 

in si am mo 
acid 

roc ■ Hi to nf 

peptide 


Amino acid sequence (A=Alanroe 

C=Cysteine, D=Aspartic Acid, 

jl — vyiuuimic Acio, r ~ x nenyiaianine, 

G=Glycine, H=Histidine, l«I$oleucine, 

K=Lysine, L=Leucine, M=MetbioDine, 

i^^Asparagine} r^rronne, 

Q=Glutamine, R=Arginine, S=Serine, 

i—i nreonine, v— valine, 

W=Tryptophan, Y=Tyrosine, 

a— ufiKiJuwi], — oiop cooon, /^possioie 

nucleotide deletion, \ppossibIe 

nuucuuuc jitocriiun 










>■ 




FISQFLLTCVSEPLIiF*YCHQIIKVGRL 
VPAHACNPTTVGG * GGWIT* GQEFNT 


1 1309 


25210 




11407 


257 


389 


GGSLLQKYLRPLVAFLFSFFFFSFETES 
* SWRLE CNGMI SAHC 


11310 


25211 


A 


11408 


364 


420 


SQPFGRPRQADCLS SG * AQWLTP WPAL 

VVll/\JSJ\uKJjljJLLiKoX>iK 


11311 


25212 


A 


11409 


312 


413 


TSGMGPGWAHACNPITLGGQGGWIT*^"" 
REFETS 


11312 


25213 


A - 


11410 


283 


425 


HGHKDYVHWLGAVARACGPSILGQRGMW 
IS*GQEFETSIjANMVKPCI» . 


11313 


25214 


A 


11411 


389 


3 


CRIDGLFFNLVFLRLWWKF* VI FFLSY 
FSFLVI ILQFLTYMLNCTFF*QIUILFP 
ILVLLTKI LNTLLS TENYCHFS LLYCLE 
FWFHPIIKHKMYNLFLI "FILIFILRQS 
LARS PRLKYSGT I S I H 


11314 


25215 


A 


11412 


267 


3 


QE FQQEDTSEIiDXiE I RSLCKRKGGLKQG 
DTKDNLS ECRVWETDKGKAAWWPGAVTH 
ACNPSTLGGQGGW I T*GQBFETTLTNRC 
PRV 


11315 


25216 


A 


11413 


3 


281 


GAWSCRIPGEGSLSASQAlQPFSI^PVrt 
PQ* I SLS ARKKKKKKKKKKKKKKKKKKK 
KKKGGGG VLKK I YFPPRGEKKFFFFWGP 
PKKTGGGGF 


11316 


25217 


A 


11414 


275 


2 


KTAHFQLFNSDFKFPGSRNPGQSWSPRA 
KQSFCTQALLKYFLK*MNEMKLNKSRSG 
AVARTCNP STIiGGQGGW IT * GQEFETSL 
ANMAKPR 


11317 


25218 


A 


11415 


319 


1 

• 


RNLGVQVYEYVVLVYKYCSVSSNNKMKN 
KRDLXiVI * VPGI VIHLI YRL IKENPEIiY 
I SLF I YSMRSLKKLT YRPGAVAHACNPN 
TLGGRGGWIVCSQEFETSLGNM 


11318 


25219 


A 


H416 


121 


428 


LSCDKWGNRDMTKI^SKELKiu^ 
KKKKKKKKK3aOCKKLGDEEBDKE*QSSW 

KLGAAVYI ILGEKT ILWL 


11319 


25220 


A 


H417 


79 


253 


KIFNFQFSNMLENNSHTTRINNYFPGRE 
K W Jj 1 F V JL F trlj VY IS AKiwyKo IaH V Ko w Kf o C 
P 


11320 


25221 


A" . 


H418 


180 


4 


E VGFM KPGL CGVFLEAGSHS VALT I VQW 
CDLGSPQHPPLGSRDPPP+ALQIVGNTR 

P1TD 


11321 


25222 


A 


H419 


154 


3 


STLKDKENQKLS IQPDTMAHVCNPSTLG 
uKbunl 1 w vyiSJF n±&US*Cim V JVJr 


11322 


25223 


A 


H420 


436 


478 


AIPLPH*RKKILHAALFFFFLKGSFVLS 
PRLECSTKIWAHCSLPPPNISNPPASSS 
QEGGTT 


11323 


25224 


A 


H42l 


. 143 


398 


AAFGFWFVLIRFLRQRLALS PRLDCFFA 
TT Q PfJPRP P * 17 TOP VPWT J?T . PIT QWTYNHPP 

DLLCAAGLVWVYLS VSRLHINE KHQPPP 
P 


11324 


25225 


A 


H422 


i 


138 


MVFHRVGQDGLDLLTS * SAHLGLPKCWD 
YRHEPPRPAAHSYFLLHK 


11325 


25226 


A 


H423 


2 


390 

* 


LELPILLLNTLILFLFLtlLYIYLVFSPN 
AFRILFITIVLESFSFILLVPSKPWKYS 
LILSNAFVLCMMISSPPFPLFSLSTTRI 
NKKLDFWLGAVAKAYNPSTLGSQGGWII 
* GQEFETS LTNMAKPHL 
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SEQID 
NO: of 
nucleotide 
sequence 

• 

■ 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 

09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

ammo acid 

residue of 

peptide 

sequence 

i 


Predict- 
ed end 
nucle- . 
otide 
location 
corrcspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
Es-OJutamic Acia, r— rnenyiaianwe, 
G=Glycine, B=Histidine, I-Isofeucine, 
Jv t =ijysine, i^jjcucine, ivi— ivietniomne, 
N-Asparagine, P=Pro!ine, 
v^vyiutamine, k— Arginine, s>— oenne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknowu, *=Stop codon, /=possible 
nucleotide deletion, yppossiole 
nucleotide insertion 


11326 


25227 




1 1 A~IA 

1 1424 


267 


Ay A 

414 


GGRIT*AQEFETSLGNMVKPC 


11327 


25228 


A 


1 1425 


189 


73 


HQDGLDLLTS * STRLGLPKCWDYRHKPP 
RSARPNAWVAP 


1 1328 


25229 


A 


11426 


318 


33 


KKT I S SHARGRKLGS FKTAAP FF FFF * T 

f-% r*y 0*-t «f> ^ A%/W7T7r.T^I^TmAllAf\T\TSn^1f VATS 

GSCSIiAQAGVKWCSHSSMQPRPPGLK* F 
SASGSAGIIGMSHHTRPIAGFLIiAKLKK 
KSNSTGYGGAHV 


11329 


0% gm 0*.0* 0\. 

25230 


A 


11427 


4* 0K 1 

291 


393 


DRVSLCLPGWSAVLPS *LTAAQI SWAQ v 
, IFLPQP 


11330 


25231 

i 


A 


11428 

i 


51 


399 


KRVARQNPPNGQGFGWCSCS P ILYGQSR 
AQGBNIYKLLGSGEWPGQLI IDVEREQL 
EDWRQGDLGRGMWMTKARVHKGPGLVAG 
TCNPSTLGGRGRW I T* GQE FENS LANMV 
KPCF 


11331 


25232 


A 


11429 


242 


420 


VKKKKRGGRFKGS KFTSACLQG I SFFKG 
APKLNSRAGF* QRWEGKNLGVPQFNRFG 
AHP 


11332 


25233 


A 


11430 


77 


405 


TKRGDSS FCL YR I W 1 1 DSVMKWNQS KAA 
AAAAFFFFLRGTLLFLFRLKGKVGASVI 
* NF C LRG WGG S PAS P£ KEVGME GACHN S 
GLFFVFLEKRRFPLVGQEFKTTRAN 


11333 


25234 


A 


11431 

• 


121 

• 


439 

• 


LPLPTSGPEDPLALSFKPTPFPSTLSGM 
I YTGHPS PEHTS HLM PAPP VNNNKACRF 
FKT*KKKKKKKKKKKKKKKKKKKKGGPL 
KKKK I FAP PGGE I F F F FWG P KK 


11334 


25235 


A 


1 1432 


109 


283 


LTPWNLDSRCEIKGNGKPQENHI^NIJGGG 
GCSEPKSHHCTPAREKRQNSVSKK* INK 
IK 


11335 


0* mm AM V 

25236 


A 


11433 


0%. #v ^ 

296 

4 

• 


1 


FLKKP S KG FS KL PHL PQNLNL FFPPK* K 
CPRAKKDI PPPS F* KI SSLFGEG*GFTG 
SRP PS LNIoENP P KR I FFFFFFLDRVSHC 
KPtaWSAMASSCPKV 


11336 


25237 


A 


11434 


t O £ 

385 


A 1 D 

418 


UbbTJjSQKQK i K.* E * S WLGAVAHACNVS 
TSGGRGGWIT *GWEFENSLTNMVKPCL 


11337 


25238 


A 


11435 


3 


405 


LLHQGRLMLGQWGNGRTKTLLMRENVGK 
FPAE PSCSAlAsvW JjQALiAC iPGLVKRQ* 
ANHWEKKKKKKKKKKKKKKKKKKKKSGG 
G 


1 1338 

• 


25239 


A 


H436 


221 


3 


D C1.R YQWQRPAAVAQAI TEHFGRPRQAD 
HLRSGVSDQPGQHGET 


11339 


25240 


A 


11437 


288 


366 




11340 


25241 


A 


11438 


160 


370 


LWYHYFPVRDGLTI LVT YQLKAI RRLLT 
EFETSLANMEKPHL 


11341 


25242 


A 


1 1 At f\ 

11439 


230 


404 


MQHGQVG Jl x SyGGGWKPVDlU^SQlXjTV 
AHTCNLI TLGGRGGW I T* GQE FKTS LTN 
MV 


1 1342 


25243 


A 


11440 


302 


54 


LSGVWFLTP PP KGGG FPTP * NGGAPGFF 
PPPPFKNPPPEGKLGGPKKKKNFPPPGG 
EKLVFLKGAPPFFFFFFFFFFFFFFIt 


11343 


25244 


A 


11441 


271 


416 


KRTLSTELQYMCCLLIKSKSIFK*KKKK 
KKKKKKKKKKKKKKKKKNPPSLP 


11344 


25245 


A 


11442 


254 


392 


SVTGMTIMTSSCCYKIQKCTKIQKLARH 
GGTCL* SQLLRRLRQENS 


11345 


25246 


A 


11443 


390 


150 


LLKNQKTLGGPARNPPPLGGPNGGVPKA 
GNWGPPGPPGGTPFFFKNQKITPGGGGG 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 


SEQID 
NO: of 
Dentide 

mf m kr mM mm «■ 

sequence 


M 

etb 

od 

• 

■ 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude** 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A°Alanine 
OCysteine, D=Aspartic Add, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glydne, H=Histidine, I-Isoleocine. 
K^Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, ^Proline, 
Q=GJutamine, R=Arginine, S«Serine f 
T=Threonine, V=Valine, 
W=Tryptophan, Y^yroslne, 
X=Un known, *=Stop codon, /= possible 
nucleotide deletion, V=possible 
nucleotide insertion ! 














, P*FPPLRGGGPGNPLYPGGQGCH | 


11346 


25247 


A 


11444 


122 


1 


PAPLFFFFFFETQSCSVA*AGVQWRNLG 1 
SLQPPPDDGTRL 


11347 


25248 


A 


11445 

t T 


272 


403 


RPKVDKTTKMGRNQSRKAENSKNQSASS \ 
PPKDHNSSPAME*SWM 


11348 


25249 


A 


11446 


70 


397 


LCHCAP AWATERD PVLGEKKRTMMS YCH 
P PMRMVE KKTVGDI KC* *GYGETGFLTH 
C*WECKMVQIL*NIV*QCLKNLNVHSPy 
ELAIVLLGIYPREMKTYIFTKKTSM 

m**mWm B^«Bp W ^BB" ^BB» ^B A ^^b^Bi m\ Mr » mmm ^B *B^ ^B ^™ Btmn V BB # V 


11349 


25250 

* 


A 


11447 


3 


401 


HASAHASAHASAYYRLCAYPKSSFLYPS j 
SFLLWSNFPLNILDTTIGPKATETLFI 
KHVMHDVISMQLKSKTFYNYFFLLSI2TV 
FYFVLSCCKRRKRS OLGMVSH I CNPSTL 

mm mm> mm W nv^v ^b> ^bi v*w ^£«M^b*b • w b> m mm ^mWmt v mmw 4w 9M9w 

GGRGRW IT* GQE F E TSLANMA 


11350 


25251 


A 

. 


11448 


333 


1 


KSGCIPPKGQWLFFLQKTAKIGSPKPV 
S PPRGKGQFFPOGLRGFY*FLPPPPGVP 

%mjt 9m mmt m\ ^*mm< mm-m^mm B> mm mm ^m* mvb>^>^b> m bb 41 mmmw mm mm ^m mm *mw w bs 

EIFFFFF*DSSHSVTLSPRLBYSGMNTA 
HCSLHLPGSSNPASDSHIAGSTGTCHH 


11351 


25252 


A 


11449 


104 


402 


WCRPLLGGVSQLGYMGVMD P FKEAVCL 
FS ELKKCAGRTTAL FRAVRQG CLS LKNF 
LXiPFFQLCPAHRGGVL*RQ*ALLSCGGL 
HPVRASRPLCLPTQA 

• Mr ^ • ^B> mmmmr m* » ^ bm bm » 


11352 

* 


25253 


A 


11450 


198 


2 


KYKFMAGLGGSCL* SHHFARPRQQDGLS 
PGV*NQPGQHGBTPSLQKKVSWAWWHAS 
VEAEAGEQP 


11353 


25254 


A 


11451 


171 


3 


TPPPRATSLFFFF*TESHSVSQECNGTI 
SAHCNLCLPGSSDS PAS ASRVAG ITGA 


11354 

* • 


25255 


A 


11452 


177 


461 


PMTGGKYI SNSTCSKPNSLSPHLPLKTQ 
TKNMEHFGRLTRGDCLS PGV* DQPGQRG 
KTLLLQK I SQALWDVTAHAWWPGLPTR 
ERRRSLSFPTG 


11355 


25256 


A 


11453 


247 


3 


EGECCG I E S R * I LHLKGRI CWHINYLKI 
TKIKN*VILKKMYKQKLRPGAVAHAYNP 

mm. A Vli A\«T W Bfc bbIm w * bb B> ^ * ^ ^m*m ^ Bj V B> B B> a>M BV « 4b 

SALGG*GGRIARAQEIiETRLGNIVR 


11356 

• 


25257 


A 


11454 


328 


57 


E VHSN ICATLS THQM KI LTKGTRHQHXS 
*GFF*EGEVLTLSTKIiECSGVIMAHCNL 
NLGGS SDPLALVSQVAETROAS YLLLAI . 
CPKVNK 


11357 


25258 


A 


11455 


70 


435 


ATRAKLHLKKKKBGRKKEFFPHWQGQPT 
PALVFPQPRTHF I FKVS IFSLKKNREQL 
PYTVQTQSLLPLNSHWGQVA*AGVQWRN 
LRFSCLSLPSSWDY 


11358 


25259 


A 


11456 


112 


401 


KCWDYRRERPCPAPNI C I F*LTFFFLGK 
KLNQSFFFFFFLKGSLVLPPRLKGGGAF 
LFN*NLTLQGKNNS PAS PS *RAETLGlJG 
PHAKLWFVFLKK 


11359 


25260 


A 


11457 


198 


3 


E KTS VKQ S GDSNKKAWLEMKTNKKS LKR 
+GWAHACNPSTLGGRGGWIS *GREFET 
SWTNMEKP 


11360 


25261 


A 


11458 


179 


1 


FFHMYNHIVCEYTITLSVNINSQGPGPR 
AHAYNPSTSEDRWI P *AQEFKTSLANMV 
KPH 


11361 


25262 


& 


11459 


210 


5 


RLIHYFFKNNLPSGGGSTA*GQEFEIRL 
DNTVRPHFLKK* KI SQVWWCMVWSATQ 
E VEVGNGLS PGV 


11362 


25263 


A 


11460 


139 


2 


RILALSKLFFFFFKQESHFFLPMLECSG 
VISAHCNL*LI J GSSDLS 


11363 


25264 




11461 


388 


272 


S * KKKKRGGRFKGSMFLS PAGQGNP FFM 
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SEQH) 
NO: of 

niirfpfiHHf* 

II U tICUUuv 

sequence 

• 


SEQ n> 
NO: of 

sequence 


M 

etfa 

od 


SEQH) 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

location 
corresDond 
ing to first 
amino acid 
residue of 
peptide 

QPflllPflPP 

JbUUCIIliC 

1 - 


Predict- 
ed end 

11 U VI w 

otide 
location 
correspon 
ding to 
last amino 
acid 

rMidiiA ftf 

iciiiiuc ui 

peptide 
sequence 


Amino acid sequence (A B A!anine 
OCysteine, D=Aspartic Add, 

F.=fvliitflmic Acid F=Phpnvlnlnnlnp 

G=Glycine, H=Histidine, I=Isoleucine, 
K = Lvsine. L=Leu cine. M=Methio nine 

N=Asparagine t P^ProIine, 
Q=Glutamine, R^Arginine, S=Serine,~~ 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, V=possible 
nucleotide insertion 








■ 






GPLKSISLAW*QRREGKNLGAPHITRL 
GAHPLFAGWHKTOGAEF 


11364 


25265 


A 


11462 


2 


387 


CVW KSNNKKAP P CVAALRFRWKDDMDEM 
APGKHWGGLTS KVQSGGLMGRYTGPLTK 

PRWEDYLRPGGQGCSBL+SSHCTPT*VT 
EGDPVSKKLKRTI FGG . . 


11365 

* 


25266 


A 


11463 


325 


398 


SLPTWAVCAPRPSG*GSGIPELKTMLAG. 

V XJjC/Ui LiXJ JL J\Vi r VuiijLliinluol 

L FLGKVGP FVHLS VM I AAYLGRVRTTT I 
GEPENRSKGNTRM 


11366 


25267 


A 


11464 


174 


2 


YLE I LFFFETGSRS ITQAG VQWSHPGSIi 

QTPPPCPSNSPASA*VAAHAS7U3AWADA 
w 


11367 


25268 


A 

a 


11465 


105 


411 


TDVELLLMDE * RKWFLELETTPGEDDVN 

TVTfMTTJmWTVQ TNT.VriirM/QTTPlirP TRQ 

YFERGS AVGKVLSHS I VC * GE I FHERKS 
PLMQQISLLTFILFYFSE 


1 IJOO 












IT+GQEFKTS 


11369 


25270 


A 


11467 


64 


236 


THASGLDLLTS * S S^LGLPKCWDYRHEP 

I 


11370 


25271 


A 


11468 


278 

* 


416 


NPQ I S CLKQHVWGAVAH I CNPSTSGG * G 
GRIT*GQELKTILVNMVK 


1 1 j / 1 




/i 




I /o 




p d wwtwss c cd TViDWP vi*2/2T .t .in/in cm? 
RVPQKKKKKKKKKKKKKKKKKS KKKKKK 


1 1372 


25273 


A 


11470 


264 


60 


DINHQSSWLSGRWTPLSWVSISQREDRR 
GFSASIYIYIYIYIYIYIYPPIHIYIHT 
YT * S YHPNONT 


11373 


25274 • 


A 


11471 


95 


399 


DFPHLLLFLPLSQFCTACCPVPENRCFM 
YFFLIFSALCWEMNSRISHSNTARVBQC 
CXVFKI LQMKS VLGQ VQWLTPVI PAL WE 
AEAS*LPEIRSRNOPGO 


11374 


25275 . 


A 


11472 


139 


2 


NHVSTEKLSLHTMAHTCNPSTAGGQGGQ 
IT *GOEFKTSLATMAKP 


11375 


25276 


A 


11473 


179 


3 


RARWNPQ VFC * GFLP VFPKI LP I WPLRK 
GVPFPPNFFFFFLRVSLCHPHWSAVARS 

RL . 


1 137A 


75777 


A 


1 14.74. 


7R7 


17 


KYIRNE* IVLFVIYFCRNRISWCCPGWS 
*TPELKRSSCLCLPSCWDYRHEPLCPAE 
MNKLDRLNKMDTTKWI00KKKNHRTORO 
DNKII 


11377 


25278 


A ! 


11d7S 






VCMOLRTD FOPGAVTAHACNPSTIiRC * GG 
WIT*GREFKTGLNNMEQPC 


11378 


25279 


A 


11476 


61 


397 


PQTPRLKQS YHVSLPSS WDYS WI YHERL 
VNPGRETDWATCYSGGNIQLQENKLNTP 
TDSTLWVSLCCPRLECSSAIPADCSLNL 
PG*SDPPTSTSHVAETTGACHYAQLIFG 


11379 


25280 


A 


11477 


156 


3 


F I FLKSSQNS Y I * KCWPGAVAHACNLDT 
LGG*GGWIT*GHRFKPTRRPRV 


1 1380 


25281 


A 


11478 


224 


383 


CWVC L I HRAFLAPRP VLG CQ * Q KKKKKK 
KNKKKKKKKKKKDSRGGG 


11381 


25282 


A 


11479 


339 


58 


NFKNKLCFCASGYLDSKDSQHYIIFFIR 
WSLALLPRLRWBDCIiNPGGGGCSEPRSY 
HRTPAWVT E * DP VS KK* IK* NKI K* KQQ 
DWLRLHDKIF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M, 
eth 

00 


SEQID 
NO: in 

1 1 ecu 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

mm. ti A Ia 

nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=AspartIc Add, 
il— oiutamic a cio, r— rnenyiaiantne, 
G=GIycine, H=Histidine, I=lsoleudne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
l— I nreonine. v— valine, 
W=Tryptophan, Y^Tyrosine, 
X-Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


11382 


25283 


A 


11480 

* 


244 


407 


SFGKKIGFLFGPKKFNGGSYGNYYKINR 
Krr r F I US a G r 5> rrtVI If * NIiE TKXTBGG 
LLETTKrTFNLLGRKIjGS FLGPRS LTAA 
PTGTTIKLTGGFFFQFWGKVFPRMYIFW 
KDEWKKGF 


11383 


25284 


A 


U482 


308 


30 


SPQKNPPTYSGFKKNLSASPGQKAKPRF 
FLKNQFJjPGI mgggrq fpllrrvr ppnc 
FNFGGKRCN*QKLGPCPPGGPKBTVSKK 
KKKRERQQ 


11384 


25285 


A 


11483 


226 


407 


■» XT T T m TTf T/ _J_ r-w ,1 v> »r* T fill* ITiTT' n % V ^* S^m 

NHWNKK* FW P KVAHTYNLS ALGGQGGR I 
T*GQEFBTSPSNMRPHAS 


11385 . 

• 


25286 • 


A 


11484 

■ 


10 


411 


QPPIYFLSLRISLFWVFHINGIIQYVAI 
CVWLVSCHNFSRFICVAACDQDPIVFLW 
IiNNIPLaSYNILCIHSlCPUWRTTGLFP 
LLAIIIKL*TFVKKKKKKKKKKKKKKKK 
KR 


11386 


25287 


A 


11485 


184 


1 


TDEELLPMDEQRKWFLEKKPTSGEDAVN 
IFBMTRKDIECYVH*VDKAAAMYERIDS 
NFLSN 


11387 


25288 


A 


11486 


435 


222 


AKNLNRHFSKKVI*MASRHIKRCSISLT 
IRET * I KS I IR * HVI PGQMTF I QKTGDM 
FGSVS PLKSHHLKL 


11388 

■ 


25289 


A 


1 1487 


317 


408 


FRPGTVAHACNPS TLGGQGGW I T*GKE F 
KT 


11389 


25290 


A 


444 a n 

11488 


340 


M A*J%. 

469 


GKGDTKCHLWLGTWHACDPNTLGGQGR 
RIA*GQEFKTSLSNM 


11390 


25291 


A 


11489 


38 


389 


KRPTSLKKSNFCSLQGFFFFGLSQKPGK | 
S NRS PGKKKNP FFSGGKAP * KGF* KI FL 
SPGLGGSPQKPHLLGGLGEKNYLTLEKG 
GCRDPK* CPC F PPWAGEGNP FS KKKKQK 
FPFFG 


11391 


25292 


A 


11490 


235 


2 


F VLMWLMPKLH I P VTEFRRLFAFPE FL * 
FTjFCFVLSYFVI^ETGSHYVTYAECSGA 
ISVHCSLNLPGSSDPPTSVLV 


4 * AAA 

11392 


25293 


A 


11491 


25 


417 


GTLCLRIGFINISHHCMILKENSLFFFF 
WETKFYFAPQADGRGPNLG*WNPPSPG* 
RGSPGPTSRKRGNGGGGPPGPVIFGFIjR 
KNGVPPGGLKGPKSLAPGIGPPGPPKDR 
GLRGGPPAPGLLKKNPKVW 


11393 


25294 


A 


11492 


234 


438 


MWI IAQQCDOTECHY*LKMAKMINFML 

WTWtfT If TIM 1 ffll TTTTy Tt llf Tff?T TTTTT TTTfT l^ft^fc T^IT^ r^T^O T^y ^» 

FIFYHTHTHTHTH L H THl HI KJciRGPS X X 
YGKVS FKLTHL 


11394 


25295 


A 


11493 


t A A 

344 


477 

• 


CFTMFIS* IKT*KFNISQLVAHTCNPSI 
LGGQGGRIT*GQEFKT 


11395 

* 


25296 

* 


A 


11494 


35 


ji n't 

482 


GI PG FHHVGQNGT >T?T iT >TS * S IHLGJjPKC 
WHYRPGPPRPAI S ITPST*AYLLOIQKI 

GVTHFLAI LALFFSGTKAM ISP KYVCI I . 

pvt t xrffUT r , ^t7vc , T & tm*t nn»nT<^iyvtrr 
r YJjL>Vhr.rlijljLr I r J-»AJ.N JJAJL t-VUVxKJj 

FDTYMYGVESHATHEFISYYFFKILCSL 
SIRLYAVLL 


11396 


25297 


A 


11495 


307 


484 


LS PS E RHELKC VLW PGAVAHAYWQSTVG 
GRGRQIT*GPEF 


11397 


25298 


A 


11496 


487 


349 


FLI KTEFHPIGQEGFDFLTL* FARLGLP 
KGWD YKGE PLRLAFWHPF 


11398 


25299 


A 


11497 


224 


417 

* 


ASGASALKVELCSFSRYKIYPRHKRRYA 
RTDGKVF* FLSAKCEGITFFLS FFFFFI* 
EVXiFCSPG 


11399 


25300 


A 


11498 


17 I 411 


krlvfggrgrtdrfisqyv*XeyVs!ni> 
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SEQ ID 
NO: of 
nucleotide 
sequence 

• 


SEQ ID 
NO: of 
peptide 
sequence 

■ 


M 

eth 

od 

■ 


SEQ ID 
NO: in 

1TCCM 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
iL=\jiutaroic Acid, r— rnenyiaianine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K= Lysine, L=Leucine, M=Methionine, 
N=Asparagine,PHProline, 
vj=uiutamine, K— Argimne, o=aerine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X— Unknown, *=Mop codon, ^possible 
nucleotide deletion, \=possible 
nucleotide insertion 




• 










LI F LT KjCKKKKaKKKKKKKRGGALKKKP 
WGGQNKTGEKKKNF F FLRGGKKKP WGD F 

FKGKGGKTPPYFCFLKKMG 


1 1400 


25301 


A 


11499 


117 


370 


VSESADLAWSLGICFFIEFPSDADVAGL 
BTAr r LGGWTG5 CS WQ PG V * WCNHSSL 
QPRPSGPRQSSCLSLPSSWDYKYMPPCL 


11401 


25302 


A 


11500 


244 


1 


EELFECYSCS PTFSSDPLTTPIiLILTT* 
LLPLTIMASQRHLSSEPLSRKKLYLSIL 
ISLQ ISLI ITFTATELI IFYI FFET 


11402 


25303 


A 


11501 

i 


130 


55 


RFVCSTI KVLRDLSSDRSNPGRFLSTSN 
SSLKKKKKKGRPF*RDNSAILF 


11403 


25304 

* 


A 


11502 


186 


421 


SFRLKIAGIIGVHHCTWLWLILNLVLL 
K* ITFALKIMMFSKYSDIGQAQGLMPVI 
PALWEADAGGSLQPRSLSLGNK 


11404 


25305 


A 


11503 


1 


213 


GELLFLYI FA* WLFLLPKLLKFVIVCLC 
ELQFNERFIMVLCI 1 1 ILGVGGFFLFCF 

mm mm mmm, amm 4wmmmm wmniwmn**Mn\*mm\ mmmwm mum**, 

LL* FCFDFFFFFWRG 


11405 


25306 

- 


A 


11504 

* 


82 


6 


AGVQWPRSWLRLP*LFFTQVLMIFFPPF 
YHQNFFFP*KGFFFLGGLSHFFPPPNKG 
FFFNYPQGFFFS PS L.KKKFF FFS S P I FL 
AP PG I FL * GP PR.FFFFF FFFFF * EGVSL 
CHLGWSAVAAI LAP PAG FTPS SCLSLPS 
S 


11406 


25307 


A 


11505 


245 


440 


V mtmj, aam \ mm **** J. m*m* mp mw w m mm\. mmm m mm.m m*m ^ ^amm m m m m-* _■ 

WAPCIG*QIiLKDPQVLFAGYKVPHPLEH 
KI I IRVQTTPDYSPQEAFTNAI TDLISE 

mp mwmmm mm mmm yiy^ f H *} 

LSLLEERFR 


11407 


25308 


A 


11506 


254 


370 


GKLM*GHrtVGQ)U3LKLLTS6bPPALASR " 
SAGITGVNRHAPPRLNVTLYTKGQTI IL 
YHNQYNQ 


11408 


25309 


A 


11507 


1 


149 


GCMWPRTVLFAFQHKFVNFLKTLGDFGQ 
VQWLMRVI PALWEAEAGKLPEARRWRPP 
* PRTVLFAFQHKFVNFLKTLGD fgqvqw 
LMRVI PALWEAEAGKLPE 


11409 


25310 


A 


11508 


316 


420 


YTY I YEPG * VQWLTP VT PALWEAKAGGS 
LEVRSLR 


11410 


25311 


A 


11509 


136 


2 


LDNRLWPGAVAHACNSRTLGDQSGWIT* 
GQE FETSLANMMKPHL 


11411 


25312 


A 


11510 


167 


420 


PTPRSGNTALQPPHHTHTHTHTHTHTHR 
HTHTHTHTHTRACI * KVRVRALFS PKKK 
PLr C JUS VbQI xVGRCii LiKRIiliF I FL YRH 


11412 


25313 


A 


11511 


124 


415 


I YI S VANT I ILI F I AIHVLAI S FFFFKK 
EVLFLPPRREGGGAF*VN*NLCFWG*GN 
FPA* PSLKKGITGALYTPGLFFVFLKKT 
GFRHVGQAWLDFL 


11413 


25314 


A 


11512 


121 


1 


PPVKFWAPFFFFF* IGSPSVAQAGVQWN 
NLSSLQSPPSCL 


11414 


25315 


A 


11513 


245 


13 


VLF I LSGFFGP INYPTS P P PQLPFPAS G 
SLCRPGWSAVARSQLLVPSDS 


11415 


25316 


A 


11514 


434 


2 


RPPFFP VWGSPPPFFFFFGEG I WGQEER 
KGASS P P WSRG I LRGGGPFGGPGAPPT 
PLKKPPLFFFLAPPFLGVFPRFFSFPTV 
FFLPFFVCPKKRBALLGVPKGPRPFFFR 
S FLFFFSGD * VSLCRPG WS AVARSQLTE 
SLAS 


11416 


25317 


A 


11515 


247 


390 


QGLRIYSQDFFFFLETKSRFVAQAGEQG 
GNLG*LEPPPPS*SDFMASP ! 
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SEQID 
NO: of 
nucleotide 
sequence 

. ■ 


SEQID 
NO: of 
peptide 
sequence 


eth 

00 

■ 


SEQ ID 
NO: in 

09/515,1 

■ 


Predicted 
beginning 
nucieonoe 
location 
correspond 
ing to first 
amino aciu 
residue of 
peptide 


Predict- 
ed end 

M||J||a 

nucle- 
otide 
io can on 
correspon 
uing id 
last amino 
acid 

residue oi 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystelne, D=Aspartic Acid, 

IT— #~* 1 • ■ +n m i t* A ni/) 17— DL AH |i| n | n w% tu 

iL^v>iuramic acio, j 4= rnenyiaianine, 
G=Glycine, HMHistidine, I^Isoleudne, 

iv— -Li/Sine, Li— jueucme, ivi = ivieinionine, 
N=Asparagine, P-Proline, 

v— oiuianiine, i\ = Arginine, o^aenne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=0Tyrosine, 

A. = un Known, — oiop coaon, /^possiDie 
nucleotide deletion, ^possible 
nucleotide insertion 


11417 


25318 


A 


11516 


269 


368 


FFFERESCSVTRAGVQWHRLSSL* PLPP 

KT7DD T? 

f\r KKr 


11418 


25319 


A 


11517. 


256 


404 


NILXI FYLRI + KKKNQLGWAHACNLKF" 

T /2T'T3 flT'lil T TO nf\ # T? VTO VP VHftTV 
£AA7KV]1jNllKbU T r JXlOlUS2VHV& 


H419 


25320 

< 


A 


11518 


211 


404 


YLCFTRCSVHSHLK+ VFLHKCVI *MYRP 
rl 1 w 1 vSJj UMy V ri I H 1 n. 1 rl l nAnTHTriTJr 
LSQGGDCI 


11420 


25321 


A 


11519 


41 


232 


E S VK PRRRR VPRAQ I TP LHCS LGDRARP 

HHCTAAWATEQDPV* KKKKSLNFKTVRA 
INLKMLGALISFHLFLWGIWF 


11421 


25322 ■ 


A 


11520 


146 


2 


LIDQQGVNQLPRME CSGAI TAHCCLELP 
GSSDPLTSAS*VAGRPPTRP 


11422 


25323 


TV 

A 


11521 


297 


437 


TDHFWNR I LS S CGDT FRDTWLG WPHA Y 
NPSTLGGRARRIT*AQEFE 


1 1423 


25324 


A 


11522 


244 


460 


NCATITTNSRI FLQLQKKHCTKWLSLS I 
SWQSLISF**ICFVLFFETNLAVSIjRLB 
CSGAI LAHCNLCLL SS 


11424 


25325 


A 


11523 


222 


1 


ESKDLDIFRCQLAPRNSSCFVLFCFVFP 
IiEIGS CS VAQAGAQWCNHGS LQPPPP XA 
SAS * VAGTTNVRH YVQ LF 


11425 


25326 


A 


11524 


308 


424 


I TFLTMFKI YLQ FDPAI PLLGI YPKD Y * 
SCCYKDTCTRM 


11426 


25327 


A 


11525 


98 


2 


NIGGRPGAVAHAYNPSTLGGRGGQIT*G 
QEFE 


1 1427 


25328 


A 


11526 


113 


399 


LDRFLTLLPRLRREDCLSPdGQdlCSfaP* 
SHRCVPAWATE * DLLS KKKKKGVLEKPF 
LPPPSVGKPPFSPPERLFFFI LRPPLGG 
VFPPPSKKIIS 


11428 


25329 


A 


11527 


280 


3 


P KYCQ I * VLQCIAQLFKLS I YHT fTHTH 
THSLSLSLSLPPPKVNKWLMSSEFTLTF 
KM XT JrG PGK VTHTCN P ST1A5G * GGW IT * 
GQEFETNL 


1 1429 


25330 

» 


A 


11528 


284 

• 


2 

■ 


FWGPQKKNFSFPPPGLKIGSFKRAPPFF 
r FFFETK5 CS VTQAGVRWCDLKiS JjQPLA 
WATERD* LNK* I KGMSKKLHT * VCMFRT 
KuKIKvjK 1 Ku 


11430 


25331 


A 


11529 


134 


2 


DTLLPRLECNGTITAHCSLKLPGSGDPP 

hand ifto 1 iunu^iui 


11431 


25332 


A 


11530 


104 


1 


GRGFFFFFFFFF* EVESCS VAQAGVQWL 

WT/V3TO 


11432 


25333 


A 


11531 


157 


373 


CVFYTYSTSQLRCYI FIGNTRSVFRCHE | 
IYT*KNRFWASTVAHTCNFSTIiGSRGKW 

T T * f2 * R I7MT Q T .lVWlHTP 


11433 


25334 


A 


11532 


237 


389 


ICKCSKTGFG*DFQEFETSLGNIVRPCF | 
YRKI KAS QV* WHAP I VLATWEAE 




25335 


TV 


11533 


259 


122 


nloro VUKJtll: r Jji* i r * i KFrlb VAUAJb Vy • 
WCNLSSLQSPPPGFKRF 


I I4JJ 




A 

** 




I OH 


1 


S SVVSOALILLFLKJCTCFMPGAVAHACM 

O V V ^'UJXUJJf J-J AVIV X >nh* XT Tl rwAViUUV^I 

P CTLGGRGERIT* GQ KFETSLAS INL 


11436 


25337 


A 


11535 


106 


310 


GGDKGTVWDTLHF FDGQKRLDHNKDHWM 
TIHAELPHELSA*CYAFKKECISGIGSI 
*AEKEYKIQFDG 


11437 


25338 


A 


U536 


167 


378 


ASFFGTTITYKTILVYVYMFEKFIIYIH 
MYTF*RQGLVLSKLECSNVI IGHCNLKH 
LGSSNLPTSASRVA 


11438 


25339 


A 


11537 


101 


396 


VNNLKQRIGLSKRGEKKLLVLISGFGGG i 
FPGAGW I KKGKPPPGFPQGLGP * KRGPP j 
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SEQID 
NO: of 
nucleotide 
sequence 

• 

• 


SEQ ID 
NO: of 
nentide 
sequence 


M 

etb 

od 

r 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

corresDond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

Uvll WW 

1 


Predict- 
ed end 
nude* 

mm mM^mmlmm 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Grycine, H=Histidine, I=Isoleucine, 
K= Lysine. L=Lcucine. M=Methionine. 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=0Tyrosine, 
X = Un known. *=StOD codon. /=D0ssible 
nucleotide deletion, ^possible 
nucleotide insertion 








• 






GKKGNFNKMGFNPKTLVFKNQNFGPGGA 
KPGTPGVRGPLGDL 


11439 


25340 


A 


11538 


18 . 


361 


QIGPRLY*DQNGETPSLLKIQKKKIPGF 
GGGPLKSQLLGGLGWENH*NPGGGGCRD 
PK*PQGLPPWGTNQNFVKKKEGGREKGK 
EGRKEGGGEKKKTLPKGIGALNLGGLLT 
3P 


1 IHHKJ 


95141 


a 


1 1 S1Q 

1 J JJ7 


114 


165 


PGFONKKTGGGGPFRPKTKTNFKVF* KY 
LKICFGFWPKRPPPPGFFILETRSLFFI 
FFLGFFF*ETKSRSVTRLBCSGVILAHC 
NPRLVGSSHYP 


11441 


25342 

» 


A 


11540 


2 


318 


1 TTERFIYRITPGPKETEVNDLPGAPSEP 
VAALRIiEARSFILKL*SFSLVFDFFFLP 
F*KTGS CLSPRLECRGEI ITVHCSRELP 
S*SHPPTSASQSSGSGGFHLG 


11442 


25343 


A 


11541 


294 


409 


GTIS FFRTCRLGMVAHTWNPSTLGGQGG 
*II*G0EF0T 


11443 


25344 


A 


11542 


1 


123 


GKQRQE PAIiS YDHATVLLTS * S AHLGLP 
KCWDYRS DPPCLA 


11444 


25345 


A 


11543 


69 


350 


VGAKHWIHMDTKIEXADPGNYYSGERGK 
RPRAEKLPVCYYAL^LGNGI ICNPKPPC 
CWKGHDFI YIYVCGHI YIHI YI CTHI YI 
YFF*DGGLLC 


11445 


25346 


A 


11544 


103 


340 

• 


LFCAFFLSPSKLDFYHLSFYV*TLFLKD 
LISLAFFFLFWEAGSCSVTQAGVQGPNS 
THCNLCLPGSNNPS TSASQAAGT 


11446 


25347 


A 


11545 


256 


354 


PYPFGKPKRGDHWGIjGV*NQPGQQGETP 
SLLKI 


11447 


25348 


A 


11546 


196 


335 


EIVARLKRSPVGRAQWLMPVIPALWKAK 
ESGS PEVRSS * VKKI TCRPGTVAHACNP 
STLEGQGKWIT*GQEFKTSLANMVK 


11448 


25349 


A 


11547 


202 


377 


TIHTKGVMVGNFILWFYQNFFKTFF*D 
KIIjGWARWPMPLI PAFWESEMGGLLEPK 
SL 


11449 


25350 


A 


11548 


39(5 

• 


26 

- 


PPRIKKEKRSPNGAFYPFPCWNF*KKKN 
LSLAKRFPRDSVFFFVSW*GSYLPTTFL 
QLLRWYVLTSHIAVS FKHLCE PHKNFTXi 
QHSIPYLYI IE IEVYS YFINRNTI YKIN 
MDFS I VGFVFL 


1 1450 


25351 


A 


11549 


232 


371 


REVPFENIKIGQVQWLTLI I PALWEVKA 
GGSLEARS L * KYQNWPGAVAHAYNPSTL 
GGQGGR1A* GQKFE IS PANMAKP 


11451 


25352 


A 


11550 


174 


381 


NRDEGFRYADRAEVQRLLTGTILEHCGL 
QRPGL * RSS CFTLPCGWDYRH* S PSHVA 
GTTGISHHTRLIKKAFFSQTGVSSC 


11459 




A 

mm. 


11551 






D YSNLGLVW WLTTVI PALWEAKAGRSSE 
VRSS * PALWEAKAGRS S EVRS S 


1 1451 


1 51 54 


A 

mTm. 


1 1 S5? 


7ft 




LHKCSFIYINHCNNPIYGLISLYPKOLF 
S YENFKDFCNKI * P + 1 PNFELKTSTP* L 
E KL*NWAMARHGGS RL * SQHFGRPRRAD 
HMRSG V PG Q PGQRGE S PST 


11454 


25355 


A 


11553 


319 


56 


TFFLGPPGFF*KIFFSVGPPLIFFLPKK 
FIGEQKKNAPGKNFFFGFLFSLFFFFLL 
TLSSAIiECSGAMSAHDNLCLPGSSNSPA 
SAF | 


11455 


25356 


A 


11554 


230 


399 


MMGEIINQANIISVCQIIRRLRHESHLI 
PGGRGYSELRSYHCTSTWVTE*DSCLKK 


11456 


25357 


A 


11555 


376 


396 


F*IFWRDGVSPCCPGWSQILRLQQSTCL 
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SEQ ID 
NO: of 

mm mm 

sequence 

. ■ 


SEQ ID 
NO: of 
peptide 
sequence 

• 


M 
eth 

00 

• 


SEQ ID 
NO: in 

09/515,1 . 
zo 

r 


Predicted 

beginning 

nucieotiue 

location 

correspooa 

ing to first 

amino acid 

resin ue oi 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 

1 AAA 4i A mm, 

location 
correspon 
ding to 
last ammo 
acid 

resiaue oi 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
jb=o(utamic Acta, r —rnenyiaianine, 
G=Glyclne, H=HIstidine, l=Isoleucine, 
jv— i_»ysine, 1j— Leucine, ivi—ivietnionine, - 
N=Asparagine, P*=Proline, 
Q=Glutamine, R»Arginine, S»Serine, 
i=J nreomne, v™vaiine, 
W^ryptophan, Y=Tyrosine, 

mF lull MAitfM Vp^C^am AM^GJfcoa Apma AJUtllkaA 

a— unknown, — otop coo on, /«possiDie 
nucleotide deletion, \=possib!e 
nucleotide insertion 














GL P KCWDYRP PH PATAAS CP YFYHYYLL 
KIWRYSFDLSCLWNLVVWGLAFG 


11457 


25358 

* 


A . 


11556 


176 


384 


KAWPNVRNNYR*NFFFQCCLNNFRKRIjK 
CFRCGADKFGKTGFSCLQYALKPTNWGR 
EQWLTVNPTNPGK 


11458 


25359 


ft 

A 


11557 


295 


391 


P YKLL I THTHTHTHTHS RTHT I * HTTL S 
LYTP 


11459 


25360 

* 


A 


11558 


1210 


1753 


NT W CME RLAVAFG S FNRNS PLRGWALWL 
TP I I LALWEAEAGGPLEPQEFETCLGNM 
VKH CLYKNLKNVSGV* DLPGQHGETLSL 
QKPKKCVKKGGTCL*SQRIiRRIjSWEDCL 
SQGGQGCSEP*SHQCVPAWVTELDPDSK 
KTKTKKF FGLPS PFLFVCPLTHI YVNKE 
YAFVLAEEASGKTTSKLTMVTSRNGLGK 
TKKFFGL 


I 1460 


25361 


m\ 

A 


11559 


172 


3 


KOKFFFPPPSHQIKIFGGGAPBMPFFFF 
FFEKE SRFVA* AGVQ WRNLS LtiHPP PHP 


11461 


25362 


A 


11560 


143 


1 . 


GQAQWFMPVIPVLWEAKAG*SLEARSSR 
PVCATATPPQL I PK* YEGL 


1 1462 


25363 


A 


11561 


84 


A 

2 


KPLDTLGGQGGC I T * GQE FETSLANMV 


11463 


25364 

♦ 


A 


11562 


12 


387 


QGILLPCFSMSSEERGRIYSNTFSFFFF 
FGKGGQINPQGGGQATEPRLRKPSPSGL 
TXQGPGNGGPPPPPGQNFFFKKKGGYPG 
GAGGVLNPGPKGNTPPGPPKARKKRGGP 
PGPGQKFL*NPKG - 


11464 


25365 


A 


11563 


230 


375 


RAKGKLLSES LLS IQS FKKP S VHTCNPS 
TLGG* GRWIT*GLEFTTSLA 


1 1465 


25366 


A 


11564 


287 


55 


PPDGRVQWLTPWPGLWEAKEGGStiKSR 
SS * SQ I NPVGSVP YS LRVLR 


11466 


25367 


A 


11565 


115 


411 


FFFFFFFLGENSFFSFRGGPRGGGHI FW 
NFPPPG*GVFPPPPPPGGGNKGGAPPPG 
FFFFFP*KTGFPPLGPGGPPPPLFGPPP 
FPPPKGYNFRRDPPP 


11467 


25368 


A 


11566 


139 


356 


CLLI GW YNTGC I TQGSTRKIGTTFFLRQ 
SSS VAQAGVKCSGM I LCYRNLRLPGS SN 
LPDSAS * VAGTTATYP 


11468 


25369 


A 


11567 


101 


410 


FLFFFFFFFLKKI FFFPPRGGGGGGN* K 
KQTPPIjGGKKNFPP P PP KRGGGGTPPPP 
PLLFLVF * GEGGFPGGAGGGKKTPRGNP 
PPPP P KGGEMGRGPP PRGG 


11469 


25370 


A 


11568 


417 


56 


EDCLRTGVRDLLPNPHPNLQIAGRDGAR 
L*SQLQEHHLSPGFRGYSEL*SCHCSPA 
V^TE*AYRKQNKTKQNYRSSALLHLYTP 
MAT AW I KIAFLRAGDI FLS VLS FLPNS Y 
PLFSHPSQ 


i i Am 

1 1470 


25371 


A 


11569 


301 


A A1 

443 


£ul 1 F V Cr ovna CS VAQATDS PASAS * I 
AG 1 1 GVCHHARLI FVFLVE 


11471 


25372 


A 


11570 


44 

• 


408 


RGRGHGPHSPWRRRRRLRWEGRKATBS 

KIPTGL**LK*PSFIFIFIYLFWKQGIiS 
IiAPMLE CRGAVLAH * HLRLPRS SLSHAS 
ASQVAVTTG 


11472 


25373 


A 


11571 


201 


1 


AERPVTGPPVYAEPGLQGRQG*KNMFKN 
MQHPRWADHLRSGVQDQPGQYGETPVST 
KNTKITQAQCI 


11473 


25374 


A 


11572 


343 


1 


FGLIt FAFTLEYKLYE S INHFAHCCVCRF 
SSGTLHWVCNRHPNKLDEWNSGGGGRQ 
DLTLTPRIiGCSGVI MAHCNL VLIjGIjSN P 
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SEQID 
NO: of 

nnf IpaH Hp 

II ULI CUIIUC 

sequence 


SEQID 
NO: of 

npnfirl p 
Pv|Jt.lUc 

sequence 

• 


M 
eth 

uu 


SEQID 
NO: id 

09/515,1 
26 

• 


Predicted 
beginning 

OUvlcOUUC 

location 

m/i ■ ouvuu 

ing to first 
amino acid 

rpcirttip nf 
■ cdiu uc ui 

peptide 

i 


Predict- 
ed end 

nuur* 

otide 

tnr^tinn 
•Vwiuvu 

correspon 
ding to ■ — 

latzt n m inn 
11131 aiumv 

acid 

rpntriiip fif 

peptide 

cpnnpncp 

3CU Uvllvv 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Add, 

IT — ; (~1 Into rrl I f» Ajvirl E^-Pk rly> Innina 

Ej vFiuiuiii ii. /tciiij J 1- r iicnjfiMiaiiiiiCy 

G^GIycine, H^Histidine, I^Isoleucine, 

K—l-VlInP I^=T pitrinp IVfclVTpfh inn inn 

*V MJj9tl»*&f V^LiCUUUC^ Itm— ITXC III lUlllUC^ 

N=Asparagine, P=Proline, 
Q=Glutamine, R»Arginine, S=»Serine, 
T=Thrpnninp VsVnllnp 

A J Ul CUIIIIIC, r r allUC) 

W=Tryptophan, Y=Tyrosine, 
nucleotide deletion, \=possible 

niirlpfitiHf 4 in^priinn 

HUlflwvllUv ■lldCrilvll 














VSAS * VAGTIGAHSHAWIDKLFFVKTGP 
LY 






i A 


1 1 






VC3P PPlC55TiPRni?PU' PPPPTf CFPR T.KK'K'Tr 
vur c riujurour rrrc \<riv\.r xr jvJ_ii\.i\ivEv 

KKPGLVAYTCNLNTSG\7RGKKIT*AQEF 
KKSPSNIVKPWY 


11475 


25376 


A 


11574 


312 . 


430 


QRYARPGPWHCYNSSTVGGRSRGIV*A 
OEFATSLCNMT 


11476 


25377 


A 

• 


11575 


125 


3 


NSLGPVVHACNPR£LGG*GGRITSAQBF 

PT<?T/31T T R RPT.V 

Ci X O XJV9ZvXIVlVErX4 X 


11477 


25378 


A 


11576 


190 


362 

■ 


CVNOTID*EKKKKKKKREREKKKKKKKK 

IfTTIOfTinf +flflC!IlPTrTfTrTrif P^PftfiflTifKn717P 

L 


11478 


25379 

* 


A 


11577 


231 


340 . 


KRI TRVNQDGLDLLTS * SAPLGL P ECWD 

X Xl*Ullr I 


11479 


25380 


A 


11578 


<!0 


3 


RMQVSV* I FTPMSLQAYLI YSSSVAAGA 
QSGIEECKYQFAWDRWNCPERALQ 


11480 


25381 


A 


11579 


285 


406 


KKGNW PGAMAHTCNP S TLRGRGGWIA * G 


11481 


25382 


A 


11580 


121 


1 


NLNCPGPVAHASNPNTIGG * CRP IV*GQ 

ttcX IK IAIN iVRP 


11482 


25383 


A 


11581 


199 


384 


KKTPPGGIFFCPKEAGPAPPL*TPFC*T 
PLFFSPAPFKPKRALKFFLKPKRGFFQI 
PFFFF*DGVSLCHPGWSTVARSCPRV 


11483 


25384 

- 


A 


11582 


677 


934 


YGSRHCICFLQAISEILFLKNPARHGGS 
LSSQNFGRLRQEGH1jSPGG*GCSEL*PR 

UP 

rlL. 


11484 


25385 


A 


11583 


104 


2 


KKIGGGPPLFFFFETESCSVAPOiE*SGA 


11485 


25386 


A 


11584 


349 

* 


9 


GGRSl^PRKTRVQWGIPEFPPLHLGGKA 
KPCFPKKKKKKVKGLNKHYSKEDIQMVN 
R YMKKCSTSLPRERKI KTIMRYHFPSIR 
iwnvT TPTTTirnpvnri*nPTT?wvi qwoomv 

L 


11486 


25387 


A 


11585 


163 


1 


TAVR I KHNS YTL T P CLRHSR YL I NVTCW 

tr\3JT vjt\ru\\~i!4 rg J.xAj\jrtVji\Vt J. x \Syc»F 


1 14*7 


ZJJOO 




I l^ftA 

I I DoO 


1Q7 




<3 ICS K YH T YGD <i AT .Off ^ TWTCW T TRPTCTf fS 

* DDVKDKAHSTRLSMS ICEEKE 


11488 


25389 


A 


11587 


30 


374 

* 


GWSPEDLFPGSLPPALKGFSGEGG* PFL 
P FOKKRGGGGGS S PPF PVLKRVRPPWRI^ 

YPGGGGFR* PKQGPCPSAWGAEPDSLSK 
KKKGFGQKKKKKKVLKEKT 


11 /IRQ 




A 




DO 




S L.CFFVFF VFF * TE 5? CS VTOVG V* W YNT. 
GSLQPPPPKKKKNIKKQKKKINWGRGFK 
RRL 


11490 


25391 


A 


11589 


243 


402 


FLHLKKKXKJ<KKKKKKKKKKKN*KDAG 


11491 


25392 


A 


11590 


205 


3 


I RLLG 1 1 KN PTPAHCQ * K* KTGE PLW* V 
VW* FFRRLHTELSYDPAIPLLCINSTEV 
WLAPGIPPRFR 

t •§ 4 , it X X- VJ & X. A\ X. 4\ 


11492 


25393 


A 


11591 


404 


152 


MGFHHVSQDGLDLLTS * STLLSLPKCWD 
* CWDYTREPLCPGHFAI SFNKVVIxNTTT 
WLYFYLIILHCIYCPYKLLFLFPFYIY 


11493 


25394 


A 


11592 


311 

« 


386 


T VYPPIxAGNLAHAGA* AGI VGTALS ILI 
RAELGQPGALIjGDDQ I YNVIVTAHAFVI 
I FFIAI P 1 1 IGGFGN*LVPLI IGAPDIA 
FPRIlWISF*lxLPPSFxxlxlxLASSIVEAG 
AGTG*TVYPPLAGNIiA}iAGASVDLTIFF 
LH 
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nucleotide 
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i 
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SEQID 
NO: of 
peptide 
sequence 


M t 
eth 
od 


SEQID 
NO: in 

irccv 

09/515,1 
26 


Predicted ■ 
beginning 
nucleotide • 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AJanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic And, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso)eucine, 
. K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^GIutamine, R»Arginine, S=Serine, 
T^Threomne, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
n ucleotide deletion, V=possible 
nucleotide insertion 


11494 

• 


25395 


A 


11593 

■ 


39 


397 


EGCPLFFFVFFFFFFFGGKKI PFFPPRG 
GGGAPHKIPGPPPPGGNFFFPPPPPLRG 
GKRGPPPPPGFFFFF* KKGGVSLFPRGG 
VFFFSFWRGGRGGFPPPPPQKKI FLKGP 
PPFYKKK 


11495 


25396 


A 


11594 


243 

■ 


46 

■ 


GILSCCFHDSLSLNSL*CD*VWISLCLS 
YLVS IELLGQCGLI FFIKRBS FWLLLLK 
MYLQPLSFLS 


11496 


25397 

> 


A 


11595 


327 


2 


PRAILRIQFLLWPMRPYVMWSQPCSLCF 
CHTAKTVPTSRPLLLGFFSLPGIAHISM 
WLVLSYGLGFSSKVTLWPGVVAHVCNPS 
TLGG * GGW I T * G WE FKTSIANMVK 


11497 


25398 

• 


& 


11596 

• 


292 


384 


LGTVAHTCNPSTLGGQDGWIT+ IQBFBT 
SLR . 


11498 


25399 


A 


11598 


125 


2 


PFQGQRSGGTRGPGAVAHTYNPSTLEG* 
GGRIT* GQEFETS 


11499 


25400 


A 


11599 


302 


408 


TGGWLG WAHACNPS X LCa^fedRQ IT*G0 
ELETRLA 


11500 


25401 


A 


11600 


409 


131 


GRSGGVPKSGVLKPPGPPGGTPFFFKKP 
KITRGGGQPPLFPLLQRVRPENCLSLGS 
GGCH* TKLPP CPSTWGAQ* KLVS KKKKC 
EGGGRIYS 


11501 


25402 


A 


11601 


196 


330 


HHCMLAWVTGRPYLNIYICIYADiyiHI 
YICI YI YTHTRVYI YLLAI * I YIYVYMR 
IYIYIYIYVYIYTHTHGYIYIY 


11502 

■ 


25403 


A 


11602 


41 


252 


TPGRARGQRLGHSSTGLDLLTS * SACLS 
LPKCWDYRREPPRPAEDIKFMF*AILSB 
T I LVRGVRS RAT P S 


11503 


25404 


A 


11603 


271 


440 


KKKKRGGRFKE S KFTSAGLQGN I FFIGP 
PKLI PRAAV*QRGDWKNIjGVTQLNRIiCH 


11504 

■ 


25405 


A 

<• 


11604 


429 


2 


FSSFFAEKGCHCVSQAGFKLLSSRDIiPT 
S APHIiG IRLLGRLRQEDPPS PGV* GCS E 
LQSRLCTPAWITKRDPVS*KFKKQKTKT 
ETH * RKPTKGQDHQ YHCLPP PHVMPLKV 
FRGSNVHGAVRTRGSRGSIPELSKAVPH 
AP 


11505 


25406 


A 


11605 


113 


2 


TEATHLVCTYKN* KLGWAWWIMPVI PAL 
WEAEVGRLL 


11506 


25407 


A 

• 

* 


11606 


301 


3 


WVMVC S VKN IALGHACG KFH YEGRGKPA 
EHPS WVLLVLES LQKKLFLS YFLFTFLL 
QPPSPRLT* PVFRFFFFRDRIMLCCPGW 
S A WP ERVGRAGNS E 


11507 


25408 


A 


11607 


117 


431 


SCDFCLSFSLSLFFFFFRKGIFFLGPRG 
ENKGGNKIKWTLGPGGKGNSRPSPPKGG 
GKRGGPPPGANIjDF * KKTGFPQGGRGGE 
KPPPPGGRAGGAPQRGGKPAG 


11508 


25409 


A 


11608 


277 


88 


RRTTSS WLARNC I QGPAGYHTGS * TPDL 
KGSACLSFPKCWDYRWRLDIi 




2541 U 




1 i j£An 

UoOy 


200 


27 


VKXjKPGDSRDPDMQW XYI * ENRPARCGG 
SLL*S *HFGRKRKADHLRLGVQDTRSSA 

R 


11510 

■ 


25411 


A 


11610 


17 


414 


TVWLVMQDPENEWIVSGWGGKEEPGRYF 
MCEARSSSMDRVFFFF*KKTLFFPPGGG 
GGAKI WVNG IPPPQGS PLS VAYPLGVWE 
KKGPP IHPHKKRVPPV* PGGVLFSEPGN 
PPPGPPKGVGKRSSPPSPVL 


11511 


25412 


A 


11611 


296 


428 


RIYENWRGTVAHACNPSTLGGQGGWIT* 
GQSLKTSTGTVAKP 
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Amino acid sequence (A= A la nine . 
OCysteine, D=Aspartlc Acid, 

V' — rf"* Into rn IP A piri F— PK anvlalanina 
JC/~V?iU LHlilll. ntlU T F *- * UcnjlalUninG, 

G=GIycine, H=Histidine, I=IsoIeudne, 
K=Lysine, L= Leu cine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, " 

I — J ijrtuiJlIICj v — valine, 

W=Tryptophan, Y=Tyrosine, 
A-unKuuvvii, — oiop cuuon, /^possiDie 
nucleotide deletion, \=possible 
nucleotide insertion 


11512 


25413 


A 


11612 


20 


362 


. NF VRG VARGS RGGATRCB WVGLRPVGMA 
YGRSPSWLL*VKPSAASHRPPLPRAADT 

Df3T21 T> 21 DTDTDH "D 11 21 21 ©2VT.VDQ CADflQ 31 

ALTLEEELQEAIRRAQRWGLTMLPRLAS 
NS 


1 1 ki i 


*}KA1A 

Z3414 


a 

A 


I lot J 


1 JO 


>> 

L 


r jvrr .fl dtp r /v^t . * vrr .t, wt.tz t.pp r.T .& on r. 
LSNGSSLKDIiKWTHSNYRASKBTK 


lid i 
11514 

■ 


25415 

■ 


7\ 

A 

■ 


11614 


3 


410 


DCtMfV^T tJT T1»MVAt7KTDTrDUTlJVaHDWr 

KoiSV iVaijVijl 1 WiUVNKVFWlnKiiARiJT 
RVSRWARPRSKISESCYWLPQAHRTKS* 

Dr.\rDTT.CRT.PCT Ty2T7l4DTT.l?CT.T.JlT r nT.V 
trXj V, ft X XwlJUvo XjXA? r flc'JjAjg j 1 il iH.l UXJ V 

PLPS FALLLLLELFFLPGMPKLAP * SPT 
KHS PS F YFPRDRVIjLCCPGWS OTP 


11515 


25416 

* 


A 


11615 


189 

• 


397 


YKVLFLVEKNGFFFLFFBTKPCCVAPTG ! 
VQGPNFG* LQAPPPGLPLFSALAAPBVG 
ITNGTPPPPRFFF 


11516 


25417 


A ' 


11616 


158 


1 


KGKPTPKTFFCQKQIFFLVFFF*TESHS | 


11517 


25418 


A 


11617 


319 . 


425 


KVWLYSQAV* KVWL YSQAVAHTYNPNTL 
GGQGWR I A* AYEFKTSL 


11518 


25419 


A 


11618 


94 


2 


CLT I S WLG TVAHVCNPS TLGG * GGH I TR 
SR 


11519 


25420 


A 


11619 


381 


9 


FFPLPLPLPTVSLFPRSPSBAEPKLDCT 
AAISAHCNLPA* FSCLSLPSACNCRRAP 
PRLTAS AS RGAG I ADGVS FTQ C3 MVPRL 
b CoO V 1 bAnxNFin ItFAi b JjUJj f 
LCPDEIVDPRI 


11520 


25421 


A 


1 1 Z"OZ\ 

11620 


1 A A 

144 

• 


431 


WDRWVAEDHVLRE PS BNRRLARVRCLTP 
VSQH FGRRRRLDHE VRS LRP ACATWRNS 

VQTVNT VQnUZl C* 
VOX X M X 


11521 


25422 


A 


11621 


174 


449 


VFSLPIiLGGASQl^YlaEVRDPIjEEAVCP 
FSDLKPHAGRTTTIjFKAIRQGHLSLQRF 

HPVRASRP 


1 YjjLL 


ZD4Zj 


TV 




_ 

j id 


Alb 


KTSL 


11523 


25424 


A 


11623 


183 


2 

« 


PKQVYLRNGLLSHS KCNKIHHINRLKRE 
KIQ PG WAHAYNANTLGGQ GRW I T *GQE 
FENE 


1 J3Z4 


25425 




11624. 


IKK 


165 


SPGG*GYSEPRSCHCSPAWAAEPDPVSK 


11525 


25426 


A 


11625 


154 


3 


T AD VL F LGQ AGRGN FVVMKKNCRPGTVA 
HACNPSTLGGRGG* ITRGLEFK 


11526 

• 


25427 


A 


11626 


412 


3 


LGRGGNFPPPQKPFFPPLF*IFPRPP*C 

VdQT DD1?TiT?DT.r , TI , OTmrK T r'DQT7T?T?T.T3WftQ 
AljOJlrrr XJX" r*XjV-r ^AAAl tr&r J?J? i-tFW \jO 

PPICEKI FLFPKKGFLKFKKPRFLKKKG 
PFFFFFGNAVLLCOPGWSAVA0S0 


11527 


25428 

■ 


A 


11627 


290 


423 


VFEYFFKFFKKNNSWPGWAHTYNPSSL 
GGQGGWIT*SQVLRPP 


11528 


25429 


A 


11628 

« 


213 


3 


LSGIYSGNRRLVRYSKSVSVIHHINRLK 
KKNHIIISIDGEKPIiEQIQNSFMT*KRN 
KLKWEDHLRPADA 


11529 


25430 


A 


11629 


152 


2 


FQKNRDLEPSKAGWPGMVAHACNPSTLG 
ARGGQIT* GQEFETSMVNMVKP 


11530 


25431 


A 


11630 


422 


189 . 


PLSGFCPSTWGAKQRALSKKKKRSEAGH ; 
SGSCL* SQYIjGRLKQKDHPS PGGRGFS E 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartfc Acid, 
E=Glutamic Acid. F^Phenvlalanine. 
G=Glvcine, H—Histtdine. I^lsoleucine. 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 
O^Giutamine. R=Areinine« S=Serine. 
1^=Threonine, V=Valine f 
W=Tryptophan, Y=Tyrosine, 
X=Un known. *=Stoo codon, A=DOssible 

an W Trill • W ww ww u J * m* ww ■ ■ w 

nucleotide deletion, ^possible 
nucleotide insertion 














LLNCT P AWVAKQD PVPKKRLK 


11531 


25432 


A 


11631 


239 


424 


KEDNNRVFVIGPLGGLNEIiAHDKAFKTQ 
NHHWLGVGAHACNPSTLGGQGRWTT* GQ 
EPTTSL 


11532 


25433 


A 


11632 


343 


401 


AREREHKSAACQ * GMERS V I KHCHVATE 
*EREQKRERQQERASEKERQQESERDSK 
RERARESERBRESTRAQLVRKIVNQIS 


1 IJJJ 




A 






J ll 


G CHEKMTAKI I HS CKLS PDAL* SQLGVP 
L* THTHTHTHTHTHTHTFS EKKGLVP 


11534 


25435 


A 


11634 

i 


235 


433 


AALTSFSGPFSFCSYLINTLGKNFPNHE 
ISTT*FTLQRL*KIIFWPGMVAHACNPS 


11535 


25436 


A 


11635 


93 


385 


FTLKS ERRKHS VPQGKKQNKI KLLS PDC 

KLENKRFGWARWLTPVI PAL * EAKAGES 

PEGNGEREEATE P FGS I ILGF F FCTQ IA 
ttt xto ctyyb'pmcti c 


11536 


25437 


A 


11636 


190 


398 


NTKKSQKAFFKTQGKPPPLDPKKKASKK 
TKFFLPKKISTRKFRTGPWRGTNFSLKK 

FFF FLF F * KDKVPLCJHRGWSAWQSQLT 
ATSISQMY 


11537 


25438 


A 


11637 


190 


380 


TPKKWPKAFFKTKGKPSPFHPRKKALRK 

FFF * KDKVPLCHRGWSAWQSQLTATS I 
SQMY 


11538 


25439 


A 


1 1638 . 


261 


91 


EKTIiWAGHG^TCL*SQLLGRIJlWENHLN 
K 


11539 






1 1037 


i 
l 




FPJVNPTVKITAH<?TH<TnNPOYi'iVF.t?TTP , P 

R I YES KYWKE E C FGLTAELWDKAMELR 
FVGGVYGGNI KPTPFLCLTLKMIiOIOPE 

KDIIVEFIKNEDFK*VQCSLANIRGMY 




25441 


A 


11640 


193 


457 


DGILLCGPGWSAVAPSQIiCSGTISAYCN 
LCLPGSS ESPVSAP * YTjGLQACAHAPLM 
FVFLVYTGTCMLPLSAHHSRAANSSHFS 
YPLL 


11541 


25442 


A 


11641 


257 


373 


NT I S CFRSLRP VAHTCNPS TLGG * DGQ I 


11542 


25443 


A 


11642 


211 


367 


DTTSHPFEWL* FKKSRK*RVIiDRIWGN* 
MLVHCWWEHKMVQl^*lCrVWWLFK 


11543 




a 
fx 


1 10*W 


1 iz 


OOi 


EOGATGTL I HCWV7E FKI I PVI PALWEAE 
VGGLLEVRSSRPAWPTRENPIFTKIIKN 
Y*KYKOTPYIIR*LQIKTIRYHHILTRM 
AKIONTDTNKOTOECGATCTLTHCWWEP 
KIIPVIPAl^EAE^GGlxlxEVRSSRPAWP 
TRENPI FTKI IKNY 


11544 


25445 


A 


11644 

1 1 \J W W 


310 


335 
j j j 


YHLLAGS1jGDIERLSML*IGKAVCHNKY 
* *GCGEPGFIxLHCWWEWKMVQSL*KIVC 
QILQT 


11545 


25446 


A 


11645 


143 

• 


455 


KRKGPSLWLLGKSMGGIRFF* KKVFFSG 
FKSLWFFFWPLWKPLERLN* IKISFYPP 
AKKKGGQRKAQFFFFFFPRSFSLAGCSG 
S I SAHCNL CLLGLSDS PTS 


11546 


25447 


A 


11646 


182 


458 


SRGGIRFF*KRVFFSVIKSIWFFFWPLL 
KPLGRVN* IKISFNPPAKKRGWERKAQF 
FFEFLPRSFSLAGCSGSISAHCl^CLLG 
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ing to first 
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peptide 
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correspon 
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last amino 
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peptide 

sequence 


N=Asparagine, ^Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T*=Threonine. V=Valine. 
W=Tryptophan, Y=TyrosIne, 
X=TJn known. *=StoD codon. /=nossible 
nucleotide deletion, ^possible 
nucleotide insertion 














LSDSPTS 






A 


1 ID**/ 






RS TKYS PRGWE PSMWFTjFIiHVFS LPFRS 
ILGPPLLGTPLRGSPPPPPPFBGKGFIY 
GLTFPFP * EPPCEI FFFFFLDRVLLYCP 
GWSAWQSAHCS LELKDKVCI 


11548 


25449 

• 


A 


11648 


165 


2 


DQSGRPRKILCAHTHTHAHMHTSLGQT* 
QDPV*THTHTHTHTHTPLYAAAQRIN 


11549 


25450 


A 


11649 


290 


3 


SKQNTTHWATLKSYIYPSQFWRliGSPSS 
RWQLDG FLVRAL FLVCRQP P S LR I LTWW 
EfCTEERKREREK* DREREPNR I ERNERE 

RERERETQRMY 


11550 


25451 


A 


11650 


220 


2 


TPTLP KGTGLTALDRI PGRDPGPRRAGR 

P F* AGVAAENCPDPAEGGSTRPLGAPGR 
fjcapp *PWP/'3PAfV5PP 


11551 


25452 


A 


11651 


29 


387 


FLIFDVTIVIDLACHEPHSYKMAHLINV 
VCVLTAPPTSYFPISFLGPPYSLRHNDI 
E I RPINNPTRTSQC$SERKSHTSLTIjNQ 
KLEMTTLS KDGMS KAKTG*E PGL* C*TV 


11552 


25453 

♦ 


A 


11652 


267 


2 


KFFFPVFSTSKLNFYPGTPFPLFLFSFF 

EEEEE P17MP Rfi E2V VTWVPfiAVA Pfi & VT . & 
r p c r c c rFLKCAjfAV i. n v rvanvAr unvun 

HCSLELVGSGKPP I SAS *GAGTTGMHYH 

TRMY 


11553 


25454 


A 


11653 


50 


411 


KAVAGGMEKSMMNIiP KGPDTLCFD KDE F 

L YYKL LKTAMVEL XNKDYADFVNLSTDIi 
VGMEQALNQLSVPL* QLREEVLSL * S FV 
IERISPSS 


11554 


25455 


A 


11654 


303 


403 


VRNMVEWLGVAAHACNPSTLGGQGEQIT 
*GQEF 


i i tee 

11555 


25456 




1 1655 




t>5J 


SQVAWQSKGDTPSSCCIINNSNGNRTIV 
LHDT*GPRASPCYNPPISWLNQFPHSPP 
WASQLPPSVASVYQK 


11556 


25457 


A 


11656 


442 


2 


GRPILMRPAVL*SEPRTPACVDIQQQIM 
TI IDE WKACAKVQTLSAP INSASRMQS 

YKKLFVLDDREAHNEVEPLCILDFYIHE 
SVQPJIGHGRELFQYMLQKERVEPHQIjAI 
DRPSLV 


1 1 55 / 

■ 


x545o 


» 


1 100/ 


1 AO 




E * VL * KKVRHSQLQKENVAQNAP WQNA 
AYIDQPSPAHVGQQGLSKLPSRPGDQGV 
E PONLRTLOGRSVIRSATNTTLPHMI^IS 
QRVIAPNPAQLQGQRGPPKHGICIRTTTP 
NMNPANV i 


11558 


25459 


A 


11658 


342 

- 


3 


ENQLKNGEHSKSQSASS PPNDHNTSLAR 1 
A* KWAGAEAEIALLTEAGFGRWVIMNFA 
ELKEHVVTQC3CEAKVHDKTTQELIAGFE 
RNITYLM * LKITTREPHIAITS INSRMA 


11559 


25460 


A 


11659 


204 


17 


STCTPLPTA*SQSRPHPPPTPTPPAPWR 
SSNGPLSRGSSSRNSTSSWSIAXiATGPW 
THRSEL 


11560 


25461 


A 


11660 


505 


514 


GS*GNHFNPDGASHGGPQDSDRTGPRGP 
CLPSPWGAQILRDWYPQEEMGTLGKQ 


11561 


25462 


A 


11661 


80 


374 


FLFDIYNRCYLIFINII*YLSSNFD*IF 
I I I YLVAI YRLLCLGLFRAPRQCKHPRP 
QFSraKrGTKR*AWPGAMAHTOTPNTtiG 
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S£QU> 
NO; of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 

r\A 
00 


SEQ ID 

NO: in 

TTCCKI 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue ot 

peptide 

sequence 

i 


Predict- 
ed end 
nucle- . 
otide 
location 
correspon 
ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 

OCysteine, D=Aspartic Acid, 

k— Olutamlc Acid, v— Phenylalanine, 

/""* Pliminn Tl Till liilinn 1 Tnn l/»n»;«» 

\r~ oiycine, n— JHistiaine, i— isoieucine, 
K=Lysine, L=Leucine, M=Methionine, j 
N-Asparagine, P=Proline, 
Q=G)utamine, R=Arginine, S=Serine, 
1— inreonine, v=vaunc, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \==possibie 
nucleotide insertion 














GLGAQIM*GQEFET 


11562 


25463 


A 


11662 


142 


437 


GQRVQI FFFFWKRFPFFAQGGTEGAQFR 
L»IKPSPSGLKEIPPPNPPRKGE*RDGPP 
FRGD F V FLRKNG VP PCGQGGS KPPAPGE 
PPPFTPPRGENNRG 


11563 


25464 


A 


11663 


120 


353 


QALQVMLKHVNFFFFLFFLFWKQ3FVPL 
PRLE CNGATS VHCNLCLPG * RNFPAS AQ 
EFRIAGSYDHAICPPASGKGE 


11564 

■ 


25465 


A 


11664 

i 


1 


404 


NTWSDL 1 1 SY J?HPRPS AFFLRQGLALS P 
RLEYRGTITGHYSLDLWGSSDS PASI SR 
VAGTTYRCM*FQLLGRLRRENHLSPRGR 
GCSDP *LCPYTPAWVTERDPVSKKKLRG 
GGENS LIiSSQTMY 


11565 


25466 


A 


11665 


493 


3 


NIWIPDVLPGHIiARQ*LGSPGKLSAGF* 
PLQFPQTLSSARAPPFCLPRLQVRNTVG 
FLPSPHTSRPSHCLPACPLPSRTQPWVP 
VKPGPTACRGFLQHPPTPGSPSP*SFQG 
S SWWDTDLRALECLLRI LRGRNCSVFVT 
TQGQLSHEGMNKEFLLTCAKKKK 


11566 


25467 


A 


11666 


3 


401 


CMERAVTVLLPGSATQSPPVYAPRAIiAR 
LWLTAAMMISGF IADJ5TEA* SS RCSSACP 
AGDSLSYYHSPADSFSIMGSPVGAQDFC 
ADLGVSRANF I PTDTDMSTS PDLQWLVQ 
PALVYS VGPSET I APHPLGVP 


11567 


25468 

■ 


A 


11667 


1 


446 


LSCAKPPQRPLRHGI IKIFNVFEETRAN 
KHLLGVSKLKPRGFPALFLSVSSFGQKK 
HVPQWLQHSHSHILVISLDFFFFFFFEK 
KFPFFPPGGRERDQTPEN*TPPRGG*RE 
S PPFPSRGGGTQGWAPHCP 1 1 FRFFQKK 
GVFPLRAR 


H568 


25469 


A 


11668 


18 


403 


DPLQRPQYRGGLRSVGCDLQSPTPRGPS 
GRWPNPAQARGPGEAQAWAWPGGGPREA 

ta>_ jta ~*h* ^tatab tab tab Hb ^ _* U a tab* *Mh «k ta-» A *_ tab tab A 1 _ |M 

FSGQGRPPPLGIiHPHRRKGAG PPGPMGP 
* HEGQGS C* GAS CRGGIACTAFPAASS P 
PIFGGLTPPNLNFPEE 


11569 


25470 


A 


11669 


157 


1 


PRPQGFVFVFFFFEMQFGRLECSGAISA 
H* NLHFLGS SNS PAS ASHVYAPLY 


11570 


25471 


A 


■* 4 /MA 

11670 


396 


7 


EKVGQARDKSACEGHS CADARS CLHLRV 
DQGGQHGSHGACPGRRLRGPDTHRQSCY 
QCRCNRSPDRLQPGRAGTP*CAEAAAEP 
DAAVxirGuPAPAKLtPPALFGRGIiL PIjPH 
PGGSQRVYC VRC IGS KLR 




25472 




11671 


209 


3 


DDLfc 1 HKPGKSUVK VTPKEAGENPSAVIi 
PCYIjHLVPFV* EDEVKPEDS I PDMPCaiE 
YAREFLAHAPLY 


11572 


25473 


A 


11672 


270 


101 


LSLNRWILGAYI I FFETESHS VAQAGVR 
*HNLGLLQSLPPGSRRPIAFKVAFLFIG 


11573 


25474 


A 


11673 


421 


592 


SSCTPGVI I laPCVRSNHMKLPGPGMVA 
A 


11574 


25475 


A 


11674 


176 


6 


LTFVYMPCFCALSSLIMTYGLAM* FHFH 
SMTLFI IGVLSVAIiAI YQ*WLDVSRVSV 


11575 


25476 


A 


11675 


1 


418 


NTVCVCVCVCVCVCVERKAGHRLTDSFP 
I QAG PGAQARTLCLRES TGGALS SGL PA 
SRYQDPGRGALHPHCVWTPRGWNWRLNY 
P PARS PSREAVCYFRQTLKKWRS PRGQG 
GRILRVI RTASTERDTS *TESACIX3ARP 
CLDGK*VCESVPCFSLHTHTHTHTHTHT 
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SEQID 
NO: of 
nucleotide 
sequence 

* 


SEQID 
NO: of 
peptide 
sequence 


M, 
etb 
od 

• 


SEQID 
NO: In 

T TO CXI 

aq/cic i 

uy/MS,i 
26 

- 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino add 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
oude 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
C=Cystefne, D=Aspartic Acid, 
ti— Glutamic Acid, r —Phenylalanine, 
I*— Glycine, ti— xiistidine, l B l50ieuctne 9 
K=0Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P»Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
W'sTryptophan, Y==Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V 3 possible 
nucleotide insertion 














■ HCI 


11576 


25477 


A 


11676 


305 


698 


SFMGRSPRKIDQFCNSRNMVHGSVTFRD 
YAIDFSQEEWECLQPDQRTLYRDVMLSN 
YSHLISU3SSISKPDVITLLEQBKBPWV 
V Vfa KK I bRWTx PGK* B* SKQGBAI XVPDS 
PCAQRGVTPLRCGLENFLQ 


11577 


. 25478 


A 


11677 


107 


397 


GGDGRETLRAPADDGCI *QERAETPLAI 
SCPRSSSTCRGRARRYDGCRTRDGHQLI 
QRGQQQESSRRTAQPGSSKLCRGPGGHR 
PSLGTVGVHGCSL 


11578 


25479 

i 

! 


A 


11678 


514 


554 


RAKPTPP*GGGEGQNFGLLKPPVSGI*PP . 
LACPTPPKRWKPKPPPPAPPHPLPP* *K 
RGLI F* PGEVFNLRNWKLP PS PPQRGGN 
KGRNQP PPRGTFMF 


11579 


25480 

• 


A 

■ 


11679 

• 


266 


3 


GPPLFIjGLKN*TFFSPTKV* IFFSQVFS 
SFPQGLKGGVREI FFFFFF* DRLECNGT 
ISAHCNLRLPGSSDSQASAGTTGVCHYT 
RMY 


11580 


25481 


A 


11680 


1 


408 


NTCAIiRHRDHFS FWSDGKTKEGEERI VQ 
ENWLISWNNFSVPETLLTLSCPLGKEGM 
PGE DGTAGAGKVPGED KI PG * DGTAGED 
GTEDENGTTGEDETAGVGKTPGAGGTTG 
EDDTES EDGTTGEDETAGKGGTAG 


11581 


25482 


A 


11681 


244 


420 


KDSESTKAYIRDNISSTKKKKSWPGAGA 
HPCNPSTLEGKGG* IT*SQEFETSLANM 
VKL 


11582 


25483 


A 


11682 


294 


443 


QMNTCTHIQPAYVH* EKYFRPSAMAHAL 
NPSTLGGRGGWIT* SQEFQTRP 


11583 

■ 


25484 

• 


A 

■ 


11683 


346 

• 


1 

- 


YNTNQFTLRGTQASVYTCLTALIVLKLL 
NQPYTLSAVLLNAGNQPFHAFPSPPSLH 
PDAFHSKIMSHIIILLLGFIPFLPASQS 
LTLS PSLERSGTI S AQCNLCLPGSSNS P 
ISA 


11584 


25485 


A 


11684 


140 

• 


423 


SSHQASSPPTTASHSMKISVAAIPFFLL 
ITITLGTTCTESSSRGPYHPSECCFTYTT 
YKI PAQRIMDYYETNSQCFKPGI VFITK 


11585 


25486 


A 


11685 


163 


3 


IKIPCNRLLKFGPPSCCGFFFPLLPPFF 


11586 


25487 


A 


11686 


179 


1 


PIVFLGMLFVAFAFRGPFESVTKPPNFF 
RST 


11587 


25488 


A 


11687 


192 


1 


EQGFDGHLASWISLKDTAKWKNGDELCS 
YYQS C CTD YTAE CKP Q VTRGDVFTMPBD 
EYTVYDDG 


1 1 coo 

11588 


25489 


A 


t 1 /DO 

11688 


on 

89 


3 


SGGYCCCLCCRCSRYSCCCCRLREGPTK 


11589 


25490 


A ' 


11689 


168 


2 


QDRLQPHESKRKLGPAPQLRRSESDTPS . 
WNFQSTESQLMSKGDEDTKDDSKETV 


1 1590 


25491 


TV 
A 


11690 


301 


397 


tio^rireT ir*r/^i\>TKTT5T envTMtTnnovT inMin 
HrVFSJjVl \arliN tf.Li5 P X IjNVDi'R I iiVQVK 

LRFY j 


11591 


25492 


A 


11691 


18 


421 


TKALQITC YLHS TMSEESDGKLIEDSLI 
QLRCHFTWKLLIEAPEIPDLENRIWEEI 
Q FLDTKYNVG I HNLLAYVKHLKGQNEEA 
LVS LKKAEDL I QKEHANQAD I RSLETWG 
NFAWVYYHMGRLAEAQTYLDKV 


11592 


25493 


A 


11692 


188 


3 


PLQGKGFGAKERGQKKGFLRGGVRASRP 
RPPFGPSLFFFFFFETDSHSVAQDGVQW 
CDLGS 
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SEQID 
NO: of 

nUCIcOIIQc 

sequence 

• 


SEQID 
NO: of 

peptide 
sequence 

- 


M 
eth 

OU 


SEQID 
NO: In 

09/515,1 
26 

* 


Predicted 

beginning 

nuaeoDae 

location 

correspond 

ing to first 

amino acid 

residue oi 

peptide 

sequence 

i 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C-Cysteine, D=Aspartic Acid, 
n^oiuiamic aciu, r — ruenyiaianine, 
(XJIycine, H=Histidine, Iplsoleucine, 
K=Lysine, L^Leucine, M-Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine,'S=Serine, 
i— threonine, v=vaiine, 
W=Tryptophan, Y«Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


11593 


25494 


A 


11693 


24 


391 


APRADAMGHFTEEDMAT I TSL WGQGNVE 
DAGGETLGRLLDVY P WTQRFFDS FGNLS 
S AS AIMGNP KVKAHGKKGLTSLGDA IQH 
LDDLKGTFSQLSELHCDKLHVDPENPKP 
LGNGLQTALQ 


11594 


25495 


A 


11694 


138 


425 


NSGVKAAFQLNPQNPNLEGKPGPGFKTH 
PGQLGETHFLLKIHYLSGLVGKPWESQL 
LGNLNRENHWTPBRGGPRDLRHCHGIQP 
WATNGNFVL KKK 


11595 

* 


25496 

♦ 


A 


11695 


1 58 


838 


CGTVHSCDAGQRQRATAPSHPCDHGNQQ 
PILYRVLCCQLFWILCFVFSHSIMSTKC 
P SVS P VSG EMKKRKAI TLEMKLKI IAQH 
EGGKPVMAIAREI/3LWQSTISTILRDKK 
Q I SDAAKS S AS VKS TV I TKKRAGP I DDM 
EKLIiVMWMEDQIQKRIPLSLLMIQAKAR 
SLFNMLKDRAS D PTYTQMFKASHGWFQR 
F KRRHNFHNVK I TGEAARAGNEGAI AFK 
EQL 


11596 


25497 

* 


A 


11696 


834 


1431 

• 


S S ACQGGSQGWP PRQP WWMWGWSRRAPS 
VTHRTGERCSTSGSAPLPLLLSSLCSSL 
PRSWEHSTLRQPPHPPLLPPGCLPPGRE 
AQRLSGAGHGPGPR PAAAS YI PDTQADR 
KQQQHHPG P DQHVGHREGLALEDAATHL 
GLVAALSCWLLGRAEAGYQVPHGHDHPK 
DQHPQADGGQRIVRAIGLGLGHHVSGRR 
ARP 


11597 


25498 


A 


11697 


143 


1 


IV ww ww ww «w w& wa WW ww. yte «► WW WWW WW WW WW, WWW _A ^w «te «lw w tote 

IS KERGASRFSGPWVFFFFFLESRS VAQ 
AGVQWCNLGSLQAPPPGFT 


1 1 (AO 

1 1 598 


25499 


A 


11698 


mm t+ 

58 


459 


KGKEEKVKRKEAEQNFSP YAQDKQERI K 
GNENDEKTKQGKETIIDIELFKGLDETG 
ENMDS TLTRT P FE PLENNKQ I LVLGLDG 
AGKTSVLHSLASNRVQHSVAPTLGFHAV 
CINTEYSHMEFLEIGGSKPFRS 


* 1 1 coo 


25500 


A 


ll 699 


215 


1 


GTKKALiWAGGGGF FP YLP PMGPLG P I CG 
GVGKGS PGLGCP I FFFFFFFETESCSVA 


11600 


25501 


A 


H700 


300 


420 


KPKILFGNVFAAPHMENLKCRGETVAKE 
ISEAMKVKAMC 


11601 

* 


25502 


A 


H70l 


351 


466 


R I KNADLSQAQ WFTPVI PALWEAKVGRS 
LEVRSSRPVW 


11602 


25503 


A 


H702 


172 


400 


SNRLRNKIQGVFLNDSSISPFILRKQSI 
GQAWWLTPV I PALWKAEASGSPEMDSTT 
LLPS S S QVPSLVKMEKLNYS 


11603 


25504 


A 


H703 


284 


408 


ASVFSSFFFVCLFVFEMBSRSFAQAGVL 
WRDLGSLQAPPPG 


11604. 


25505 


A 


11704 


20 


447 


LPGADYGGGHLSLRLFHLLIASAAWVPD 
ESQVTLNSAICVLSTVLIMEFPDLGKHC 
SEKTCKQIiDFLPVKCDACKQDFCKDHFP. 

KKGQ I PD WVGDH IDRDCDSHPGKKKEK 
I FA 


11605 


25506 


A 


H705 


1 

» 


455 


HSCSLQFTPTAWDCTGSVSSEQGERPAA 
AMKI CS LTLLS FLLLAAQVLLVEGKKKV 
KNGLHSKVRSEQKDTLGNTQIKQKSRPG 
NKGKFVTKDQANCRWAATEQEEG I SLKD 
ECTQLDHEFSCVFAGNPTSCPKAQDERV 
YWKQVARKLRS 


11606 | 25507 


A 


H706 | I 


428 | 


DAEADKMAAAAVRGGRSGGSGGCSGAGG 
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SEQID 
NO: of 

tiitrl^otiffp 

sequence 


SEQID 
NO: of 

sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 

■ 


Predicted 
beginning 
nucleotide 

MM U VIWUHW 

location 
correspond 
ing to first 
amino acid 
residue of 

peptide 
sequence 


Predict- 
ed end 
nude- 

otide 
location 
correspon 
ding to '* 
last Amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Add, 
E^Glutamic Acid. FHPhenvlalanine. 
G=Ghycine, H=Histidine, I^Isoleucine, 
. K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
<£=Glutamine, R=Arginine, S=Serinc, 
T=Threonine. V=Valine. 

M M. Ill Wl/lllIIVf T ▼ ullHV, 

W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


- 


- 


• 


• 




• 


ASNCX3TGSGRSGLLDKWKIDDKPVKIDK 
WDGSAVKNS LDDS AKKVLLE KYKYVENF 
GLIDGRLTICTISCFFAIVALIWDYMHP 
FPESKPVLALCVISYFVMMGILTIYTSY 
RE 


11607 


25508 


A 


11707 


1 


422 


RSQARS SAAAAARAS VPLRGS PGPSAIM 
PMFIVNTNVPRAS VPDGFLSBLTQQLAQ 
ATGKPPQYIAEHWPDQLMAFGGSSEPS 

ERLRISPDRVYINYYDMNAANVGWNNST 


1 1 £J\0 

l louo 


ZjjvJy 

i 

i 




I 1 *7AO 

I I /uy 


oo 




AADAM KO APRAMG AFMRIAYOBLb I DPT. 
KEDKMLLNLEGNKREHABRLGMGLVSRS 
S V8HS VLS EMLVI EHETPVSAKS SRSQL 
DLFDDVGTFCSGPSKYTDNPFSLWESIG 


11609 

■ 


25510 


A 


11710 


393 

- 


130 

• 

• 


NPSATAQS FHLAARKLSTLDRPGSQAPQ 
S KAHPAX SLLSTEEG I FKAVPSPASSCS 


11610 


25511 


A 


11711 


331 


440 


KIFFLNFlilSRAWWCAPLVLATWEABVG 
ESIiYPRRS 


11611 


25512 


A 


11712 


122 


478 


SGLCPQQPPRANSCPPSSMASCAEPSEP 
SAPLPAGVPPLEDFEVLDGVEDAQGEEE 
EDEEEEEEDDLSELPPIiEDMGQPPAEEA 
EOPGALARE FLAAME PE PGPS PVPKEWL 
DILGNGL 


11612 


25513 


A 


11713 


102 


2 


TNIiGNPRRPPPFFFFETVSLIiAQAGVQW 
CDLGS 


11613 


25514 


A 


11714 


126 


2 


FFAPFLKI FFFFFFSEMESCSVLQAGVQ 


11614 


25515 


A 


11715 


178 


449 


LSGGNLFGI IILLYCAI IGGKLLGLIKL 
PTLP PTiPSLIiGMIiIjAGI LI TN I P VINDN 
VQIMHRWSYSl^SIAIAIILVRAaLVLE 
SKALEK 


11615 


25516 


A 


11716 


48 


417 

. 


GSGGNHSVGCDTMEGGGGSGDKTTGVLA 
G FFGAGE AG YSHADLAGVPLTGMNPLS P 
YLNVDPRYLVODTDE FI LPTGANKTWGR 
FELAFFTIGGCCMTGLAAFGAMNGLRLGL 
KETQNMAWSKP 


11616 


25517 


A 


11717 


103 


2 


PKSPTQWLMAVIPALWEAKAGGSRESRS 
SRPAL 


1 101 / 






1 171 a 

1 1 / I o 


i i 


4.1 1 


WPSGOVIiVG CLSFSLYGWKRS LKRKNPG 
FEVLLKI FLKNHPRCHTNRM I QLTATPV 
SALADE PAH I RATGL I P FQMVS FQASLE 
DENGDMF YSQAHYRANE FAEADLNHAAS 
LGGDYLGDIiALRLLCTLEPVFPQSY 


K lOlO 




A 


1 1720 


1 16 
i to 


467 


AGMLPAVGSVDEEEDPAEEDGPELVPIE 
TTQSEEEE KSGLGAKI PVTI I TGYLGAG 
KTTLLNY I LTEQHS KRVAVI LNESGEGS 
ALEKSLAVSQGGELYEEWLELRNGCLCC 
SVK 


11619 


25520 


A 


11721 


167 


407 


EIYSLTRFIEVKMSKXISGGTVVEMLGD 
EMTRIIRETLKEKLIFPYVESHLHSYDL 
GIENRDATNDQATKDALEAFNKPY 


11620 


25521 


A 


11722 


254 


423 


NQLSS IMAMFKKI KS FE WFNDPEKVYG 
S GE KVAG RV I VEVCEVTR VKAVR I LAOG 


11621 


25522 


A 


11723 


3 


424 


VSCDTMEGGGGSGDKTTGGLAGFFGAGG 
AGYSHADLAGVPLTGMNPLCPYLNVDPR 
YLEQDTDEF I LPTGANKTRGKFELALFT 
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SEQID 
NO: of 

nucicuuuc 
sequence 

• 


SEQ ID 
NO: of 

peptide 

sequence 

• 


M 
etb 

Cut 

■ 


SEQID 
NO: in 

09/515,1 
26 


Predicted 
beginning 
nuctcoiiae 
location 

ing to first 
* amino acid 
residue of 
peptide 

• 

AA/I ■ « AM AA 

sequence 


Predict- 
ed end 

hucjg- 
otide 

titration 

correspon 
ding to 
last amino 
acid 

i*ap!rliiD AT 

residue oi 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCystelne, D=Aspartlc Acid, 
til — oiumirjic aciu, r— * nenyiaisoine, 
G=Glytine, H=Histidine, I=Isoleucine, 

T^=T.v«jinp T — T x> 1 1 r 1 n M= iVTofh innino 
xv xjjrsiiic, jLr~Jucuuue, in — lrlciniuillllc, 

N=Asparagine, P^ProJine, 
Q=Glutamine ) R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
a— u nKnown, — otop coaon, /^possioie 
nucleotide deletion, ^possible 
nucleotide insertion i 














IGRCCMTGAAFGAMNGLRVGLKETQNMA ! 
WSKPRNVPILDMVTRQGALWANTLGALA 


11622 


25523 


TV 

A 


1 1 724 


2 


343 


AFG IMK.W VTJjVSVIiFIjFSSAi SRGVFRR 
DAHKSEVAHRFKDLGEENPKALVMIAFA 
QDLQQCPPEDDAFSTSEVPEFATTCQDD 
DHFEDRCRRVHTLVVVQLCT P ATLL ETD 
Y 


11623 


******* 
25524 




11725 

i 


1 


359 


HAFGTMKWVTVI SLVFIiFNSAYSRGVFR 
RDAHKSEVAHRFKDIiGEENFKALVLIAF 
- AHYLHQ C P FEDHVKLVND VTE FAKTCDA 
DESAENCDKSLHTL FGDKLCTVATLQET 
YGEMADC 


11624 


25525 


A 


11726 


1 


349 v 


GAMSDRKFS APRHGS LGFLPRKRS IRHR 
GKAKS FPKDDPS KPGHLTGFLGYKAGMT 
HIVREVYRPGSKANTNEVAEAVTIVETP 
PMEVADIAGYMETPRGLRTFNTAFAEHM 
SDEC 


11625 


25526 


A 


11727 


81 


349 


TKGSWS CCVCLCVCLCT WSHLCRLVTW 
LPDMPDDVLWLQWVTSQVFTRVLMCLLP 
ASRSQMPVS SQQAS PCTPEQDWPCWTPC 
SPEGC 


11626 


25527 


A 


11728 


264 


388 


QADPKD I MKFPGPLENQRLS FLLEKAI T 
REAQMWKvNvRKM 


11627 


25528 


A 


11729 


2 


471 


PGCSASWSKRGSGPDMLSSMAAAGSVKA 
ALQVAEVLEAIVS CCVGPEGRQVLCTKP 
TGEVLLSRNGGRLLEALHLBHP IARM I V 
DCVS SHLKKTGDGAKT FI I FLCHLLRGL 
HAITDREKDPIiMCENIQTHGRHWKNCSR 
WKF I SQALLTFQTQ I L 


11628 


25529 


A 


11730 


160 


377 


LQGFGRPSVYHAAI VI FLE FFAWGLLTT 
PMLTVSIAELGLCFVRERDKFLGTYHCV 

9 


11629 


25530 


A 


11731 


45 


438 


KLGQRKEATVTKE S CKSCSRKVGS PDRF 
RS PQKRSKGRQDCFTLFCFWKVLDKNME 
LISPTVIIILGCLAIiFLLLQRKNLRRPP 

KVCGRGRRGLQRRQCFLF 


1 IQOU 




A 
n 


1 1 TV} 






PAS AS RAAGGAHL 


11631 


25532 


A 


11733 


473 


600 


KFWEKRTEGRGKS KS KNSETGE I VS ISA 
LSTTEVAMHTSTSS 


11632 

>. 


25533 


A 


11734 


19 


349 


APSPDAMGHFTEEDKATITSLWGTVNVE 

SASAIMGNPKVKAHGTRVLTSLGDAIKH 
LDDLKGTFAQLSELHCDKLLVDPENF 


11633 


25534 


A 


11735 


234 


359 


F YHLKSFTVSS VQSRWLTPVI PALWEAE 
AGGSPBVKSSRPAS 


1 1 A 

. 11634 


25535 


A 


1 1 TO £ i 

11736 


15 


372 


KLPLKAIjTGBEKTHINIDI ighvhs vks 
tttghli yksrg i derti r i fekeaaem 

GKGSFKYACILDKLKAERERGITIDISM 
RKFETS KYYVTI I DAP GHRD F I KDMTTG 
"PSHADGA 


11635 


25536 


A 


11737 


1 


357 


IWKAAMASPAGSWARPPRPMREPQTLAM 
PTNAAEDQKLKLERLMKSPDLAVTIPEK 
MSEWSPGPPPEFDRDVMGSRAGAASGEF 
HVYRLLRRRE YQRQD YMDAMAE KRILDA 
EFQRRLE 


11636 


25537 


A 


11738 


24 


335 


APNANAMGHYTEEDKATI TS P WGKGNVB 
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SEQ ID 
NO: of 

nuCieOtluc 

i 

sequence 

- * 

• 


SEQ D) 
NO: of 

pep uue 

sequence 


M, 
eth 

uu 

• 


SEQ ID 
NO: in 

09/515,1 
26 

• 


Predicted 
beginning 
n u cieoituc 
location 
correspond 
ing io iirsi 
amino acid 
residue of 
pepuue 
sequence 


Predict- 
ed end 
nucie- 
otide 
location 

corrcspon 

ding to 
last amino 

nCIU 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alaoine 
OCy stein e, D=Aspartic Acid, 

Cr^vri u Lain ic i\ciu t r~- j iicn jiaiunint^ 

G=Glycine, H=Histidine, I=Isoleudne, 
K^Lysine, L= Leucine, M=Methionine, 

li^Aj) para gin C, X * » unirCj 

Q=GIutamine, R=Arginine, S=Serine, 
T=Tnreonine, V^aline, 

w— a rypiopnan, i—i jtusihc, 

X=Un known, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














• IxnvylMVXjJUJ&JulJlJv xxrini 1XUCT CUKV \ja UO 

SDSAIMGNPTDKAHGQKVLTSLGDATKH 
LDD LKGTFAQLS ELHCDKLH 


l loo f 


0553D 

ZJJJO 


n 


I 1TJQ 

I I f07 


141 


775 


MOPLPCIPII.KSLEKSVASHSOTVHSDI 
ISTVE FNHTGELLSTGDKGGRWI FQRE 
QBSKNQVHR 


J 1638 


**> con 

i 25539 


7V 


11740 


1 


337 


DAGGETLGRLIiAVYPWTQRFFDSFGNLT 

<?H «5 & T MfSWP TTVNA WfTT KVT.T <3 LRtl A T KH 

Ot/iSAl FIlJlMIvIS, V Xm AIlO J. AVJJIO UuUnl JVXl 

LDDLKGTFAQLTELHCDKLHVDPENF 




* 


/i 


1 1 *1A t 

11741 


loZ 




OPIICT QPP\7TvTr>Tr > >T/3ZXQVTggT?r\7VTV , P 

oiuioxojrir vr<yx v ifi&ui v v xjix 
AR VSP AOWT ,MPVT P AT i WF APIAGBS PEVT? 

NSR * 


I J04U 

* 






1 1 /4Z 




1 


nrounaur v^vivuisiJxxjxjfrAxxcriMOOXJx i 
AHCSLHl^SGDPPTAASWVSGNTOVHy 
HA 


11641 


25542 


A 


11743 


75 


218 


KIII^RAQW^TVIPALWEAKVORSPVS 
ASQSAWDYRREPPCPSCTI 


1 1042 




ii 


11744 


loo 


*> 


xjrvrJjWoJjvjir V xjuvj VVj^uorVjrbrrcCr r 

FFFFETESCSVARLECSGPILAHCSLR 


11643 


25544 


A 


11745 


83 


2 


REQRFLLPFVQLCPAPRdGVYRGR(iAS 


11644 


25545 


A 


11746 


156 


3 


FHSGSGRVENPS FFFS FFETGSCS VTQD 


11645 


25546 


A 


11747 


244 


330 


KDRAQWVTPVI PALWAAKAGRSLEVRS S 
R 


11646 


25547 


A 


11748 


201 


1 


Xby fiilKX 1 Jjb ire r£ bKDGOJbxTCjUJrKb 
DPELPGSSNPPSCSASRVDGTT(»ICHHT 
RLIFFSQTDKK 


11647 • 

■ 


Af fin 

25548 

• 


A 


11749 


262 


** 
3 

■ 


ViloJjirfr'lu) ViN LiJj 1 w V o Jr Ko vb I ISUN 
xijv^xujr v i iciLirr vwji; vv»x«r v^r r rv 
MESCSVAQAGWWRDLGLLQAPPPGFTY 

X X 


11648 


25549 


A 


11750 


225 


3 


PIINFSVPQFLHLYNGIIIESTPPSCCI 
EAWTLKTI LANMAKPHL 


11649 

i 


25550 


A 


11751 


185 


2 


VSTPFNSFPSPPWDLVFLKGFFFFFFFF 
FFETEFCSSPFJiERNGAJ>rJUiC2JLRLPR 

SSNSP 


1 luJv 




A 

mm 


1 1759 


961 




DRVSLWS PRLECNGAI S SHCNLRLP 


11651 


25552 


A 


11753 


282 


381 


TDIiLYKKKCGLGAVAHAYNPSALGGQGG 
RITCS 


11652 


25553 


A 


11754 


202 


45 


NPRXVKLQWALILPLPFNLGGQLKSRFQ 
KKKKKKKKKKKEEKRFCKKTLVVI C 


1 JOJJ 


95554 


A 


1 1955 
1 1 / Jj 


9QT 


759 


L.T PVX PAL.WE AEVGGS PE VRS 


11654 


25555 


A 


11756 


264 


1 


TLPINWCCDHSTGPSLISLPLLSLPYS 

SLTLDQKLEMIKLGEEGLLKAQIGQKLG 
LLHQ 


11655 


25556 


A 


11757 


126 


1 


I LPGFLKE LRI ELP FNPAI LPLGI YP KE 
KKLLYQKDTCTHMF 


11656 


25557 


A 


11758 


399 


226 


I EKGTGRVWWLTPM I PALS EAKAGGS PE 
VRS PRPACIGLLKFWYYRCE PQHPARLS 
EI 


11657 


25558 


A 


1 1759 


235 


380 


DTTOLKLGQLITLQWASKCSSERKSRTS 
LTLTQKLEMVKLSEEGM5KGD 


11658 


25559 


A 


11760 


249 


357 


NRASGQTWCPTPIMPAIiWEVEAGGSLBP 
RSLRLTWA 
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SEQID 
NO: of 

sequence 

* 

• 


SEQID 
NO: of 

sequence 

• 

• 


M 

eth 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 

• 


Predicted 
beginning 

location 
correspond 

inp tft first 

lllg iv 111 Ok 

amino acid 
residue of 
Deotide 
sequence 

i 


Predict- 
ed end 

nitrlf*- 

■I u t iw 

otide 
location 
corrcsoon 
dine to 
last amino 
acid 

residue of 
peptide 
sea ue nee 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
K=(*lutamir Acid F=PhenvlalfltiinA 

Jw wlUUlllllV rlvlUy A, * U VIJ Jf 1(1 I4JU III C| 

G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N = Asnar&pine. P=ProHne. 
O^GIutamine. R=Arffinine. S=Serine% 
T=Threonine, V=Va!ine, 
W=TryDt0D h a n, Y=Tvrosl ne, . 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


11659 


25560 


A 


11761 


168 


372 


KLHCLLCFYX.SEFFVFCFLETRSLSVAQ 
SKCSGTVITHCSLKLLCSSDPATSDSRV 
AETTGTGYHAGL 


11660 


25561 


A 


11762 


303 


377 


QFAGCRWLMPVT PALWEAKADGSPE • 


1 1661 




A 


1 1763 


135 


1 


ATTPGliFFIFYFFETESHS IAOAGVOCM 
ISVHOJLCLPGSSDSFV 


11662 


25563 


A 


11764 


219 


2 


KLRSSQLSEPEVRPDSTLSMLMHARAHT 
hththphiiflphp csrphtstptcmptv 
THSPIYLKFHSHTYIIj 


11663 


25564 


A 


11765 


102 


1 


NRPINNPTMASMCSSERKSHKSLTLNQK 

1 Hi II A nu 


11664 


25565 

i 


A 


11766 


245 


3 


PGEPEEDQLVKFRPSLLWVFMFLFFLRR 
GFKQFSCLSTjLSGWDYRHVPPCPS 


1 100 J 






I M6TI 

I I /O/ 




y i 


GSVPAS POVTRPTLMTSLADKWF^tdLT 

SLRVNFGVALILSGS hs vsodtldlltl 
FSTHLGLPKCWDYKRVPPCSAQFLVTFL 


1 1666 


ZJ JO / 


A 
*» 


1 1 / Oo 


l oo 


3Q1 


LGS VAGDIjLCFFGGVI FPCS FMFPMST.C 
YYLYIWYNSCVFLFFKTEPHFVTHSGVH 
WCDGLLOPPPPN 


11667 


25568 


A 


11769 


131 


2 


YTLYIPNKSPIENIQJiSWAQWLVPVIPA 
LCGAETGELLEPRSS 


1 1668 

4 i TOO 


75569 


A 

** 


1 1770 


165 


1 


VLI PLIHLCI YI YI YTYI CVYI YTRI YT 
HICIHTHVYIHIYVYIHTYIYTYMYTY 


11669 


25570 


A 


11771 


72 


1 


ILLFFFFFEAESCSVAQAGMQWCN 


11670 


25571 


A 


11772 


121 


3 


TSFVLFCFETEFCSCCCPGLECWGAILA 
HCNIiRLPGSSD 


11671 


25572 


A 


11773 


152 


3 


HVCLNLTLPFLFEKNIFSLCVCVCVCVC 
VCWIiCKKKKClEKDFYFERTF 


11672 


25573 


A 


11774 


190 

i 


2 


GFSPRQRGAPRVPPPIAGPFPPIFFFFF 
FFE^SHSVAPAGVOWCNHSSLAAT.TSP 
DSGDPP 


11673 


25574 


A 


11775 

• 


287 


1 


GAHRRKLFITPGESLRDKQIAGFEHRGG 
E KKKlTiFYKKXKJOiKKKPJCEKKGI^KKM 
L PKCLTTTMKNHQ VP YS I RP TRE CADLR V 
FYFIKFQILKY 


11674 


25575 


A 


11776 


121 


3 


KCASRDLSKFFFFFFIxETESRSVAQAGV 
QWCDLGSLQAL 


11675 

* 


25576 


A 


11777 
1 1 / 1 1 


142 


i 
i 


EKTLHVRNTIl^SEGSLVIKIHGRIiGVV 
AHACNPSTLGGRGGQITRS 


11676 


25577 


A 


11778 


179 


3 


SHQ P VPGTLDLPRG PQKLQ S TS EAESEA 
SMSEASSEDLVPPIiEAGAAPYREEEEAA 
KK 


1 1677 


25578 

X>->-> / o 


A 


1 1770 


147 


6 1 

V 


KTPGLKKNSNFFFFFFFFETRSHS IARA 
GVRGCDLS SLQPPPLGLK 


1167ft 


7SV7Q 


A 


1 17ftft 


io*t 




GFGFYI I PNYRIiFFFSLLIGP FFFLIFF 
ETEPCSVARLECSGVISAHCNIRIiLGSC 
DSPA 


1 1679 


25580 


A 


11781 


120 


319 


VKSLAKFLVNYQLHRQTCSGQAGRGALR 
QRFGRPRQVDHLRPGVRDQPGQHGETPS 
LLEVRKLSSG 


11680 


25581 


A 


11782 


91 


3 


Y I CLS L I YTHTHTHTHTHTHTHI YIYVC 
V 


11681 


25582 


A 

# 


11783 


221 


332 


SRl^MVDHPRSGVRDHPGQHGETPSLLK 
IQKLARRSG 


11682 


25583 


A 


11784 


243 


379 


LKYSVP I KNCKP WPGTWDHTRNPSTLGG 
QGGWITLDQKFETSIxAN 
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SEQ11) 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M, 
eth 
od 

• 


SEQID 
NO: in 

necM 
UooIN • 

09/515,1 

26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino aciu 
residue of 
• peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
uing to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A)anlne 
OCysteine, D=Aspartic Add, 
H/— Oiutamic Acid, r— rnenylalanine, 
G=Glycine, H=Histidine, I=lso leucine, 
. K=Lysine, L=Leucine, M==Methionine, 
N=Asparagine, P^Proline, 
v=v>iuiamine, tsr- /vrginine, s— ocrine, 
T=Threonine, V^Valine, 
W—Tryptopnan, Y—lyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \= possible 
nucleotide insertion 


1 1683 


25584 


A 


11785 


327 


5 


GRVDSQ P 1 1 PKNP PQS FP KI YPGS FKTL 
FPYLQSRHSFSIPRSKLARFHQAKPLFC 
SKFSQSLS PHSELKiuUJLSQARWLTPVl 
RALWESKAGRSPEVRSSRPTWPT 


1 1684 


25585 


A 


. 11786 


245 


359 


F KGMDKljRSGVKlJQ PGQHS KTP SVUKIQ 
KLAGHGGMRL 


11685 


.25586 


A 


11787 


356 


67 


RDIASITARLRYDRGVGITRLBVLSKSD 
EYTKGSVLDFSRETEPIGYIYIYIYIYI 

VQNFDSPKLSYH 


11686 


25587 

* 


A 


11788 


103 


3 


KYIIPVCIPFFFETESCSVTQSGVQWCE 
LRRRG 


11687 


25588 


A 


11789 


136 


1 


VHQFTIR I LNYYYYYFFETESYSVTQAG 
VRWCNLGSLQPLPPGFK 


11688 


25589 


A 


11790 


291 


163 


SLIJ^LPRIiECNSTI SAkRiJpJlLLfiSSDS 
PASGRSLEPRSSKLO 


11689 


25590 


A 


11791 


107 


2 


I FMYVFFETESRSVAQAGVQWRYLGSVQ 
APPSGFS 


1 1690 


25591 


A 


11792 


329 


3 


KNPRNIIGKSRGPYITNFPGDKGPKNSS 
PRGNDL S I N KNVJ AQ A FQPRGQ KKLFFP K 
KKKKKETQSRSIPRLECNGPTIAQCNLK 
LLASSNPPTSASHSAAIISMSHSG 


11691 

• 


25592 


A 


11793 


213 


3 


ISPWLFLLLQATIiFRSQYPGPFPPFSLS 
PYLFPFKQKLGSGFFFFFLFFFKTESRF 
LARLECSGI ITAQ 


11692 


25593 


A 


11794 


1 


400 


KRAAPQ PAPEQRDLKKKKKKP PLAPSSL 
FRFWEKLLPSPQFRQPRGPFLGTRGVSF 
AFHRRRFWQ YGNMGE KQTWGNPGS S SNP . 
PPGTAGS LSGNRGPG WGGKIADRFWLYA 
RDS FGHS RG LKGS CCQAHLGK 


11693 

• 

■ 


25594 

■ 

• 


A 


11795 


3 


769 


RKEQTRNARAEVLRQAKANFEKEERRKE 
LKRl^GEDXWMLPDVKERXEQFSQEHSV 
KKKKKKDKHSKKAKKEKKKKSKKQKYEK 
NNESSDSSSSSEDEWVEAVPSQTPDKEK 
AWKVKDEKSGKDDTQIIKRDEWMTVDFM 
SVKTVS S S SLKAEKETMRKI EQEKNQAL 
EQ SME I FQS KLED AE KAAST KED YRRER 
WRKPTYSDKAQNCQESRESDLVKYGFCS 
RDR YATTDTAKNSNNEXF I GDE KDKRPG 

pT ™ 


11694 


25595 


A 

• 


11796 


110 


13 


HTHTHTHTHKHTHTHTHTHKDRSGKIKC 
vp*rv 


11695 


25596 


A 

* 


11797 


184 


2 


SEIFGKFHVFYILTVHFMPPTQVYVERD 
VLGQEQLLT? V I FALWEAKAGRS PEVRS 
SRPA 


11696 

• 


25597 


A 


11798 


83 

. ■ 


387 

* 


G £j RRKKRKRRL WAP LQ AiGCKKKKKKKKK 
KKKKKKKiOOCKGGAPLKKPPGGPHFSGG . 
RQKNXPPIjKGGEIjKRAPAGDFKNPGRGK 
IARGGFFEKNIiSWGGEK 


11697 


25598 


A 


1 1799. 


153 


2 


i p wedttvysgypqflfshsffyfyffe 
tgscsltqtgvqwcnhgslqpr 


11698 


25599 


A 


11800 


93 


3 


vwgrawwltsvisalweakvgespevrs 
s 


11699 


25600 


A 


11801 


115 


1 


NP LFFFFFF FEMES CSVAQAGVQWCNLG 
SLQPLPPRFK 


11700 


25601 


A 


11802 


295 


137 


RCFI FILFINKLYFVYLFI YFFEMESHT 
VaQAGVQWRDLGSLQAP P PNKLYF . 


11701 


25602 


A 


11803 


287 


1 


MGPHAPFKGTCGGFQDFRFKNLEISAGL 
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SEQID 
NO: of 
nucleotide 
sequence 

• 


SEQID 
NO: of 
peptide 
sequence 

• 

■ 


M 

eth 

i 

od 


SEQID 
NO: in 

f ICC XT 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid " 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
ding to 
last amino 
acta 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCystelne, D=Aspartic Acid, 
Hr— oiutamic acio, r— .rnenyiaianine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lys ine, Lr=Leu cin e, M=Meth ionine, 
iv-Asparagine, i^rroiine, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V^Valine, 

\i/MnrM« * » _ 1*. _ _ v» »hp« imam ■ m m 

W— tryptophan, Y— lyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


• 


• 










THS P IiIiAQNFRVGAVPS PR YS PRPGS KG 
GGG FFFF F FF FE TE SHSVAQQ AG VQ W CD 
JjGoliUAliP Pbr 


11702 


25603 


A 


11804 


202 


3 


WSVCCFKQGLGLSPRLECNGAPGFKQSF 

nr OT T OMUPVBPBDDT3T MUPUPPPPMOO 

CSLAQAGVQR 


11703 


25604 


A 


ft ft ft f\A* 

11805 


3 • 


*> ft. Mt 

315 


FMJjJj I IjTLi FIjIjRWIJKJjVQC.UVRS 5 WCIj 
GFLLGWSVILYPLMAAFMPTMWIFLFIL. 

FSMLFFYVFFLFLLFFLLFLYSFPLFCF 

t 


11704 


25605 


A 


11806 

i 


126 


3 


KRGFFFFFFFFFFETESRSFAQAGVQWC 


11705 


25606 


A 


11807 


130 


2 


QS I KMTCSSLYFFI KKNYAWWLTPWPA 
LWKAGAGG5HEPKS 


11706 


25607 


A 


11808 


149 


1 


GGKKAILFFFFSLSPEiliHeS^AlPTHCK 
LCIjPGS rhspas afrwgtag 


11707 


25608 


A 


ft 4 AS\/N 

11809 


258 


3 


KYYYQGNRQI KrVNAKSNKNIiPS YSQKA 
SWFSSRNIRMMGQHQPMHLDNPTSFIKK 

T no7\ T«7T»TT TamTTTJTiT UTEP A T? 7\ O T T7PD OT 

JjGQAWWLiRP v 1 PAijWiiAaAG\7oLiiS5Ro li 


11708 


25609 


A 


11810 


153 


285 


CASPIRSHQKLNKNQWVQWLMPVIPVPW 


11709 


25610 


A 


11811 


160 


3 


NHLFLLLNRYKS I F IGRVQWLMPVSSAF 
WEAKAGRS FEPRSSRPAGQHGKTL 


* * M ft A 

11710 


25611 


A 


11812 . 


89 


2 

■ 


At r r r r ETVSC5 VAQGtj VUijCnijrlSIfULi 
N 


4 *• ft ft! 

U711 


25612 


TV 


11813 


254 


^ ^ ^ 

336 


J-iLitjyFyWJjl P V IFAijWtAilAIjKi'iSCt VK 


11712 


25613 


7V 

A 


11814 


108 


2 


L.vbvlAjr r vUbrr IS 1 HbLb v/iyAvjMy Wrl 

DLLGSLH 


" ft 4 Ml ^ 

11713 


25614 


A 


ft ft. A ft A? 

11815 


166 


289 


5n&vjWr»AVWKlJ\jI>ijyPLiI rur I\J<r FL.LiK 

VPSSWDHKCAPPC 


11714 


25615 


A 


11816 


192 


2 


DKDPG KVTRPFLLLC I FYLF I CLFLRRT 
LAVSPRLEFSGRGCSEPRSHRCTPSWVT 


11715 


25616 


A 


11817 


228 


343 


LLQYS IQNEDAGWAWWLTPVI PTLWEAK 
AGRSLEVRSL 


11716 


25617 


A 


ft v f\ ft a 

11818 


322 


407 


VliRrlLIiHCL)RliLJUjVUPi-«WKl VWQ^I-LKiJ 


11717 


25618 


A 


11819 


119 


1 


WEKIVRGKKCQKENTHTHTHTHTHTHTH 
TRTYRESKRERLV 


11718 , 


25619 


A 


11820 


269 


1 


FVQ VF Y I LWTQS FCLTCCvWFIjIjGYHLS 
SVSNRFMRSGKSTFFFMTSEFFFFEMES 
Rt v rwAKVijCCoVsro 1 oHrtv>J\LiriijPGPKci 
SPALV 


11719 


25620 


A 


11821 


217 ' 


389 


EH X ET VYGKLG I S EH WGKGGL FNKWLT I 

TilWTWfCltjTf^WTJT JWQT.7T T3TA T Ml? 1kV\Tf3n.GV\T 

H 


11720 


25621 


A 


11822 


188 


I 


GSHMPCRVISSVKESMNEFPAVPTSYPA 
NPQPRBRAWRNQREKEDKKERSQRSVGR 
VOAGLV 


11721 


25622 


A 


11824 


126 


3 


KLQGQVQ WLTPVI PAP WEAEAGRS PEVR 
SSRPASTWRNLV 


11722 


25623 


A 


11825 


224 


3 


AIjIIKKEVSAMELEGSNRIQAYGVQSIT 
GARDYHAAASRVPAI KGTHHHARVIFVF 
L VETG FHHVG QAGLDS C 


11723 


25624 


A 


11826 


3 


364 


HELPEPLRVLWTAHRVAMAPGSRTSLLL 
AFALLCLPWLQEAGAVQTDPLSRLFDHA 
MLQAHRAHQLAIDTYHEFEETYI PKDQK 
YSFLHDSQTYFCFSDS I PTPCNMEETHQ 
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at 111 
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Amino acid sequence (A=Alanlne 
OCysteine, D=Aspartic Acid, 

Ifefiliitnmif* Acid. F = Pherivl Alanine 

Ur^\JIU Willi /l«rlU| M. M. llwilTlillfllllllM 

G ss Glvcine. H=Histidine. I=Isoteucine_ 
K=0Lysine, L=Leucine, M=Methkmine, 
N=Asparagine, P^Proline, 
Q=Glutamine> R=Arginine, S=Serine, [ 
T=Tbreonine, V=Valine, 
W=TrvDtoohfln. Y=Tvrosine. 
X=Un known, *=Stop codon, A=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KSNLEIiLR 


11724 

■ 


25625 


.A 


11827 

1 


2 


376 

- 


ARELPEPITVLRTAHLKAMAPGSRTYIjL 
LAPALLCLPWLOEAGAAOTDTIjYMLPDH 
AMLQAHRAHQLAI D TYHEYDETYI PKDH 
KYSILHDYQTSFCFSNSITTPYNKEHTQ 
OKSNLELLRISLL 


11725 


.25626 


A 


11828 


288 


3 


IHTKNPSVHHHHQRPKVDKTTKMGKKQS 
RKTGNSKKWSASPPPKEHSSSPATEQSW 
TENDFDELREBGFRRSNYSEIiQEEIQTK 

fc Jfc ^ fc^ fc fc^fc* V^IW A *^*^fc*^Y A W AJili^f MM M» ^( A 

GK^VENFEKN 


1 179/; . 

1 1 /AO 


Z JOZ- / 




11890 


IV/ 


i 
i 


KRSGVRLQRGLVRWLTPVI PTLWEABVG 
GSFEVRC 


11727 


25628 

i 


A 

_. . 


11830 


220 


354 


QGNKS FKNI IAPWLGTVAHACNPSTLGG 
QGGRITGHEIETILANM 


1 1 IJ.O 






1 1 Q7 1 


nil 
Hi 


35Z 


E^l^(^tQWLTPVIPAliW^AK)^tt^ 


11729 


25630 


A 


11832 


239 


488 


SQTHCSGKTENSHGTNSDRNQVPTALHK 
VSAVASQWERTGKPFNPLLGETYELIR 


1 1730 


25631 


TV 


1 1833 


i on 

lev 






11731 


25632 


A 


11834 


1 


332 


GTSPEPLTVLWTGDLLAMAPGSRTSLLL 
AFALLCLPWLQEAGAVQTDPLSRLFDHA 
MLODHRAHHLAIDT YHEFEETYI PKDOK 
YSFLHDSQTFFCFSDSIPTPSNMEET 


11732 


25633 


A 


11835 


330 


175 


PRELAQLGSPNVRVSRVKPRGQ PQMFFF 
FFWROES CS VAQAGVOWHE PEOGE 


11733 


25634 


A 


11836 


96 


1 


VFVETGSRSVAQDGGQWCDLSSLQPQPP 
RPRA 


11734 


25635 


A 


11837 


267 


334 


S WVQWLTP VI LALWEAEAGGS P 


11735 


25636 




1 1838 


t OA 


2 


C^GRAQICSVCVCVCVSVCTCVHVCAYV 
SSC 


1 1 /JO 


25637 




1 lopy 


loo 


*> 
Z 


GRGQAQWLMP VI PALWEAKVGRS PEVRS 
LSSC 


I 1 

I I IS 1 


25o3o 


74 


1 1B4U 


i'A 
54 


im 


nPWfZTJT .PPPT .TfRT .WT AHT.VAM A Pf5SRT«5 
LfiT4Vt\S\iiJ*T CtJT JJTv.VjJjrv inlliiVnl'lnfUQni O 

LLLAFGLLCLPWLQEAGAVQTVPLSRLF 
DHAMLOAHRAHQLAIDTYHEFEEPYIPQ 
DOKNSFLD 


1 1 1 Jo 




A 




1 17 
lit 


1AA 


IOYVKOINETEMRNNESYLNHTSLTITI 
HTLCLMGSYLEHFQNQCKGRARRLiMPVI 
PALWEAKAGRS PDVRSSKPA 


11739 


25640 


A 


11842 


750 


968 


RAQGPRWILKIPFSSPFRLFPLIPLVFL 
YNSPPLFPPFPRTKATNNSPFPPPLPPP 
LPFSPPPPPSPSRPPPS 


11740 


25641 


A. 


11843 


757 


1000 


WELVWYCCLFSPFFPLiPSPPLFPPPPP 
SPPHS FPPPS PFRPPLPLAPTRPPPPFS 

4. X. ^ * » W « * *.wfc fc fc^ *fc • * vfc fc fc> fc fc fc# - 

LFQAPPLLPSFCSPPPPPPPSLSFL 


11741 


25642 


A 


11844 


278 


361 


TLLFFFFFFEMESHFVARAEVQWRDLGS 


11742 


25643 


A 


11845 


260 


2 


GSCLLEGKLTNRKDIHTKNPSVHHHHQR 
PKVDKTTKMGKKQSRKTGNSKKQSTSPP 
PKERS S S PATEQS WMEND FDELREEG FT 
RA 


11743 


25644 


A 


11846 


194 


2 


TKFAKEPS PPPCWQEVFKTS PLGFFFCP 
RSFPVNGFPRLKNLFFFFETESCSVAQA 
GVQWCARA 


11744 


25645 


A 


11847 


219 


1 


SLGHTLKQSESSLSFSPGGTPLLEFKPS 
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Amino acid sequence (A=Ala nine 
OCysteine, D=Aspartic Acid, 

E=frliitflmic Arid K=Phenvlfllanine- 

ju/ vjiu lu line riLiu | a 1 ^ ijbii rmiauijivy 

G=Glycine, H=Histidtne, I=Isoleucine, 
IC^Lvsine. L^= Leucine^ \t=Metbionine. 

N-Asparagine, P^Proline, 
O^GIutamine R^Arpininc S=Serine.. 

T=Threonlne, V=VaIine, 
W=TrvntoDhan. Y=Tyrosine. . 
X=Unknown, *=£top codon, /=possible 
nucleotide deletion. V=POSsible 
nucleotide insertion 














VYDPSVP PFQVSKGGCVGFPFFPQTESC 
S VAQAGVQ WHDLGS PRA 


11745 


25646 


A 


11848 


126 


3 


RFFFKGLFLHFFFFFFFFCFETESHSVA 
QAGVQWCDLGLV 


U746 


25647 


X 


11849 


216 


339 


KC FKVGKVYKLNRE T FYLAQNFFDRNWA 
TOKMWKTLLELI 


11747 


25648 


A 


11850 


190 


334 


VS YHI EMS FFEVYNE KIHDLLVCKDENG 
QRKQPVRLKQFI ICFEFLFL 


1174ft 

1 1 (to 


25A4Q 


A 

AM 


1 1851 


108 


357 


FVPRLECSGAISAHCTLHLPGEL 


1 1 749 




A 

*m 


1 1852 

I JO J— - 

i 


371 


i 

1 


PPKKr if? TOVFTP PAP1? APWP VT i\7T?1f infrt 
c it jv rujuxy v c i, <tr c rwr ivrvr wjrvijvjc aamj 

FPQGNLVLPGFFSKVS IQGVPPFCRKPF 
SGWGEAFQ I GFF INRGKKFF FFPLGFFL 

AGVQWRSLGLV 


1 1750 


25651 


A 


1185*3 

J JOJJ 


176 
i to 




LHTFFFLTE SRS VAOAGMOWCDIiGS LHS 

c 


11751 


25652 


A 




281 


375 


OLTFKKYFLGWARWLTPVI PATiWRAKAf? 
RSP 


11752 


25653 


A 


11855 


146 


356 


KCGALIAEIEVPLF^ELRDFLLAYSRMT' 
ELCFKRCVPSLHHR7\LDAVDDDCLHSCS 
CTDYP INRSLMNAY 


1 1753 


25654 


A 

* * 


11856 


136 


i 

■ 


LSSLKPPPPRLQQCHSC 


11754 

ill «/*T 


25655 


A 


1 1857 


777 

■£,11 


361 
Jul 


T VT.RGAMWr.TPVT PTTiWRAPPRTlQPPVP 


11755 


25656 


A 


11858 


295 


152 


WFGFGFFETESHFVAQ PGVQ WCNLGS L 
CSLRLPGSSNSPASAFQVA 


1 1 756 

* * / <jy 

■ 


95657 


A 


1 1850 

I 10J7 

• 


i 




GTPTiPPPTiTVTiWTAWT/^AMAPr^QPTQT.T. 
TjAFAIiLCLP WTjOEAAAVOTAP VS RT.FTJH 

amlqahrahqlaidtyqesebtyipkdh 1 

KYSLLHDSQTCFRFSDSI PTPYNMEDTO 
HK 


11757 


25658 


A 


11860 


244 


332 


TIEMMLDIKQIQVIFLFEFKMGRKIABT 
T 


11758 


25659 


A 


1 1861 


1 


330 


GTRWTI CCVOIjHAYAHFLYFFEMEARS 
VAQAGVRWCDLGSMPPPPGSSSSSSSS 

s 


11759 


25660 


A 


11862 


311 


394 


GMLGAVAHACNPSTLGGLGGWITGSQEF 


11760 


25661 


A 


11863 


no 


1 
1 


VP PS VRTFFFF FFE TESRS VS OAGVOWH 
YLGLLQAPPPGSTSC 


11761 


25662 


A 


11864 


3 


320 


immyalfllsvglvmgfvgfsskpspiy 

«M> 4 ** 4 *4M«4 M^Mt^MW ■ M^ » A * »»J M. W 4. 4J Jv rit> 

GGLVL I VSGWGCVI I LNFGGGYMGLI V 
FL I YLGGMMWFGYTTAMAI EE YPEAWG ! 
SGVEVLVSVLVGLAMEVGLVLW 


11762 


25663 


A 


11866 


165 


309 


GLILLPMLECSDRISAHCSLYLLGSYDP 
SDSGSHVAGTTGTCHYAWLR 


11763 


25664 


A 


11867 


208 


314 


GSRNKLSGQAQWLTPVIPALWEAEVGGS 
PEIRSSR 


11764 


25665 


A 


11868 


102 


1 


KKPFFFFETESHSVAQAP^VQCNLGSIiQA 
PPPRFT 


11765 


25666 


A 


11869 


126 


3 


NNALQLHSSYCKKPFFFFLKTESRSLTK 
AGVQWCDLGLLQ 


11766 


25667 


A 


11870 


1 


273 


KQLPVNFLNWVRLELTGLLtTLNLSKPC 
MIFIILVIVKYWFLLFCNIFKNHIFSQA 
QWPTPVI PALWEADMGGSHGYWITTI VD 
FMCATLT 
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NO: of 
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SEQID 
NO: of 
peptide 
sequence 


M, 
eth 
oo 
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SEQID 
NO: in 
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09/515,1 
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Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
• peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
locauon 
correspon 
ding to 
last amino 
acio 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid f 
Hi— Glutamic Acid, r— rnenyiaianine, 
G=Glycine, H=Histidine, I=Iso leucine, 
A—ijysine, ju^jueucine, m— Methionine, 
N D Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V^Valine, 
w— l ryptopnan, y— jyroslne. 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, v= possible 
nucleotide insertion 


1 1*7 ATI 
1 1 10/ 


ZOOOo 




i i mi 
1 lo/I 


IK 


Z 


cdvrt T.TnyptrppgPTruiTTPCDCtraoaPTr 
-r JrJSAljJLiv Vr* ccr r r rCilxSoKoVAQAKV 

QWHNPGSLQCPLR 


1 J /DO 






1 1 mi 

4 


269 


A 

9 


y v £>v»KJr W FLU. r X K. 1 GKIwrMJ L.FN r P Qua 
P PRGL KKKS FS KKKKKKKKKS IAVGQAQ 

MT .T DTfT T5IV T JaTP JW\7r*T30 tJCXftrO OD r*TS 

nul^V IrM^nAivvbKbFfi VJvboKrnLr 

TW 


I 1 1AQ 

I I /oy 


•ZjO/U 


1 a 


1 1 Q1A 

1 1 6 / 4 


Mi 

1 14 


1 


LinHivroVr y oCj V JjJjVJ. r r rr KTxsSnS V5 
QAGVRWCDLG 


1 J / /u 


200 /J 




1 1 0*7C 

llo/5 


1 A £. 

146 


1 


VAGPGGAHLQSQEVQVGGSL 


1 1771 


25672 

• 


A 


11876 


2 


159 


SLQFFFFFFRWSFVLVAQAGVQWHDLQS 
PQP P P PG FKHS PAS AS QVAGRGGR 


11772 


25673 ( 


A 


11877 


124 


240 


FRAFSSGQAWWLTP VI PALWEAETGGSL 
EARSSRPDWRT 


11773 


25674 


A 


11878 


135 


5 


QVLFYFYFSDSFTLSPRLECSGVTLPHC 
NLCLPGSSDSCASAS 


11774 


25675 


A 


11879 


170 


2 


GFFFPEKWGYKNSFLCFFFFFFFEPEFC . 
S VAQAG VQWLHIA3S LQPPP PVFKVDAAP 


11775 


25676 


A 


11880 


97 


267 


GHGHATLRGLCVLSLFFHI PAPSVSGTS 
DAEECCLCVTYRLI CGYISRNCLYFSDH 
R 


11776 

■ 


25677 


A 


11881 


I 


292 


LPEPLRVLWTAHLQAMAPRSRTTLIiLAF 
ALLCLPWIiQEAGADQTVPLSRLFDHAML 
HAHRAHQLAIDTYQEFEQTYIPEDQKYS 
FLHDS QTYF CFSD 


11777 


25678 


A 


11882 


93 


2 


KYQMGWAW WLTP VI LALWEAEAGRS PET 
SC 


i i no 

11778 


25679 


A 


11883 


145 


2 


PPLGLRLQVQAPTPGFFFFFFETESRSP , 
RLKCKGAILAHCNVCLLLV 


11779 


25680 • 


A 


11884 


202 


1 


TWWRWGVTVLVRLVMNCRPCDRHKSASQ 

i 

L IGR VRQENGLNARVGGCSE PRSRHCTP 
VWVTSENPSSC 


11780 


25681 


A 


11885 


42 


155 


GERSGLS PG VQDB P GQHSGTS S LQKILK 
IiAGHGGTCP 


11781 


25682 


A 


11886 

• 


215 


1 


STMARHCPLS PMLFNSAMEVLVRAISQE 
KEIQGI QIGKEEVQLSLFTDDM I FNLEK 
RKDCS KNLLQLMNLV 


11782 


25683 


A 


11887 


214 


1 


GYFFGLNEVLGKLEKPSLKVPQNPSPKR 
Pfr x? I/5FFFFETESRI VARLECSGAISAH 
FNLCLPGSSDSPVSC 


1 HOI 


25oo4 




1 1 0>00 

1 1588 


215 


3 


WtjfGr Pir r or r r rr LiRQGIiAVIRJjKCSG 
TITAHCSLNFLGSSDPPASVSLVATTTG 
n Pig a x lunboL 


1 1 toff 






1 1 ft CO 
1 1507 


1 to 


1 


PT?l?TMr , PNrD f rVT7T?T?'PPl?TC , <2T T DDT CPC 

GVISAHCNLSC 


1 1785 


25686 


A. 


11890 


267 


83 


HCLRSGVQDQPGQHGKNPSIQKIQKLAT 
c TTtrvcT irTVTDDT.DT?irKTrMr r pncirce/^cc 

LSPHS 


11786 


25687 


A 


11891 


134 


2 


DRLAVLPRLECSGMIFLLPLPSRFKRFS 
CLS LP SS WD YRGAPRA 


11787 


25688 


A 


11892 


116 


1 


S KG VGHFLFFFFFFETES RSVAKTGVQW 
CDLGSLLCLV 


11788 


25689 


A 


11893 


397 


475 


RFVCSTI KVLRDLSSDRSNPGRFLST 


11789 


25690 


A 


11894 


115 


331 

* 


KNVCLFVFVEMKNHLNPGDEGCSKPRSR 
HCTPS WAAQQDS IS KY ICI YRYLYLDIY 
IjSKLVVRLRINQLPNS 


11790 


25691 


A 


11895 


134 


251 1 


INPPVSRKKXKKKKKKKKKKKKKKKKKK 
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Amino acid sequence (A^AIanine 
OCystelne, D-Aspartic Add, 
L-Glutamic Acid, F—rnenylalanine, 
t»=vriycine, Jri=tiistidine, i=isoieucine, 
K=Lysioe, L=Leucine, M=Methionine, 
N-Asparagine, P^Proline, 
i^=uiutamine, K=Arginine, o— serine, 
T=Threonine, V=Valine, 
yv— i ryptopnan, i— tyrosine, 
X-Unknown, *=Stop codon, /^possible 
nucleotide deletion, \— possible 
nucieoiiue insertion 














KKKKKKKGGGL 


11791 


25692 


A 


11896 


186 


402 


PGKKDTLENVPRENKVGEKAPGQNEPPV 

IAjvjvj Ci I UL/lrmjN V JUj v t»F H FiUrval? oiUJu 

PKRLADTFDLIEGDGD 


11792 


25693 


A 


11897 


100 


2 


KGPPFPPFFPFFFPPFFPFGQSGQVKLK 


11793 


25694 


A 


11898 


92 


3 


SVARIMPYALFIiLSVGIiVMGFVGFSSKP 
S 


1 1 /y* 


25695 


A 


i i onn 

1 1899 


283 


362 


MT YTMT I VHAIiWAS V CLLLNHAHD PLD 


11795 


25696 


A 


11900 

i 


248 


1 


VMSAQPGLSVILRFMGKWSNWTGMQYTE 
SEVER YRDRGRARETERQRDRERERENP 
RNRKLFFYGRFKHSPQDFFMPQKVHF 


. 11796 


25697 


A 


11901 


253 


2 


KLQENPFKLLNF I INLCS VSLNVP 1 1 LN 
INIKPSFIAPKPRCPSKFQRNYIYIYPS 
I YLS I YLS J YLS IYLP ITI SKVLKDVE 


11797 


25698 


A 


11902 


179 


3 


LAVTKNEIMGFSTTWMEMGDINFFFFFF 
ETKSHPVMRLEC3GVISAHCNLRILVSS 
SC 


U798 

■ 


25699 


A 


11903 


1 


403 


GTSS QESFGGCCVSGL I AMGTKAQGERK 
LLCLFILAILLCSLALGSVTVHSSEPEV 
1 1 PENNPVKL I CAFYDLS FFSSS FSYPF 
HLSFFIYYPYYLFVSCFTFYIVILLSFI 
YSHHLSSVSYYSYVLLIYTFIL 


11799 


25700 


A 


11904 


37 


242 

• 


KG PTRDQLQHPKARLPAPLRVLWTAHLA 
AMAAGSRTS LLLAFALLCL P WLI YDGAS 
HIVSLFMLFI IF 


I 1 OAA 

I I SOU 


25701 


A 


11905 


1 


298 

# 


GTSQHRGRKDSRTGSHSSSDHPGAKLLS 
TEEKOAAETMRPPSAPPRRGCIPWQGLL 
LSSS 


11801 


25702 


A 


11906 


225 

■ 


30 


ACYLQKDGAFAILPDVLPDMFKGHSSIjY 
PCQHAE CDHI KN I YNCVCVCVCVCVCVC 
IVICKLNV 


11802 


25703 


A 


11907 


196 


320 


HS SS S TT PS QDTHTHTHTHTHTHTHTHT 
HKI PQREIiLPSVPD 


11803 


25704 


A 


11908 


130 


3 


NGFVFSPFPPFFFFEMESHSVTQARVQW 
CDLGSLQSLPPGFK 


11804 

* • 


25705 


A 


11909 


381 


2 


ASICGQKLIFFQHWBCKI ILVQQVGLQKK 
NSTLKRAWEPLSFFQPAPPLGSPIiKGIN 
ILNKGAKKLKGKTNFPGPIVPFFSVLPP 
FFGLKKFQKKVPEQGKSPFFFFFFFETE 
SRSVAQAGVQWHDL 


11805 


25706 


A 


11910 


138 


2 


S VPLKE F 1 1 SQARWLTP 1 1 PALWEAEAG 
RS P E VRS S RP AS TWRNP 


11806 


25707 


A 


11911 


231 


54 


IRASFGIQIRICLTIFFFFETEPHSWQ 
AGVQWRDLGSLQP P PRRQEQNS VSKQTK 
TN 


11807 


25708 


A 


11912 


3 


461 


DAWGRVEGPPLRP PATSRRWAGP TLWRM 
EVTGDAGVPESGE I RTLKP CLLRRNYS R 

CiWnu VAV £>L.LiJ2iU1jKo iVALiJ XLtHJ. UJxbiJ 1 

P VTLVLAEDGT I VDDDDYFLCL PSNTKF 
VAMAINE KWAYNNSNGGTGWI SQES FDV 
FEAYSGATLLFPF j 


11808 


25709 


A 


11913 


318 


407 


LCGRLWWLMPVI PALWEAEAGGLLKLRS 
MR 


11809 


25710 

• 


A 


11914 


340 


5 


DRVAKFSEARLFRLFFPFFPLKI FCPPR 
GFKIFRGVCPLLFPPEFWGLFQKGPGMR 
XiFFPPLGGFPFFFFFFFFFPFPSFFPPP 
FFPFFFFFFPFFCmNKTLLKKVHNS I 
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last amino 
acta 

residue of 

peptide 

sequence 


Amino acid sequence (A«Alanine 
OCysteine, D=Aspartk Acid, 
tr-vriutamic Aciu, r— rbenyiajantne, 
G=Glycine, H=Histidine, I^lsoleudne, 
iv— i-»ysine, l— leucine, m— met nio nine, 
N=Asparagine, P=Proline, 

^WOI||4a*M IMA A MM m~* ■ — S f> A O-^O All • *k A 

Vr^vjiu famine, jtv^/vrginine, 3 =s oenne» 
T=tbreonine, V=Valine, 

tt — irypiopnan, i — lyrasine, 

X=Un known, *=Stop cod on, /-possible 

nucleotide deletion, \= possible 

w% ■ ia!aa4i/Ia ■ W% 0 am£b am 

nucieoiioe insemon 


11810 


25711 


A 


11915 


274 


385 


IKPQRWGETRAEKLK3LKIRVPLLLQRN 


11811 

• 


25712 


A 


11916 


259 


441 


DTKLPKVNI KLNFHALKKKKKKKKKKKK 
KKKKKKKKSSSLRG 


1 1 Q11 

J Joli 






i 1017 

■ ll7l./ ( 




a no 


CRLRLPGSYHSPASGCQLLGR 


11813 


• 25714 


A 


11918 


223 


1 


NxjNS JP KKKF r YE b VS5VPVFFGVF P PSP 
LKGSPRAFLKLAWRPPLPLFFFLEMBSR 
SAAQPEVQWCDLGSLQPP 


11814 


25715 


A 


11919 


97 


3 


KPEFFPFFFFFFEI ESRSVTQAGVQWRD 
LGS 


11815 


25716 


A 


4 4 A 

11920 


4 ^ A 

154 


384 


KEFFLMIiFFFLPPPPPFFFFFFLGKGLF 
FFPPGGGGGGQFFS IGPPPPR6KKDPPP 

SFPKMGGKRGTPPrPvSYFFFF , 


11816 


25717 


A 


11921 


225 


369 


GTLNLLTYKIKSWSGAVAHYNPSTLGGW 
GKWITSGQEFETSLANWKP 


11817 


25718 


A 


11922 


116 


372 


ME YTWNKMAMSYWIiHIYQTYKNSVIEHP 
TGVLPHSRVI MVKn I VLCI S KQLKBLLR 
RLMWE EHLSPGGGGCSE PRS RHCTP AWA 
T 


11818 


A, 4 A 

25719 


A 


4 4 A 

11923 


271 


409 


KKKiCKKKKEKKKKKKKKKKKTGGGA 


11819 


25720 


A 


11924 


121 


228 


KKKKKKKKKKKKKKKKKKKKKKKKKKKR 
GGGFKKKPWGGQK 


11820 


25721 


A 


11925 


55 


423 


NKPKKKNFLKKKKKNFFFPFYPLKFFFF 
PKSLNFFRRWPKISPPKKKFFFKNSPS 
VFFFPPLKKKNFFFLTPLKFGPPKNFFK 

F1T1TIT1T LIUL*LILHJLIUL)L> Lff OOmT^ITWVT 

RPPPLFFFFFFFFFFFLSGNCIEKL 


11821 


25722 


A 


11926 


253 


443 


YQHQRPKVDKSMKMGRNQRKKSENSKNQ . 
NTSS P PKDMNS S PARQQNWMENE FDGIiT 
EVGFRRW 


11822 


25723 


A 


11928 


175 


413 


KKKKKKKKKKKKKKKKKKKKKKKKAGGG 
A . 


11823 


25724 


A 


11929 


490 


182 


RKQK I KG CKKPNP LAQGGVKKKGKKGPP 
FNFFLKQKGQDFLKNWNLGQKKKI P I PP 
QVFS LRQKGAF PGR I FFFFFF FETES CS 
VAQAGVQTLSYKNKNKIK 


11824 


25725 


A 


11930 


110 


2 


KKIPRKPPFFFFFFFFYKAGSHLVAQAG 
VQWHDLGS 


11825 

■ 

* 


25726 

■ 


A 

* 


11931 

- 


169 


987 

• 


YLEKI MSEHSRNSDQEELLDEE INEDE X 
LANLSAEELKELQSEMEVMAPDPSLPVG 
MIQKDQTDKPPTGNFNHKSIiVDYMYWEK 
asrrmle eervp vtfvksee ktqeehee 
IEKRNKNMAQYIjKEKLNNEIVANXRESK 
gssniqetdeedeeeeddddddegeddg 
EESEETNREEEGKAKEQIRNCENNCQQV 

KLALDTS FLKVS TR PSGNQTDLDGSLRR . 




79*7 




now 




IRQ 

JO? 


SVQTHPNIiRSCS VIiKNAMHFYLIjGYVI S 
GCTEPAKAI KPIDRKSVHQICSGPWLS 
LSTAVKKIVGNSLDAGATNI 


11827 


25728 


A 


11933 


134 


289 


ASTQNMGQMLRLMS S PTLLSLSLSHTH I 
HKHTRMHTHTHTHTHTYTPTKKV 


11828 


25729 


A 


11934 


128 


1 


IKKGEFFFFFFFFETESHSPRLQqSGTI 
LAHCNLHLLGSSNS 


11829 


25730 


A 


11935 


131 


19 


MYIGWVQWLTP VI PALWEPKVGASLEAR 
SLRPARTSE 


1 1830 


25731 


A 


11936 


104 


3 


LKGQLIRAQWLMPVI PSLWEAEMGRSPB 
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nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 
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M 

etb 

od 
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SEQID 
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U55N 

26 
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location 

correspond 

ing to lirst 

amino acid 

resiuue oi 

peptide 

sequence 


Predict- 
ed end 
nucle* 
otide 
location 
correspon 
ding to 

1 A A ^ f« (H I ■% A*\ 

last ammo 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
C=Cystelne, D=Aspartk Acid, 
E=Olutamic Acid, r— Phenylalanine, 
k*— Glycine, Jri— Misname, i— isoieucine, 
K=Lysine, L^Leucine, M=Methionine, 
iN^Asparagine, r=rronne, 
Q^Glutamine, R=Arginine, S=Serine, 
i— i nreonine, y— vanne, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stopcodon,/=possible 

■■iiaIaA^i Ja <1 aIa^A A M |HahAAA||ftla 

nucieotiae aeietion, \=pos3iDie 
nucleotide insertion 














VUOSK 


U83I 


25732 


A 


11937 


146 


2 


GGRLG 1 1 YHAWNPEGFFFF FEME S CS VG 
r»an\7r4Mr , Krr.ccjT^DDU'iV2T?ir 


11832 


25733 


A 


1.1938 


149 


1 


DKCFLCVCCLFIFIYLIiIYFYFFETESC 


11833 


25734 


A 


11940 


176 


380 


QS VAQL PRLECNGMIIiAHCSLHLLGFKR 
QAWCQSLTILWN 


11834 


25735 


A 


11941 


133 


415 


KKKKKKKKKKJCKKKKKKKKKKKKKKKKK 
KKKKKKSG 


11835 


25736 


A 


11942 


222 


420 


QGDKFLDNNNSTILYMEKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKYSR 


11836 


25737 

* 


A 


11943 


163 


425 


KNPPKNSQLSPSTLKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKTG 


11837 


25738 


A 


11944 


143 


3 


KMGKKQSRKTGNSKKQSASPPPKERSSS 
PAMEQSWTENDFDELREE 


11838 


25739 


A 


11945 


200 


3 


QNSQVGYKGAPHKEKTMALQARVNLGPP 
RGPLKRPALFFFFFETESPSVTQDGVQW 
HDLGSMQPP 


11839 


25740 


A 


11946 


221 


3 


RKIEGGPSTSRKSVRGVRCFQPLGLGFP 

/s/^r firm 13 BwtfTifHWfl aat rnt>T n^inrnnffl*! 

GGLFF FFFF YETES CJS VARL ECS GWSA 
HCILHLRSTNEGRRGR 


11840 


25741 


A 


11947 


68 


177 


ISSSFKWKCIiCEKiCKKKKKKKKKKKKKK 
KKKKKRKR 


1 1841 


25742 


A 


11948 


210 


329 


RHTHTH I n 1 HTH I lib Lib LSHTHTHAHTL 
MYFLAF FDIiRS 


11842 


25743 


A 


11949 


183 


20 


LGT VAHS CNFVFSLS RGFLHVG KNGLRL 
PTS DDP PTS ASQ SAGTTGMS PCTWPS 


11843 


25744 


A 


11950 


232 


329 


VGI KMS I SSDEVNFL.VYRYLQESGFSHS 
AFTF 


1 * OA A 

11844 


25745 


A 


11951 


412 


239 


LFYEKGSRFVSQAGLEL»LELKQSPCFGL 

P KCWD YRHE P PRQADLP F CVP S SAWAS W 

* 
it 


11845 


25746 


A 


11952 


310 


393 


PHTDISGTPEIMHYVHVHRVTTQPRNKP 


11846 


25747 


A 


11953 


193 


3 


LSSWPTLISGAFKAEENAVEGNDDSRIL 
RGFFVLFCFVLFFRQSLALSPRLECSGV 
ISAHCNL 


11847 


25748 


A 


1 1954 


185 


416 


nirrt<ntir>M7 ir> o riot tt tncrTvutrcvT t j""i'Dr»iv 
SVQTHPNLiRSCSVLil^AMHfYLiL^TaPA 

KAI KP IDRKSVHQICSGPWLSLSTAVK 


11848 


25749 


A 


11955 


100 


1 


AHLSKVFPI FFFFFEMESHSVTRLECSG 
TISAH 


11849 


25750 


A 


11956 


35 | 


521 


FFFFFFFFFFSFFFS 


11850 


25751 


A 


11957 


214 


1 


KKKIFFF KE I F YP KLS RKKKNF FRGAGK 
ILNLSIFPEKKKIFSPFSFFFETESCTV 
AQAGVQWHVLASSIiQP 


i 11 RSI 


ZJ I J I. 




1 10^8 




i 


PVF QHO AOWLTPVI PATi WF AFJIGR5 PPV 
SSSRPAWLTW 


11852 


25753 


A 


11959 


188 


12 


SKCQCTFFSNFKKTEMESHYLAQAGLEL 
LGFSHLPTSVSQTVGITGVSHCAQPNAH 
LY 


11853 


25754 


. A 


11960 


241 


556 


SS I P FL PN1CHLLLS FS TLS SLGREVS IQ 
DMCQGTYQKQPFPFPLPHVFHLS IRLX3E 
LCVGRDVHLTC^TOIiAQVFVCLFVIiGVL 
LSLPRAGVQWCHLSSLQPPPP 


11854 


25755 


A 


11962 


362 


462 


KIGQALTPIIPLLWEAEAGASPEARSSR 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Add, 

TC=<"»lutamie Acid F^=Phenvl» Inning 
G=G lvcine. H=Histidine. Is Icofencine 
K=Lvsine, L=OLeucine, M=Methionine» 
N=Aspflragine, P=ProBne, 
Q=Glutamine, R=Arginioe, S=*Serine, 
Threonine, V=Valine, 
W=TryDto n h a n . Y=Tvrosine. 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, \=DossibIe 1 
nucleotide insertion 














SAWPT 


1 iOJJ 




A 


1 1 7UJ 




1 


CCCYCCCCCCYCC 


11856 


25757 


A 


11964 

• 


30 


415 

■ ■ 


GLGFKQKI FFFPFIjGQGPNPDLiLRAKKR | 
P FLP PP FFP KNP KNRGVFLLGP SGVPKP 
RPRQGDVPPLKPRSLGVFLTSKKTPLGP 
WANPPPPPQDGP ! 


11857 


, 25758 


A 


11965 


149 


2 


SKTWLKLAKYLKNTYPGWVRWIjIPIVPA 


1 lOJO 




TV 

* 


1 17OO 


170 


a 


KICNI FFP PPVKFGPPOGFFKRPP PIiFFF 
FFFFFFFETESRSVAQAGVQWLS IGSLQ 
AP' 


11859 


25760 


A 


11967 


245 


382 


DVTDS KMHPDFRSS HVKMWLG WAHACN 
PSTLGGSRGQEFKTSIiAN 


11860 


25761 


A 


11968 


264 


10 


LSTLECKGVSPGVFSPKGWRKGljGyiFS 
PFTiE 5CPH I P PLG I S MYFFFFFDGVS VAD 

AGVQGRDLGSLHLPS PGSSYS CAS AS LS 

S 


I loo I 




zv 




39#C 


/in/; 




1 looz 


23 /oi 


7V 




i OA 
I2U 


l 


AQDGVQ WRD LGS 


11863 


25764 


"3T 


11971 


84 


370 


RNGAELKTPLAGQTQITINCRYSWVTLV 
LGDLESILCWHIFKKTNLFICGFFSSFL 
LGAGRMKSHCVVRLECRGMISAHRNVCL 


11864 


25765 


A 


11972 


196 


3 


SRGKMGEYPFVLFKNPSWAKAGGNKGNP 
FFFFFEAESCSVAQAAVQWCNIiGSLQAP 
PPRPTPRA 


11865 


25766 


A 


11973 


559 


644 


KIGCSAYGVAILLFLYFFNKLAFTLWKK 


1 1 Q£JL 


ZD/O/ 

• 


n 


1 1 O'JA 


j 


701 


LIiAFGLLCLPWLQEAGAAQTVPLSRLFD 

W AMT iO AHR AHOT « A T DTYnRT (RPTYT PKD 

QKHS FLHDSQTS FCLSDSI PTPSNMEET 
OOKSNLELLRISLLLIE 


11867 


25768 


A 


11975 


138 


365 


LKFECHSTLCANHGKQTFTFVFYNKFLL 
KYTFYFSVLTYAINWYIKNLINVFKKKK 
KKKKKKKKKKKKNFKKKKN 


11868 


25769 

• 


A 


11976 


275 


433 


MS ILDL5 KARNFFLSFLETGSCS ITQAG 
VQWIjNHS SLQPQTPGLRDPPASASQ 


1 1 8£0 






11 077 


l\JO 




RCGLFFFETRAS PCRPGWHNLGS LOPLP 
PRFKRF 


11870 


25771 


A 


11978 


270 


408 


FENNVWLGRAQWLT P VI PALWEAQAGRS 
PGPENQNHPDYKGNPWPY 


I lo/i 


-O / /Z 


A 


1 1070 


99fi 


All 


OATjKFVI EMIjLSLKKCLDVSVI fnrhkk 
IELLQKKKKKKKKKKKXKKKKPSQKKKD 
SSRGKDS 


11872 


25773 


A 


11980 


2 


447 


GALAL^TTADVWRLNFLVSGLHWKRWL 
QQTSLSKWVKIKECSILKKKKKKKKKKK . 
KKKKKKKKDKK 


11873 


25774 


A 


11981 


124 


445 


KYGEMSQNPARGGPKFSKNQKYSEHLRI 

HCCPPFTFLISKKBIGDRKYSICKSGCF 

YQKKEEDWFCCPCQKTKTSRRAKSLKRP 
KQKPVAP PGGVKAPAKPRSLPRF 


11874 


25775 


A 


11982 


48 


429 


KESNGSQDRLLPKIHSPDVSGGKSGGMS 
QNPARGGPKLSKNQKS SEHLRIHCCPPL 
PFLISKKEIGDRKYSFGKSGCFSQKKKB 
DWICCPCQKTKLKGKIRPPPKKNGPGGS 
LNGRTTWVSGLFVHN 
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SEQID 
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correspond 
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peptide 
sequence 


Predict- 
ed end 
nude- 
otide 
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correspon 
ding to 
last amino 
acid 

residue oi 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartlc Acid, 
J£=GJutamic Acid, J 4 — .rnenyiaJanine, 
v*— uiycine, it— jusnoine, l— isoieucine, 
K=Lysine, L=Leucine, M=Methionine, 
iN^ASparagine, r^iTOiine, 
Q=Glutamine,\R=Arginine, S^Serine, 
T^Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
a— unKnown, — otop coaon, /=possiDie 
nucleotide deletion, ^possible 
nucleotide insertion i 


I lo/5 


ZD/ /Q 


A 




41y 


500 


o JuJj V K.V c, K.y W tr\jf* V AiiAiJpJ f£> i lAitj f\» 


11876 


25777 


A 


11984 


362 


496 


LWSPIRILLLRWAWWHVPTVPAIWEASV" 
GRIjIjEPRSLCNMVRPIS 




Jo/ Jo 




11 985 


142 


433 


Rif VL.0 1 IKVJUKlJI^oIJK&NPvjKr LibloN 
SSLKKKKKKKKKKKKKKKKK3CKKKKKKK 

KEKFFFFGGGFIN 


11878 


25779 


A 


11986 


177 


2 


CQRRWVTLSHHTHS I TRPLGQLKKG WS P 
(aAVAcli. X WJPbJ? iiUsy W^WVT xVHKTRGK 
TR 


11879 


25780 


A 


11987 


178 


443 


DKKKKKKKKKKKKKKKKKKKKKAQGGLF 
KKI 


11880 


25781 


A 


11988 


329 


410 


LIFFFIFFERESHSVAQAIVQWHDLGS 


11881 


25782 


A 


11989 


146 


1 


DRAS ALQPGRQGRNFVS KNKNNNVI YI I 
NTLTTTIFLGTTYDALSPEL 


11882 


25783 


A 


11990 


240 


420 


GVFAPLLGDVSQSGYTGFRDPLEBAVCL 
FLELE CHAERTTALFRAVRQGCLS LQNL 
SVAF 


11883 


25784 


A 


11991 


239 


410 


GTLFIYLLGDLFIYFRNKSLTLLPRLEC 
GGVIVAHCSLDLLGSGYPSISALPSSWD 

c 


11884 


25785 . 


A 


U992 


126 


3 


YTQEYKS FCHKDTCTHMFTAALFTIAKT 
WNKPNGRVGGRV 


11885 


25786 


A 


11993 . 


300 


22 


SQLLGRLRQENHFNWGGRGCSEPRSCHC 
I PAWATRANS I FCGFQASS VEVRRS ARK 
KLFSD ILKRHNTI S WR VSGLLLVDS YPG 
RLATPVRAQ 


11886 


25787 


A 


11994 


303 


2 


EGEEIFLPSPPFPPGQENPGSNCSPPLF 
P FGTFLDI RVAGSHKVQAKGVFKKRP PS 
LHLFLI KKRFFFFFFETESRSVPQAGVQ 
WRDLGSLQTPPPGFKR 


11887 


25788 


A 


11995 


171 


457 


SFSDHLIIA3WIGFQ«NLSKIALCSSPIK 

NKNDDLQKKKKKKKK^ 

SYQDSS 


ll 888 


25789 


A 


11996 


357 


204 


EKTGF PHVGQAGVDLLTS RS TRLSLPKC 
WDYRHE PMRLAGHLYYF YPQMK 


1 1 con 

11889 


25790 


A 


11997 


219 


1 


PRVrwAPPPRYPPGALFWAPRJPVGVSLG 
ARAPTKAGPKQKGARGTFFFFFEMESHL 
SPRLECNGTIWAHCNLC 


11890 


25791 


A 


11998 


254 


1 


AHIjRGNRQLP KHTFFQ YMTTNIjKCAFS V 
GRQSYSIPWRSFYTSLFFKYSCVFHTHT 

rn'Lri'Li'ntJ'i'iJ'p'i vuv^TiurtrT t t wot o w/^o 
riirl 1 11 rHTnTIiXf QIM VLLiLiPSliRlVKGS 


11891 


25792 


A 


11999 


167 


2 


NFKSFFQGLSRGVLNPKVNLFFFLETES 
CSVSQAGMQWGDLDSLQPPPTRPPTRP 


11892 


25793 


A 


12000 


104 


3 


RPWTFFFFSETKSRSVAQAGVQWRDIiCS 
LQPPP 


1 1893 


25794 


A 


12001 


178 


2 


KI FQRGAKNS P WGLPFRGGENKKGAPP C 
DKGFGRPRFPFFTESRPTTOTGVOWrDl. 
GP 


11894 


25795 


A 


12002 


163 


282 


GIGGE WCLSKYVI KYVKKKKKKKKKKKK 
KKKKKKKKKKKKKICKKKKKK 


11895 


25796 


A 


12003 


195 


2 


KWGRI PLKKGGFCFTKKKSGQTLLIKSP 
PPAFFVFFIFFFLETESRSVAQATVQWC 
DLGSLQAP 


11896 


25797 


A 


12004 


152 


1 


CFVDVSAEKLRPKDFMTKTPKAMATKAK 
IDKWALTKLKSFCTAKETIIRV 


11897 


25798 


A 


12005 


359 


1 


EWPIWPPPKRRVLSKVSKQFFISAPIR 
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. peptide 
sequence 


Predict- 
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nucle- 
otide 
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correspon 
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peptide 

sequence 


Amino acid sequence (A=Alanlne 
C=Cystelne, D==Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l^Isoleucine, 
K=Lysine, L= Leucine, M^Methionine, 
N-Asparagine, P=Proline, 
Q=Glutamine, R-Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y^yrosine, 
X=Un known, *=Stop codon, /"possible 
nucleotide deletion, ^possible 
nucleotide insertion 










• 




. KKPFPCQVGLILAPFRVLLIGRPFFFFF 
GVLWEIQRWSFISLWVWRARIMMYA 
LFLLSVGLVMGFVBFSSKHCPVYGALVL 
VSIGGRV | 


11898 


25799 


A 


, 12006 


189 


12 


DGRLPGSVCYYLI FLRQSFTLLAQAGAQ j 
WCHLGS LQS P P PGFKRGRRVSRI RAVRG 
DQ 


11899 


25800 


A 


12007 


131 


3 


KI FFFFFFFEMESHS VAQAGVQWRDLGS 
MQDP PPGFMPLLFY 


11900 


25801 


A . 


12008 


150 


2 


IILFLVPIEAVFFFFETVAEAGVQWHNL 
SSLQPPPPGFKQSPASASRVA 


11901 

• 


25802 

• 


A 

■ 


12009 

• 


305 


3 


KFFFSLKAS IRGRWFGLFFTPPKKGFLP 
KIPHQVFKGGPLWBKLQLGKAGLNLGPY 
KGFFKGAARFFFFFFETKSRSVAQAGVQ 
WRDLAHASVHASAHAS 


11902 


25803 




12010 


263 


517 


DKGFQLPPRW CPGQDPLFFYGG I VFHGG 
YGPPFFFFFFKAESRLVAQA0VQWQDLG 
SLQPPPCRTRGSP 


11903 


25804 


A 


12011 

■ 


107 

■ 


374 


WMGTWGSLPVLLTTRSTCPPRS PRLHRR 
TGATIWIFFRLFETGTVLPMLECSTVIM 
AHCRLQ LPGS S S S PTIAS Q VAGTKAHLS 
DCFVY 


11904 


25805 


A 


12012 


144 


261 


FPKKINNFYFTFLQDPTGI FSLDKTIGL 
GTYGRIYLVS 


11905 


25806 


A 


12013 


371 


3 


IFHLRKIFTFLPGIFWPIRLLSEKKKPK 
RFWVKI KKFKNG YPLKNP IN PKY KG P RF 
FFKGEKNPNGKVPVLKKIKIPSFSKALG 
FLQKKASKPFFFFFFDTES CTVDQSGVQ 
WCDLGSLQAP 


11906 


25807 


A 


12014 


101 


3 


RGAFFFFFFETRSRSVIQAGVQWCDLGS 
LQAP 


11907 


25808 


A 


12015 


205 


2 


VFNPSGINVMYVDAIILNKRLATRIQQH 
IKQRI LHDQVGF IHGMQGWFS IRKS INV 
IQHINRPKDKN 


11908 


25809 


A 


12016 


168 


1 


GCVCQFQKKLGNGGLNGFFFPLFFFFFF 
EMKSRSWQAGVHWRYLSSLQTPPPEFK 


11909 * 


25810 


A 


12017 


311 


2 


RGLRFGQWKTQMENPNTP FSALRPGQLS 
S I RS GL LHTYPGLGDR5 E PLS CSILSSS 
KYWWQAVTSALSSNKPGASQGHWKDDF 
FLFFETESCSVTQAGVQWH 


11910 


25811 


A 


12018 


383 


247 


LVEWGFRHVGQDGLDLLTSQSAHLGLPK 
WWD YRRE PPRLAT I EVL . 


11911 


25812 


A 


12019 


2 


379 


RVLWTAHLAAMAAGSRTSLLLAFALLCL 
PWLQEAGAVQTVPLSRLFKEAMLQAHRA 
HQLA I DTYQE F I S S WGMEAY I TKEQKYS 
FLHDSQTSFCFSDS I PTSSNMEETQQKS 
NLELLHISLLLIES 


11912 


25813 


A 


12020 


3 


389 


PE PLRVLWTAHRGAKGAGS RTSLLDGFA 
LLCLPWLQEAGAVQTVPLSKLFDHAMLQ 
AHRAHQLAIDT YQE FEET Y I PEDQKYS F . 
LHDSQTSFCFSDS I PTPSNMEETQQKTN 
LELLRI FLVLIELWLDP 


11913 


25814 


A 


12021 


247 


386 


PMLGHVS QSGGNG VRDPLEEAVCPLAKL 
KHCSGRSTALFRAGRQKR 


11914 


25815 


A 


12022 


17 


371 


PLRDLWTAHLEAMAPS FRTDLLLAYALL 
CLP WLQEARAVQTVS L YRLS DHAMLQAH 
RAHQLAIDTYHEIEETYILKDQKYSFLH 
DSQTS FCFSDS I PTPSNKEETQQKFYLE 
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LLPISL 


1 1Q1 < 

i ► 


ZjoIO 


A 


* 






LRALWTADLGAKAPGSRTFLLLASALLC 
LP WT iFIK AGAGQTVPLS KLFDHAI LQAHR 
AHQLA.IDTYQELEETYI PKDQKHS FLLD 
SQTSFCFLDS IPTPSNMEETRQKSNLEL 
FRTSLLL 




/Jo l / 


A 

** 


17074 


2 


363 


PEPLKGLWTDHLGALAPSSRTSLLLAYA 
LLCLPWLQEAGAGQTVPVYTLFDHAMLQ 
AHRAHQLAIDTYHELDETYMPBDHKYSF 
LHDSQTCFCFSDSITTPYNMEETQHTSN 
VELLRISL 


11917 


25818 


A 


12025 


314 


393 


GQAWWLT PV I PAF WEAKGG RS LEARS 


I iyiB 


KQ 1 Q 




IZlrZO 


ou 




PLiIACSFFLFLFFFFGKKILFLPPRWKG 
GGG I WVNGNPGFRGQAFFLAS S S KKPGM 
GGPPLTPGKRFVFLKKKGFSPGGPGGSK 
FSAPGTPPTGPPKGWE IPAK 


i 


Z JOZU 


n 




i &s 


£. 


WOKLLFHFGTESCSVARVGVQWRHFSSP 
KPPPPB FKQLSAPAS RVADRRPPDAW 


11920 


25821 


A 


12028 


1 


338 


VF VALKFLMCLLS VC I FS LENI YSNS LP 
OKNCLPLI FFFLFFEMEACSVTOAGVOW 
GDLGSLQPPVSHLNLGGGGCCEPRECHC 
TPCSRPGDRDFVFNKNTMIQDNHLMELT 






2i 




o 
z 


us 

» 


HEERERERERERERERERERERVGRTVG 
GVGETYKAELPRVGGAAQKRAHFSARGR 
LFMEICGDMCGEKPGKPPHSLTEECLSR 
CGERFFDTS LAITRGCAQS V 


11922 


25823 


A 


12030 


216 


3 


ERIIPRGVRQRNRLFSGRDTASPPLFYP 
PPRQ IHKRGVEDREHERRGERERERERE 
RERERERERARAAR 


11923 


25824 


A 


12031 


114 


329 


QTBRNS IN INKKD IHTKTPSVGHQHQRP 
KVDKTTKMGRNQSRKAENSKNWKASSPP 
KEHNS S PAREQNWMES 


11924 


25825 


A 

■ 


12032 


177 


1 


IKKKTFFWQNIPLFSKKKTCRQKFFFFF \ 
FFETGSLLPRLVCSGAVLAHCSLCLPGS ! 
AFL 


11925 


25826 


A 


12033 


127 


1 


PSFFFFFFFNETESRSVAQAGVQWRDLD 
SLQAPPPGFITPSC 


11926 


25827 

a 


A 


12034 


47 

• 


314 


GAPVAS VS I S CPS CS ATDGWRNGKSTA 
GHQRYLCSHCRKTWQLQFTYTASQPGTH 
QKIIDMAMNGVGCRATARIMGVGLNTIL 
RHLKN 


11927 


25828 


A 


12035 


15 


408 


GAIPGAMGHFTEEDKATITSLWGMVNAB 
DAGGETLERLLVDYPWTQRFFD S FGNLS 
SASAIMGNPKVMAHGKKVLTSLGDATKH 
LDDLKGTFAQLSELHCDKLHVD P ENFKL 
LGNVLVTGMAIHLGKEFTP 


11928 


25829 


A 


12036 


122 


2 


KLVDVLPPPGGARVFFFFFFPLETEHCS 
VAKAGVQWHDFG 


11929 


25830 


A 


12037 


154 


1 


GKPPPKLGFWGGPRQRVRGRGPPFFFFF 
ETESPSVAKAGVQWCDLGSLHPL 


11930 


25831 


A 


12038 


58 


254 


DPRVRQQRTTAHCSLNLPGLQRPPSLSL 
PSGWDYRHAPLCATQKILAYRVAQRERS 
AHGSYYQAS 


11931 


25832 


A 


12039 


270 


124 


DG VLLIiL PR LE CNCAI S AHGNLRL PGS S 
NSPASNKTKQNNNKKTLSNKF 


11932 


25833 




12040 


63 


355 


LGRGTAHRRLWRPLSRPASRVSYLSAAT 
NKRSFAPPSRAFPWDNKWERGGFYFFET 
GSHRRRPGRMECSGAITAHCSLDFPGSE 
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. MGFTMFPSLVSNS 


11933 


25834 


A 


12041 


184 


3 


GVS RIiLGGVSQLG YMGVRDPLEEAVCLF 
S KL KHH PGRTTAL FRAVRQGCLS LQKFL 
LLFA 


11934 


25835 


A 


12042 


118 


3 


DRVLLCYLGWECSGTISAYCNZiHIiFGSK 
RFSCLSLPSS . 


11935 


. 25836 


A 


12043 


40 


436 


LSEGKLTNRKDNHTKTPSVHHHHQKPKV 
DKTTRMGRIQSTKAENSKNESTSSPPKE 
HS TS PAT EQS RMEND FD ELREDFRRS VI 
TNFSELKEDVRTLCKEAKNLEKRLDEWR 
TRINSMEKTINDLMELKTMA . 


11936 


25837 


A 


'12044 


101 


2 


HLGQVRWLTPVIPILWEAEVGGSPEHRS 
SRPAW 


11937 


25838 


A 


12045 


123 


3 


LTVIKELPLGQARWLTPVIPALWEVKVG 
GSPEVRSSRPA 


11938 


25839 


A 


12046 


3 


432 


PIFGGGGERIFPFFFFFSQGGGVFFPKT 
KNKGFFP FIjGFFKKI FLRNF FFLFFP FF 
FLNPFFYFGAPPFFFFFPGGGFFFPLFS 
RFFKFFFQNFPQGFFFGGGFFFFFLGFF 
FFFFFL 


11939 


25840 


A 


12047 


325 


164 


KNKRKNRGQARWLTPVIPALWEAEAGGS 
PEVRTTGVSHGTRPHLLLNTVSEYX 


11940 


25841 


A 


12048 


187 


381 


IiMDKRVSLWGDENILKVESNSGCTTLYL 
LVCFEMESRSITQAGVQWRDLSSLQPPP 
SRFKRFSCL 


11941 


25842 


A 


12049 


84 


464 


SYRVPSHPDTLVIiSRISAQEAGEKSPFC 
FPERVWPCPRPLSDLGRRLKLECGPDLL 
DSTFLS FFFGEGLKTGSHSGALE CSGVI 
RDHCGLCLSGSGDPP I SACLRVAGTTGV 
SHHCI FCRQDLPGS 


11942 

• 


25843 


A 


12050 


232 


20 


LLQCSSRAKIHTSLTGNQKLEMIKLSBE 
GMSKAEISQKLGLLHHLGLVANAKQRFL 
KVIRSATPEFRHAE 


11943 


25844 


A 


12051 


401 


518 


FFETESCSVTQAGVQWRNLSPLRA 


11944 


25845 


A 


12052 


191 


1 


LFPLKKEKI FFLGPGTNGAPPMFFLKAP 
PLFFFFFFFFFFFFFFFFFFFFVNLFLL 
LSFLHILEP 


11945 


25846 


A 


12053 

• 


12 


369 


PLPIYSACIEVGSNPQGPGIDAKSVSEN 
NCYLKE KKKKKS NG S QNNLLP KIHTPNG 
SGKKTGEKIQNPPRGGPKLSKNQKYSKN 
LKKHCCPRLTFL I S KRBKGNRKYTF3KS 
GCFYQKK 


11946 


25847 

• 


A 


12054 


89 


513 


NFTVRGS IPLLNNI PMANLLLLIVPILI 
AMAFLMLTERKILGYIQLRKGPNVVGPY 
GLLQPFADAIPLFPKEPLTPAPSAFALC 
FAAPALALP FALFLCP PRPLGRP 


1 1947 


25848 


A 


12055 


158 


3 


SQLTWPIMAQTDSMdiWIVYFDDNIGWA . 
WWLMS VI PTLWBAKVGGSLKHSN 


11948 


25849 


A 


12056 


113 


1 


KSHGGHGVEVLVSVLVGLAMEVGLVLWV 
KEYDGAALA 


11949 


25850 


A 


12057 


44 


263 


AR I ARS AHEGKMPRKYTGARKNAENRPQ 
RELQLKASRSTIDLAEHACNALWYQLFL 
ISVGSWKNYILFYLTYT 


11950 


25851 


A 


12058 


142 


1 


KKI FFRYKKEGFS FPKFFRI SLFFFFFF 
ETESRSVAQAGVQWRDLGS 


11951 


25852 


A 


12059 


12 


347 


QTERNS ININKKDIHTBTPSEGHQHQRP 
IVDKSTKMRKNQCKKAENSKNQSASSPA 
KDHNCLPAKEQNWTENEFHKLTEVGFRI 
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ITNSVELKEHILTQCKEAKNLEQRLBEL 


11952 


25853 


A 


12060 


245 


3 


MPEGSFFLBIPEITRVFVKKPGKFLGGV 
GPSLLFFFPFFFFFFMROSLAVSFRLEC 
NSAISAHCNLCLLGSSDSRSSGSR 


1 1 y j j 




A 


19061 


94 




KGGI FFFFFFFFLROSPSVAOAGVOWCD 
LQ 


1 1054. 




** 




1 OR 
1UO 

• 


c 

D 


KmjT.C CflVR DO PGOHGF.TTS TjT .K T DKT , AP 

HGGRHL 


11955 

■ 


25856 


A 


12063 


298 


1 


kkkk i nthaq krg p p ff fkpp pekgkap 
ppqrvtkkggIbkdsfipkrrekknpppl 

C P PF PT .VP VHVP T?P 

OCCCf UVAVIIV lUlJUMWUWUUtKfiRfilUi 

ERERERERERERERAR 


11956 


25857 

l 


A 


12064 


122 


1 


VCEGVCVGGWVRDRERERERERERERER 
ERERERERERAR 


11957 


25858 


A 


12065 


124 


1 


ATCTKFRNDNRALLRVRVFERERERERE 


11958 


25859 


A 


12066 


48 


351 


FFFFFFFFKKGPKIVPPGGGGGGDPIFL 
EPPPPPLKKFWGPPLRGRGNRGGGPPGW 
VNrvjl FLKKKGIjPWGPGGVxvTPAFKGSP 
GPTPPKGGNNRKNPPPW 


1 io<o 

■ 


25860 




12067 


1 


365 

* 


Ka X Kit 1 VJbW I AriLiVAMAirtjoK x SijLujDr A 

UjCLPWLQEAGAVQTVPLS RLFDHAMLQ 

LHDCQTSFCFSDS I PTPSNMEETRQKYH 
LELLRISLLF 


11960 


25861 


A 


12068 


174 


1 


P AWVMQHNP VS LFFCFFETESRS LTRLE 
CSGTISAHCNVRLPGSSDSPVSPSRVAA 


11961 

i 


25862 

• 


A 


12069 


270 


375 


TRIKRCNGGRAQWLTPVIPALWEAKAGE 
SPEVRSS 


J 






1Z0/U 


15f7 

i 




jrVTVT TTT TTKTTTMTM T T DT7T\T T^XTT T 17U 
JXlVl I If Jji 11N IXulULil ±c£ JjiiiJJN i.iAjr W 

LGAVAHACNP S TLGGRGG W I TRS 


1 \ QA1 
1 l?0j 




A 


1 


■j 
i 


J/o 


LLAIDLLCLPWLKEAVAVQTVPLSRLYD 
HAMLQAHRAHQLAIDTYQE IEETYIPKD 

LHKSHLELLRIYL 


11064 

* 


95R65 


A 




919 


i 
i 


GGGPPKKWAGVQKWVFFFFFFFWEMESC 
S VAQAGVQWYDLGS PRA 




9SR66 

£ JOOO 


A 






a 


VFGP P PF P FFFCE TESHS VAOAGVOWRD 
LCSLQAPPPGFMPSC 


11966 




A 

** 






3S7 


MP INOP VKKMCVC VCVCVYT Y THTPTj YT 
RVYMYMCMYI CVY IHVYI WMC I Y I HTHV 
STYVW 


11967 


25868 


A 


12075 


193 


350 


KI PHLTS L YHTQNYLKYC IGQAWWLTPV 
IPALWEAEAGGS PEVRSSRPAWPW 


11968 


25869 


A 


12076 


252 


1 


GVFGPFPKKGGFFWVKGFLGTPQIFWVG 
VFFKKKGGALWVKGLLGKPPPPPLFFFF 
FETES CS VARAGVQWRDSVKKKITIARA 


11969 


25870 


A 


12077 


273 


366 


I FI YLFI YLFETES CS VSQAG VQWPNIiG 
RLR 


1 1970 


25871 


A 


12078 


127 


2 


KKKKTLFFFFFFFFEMESRSVTQAGVQW 
CDLSSLQPPPPIiV 


11971 


25872 


A 


12079 


206 


334 


HNRVTIVNNNLIVHFKITKRCWTQWLTP 
VI P ALWEAKAGGS S E 


11972 


25873 


A 


12080 


135 


2 


KMKRKFGEDSQ I PKTLQTFFFLTESRS V 
AQAGMQWCDLGS LHS C 
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11973 


25874 


A 


12081 

M mm* W ~mW a 


147 

m\ V V 


3 


ATTPGLFFI FYFFETESHS IAQA0VQCM 

isvhcnlclp'gssdspvlv 


11974 


25875 

mmmw W f mf 

i 


A 


12082 


106 


2 


YFI PINEGYCFI YLFFETESRSWQAGV 
QWCDLV 


11975 


25876 


A 


12083 


164 


346 


CPYKKRKKANQAKWLTPVIMLFGRPRRV 
DRPSSGVQDKPGQHGETPSLPKIQKKSQ 
AWWCA 


11976 


25877 


A 


12084 


186 


1 


KSLFSKGNPLFKGPPPPPFFFFFFFETR" 

SRSVTQAGWGAVIMAHCCLNLPGPSDPP 

TPAPRA 


11977 


25878 


A 


12085 


98 


2 


cpffrvnffffemesrsvaoagvoWrdIj 

GSRA 


11978 


25879 


A 


12086 


154 


2 


SPTPEKGVWELPSPFFSWGPKIWGCIFF 


11979 


25880 


A 


12087 


326 


3 


KTSVITCGLPMGWSWCiP^l>K6PWWC3CKF" 
CVCKFWANQTLKKNLGAGRFQKPFSAAA 

PTiT?MRm?P P*l?fV5QOT?PT7T7PPT?13'P 

f JJOrlwU* XT £ XV* X" i\C c uuoyc CCCvlfui 

ESCSVTQAGGQWCNLGSLRPLPP 


11980 ' 


25881 


A 


12088 


154 


3 


KE FS FFAPGGKQRGE I RSLRGPP PQVKP 
LFFFFFFETESRSVAQAGVQWR 


I 11981 

1 I 70 1 




A 


UU07 






HTHTHTHTHTHTL YSQ I MVLLLPSLRKK 

GS 


11982 


25883 


A 


12090 


265 


30 


WFIVYSEISQLWGLPSFNTLGTWQSLSF 
IFVAEMRSCCVTQAGLELLASSDPPVSA 
S 0 SARI RAMS PS VAWVI PGRS R 


II983 


25884 

- 


A 


1209 1 


257 




GS VMRMHTE EOYPENKVEQS S SGFT RPH 
LVYRVCFYLSVCLSVCLSIYLSIYLSIY 
LSFFLSFFLLRKSLYLSIFLSFFLLSPR 
V 


11984 


25885 


A 


12092 


311 


404 


LYHQNPWLYSVFFFIjETESCSVAQAGVQ 
*WCD 


11985 


25886 


A 


12093 


331 


83 


GDDYKGARENSRDDKLFYVLIMMWIQL 
HVFVKIHKTIjQLNGYILLCKLYLINLTK 
NNFQKPKTNSQKTLHIFHPDSTAANI 


11986 


25887 


A 


12094 


82 


2 


LREPNLLNPGGEGCNEARWCHCTPAL 


11987 


25888 


A 


12095 


255 

muwrnJ mf 


393 


HGI LFSS FVCNLNQFLSQHWLMPVI PAL 
WEAEVGGSLEPRSSTLAW 


11988 


25889 


A 


12096 


80 


3 


PFFFFFFFFETESRSVARLECSGMI 


11989 

i * mw 


25890 


A 


12097 


367 


i 

■ 


PRVDKPTKMGKKOGRKTGNS KKO SPSAP 
PKERS S S PATEQS WMENDEAKLREEGFR 
RSNYSELQEE IQTKGKEVETFEKNLDEC 
ITRI PNTEKCLKELMELKPKAQELREEC 

mm *w mw ^ mm* mu mm* M> m-mf m- » V* mmJ A 1U AJlVfc * ^ atf %p 

RSLRSRCDQL 


11990 


25891 


A 


12098 


94 ! 

• 


236 


MWADYFCMSPSIDEGLKKKKKKKKKKKK 
KKKKKKKKKKKKKKKGGAP 


11991 


25892 


A 


12099 


410 


212 


NSLHPSTSLFRFSSHFHIKLISGQAQWL 
TPIIPVPWEAKAGGLLETOSSRLAWAAQ 
EDPISMPVCK 


11992 


25893 


A 


12100 


140 


324 


NLG QAQWLMP V I PAR P WEAE VGRS LAVR 
SSSNSPASASQSAGPTGPSHHNWLMFLQ 
LLTRL 


11993 


25894 


A 


12101 


247 


357 


KQIQGRAWWLTPVFS PLWEAKAGES PEV 
RSLRPAWPT 


11994 


25895 


A 


12102 


245 


2 


DVLVGGLCLPAGGTLHLTS IFLLSGCWC 
GSKDEEAPC KQR I SVQRESQ SRTLRAGV 
SPKKAHPCEMCGLILEDVLHFPDLQ 
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25896 


A 


12103 


230 


379 


KEAS FFETESHSVSOAGVQWRSLNPLGS 
GNPPTSTSRVAGTTGMSHHVWL 


1 1 77v 


25897 


A 

mm 




86 


2 


DGVS VLS PRLECNGL I LAHCNLRLPGS S 


11997 


25898 


A 


12105 

* 


116 


2 


SQKTPFFFFETVSLLLPKLBCNGAISAH 
CNLCLPGSSD 


11998 


25899 


A 


12106 


271 


416 


QFTFSKAFNLVFIKPSFHISFYRTHYSQ 
AWWLTPVIPELWEAEAGGSP 


11999 


25900 


A 


12107 


194 


3 . 

* 


HVLGTVL VAEDTASNKI RSLQGCLLLP"E~ 
LFSWLEVLARAIRQEKEIKVIQIGREE 


12000 


25901 

• 


A 


12108 


142 


3 


LCLL Y WDC KRHAWWRG Q VRWLTP VI PAL 
WEAEAGGSPEVRSLRPAW 


12001 


25902 


A 


12109 


233 


1 AC 


nUKoijlijWrt.UVV.LiMfc' V XrrtJjMfcAElHJjVor' 

EVRSLKPAWP 


12002 


25903 


A 


12110 


234 


349 


GFSPFYDFCLFVFETESCCWQAGVRWH 


12003 


25904 


A 


12111 


136 


333 


RLNFF YFYF FETAS CS VAQAGVQWHDLA . 
HCIPAWETEQDS ISKNTVQKRKKKVTRA 
GISKTKNENF 


12004 


25905 


A 


12H2 


345 

• 


110 


QDGLCLVMTLQETQPILAYSLWFPXVFP 
TTKACNVQGDS KFIjIjKKKYIiGQVQWLM P 
VI PTLWGADVEGS PELRSLE PA 


12005 


25906 


A 


12113 


A AO 

108 


285 


xNAMKrlRr JjKI lijWWlUNy luWAKWUMAV 
I PALWEAKAGRS PEVKS SRS RPGAVAQV 
DAA 


12006 


25907 


A 


12114 


1 


364 


RWAAEMGKFMKPGKAAtaDLAfiRYSGRK 
AVI VKNIDDGTSDRPYSHAL VAGI DRYP 

QLMPTRYSVDIPLDKTVVNKDVFRDPAL 
KRKARREAK 








121 ID 


12U 


J 


QWCSLGSLQA 


12008 


25909 


A 


12116 


120 


3 


TQIWGPFLTFFFFFFFEAKSHSVAQAGV 
QWCSLGSLQA 




25910 


• 


121 17 


231 


1 


LGSTPEAAAQ WQWVS FADSD I VPPAST 
WVFPTLGI MHHNKQATENAKE 






IV 
M 


111 to 
12115 


2lO 


i 
1 


FFWPRFSPKTFFFFFYEKEFHSAAQAGV 
OMCDIiGS LOP LPRG FK 


12011 


25912 


A 


12120 


267 


1 


KKKKKAFPSPPLLGCFIRLQKHLLGILV 

NTHSLLANI VP IAI LVFAACEAAVGLAL 
LVSIS 


• 




A 


19191 


ID/ 




I SGODLL KETKRVKRP FOODDVP FIN I F 
VPTPGAPRSLRQILELRGKTDPSPI IVG 
DFNTLFSALDRS 


12013 


25914 


A 


12122 


124 


2 

mm 


GLLKFNI FAPFFFFFSETESHSVTQAGV 
QWCDLGSLQPPP 


12014 


25915 


A 


12123 


121 


1 


RFTAASANSGAVSAAVGYMGDPGSE I IE 
SVPPAGPVSSW 


12015 


25916 


A 


12124 


176 


2 


RDLRGFSRFSNYKFWNFGRVGGIFKFVF 
GKGVKRFFFFFFETES CS VAQAGGQWRD 
LG 


12016 

■ 


25917 


A 


12125 


137 


2 


KLGFFGGGGGKI FAPQKNFFFFFFFFFE 
MESCSVAQAGVQWRDIG 


12017 


25918 


A 


12126 


103 


3 


LEENLGNTIQDIGRGKDFTSKTPKAMAT 
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SEQ ID 
NO: of 
nticicoiiac 

■ 


SEQ ID 
NO: of 
pcpuue 

• 

■ 


M 
eth 

nH 
uu 


SEQ ID 
NO: Id 

09/515 1 
26 


Predicted 
beginning 

nil* lanfri ft a 
lorn fin n 

correspond 

■no fn fifC'f' 

■■■K IU 111 31 

amino acid 
residue of 

nentiffo 

sequence 

i 


Predict- 
ed end 

IiUCIc- 
VIIUv 

location 

ding to 
last amino 

nrirf 
nuu 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
OCystelne, D=Aspartic Acid, 

IT— 1 nf*Q m if* Apirl E^PfiPn vlnlanina 

JC^^VJfl UmiU IL AUUy X * UCIIY IBIttlllilCi 

C2=f^lvcine. H=Histidine. I=L«?o leucine. 
K=Lysine, Lr=Leucine, M=Methionine, 

N=A 4na rapine P^ProIinc 

Q=Glutamine, R-Arginine, $=Serine, 
T=Threonine, V=Valine, 
W=Trvntonhan Y=Tvrosine . 

X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \-possible 
nucleotide insertion 














KAKID 1 


12018 


25919 


A 


12127 


212 


351 


LITFEILNICLGDNILDLGLARWLTPVI i 
PALWEAKAGGSPEVRSLR 


12019 


25920 


A 


12129 


187 


3 


RGSTMQQTNTRT PG VRVLYYI LMIATMA 
T P Y I PMANLT.LI1T VP ILIAMAP I MLTER 

KILGY 


i imn 
J2U2U 


23^21 






13 


3zy 


n ^T?\7T T WVM7X <?N <?TTCS FT . ADAR VRFOFT. 
nun V 1 X rv VPinOLv O 1 Zwr ItnlJnU X UAvAU 

DANSALMELBKGLRSGKLGEQCEAWRF 
PRLFOKYPFP ILTNSAPTiKLADVPRVGN 
NFLRLCVIiKVTQQSEKHLEKI 








IZlol 






SPEVR 


12022 


25923 


A 


12132 


95 


2 


KGKCFPPFFFFETESRSVAQAGVRWRDL 

call 


12023 


25924 


A 


12133 


129 


1 


DLALLPRLWCSAMIMAHCNSELMSSSDP 
PISASQIPGTIGMCH 


12024 


25925 


A 


12134 


196 


301 


GGWAWW PTP VI PALWKAKTGGS LE PRS 


12025 


25926 


A 


12135 


120 


1 

1 


nun opt cccDUTCPCovcifA AAPxrnuPM 
FjVPFkrlir r t r r r r» 1 Cib 10 v AUAtoVUWUM 

lAJwitnirririr or 


12026 


25927 

* 


A 


12136 


235 


3 


KKGIRIERDRLKQLSVCQRFQLAKFPPF 
penvft p w nMPPPK inoc inrtf op aowt/ppv 

I PALWEAEAGGS FEVRSSRPA 


i inn 




A 






i 


TWfPPPPPftPOTP^PPrHPPFPPFPPPP 

LEAETRS FAQAGVQWCDLGS LQS SWVTE 
LAAAS 


12028 


25929 


A 


12138 


158 


1 


LRRGGVFS I FFCGGTMVLS PADKTNVKA 
AWGKVGAHAGE YGAEALERMFLS F 


1Z029 


25930 




12139 


70 


3 




12030 


25931 


A 


12140 


157 


2 


FFSSPRIiLKRRPGNFLGARENFFFFFFS 
ETBS PS I AQAGEQWRDLGS LQAP 


12031 


25932 


A 


12141 


234 


388 


LSQLMS ANLFFVLLFEAE FHS VNRLECS 
GMIWAHCNLHLPGSSESPASTSQ 


12032 


25933 


A 


12142 


1 in 

1 12 


453 


LtoKK^ AAbMlvby 1 £J 1 H V uynu Vn J.tyJ>tAU 

WELYCLEHGIQPDGHMPSDKTIGGGDDS 
FNTFFWETGAGKHVPRAAFVDLEPTAID 
EVCTGTYRQLFHPEQLITGKEDAANNYA 
RG 


12033 


25934 


A 


12143 


44 


443 


aVT/'TDVT.DT VJ VMnVTT TTT .TXTTITTTTniT 

/UvLaj lXiujlruJS^\J\nl7XSjB* JL Jl JUXJN x V v lua 
VDWGKTTTTGHL I YKC3GG I DKRTIEKLE 
KEAAEMGKGSFKYAWVLDKLKADRERGI 
TIDISLWKLDTSKNYVTIIDAPGHRDFI 
KNMITGTSHADCAVLinAAGV 


12034 


25935 


TV 

A 


12144 


3 


3oo 


KlL/i/i x L V 1JXH ^ X X Jtl V VJrtrtSj V yjN uNil^ W IS Xj 

YCLEHG I QPDGHMPS DKT IGGGDDSFNT 
FFSETGAGKHVPRAVFVDLE PTG IDEVR 
TGTYRQLLHPVQL I TGKEDAANNYARGH 
YTIGKEIIDLVLDRIR 






n 


lZltJ 






AGATYIDRIiRVALFEATTFTLWVRPELS 

/lUili A h m r F*i 1 f li V ft 1 1 Jw Ural X X> A UK V IVcXiAJU 

SGEATTMRECMS IHVGQAGVQNGNACWE 
LYCLEHGVQPDGHMPSDKTIGGGDDALN 
TFFSETGAGKHVPRTEEVDxiEPTEIGEE 
GTGT YRQLFHP EQHMM 


12036 


25937 


A 


12146 


3 


385 


GRAT Y I DRLRVAT J iKATTS TLRIiRRELG 
S REATTMREG I C I HVGQAG VQNGNACWE 
LYCLEHG IQP DGQMPRDKTIGGGDD FFN 
T FF I ETGAGKHVPRAVFGDLEPTDIGE I 
RTGTYRQLLHPEQLI 
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SEQID 
NO: of 

nnrlpntiHf* 

IIUtlEV UUV 

sequence 

• 


SEQID 
NO: of 

sequence 


M 
eth 


SEQD) 
NO: In 
USSN 
09/515,1 
26 


Predicted 
beginning 

nucleotide 
location 
correspond 
ing to first 
amino acid 

rrairlnp nf 

■ vOIUUC Ul 

peptide 
sequence 


Predict- 
ed end 

nude* 

otide 
location 
correspon 
ding to 

Inct nminin 

1 iXO L (1 III III U 

acid 

residue of 
sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartlc Acid, 

|T=r^ 1 11^51 mir Arid IfePhprtvlfllnnlnA 

l^^W 1 U ill III IL LIUj A A. llwlJTIIIUUIIUCt 

G=Glycine, H=Histidine, l=Isoteuctae, 
K=Lysine, L=Leucine, M^Methionlne, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 

A A Mi CUII 1 lie, T ▼ nillic, 

W=Tryptophan, Y=Tyrosine, 

X=Un known, *=Stop codon, A=possible 

n im\ontir\a t\e*\t*iif\n l=r\AQCtHlA 

nucleotide insertion 


12037 


25938 


A 


12147 


11 


381 


HTPEGRTRVPVLIAGWYCQEALRDWGR 
VTAS S TGAMAF LRS MWGVLTAI/3RS GAK 
LCTGCGSRLRSPFSFVYLPRWVSSVLAR 

TTELIRRIAQR 


12038 


25939 


A 


12148 


321 


509 


YSMDQ YFI PFYCQIMHLCFFLKTRVSLL 
F P WLE CS GATS AHCNFCLLS S SNS PAS A 
SRVTGIT 


12039 


25940 

i 


/t 


12149 


51 


ai\ A 

400 


AA 1 PIKE* L. X £> 1 n VviUiMj Vrll UfiAUW&Jjifc. 

LEHGIQPDGQMPSDKTIGGGDDSFNTFF 
NETGAG KHVPRAVFVDLEPTVIDBVRTG 
TYRQLFHPDQLITGKBDAANOTARGHYT 
IGKE ■ 


12040 


25941 


J\ 
A 


12150 


25 


399 


EATTSTI^LRHQljGSRESTArlRfiCISIH 
VGQAG VHI GNACWE L YWLEHGLHPDGQM 
. PGDQTIGGGDDSFDTFFSETGAGKHVPR 
AVLVDLE PTVI DEVRTGTYRRLFHPBQL 
ITGPEDAANNYAR 


12041 


25942 

• * 


A 


12151 


4 

1 


402 


TKEEELCLACRQRRAEXjGWRSQMNAGSD 
PVGI VSAARTI IGS FNGALTAVPVQDLG 
STVI KKVLKRATVAPEDVSEVI fghvla 
AGCGQNPVRQASGGAGIPYSVPAWSCHM 
ICGSGLKAVCLAVQSIGIGDST . 


12042 


25943 


A 


12152 


3 


400 


1 UKJjK V ALiKK.1 1 Lbtoc CrPHHliGSRJSTA 

TMRECIS IHVGQAGDHIGYACWELYCLE 
HSZjQPEGQMPGDKTIGGGDDYFNTFFSE 

lunV JMlVcrxCnvr VL/JjCFX V ±UCt v&Xvja X 

RQLFRPERLITSKEDAVNIY 


1zU4j 


235*44 




111 o 


*> 




LMDMDMS PLRPQNYLFGCKIiKAEND YHF 
I VANDENEHQ I SLTTASLRAGANDDMN I 
VEAEAMNYEGTP I KETLATLKMSVQATD 
SLGGSE I TPPSVLRLKCGSR 














ANAMLDFVTTVEDPGGWDSKNLQKKWSH 
YSFLTGIRPKI 


12045 


25946 


A 


12155 


229 


397 


TK1AHHKAGFALISKKITKNLKMFLSKF 


12046 


25947 


A 


12156 


2 


399 


GGVPHCVWATAWGMRPGLPGPTGLCAQT 
SSRGQKS VLKQKES CGIWQLYHFLSRKQ 
E PRWE PCVSGS S SGDGAVADLADELRGY 
PALCGTLPVHSYRSWAGIRPOTMNGPIjH 
PRPLVALLDGRDCTVEMPIL 


12047 


25948 


A . 


12157 


85 


414 


ALLPQS EALQGAVTMPHS YPALSAEQKK 
EVSD IALRI EAPGKS I LAADES VGSMAK 

iUJCvXU V £*1Y J. OOwftUy A Ivy V Xif pnUUjUi 

KKCIGGVI FFHBTLYQKDDHGVPFVR 


Invito 






nice 


55 


Am 

*Hl/ 


fiTVTtPH^FPI/TRAVTMPHCinPAT.YZlFWFF" 

DVSD IALR IGS PGKG I LAAYESMGSMAN 
PLNQMGVENTKQNPJRLYRQDLFRAEDRE 
KKSLAGVI FFHDTLYHKDDNGVP 


12049 


25950 


A 


12159 


85 


406 


VLLPHSEALEGAVTMPHSYPALSGEHNK 
ELSD IALR I VAPGKGI LAADESVGSMAK 
RLSQIGVENTEENRPLYRQDLFSADDRG 
KKSIGGDLFFHDTLYQKDDNGGP 


12050 


25951 


A 


12160 


275 


146 


EKTFI ISFFFFFETESHSVAQTGVQWHD 
LGSLQAPPPWFRRIA 


12051 


25952 


A 


12161 


212 


2 


KRSRPFFFFFKEMGSCYIAQVGMQWLFV 
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SEQ ID 
NO: of 
nucleotide 
sequence 

■ 


SEQ ID 
NO: of 
peptide 
sequence 

• 

• 


M 

eth 

oa 

• 


SEQ ID 
NO: in 

09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspono 

ing to first 

amino acid 

residue oi 

peptide 

sequence 

i 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptiae 

sequence 


Amino acid sequence (A=Alanine 
0=Cysteine, D=Aspartic Acid, 
ii^vviutamic AciOy i*— Jroenyiaiamne, 
G=GJycine, H=Histidine, I=Isoleucine, 
iv— i^ysine, Lr-jueucine, m—ivietnionine, 
N=»Asparagine, rM*roline, 
Q=Glutamine, R=Arginine, S=Serine, 
*— inreonine, v— vaune, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stopcodon,/=possib!e 

n a!aa4ij1 A /Ia1a4iAA ■ ■■' >V #%, a M ■ ft» |j* 

nucleotide deletion, v— possmie 
nucleotide insertion 












• 


bill At I UetSuLAJii iSUctr 1 oAbQV IvirTx 
GTHTHTQLI FVFLV 


J21D2 

I 




B 
A 


12162 




2 


PDUHC PPPT UMPMaDVCTDrT'T T tTVT CV ™ 
IjKVJJaJSuLJjAPIljJNIiiXSJnolr tA.luiVXllC X 

LS S RQSHS VAQAGVQWHDLGS LQPPPPG 


12053 


25954 


A 


12163 


228 


1 


LLGLW FEQLGRFTKI AKTKGERSQTQSG 
HSMDMRVPAQLLGLLLLWFPGSRCDIQM 

lUoFrAVbAoVurJLJKvliTCR 


12054 


25955 


A 


12164 


122 


1 


RLLFFETESRSVARLECSGAI SAHCSLR " 
LPGSGSSPASAS 


12055 


25956 


A 


12165 

i 


239 


350 


GQARWLTPVI PALwEAKASKSLEVRSLR 
LAWAGHGGS 


12056 


25957 

< 


A 


12166 


235 


359 


TDKAIS KRDLSVliREI IFFEMESRSVAR 
MECSGVI SARCNL 


12057 


25958 


A 


12167 


209 


396 


QFSGI PFDQYKCX3FKNYIYI YI YT5fIYI 
YIYIYRKYTRAYTKRYMCVFSRATHNIV 
YLCAYI 


12058 


25959 


A 


12168 


327 


130 


GGVGFLGGPGEKPPRPSPFFFFFFEKES 
RSVPQARVQWPDFGSIjPAPPFGFTPFIjR 
ESFLFSSLMI 


12059 


25960 


A 


12169 


317 


422 


RGEKPLFFFLQKKKKKKKKKKKKNIKKG 
GRRYKPS 


12060 


25961 


A 


12170 


260 


375 


EKVTCS WTQWLTP VI PALWEAKAGRSLE 
FRSLRSAWAI 


12061 


25962 


A 


12171 

• 


304 


3 


KFFFFFKGFFFLGGVGPIFPPPKKRFFS 
KIPPGVFFFPPLKKKIFFFFPPVIIjGPP 
RVFFKGAPLFFFFFFFFFFFSETEFRSC 
UFUKJjEUS vAvSAHCK 


12062 


25963 


A 


12172 


105 


3 


Li I F LRQS S AJjLP RLE CNGAI S AH CNLCL 
TGS5D 


12063 


25964 


A 


12173 


146 


17 


RIFFFFFLGQSLAVAQAGVQQRDU5SLK 

AFirFo VnAiLiljFU" * 


12064 


25965 


A 


12174 


234 


2 


LFPRKPKPSVSLSLSLSLSLFRLRKLSS 

DMDinntT otra o tmmd n lumotr/^o bi? o v or 
r MKHJNtJ la vKPliMrJJrAIVrioKCbbERKSJj 

VSFTLNLKPGMIRIiSEEGMIiK 


IzUCD 


23>0O 


A 


1 21 75 


292 


380 


v KoJjvKVArliAJrf Jro 1 JjuuUotjrW XTuuUo 

F 


IzUoo 






121 /O 


1 AO 


1 


SPKVRSSR 


12067 


25968 


A 


12177 


170 


291 


L INFYF YFLRQSHS VTQTGVQWCNLGSL 

V* * c AAKivKKltvJ 


12068 


25969 


A 


12178 


260 


380 


LTYIALFI FLRRSLTLLiPRLECSGMISP 


12069 


25970 


A 


12179 


54 


166 


PKRGFHRVTQEGLNLLTLGS PRLGLPKG 


12070 


25971 


A 


12180 


273 


359 


APAGHGGSCLQSQHFGRLRQADHLGSGV 
K 


12071 


25972 


A 


12181 


a / 




IGOAOWLT PVTLAFWEAKAGGSLEVRSS 
RPAWPIYFLVLLASYLRRFCLIQGHSDL 
LYTSSTRGS 


12072 


25973 


A 


12182 


1 


155 


VHKI F I AYKYSGFQTVFRGRAWWLTPVI 
PAliWEAKANRS PEVRS SRSGRRG 


12073 


25974 


A 


12183 


91 


2 


FNLFFFFFFETGSRSVARLE CNGAI SAd 
C 


12074 


25975 


A 


12184 


122 


2 


GQAQWLTPVIPPPWEAKVGKSPEVRSSR 
RTRGRTRGRTRG 


12075 


25976 


A 


12185 


389 


20 


TDRGRRARRACFCGKVFDGELS FALKLA 
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SEQ ID 
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nucleotide 

location 

correspond 

ing to first 

amino add 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
OGIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M»Methionine, 
N=Asparagine, P=»Proline, 
Q=Glutamine, R=Arginlne, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Mop cod on, /= possible 
nucleotide deletion, ^possible 
nucleotide insertion ! 






a 




• 




REMGRPDWRAI^IAGMSSTEYADWHRFYS 
TH YFHDVLLDMH FS GLTYTVLS LFFSDP 
DMHPLDFSLLNRREADBBPEDDVLMQKA 
AGLAGGVLFLI 


12076 


25977 


A 


12186 


180 


393 


LNPGSGGCGEPRSRHCTLAWVTERESVS 
KNKIKIKINKQIEEGFLFQPFGQGFIWC 
YFPAKEFIIALSKT 


12077 


25978 

* 


A 


12187 


37 


443 


PDFPIPFPPRKVQHLARPSPKTARPAWT 
IRR I LPP PPKKRHP PGLARGKFGPREIiA 
KAGLAKE CPAGG P T P AVR FAT PTRGGGG 
RVAGFQTPMG I PMGKARLGLLTFLGFAS 
GWIGAYRPRETLCGGELVDTLQF 


12078 


25979 


A 


12188 


3 


411 


AFPENAATGST FQDP VPAS KGRAEVGNM 
RLSVAAPISHGRVFRRMGLGPESRIHLL 
RNLLTGL VRHER I EAPWAR VDEMRGYAE 
KL I Y YGKLGDTNQRAMRMAD FWLTEKDL 
I PKLVQVLAPR YKDQTGGFTRMLQ 


12079 


25980 


A . 


12189 


2 

• 


414 


QEFGTRKRATF I S LLFFFS S VYS RGVFR 
RDAHKSEVAHRFKDLAEETLRALTLIAF 
AHYLHQCPFADHVKLGNEATEFAKTCVA 
DESAENCDKSLHTLFGDKLCTDAPLRET 
YGEMADCCAKQE PERNECLLQHKDD 


12080 


25981 


A 


12191 


3 


514 


PRLLMEAGPHPRPGHCCKPGGRLDMNHG 
FVHH IRRNQIARDD YDKKVKQAAKEKVR 
RRHT PAP TRPRKP DLQ VYLPRHRD VS AH 
PRNPDYEESGESSSSGGSELEPSGHQLF 
CIjEYEADSGEVTSVIVYQGDDPGKVSEK 
VSAHTPLDPPMREALKLR I QBE I AKRQS 
QH 


12081 


25982 


A 


12192 


182 


3 


RGLFFIPPPPIKTKIWGPFWKWFMGFG 
PPLFFFFEMESRSVAQAGVQWHDLGSLQ 
ALP 


12082 

- 


25983 

• 


A 


12193 


53 


400 

■ 


PETPSWLGPVRRF YFI I IKS VWKWKKKK 
KKKKKKKKKKKKGGGPFKKTLGGDKFYW 
GVKKKIFFFLGGSKKHPWGFFEKKLFFG 
GGKCGAP PPKD I S CLWGKKNFLGAIGEK 
TCCC 


12083 


25984 


A 


12194 


182 


3 


KGIF IQI APPKKKKNWGPLWKWFKGFG 
HPIFFFFEMESRSVAQAGVQWHDLGSLQ 
ALP 


12084 


25985 


A 


12195 


200 


378 


KSSKSQWLGYLFIGLLIYLFIKFFLETR 
SCFVTQAGIIAHCNLKLLGSSDPPTSAS 
QAA 


12085 

■ 


25986 


A 

■ 


12196 


341 

■ 


54 


SLSFHGGLS VLCI FSTYKKLSQEQWLTP 
.VISVLWEVKVSRSPQVRSLRPGDPPASA 
S QS AG I TGVS HHAR P HLS LNP RLE MI KL 
SEEGMLKRMTG 


12086 


25987 


A 


12197 


1 


462 


GGPPRP FRMKAAVLTLAVLFLTGSQARH 

SGRDYASQFEGYALGKQLNLMLLDNWDS 
VTYTFSKLREQLGPVTQEFWDNLDKETE 
GLTHEMSKDLEDVNAKVQTYLYDFQKTW 
QKELTFTAESEPVP 


12087 


25988 


A 


12198 


80 


1 


SNKVFFLETESCSVAQAGVQWCDLGS 


12088 


25989 


A 


12199 


221 


3 

* 


TMGIMLDKKEIQAI FS FKFKMGHAVAET 
TCNIX^SGPGTAmCVTVQWWFiaCFCRG 
DESLEDEEHHGRRLEV 


12089 


25990 


A 


12200 


239 


336 


FGALP ITTCKFGKKKKKKKKKKKKKKKK 
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ing to first 
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peptide 
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Predict- 
ed end 
nucle- 
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correspon 
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peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutaroic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methiomne, 
N=Asparagine, P=ProIme, 
Q=Glutamme, R^Argmme, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptopnan, Y^Tyrosine, 
X=Unknown, *=Sitop codon, ^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKKKKKKKKK 


12090 


25991 


A 


12201 


279 


366 


KKKKKKKKKKKIQCKKNKKKKKKKKKGFK 


12091 


25992 


A 


12202 


153 


40 


GFLFLI FFFFETKSRS I IQAGVQWCHLG 
CTLAWATETD 


12092 


25993 


A 


12203 


317 


406 


LGKLKCNGAVSGHCNLRLPGS SDFSGS A 
S 


12093 

i 

■ 


25994 

i 


A 

■ 

- 


12204 


2 

■ 


925 

■ 


RSGPDHELTLDSGKGCTRSMVPASAASE 
DRRKL P 1 1 VE DEGGPTRS RACSS PARGS 
RP P PS AI GC S PVAQASDS AAG PARRTAL 
QSLSSWLGYQIDRHSVPVYVFKSPLFSV 
IMAPKHKSSDAGNLDRPKRSRKVLPLSE 
KVKVLDL I RKDKKS YABVAK I YGKNE S S 
IRE I VKKEKE IRAS FAVSPPTAKVTATV 
RDKCLVKMEQALHLWVEEMNRKRVP IDS 
NMLRQKALSLYQDFSKGCSETDTKPFTA 
S KGWLHRFRHRFS HHYKKKKKGIMAQVA 
VSTLPVEEESSSETRMVVTFLVSALESM 


12094 

• 


25995 


A 


12205 


272 


1 

■ 


KMARAGLLVI EGKVWRTVYYRFATREER 
EGKMSTNLMNKLDTIGFDNKKDLLISVG 
DLVDRGAENVECLE^ITFPWFRAVRGNH 
ERQPGQ „ 


12095 


25996 


A 


12206 


166 

• 


478 


ISASGLLPTSPLTGTSKLQDPNEHLNLL 
MLNRVSLLLPRLECNDTILAHYNLRLPG 
SSNS P DS AFQVAGI TES FTCE INALKDS 
SQVALWS FRCEMDHSS I CNR 


12096 

■ 


25997 


A 

• 


12207 


102 


497 


PLLALLFGSQKTFLCLQIiSIiKPTRSSLL 
S PGSAGNP ENE AP CPQLNPEATS LKKKK 
KKKKKKKKKKKKKKKKKKKKKARG 


12097 


25998 


A 


1220S 


131 


2 


FGFLGGDKAKGKGPGPPFFFF FETES PS 
VAKAGVQWCDLGSLH 


12098 


25999 


A 


12209 


221 

< 


499 


QTRHKLTNSRSGRTGLQ ILI KEQHSSGR 
WCXSTGDILWQSLEDTICYSVPKSLIiDFL 
LF I YFETESHFVTRLECSGAI SAHCNLR 
LLGSSDSPA 


12099 


26000 


A 


12210 


297 


398 


HNFFFFLKTESCSISQAGGHWYDLSSPQ 
PPPPG 


12100 


26001 


A 


12211 


341 


452 


NWSPGLKQSSRLSLPKCWDYRREPPCLA 
GLLAYLFRR 


12101 


26002 


A 


12213 


214 


468 


FFFFFFWGKGVFFFFPPLEGRGGNSVYW 
TPPPRGKKNFPPLGLGKMGNNPPPPPPR 
FFFFFFKKKGVFFLGPGGVKTPSLRETP 
P 


12102 


26003 


A 


12214 

■ 


223 


477 


CPTETQLQLIFAREKHKPSPVLLDLDLY 
GFFFEMESHSVS PLECSGP I S AQSNLYL 
LGSHNSPA 


12103 


26004 


A 


12215 


263 


55 


TPKRYGKMQTATYLPKSKIYI IISDLRK 
QD YI VYKQS KKPEGWAQWLTP VI PAL WE 
AKAGRSPKVRSSR 


12104 


26005 


A 


12216 


328 


471 


GVIHAQVLHAHTHTHTHTCTHTQVHAHS 


12105 


26006 


A 


12217 


434 


1 


WDHSAS PPTSVGTPLRADGPHPRIiTAPG 
HWGS S AMLS APAAGRRGLS LGRWCDRD 
KLSQRG PQP KW I L WWGNLPGGG FLHRH I 
HKPLPTI PI PSTVRGEEVLQQLLHSAS F 
ILFIFI FLDRVSLCHPGWSAVARSRFTT 
TSTS 


12106 


26007 


A 


12218 


304 


43 


EKPIGVAPSRVKKRAGGGEKTECCLLKG 
PLQGFQGVLKGPLKLNGPLFFFFFLRRS 
LALFRLECNGTILAYCNLCLLGSSDSPA 
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E=Glutamic Acid. ^Phenylalanine, 
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nucleotide deletion. ^possible 

m m mm mm m mw mwm mm* ^m* mm? mm m mm* m* m mr m>w m m mm v<f mm* m mw 

nucleotide insertion 














. SBI 


12107 


26008 


A 


12219 


171 


365 


ALVQPRAGREAVTVPACCS CS DVFL YET 
NKVAR IQS IN YGT I KFFHVI VPS YVSKE 
HLKRKLEK 


12108 


26009 


A 


. 12220 


171 


452 


ARYKFLHPKFAFGLYPS S SQ IQGARHQL 
WILIKCSFYLLFLRQCLTLLPQIiECSNM 
ITAPLQPQLPGLKQSSHLSLPSSWDYRH 
EFP 


12109 


26010 


A 


12221 


323 


458 


FRVQRS LFLF I Y F FETESHP WQAGVQ W 
HDFGSLHRISP 

mm 4*mm* mm. ^mkw mmw 0mM3^ mmmy ™ mtm) w mm 


12110 


26011 


A 


12222 


189 


56 


DRVSLVQPTLECSGTISVYCSIiNIjIiGSS 
DPPTSASPVASTTGGI 


1 01 1 1 

121 1 1 


i 


a 


1999** 






FLCSS YAYDDHFLRPFFP I HIiI IJ3ITAF 
SQDTI X SNLLFQDACSVP KKAANLGQAQ 
WIiAPVI PVLWEAEVGRLLEVRS SRPAW 


12112 


26013 


A 


12224 


55 


353 

• 


RIQGCS CWVKLCPKVEKWVQRTDAEQES 
QTKAE I QDMKQELSAVNMMDKFARSARL 

ISVAFYKLPGTVIRIj 


12113 


26014 


A 


12225 


199 


23 


LKXiSS I Y YVPGT I LKYFTGWARWLLP VI 
PRl^WEAEVGGSPEVRSSRPANSTSWI ER 

HQ 


12314 


26015 


A 


12226 


369 


14 


DCVRVGLSYPQTVCPSCESVRKAPFSSR 
PHRGVLESGQPRCKS RRVD I FFSMERER 
DRERERERERERERGQKRTRRVWGGERE 

KLGYSHAWGWRGGRQLGWGGMCGIPPPC 
TSDVHQ 


12115 


26016 


A 


12227 


206 


2 


QNGHMNGGPCRVS LS LFLETRS FLF FLA 
SLSMLECSGAILAHCNLRLPGSSNSPAS 
AS OVAG I TGACH 


12116 


26017 


A 


12228 


231 


501 


RSWEGQALHGSDPIASCTRIQSNYMALQ 
RTNOELEDKLYRMGOHYEEEKRALSHEI 
VALNS HLLEAKVT I D KL S E DNVSAAHTW 
PMEDCR 


19 1 19 

■ 

* 


ZOvlo 






art 

oO 


- 


DLPTFPVP PHPG FSGPLLGO SELPSNCO 

m^JLmiMT mm mm mmT W mm m» i4£ W *» mmW^m* At J^JjfcJ^J^^ W4JAJ» M*i»V|| 

TGGRSHLSVSVLCPTLATTPLGLDFPTC 
GTSQLPLGPLQIiAHRSELAGLSLPGLST 
NCTSQRLPDRPHRHSDAPHTHTHTQTHT 
HTHTHTPATVT 


12118 


26019 


A 


12231 


211 


96 


KSHFKAGGGGPPRYS PPFGGGGGGVFRV 
GGFPPPGLTQ 


12119 


26020 


A 


12232 


228 


3 


KRWFIKGFLSKGLGPPPPKLKFLGFFFF 
FFFEQS FVLVAQAGVQLECSGTIIjVHFN 
LYLPGSSNYPASASHPRV 


12120 


26021 


A 


12234 


137 


3 


VPLPGTLKNHSMVPCFLFVCLFVCYETE 
SHSVAQAECSGAISAH 


12121 


26029 


A 

mW A 


12235 


139 


1 


FKLCFGQAWWFKPI I PALWETDASRSLE 
VRS SRPADAWADAWADAW 


12122 


26023 


A 


12236 


80 


1 


VLLIFIFLEEMGFHHVGQDGLDLLTL 


12123 


26024 


A 


12237 


187 


494 


TERNS FN INKKDI HTKTPFVGYQHQRPK 
VDKTTKMGRNQSRKAENS KNQSTS S PPK 
DRS SLPATEQS WTENDFDELTEVGFRRS 
AITNFSELKEHVLTRRTE 


12124 . 


26025 


A 


12238 


252 


503 

* 


CQGVGADDLQSQAE IQCELCAES I DRFD 
ILFFFFLGLGP PHKNP PPPTPQKWKPGK 
APPLALWDI PRGAKGPGRQ PGRPAIHTC 


12125 


26026 


A 


12239 


2 


471 


RIALCPAVRIRHEERERERERERERETP 
GHTQLS PGARRTPPLERERAFNDCFSLN 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Add, 
Er=Glutamic Acid F=Phenvlalanine^ 
G=Glycine, H-Histidine, I=Iso leu cine, 
KMLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

0=Glutamine- R<=>Ar?inine. 5?=5?erine 

T=Threonine, V=Valine, 
W~TrvDtODhan Y=Tvrosine. 
X= Unknown. *=Stoo codon. /=oossible 
nucleotide deletion, ^possible 
nucleotide insertion 






• 






- 


YPGNLCPQDLIEVFRSVYQHCALYIjQDG 
YFINITPVYGI PAFFTSAKS VFFSRALE 
KTHLLKDVVGSDTYRINYICYETLPSLS 
VBEI ITRSEFVIGQEA 


12126 


26027 


A 


12240 


14 


149 


FMTACRIRHEMGRGLLRGLWPLHIVLWT 
RIASTI PPHVQKSALPH 


12127 


26028 


A 


12241 


588 


671 


RSRPSFQEQAFESSQKYKEGKYIIELNH 


12128 


26029 


A 


12242 


384 


464 


RGGQWWLKPWPALWEAKAGRSLEAG 


12129 


26030 


A 


12243 


325 


459 


TYLFIYLFIYPETESCSVAQAGVQWRDL 
VSLOPLPNSPALPTN 


12130 


26031 


A 


12244 


323 


457 


SLQRECGPADTFFEMDSCSVAQAGVRWC 


12131 


26032 


A 


12245 


241 


466 


GAXjFFFFFFFFFFFFFFFFFFIiSNKAVY 
F 


12132 


26033 


A 


12246 


385 

j 


37 


QARRRCLQTSRMEALGRWS YKVTI EFPL 

OTIiPPPPTT.KVTCVT Pf? TK TTPNPttPT.PP. 

YEEGQRLGKSTVSWVFNEDTRIiINIWNR 
HITVNPRTNSONRMLKMAHESKY550VPF 

PLQA 


12133 


26034 


A 


12247 


318 


456 


LILKYFMYVCMYVHMYVCMYIKSCSVAQ 
DGVQWYDLGSLQPPPPGL 


12134 


26035 


A 


12248 


362 


473 


S PCQNNMNS AKTE ARTNIKFMAKLMWKN 


12135 


26036 


A 


12249 


310 


463 


FSIKTRKNALGKLQSLLRCHQFYLYVCV 
CVCVCVCVCVCTRTYAILTCIAV 


12136 


26037 


A 


12250 


169 


54 


EAKSGQARWLTLVI PALWKAKAGGCPEV 


12137 

• 


26038 


A 


12251 


49 

• 


449 


GI PATS TS CVQVI LL PQPPLLLGLQDS F 
T»pnp t t TTrvRf?pnTfl<?T,TTP<?rt'P , r!VTJirTir 

uryQi x i ai/ DuXjUivjiDJJi iroyDVjviNc IN. 

IVTVDFTREEQGTCNPAQRTLDRDVILE 
EPANGVKIERFTRDDPWLSSC 


12138 


26039 


A 


12252 


130 


428 


RWGLAMLPRLVLNS WSQVI LP PQPQVGA 
TFGGFPNP PGHWKLLGG I SSGEEAGVEE 
AEEGQALGFLGQFPPSSPQLLIiQTSWCIi 
RTTRTCRWStiLWKAH 


12139 


26040 


A 


12253 

A A*£+-J m* 


204 


54 


PGMHKTQKSDLGQVQWLTP VI LGLWETE 
AGDSPESRSSRPAWAKHKIVR 


12140 

i 


26041 


A 


12254 


375 


470 


SLCLFFS FFFQTE PRCVAQAGVQWRHLS 
SLQP 


12141 


26042 


A 


12255 


432 


512 


GQVRWLTPVI P ALW EAEGGRS PEARS S 


12142 


26043 


A 


12256 


232 


1 


KGALSKKTQS FWGGQfCFFPRPGVFKGVF 
FWGGGFCFFVFGGFFFFFFFFETESCSV 
AQAGVQWCSLGSLQPLPPRFK . 


12143 


26044 


A 


12257 


22 


123 


RFVCSTI KVIiRDLSSDRSNPGRFLSTSN 
SSSTRP 


12144 


26045 


A 


12258 


71 


362 


QTERNS I S INKKDVHTETPSKGHQHQRP 
KVDKSTKMRKKQRKKAENS KNQNAS S PP 
KDHNSLPAREQNWTENE FDELTE IDFRR 
WVITNSSELKEHV 


12145 


26046 


A 


12259 


375 


3 


SPNIEAPPKAFFFFNQRHFVFCFFSCRA 
ENTHEIVWVKVYPFSNQSKAKLFFSFFV 
QRDRS S PDS YLMPLQLQMWDTAGQERFR 
SIAQSYYRSAN7kLILAYGITCKESFRCL 
PEWLRQIEQYA 


12146 


26047 


A 


12260 


98 


1 


KYQ IDLG GRGS SNS PAS ASRVAGI TGTH 
HHTQ 
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sequence 


Predict- 
ed end 

II UUIr 

otide 

Incntion 
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dine to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=AIanlne 
OCysteine, D=Aspartic Acid, 

F=f"21iitamif» Arifl 17^= PK an via Ian in a - 

G=Glycine, H=Histidine, I=Isoleucioe, 

K=I J v^inp I «=I .Piirinp lVf =TWe»t h mri !na 

N-Asparftgine, P=Proline, 
O^GIutamine. R=A rpinirtf* S=S«rine. 

T=Threonine, V=Valine, 
\V=TrVDt0Dhan. Y= s Tvroslne. 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \= possible 
nucleotide insertion 


12147 


26048 


A 


12261 


125 


3 


LFLIQKYI TGQERWLTPVI PAPWBABAO 
RSPKVRSSSPAW 

■ Mm Mr** ▼ Mt\rmmmw mmw %mW Mm Mt M>9 W 


12148 


26049 

Mm Saf ¥ 


A 


12262 


228 


385 


SCLYLQMI PSS I EKSQGJMYKKATRGRAR 
WLTPVI PALWEAGAGRSPEVRSSR 


12149 


26050 


A 


12263 


227 


369 

JO" 


AKVGGS PEVKNSRPAWPTW 


12150 


. 26051 


A 


12264 

■ 


102 

• 


487 


OONRFFGTERGRGTGRGVS <? FPfrf A PO <! 

LELAS S PS P SVGRLGSAASQRSRGGQER 
PKAOLRRPPRPSS S PRLRGRENCAHHAR 
TPQG I LDRP PPTLGTNLLKGGRPAWMAG 
NPPEAHKTANGPGEOA 


12151 


26052 . 


A 


12265 


391 


476 


PRSYRVECSGSISAHCNLGLLGSSNPSS 


12152 


960S3 

6UWJ 

1 


A 


19966 


37R 


476 


ftp OXvuvjo U Jjj V ul UUiUUVnii if VX riUJn IS 

AKEGG 


12153 

i 


26054 


A 


12267 


166 

1UU 


58 

JO 


NF FF FF FE TDS CSVAOAfiVOMHTiT J3PPN 
LCLPASSW 


12154 


26055 


A 


12268 


209 


49 


TPPNFYI KI SRSGQAQWLTPVI PALWEA 


12155 


26056 


A 


12269 


127 


1 


S S SFFPLKTFGFIi FFFLCFFESESRS VA 
QAGVQWCHLGSLQA 


12156 


26057 


A 1 


12270 


154 


27 


IQTPGKKEFFFFFFETKSCLVAQAGARL 
CPKKKEKENSNFVL 






a 


1 7771 


ioU 


7*> 

22 


V JjlvlHW J. UxCAU W V i <b V 1 riiLWBVKAGGS 


12158 


26059 


A 


12272 


207 


422 


GDSSTRGADEKPKEGVKTDNNN1.INLKV 
HFVMPRWADHLRSGV 




26060 


A 


19973 


*y 


4.70 


RVQRRRCEMSTMFADTLLI VF I S VCTAL 
LAEG I TWVLVYRTD K YKRXiKAEVEKQSK 
KLEKKKETITESAGRQQKKKIERQBEKL 
KNNNRDLSMVPJVIKSMFAIGFCFTALMGM 


12160 


26061 


A 

* •* 


19974 


3*?4 


430 


VKTGECS lAICTWNOPkCPTTT hWYlfJCMW 


12161 


26062 


A 


12275 


366 


499 


TRAQYFWWGIGSQCVTLECSGMIMAH 
CSLDLLGSSDPPTSAS 


19169 


96063 


A 


19971? 


1 JO 




JVr r^oSJ VilWXjJVir V X irAXjIVJSVJlWvjlJOlrlSVK 

SSRPA 


1 12163 i 


26064 


A 


19977 


410 


4£ 


nn?PTiM\/QT?TiT\7QQQr!PPT.Tl«*T T TVDDDV 

FLRYNNSEIRPINNPTMTCKCSSKRKSR 
I CLTFNQ KLE 1 1 KL S E E QMS KI KTGQNL 

GIIiCHVSQVVDEQHKFVKATKSATAKlN 
LSISKYIMSGA 


12164 


26065 


A 


12278 


190 


1 


RKGVFS PQMG PGLGKGKS PFLAKliNFPF 

inrT'T^TrpppTPTrscr'iTVA f\ Ti mumuT.nbc c p 
jvi\.ir r i:r r n hjscc v>iy/iiji*iy WijUirooJ^ 

DAWADAW 


1216S 


96066 


A 


19970 


976 

z/o 


jay 


AISAHCNLT 


12166 


26067 


A 


12280 


425 


1 


PPHPPPGKPPPPPRPELFFPKTKNKKER 
VLFFSPPPKKNFFSLTTPRFFFSPPQKK 1 
KKKLNPPPQKGPPPP VPI KPPPPLFFFF 
FF F FFFFFF FFFFFFFFFFFFFFFFFFF 
FFPFFFFS FGLTVRVIMTCWLI VDIGLL 
I 


12167 


26068 


A 


12281 


253 


90 


IWPPQGSFKTAAPFFFFFVFLVEMGFCH" 
WQAGVKLLTSSSPPASASQTAEKLF 


12168 


26069 


A 


12282 


61 


387 


PKSSSSAPFQI SSWFSLPASTSASTMS I 
RVTQQPYKVFNSGPWAFSSSSYMSRPSA 
HISSLIVSRVGSSSTSFQGGLGAGMGLP 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methlonine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LHSSLANRARLSQKKKKRGGRSKEP 


12169 


26070 


A 


12283 


186 


305 


LHHSTVQVREEEHKSLHTQVHTHTHTHT 
HPHLVIMSSHS 


12170 


26071 


A 


12284 


2 


190 


PRVRFDRVNNRS LNPL I SRTMRI EP I PE 
NPKFSVKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKGGGA 


12171 


26072 


A 


12285 


17 


197 


SLGVGGGQGGGGVHFHPCKPFIVLSPV5 - 

LSKHCSYIiPKKKKKKKKKKKKKKKKKKK 

KKKKKKKKKKK 


12172 


26073 


A 


12286 

i 


158 


405 


GVASYAEQRKYFLEMEYFPGGEAISIVE 
MTTEDLHYSINLDDKAPTGFERVYSHFE 
RTCTDDKMLS YCITCYREI FCERNKL 


12173 


26074 


A 


12287 


364 


142 


GCAFFLGGPLKERRKRRAVPPLFFFNFF 
FETESCSVTQAGVQWCNLGSLQPLLIDW 
WGQPGTVAHACHPGTLGQ 


12174 


26075 


A 


12288 


171 


380 


VFEAGHAWLQENLTS PDFWGFS FFPTDT 
GMLE KKKKKKKKKKKRKKKKKKDR 


12175 


26076 


A 

4 


12289 


194 


2 


GPRGFWQFWVTKGFFPKKGFVKFYPPAG 
GWFFFFPKRGFPFPSFLFFFDRVWLCHP 
GWISVARS 


12176 


26077 • 


A 


12290 


179 

■ 


416 


LIFLETFACAVCFQIiARFHFKIHI ETAL 
RCLI ILSCPFTHPIiYKMYREGQARWLTP 
VI PALWEAEVGGS PEVRSLRPAC 


12177 


26078 


A 


12291 


3 


408 


LAYGFHDRTT YDTPSALMMVLLVDEL IT 
GNKNGSGE AP E YLP ED FRDGEDDAAVTL 
EKQEDLTTLVTLPVTLGEHQRQRETQLE 
AKLLKKRLELGSLLDLVEDLELI IQLKK 
KKKKKKSPLWGAPFKGPLGAPI 


12178 


26079 


A 


12292 


119 


278 


PKS I EAGVKS VLDHPLPNKAKTHLSCKK 
LKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKK 


12179 


26080 


A 


12293 


119 


3 


MLFTGFHTKVMSLNS ITGWAWWFTPVI P 
ALWEAEVGKS 


12180 


26081 


A 


12294 


205 


1 


ILHFYSTETKQPKGLFLALFYSDCEQEK 
RKGGKLKKI FKKAQGQARWLTPVI PAFW 
EAKVGRS PE VGS 


12181 


26082 


A 


12295 


215 


1 


AFKIKLTMHIFSSPLCLPQDEFHPFIEA 
LLPHVRAIAYTWFNLQARKRKYFKKHEK 
RMSKEFHGARLBGNQ 


12182 


26083 


A 


12296 


185 


411 


DVPLMFHWLHLHSDFRIISTEIGVETVG 
KRVLHIRILLLLLLLFFEMESCSVTHAG 
VQWCDLGSLQAPPPGFTPF 


12183 


26084 


A 


12297 


281 


388 


RILFFFFFETKFCSVPQAGVQGRDLNSL 
QPPPPGFM 


12184 


26085 


A 


12298 


178 


3 


TIjLSVTQVPILPGSASSPLLFSRKDIKI 
PTSVFFS FQTE PRS VTQAGVS WCDLGSIi 
QP 


12185 


26086 


A 


12299 


199 


450 


TLIRHMI CNYFLLSHRSPFS S VDYFLCY 
REI YSLI KTLNKLGIGGFFLNI IKVMYD 
KPAAS I IRSSERQKAFPLRSGTRIRCPL 


12186 


26087 


A 


12300 


3 


407 


EFKDHSTAMDTEPNPGTSSVSTTTSSTT 
TTTITTS SSRMQQPQIS VYSGSDRHAVQ 
VI QQALHRP PSSAAQ YLQQM YAAQQQHL 
MLHTAALQQQHLSSSQLQSLAAVQASLS 
SGRPSTS PTGSVTQQSSMVQTS I 


12187 


26088 


A 


12301 


227 


3 


KRGTEGQTLFFPAPRARKSFPPPKMPKK 
VEKEFPKGFFFFFFFETESAPSPRLECS 
G AISAHCS LNPHASAHAS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


eth 

QQ 

■ 


SEQID 
NO: in 

09/515,1 

■ 

■ 


Predicted 
beginning 
nucieonoe 
location 

1* A »• MAG A A It /I 

curresponu 

ing to first 
amino acid 
resiaue oi 
peptide 

CAyi haw n* 

sequence 


Predict- 
ed end 

« II A Ia 

nucle- 
otide 
locution 
correspon 
ding to 
last amiDO 
acid 

resiuue oi 

peptide 

sequence 


Amino acid sequence (A B AIanine 
C=Cysteine, D=Aspartfc Add, 
tir=ijriuianiic Aciu, jp— JroenyiaiauinB, 
G=GIycine t H=Histidine, I-Isoleucine, j 

IV~l_<ySIIlti, l-r-'JjtULIIICj ItI— 'iriclll tUUlUC, 

N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Sertoe, 
i— i nreonine, v = vannc, 
W=Tryptophan, Y=Tyrosine, 
a— un kd own, = oiop couon, /— possioie 
nucleotide deletion, \apos9ible 
nucieoTiue insertion 


12188 


26089 


A 


12302 


292 


380 


KCYRWARWLMPVI PALWBAKAGGSLEPR 
N 


12189 


26090 


A . 


12303 

■ 


13 


186 


VDBRHIRLSBMAWVALYLLSLLWATAGT 
STQTQSSCCEYDVFPRRSGKQLPWGRGC 
DL 


I'M OA 

12190 


26091 

i 


A 


12304 


253 


409 


J. 1AV xAAftAAV V<jV i XxnlrJr KoIjItKIjCj 

CNGAI S AHRNLLLLGSSDS PASAS 


12191 


26092 


7\ 


12305 


153 


64 . 


nn7\ r\U7T MT3TT'D7AT UUAVnnPDT.niTDCCD 
bKAyn liMF J. J. FAljWE»Al\Aljij f JjU VKooK 

PA 


tni fit 

12192 


26093 


A 


12306 


265 


420 


PVI PTLWEAKAGOSLEARCSRPAW 


12193 


26094 ' 

i 


A 


12307 


295 


1 


RPCLKNKKTS PHWKKKF aNPIiFGKAP VF 
PLDKKPFKPHYRES PRPLKKKNGPKGFF 

E r C , T?"C r C ,r PT? Cy/^CTrT/^lA nnrATiTDT^T /ICT/MIDD 

PGSYHSPSKLIIKN 


1 *"l 1 Ct At 

12194 


26095 


A 


12308 


247 


421 


PGGGALKKKKIFS 


12195 


26096 


A 


12309 


168 


3 


GQALIjLLCLTVAFSKTTVVCHAPILGWA 
WWjjTP V I PAL W iSAb VGKJj F£i VKKSKP 


12196 


26097 


A 


12310 


117 


3 


VAYNIAVFRGGVSLLLPRLECS6VISAH 
CNLRLPCSS 


12197 


26098 


A 


12311 


256 


101 


EPPTPFFFFETGSCSITQAGAQRRDHSP 
PQSPTPGLSNTPAPASRACESGR 


12198 


26099 


A 


12312 


146 


3 


FVALCIQLFPIiVNVI CFHKKHKTGRARW 
LMPVIPALWEAKAGGSPBV 


12199 


26100 


A 


12313 


207 


3 


vRWTTGVIMALRGLCFSViSKSMTKDEL 
IARLRSLGEQLNRDVSLTGTKEELALRV 


12200 


26101 

• 


A 


12314 


282 


2 


NCLAEKVKFMVSAISLQPQGISILFLSF 
FFEKESCSVPQARVQWPNFSSLQSPPPG 
r KJLtr r CLibPCjUCatH-bcilr Ka\JKuxe\>WUU 
NETPSQKKK 


12201 


26102 


A 


12315 


298 


Oil 

84 


i\l fyPP Vr AojofLNr UrirlwJL AFlr r if Jr r ir 
FETKSYSVAQAGVQWRDLGSLHSSLGHR 

aDT.WT.lTVP IfPTfQ TWT . 


12202 


26103 


A 


12316 


124 


1 


SKGHFFFFFFSETESCS VAQAGVQWHNL 
GSLQVPPPRFTPF 




ZOIU4 






1 JZ 


J9o 


v v ifuivr ox v x r org vwio a vAUrtHi j^n 
LVCYLLRETYQKLNQEIKP PTLSVPKKR 


12204 


26105 


A 


12318 


262 


392 


PHYCQNQ PQ ARRSGSHQHFGKQRREDHL 
SLGVRDQPGQYSETP 




Zoluo 


a 




loo ■ 


J 


QVQCLTPVIPALWEAKVGGSPEVRSS 




ZOI07 


A 




48 


<2QC 


STMIMRLPLPCGSPIKPPFLPKKKKKKK 


12207 


26108 


A 


12321 

1 if J* ft 


218 




NLSPIFPEGNFFFFFFEMESRSYAQATV 
QWRDLGSLQAPPPRRKF 


12208 


26109 


A 


.12322 


197 


1 


KETWPFPPKKEQGLFSFFKKRIFFFFSK 
KIPSPPFFFFFFFETESCSVAQAGVQWH 
DLGSLQAPP 


12209 


26110 


A 


12323 


128 


36 


KGLFFFFEMGVSLLPRLECSGTTSAHCN 
LP 


12210 


26111 


A 


12324 


140 


3 


I LNSNLKP I KKNTFFFETE SRS IAQAG V 
Q WHDPSS KQPPPLGFKR 


12211 


26112 


A 


12325 


273 


369 


RKEWFLKKKKKKKKKKKKKKKKKKTICKK 



t 

1281 



■ 

t 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

oa 


SEQID 
NO: In 

26 


Predicted 
beginning 
nucieonue 

IUUIUUD 

correspond 
ing to first 
amino acid 
resiuue oi 
peptide 
sequence 


Predict- 
ed end 
nucic-* 

UUW 

location 
corrcspon 
ding to 
mst amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A B Alanine 
C=Cysteine, D=Aspartic Acid, 

£/ — OIUIH 111 It. /\CIU, F ~~ F IJCIiy IHlalllDc, 
vrijciuc, n — jui>iiu mc, i— uuicuuuc, 

K=Lysine, L= Leucine, M=Methionine, 
N-Asparagine, P=ProIine, 
Q=Glutamine, R=»Arginine, $=*Serine, 

TsThrpnnlnA V=Vnltnp ' 
I -~ k iJicuiniic, v — valine, 

W=Tryptophan, Y=Tyroslne, 
X~Unkoown, *=Stopcodon,/=possible 
nucleotide deletion, V=possible . 
nucleotide insertion 














KKK 


12212 


26113 


A 


12326 


140 

• 


415 


NASl^PEVWAPSLSGPETKPPAHSPSD 
PGVHSSCSASQGSQGPGPLTPSSRIHEP 
KPPGPFFK 


12213 

■ 


26114 


A 

• 


12327 

• 


2 


506 


FVPLSETEAPGCSGSRPPEIiBPERSLGR 
FRGRFEDKDBQLBE££cIjCiSEEEEcc£D 
MSHFSLRLEGGRQDSEDEEERLINLSEL . 
TP x XIjCS I CKG 1 Ii I DAxT J. TbL-liHTr CJV 
SCIVRHFYYSNRCPKCNIWHQTQPLYN 


12214 


26115 


A 


12328 


1 


419 


PRVPSCARCQYLPLLRGASQLGYSGVRD 
PVEEAVCPFSDLKLCAGRTTTLFKAVRQ 
GHLSLQRFLLPFVWLCPAPRSGVYRGRQ 
ASJjS CGGIiH PVGAS WFKCIiPTQAS AMAvj 
TPPPASLPPCSLISDCCASNERGSMGV 


12215 


26116 


A 


12329 


259 


414 


NMENSLRCVWVPKIjAFVLFGASLLSAHL 


12216 


26117 


A 


12330 


302 


404 


SCYRVSVISQARWLTPVIPALWEAEVGR 


12217 


26118 


A 


12331 


94 


375 


S RDMS PGIiLTTRKEAjLMAFRD VAVAFTQ 
KEWKLLS SAQRTLY31E VMLENYSHLVS L 
GIAFSKPKLIEQLEQGDEPWREENEHLL 
DIjCPGWSAMA 


12218 


26119 


A 


12332 


mm ■ - 

7 


246 


NPX VDFbPCGoXRVKl PAvjKCjviPALiRVK 

PBTWEEAGEKMPSESLCLAAQARLDSKW 
LKTDIQITCFITRKISLLPLFCHHIi 


12219 


26120 


A 


12333 


150 


1 


Ylxr FHliVKGljPGSt»xL/riX r VVVVvNliS 
SPSVTQAGVQWCHWSVQHKPP 


12220 


26121 

■ 


A 


12334 


34 


426 


EPGFLFVFFFLIRGKKTPLFFWGTRKNP 
KSGKPLFTPPGKSPSEPPKSRGGPLFFQ 
TtCJKJjl rybWtX^AJrJt^rJjUUbFFl^nli 
AGLGGDPPTPRAPHRADQTLEQP IQAPT 


12221 


26122 

- 


A 


12335 . 


185 


3 


TVITPLHFSLGHRARLWSRVPAPPAENR 

ASMS 


12222 


26123 

■ 


A 


12336 


60 

■ 


316 

• 


GWGPIPSTDVTVLSPQKPILLQGHERAI 
TO T KYNREGDIjI»FTVAKDPVS vgwrgsg 
RGGIIiLPGRWVDTPVLPLAGNQRASHCE 
K 


12223 


26124 


A 


12337 


3 


167 


SFRIQVQGNHTSKHYPKIQYSDQAQWLT 
SVIPALWEAKTGRSLEVRSLRAFRAP 


XLJLJL 1 * 








L j 6 




RPIiS VMCGRLTS I RVE DSGARC W F YLF I 
YETE FRS VAQAGVRRRDLGS LQT PPSG 




ZD 1^0 








All 


GWAO WFM P VI PALWEAKAGVS PEVRS 


12226 


26127 


A 


12340 


238 


382 


LL I VYNMMLLPDAVAFTCNS S TLGGQGR 
RITRGQE FE TSLANMVKPHL 


IJ.LJ.I 


ZOIZo 


a 




tm 

iUI 


t 
i 


KTKOTNKKTRSVGOTWWFTPI I PAYWEA 
QAGEL 


12228 


26129 


A 


12342 


276 


23 


GSCLLEGKLTNRKD IHTKTPS VRHHCQR 
PKIDKTTKMGRNQSRKAENSKNQSSSSP 
SKECS SIiAATEQS WMPHDFDELREEGFR 


12229 


. 26130 


A 


12343 


122 


1 


YMGVNERGCGQSIFKSSLSSQLWWWAPI 
I PATQ EAEAGE W 


12230 


26131 


A 


12344 


180 


360 


LHWYFFSVGDFLPPPPPPLDDSSALPS 
I SGNFPP PP PLDEEAF KVQVRAEVKVML 
GKS 
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SEQID 
NO: of 
nucleotide 
sequence 

't 


SEQID 
NO: of 
pepiiue 
sequence . 


M , 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 

• 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
'amino acid 

residue of 

peptide 

nnencf* 

9EUUGIIVV 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 

seauence 


Amino acid sequence (A»Alanine 
OCystelne, D=Aspartic Acid, 
E=Glutamfc Acid, ^Phenylalanine, 
G=Glycine, H=Histidine ) I=Isoleucine, 
K^Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q=Giutamine, R»Arginine, Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyroslne, 
X=Un known, *=Stop codon, /^possible 
n ucleotide deletion, \=possible 
nucleotide insertion 


12231 


26132 


A 


12345 

- 

♦ 


35 


411 


TMT I VDKAS E S SDPSAYQNQPGS SEAVS 
PGDMDAGS AS WGAVS S LNDVSNHTLSLG 
PVPGAWYSSSSVPDKSKPSPQKDQALG 
DGIAPPQKVLFPSEKICLKMQQTHRVGA 
GLQNDGNTCFANA 


12232 


26133 


A 


12346 


278 


386 


I I YCYKQTI IGQARWLTTLIPSLWEAKV 
GGSPEVRS 


12233 


26134 


A 


12347 


111 


2 


KSQTGQVQWLTPVI PALWEAEVGGLLEL 
RSSRLAWA 


12234 

t 


26135 


A 


12348 


215 


316 


LNKRIJ4ISFIKKKKKKKKKKKKKKKKKK 
KKKKKKKGGPL 


12235 


26136 


A 


12349 


265 


404 


SN I SY FLNNNFS KALS LNIKYM I SQARW 
LTPVI PAIiWEAKVGGLPE 


12236 

• 


26137 

• 


A 

• 


12350 


372 


1 


PVFPLPPKKLGEQXiPPPALRFIAVSPLP 
KAAHEQE I KEKVLAVHKNP I DPVYGFPK 
KGPTNFFLFPLFQRVI FLGAQKGVSWFG 
FYVKGLSLAPKLGGPPFFFFFETESRSV 
AQDGVQWCDDGS . 


12237 


26138 


A 


12351 


318 


416 


QGRAQWLTPVILTLWSAKAGGSSEVRSS 
RPAPP 


12238 


26139 


A 


12352 


144 


1 


NGYAI VE CWL FCFVL FLRQS LSVAQAG 
VQWCNLGSMQPPSTSIiVQAI 


12239 


26140 


A 


12353 


116 


1 


S GR YS FKKMKCGQERWLAS VI PAL WEAE 
VGRSPEVRSL 


12240 


fill 

26141 


A 






7 


STRYKKQPGGWERWIiTPI IPVLWEAEVG 
GSPEV 


12241 


26142 


Tk 


nice 


1 19 




LISPSCPMWPSPHGPTWLSRLISQHSPS 
NLKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKRGGGAF 


12242 


2614J 






1 "*R 

1 JO 


7 


SFFLKVFFFFFFEMESRSVAQAGVQWCN 
LGSPQAPPPGSRHCPVD 


12243 


26144 


A 


12357 


148 


394 


PGEPPGEKGEGGKGKKPGGRGWGEPPCR 
GGFKEKFPRGKEKAPPLKENSRENLGWG 
TKIPPLGKKKPPPPQEQEMILGPPQF 


12244 


26145 


A 


12358 


97 


3 


EKFS PC FVRARTHTHTHTHTHTLS LSLF 
GFH 


12245 


26146 

4 


A 


12359 


204 


2 


KHYTPAACFAPCLPDEAPI IAAAKPATT 
TSEQKMAVPPKYANLGKSARNVENKGYG 
YGL INLLKTKS 


12246 


26147 


A 


12360 


156 


391 


NRGLFKVEES FS I PCS VRCS IHPSALVS 
PTP PTTDTTNWAL FFBTES CS IHQTGMR 
WRDLNLLQPLPPGFKRFSCLNL 


12247 


1£1 AQ 

26145 


a 






i 

j 


ALKKLLIIDDNSHKTNIVKHFSFRNFTF 
LFLLETESRS IARliECSDANPTHCNLRL 
SGP 


12248 


26149 


A 


12362 


218 


391 


TILKDCTFKMICIiSQAQWHIPWPATWE 
AESRGLLEPTS 


12249 


26150 


n 




7 


JJ\3 . 


TNSHVDNS ITQKPEFCI PRCCGYIVCQL 
QLVERAFI FNFFFFFLKRS FVLLPRLEG 
RGAI FRVTQEGSNLLTLGPAPLGLPKGW 
NYRGDHPGPAYFYFLMGKRLLQIQGGRG 
LKRNLL 


12250 


26151 




12364 


94 


1 


KNPPLFFFFFFFESESCYVAQTAVQWHD 
LGL 


12251 


26152 


A 


12365 


237 


2 


YSSLLFKPAESAKGILRHHVTPKSTSAI 
SKILFCFLFFVETEFLS PGLKCSGAIKV 
HCSLNLSGS SNSPTSASQVPGS 


12252 


26153 


A 


12366 


340 


442 


GLFVCFFVW FLETESCS VTQAGVHWCDL 
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SEQID 
NO: of 
nucleotide 

M j> «ta aB MM 

sequence 

• 


SEQ 10 
NO: of 
peptide 
sequence 


M 

eth 

od 

• 


SEQID 
NO: in 

TTCCKT 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to iirst 

amino acid 

residue of 

peptide 

sequence 

i 


Predict- 
ed end 
nucle- . 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCystelne, D=Aspartic Acid, 
f£=Lr!utamic Acid, F=rnenyiaianine, 
o=Hjiycine, ri=tiistiaine, i—isoieucine, 
K~Lysine, L=Leucine, IVHMethionine, 
n=Asparagine, r^rroiine, 
Q=Glutamine t R=Arginine, S=Serine, 
T=Threonlne, V=Vaiine, 
W=Tryptophan, y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion) ^possible 
nucleotide insertion 














SShQPS . j 


12223 


26154 


A 


12367 


1 AO 

208 


45 


is , riot7i?T7irT3 , T?t70Trc , 'c , 'C r trD JM*>\rr Drv 1 d/^TiT c? f 
ISLiuiSif r C r rrbvr r r vKAljVLib'tA-FtjWo 

ETPGSSDPPALASQSAGITGATVNST 


I 12254 


26155 

• 


7\ 

A 


12368 


60 


443 


GCTEATAGKGMRKCLVAWSTCAGARPPS 
RNRGSQGHSARFQGPRLPTWRLPPPPP 
QAKPFCKKWMLGGPISIEPKPQHPPGPG 
ISJUtWAGKAAPRliVFT 


12255 


26156 


A 


12370 


341 


421 


SGWYRWLTPYTPALWEABAGGSPEVGR 


4 *N A F r 

12256 


26157 


A 


12371 


AAA 

280 


393 


NTIDHxVTQWLTPVIRTLWEAEAGRSPE 
VRSSRSAWPT 


12257 


26158 


A 


12372 


268 


399 


TLCSLSSAES I IHS YPEFS CKDNYLMTH 
NEAYKACMRRAFH I PL 


12258 


26159 


A 


12373 

■ 


209 


468 


EAVPDE TSHPS PADS FS TS SNLACGTHH 
LRRLLKLYVEEELEYYPRKIQCFHFSFF 
FFESRS LiLiT iLPRLECSQS ISAHCNLCLL 
GS 


12259 


26160 


A 

• 


12374 


381 


488 


TITCLFQKCQRGRARWLTPVI PALWEAE 
AGGSQGQE 


12260 


26 16 1 


A 


12375 


276 


395 


GHWKT KKLKL I FVFFEMES CS VAQAG V 
QWHDLGSLQPPP , 


12261 


26162 


A 


12376 


295 


472 


VYS SVAIiNT FTIjLCNHHHQPS PEVIjTFP 
NKKCRGAILAHCNLHLLGSSDSPTSASR 


12262 


26163 


A 


12377 


390 


511 


PGAVAHACNPSTLGGRGGQITRIPSL j 


12263 


26164 


A 


12378 


417 

• 


511 


AHRS FF E ME S CS VTQ AG VQWRRLG S LQ P 
SPP 


12264 . 


26165 


A 


12379 


133 


406 


KLSLNKRDYKKQVCKETKKKKKKKKKKK 
KKKKKKPGAQKKKSGGS PR 


12265 


26166 


A 


12380 


247 


345 


HMWSSQLNKGIjINKKKKKKiOGCKKiCKKK 
KKKKKKKKG . 


12266 


26167 


A 


12381 


156 


15 


NFFFFETESCSSVTQTRVQWFDLGSLQP 
GKQEQNYVSKNKQKELIH 


12267 


26168 


A 


12382 


275 


1 


KIVFLKGPPLFFFFKKKFVFFLKVGFKG 
GLNFI KTFPPGGKP I SKKRQ I FFFFFFF 
ETE PHS VARLECSGTIS PHCNLHLP S SN 
DSLASTS 


12268 


26169 


A 


4 S^**ft t 

12383 


43 


379 


LLGYDESRSLSWICLCLSCLGQLLSFIjN 
LLVYVFCQTCEVCFHYFFQYFSTLPLLT 
FWDSNVLNIRHFG I VPQVLDDLVFFS PN 
KE I FIjGQIjQWIiMP V I PAFWEPEAGRSPE 


12269 


26170 


A 


12384 


303 


1 


KGPFFFFGLSPFFFLKKVFFFPPFFLRT 

PPPFFFFFFFFFETRSRSVAQARMQWGD . 
HGSPQPQPPGSSDPTRP 


12270 


26171 


A 


12385 


141 


1 


KKEPLRAPPPFQLFCFFYFETTPCSVTQ 
ARVQ WCNbSS LQF P PLRYK 








12386 


31 


AM 


KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKSGGGA 


12272 


26173 


A 


12387 


291 


412 


LSS EMUjFCFVYFIiRWS FT F VAQAGVQW 
CDLGLLQPPLPG 


12273 


26174 


A 


12388 


310 


1 


MTPCPFLFFCPKKRKAGGGFIKKALFFN 
PQKEVFLGPPPKGPFFLGFQPTPPWGKK 
I PRLNPL F WRS S I F FFFLLDGVSRHiPR 
LECNGVISAHCNLRLLGSS 


12274 


26175 


A 


12389 I 117 


2 


KRNPQRGGPLFFFFFFFETESRSWQAG 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 


SEQID 
NO: of 
peptide 
sequence 

• 


M 

etn 

od 


SEQID 

NO:in 

USSN 

09/5154 . 
26 

• 


Predicted 
beginning 

ft i • 1 

nucleotide 
location 
correspond 
ing to first 
ammo acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I~Isoteudne, 
K=Lysine> L=Leu ci ne, M=Meth ionine, 
N=Asparagine, PMProline, 
Q=Glutamine, R=Argmine, S^Senne, 
T^Tbreonine, V^Valine, 
W-Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














AQWHDLGSLQ 


12275 


26176 

■ 


A 


12390 


117 


319 


SFLKKPKDSTKNLLKLINEFSKGSGPKI 
TIQKS VAFI YPNGEHSBKKI RBVITFTI 
AAKTIKCPPQA 


12276 


26177 


A 


12391 


95 


2 


SKS WTS LKLRTLGWARWLTP VI PALWEA 
KAG 


12277 


26178 


A 


12393 

• 


127 

■ 


395 


IPGSQGFCLIKQSAIiHLLNKSSFSFLSF 
IETQSLSVTQAGVQWCYLSSLQPPPPRF | 
KLRQKNRLNSGGRGCNKLRAHHYTPGWV 
TERDS 


12278 


26179 


A 


12394 


252 


391 


FFCFCFVLLCFFEMESCSVAQAGLQWPG 
LGSLRQGFAMLARLVSSS 


12279 


26180 


A 


12395 


275 


3 


TFRLVI LKKSKETVEINCLETKSHSVAR 
AGVQWCSQDWWEEKAGFNWRPSKARRCP 
TSVLKCHLGWAQWLMPVI PALWEAKAGG 
SPEVRS 


12280 

* 


26181 


A 


12396 


111 


3 


GNNVKIRGLRIRPLKGWACWFTPVIPAF 
WEAEAGG 


12281 


26182 


A 


12397 


124 


1 


VSRGRTVHQEKPTERAASPRLECNGTIS 
AHHNLCLPGSSNS 


12282 


26183 


A 


12398 


259 


387 


YQQESFSCLFIFIFLFFIFYDTESRSVA 
QAGMQWHDLGSLKPP 


12283 


26184 


A 


12399 


103 


) 


KGRFFF FFF YETES RS VAQAGVQWRDLD 
SLQAPP 


12284 


26185 


A 


12400 


308 


416 


CGNNFEKAGRVRWLTPVI PALWEAKAGG 
SLEVRSLR 


12285 


26186 


A 


12401 


92 


1 


HTWGLTQWLMPVIPALQEAKVGKLFEPR 
SS 


12286 


26187 


A 


12402 


265 


388 


GLSWLFRRPGHLFYFIFFBTMSVSITQA 
GVGCCDLGSLQPP 


12287 


26188 


A 


12403 


279 


1 


SLSSKMESGSLTFSTIDIWGWI ITLSCC 
PGHCRMFSS IRGLYPLXtASSTTS SSCNN 
QNSPDMGTCSLV?SQLLRRLRWEDCLNTG 
GRGCSEPGS 


12288 


26189 


A - 


12404 


387 


152 


NPPPGFNFGGPLKKNFFSPPRGEKFVFL 
KRP PP FFFFFFFFFETESHS VAQAGVHW 
RDLGSLQALKSFCDIiVSAFKGF 


12289 


26190 


A 


12405 


262 


464 


1 1 FHWIiACILVIHS SFDGHGIiFLPLAYA 
DSAAVKHSCTSICLNIEYIPGNGIAGSC 
GISVLNFLRNC 


12290 


26191 


A 


12406 


204 


67 


KCFGQAWWLMPVIPALWEAEAGRSQELR 
SSRPAWAKELLNNRTRAS 


12291 


26192 


A 


12407 


134 


1 


IQRPC IQLFSTFFLI LFYFI FTE IDSCS 
VAQAEVQWHDFGSLQP 


12292 


26193 


A 


12408 


145 


403 


TYMYSLINYYKANSHVTKVNKHNTARFL 
ESLLCAHPNP PIKLI ISLLLRE KKKKKK 
KKKKKKKKKKKKKQK 


i 12293 


26194 


A 


12409 


65 


415 


RNKRI ISQSWNGPSRKPQSSALLOGLGH 
LTSGVRARVS PVQGCLVRI KGGSGPSKP 
KKKKKKKKKKKKKKKKKSSKKKKAQKGG 
ALKKK 


12294 


26195 


A 


12410 


124 


404 


PWWEFLHHRNCQMWPGTVAHTCNPGTLG 
KKKKKKKKKKKKKKKKKKKSSSS3S 


12295 


26196 


A 


12411 


164 


2 


TGAGLTIMDMVGVGPDLLQLRFCFVLFC 
FLVFFFETESDSVAQAGVQWCNPSSL 


12296 


26197 


A 


12412 


114 


3 


PGMVAHACNP S TLGGQGGW I TRGQVYKT 
GLAKTVKP 
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SEQID 
NO: of 

n UL-lcUlIUc 

sequence 

■ 


SEQID 
NO: of 

sequence 

• 


M 
eth 

uu 


SEQID 
NO: In 

09/515,1 
26 


Predicted 
beginning 

n it j*lAn£i#t ^ 

nucicuLiuc 

location 
correspond 

in cw 4 n fitted 

ing tu iirsi 
amino acid 
residue of 
peptide 

CIUIIIMIM 

Msquciive 


Predict- 
ed end 
nucit>* 
otide 
location 

corrcspun 
ding to 
last amino 
acid 

Mnr i/l 1 lO <\T 

peptide 

5CQUCI1CC 


Amino acid sequence (A=AIanine 
OCystelne, D=Aspartic Acid, 

iL^oiuia rnic i\ciUy f^jt ncnyiaianincy 
G=Glycine, H=Histidine, l=Iso(eucine, 
K=Lysine, l>Leucine, M=Methionine, 

Q=Glutamine, R=Arginine, S=Sertne; 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
a. — uniuiown, — oiop coaon, / — possioie 
nucleotide deletion, \=possible 
nuciGuuuc insertion 


12297 


26198 


A 


12413 


583 


391 


LADTGFHHVGQADLKPLTSCSTRLSLPK 

fWnVR n? P PHT VPTYTVT .QTP(TT . T C PTTC 

FLPLCTFR 


12298 

4 

? 


26199 


A 


12414 


280 


2 


KVSSMNAQTSVLFSTMLGI SETLKYRGR 
5CITPR BPGPRfiflP EflP E P VDR f*P WP QPP 

IEGGKKRGRKEYREEGRRRIEVSANEBT 
PRDHPANK 


12299 


26200 


A 


12415 


264 


352 


RFLVHGCDIYICINIYTHTHTHTHSHTH 

. in 

T , 


12300 


26201 


A 


12416 


359 


485 


TDARMDWCYQEGTGSQAWWLTPVI PAFW 

PlVPIHT/lGTTPtTPCT P 
EtACiAvj\?o f iiiVKo LiK 


12301 


26202 


A 


12417 


I7l 


1 


PLFRWWGFELFPPFGYQKKKGYPPFFFF 
ETEPCCVTQAGMQWYDLRSLQPPPPEFK 

K 


12302 


26203 


A 


12418 


265 

• 


1 


WHKSNEELAAEASXpt/KARASNTILGPH 

•Jtin VDD POTTOT YnxrTrT T\C*/*rr rargTOI L'L'PPP 

iHoi\JaKL.F VoLiKJN VIjJJ&GVKoTNF FSF F 
ETDSHSVTQAECSGAISAHOTLHIiMGSR 


12303 


26204 


A 


12419 


278 


388 


SSLTGFFIRRRNEEPGQHGBTPCLLKIQ 
KLAGCGGVL 


12304 


26205 


A 


12420 


205 


413 


WNJj rvi it Y Q j7HKYAR,yY S YN 1 1 LFTNKK 
RAIGQAEWFTP 1 1 PAVWEAKVSRLPEVR 

G\7T3T OT TJ VTaTD TTO 
t> vKJuoJjJr J\WK1. 1 o 


12305 


26206 


A 


12421 


103 


2 


RSLSKHRTETILGLPAAVLI ILFPPLLI 


12306 


26207 


A 


12422 


372 


478 


NIFVFLKETPCQSQWLMSVIPTLWEAEA 
GGSLEPR 




ZoZUo 


TV 

A 


12423 


1 74 


416 


KMH x v- Vlio At? Jb X JjHijUTVAL I i*b I CSTJj 
DMDQFMRKRIEAIRGQILSKLKLTSPPK 


12308 


26209 


A 


12424 


572 


733 


RFSCLSLPGSWDYKNCIjNSGSRGYSEPR 
<; hhp t p Ziunrrpnn q v<5 TrnMwrxrn c n 

OrU1\*±cJvtl V J. dyiJo V dAXJJNJYi K^oH 


12309 


26210 


A 


12425 


123 


2 


GYIFIFIYFFETGSCSVAQAGVQWRNLG 
SLQLPPPSSSDS 




ZOZl 1 


a 


12420 


224 


417 


skftaaglqknifflsapnfialavvlk 

RRDWETPG 


12311 


26212 


A 


12427 


us 


3 


FFETGSHPVTQTGMQWCLKNSIiQPLTNR 
LKQSSHLSL 


IZJiZ 




TV 
A 


12428 


250 


398 


PVIPALWEAEAGRSLEVRSSK 


12313 


26214 


A 


12429 


215 


3 


KKGLWWGANKL WVFTPQKFFLNQ FSGW 

altprgkswfskrrapapffffffwete 
shsvaeagmqwhdl 


12314 


26215 


7A 
A 


12430 

• 


345 

• 


2 


r bHiif AKFQlio£ir IKUPIjGIjEGKETRPR 

lndrfmagpsfpaqqglrmpflgkewer 
ffffffeags psvtqaecsgs itancsl 

EL 


12315 


26216 


A 


1243I 


348 


3 


LYSSLGDRVKLRLEKTKTNKWCLHSYIS 
TCKI INLDHYLIPCTKTNSEWI KYLNVR 
TNSIKLLEEDISVNj^ELGSDNSFIiAMT 

pkaeiikekiyklhfikikifhvsedii 

EK 


12316 


26217 


A 


12432 


160 


52 


ntaighphqaiylsyls iiyls trLsx y 
lsiylsiyl 


12317 


26218 


A 


12433 


180 


2 


GRSKLHICREHSICAIEHVCECDTENNT 
NLCQVQWLMPVT LAI WEAEAWRS PEVRS 
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SEQID 
NO: of 
nucleotide 
sequence 

• 


SEQID 
NO: of 
peptide 
sequence 


eth 
od 

. 


SEQID 
NO: In 

USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acta 

residue of 

peptide 

sequence 

■ 


Predict- 
ed end 
nucle- 
otiae 
location 
correspon 
ding to 
last amino 

A A>t#t 

acio 

residue of 

pepnoe 

sequence 


Amino acid sequence (A~Alanine 
OCystelne, D=Aspartic Acid, 
j^Hjiucamic aciu, r— rnenyiaianine, 
\j— oiycine, jci— xiisnaine, i— isoieucine* 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P=Proline, 

vj i u ta m i n e, i\— Arginine, o^^denne. 
T=Threonine, V=Valinev 

VX/ ■ — Ti-vntnnhan VsTvrActnA 

w — k rypiupiiaii, i— lyrosioB, 
X=Un known, *=Stop codon, /= possible 
nucleotide uciciiun t v— possiDie 
nucleotide insertion 
















1 T) 1 O 

12318 


26219 




12434 




i 


Pfi&FNP P P KfifiP P FF FP FPRGF P PGVFF 

LKKKGFPFFFFFFFDTESRSVAQAGVQW 
HDLGSLQ 


12319 


26220 


A 


12435 


321 


Alt 


NVEGDFLQR I KNI RYWPGAVAHTCGPST 


12320 


26221 


A 


12436 


116 


2 


HDRFCLFVYDRVSTLS PRLECS SAI SAH 
CNLRLPGSSN 


12321 


26222 


A 


12437 


302 


400 


TH THTHTHTHTHTHSUJ-JjI X LF KAiKLJS H 
EFCW 


12322 


26223 


A 


12438 


329 


399 


LALQARLECNGI ISAHCSLCLPY 


12323 

• 


26224 


A 

• - 


12439 


310 

* 


2 


KPI LSRGLFCARPCFRRSNMRS FNPSQS 
YQVGKDTFGFTVKVTKLKVLGPSRGPKG 
ARQGWKGKCLLTLPKMLRPLFF FETES 
CS VTRLECSGAI LAH CNL 


12324 


26225 


A 


12440 


151 


401 


EGMPHLPGSHSSSNILKRGRESGREAGR 
KGRKERRKEGRRERRK2EEKRGRGKTRR 
RKDRGENEEEENNRLSSKVSLEISTCP 


12325 


26226 


A 


12441 

i * * 


276 


1 


FHPFGRPSVSPRVQPKIGI I IHTCVARI 
EKGGSCKALSLCPQYVGSNFFFPFETRS 
FSVAQAGVQRCDLGSMQPPHFPGSSDPP 
APASRVAG 


12326 


26227 


A 


12442 


140 


1 


NSHFPKNIiGSPSSFFFFFYLETESRSVA 
QAGVQWLDLRSLQPPPPG 


12327 


26228 


A 


12443 


310 


3 


LPPCSWGLYGAFARCPLMDDKQLSQVPL 
WACPLSTAGRTRLCVACIKAAGKAQGFF 
FFFETESHSVTQAGVQWHDLGSLQPPTH 
AKPKHTHTHTQTHTI PDT 


12328 

■ 


26229 


A 

- 


12444 


337 


2 


GFFAI VPKKGQKKMFSPFPKEGI FSGPC 
FISSTPTLLGVKKI LPPNTS E I RP VLS S 
I INTSVHIREQKKTTYVGIFTGKSRFFF 


12329 


26230 


A 


12445 


112 


9 


GRVRWLTPAI PAL WEAKAGRS PE VRNS R 
PAWPTR 


12330 


26231 


A 


12446 


172 


3 


I PGELLIGRAKFKAKNWPWAPRGI PLGR 
QKF5F r rrr rSTbbKo VAUA^MUWJ-'WAAji' 


12331 


26232 


A 


12447 


221 


418 


RKAERLEVFFRRLCQRALRP FPHCLAAA 

FMJr Li JL V JatdfJr X LnjKAW W Jj 1 rvl Jr ALiWiiiA 

EAGGWLESRS 


12332 


26233 


A 


12448 


280 


392 


PLGRVRWLTTVI PAHWEALAGGSPEFRN 
LRPAWPDMA 


12333 


26234 


A 


12449 


410 


129 


PSQEIFPlPGPPGKTPr PJjKKKIuONRGG 
GGGLPFPPPQKVKKKKFFYPGRGRFKQP 
KFNPLPPPRGEKKNPKPPPPKKKKKKEK 


12334 


26235 


A. 


12450 


159 


1 


KKKRGGFIiGLTF I KKTG PDP F F FFFETE 


12335 


26236 


A 


12451 


270 


1 


KPLKFFFFSLFQNPLKKDPAIFNPFFFF 
FPR I FWP FFGGVS KAG I WFFF F PFFFFP 

C *^ aVC II AT A7 A* *J\J V mmf A^A m %A& mm- V V «b A* Aw Am AV «W Am AV Am Am 

FFFPPFETES RS VAQAGVQWRDLGSLRT 
RGRTRG 


12336 


26237 


A 


12452 


249 


33 


DKLVKLPLILKKLNSFFFFFETESRSVA 
QTGVQ V7RNLGSLHP P PPG FKPNAWAYAW 
AHAWWPGLPAQNLR 


12337 


26238 


A 


12453 


190 


86 


TNDLGWTQWLTPVI PAL WDAEAGRS LEL 
RSTRKA 


12338 


26239 


A 


12454 


318 


2 


P PTFPLKTLFFPVFLPKLFSTLFSPKKK 
FSNFLVSAP PFFFLTLGNSQGVLLKGG P 
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SEQID 
NO: of 
nucicOLiuc 
sequence 

• 


SEQID 
NO: of 
pepouc 
sequence 

• 


M 
etb 

nd 


SEQ1D 
NO: in 

09/515,1 
26 

♦ 


Predicted 
beginning 

nucleotide 
location 
v-urreopurm 
ing to first 

A in i ha ■ o i\ 
mil in u adu 

residue of 

peptide 

sequence 


Predict- 
ed end 

nucio- . 
otide 

InpafSnn 
luta HUH 

correspon 

last amino 
acid 

residue of 
peptide 

SCIJUCIICC 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 

E/~\jl U I a III II. ACIUj *■ — J ucil j laiUHUJC, 

G=GJycine, H=Histidine, I=Isoleucine, 

If— T vcino T — T am»ino M=Mo+h inn in n 

N=Asparagine, P=Proline, 
O=f^liitomtnp R=Art»inine. SfisSftrinc 

^ VJI U IXt 111 1 XV TM I glUIIIV, WVl UK) 

^Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=^top codon, /^possible 
nucleotide deletion, \=possible 

iiucicuuuc iiiscruuii 














FFFFFFFFETESRCVSQVGVQWRDLEKK 
EKKKCNSNGGRHYISTHLKYK 


lzjjy 


Z0Z4U 




1Z4j3 


ZU4 


Zoo 


T OT .WTUTUTTJ TPT tTTH THTH'I "W PTOTCMW 
•Luxjim ininifiixiifiinixi ± a i. n Ci ±.\Ji\rin 

I 


12340 

■ 


26241 


A 


12456 

• 


40? 


13 


QGGAPPIPIMLSFLQEAGLLHVAQPGVK 
LLTSQDPPSPPSKSAWRQRQETRLNLQQ 
prep g t?hp pppp pp zvwa Tf>nn wQVPWVft 

TKNNKKIRKSLCKKTLEKEMLAVMI ITK 

VT.T.PTnVDQTVT .PTHT .YVT.T 
T i«ii* LLfVtrO X V X n±J 1 1 Jj X. 

■ 


jzj4i 

i 


Z0Z4Z 


Jv 


1Z4D/ 

1 


ZU4 


1 
J 

1 


PFVAQAGFKPLGSSDPPIIASQSXAITG 

1 rill XXX* X w DAT XX 


12342 


26243 

4 


A 


12458 


289 


3 


KAREKKKVGATKNI RI VDLS KGRTS LCK 
KTEMPPVGNRNKQKPQNPSNLRISTRPE 
TESRSVTRLECSGAISAQVILPGSSDSP 
ASNWRQSETLS 


1ZJ43 


Z6244, 




IZ45i/ 


261 


JOO 


KTTKI SQAWWHAPW 


12344 


26245 


A 


12460 


313 


391 


RVGRA/HWLKPVI PALWEAVAGGSPEV 


12345 


26246 


A 


12461 


112 


7 


H I C I Y FFETE SRS V^QAGVQWHDLS S LQ 

PPPACarFK 


12346 

* 


26247 


A. 


12462 


389 


3 

■ 


ALHEGAARGS PPPGGKKKKRGS PFPMGN 
KPGVFYKPKKVPTPKDPPPPTLWGGWGT 
WRHLGPLQPQPPAHAS 


IZJ4 / 




a 


1 ^AtLI 


Zo 


J4V 


WQFLFFFFKRGVYFFFPGGIKGPGFGLR 
EPIiP PGT KGVPRPNPLGGGDLRPP P PPP 

INLGFFKGKGVFMVWPGVGNFL I 


1ZJ45 


Z0Z47 


A 


1Z400 


1 C7 

15/ 


D4 


EVRSSRPAWPIWRTRG 




ZOZJU 




19 AAA 




jj 


1 1 OIjROONS FIi FI CFD IKS CS VADAfi VO 
WHNLSSLQLEVPGFKHAPSH 


12350 


26251 


A 


12467 


107 


3 


FAHCLIGNSFFSFFFFFFEVGSCSVAQA 
GVQWSD 


12351 


26252 


A 


12468 


120 


1 


GNW CRAQWLTLVI PALWEAEVGRSPEVG 
SSTRLGLPTHRP 


1ZJDZ 

1 


ZoZdj 


' TV 


iz4oy 


1 AC 


Z 


wt . v p t ppt .p pi .rty3<3 cg VTOiinvnwTrn 
LRSVTS 


TOIL'S 


Z0ZD4 


71 


1Z4 /U 


Olfi 

Z-Jo 


/1AC 
4UD 


PH T PLP T T .TCNOVS KMYTHTVS PfiTjTjPt T T 

YCCLLQWLTPVI PALWETKVGGSLEVRS 


12354 


26255 


A 


12471 


425 


1 


LNPAPAIFFGGPKKKNFFSPPRGDKFFF 
FKRAPPFFFFFFFFFFFFQKKKKPFFPA 

riTT V ftt? R PPfiP i <?0 WP"LaCJPIjlC T TT1 PP D 

\4 r^r\\Jt\\3 C tr\3ty^HJ~ii3\i yVArxxMAjX" XJXN.X xy P lr Jr 

WSRVPFSPLLFFFFDMESP SVADAGVIiW 
HDLLHL PGS S D P P T PAS RVAG I TGVCHH 
A 


12355 


26256 


A 


12472 




2 


QNKYS ELRMNTFGRPQWFT P VI PALWEA 
EAGGSPEVRSSR 


12356 


26257 


A 


12473 


144 


420 


GAPRP I SRPRGKTPGGGGKRGKPNKNGL 
GENPFPPKGVKGKNPPIFFPFLGQNGPT 
PLVLGS ANMG FYLYTiKRKGPGKGREALR 
AEPGQPFY 


12357 


26258 


A 


12474 


80 


1 


KGGQVQWLT PVI PAFWEABAGGS PEV 


12358 


26259 


"A" 


12475 


54 


294 


LIKSWQHRQEYVSLMSFIFFPCHIWYQS 
QRS PGTTQI ILE I SDTCKNKYFNCTGSH 
PJCHTHTHTHTOTHTH1TIAHTHTL 
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SEQ D) v 
NO: of 
nucleotide 
sequence 
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SEQ ID 
NO: of 
peptide 
1 sequence 


M 

eth 

od 

■ 


SEQID 
NO: in 
USSN 
09/515,1 
26 


i Predicted 
beginning 

1 nucleotide 
location 
correspond 
ing to first 
amino acid 

I residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


| Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Add, 
E=Glutamic Acid, {^Phenylalanine. 
OGIycine, H=Hlstidine, l=Iso!eucine, 

K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Areinme, S=Serine. 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

X=Un known, *=Stopcodon,/=possibIe 
nucleotide deletion, \= passible 
| nucleotide insertion 


12359 


26260 


A 


12476 


364 


234 


EEMGFHHVSQDDLNLLNSW3ACLQLPKC 

1 wdyrsetsrlaslpf 


12360 


26261 


A 

■ 


12477 


288 


450 


PGTVAHACNPSTLGGPGWSRTPGLKRll 
J CLGLPKCWDFRRELLPSLPCLLIFL 


12361 


26262 


A 


12478 


271 


396 


TISFLSFFFLFLFFFFFFWRKSGRESRF 
j SQKKKKYKKKNIYY 


12362 


, 26263 


A 


12479 


289 


2 


GS CLLERKLTYRKD IHTKrtPSVPHHHOR 
PKVDKTPKTGKKQGRKTGNSKNQSASPP 
PKBCSSS PATEQKWMENDFAKLREKGFR 
| RSNYSELQEEI | 


12363 


26264 


A 


12480 

- 


323 


t 

• 


YSMVYPYVFLTAKKDQLCVNNTCiliWaCfl 
7QLYHC INHSTLOTHNISTLMIIiGHIPQ 
LCI P VNLS EAWAAT PALHFVKLLLTELT 
1 HHVCRAIiDIIILAIVSLVALITS | 


12364 


26265 


A 


12481 


279 


429 


I mngqrkcvyvytlytytteyVS'tLKrRK 1 

1 WPGTMAHACKPNTLGGHGGWTA j 


12365 


26266 


A " 


12482 


308 


423 


LTFFKN I TGFFFFFFETEFCFVPQAOAQ 
GGDLSSLKAP | 


12366 


26267 


A 


12483 


' 272 


474 


MVSWGLSRAKIPWVLSFFFFFFKKRWFP 1 

FQGGGRGGDIiSWEQPLPPRSKGFSCFTIi 
P$TRDYRPAGP 


12367 


26268 


A 


12484 


84 


229 


LFKSVCHLLPLSSCSSHVRQACLPFTFC 
HDCKCSEASPAMLPVQPAEL 


12368 

_ 


26269 

• 


A 


12485 


392 

• 


3 


TGKKKRFPFFPKKKI FNSKPQFSWRKKK 
LFKI FS P PGGAPF P PGVQKKKTFFFFYP 
FSPRTPFFSIFFPGKGPPKGGGLGPPFF 
PQKRGGGFPNSKWETWEKFPPLFFFFFF 1 
ETESHSVAQAGVQWHDL j 


12369 


26270 


A 


12486 


76 


. 1 | 


KLI S WTWWHMP WPATWRAEAGDFS | 


12370 


26271 


A 

« 


12487 


204 


8 


FFI FFFFFFFFFFFCFKTGSHSYAQAEQ 
EWHDHGSLQPQPPQSFHLSLPCATMPGL 
FFLFLFLFT 1 


12371 


26272 


A 


12488 


96 


3 


KKPLGQAMFFFFFETES CS VAQAdVQWC 
DIi | 


12372 


26273 


A 


12489 


126 


1 


KP PHPFLPFFFFFETESHS VAQAGVQWC 
YLGSLQAPPPGFTP | 


12373 


26274 


A 


12490 


107 


2 


IAGRAWW LMLA I P TLWEAEAGGS LE PRS \ 
SQPTCAT 


12374 


26275 


A 


12491 


302 

* 


2 


GGFG FFP PRE KGG FFQTVL FGVP PGF FS 
P P VFKTGPGVFFLGAQKKKI FFPPPGGK 
IWFFLRGAPLFFFFFFFFFFFEMTSCSV 
AQAGLQ WCDLGS LQHP 


12375 


26276 


A 


12492 


248 


398 


PTEQVTLGITAQSYSRVHIUJNRVYD)^ 1 !/ 
GSGHPDGAAA I KGS FVQRLKS Y 


12376 


26277 

- 


A 


12493 

- 


341 

- 


1 

• 


KTSHFRE I YPLEHAS SLSKKI ETGALP C 
SQELLNSQENSVMNKFPCLNQIiPFQDVN 
SCI IRFQTKDTLKHTAIQPEAKTLSLPC 
YTHTHTHTHTHTNNHS I FELLCMQCDS Y 
N 


12377 


26278 


A 


12494 


95 


3 


PLFFFETKSCSVAQAGVQWQNLGSLQPP 
PP 


12378 


26279 


A 


12495 


62 


420 


CAAKLCTEAHTAASADTHTS PHGSVSGL 
FCFHFPPHRREQRSRPGRKPGAHRLAGR 
ALSQKP VGSGATPHNLHHQ I RTQTNGL I 
QLLGDRQAPWVTPVLPVLWEAJBAGGSLE 
ARSLRPA 


12379 


26280 


A 


12496 


58 


492 


NSPPPPAPSQRTSPPAAAASPTTGSSSA 
PCPASSSWPRSSIALITFYPDQPCGLSF 
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SEQID 
NO: of 
nucleotide 
sequence 

• 


seq n> 

NO: of 
pepnue 
sequence 


M 
eth 

aH 


SEQID 
NO: id 

09/515,1 
26 

• 


Predicted 

beginning 

nucieonue 

location 

correspond 

ing to first 

a mi do a ci a 

residue of 

peptide 

sequence 


Predict- 
ed end 

mm mm A 

nucle- 
otide 
location 
correspon 

J ■ mm tTW W£%. 

oing co 

last amino 
acid 

resiuue oi 

peptide 

sequence 


Amino acid sequence (A= Alanine 
C=Cysteine, D=Aspartic Acid, 
xv— v»juia[ii ic acioi r— rnenyiaianine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Aspnragine, P=Proline, 

f~\ — 1 * 1 4o m in a PcA ft*o iaIna CsCftvinft 

i^^vjiuiaiiiinC) Jx^ArgiiimCj o^-ocnne^ 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
A E unKnowii| — oiop coaon, ^possiuie 
nucleotide deletion, V=possible 
nucleotide insertion 






■ 






• 

■ 


NNLAWKQNVLCFKERLLEKMFRRIQVLG 
JuLiVn I blAbliS Xr KVJrArvjW VFlr VAVr I 

WVT .TVFFT . T T Y T TMTYTP T POVPWTTVr* 
vvvxjx vr run iiit'iii xkx ,cy v trri x x v v_ 

LCFKR 


12380 


26281 


A 


12497 


2 


198 


RLQG I LS AGFLVPGDTSVS S PRPRAAGS 

Uoo XoftVljVVJ\Xj0v/v*XuJtvuX ViiVrHjVoiir 

TLQPAAHTG 


I23ol 






12498 


71 


340 


uHrinl oiS-Ci xr Xn iy rUuiNoUrArlXAxAJr . 
GGLSAKAPAMTPLMLDTSSRKLVAWDGT 

TITf32i 21 WIT T .AV21 ATVYPQTTT/TI? VlfQ/yPP 
X UUAAVuJLiJH.VAnLIU10 X 1 XjX C X X r. 

RYEDVLWP 


123o2 


2o2o3 




to Ada 


122 


5y 


UriTVT VTU r PUTU T PtIT VTVT VT 

n^x xxx initiininii x x x x x 


12383 


26284 


A 


12500 


220 

4 


3 

. 


PKKESVRCWAMKWIIiLILCQTSIiHWSL 
FUoSLiAAi FCLIaNLKjx VUVxjytjJjuAVAJa 
AWNPSTLGGRDGWITS 


12384 


26285 


A 


12501 


125 


1 


inrrxrr oudt t?\it tt cEfPTPODoiiTDnvr) 
xi v Jj ViibHr JjJp VJLir Jbr rbl bbKo V x KFKK 

S S AVS AHRNIRLS 


12385 


26286 


A 


12502 


373 


476 


ILTRDGVSLLPRLECSGTI SAHHNLCLL 
USUDYP 


12386 


26287 


A 


12503 


440 


573 


SWEDHLSLGNRGCSEPRLHLCTPTWVT 

E M 


12387 


26288 


A 


12504 


146 


5 


HPHLQTMNGAPIPQELTPLLEKERDGLR 
CRGNRSPVPGI PPRWSRDL 


12388 


26289 


A 


12505 


140 


44 


AQGLTP VI PALWEDEVGRS PEVRSSRSD 
WPTR 


12389 




A 


12306 


247 


420 


XjCiXb V 1 r fc>LIfc(r\jCJLPKKJiln.l HlHlHiH 

THTHTHTHTHTYI RT I YTTLARPLS ALH 


12390 


26291 


A 


12507 


346 


473 


KAY I IGLNCFYETKSHS VAQAGVQWRDL 
NSLYS 


12391 

- 


26292 

• 


A 


12508 


87 

• 


414 


RTGVYR VGKDGRS RS PDLL I CPPLGLPK 
LSPFLSFSLPCSPFLYIFFPPPLLPISL 


12392 


26293 


A 


12509 


386 


467 


LLLFFEMESHS ITRLECSGAILAHCNIi 


123 V j 


202V4 


a 

A 


12510 


344 


472 


Jvr JjFt UrAVbxjxAiX XFiUSislvbr xyiUJiL. 
TRMFIAILFriANTW 


i 


2o2V3 


TV 
A 


i *><1 1 
12311 


.424 


132 


K.LrEilaFi'KokyXXjo «• VV* fFLaUi X ir FrXjlvK.y It 

ITRGGGGGPLFPPLKRVRGENSFPPGGK 
SFHGAKFPFCPPPWATKRNSVSKKKKKQ 
TKKGSRIILYNNE 


12395 


26296 


A 


12512 


351 


510 


GGTDF CKQS KLQSLQPAVE I F I PGRVQW 


12396 


26297 


A 


12513 


362 


463 


NTITWLGAVAHACNPHTLGGPGGRITWG 
QEFETS 


12397 


26298 


A 


12514 


137 


1 


EEISLLLPRLECNGAITAHHNIjHLPCSR 
NRRPGKAVHVRQAYSFQ 


moo 


2o2yy 


a 


12313 


231 


5y2 


FVESVLLG I VILLAYRLEFTDTFPVHTQ 
GFFCYDSTYAKPYPGPEAASRVPPALVY 
ALVTAGPTLTILLGELARAFFPAPPLAG 
PV 


12399 


26300 


A 


12516 


3 


413 

* 


SWGGRKFLCPPRSLSGSRELHPAQGDRP 
GPLSSSGKRETGTHRETLGKKKVSSAPE 
AQGAGLRLSQALGGLC YDLS PLTE PRLP 
LAATAFPRPCPALPHPQPRVTMGSVSSL 
I S GHS FHS KHCRAS QNKLRKS SHLK 
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SEQID 
NO: of 

nucicutiuc 

sequence 


SEQID 
NO: of 

pep li lie 

sequence 


eth 

• 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


i Predicted 
beginning 

nnrlpotirip 

MM UlilCUllUC 

location 

crt rresnond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 
nude- 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sentience 


Amino acid sequence (A^Alantne 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid. F=Phenvlalanfne 
G=Glycine, H=Histidine, I=lsoteucine, 
K=Lvsine* L= Leu cine, M=Methionine» 
N=Asparagine, P=Proline, 
Q^Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valiue, 
W=Tryptophan, Y=Tyroslne, 

X=TIn known *=Ston codon. /=DOMihle 
nucleotide deletion, \=possible 
nucleotide insertion 


12400 


26301 


A 


12517 


115 


2 


KSHGGHGVEVLVSVLVOLAMEVQLVLWV 
KEYDGAALA 


12401 


26302 

■ 


A . 


12519 

* 


444 


34 


YCLRQKHTE YRRRSNLEMAS S CLSORKT 
CTFFTLYQNLKM I KLS EEGTWQABWLKS 
RPLAPNSKWEAKEKFLKKI KSVTSLHT 
YI LRKP NS L IADMEN I L VI HIE YQTS YN 
IPSSORLIGSKALCLFNYLEMECRI 


12402 


26303 


A 


12520 


1088 


935 


ILIIFINYRCEFLRSKKSSEEITQYIQS 
YKGFVDRTVMYNSNFFLKFLDYL 


12403 

• 


26304 


A 


12521 

• 


2 


347 


AMAAGSRTSLLLAFALLCLPWLQEAGAV 
QTVPLSRLFDHAMLQAHRAHQLAIDTYQ 
EFEETYIPKDOKYSFLHDSOTSFCFSDS 
IPTPSNMEETQQKSNLELLRISLLLIE3 
WLE 


12404 


26305 


A 


12522 


50 


205 


WGE S DRQS SGFWASS AHACNPTWEVET 
GGSLEARSWRLOCTMFASVKSTRP 


12405 


26306 


A 


12523 


105 


2 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLYER 


12406 


26307 


A 


12524 


279 


455 


KTKNWLAIjGGPPCFPGQYGETPSLLKNE 
KLAGHGGAPLYSRLiLRRLRRGNSLTPGD 
RGC 




JLOjvo 


TV 








LYFLLFPFOI INRMVLFFVFCFFFETES 
HS VPQAGVQWRDLGSLQAPPPG . 


12408 


26309 


A 


12527 


363 


459 


CTILGQAWWLTPVILALWEABA3RSPEV ; 
RGSR 


12409, 


26310 


A 


12528 


182 


36 


QH I WFHKEHGQVRWLT P VI SALWKAEA 
GRS PEVRS SNAAWHACRNS A 


12410 1 


2*311 


A 


12529 


388 


530 


KSFLNFFPRDGILNCVPQAGLELGSSDP 
PASAFRVAGTTS VWHHSQ I 


12411 


26312 


A 


12530 


134 


1 


GHKKGVLFFFFKTESRSVAQAGVQWCTL \ 
GSLQPPLPAHATRPRV 












i 


SQKREKPFFFFFFETEPHSVARLECSGT 
ISPHCNLHLPSSNDSOASTS 

AT A»*A**W*» W X^ Ai Mw^4*LJ A 


12413 


26314 


A 


12532 


145 


3 


KKLE ILS AI I PPKPRKE I FLFFFYETES 
CSVAOAGVOWRDLGSLOAP 


12414. 


26315 


A 


12533 


365 


2 


LFFFTSLIWQITLGFIilLNCPCIPGVCI 
YIQFANTFRKFAFLFTNEIAYNILILPL 
TKQDYTNLI KL I KKHS FFSNFLKQLLFV 
FCFVFVS LFAMES CSVAQARVQ WRNIXJT 
LQPPPPRFK 


12415 


26316 


A 


12534 


313 


475 


FFWS CSSFQQGFGGGITI III ILRWILA 
liSPRLERKWRDLGSLOPSPSGFKRFF 


12416 


26317 


A 


12535 


124 


3 


MGSLI MGAELSVYRQVCDCKPQGVCVCL 
CVCVCVCASTCM 


12417 . 

• 


26318 


A 


12536 


333 


58 


ACNPSKVRSCRPAWSNMVKSCLSKNAKI 
TKEWWWANFFFVFVFLVEMMFHHVGQLS . 
LKLLTS SDLAASASOS YGITGVNHYAOP 
ARTRGSFR 


12418 


26319 


A 


12537 


214 


3 


SD PRRACRC P KEAPEQQRRLPGGSVP AL 
SIFFFFKTESCSVPLSPRLECSGVISAH 
CKLRLPGSCVPPAS 


12419 


26320 


A 


12538 


146 


1 


MNFIiAFTNPPRPPQQILKPLLFILRRES 
HSVAQAGVQWCDLSSLLPPA 


12420 


26321 


A 


12539 


195 


1 


IFI PPQAQKRGDP FLS FFFFETRSPSPR 
LECSDAITAHCSLHLPGPGEPPTPPIPK 
SWDHSHVPP 


12421 


26322 


A 


12540 


151 


334 


LLGRLRHKNHLNPGGGGCSGTIMALCNI 
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SEQID 
NO: of 
nucleotide 
sequence 

• 

• 


SEQID 
NO: of 
peptide ■ 
sequence 


M 
eth 

OO 


SEQID 
NO: in 

1ICCM 

09/515,1 
26 


Predicted 
beginning 

IIUHCULIUC 

location 

Wl I wUVUU 

ing to first 

nmtnn nciri 

ilUIIIIU uvtu 

residue of 

peptide 

sequence 


Predict- 
ed end 

mtf*|p» 
uuuc* 

otide 
location 

correspon 

Hino to " 

last amino 
add 

residue of 

l/VUIIUV 

sequence 


Amino acid sequence (A^ Ala nine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid. P=Phenvialanine. 
G=Glycine, H=Histidine, ]=Iso leucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 
0=GIutamine, R^A ret nine* S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion. \=oossible 
nucleotide insertion 














LLPRSS SSPTSAS KVAG ITGACHHACCL 
RYFLY 


12422 


26323 


A 




1 1 A 

314 




NliNS G P AWVTLM P VI P ALWEAEAGGS PEV 
G 


.12423 


26324 


A 
A 






i 

3 


GS CLLEGKLPNRKDIHTENPSVHHHHQR 
P KVDKPTKMGKKQNRKTGN S KTQSVSP P 
PKERSSSPATEQSWMBNDFDEIaREGFRR 
SNYCELRED IOTKGKEVE I FE KNFERV 


12424 

• 


26325 


A 


12543 

i 


31 


365 


RS AVMPTEDRATWKFNYFLKI IQLLDDY 
PKCFI VGANDVGS KQMQQ I RMSLRGKAV 
ALKGNNTMMRKAIRGRLENNPALBKLLP 
HI RGNEGS VFTNEDLTE I RDMLLGNKV 


12425 


26326 

» 


A 


1 KAA 

12544 


2/5 


/tin 
41U 


FS PKNVTAGRVWWLM P VI PALWEAE VGG 
SPEVKRSRTTLANMVK 


12426 


26327 


A 


12545 


88 


276 


EVPQAHRKLPSAPQLRPLQAISTKGPGP 
PO S PO PSP RPEGS SAHS RGE KRTRERGK 
EEKRKKK 


12427 

• 


26328 


A 


12546 


251 


3 


PVEDNSKRRFQTGEEKVTQEEQEALTPW 
PTMFPPFSDRVSVAOARVOCCDS OSLO P 
QLPGSSDLPTSASQVAGTTGMRHHAC 


12428 


26329 


A 


12547 


106 


2 


DEHLPFLTS FETES ^SVAQAGVQWRDIjG 


12429 


26330 


A 


12548 


225 


1 


GFFKKEPHPRPCKKKTPPKLKPFWGGLN 
TC^I PtiS FVFS KKGPLPKKFFFPFFFFET 
GS CS CHPAGVQ WCDHGSLQ 


12430 


2633 1 


TV 
A 


1254y 






VP P PGTjVPGTjPO 1 1 PKGFPKTKGP P KGP 
PPGDKQKSGPPNKKNFFFFFLRRESHSV 
AQAGVPWCDLGSLQP 


12431 


26332 


A 


12550 


276 


24 


GPLKMVEAPPQNPLS PKNKI FFFFFFET 
ESHSVAQAGVQWRNLRLPGSSDFPASDS 
RGAGITGARPHVQIilFVFIiVKTGRKRR 


12432 


t£1tO 

26333 


n 

A 








KSASSMSFNSKGWGWWIjTPVIPALWEAE 
VGGSLEVR 


1 *iA 11 

1Z433 


xWJ4 


A 


LZjjZ, 


I JO 


1 
I 


RHTGSKTTATALVDVFLLLFFETESHSL 
SRLECSGTTPGHCNIiYE 






A 


I ZJJ_> 


<M 
iVr 


1 
I 


IFFCLFSFEMESHSVAQTGVQWCDHGSL 


12435 


26336 


A 


12554 


3 


349 

- 


HASGPEELSKDPSLVSQGQPHRKPGLKR 
CSSRPLGPPDKLGGEGKQGLLGFTLWLS 
GP I KPCDDEE KNKKKKKKKKKKKKKIX3G 
GPFKKKLFFPPGGGRNFFFLGAPKFFWA 
GRF 




ZOJJ / 




izjjj 


^-~7 f 


417 


KQLHLLQGRLFSPS FLP S I SKLFFFEME 
SRSVTQAGVQGCDIiASLQAPPPGFT 












403 


Ymrlsfqdlvmfkdvavdfsqeewecln 
s y qrnlyrdvi lenysn 


12438 


26339 


A 


12557 


286 


426 


DMLI KTCCHLVCE KKKKKKKKKKKKKKK 
KKKKKKRGGALKKKPLPPP 


12439 


26340 


A 


12558 


114 


1 


PLKEIANRHMKKCSSSIiAIRQMQIKTTM 
RYYFTPVTMA 


12440 


26341 


A 


12559 


106 


1 


FFFFFFFFFFFYFFFFFFEMESRSVAQA 
GVQWCDL 


12441 


26342 


A 


12560 


118 


2 


KNYSNNLKTNAGRAQWLTPAI PALWEAE 
AGKSPEVRNW 


12442 


26343 


A 


12561 


196 


2 


MCCFSKNWQEGAQMTPGFEPPQDNSEKT 
KLSDTRLLFFFFERESRSVAQAGVQWRG 

LGSLQAPP 


12443 


26344 


A 


12562 


128 


3 


RAPPFFFFETEFRSLPRLECNGAISAHR 
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SEQID 
NO: of 

Mil A IaA^I t^A 

n ucieoiiue 


SEQID 
NO: of 
peptide 

sequence 


M, 
eth 

0(1 

• 


SEQID 
NO: Id 

26 

— - 


Predicted 
beginning 
nucieonae 

lUCaUOn 

correspond 

mg iu Urol 

amino acid 
residue of 

' peptide 

sequence 


Predict- 
ed end 

mm muA 

nncie- 

oiioe 

location 

Currespon 

ding to 
last amino 

HC1U 

residue of 

pepuuv 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
£/-<jiutamic Acta, r— rnenyiautnine, 
v* = v»iycine, ti— rttstiuine, i— isonucine, 
K=Lysine, LHLeucine, M^Methkmine, 
rv^Asparagine, jr^Jrroiine, 
Q=Glutamine, R=Argfnine,S=Serine, 
T=Threonine, V=Valine, 
yr ~i rypiopnan, i = *yrosine, 
X=Un known, *=Stop codon, /^possible 
nuwicuiiuc aeienon, r^possioie 
nucleotide insertion 














NLRIPGSSDSPAS 

■ 


12444 


nc?A ^ 
26345 




12563 


277 


3/U 


MCt7TT.T7Dnar , 7AOUr»nT/2CTJ^TJlVT CD17VD 

FSR 








1Z004 

* 




Z~J J 


KFPKTGENFHTLNNKDKGPGS CFHRI I P 
fir J. v-vvjl/JJr 


12446 


. 26347 


A 


12565 


129 


3 


KYSVIiRPGTVAHTCNPNTLGGRGGQlTvP 
GWEFETSLANVAK 


12447 


26348 


A 


12566 


172 


1 


P PP KKKKPRP PKKKNFF r r rGAI S AHGN 
LCVPGSSDSPASPSRVAEITGSRCHABP 
Y 


12448 


26349 


A 


12567 


257 


380 


GI CRPLLGGVLQS GATGVRDPLEEAVCP 
LAELKHCAGRS TA 


12449 


26350 


A 


12568 


120 


r 


GVFL FCF VLFETES CS VARLE CSGM ISA 
HCNLRLLGSSNS 


12450 


26351 


A 


12569 

i 


250 


2 


IGKPKTPQFWFLKTKRGGPNAAFSFKKF 
FFQLKPPPRVFIiKIFSCQKKIFFFFFFE . 
TESRSVPQARVQWHDLGSLQAPPPGF 


12451 


26352 

* 


A 


12570 


338 


3 


VST PEKNFVFHTRGFS KKSSKYKVCNFF 
FQKKLGS P P FSLKKVFPGKP P YS CCSGR 
VSTAPYFIjNPGPSTKIJjPPFFGPIiGPEE 
KFTjSFFFFFEMECRSVTQAGVQWCDIjG 


12452 


26353 




12571 


5 


556 


I GCCIiCF KIN I r QIjHFNKINF F eVTRSG 
PVTQAGVQWCNLGSQQPQPLS SKQS 


12453 


26354 




12572 


1 15 


2 


AGITGARHD 


12454 


26355 


T\ 


12573 


279 


ill 


/~» /~\t rf\ rijT Mnin rr» t\ T7jvi'ct\c , t\ p/^t ty>neoots 
^UVUWJjMi' V VPAr WbUciAhbijJLiUi-'aobK 

PAWAT 


t2455 


26356 


A 


12574 


117 


1 


KTPLFFFFFFFEMESRSVAQAGVQWYDL 
GSLQAPPNGFT 


12456 


26357 


X 


12575 


253 


374.. 


S LPGrf^TVHACNPSPLGGRGRRITWGRKF 
GTSIiANMAKPCXi 


12457 


26358 


A 


12576 


130 


2 


KFPGPFLGFFFFFFEMESRSFAQAEVQW 
RDLiGPPQAPPPRFT 


12458 


26359 


A 


12577 


132 


3 


RVPGPPLLFFFFFFEMESRSFAQAEVQW 
RDLGPPQAPPPRFT 


12459 . 


26360 


w 

A 


12578 


3 


370 


JjKKNCIjriliGGRECSEIJunHCTPAWVTE 
RDSVS KiCKTLS VLMEKWPNFPLRPLALS 

EGADLDCYDPRTSPLFAKSWERNFPFPV 
HPGPVEGCPK 




26361 


A 


12579 


1*51 

131 


2 


c lib Lib x v Ltb Juil iiSbljbvAyAtjVyWrlJJV 
GSLQTPPPRFTPTRA 


12461 


263 62 


A 


12580 


AAA 

202 


2 


QTREQNWMENEFDELTEVGFRRWVITNS 
SNLKEHVFSC 


12462 


26363 


TA 
A 


1255 1 


i An 

109 


2 


Mt?r r r r r CiliitbKbVPUAKVUnKDXiualj - 
QAPPPGF 


12463 


26364 


A 


12582 


105 


375 


LIMILLPLQ ILVHS GTFCFLFFG FETEF 
HSTPRLECNGVMSAHCNIiCXPVKTSPAS 
TSQIS 


12464 


26365 


A 


12583 


106 


3 


RPIFFFFFFQTESRTVARAGEQWCDLGS 
IQPPPP 


12465 


26366 


A 


12584 


484 


700 


S S S WNRAF S RKKD KT WMHTP EAL S KHF I 
PYNAKFIiGSTEVEQPKGTEWRDAVRKL 
KFARP I KKSEGQKKKK 


12466 


26367 


A 


12585 


301 


1 


TQPKVRTQMKNNIFEHAFLSSTFPHPLP 
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. SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
pepuoe 
sequence 


M 
eth 

■ 


SEQID 
NO: in 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing 10 iirst 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue 01 

peptide 

sequence 


1 Amino acid sequence (A=Alanine 
OCystef ne, D=Aspa rtic A cid, 

j Ur-Lviutamic aciu, r— roenyiaiamne, 
G=Glycine, H=Histtdine, l=Iso leucine, 
K=Lysine, L=Le urine, M=Methionine, 
iv^sparagine, t^rroiine, 
Q^Glutaroine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y==Tyrosine, 
a— unknown, w — otop cooon, /=possiDle 
nucleotide deletion, \=possible 
nucleotide insertion 








• 






NLPADFYSATSQTKPDILQSMQKAFKNS 
oIoQAWWIjI PAI PAALWKAEVGGuLEPR 
NLRPAWVYGKTLSLQK 


12467 


26368 


a 


12586 


76 


379 


PLHPPASCPAPSLSCLIPGSPSDPLFSD 
SARPGWGRGWGRAPQLPPS PPFLYNLIK 
KWSRFCHQKKKKKKKKKKK^ 
KGFPFWRAPPFFFFGGGFFS 


12468 


26369 


A 


12587 


109 


376 


LEFFKMSGWPTAPEQPAGEMENQTKPP • 
DPRPDAP PEYNSHFLPGPPGTAVPPPTG 
YPGGLPMGYYS PQQPSTFPLYQPVGGIH 
PVRYQ 


13469 


26370 


A 


12588 


323 


1 


KKFCPPRFFPPLFGFQKKGTNPISPFSG 
GFKKNLGGFNPPPRPPWSPQIGFFLGRS 
KELIIA3LFPTRIGIGGFFS FFFFCETKS 
HPVMRLE CSGVI SAHCNLRLLVS 


12470 


26371 


A 


12589 


262 


1 


gdppttsgpqtnqpkehlmnfksdsqly 
eotlagrsvliknltpqti^prwtgpyl 
viystptavrlqdpphwvhrsriklcps 


12471 


26372 


A 


12590 


284 


374 


TESRNV7GWAQWLTPV I LPALWEAEAGGS 
PE 


12472 


26373 


A 


12591 


297 


387 


GQAWWLTS VI PAFWEAKEGGS PDVRSSG 
LA 


12473 


26374 


A 


12592 . 


206 

■ 


407 


VKPQNPRKAQKLKGLVYIHIYVYIYVYV 
YIYIYIYMYICKYISLTVVYTVNSKEKG 
LDTAAHTCNSR 


12474 

■ 


26375 

• 


A 


12593 


222 


379 


IiTTGS I MGNFS FLI LVYTRKVMGSVQWL 
TAV I PLL WEAEAGGS LGSRS S RPA 


12475 


26376 


A 


12595 


301 


189 


KKIQSQAQWLP P 1 1 PALLEAEAGRS LEL 
KNSRSAWAS 


12476 

< 


26377 


A 


12596 


209 


3 


SVKPLFFINYQSQWLFLFFRGRLKRSG 
TI SAHCNLHLPGSRFS CLGLPKCWDYRC 
APPTLKIDLTKD 


12477 


26378 


n 
A 


12597 


306 


3 


NGKKGFYMAPEPPFFFHFFFFSPLFPTL 
FFFFIFPLPFCFALSFLVGSSSLSPRLL 
CPMQHYFCDAWNTFDALIWGS IVDIAI 
TEVNVSTWRLSLTVRV 


12478 


26379 


A 


12598 


1 


249 


NTGAQCPLEFS IQELQLLFLGGGEDGVS 
LIjLT.RLEOJGAISAHCNIiRFPGSGDPPA 
sasrvag i tgacrdhtcepprpvppss 


12479 


26380 

♦ 


A 


12599 


228 


404 


RLSSLSPVTEFGCLSPPDLILKCDPWPG 
TVTHTCNPSTLGGRGGWITRGQEFKTSP 
VNM 


12480 

• 

• 


26381 


A 


12600 


101 


580 


LSLTKNCALLGEETMMEQEMTRLHRRVS 
EVEAVLSQKEVELKASETQRSPLEQDLA 
TYITECS SLKRSLEQARMEVSQEDDKAL 
QLLHDIREQSRKLQEIKEQEYQAQVEEM 
RLMMNQLE EDLVSARRRSDLYESELRES 
RIAAFE F1CRKATE COtnCTitjIf 


12481 


26382 


A 


12601 


464 

« 


3 


CAI QQTL YEHPMKS SRLGP TQLKI FTCE 
YCNKVFKFKHSLQAHLRIHTNEKPYKCP 
QCS YASAI KANLNVHLRKHTGB RFAGDY 
CS FTCLSKGHLKVH I ERVHKKI KQHCRF 
CKKKYSDVKNLIKH I RDAHDPQDMYCGR 
SRGSARMSRALPS 


12482 


26383 


A 


12602 


140 


1 


W IGS FLSS EEDGTWQKI SGVQRS LSLY 
VYI HTHTHTHTHTHTHTV 


12483 


26384 


a 


12603 


112 


2 


ESLRPGAVAHTHNPSTLGGRDGWINKDK ™ 
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SEQ ID 
NO: of 
nucleotide 

1 

sequence 

• 


SEQ ID 
NO: of 
peptide 
sequence 


M, 
etti 
od 

■ 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A B Alauine 
OCysteine, D=Aspartfc Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoteucine, 
K=Lysine, L=Leucine, M=Methionine, 
N«Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S^SerineT 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X~Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EPKMCERV 


12484 


26385 


A 


12604 


139 


1 

M 


KKRVKRGPPGGFKGLGGKFFFSPKTFFF 
FFFETESCSVAQAGVQWR 


12485 


26386 


A 


12605 


204 


460 


SGQGNGEICHPRGFCTLPKAISSLLIAR 
VCLKHFNKGVASLSQTANGCFFEGFSET 
ESRSVIQAGGQGGDPGSLQPPPPGSCLS 
h 


12486 


26387 


A 


12606 


•ill 


1 


KRPLFKNIFQKPFFKNGAPRKKGPRCIF 
GPNQKNPLFFPAPLSPP FWETKKGAOFF 
FWGFPFPPTKI IGGQKVPFPPTFPPFFF 
FFETBSLSVTSLECSGAISAH 




26388 


A 


12607 


248 


367 


TQI LSSGSVQSSIiHHPP I CFIHTHTHTH 
THKHKHPHS 


12488 


26389 

■ 


A 

- 


12608 


2 

- 


387 


QKQQQRAGRETSTGSLRI ISAPTMATFV 
ELS TKAKM P I VGLGTWKS P LGKVKEAVK 
VA I DAGYRH I D CAYVYONEHE VGEAI OS 
KIQEKAVKREDLFIVSKVQWCIWLAFNT 
IVPKSQWQSLQTPQRC 


12489 


26390 


A 


12609 


258 


23 


MWE RG I IRPQTQTLTGE PEGLDHGRS I S 
PNS VMNG3TVPHI S ILTLNINGLNAPLKR 
PGI AEWIR I YQTSMCCLQETGR 




2MQ1 


A 


1961 ft 


176 
i to 


433 


DALSSAWGTVLGTOIPSHFTEILMIiSPP 
AWGSSSLTQTLFYVPSGAKRTGSYVIAR 
VGQKYKTLWLGAVAHACNPS3 LGGRGRW 
IT 


12491 


26392 


A 


12611 


312 


410 


AHM YRTCSRGWVS WLTPG I PALWEAEAG 
GSLEP 


12492 


26393 


A 


12612 


3 


289 


VFEFLSRKLSYILRMFWTFKEWFWLERF 
WLP PTI KWSDLEDHDGLVFVKPSHLYVT 
I P YAFLLL 1 1 RRVFEKFVAS PLAKS FGI 
KETVRKVTPNT 


12493 


26394 


A 


12613 


294 


432 


FMKLFFFFFFFERRSCFVAQSGMQGGYN 
ASLQPLPFGSSNFSGLPL 


12494 


26395 


A 


12614 


138 


375 


S PNATCGHADPLPLCCLHI CQALKRFTR 
QTFFFEGKPYLGILLYAVFFFLRLESCS 
VAQAGVQQPNLSSLETPPPSA 


12495 


26396 


A 


12615 


232 


1 


GVPQRAKLRAGLCPPQAMDTFSTKSLAL 
QAQ KKLLS KMAS KAWAVL VDDTSS EVL 

^^^^ — 

DELYRATREFTRSRKEAQKML 


12496 


26397 


A . 


12616 


236 


419 


LEMRNGWVW FFP P VNPTLWETEGVRDQ P 
GQHEETLSLKI I FKISRHGGMRLGSQLL 
RRLRQ 


12497 


26398 


A 


12617 


197 


1 


LCHESEGFLSRVSALLFTLIiIjLVIiMBSP 
PSVSQPGVQWPHFSSLQPLLPRFKQFCN 
TLPSGWDYK 


12498 


26399 


A 


12618 


132 


3 


NSFMFHLFTY FEMES CSVAQAGVQWRDP 
GPLQAPPERVGGRV 


12499 


26400 


A 


12619 

* 


255 


2 


MNSLNSFTDIAVIPLHYNRILPHFKI IL 
SYFLKS KFQVKG IGRVKWLTPVI PTLWE 
AE VGRSLE PRNCPAHQPGQYGKI LLYKK 


12500 


26401 


A 


12620 


141 


2 


FHFTYIIISCVCVCVTESRFVARLECSG 
AISAHCKNHRPGSSNSPA 


12501 


26402 


A 


12621 


274 


421 


TYVMEVCCTYYFLAQVLSLAPFNYFI IR 
PDPRTDPRPDPRPDPRPEPRP 


12502 


26403 


A 


12622 


132 


2 


NQS I KKKI ELQGFQRSGGRGKGQVQWIiT 
HI I PAR WEAKVGRS P 


12503 


26404 


A 


12623 


390 


86 


IFPFFYQNTNGMAPGNQIPQGFSSRFFF 
FLRQVLALWPSLEYTGEITDQCRLKCSS 
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SEQID 
NO: of 
nucleotide 
j sequence 

't 


SEQID 
NO: of 
, peptide 
sequence 


M . 
eth 

00 


SEQID 
NO: in 

TTCCM 
U52>fN 

09/515,1 

AO 

i 


Predicted 
beginning 

mm t ■ mm IaA A 

nucieoTitJc 
location 
correspond 
ing to first 

uiiiinu uliu 

r ACirt hp nf 
I Caiuilc ui 

peptide 

cpnupnrp 


Predict- 
ed end 

ottde 

lrw*otinn 

correspon 

lflct amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alonine 
OCysteine, D^Aspartic Acid, 
R=GlutJimic Acid F=Phenvlalflnine-. 
G=Glycine, H=Histidine, l=Lsoleucine, 
K=Lvsine. L=Leucine. M=Methionine. 
N^Asparagine, P^Proline, 
O— Glutiimine. R^Arpinine. S**Serine. 
T^Threonine, V=Valine, 
W=Tryptophan, Y^Tyroslne, 
X=Unknown. *=StoD codon. ^possible 

<# F* • • ■» ■ M w ¥ MM % »^ ^^-^F ^F* V J F mmi-mnmmwmmm- mmm 

nucleotide deletion, V=possible 
nucleotide insertion 














S LS LCS S WDNRCAHPPLVN I L FYVETGS . 
HYWHIGLKLLHSRYPG 


12504 


26405 


* 


12o24 






I IRVRGRAQWLTP VI PALWEAKAGGS PG 

mm mm. AV\, W i^Vjiul^ FT AmJ XL ~ ^ « A V F Wi *™ W^fF ^F* «FT « ^F> 

VRKSR 


m. f*h m* A** AW 

12505 


26406 


A 


' j 2625 


iU3 


to 1 


KFHLVPSINTMSGSOELOWMVOPHFL 


12506 


26407 

< 


A 


12626 


318 


152 


GDKRRLVS KKKKNL WNFGR I Q WVP PVI P 
TT.T .rgf AGG R PRVRSFRPTGOOSMTPP 


12507 


26408 

i 


A ! 


12627 


48 


388 


QLGNGKVRIiLYQRHSSSFFFFWGEGFPP 
GPQGGGGG PNLGPRES PPPGVKGTP PPS 

YPGGPPTSGPKGEAPPTPPKGGGGIKGI* 
P 


12508 


26409, 


A 


12628 


116 


3 


KVATNQAQWLMPVI PALWEAKAGGS SEV 
RSSRTADAW 


12509 


26410 


A 


1 2629 


Mm*. Am, *m* 

227 


400 


VJjr VLict {.JSJLrirtv It d.c LilLtf V JolMJuV. 

KKKKKKKKKKKKKKNKKDSQKKD 


12510 


2641 1 


A 


I2630 


182 


3 


TNWLFPPPKFGPVPLFFLPPFFFFFFET 
GSHTVAQAGVCSGAILAHCNIiRLPGSSN 
TPA 


12511 

- 


26412 

* 


A 


I2631 

• 


134 


473 


ASTIMDLLFGRRKTPKELLRQNQRALNR 

QGQMDAVRI MAKDLVRTRRYVRKFVLMR 
ANIQAVSLKIQTLKSNNSMAQAMKGVTK 
A 


12512 

■ • 


26413 


A 


12632 


100 


462 


HMAPSLLEPPSLLVTQICKLSAFSGPSI 
xtts dttc TnCKKTrK KICK JO KKKDRGGAPLKK 

KK 


12513 


26414 


A 


12633 


313 


441 


VI PTLWEAEVGGS IiE 


12514 


26415 


A 


12634 


129 


400 


VSQQCWPTPPALYS I SRQALAAS PQGRP 
WDLQPTMPESPLLP PRAPAWPKPItRRAIi 
P PaPPrT«GPSTAOGLRSAGAWCX5TGRHL 
HLRPOC 


12515 


26416 


A 


12635 


308 


399 


E I RGWTQWHVPVI PALWEAEAGGLPEVG 
SS 


12516 


26417 


A 


12636 


285 


390 


I FGVLINS FIYFETESRSWRAGVQWRD 
LGSLOPP 


12517 


26418 


A 


12637 


260 


2 


GQGPFFFGGRGSPKKKPPLKGFFLVGGS 
KTQFFFPRPKKGPFGVFFFFFSQTASRS 
VTRLECSGMILAHCNLRLPGSSDSPASA 
SR 


12518 


26419 


A 


12638 


76 


3 


ALQVQWLMPVIPAIjWEAKAGGSIjE 


12519 


26420 


A 

■ 


12639 


A1 A 

219 


484 


ELSLESILCSYLKQALKVHQEVNCIJ^IF 
LGE CE EE LLALKKL KKS ERGLL YGVPMS 
LKDT 


12520 


26421 


A 


12640 


33 


462 


EGLSWGYREHNGP IHWKEFFP IADSDQQ 
SPIE I KTKEVKYDS SIjRPLS I KYDPSSA 
KIISNSGHSFNVDFDDTENKSVLRGGPL 
TGS YRLRQVHLHWG S ADDHGSQHI VNGV 
SYAAEIiHWHWNSDKYPSFVEAAHEPDG 
LAG 


12521 


26422 


A 


12641 


2 


414 

* 


SGPAAPATPMS IFPELYFNVDNGYLEGL 
VRGLKAGVLIQADYLNLEQGETLBDLKL 
HLQS TD YGNFLANEAS PLTVS V I DDRLK 
EKMAVEFRHMRNHAYQPLAS FLDFITYS 
YMIDNVILLITGTLHQRSIAELiVPK 
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SEQ ID 
NO: of 

□ ucieuiiuc 

sequence 

• 


SEQID 
NO: of 

pep LIU C 

sequence 

■ 


M 

eth 

od 


SEQID 

NO: in 
USSN 

09/515,1 

26 


Predicted 
beginning 

11 1 ■ i» Iaa t\ A 

location 
correspond 
ing to first 

a III 1 11 U aUU 

residue of 

pcuiiuc 

sequence 


Predict- 
ed end 

nunc* 

otide 

location 

correspon 

UllJg w 

last amino 

nrirl 

nuu 

residue of 
peptide 

HUUvllVv 


Amino acid sequence (A^Alanine 
C=Cysteine, D-Aspartic Acid, 
R^niiitamic Acid. F=PhenvIalanine_ 
G^GIycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
0=m litem ine. R = Arpinine. S=Scrine_ 

T^Threonine, V=Valine, 

WsTrvntODhftit. Y=TvrosintL . 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 

niirl*»rttirfft insertion 


12522 


26423 


A 


12642 


161 


463 


KEKARGRPKPLT.T.T.PITSATTAM3LTIS 
SLFSRLFGKKQMRI LMAGLDAAGKTTIL 
YKLKLGEIVTTIPTIVFNVETVEXNNIC 
PTVWDDGGQERIRPLWN 


12523 


26424 


A 


12643 


124 


2 


TVFGGQAQWLMPI I PALWEAEAGRSPEV 
TQI.TJ PAWPRTP.G 

1*77' 'IT* rM "~ i\ 1 


12524 


26425 


A 


12644 


258 


454 


NR PE LCD YQGLREDS KRKGE INQTACQ F 
LWPRQKMSSQTKFKKDKEI IAEYEAQIK. 

I? TP THT .T7T?0 


12525 

t 


26426 


A 


12645 


636 


182 

■ 


APLBRPSMIPPRCSGGQPSDLRSSPAQG 

PRGSRQRFGPQRRQTNAHTKRVRTRLSA 
AFARHS PAFIACRQVGRPRLSARRSGKK 

PT4T.TVMr , PF<?FODPTnnTTTiOT TTJ.ST.Ti 
GGDTFS DFPCS 


12526 


26427 


A 


12646 


199 


414 


IYHTSRSLLDRKLRAQRYDELPHYGGMD 
flVRVP 2V <!M VOnPHAPGP T P P VWWtMHGP 

PFHAKQNYGAHAPHPN 


12527 


26428 


A 


12647 


130 


3 


DRGFLCVHTPLKKKKNTHTHTHTHTHTH 
TNTHTHS HTHTHTK , 


12528 


n/inn 

26429 


A 


12648 


1 (V> 

192 


AA1 

443 


SPLFACWFFWGFCLFVWLFVWFEPKSGS 
VLTLECSSVITAHCSLDLPGSRDPPASA 


\Z5zy 


Z04j0 


n 


iz&*y 


oyi 


no 
1 fy 

■ 


S PQSNQEADGQARWIjTP VI LALWEAKAG 
GLPELKSSRPAWATP 


12530 


26431 


A 


12650 


263 


2 


KAPLLVYKWEPNPGGGNFSPGMGTPNTF 

n\FD T?nr , T? f PK' DT. VT? 1? T? (rem QH , <J\7Tn&fTU' 
\p vrcr vjVjCi x rutr JL>r r r r ruiuansviyHyv 

QLCNLDSLQPPPPSVKPSQYSWDHRKQS 

NTK 


12531 


26432 


A 


12651 


355 


111 


QDPCF IRLHLPLS FLS VFLECTKISLLD 

WPMT»T?PT.T/3nPPHT»AT«SQT.V9PTKTrMNY 
VVIvCuJf It JjIAjJL/Jc c fllJrtl POUVOflH X fill 1 

SFQQLMMFRDVAVDFSQEBWECLDL 




20433 






Hi 


3 


LKSAFIFNLETDHLNCFPGS ISAKSYFI 
LFYFFETEPHSVAQAGVQWRDLSTLQPP 
PPRFT 

Ai A» A>4* M 


12533 


26434 


A 


12653 


333 


2 


PPS PGVFGNP PS PGDPLGKTFFFPRGPP 
PRFWGGGFFFILAPFPFGGLFPPFPLPP 
SQGPPFFPFFFFFFFFEMESRSVSQAGV 
QWRDLSS LQP PRARFSDSPL YFLEGR 


12534 


26435 


A 


12654 


310 


444 • 


TYLSNIFYFILFLVETESPSVAQDGVRW 
CGLGSLGPPPPGFGRFS 


12535 


26436 


A 


12655 


336 


411 


DQPGQHGETPSLLKIQKLVECGGAH 


12536 


26437 


A 


12656 


114 


2 


G I FF FFFGLESRS VAQAGVQLCDLNSLQ 
SPPPGFKOP 


12537 


26438 


A 


12657 


191 


1 


KNFILTFSFRGAKNGDVFFPPPGYIQTG 
ENFFFFFFLKWSFTLVAQAGVQWCNIaSS 
LQPPPPR 


12538 


26439 


A 


12658 


1 


445 


LRTGSE FSGRDS KGLAAAE PTANFX3LLL 
ASIEDQGAGGGGYCGSRDQVRRCLRANL 
LVLLTEAAGVAGVALGLGVSEAGGALAL 
GPERLSAFFFPGELLLRLLRMI ILPLVG 
CSLIGGANSI^PGALGRLGAWALLFFLG 
TTLLASAL 


12539 


26440 


A 


12659 


2 


224 


ESTASRIAFEAWQPETLPKGLNYSGASP 
WLNAVL PLKKKKKKKKKKKKKKKKKKK 
KKKGGGVFKKKK I KGGGG 
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SEQID 
NO: of 
nucleotide 
sequence 

'i 


SEQID 
NO: of 
peptide 
sequence 


M. 
etb 
od 


SEQID 
NO: in 

09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

mk • t aIa 

nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Add, 

T?—/~2 Info m i /» A T? — PhonvlnloninA 

OGIycine, H-Histidine, I=Isoleucine, 
• i\— Lijaine, L^i^eucine, iti— iTieifiiwiiine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S^Serlne, 
T=Threonlne, V=Vallne, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \ B possioie 
nucleotide insertion 


12540 


26441 


A • 


12660 


96 


1 


GLKDGRGQVQWLT PVI SAP WEAKVGGS P 

JSVWo 


12541 


26442 

• 


A . 


12661 

* 


1 


457 


CWVHLQKGCHFTLGRLHSGQAEALLTSQ 
QAEALLTSQTGQPGRGAPHLPDGAAGQR 

Kti o Ir lr lUCKnAlaUAUa S f c'KKfwwUJUKJjO 

SLPRQGGRAEALLTFQTGRPGRGAPHIP 
DEAAGQRGGAG 


12542 


26443 


A 


12662 


175 


351 


KGIFFFPPSWKAGGAIWFKGTPAPRGKG 
NPPPNPSGEGGPKKKKPGPQI PRGPPLG 

nntr 
PPV 


12543 


26444 


A 


12663 


247 


401 


SSNQEPGCKGPCVLTNFYFILGRAWWLM 
PVIPAIWEAGTGGSPEVRSWRPT 


12544 


26445 


A 


12664 


152 


251 


RENSCCFIKKKKKKKKKKKKKKKJCKKKK 
KRGAP 


12545 


26446 

■ 


A 

- 


12665 

• 


2 


365 


APTRSRCLRHAS CTACRTVS TDTS SLRR 
ADPKGRSALLAD IQQGTRLRKVTQ INDR 
S APQI E SS KGTNKEGGGSANTRGASTPP 
TIjGDL FAGG F FVTjRP AS QRD VAGKEEF I 
LAPLWTVSP 


12546 


26447 


A 


12666 


179 

• 


1 


LELSKSGSHWMRHFLSRSFYSLISPQLN 
TTVWPTI ITPILLTLFLITQLKILNTNY 
HLP 


12547 


26448 


A 


■m. *± 0* ^ 0m 

12667 


279 


35 

* 


NQAPFKARFt 1 J. FKNVFIiKEI KKKRCWS 
GAPPI I PPPWGGGGGGPPWGGNLSPPGP 
P VSTP FFNGAQKKKKKKDRYGGACL 


1254S 


26449 


A 


12668 


299 


399 


ISIKKQRKGPVQWLMPVIPAtWBAkAVG 
SPEVR 


12549 


26450 

» ♦ 


A 


12669 


1 


385 


KMQNRGFFFLTFLALLGLTSGAAKKKNK 
GKKAGPGSKSPNWPWGPCPPSSKDCGGV 
FRKGTWGAQTHG I RGRGPCNWKKEFEAN 
CKSKFKNWGAGDGGPGTKVRQGTLKKAP 
SIAKGQETIRVTKPCT 


12550 


26451 


A 


12670 


357 


1 


AGKIAKI CPVSSMQAPTOGFPVGGNDNQ 
GQAPDGQFQPPijQQNQTSSPDrSNENSP 
ATPPNEQGQGDAPPQLEDEE PAFPHTDL 
AKLDDMINRPRWVVPYDPKGEIjEVLIjKK 
GIDFSKK 


12551 


26452 


A 


12671 


107 


373 


TGNYTPLEDCAQEQMRLIAQVHLLETRV 

DLGSKRQYADCSETFNDGYKLRGFYKIK 
PLLNP 


12552 


26453 


A 


12612 


196 


45 


I It I fc nOoKciUiJUAIuvi X DiUjr Kb 


12553 


26454 


A 


12673 


33 

■ 


448 


KEGAERAGAAAPVWSFLLDRRDSARTRS 
GTSLGSADMGDMKTPDFDDLLAAFDIPD 
IDANEAIHSGPEENDVPGGPGKPQPCVG . 
SESEHTASASAGDGPGVPAHASDHGLPP 
PDISVLSVIVKNTGLSRAGWRPWVDV 


12554 


26455 


A 


12674 


250 


29 


KKQTNMWQSPYEDYRIFYTISSSLYHLS 
IYPPIYLSI YLSIYLSLYLPTHLSI ITD 
YVSILEHELRPLWRIQP 


12555 


26456 


A 


12675 


377 


442 


SDRQWWCMP I VPAT WEAEAGES . 


12556 


26457 


A 


12676 


1 


553 

• 


R I F PGRRFRVKLP S C PDPAMGTRLLFWV 
AFCLLGADHTGAGVSQSPSNKVTEKGKD 
VELRCDP I SGHTALYW YRQSLGQGLEFL 
IYFQGNSAPDKSGLPSDRFSAERTGGSV 
STLTIQRTQQEDSAVYLCASS LATAWHS 
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SEQID 
NO: of 
nucleotide 
sequence 

• 

• 


SEQID 
NO: of 
peptide 
sequence 

- 


M 

eth 

oa 

• 


SEQID 
NO: in 

TTCCM 

09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue ot 

peptide 

sequence 

» 


Predict- 
ed end 
nude- . 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue or 

peptide 

sequence 


Amino acid sequence (A«AJanuie 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^.Phenylalanine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K.= = JLysine, JL=JLeucine, M— Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginirie, S^Serine, 
1—1 nreonine, V— valine, 
W=Tryptophan, Y=Tyrosine, 
a— Unknown, w — otop codon, /=possioie 
nucleotide deletion, V=possibIe 
nucieotioe insertion 














RLLPAHKPHPSLSLQLIiDTLNRQFSLLL 
PPHGKQVDLDLSLSFG 


12557 


26458 


A 


12677. 


232 


3 


bWis bLJL~UUKAKV 1 c*J& Kli bK.UK.bHl AKJM P 

SVHRHHQRPKVDKTTKMGKKQNRKTGNS 
KTQSAS PPPKERS SSPATCI 


12558 


26459 


A 


12678 


134 


3 


IPIVEFGGKPFSCTSISLSQWLWCLPIG 
IGELLWGQVS TGTL Y 


12559 


26460 


A 


12679 


212 


47 


NGTHPHG YTLHTHKHTHS CAHTHTHTHS 
LLLVRKSFLPQKNTTISFRSCCKLGKP 


12560 

* 


26461 


A 


12680 

i 


224 


2 


MKMASFLAFLLLNFRVCLLLLQLLMPHS 
AQFSVLGPSGPILAMVGEDADLPCHLFP 
PMSAETMELKWVSSSQCI 


12561 


26462 

i 


A 


12681 


35 


397 


KPLISQNKIHSVSSSQTGFFFFFFFFGE 
KKP P FAPRGGGKGPPP PLP PGGGNRAPP 
RGBKKG KGKP P P KNPGAFP PQKEKKTPG 
GGGGKKKTPAPPTPKEVDQPLPRREQRG 
PGRLPPRGA 


12562 

■ 


26463 

• 


A 


12682 


188 


514 


HTTHLVYVLSMAAFFFFFFEKNFLFAPR 
VEKRGKDLGSLKLPPPGFRHFSGLTLQG 
SGNNGAPPP S P VI FJjVFFEKRG FP LVGR 
EGLILPPLQPAPFCJSFRGAINGPS 


12563 


26464 


A 


12683 


366 


3 

■ 


ISQTQTQWTDWVSLELQGCEIAHRDTVA 
IPQTRSFPFPLMISCMTLFSLLQMDRAF 
PP F P PQTP IT I QGQI STS PATWPLTHLH 
STPGLSVEYSSNTHKSSLSPTSQWIRVD 
PMLASPTV 


12564 


26465 


A 


12684 


99 


1 


SS CIRFFLEQAYGQQS YGTYGQPTDVS Y 
TQACI 


12565 


26466 


A 


12685 


247 


31 


FLKE KRNQF YL F I FQRWLDPNKPIRKQL 
KRGSPYSLNFRVKFFVSDPTKIiQEEYTR 
WVYGYIFTiKNIVKTLI 


12566 


26467 


A 


12686 


416 


253 


PSPMTI PVTGAPRDADLWSSHAKMLAQP 
LKDSDVEVRFLGSSQIFLCWRPSFNL 


12567 


26468 


A 


12687 


116 


3 


CLLGRM WWLMPVI PALWEAKAGRSLEVR 
SLRPARPMY 


12568 


26469 


A 


12688 


380 


1 


SPESSKbGLWSKHVSPLLCIGVCPPPVS 
MAELRQVPGGRETPQGBLRPEWEDEVP 
RS P VAEE PGGGGSS SSEAKLS PREEEDL 
DPRIQEELEHLNQASEEINQVELQLDEA 
RTTYRRILQESASV 


12569 


26470 


A 


12689 


157 


488 


REFVSQGGSHLKAQVRLEALLLTIGTPP 
WAHLS I LHMTAMGQRENKREAAS SRIQV 
LLNHSQAI KQES 1 1 LFFEMESGVWPRLQ 
YSRMISASCTLFLSGSNNSPVSAPRI 


12570 

■ 


26471 


A 


12690 

■ 


514 


3 


PS I RAGLLCGSAENATP FLCG I TMAAGP 
LYTYPENWRAFKALIAAQYSGAQVRVLS 
APPHFHFGQTNRTPEFLRKFPAGKVPAF 
EGDDGFCVFES IAIAYYVSNEGLRGSTP 

LGIMRHNKQATENAKEECMRPIiRVSSLV 
GP 


12571 


26472 


A 


12691 


304 


4 

i 


AESLRVAHERLDTRSTSSDI FNFPQTQS 
NLEMNS E I LE S WANYQS STS YS INTELS 
LFSKVNGKFSTEFQRMKTLQVKDQAITT 
RVQVRNLVYTVKINPL 


12572 


26473 


A 


12692 


221 


2 


VGPATRDLCFADE P WGRQKQKQWWWVL 
CLS PNRGLAGRPDTLHI TCASAHMRTHT 
CMHTHRHAHAHVHTHCI 
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SEQID 
NO: of 

DUCIcOUUc 

sequence 

'i 


SEQID 
NO: of 

pcpuuc 

sequence 


etb 
uu 

■ 


SEQID 
NO; in 
I ISSN . 

09/515,1 
26 


Predicted 
beginning 

mi f»|a/\+t ft a 
[] u ci cuu (J c 

location 
correspond 

In (? tn firct 

lug IU 111 91 

amino acid 
residue of 

sequence 


Predict- 
ed end 

n it d Aw 

11 U LIC- 

otide 
location 

cnrT&snnn 

VUI I vJUUll 

ding to 
last amino 

4VIU 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine t D=Aspar tic Acid, 

-L^vjiuiuniic /_ctiif jt — r lieu j wtiaiiinCf 
G=Glycine, H=Histidine, I=Isoleucine, 
K-Lysine, L= Leu cine, M=Methionine, 

N^Acnnrfloinp PssP'rAlin** 

Q 3 Glutamine, R«Argtnme, S«Serine, 
T-»Threonine > V«Valine, 

X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


12573 


26474 


A 


12693 


3 


205 


YML CS I LtS TQEAFS I C LGE KFI YLFTYIj 
RARLHIiSKKQK 


12574 


26475 


A 


12694 


233 


3 


IMGPAHLFKGNHWNGGWGLMYKAFFFLF : 
FFKTGSHS VAQAGT I P VHCNLRLPGS SD 1 
S PAPAS RVAGNTNVPHHTL Y . 


12575 


, 26476 

• 


A 


12695 


408 


3 


PAREMSKFRVCVRKRPLGMREVRRGEIN 
I ITVEDKETLLVHEKKEAVDLTQYILQH 
Vr X r DBVFGEACTNQDVYMKTTHPLI ; 
I FUGGNATCFAYGQTGAGKTYTMIGTHE 
NPGLYALAAKDI FRQLEVSQPV " 


12576 


26477 


A 


12696 


123 

■ 


631 


RKAVQGGKGSGGGFGSRDSRSSGAVSAA 

VGDMGDPGSE X IESVP PAOPEASESTTD 

ENEDDIQFVSEGPSRPVLEYIDLVCX3DD 

EJX FSA i x S D X L»r P KMP KKQGDr tilir JjNV 

KKVKTDTENNEVS KNHCRLSKAKE PHFE 

YIEQPI IEEKPSLS SKKB IDNLGASDCW 
n 


12577 


26478 


A 


12697 


429 


629 


LALLYPLKWVGKPVNFMEEDILGPLPPP 
LNEEEEEAEEEEEEEEEEENPVHKIPDS 

ITPT'PT VUr'TVP 
IfDJ. 1 Lilvflo 1 J\.l 


12578 


26479 


A 


12698 


158 


404 


LPLLS VERLVE I YAYTFIQDLFSASNI P 
YSSSVFRIEMDQPRTHSGPTTASNPAPS 
STNSSSAPSATNSKQERSSSSLSKPS 


12579 


26480 


A 


12699 


185 


3 


HRRPISIPSSVIHPAMVRLGLQYSQGLV 
HLVY 


12580 


26481 


A 


12701 


181 


3 


TVWAS MFLS AALRARAAGLAAHWGTHVR 
PCI 


12581 


26482 


A 


12702 


373 


1 


ILVQERDSQRVI RM I DKLVS SRDRVGRG 
VEKQMTDHRVI YSVRKLQKRITPEPFI P 
SRTTPKLLFLLVLPSGKTPGHPVSSRTI 

PPLPI EPHETMY 


12582 


26483 


A 


12703 


272 . 


508 


TKLGKKQNRKTGNS KQQSAS PP PKERSS 
LREDIQTKGKEVENCEDNLEE 


12583 


26484 


A 


12704 


341 


1 


VPQHPHRPLAPPSIiGPQSWPLMEGSRPR 
SSLSLASSASTISSLSSLSPKKPTRAVN 
KI HAFG KRGNALRRDPNLP VH I RGWLH K 

V 








IZ/Uj 


i c*f 
lo/ 


j>OD 


GRS AMEQPQEES PE VREEEE I EEMAHAE 
GTP 


12585 

■ 


26486 

* 


A 


12707 


238 

. ■ 


596 

- 


LWLVRVKYS I MSGAALGLE I VFVFFLAL 
FLLHR YGDFKKOHRLVT IGTIJiAWYLCF 

LIVFILPI^VSTTIYNRCKHAAANSSPP 
ENSNITGLYATANPVPSQHPCFKPWSYI 
PDGIMPI 


12586 


26487 


A 


12709 


131 


2 


DRVSLSPRLECSGTIIiAQPSRLKQSSHL 
S LTGS WD YTHAAP C I 


12587 


26488 


A 


12710 


151 


2 


YRQGLILLPRLECRVMIMALCNFESPGL 
SDPLTSASRVASTTATCHHTC 


12588 


26489 


A 


12711 


151 


1 


NFPEFDHFTVGCLRAGSSLIHYWWECKL 
VQSLWKAVWRF I KDLKI DLPLY 


12589 


26490 


A 


12712 


28 


411 


RWPARPAGEPREPHVSWVMKLNPQQAP 
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NO: of 

n ucicoiiuc 

sequence 


SEQID 
NO: of 

llAIttlfl A 

uvuuuc 

sequence 


M 

eth 

nd 

■ 

- 

• 


SEQID 
NO: in 

TISSN 
26 


Predicted 
beginning 

11 u c lev UUC 

location 
correspond 
ing to first 

aillllJU allU 

restaue oi 

peptide 

sequence 

i 


Predict- 
ed end 

QUClC** 

otide 
location 

— — — J| — h ^fc, *A 

correspon 
uiiig m 
last amino 
acid 

residue of 

pcpiIUc 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 

F—fZl nfn m in Arid F^Phenvlalnnlnp 

vj — Glycine, h— insiiuine, l—iso leucine, 
K=Lysine, l>Leucine, M=Methionine, 
iv^Asparagme, r^i^roune, 

i— i nreonine, v— valine, 
W=Tryptopban, y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 

UUvIvULlUv UCIClIUIl, T - fJVMUJIC 

nucleotide insertion 












• 


LYGDCVVTVlilAEEDKAKDDVVVYTjVP'L. 














GSTLRHCTSTRKVSSOTLETIAPGHDCC 


- 




• 








ETMKVQLCAFKEGLPVFDVTEKYFLLPR 














MYRYYRCLYTYPSAAV 


12590 


26491 


A 


12713 


351 


675 


ENPRHT F I YS LPLGLQKHQVLTVDIG FG 














GTAI MTVGKSS KMLQHIDYRKRWI LQDG 
PKNAKQ P ERE E KRVLGL VLL RGEN 






n 








ACS hhcyrksoap PPAHLWPD CPHPGS T.P 
LLNPSVCNPGREVS PT 


12592 . 


26493 


A 


12715 

i 


200 


1 


TITKVCVFTRTCGLKGSLMAKALFDQRK 
DRESETECQVWWLTPVIPALSEAEVGGL 
LEPRSSRPGV 


12593 


26494 


A 


12716 


208 


405 


KKERENKKHTNKKGRKKMVHICQWHSSL 
EAEVGGPPEV 


12594 


26495 


A 


12717 


157 


3 


AKET PTHKGS CKKKF FF FF FETESRSVA 
QAGVQWRGLGSX.QAPPPGFTPCI 


12595 


26496 


A 


12718 


1 


472 


SPAI LPRLAI L P YLLFDWS GTGRADAHS 
LWYNFT I IHLPRHGQQWCEVQS QVDQKN 
FLSYDCGSDKVIiSMG^ILEEQLYATDAWG 
KQLEMIjREVGQRIjRIjEIjAD r£LiKDFTPS 
GPLTLQVRMS CECEADG YI RGS WQFS FD 
GRKFLLFDSNNRKWTW 


12596 

* 


26497 


A 


12719 

- 


537 


1 


LHTMNGGNESSGADRAGGPVATSVPIGW 
QRCVREGAVLYISPSGTELSSLEQTRSY 
juXjS DGTCKCGtiBCx? uNVr? KvrNFDFIjAl? 
VTPGGAGVGPASEEDMTKLCNHRRKAVA 
MATL YRSMETTCSHS S PGEGAS PQMFHT 


12597 


26498 


A 


12720 


369 


3 


AAKI I PATRKKASLELELPSS PDSTGGT 
PKATI SDTNDALQKNSNPYITPNNRYGH 

ONGa YAWHFF. AP KSOT LKCME GGSSHD 

TLQELTAHMMVTGHFI KVTNSAMKKGKP 
IVETPVTPV 


12598 


26499 


A 


12721 


230 


3 


KFrrnnjAGiiTGKMSPASAS ijNyvHISPr 
ur SKnnJr ro uGr liS o f v Jjby<JO KKKy JjJLi 
ATLRALESASIjSOHPHVPV 


12599 


26500 


A 


12722 


26 


110 


REQYAEGNMRGPAPGKKTSGLQQKNVEV 


12600 


26501 


A 


12723 


224 


1 


WQNQDQASDPPKYSFTSQCCLSFARLAR 
VQQGSAFADRPAFASFRV 


12601 


26502 


A 


12724 


120 


530 


KKVARGRSRSRERSRRRRSRPKAIT\NR 














T*OTRCTPRRWRSTVLGMRSCTRAR*QR 














RSGLSRGHTRSAJGLCDHGSVRAGSGDGA 














MTYaGGrjRRtiGTiGRDSrJJtj^fiQOSAAFfi 














SSASGS S S FFSATQPLRMLLEYFWL 


1260? 


^UJUJ 


A 


12725 


4^4 


222 


KEEKS RGEVKEQD* EKDREEEVEKSRE / 
RRRSRRRRRRSEV*YRKGKRRREREDIL 
VAERSHRSFRNS I PSTL 


12603 


26504 


A 


12726 


268 


370 


SNILIHMESMFFFFGTDIPSV*KNLPA* 
KTPGPDGFIDKLYITFRGELTTFPHILL 
H* FTEGAVLFNS FSK\AAS I TLTPKPNY 
DIMRKENYSPISSYIWNQCSFFLEQIFP 
LSFKLNCPGLIIPH 


12604 


26505 


A 


12727 


103 


636 


VCFISMKQPHGSRHPPCS/PPRPGS*VC 
QQSRPLGPQPSAFGQIA*HRRGLLGLQG 
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sequence 
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SEQ 10 
NO: of 
peptioe 
sequence 
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etb 

00 


SEQ ID 
NO: in 
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Predicted 

beginning 

nucieouue 

location 

correspond 

ing to first 

amino acid 

residue oi 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A°Alantae 
OCysteine, D=Aspartic Add, 

T-— 1 1 1 to m i A f*tH VT^Pfionvlalaniim 
JC^vrlULUIIlIC ALIU, F~~A IJCUj lUUUIIUCf 

G=Glycine, H=Histidine, I-lsoleucine, 
K=Lysine, L=Leucine, M=Methjonine, 
N=Asparagine, P=Proline, 
Q=G luta mine, R=Arginine, S^Serine, 
i — inreonine, v— valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


- 


■ 


■ 


• 


■ 




. PRSPMQPVPHGGRVSGQTLPKGPTPKRP 
ruriur Kriovyno W IiJj J\ w uVLAUr Jr ficr o 
♦ASQAVFTLQVPGKPQVWTPCPVPVRAP 
TP\ PLSNGGLGVS ERH* DGH* QAPTS P * 


12605 


26506 


A 


12728 


1193 


1638 


GLGFAMLPRLALNSWPQGDPSVPASRAA 
GTRGVHRHTQLQVSFNYYKVLAMHSGSQ 
L* SQHFRKPRRSDHLRSGVRDQSGQHGE 
TPSL/LKNTRISWAWWHTPIiVPATREAE 

NTCETPPQKK 


12606 


26507 


X 


12729 


304 


101 


RHLHWPCPSPLAPTPDISHCPEPPKSQT 
SP/CPGT*CLGPLKCPSSCHPA*K/PLH 
PSEPARKSLAPDTCP 


12607 


26508 


A 


12730 


91 


264 


SQRSISGLRVKENLIFVMIMFPPIYSSQ 
TFWSQTFLMIiKI VFMGISIS ICYIL/NT 
E KNLGQG WWLAP 1 1 PALWEAEAGGLL * L 
R VKENL I FVM IMFP P I YS SQTFWSQTFL 
MLKIVFMGISISICYILKLRKI 


12608 

» 

■ 


26509 

• 


A 


12731 


1026 


406. 

■ 


IiAHFRSQIFSFSHILVHFERMVLNYRYY 
LACVRR WFRFLHVLAYFGR I VLKYRYF 
LVHFRREVFRFRHMLiACFRRMVLYCYFL 
VLRKI LL I YVT FLTY FRKVFLRHNFLRG 
GY*RKYFGYAPRWLSFFLYRCQCFLHYF 
LFYLWLRLHLQF/ CCFWSFCLFDFLFL 
FSAACVFS VLQ IAIVMFPFHALQLFFF* 
WLCFSNSRFQ 


12609 

• 


26510 

■ 


A 

- 


12732 

• 


1508 


157 

• 


QDVGGSGFKVDTHPRGRMAS I FSKLLTG 
RNASLLFATMGTSVLTTGYLLNRQKVCA 
EVREQPRLFPPSADYPDLRKHNNCMABC 
LTPAI YAKLRNKVTPNGYTLDQ C IQTG V 
DNPGHPFI KTVGMVAGDEES YEVFADLF 
DPVIKLRHNGY*PQG*LKHTTDLDASKI 
T\QGQFDEHYVLS SRVRTGRS IRGLSLP 
PACxRAERRBVENVAI TAIxEGFKGDL \A 

gry\yk\lsemteqdqqrliddhflf\d 
kp vs p ll»t cagmardwp\d arg i \ whny 

krvilsr\fcrg\lkeverli\qergwe 
f\mwk*ap * eyilt\cpsnlgt\glra\ 
gvhvrdpqsfsqdptaflki lekprtpr 

rsegelwqividgvnylvdcekklerg 
qdikWppplpqfgkk 


12610 


26511 




12733 


261 


487 


^SETDdAltPt^^t^VPmfiMKMti 
LLKNKQFW\RGWAHACNPSTLGGQGGW 


12611 


26512 


A 


12734 


382 


668 


YKRITDIFVDSETVHILINKRQSCRIPG 
FIOLVOLI S HOLAAP RD YTVSHS VAOAG 
VQW/RNIiGSLEPI.PPGFKRFLCIiS*HAIi 
KNLSSCDTPPQY 


12612 


26513 


A 


12735 


401 


27 


GDRAEESAEPRAWSHSDNSHRYTTLFIC 
LTHTHVHN P VHS \ HTHTHTHTHTHTHTH 
TVSHRHTETPPLLLKQTGLKFY*NSRDD 
TPRSRPGSSGLQRLSSSPPVPFQPGTVB 
ASADFCGHDLLTT 


12613 


26514 


A 


12736 


202 


182 


KYLPI FINL ITMNILFFFETHSCSVGQA 
EVH*S*LKPMPPG\SSDSPASAFQVSGI 
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nucleotide 
i oca Lion 
correspond 
ing to first 
amino acid 
resiaue oi 
peptide 
sequence 


Predict- 
ed end . 
nucie- 
ouue 
location 
correspon 
ding to 
• last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A a Alanine 
OCysteine, D=Aspartic Acid, 
Hi — vyiuiam ic acio» r~rnenyiaianuie, 
Vy— wi/cinc, ii — nidiiuinC) i— jsvieucints, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
i—i nreonwe, v — valine, 
W=Tryptophan, y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TGSPSSSWLIL* FFLKIGFTKK3 


12614 

• 


26515 


A 


12738 


319 


2 


QNTPPHIYEYIPRVDAFLLFLEMSYSVY 
WAVLS PLY FVNKLALTLHCRLALNS FL 
HKIQEPS\LGSGSG\PLSCNTI I*RLCL 
RSLHLVFRSLWILPICDSSSVF 


12615 

• 


26516 


A 


12739 

i 


1363 


2000 


DRVFVIPGWKCSGTIKVHCSLDLPGSSN 
PS TSAP * VAC \ATGTYP PCPGLNFLYF * 
MEERGFTQVA\QAGSQNSWGSKPNGTSE 
ASP!CVLX3MNK;P*NLPNPGRSXtiN*DGSI 
RYKDNYPA/WEPPKRS YKCLRQKKSI *L 
SAGPRDYEASQPRKNYP ISSLLTFCTSS 
LEFNPTELTCSSQLQKS IRRKLKSEESC 
PPLPACSLKHTQAIIKITF*RLHRTA/L 
FYLLK*NCFRN* S* *KYFLLSVSNI.KI / 
Q WVRS S LHPGAPG YTPTV* RP KHKSNVL 
I * IQS / ITRCLFYILVYMRTI fplqlfl 
SKHPSTYKEDFA*VLPLSFFFFEMESRS 
AARRSLSSLQPLPPRLKRFSRLSLLSSW 
DYRCAP PGLAS PS X LSS *GS CCSIQFI L 
RMSTISHAINVLVLKNTYLVLVSASEHS 
LIKKPC 


12616 


26517 


A 


12740 


1696 


743 


GGQIMRSGVHDQPDQHGETLSLIjKIQKS 

agpggmhl*sqllrrlrqenrl\nrgcs 
e pr/s rhct pawateqds vs kki kk* kk 
♦nhleskqqgqpaleppeqaggqlrkte 

QLQDGRRELAADMTL* PGHS KRLGALPR 
PLLSAYYFNKVHAARRQAYLETPGTFTS 

yqreegeqalilgveaqassptvfhrer 
rqsqalcshrksqeapvrpahpprrvpg 
lgkpssqglsahlgqdrapaprraswdq 
rsqapisvtfpsvldkeesvpcgpgfpg 
hapaprgihgatwegahsrgypghflal 
pqhnsdeqrpn 


12617 

* 


26518 

■ 


A 


12742 


2473 


445 


rgarrsrrrsrhrrrrhqsrpvraaprq 

PEQGRRRGAPTHGPQLIMMDLELPPPGIj 
PSQQDMDLIDILWRQDIDLGVSREVFDF 
SQRRKEYELEKQKKLEKERQEQLQKEQE 

kaffaqlqldeetceflpiqpaqhiqse 
tsgs anysqvah i p ksdal yfddcmqll 
aqtfpf\vddnevs satfqs lvp wyspg 
hies pvfia\tnqa\qspetsvaqvapv 

DLDGMQQDIEQVWE ELLS I PELQCLNI E 
\NDKLVETTMVPSPEAKLTEVDNYHF\Y 
SSI PSMEKEVGNCS \ PHFLNAFEDSFSK 
HPLHKNDPNQLTVNS LNSRM / P TVNTDF 
G*WNFILVF\MAEPSIRQQAWPSPATLS 
HSLS \ ELLNGAHGCFLDLFTL/ CKAFNQ 
NHPEGTA\EFHGF\DSGISLNTSP\SVA 

nn\ t?tjc\ CV/^TVPT TJTlT.Cn GTJT7T7T7T. 

DSAPG\SVKQNGS *NTMYSSSGDM\VQP 
LSPSQGAEHFTCMDAQCEEHTRGKDLPV 
\SPG\HRKNPISQKDKHSSPLGGLISQR 
DEL\RAKAL\HIPIPCRKKSFNLPVG\D 
FNEMMSKEQFNEAQL\ALIR\DIRREGV 
RNKS /AAAS GICR KKENWEN I VELEQDL 
DHLKDEK\EKLLK\EKGENDKSLH\LLK 
KQLSTLYL\EVFQHAYRDEDGKPYSPSE 
YSLQQTRDGNVFLVPKSKKPDVKKN 
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seq n> 

NO: of 

nucleotide 
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SEQfl) 
NO: of 
peptide 
sequence 


M. 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 

• 

- 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residne of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 

• * 

acid 

residue of 

M • _1 

peptide 
sequence 


Amino acid sequence (A-Alanine 
OCystelne, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidioe, I=Isoteucine, 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P=Proline, 
Q=GIutamihe, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 

««</ rmm a % ytw fwn • 

W=Tryptophan, Y=Tyrosme, 
X=Unknown, *=Stop codon,/= possible 

ft A * 1 » ■ J* t ** % 

nucleotide deletion, \=possible 
nucleotide insertion 


12618 


26519 


A 


12743 

> 


6 


457 


RPRNRPG I RVD PRVRGRVGHAPHEGLVP 
ERMYRGS PTACETQAAA* ERAF6PSS PT 
C\RLPI PRMSTSVPQGtfTWTQRVKKDDE 
EEDPLDQLISRSGCAASHFAVQECMAQH 
QDWRQCQPQVQAFKDCMSEQQARRQEHL 
QRRQEQAGAHH. 


12619 


. 26520 


A 


12745 


297 


12 


QERPVRMLVLHFGRLRLADHLKLGVRDQ 
PGQYGETPSLLKIPKLSGM\MWAP*IC 


12620 

r 


26521 

i 


A 


12746 


98 


428 


YNTS FNE TVBLLTT P I TIDCLYTRKDAX 
APESACGVCPCSIiLGVGA* IPKWIRLD 
LSKKHVTTAYGGFMCAKCVLDRINRAPL 
IDEH\KI VPKV* KALSQSS KAFCEHET 


12621 

* 


26522 

- 


A 

- 


12747 


3 

- 


638 


LWLWSIiCWVWAVSLPLQFILGSIiHPOQG 
QAS WRE VDLLHEG S QEASS KPS SGSVP V 
. GLLLDQEMVTPLLVCICGILLHQEMVSP 
LLVYTRG I LLHQEMVTPHPGLYPWDSP P 
LDGHS PPGLF\RGAAPPS *GG/ PLPVLV 
YTRGAAPPSGDS\PPPPGLYPWDT\PPS 
GDRHS PPGLYPWDT PPSGDGHS PPALYP 
WDSPPSGDSHSPPGLFPRG 


12622 


26523 


A 


12748 


2 


333 

■ 


DMVLLCHPGWSAGS I * KKTKERERMMNL 
HR** IDRERERBRERQIQ* KE * KRINEF 
EGS KERFTQS VKQRNK/ELEKIRTLDTS 
ETIYLS I YRN * S LRRGQERRTE RSRMRH 


12623 


26524 


A 


12749 


30 


333 


KTSYLLPVQWWAQNDNERYSSSKNTIMA 
LPLPIiPVFPRSPSDAERKLDCSAAISAH 
cnlpa\ds PASACRVPAIAGARRHA* lv 
FGFFWWRRRFAVMAGLVS 


12624 


26525 


A 


12750 


231 


39 


INDLL/CLF*FKKLIWGRVW*l4MPIIPA 
RWEAKAGGSFEPRSLKLQ+AMITPMYSS 
MGGREQDPVS 


12625 


26526 


A 


12751 


49 


273 


HLQVTE VFWFWCVFFRRWGGSHCV / AQ 
AGV* WLFTGTWPCCS PELLGSRDPPAS 
AS * VAGTTGACLAANCRGF 


12626 


26527 

■ 


A 


12752 

- 


505 

■ 


897 


SYLRVQFYSAFSLFFRHEVLPOiTGWNA 
W* SQLTTASN/ CLGPSNLLKAPHLAN* 
KKI F*RQGLSMLPRLVLNSWPHMILLP * 
LPE * LGLQARATAPGSGFTFCLTQDSIL 
MCSPTVHKLSDSIALETKQ 


12627 


26528 


A 


12753 


335 


542 


CCNE FFLSQVWWLMP WPATR/ SAEAGG 
LPGPGSQRLR*ARSKPVNSHCSAGGRCG 
IDPIS IKQQKNNNR 


12628 


A * AM _n. -PL 

26529 


A 


12754 


356 


72 


WHEYYLLMEHTKKCHLS *GLYNGLNN*Q 
/ WCTH P WPAT WEAKAGESLE PRS SRL * 
CTMITPANSHCPPAWATARSCLNQSINQ 
SSNNWQGMITGGK 


12629 


26530 


A 


12755 


340 


127 


NYLFIYFRDSLTMLPRLECSN*FTGSII 
VHITLKLLGSSLP/ASASQVAGTTGTRH 

nVUJboJM XrJ\-LK. J. K-flvP j 


12630 


26531 


A 


12756 


290 


511 


KKNQPGT/ CGSRL* S QRFGRLRQADHL \ 

DQ lgqhgkapsl/ lkntkis walwrtpv 

FPSSQEAEMEELIEPTSSRLQ 


12631 


26532 


A 


12757 


254 


549 


ypgakkqrpggdsvrgthlqsrcwcvll 
hncq* pksntqtfp* pknlre*nat*ks 
tdl/wpgaaahlcnpstleg*gewit*g 
qefetslgniprpqiy 


12632 


26533 


A 


12758 


617 


451 


NKKRRGLTLLSRIiKCNGVIIAHCSLKYP 

gssepptsaf*vagity\gt*hhaway 
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SEQID 
NO: of 
nucieouae 
sequence 
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SEQID 
NO: of 
pepuae 
sequence 

> 


M 
eth 

00 


SEQID 
NO: id 

09/515,1 
26 


Predicted 
beginning 

location 
correspond, 
ing to first 

uiniDU a LIU 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

*\ Inn tr\ 

aing io 
last amino 
acid 

resioue oi 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
x^Vjiuiauiic /\ciq» r — Jrficnyiaianine, 
G=Glycine, H=Histidine, I=Isoleurine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
\£ — v^iuutniinc- Jv^Argmine, o^ocnne- 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
A-un Known, — oiop coo on, ^possinie 
nucleotide deletion, \=possible 
nucleotide insertion 




i 


ft 
/I 


\LiDy 






VTJVTlTCr'T . VT T.VO T 17 COVT.T.C T WP V\7X. 

AQTLGKLS KTVMCPALEETVAVNGLQ I K 
SKK* /WLGMVTHACDPSTLGG*GGWIAW 
GREFGTSLASMAKPCLC 


lzoJ4 


20jjj 




12/MJ 


10 


221 


yortiN i Aii^cJoUxinr yjvjJrwyo vi»c» x xjnxi w 
TIFLCVCVCVCVCVRHVYHM/CYIYSDF 
ICPSIECEYFSFAHR 


12635 


26536 


A 


12761 


309 


55 


IIRYKNYISIKLEGWKRLYHATTHEKKV. 
GVAVL I S E KVDFRAKN/MYQDKEGRFLL 
*R 


12636 


26537 


X 


12762 


640 


896 


T I VLKCNFVLFCFRDE VPLCCPGWSAVA 
/ QSQTQLW*TTQPGLKQSSCLSLLSSWN 
* RHVP PHIjAMFIiK 1 1 FIxjL YTCGHLBTN 
IC 


12637 

* 


26538 

• 


A 


12763 


1 

. 


357 


IVPLHSSLGDRVKPHLLKKKKTCSTSLV 
I RETQ I KITRYTYSNG*S* RKNKAGNNK 
CWHG YTATGTL I AD / CKSKQMLWKTF * Q 
FHIKLNIHY*YNSEIQFLGLYPRDKNIA 


12638 


26539 


A 


12764 


129 


320 


RWKCVKLKNTGQAWWLI PVI PALWKNQ \ 
AEGSLEARSLRLQ*TLIAPLGYSLPG*W 
5Gxxj5xjE«Jv 


12639 


26540 


A 


12765 


617 

• 


435 


GVKGLILPKLQKLLKSIE/I/E*KLPKS 
FYDASLTMIPKLHKD\RPISHVKIEARI 
LHKISINII 


12640 


26541 


A 


12766 


243 


5 


IGIPERGEKVYNTENIFEGI IQ/ESVPN 
L/AYLVREVDNQIQE IQRTPVRYYTK 


12641 


26542 


A 


12767 


101 


16 


RTQL F VS VFWFCFLFC FETGS CS ITQVG 
VQWCHLRS1^QLPPPP\GSRDSPPPSSQN 


12642 


26543 


A 


12768 


362 


92 


RVPHMDLGEECTS P YMYI * PMTCVFKHG 
•IAjWrCVl irAllXI X 1 XJ./ iJjoVlVoI 
*I*CYLCVCVYIYIYIYIYIYIYFIYKY 
MKERRL 


12643 


26544 . 


A 


12769 


1161 


824 


QAKDLS THFTKENM * MG/NKQMKRLSES 

DKICP*RCX3*MRA/LLQC*WKYKIVSPL 
WKTWPFLKT\lNMNISCm»MILFLGFY 
PIY ! 


12644 


26545 


A 


12770 


241 


237 


K*RIWA\GVMAHACKPSTLAGCDGWIA* 

oyCif IS XINXioiM XVaxCf v» XJ X Bis 


12645 


26546 


A 


12771 


326 


6 


DTVSRKNKSGKI FQLSSRV* I YERSQSG 
VKVYKCKTFGKAFTQ/HF*AHMRMYTOE 
KPYKY*ECGKFFILVLLT.TiT,MIQKYFHL 
I KTVRLYli I RKKVS CKQPSNKI LQS 


12646 


26547 


A 


12772 


413 


1 


KKTFYFFFFETTQAGV* *CDHNSPWP*P 
P\GSDNPLTSAS *MA\GLQSCMHHHAQL 

GITAVSHHTRPILYILIVLSYAISFFSI 
LYSAYFLQPHLPIHNNFLFSYISSSC 


12647 


26548 


A 


12773 


2 


1055 


FFFFLRLNLTLLPRLE/ CVILAHCNIiHF 
PG* SNS PASASRVAE I TGSRHQARI IFV 
FLVEMGFHHVSQAGLE / S PDLK* SAHLG 
LPKC*DYRRDITPGHH* SF* * RYFLKTY 
I SRVL/ WCSLLGTFMFMIAFDHM* TW*H 
♦QNHFHFLLQIWKLISERLRSAQGQQAL 
KTVTSTFS S KYVCRAFIAEPGW IAI L YV 
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NO: of 
nucleotide 
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'i 

• 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 

♦ 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

■ 


Predict* 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=lso!eurine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine t P=Proline, 
Q»Glutamine, R-Arginine, S°Serine, 
T=Threonme, V=Valine, 
W=*Tryptophan, Y=Tyroslne, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


- 


■ 

■ 


■ 


• 




• • 


, TF* SLDSRRTCLWAVSNFFYSTLFLKL 
RQQVLLTQ/PYHLFSARPSSCGSDKKAR 
KLNItS LS L\CFF FFGKRVSGTQAGVQ * H 
DHS S LQLWFSRL KQCSHLSLLS SWDYRS i 
ITPHLANF/CNFLSGWSRTPGLK 


12648 


26549 

i 


A 


12774 


315 


98 


FRCFLNS INPLES PHVRKTCAGRAQVfLA 
PVIPPFWEABA\SQMPQVIHSR+LBARS 
SRPAWPTWQKPCLKYKN 


12649 


26550 


A 


12775 


275 


59 


NPSPYKIQKISWAWWYMPWLATRXRLQ 
*AEIAPLHSSVGDRARPCLPKKQKTNKQ 
TKTSGEPVLVQGPATVC 


12650 


26551 , 

* 


A 


12776 


73 


381 


SLLNSELSSIPRPTWESPSTSDSAWPMQ 
NLASEPLPLP*TTPPPDFPALSA/PB/P 
LPFTPSPRDTLPPPLSPSYKQAPVLSVH 
PPVMPLKYVPSSPLIiLSFLYL 


12651 


26552 


A 


12777 


274 


342 


LNCVIA* WLMPI ISALWEAEVGVSFA/P 
/RSLRPAWATW 


12652 


26553 

* 


A 


12778 


2 


478 


TIYYTKYTTFRVPNLGLQLLTAFFCVCV 
CVS LNG F KNAKD YGGSHLKYS VYI TGFL 
LQFSLKFDCFLCV CVCVFP * MA* KMP K I 
M/RGSHLKYSVYMT*FLF* FSIiKLKYFC 
DPILRKRI /WPGAMAYTCNPSTLGG*GG 
QiT*AQBFETSLAKWKPCLY 


12653 


26554 


A 


12779 


680 


453 


SETDWRKNKFFPSNFPSNLRTNSWTNSC 
SKKTFKKH/RVGLGVLAHTCNPSTLGGR 
GGWSP*GQEFENSLTNMVNHFS \ 


12654 

• • 


26555 

■ 


A 

- 


12780 

■ 


59 

■ 


487 


SLHKHPERGLQSFQGCWTQEMLGGSHVQ 
QRAWE L CAPPNLTC P GYLS FFLRQDQAL 
LSKLE CS GTI PAHRNPSLPS PQLKVSSH 
LSFPSSWNSRCTTPHQLLLLFF*DRIRL j 
FCPSWSAVAQSQPTVT/SSLPSPQLKVS j 
SHLSFPSSWNSRCTTPHQLLLIjFFNFL* 
RQVFTMLPRLVSNS * VQAILPPWPPKVL 
RLQA 


12655 


26556 


A 


12781 


109 


361 


LSPWHHSRNYTLIYPYTLNFWMSAFPTP 
KEEFFFSFPETRPHS VAQAGAHW\AI IA 
HCGLDLLGS \ SDPPTSAS * AAGTTAI I P 
G 


12656 


26557 


A 


12782 


3 


204 


LIDGS LALSC * L FCGGAI I AHCSLELLG 
IjGQ \ S PVSATRVAETTGVCHHAQLAFVN 
LHLRTRS KHCGH 


12657 


26558 


A 


12783 


2357 


6366 

- 


LTGS\NSHITILTLNI/NMGLNAPI*RH 
RL\ANWI KSQDPS VCCIQETHLT\CRDT 
HRLKIKGWRKIYPSPMGKQKKKAGVAI\ 
LVS DKTD FNPTKI KRDKEGHYIMVKGS I 
QQEEIiTIIiNIYAP\NTGAPRFIKQVLSD 
LQRDLDSHTLI \MGD\FNTPLSTLDRST 
RQKVNKDTQELNSALHQADL\IDIYRTL 
HP\KSTEYTFF\SAPHHTYSQNWTTIVG 
S KALLS KCKRTE 1 1 T \NYLSD\HS A\ I K 
LKLR I KNLTQNHS TT\ WKLNNT J >T rNDY\ 
WVHNEMKAEIKMFFETNE/NKKTPTYQN 
FWDAFKAVCRGKFI \ALNAHKRKQE\RS 
KIDTLTSQLKKTREAKSKQHS KASRRQE 
ITKI RA\ ELKE I E TQKKTLQP \ LKKIS E 
SRSWFF\ERINKIS\RPLARLIKKKREK 
NQIDT I KNDKGD I \ TTDLTB I QTTIRE Y 
YKHLYA\NKLENLEGMDKFLDTYVSLPR 
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eth 
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ing to first 

amino acid 

residue of 

peptide 
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Predict- 
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nucle- ' 
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location 
correspon 
ding to 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
fc/=v»iutamic Acid, Jf— rnenyiaianine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, LHLeu cine, M=Meth ion ine, 
N=*Asparagine, P»Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
. X=Un known, *=Stop codon, ^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


■ 

■ 

> * 

> H 
• 


i 


• 


i 

• 


* 

• 


• 

• 


LNQEEVT£SLNRP\ITG\SAIVAIINS\L 

PTKKSP \GPDGF\TAEF\ YQRYKEELVP 

\FLLKLFQLIBKEGILPKSPYEASIILI 

PKPGRDTIKKENFRPISLMNIDAKILNK 

ILAKRIQQHIKKLIHHDQVGFIPG\MQG 

WFNIRKSINVIQHINRAKDKNHMIISID 

AEKGFDKIQQPFMLKTINKIGIDGT/YY 

FKI IRRAI YDKPTANIILNGQKLEAFPL. 

KTGTRTGMPSLTSPLLFNIVFCPIVFWA 

R\AIRQEKEIKGIQLGKEEVKLSLFADD 

MIVDLENPIVSAQNLLKLI SNFSKVSGY 

KINAQKSQAFIjYTNNRQT/ EEAKS* LMS 

ELMSELPFTIASKRIKYLGIQLTRDVKD 

LFKENYKL\PLIKEIKED\TNKWKNIFI 

PCLWVGRIS\ IMKMAIL\PKVI YRFNAI 

PIKLPMTFFTELEK\TTLKFIWNQKRS\ 

RIAKSILSQKNRAGGITLSDFKLYYKAT 

VTKTAWHWYQNSMVLVPKQRYIDQWNRT 

EPSEIIPHIYTILI\F\DKPLLEKNKQW j 

GK\DSL\ FIKW\CW\ENWLA\ I CRKLNL 

\DPF\LTPYTKINSRWIKKKDLNVRPKT 

IKTLEENLGITIQDi'GVGKDFMSKTPKA 

\MATKAKIDKWDLI KLKSFC/TI CTAKE 

TTIRVNRQPTKWEKIFATYSSDKGLISR 

IYNELKQIYKKKKKTTPSKKWTK\DMNR 

HFSKE\DIYAA\KKHMKK\CSS\SLAIR 

EMQ\ I KTT\MRYHLTP VRMAI IKK\SGN 

NRCWRGCGE IGTLLHCWWDCKLVQPLWK 

S LWRFLRDLELE I P VDPVI PLLG I YPED 

YESCCYKDTCTRMFIAALFTIAKTWNQP 

KCPTMIDWIKKMWHIYTMEYYAAIKKDE 

FMS FVGTWMKLET 1 1 LSKLS QEQKTKPR 

IFSLIGGN 


12658 

i 


26559 

• 


A 


12784 


787 

• 


926 


PQAIRRPRPPKALG*HNSVDLGWAWWFT 
PAIPTLWDYTHEPLYLAKIS*CFKKVYK 
FVLNC I QNCPGPHAARHS/ SGKTS * V* S 
NYLFFSFCFETKFCSVTQARIiQWHDLNS 
LQPPPPGFKRFSCLSLPGGWDYRRLHTR 
S ANFC I FSRNRVS PS WPGWSPTPDLiRRS 
AVLGL PKR WD YRRE P P CPAKI YT I MAPQ 
KVNSHSSQPVSFLS 


12659 


26560 


A 


12785 


248 


285 


C * VI F * CVMIL YVli I YVEKI TS X KIAVY 
W/CI ITRLFVK* ICSCLLGC* FLFFNV 
C*LYI FI I FVFFFFFFFFFFFFFFFELN 

TT V 


12660 


26561 


A 


12786 


243 


37 


RRSAGHGGSCL * SQ Y FGRPTRADHKVRS 
N/RGETI*SLVKIQKVSQAWW*EPWPAT 
RQAEAGADAWVDR 






TV. 




/J 


Z07 


n VP WR P R PT7 l^r i YTFPF MnfiHS VAP ZifTRH 
u i i\_rix-j It xvir r rui i r r ciriuono vnmwon 

WCDliGSL*PST\*GSSDSSASTSR\QRH 
HAWLI FGFL* RRGFAC 


12662 

V 


26563 


A 


12788 


83 


256 


RKMYVVGLFVTAQN* K* PKYPSTEE/YI | 
K/RLW YLHTMEYYS AI KIEKLGTRAQNR 
LILY 


12663 


26564 


A 


12789 


197 


499 


QSLKSVEETVFRNNKQ/SIPTFQVILMP 
WWLIP 1 1 * ATWEG* GRRSPLNPGV*DQP 
GQHSKTPSLQKKTKFS * AWWRVP WPA\ 
IGEAEVSGI PLSPERSRL 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIantne 
C=Cysteine, D=Aspartic Acid, 
iLr— Glutamic aciq, r— rnenyiaianlne, 
G=G)ycine, H=Histidine, I=Isofeucine, 
K=Lysine, L=Leudne, M«Methtonine, 
N=Asparagine, P=Proline, 
v=vviutamme t K— Arginine, y^oertne, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop cod on, /^possible 
nucleotide deletion, \»possible 
nucleotide insertion 


12664 


26565 


A 


12790 


394 


?1 


FYNSTKFKWKNIQFIilTRCRGDQIOVIR 
AHAVQLGFHVLyF/CLSQNLSIYLYISI 
YVYTYR Y IHTHMCIHI I * YLYI YTYTv I 
KH* KEHWW X KHCFPTERK 


12665 

i 


r% S r* S 

26566 

i 

i 


A 


. 12791 


735 

■ 


75 

■ 


FFFRQEQFLSPPKRGGVPPPHTKKYFSP 
RGSFLWGWGTSRPPPRKCFSPPPPPVFL 
PPPKKKKIISFSPTKLAPPPEYFXSPPP 
PS \ PPP PSSPSPSTFFFFKNFPFS PSS F 
SSPPFFFPSPAWEDPPVQSRRIY*FLPP 
PF* J PE I PR\RVLKKKPGEGRS WGLFPP 
PKGPKRRVPLGPEFTCTRLGHKTKPRPPK 
KKKKKKRFLISCLWWHMPAIPSTLGG 


12666 


26567 


A 


12792 


482 


342 


MGVFVFFFE TGSHS S VTWAGVQ* CNLSL 
LQP/PASSDPPTSDSRVAGST 


12667 


26568 


A 


12793 


405 


169 


FYKKKKTGSGLALS PRLECSGTI IAHCN 
LELLGSRSLRL\SNTPTLAS *AARTVSV 
CHQTHL IF* FFVETG S CYVAS A 


12668 


26569 


A 


12794 


270 


94 


YVCYSYKQIKICSHVKRRE*NTFKIYLL 
KYIALGIjTGHFFPL * TFSKTVTIjWPGAV 

ahacnlsplegqgrwii/r/gqeiktil 
anm * ipvflhvni flsvcsys i httldy 

LFYYSCYILISL 


12669 


26570 


A 


12795 


284 


415 


NFIKI I KKKVMGLF YGKTFLLNLKKKKR 
KKK* WPGTVPHA* NPST\LGGLGGWITR 
GQEFETSTLATN*NPTKSISRGF 


12670 

• 

■ 


26571 

• 

• 


A 


12796 

• 

- 


1387 

• 


32 

• 


APSSFAIRSFFSGPiyilSBVFFSSMVWKRPW 
PNLEWS MNLRS I FS RARRFVCTS KDLR 
RVSTRFLVmUTAFLG YKAGMTH I VREVD 
RPGSKVNKKEWEAVTIVETPPMVVVGI 
VGYVETPRGI^TFKTVFAEHISDECKRR 
FYKNWHKS KKKAFTKYCKKWQDEDGKKQ 
LEKDFSSMKKYCXJVIRVIAHTQMRIjIiPL 
RQKKAHLMGDQ VERGAPVPE KAD\ WAPR 
EALSS KVLVTQVFWAGI K*SNFIGGDPR 
AKGYKGGQPVCWAHPRKLPPQRPHPRAL 
RKGGPVLGAW \HP \ARVA\FS VARRWGR 
KGLPFHRTEINKKIYKIGQGYLIKDGKL 
I KNNASTD YDLSDKS INPLGGFVHYGBV 
TNDFVMLKGCWGTKKRVLTLRKSIiLVQ 
TKRRALEKIDLKFIDTTSKFGHGRFQTM 
E EKKAFMGPLKKDR I AKEEGA 


12671 


26572 


A 

■ 


12797 


86 


364 


EQDNRIFFSFLSLFFFFGETEFGPFAQG 
GGQGAILGPLKPPPRGL\SSFPT*GSQE 
VGTTGAPHD I FCF FNKKGET PRLYKKNK 
NNRGGGATP 


12672 


26573 


A 


12798 


101 


440 


HCSRYATGIFEWYSGILVLILLRFGIYE 
PMNLNIGDPYDSPPSRLYKMRLWERMAK 
S L I EVS LKNSHFWLGMLAHACNPRTLRG 

JdCLU T a * TUm?\ Tf T Q T y^XT T AD DWT .VT .tTTf 

I 


12673 


26574 


A 


12799 


364 


195 


NPPGDIiGEETLLWGGEIMGTTPP*FPPP 
KEGFLPQDPPGGFNSPPI/QGRSFSFPP 
PGKFGPPQGFFKRPPPFFFFLNNNQ 


12674 


26575 


A 


12800 


86 


489 

■r 


PTAMAEEG IAAGGVMD I NAALQELLKTA 
LIHDGLAHGIRKATKALDKRQAHLCVXjA 
SNGDETVYVKLVEALCAEHQINLIKVDD 
/NKKLGEWVGLCQIDREGKPRKWGCSC 
IWKDYGKESQAKHVIRENF*CKK 



1308 



I 

WO 01/64835 PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 

■ 

• 


SEQID 
NO: of 
peptide 
sequence 

■ 

• 


M 
eth 

00 

• 


SEQID 
NO: in 

09/515,1 


Predicted 
beginning 
nuciconoe 
location 

r/ti*i*oenAnH 

currcspuna . 
ing to first 
amino acid 
residue of 

pcpuue 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
a cio 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
0=Cysteine, D=Aspartlc Acid, 

Jc^— \riu ia in ic /\ciU| J 1 — rneny laianine, 

G=Glycine, H-Histidine, I=Iso leucine, 

I\. — Ajjrol IICj l_i — JLnsUCIIlV, Irl — irj.t2lllIUUIIlc, 

N»Asparagine, P=Proline, 
' Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
w— i rypiopnsn, i— lyrosine, ■ 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucicoiiuc insertion 


12675 


26576 


A 


12801 


64 


2ll 


GNHKKSNYSLAPWW YTTWPAT * EAEAG 


12676 

* 


26577 


A 


12802 


300 


3 


NNVTTGSNIQFHSFPLIRKDICVSYIKN 
n zlunv t ttntq ntpmht? Town * vc NT , 

KIWLGKVA\NPSTLGGHGGRIA*A*KFK 
TSMGNIVTPCLINKQK 


12677 


26578 


A 


12803 


208 


394 


SQHFERPRREDCLS \QEGQ YSE I PSVQK 
RRSRLE 


12678 


26579 


A 


12804 


183 


348 


AFILGKGKYEPYKPCLGWA* WCMPWLA 
SWEAEAGG\SLEPRSSEL*CAMLIGCLH 


12679 


26580 


A 


t#\OAr 

1 2805 


397 


238 


ETEr PFCCPGWKWGDLiGSQQPIiPPGFKR 
FSCFSLPSK\WGQ* SLAPVGKGCK 


12680 


26581 


A 


12806 


l 


293 


PTRPRTS YEKQGYLLLPPVFS IVLEVLA 
RAI SQETE I KS VQMGKEIEVKLS L \FVCv 
YIYA*ENPVESTKTVDANLFDTRDWFHA 
I FPQTVG EVGMVLG 


12681 


26582 


A 


12807 


297 


281 


AKNPRRQ PRE I PGG VFPTPG FP P YFKAK 
M PEGP FPGGFFFKGGVLWGTP PPFFFFF 
F FSETES RFVTR\ LQCSGAI S AHCKLRL 
PGSCHSPETC**R*G k( 


12682 


26583 


A 


12808 


255 


59 


AGCSGSQLFGRPRGVDHLS PGV* DQSVQ 
HGEILSTKNTKISWVWWSVPWG\EVGR 
SIjEPGR*RuE 


12683 

» 


26584 


A 


12809 


247 


310 


PIiHS ILGTPTTPAPTPGKPLLIRTSILY 
* KP I LS QAWWCAPIj VPATQ / EAEARG SL 
EPRDSRRS*AMIKPVNSHCTPSWALPRP 
LPQPI*ESHC 


12684 


26585 


A 


12810 


I0l 


477 


IGKEEIKLLLFPDNLMEYTVNF*MYKPL 
EIjIS * FS KVTBYQVNTEK/SNCI YT*QL 
QIENEIAKTI*FMIASKSIKYLKISLTK 
CN\KWRDILCICIGRI»SIIKVLVLPKLM 
CGGEKI FNPNPTGFW 


12685 


26586 


A 


12811 


1022 


696 


CFFVLFFKEMGSHYVAQAGVRWCDHGSL 
YPPTP\GSSDPPTSAS*VAGTIGAHHHT 
WLI FKFFVE TRS CFL I QAGLKLLASS YP 


12686 

i 


26587 


A 


12812 


166 


143 


RKEERFQXNNLGFHVEKLEKEEQI\NPK 
TNRKKKI IKTRV* INEIEYRKTIGK\lN 
KNKSWFF*KINKIDKPLAMLDSGRRDSN 

V*MTTVC TT?T?r , DDD VPT77\r»VO VT?Trr»"DlTD 

RRRWRQRWR 


12687 


26588 


A 


12813 


163 


423 


KAFWE F INHKKTGP F * RGP IG KVS P PGG 
NLAoo 1 * Dec Jr t? F LiKl^FUOboGOrl? 
PKKP P TGFNLAFGGIiRGRETKRGPFFLA 
GLLP 


12688 


26589 


A- 


12814 


2521 


2842 


INTMYFPP* EMLVGWAWWF/RASNPQHF 

NTKISWAWWRAPVI PATWEAEAGESLEP 
GREGCSEPRSCHRTPVWVTRMRIiY 


12689 


26590 


A 


12815 


173 


395 


LDQPGQQRETLS llkkkqtnkqtkkkng 
FKKFL * LDTWVHTY\NPTPLGGRGGGFF 
* GQDFETTLANMGKRGFY 


12690 


26591 


A 


128I6 


43 


346 


CVYVS /GVCVCVCVCVCLASVCMYVCLH 
L* VLFG I F YV* DHNI CKLRSFCFLFKNT 
FSHVYF * KI I WPGVEL * VTVFLFliS SLL 
HTSCVYEKSYAILIFSAL 


12691 


26592 


A 


12817 


<}47 


!>3l 


SQHFGRPRRADHLRSGVQDQPGHHGETP 
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SEQID 
NO: of 

II U L ICU 11 uc 

sequence 


SEQID 
NO: of 

UCU LIU V 

sequence 


M 

eth 

od 

t 

- 


SEQID 
NO: in 
USSN 
09/515,1 
26 

• 


Predicted 
beginning 

nu denticle 

location 
correspond 
int? to first 
amino acid 
residue of . 
peptide 
sequence 


Predict- 
ed end 

nu cie- 
otide 
location 
corresoon 
ding to 
last amino 
acid 

residue of 

nentide 

UCUIIUV 

sequence 


Amino acid sequence (A=AIanlne 
C=Cysteine, D=Aspartic Add, 
Er^Glutamic Acid. F=PhenvIalAnine~ 

Mm* \j»mm>tllV> -m, M. 11 fall T IdMIU IUm 

G=Glycine, H=Histidine, l=lsoIeudne, 
K=Lysine, L=Leucine, M^Methlonine, 
N^Asoarasine. P == Proline« 
Q=Glutamine, R=Arginine, S=Serine, 
T=ThreonIne, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, * : =Stop codon, /^possible 

nucleotide* deletion. \— noqoihle 

nucleotide insertion 


■ 


■ - 






• 




. PLLKI \QKLAGRGGA*L*SQLLORLKQE 
NRLNPG I RS CS E PGS HHCTPAWATE * DS 
VLKKKIGADKY 


12692 


26593 

• 


A 


12819 

» 


392 


2 


PLFFFPFFFLETESHSVTQAOVHNPPGP 
SKP PTSAP * AAGTTGMHY S AHAQ/LI F F 
L* TBL PS CRPGCSAMAVHRHDPFTDQHQ 
SAS Fli I WASS PIiLROTODS S LLGC3HHVT) 
AKFSVDTQLALCTKAQNS 


12693 


26594 


A 


12820 


1183 


870 


DRVSLLTPRLGVAVARNLAQWKPLPPGF 
KRF£CLSLPSSWDYKHTPPTPG*EH/RG 

PTLA/FPKCW\DYR\HDHLAWPRKM 


12694 


26595 


A 

- 


12821 


172 


298 


IYLSIYLPI/YLCIHLSIHPSIYPYLYL 
SINPSYWVSFSREL*LT*L*YIiSIYIiPI 

XX VOX IXlOXfllrOXnXOx X JrUXnirX\7l:iJf 

LENFD 


12695 


26596 


A 


12822 


51 


254 


YDFLKKHLLSSYRAKDLESLFliLHLHI F 
SGN*SVSYICVCVCVCVCICVCVYICIL 
IKSWCS*S/CGVL*AICAGVGWC*LIFG 


12696 


26597 

* 


A 


12823 


199 


668 

» 


QPLPAPSPSAQKGRAHPTPS PARPQAE / 
SLPSQSLGTRTRPPLGRPP * SPPWGQVD 
LWGWLGPSPTLPAPESLSPASLSTEWL 


12697 

■ 


26598 

« 


A 


12824 


33 


442 


GRGKLLKKSQKQRKNALLKYDS KTRVGT 
IiFHWTSVPVTRPLLL*INIRLIKUDVIK 
CSIHKIFHSEFLFFKQHSGL/CI*LSK* 

T .CQ * Tf T WO /5TV A WVPNP Q A T /3fiTH*»P 

RIT*GQEFETN*VNMVGPCLYPPLPP 


12698 


26599 


A 


12826 


363 


75 


HWEPRPAGPTLPSAPGSPSGRPCPPHTT 
SG/PPPPNTHTPMSRSPWSWEGSDQRPQ 

RPPPTTLVRPLHSH 


12699 


26600 


A 


12827 


76. 


486 


CTVNLFIYFEMESCSWQTGVQWCDLSS 

GP*MMGAINSFLCCSYDSK*VS/ARSDL 
FFFFFFFF 


12700 


26601 


A 


12828 


345 


672 


KMKCTADIiSLLBKDT*IKLEENR*QERI 
MLRAEVNEIENKIENINKTGGSSSEKYY. 
KIDNPLTRSIRRKVN\KITSIRNEKGSV 
TTNPTE 1 3CRTVKE Y I MKNFMP TTJT.TT 


12701 


26602 


A 


12829 


208 


1 


PSNSTP I FKRNENI CPHKDPCTLMLMAA 
LFVMS KK* KQLKCS SNDE\KLWYIHMME 
♦YIAIKRNKLLI 


12702 


26603 


A 

« 


12830 


649 


172 


FPUCYSLSTTS VMAPS FTS CGGSQQVLL 
APLSRLLAWPNV/HPPPQPRHPAPHSFL 
SPPRSPPNYSP*SSQGPLLQPQSKHVRP 

PPA/VCPPLPPHSPLRHERAPRLFLPLP 
ATEPAAIjHAVHPGRRQERVT 


12703 

• 


26604 


A 


12831 


1068 


667 


KLSWFFCSRLRLHFGSIFKVPSL/PVPK 
LLLPIPSLCPSLPLQIjQPAQPVPFNTAT 
WLCFC* / P * RV* PAPGYQRVGHFNFFPL 
QLNFIPIILQ*HLGF*KSASLSSSSSPK 
KVSLCHPGCSAVVRSAHCNLKLLIS*SA 
C^LPKC*DxTUJQSPRPP/PTIFKVTKM 
CLLKQLTGSVSF 


12704 26605 


A 


12832 


268 1 


KCFFLIALNWPEFILDYFILNW*NSLRT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K«Lysine, L^Leucine, M=Methioninc, 
N=Asparagine, P=ProIine, 
Q-Glutamine, R=Argin!ne, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, 
X=Un known, *=Stop codon,/=possib!e 
nucleotide deletion, V=possible 
nucleotide insertion 














VTEKFLFMTLFLTY*NMNRPGMVAHACN 
PS/TIiGGSGGRIT*AQES*TSIiGNIVIP 
CLYQKL 


12705 


26606 


A 


12833 


654 


321 


LVG I RFLKVKYFN I KFPHLLAI IS PS LE 
VFNLQHFYI YFLLFIYF * ETRS CSVA\R 
LECSGVIIVHCSLKLLGSSNPPAIASQV 
AGTITAHHCAQQHFCIIiEKSNLCTYFPL 


12706 

• 


26607 


A 


12834 

i 


867 


201 


RQMS PTLRTKSRDVASRSSFSAWAWGKL 
FTFWDFSMSLVSLQGPGGPFHSPQP*TQ 
GR*VENNQEPLATPFPGPERSLPGQPRL 
EPAHSGKRRGLPLEIRPPVNVSDIHGDP 
SGAQWLVRGVEAG PWAVGGQAP * SEGPS 
PSPPLDVSPDPGSSLPRSSSP*MDLRA* 
LS LTF * TFQKGSGN/DPAS S PA\GKPG * 
EWRKIKGPPQGLLLGPEMWGSQARVLS 


12707 

- 


26608 

• 


A 


12835 


328 


1365 

■ 

* 


YAFCLMMLDKXQI \*AVFLFEFKMVHKA 
AKTTQN ISNTFG PGTANKHTVQWWLKKF 
CKEES LEDEERDMGHSGKVDNDP I * E P \ Y 
*TT* KITEELSVDYSTVIRWQHSKQIG 
KVKKLDKWVPHELSPNQNYRFEV*SSFM 
LRNNNPFLNRI VTC^EK* ILYN/RPAQW 
LGPRGQLQSQFPKPNIAPKKKRSLVHW/ 
SPVGLLAPILDPTYSFLNP/GGETITSE 
KLCSG KLGKMHG KLQ YLL P AL VNRKG P I 
\liHDNT/RDCVFAQPVLQKLKELG\YK 
VLPHPPYSPDLSPSDYHFC+HIiDNFLQG 
KH/SQHGAENAFQEFVKS * STDFYATGI 
NK/LFSHWQKCVDCNGSCFD 


12708 


26609 


A 


12836 


7 

* 


328 


RRERERERERERERERERERERERERER 
ERERERERERERERERERERERERERER 
ERERERERERERERERERERARV*EKST 
RYKSPPPRIPPLER/RRV*RKHPHARAL 
SFPLYKKCTGGERAPSPVCVGGKFTW 


12709 


26610 


A 


12837 


219 


350 


P FNHT YATWS 1 1 1 SNVQVCFMRAQD I YL 
LIYLFEAGSCSVA\RLECSDMIMLHCSL 
DLPGSNNPPISAT*I 


12710 


26611 


A 


12838 


189 


359 


LGUX3VMRVGPCRALGPS*EEKSSRVQW 
LTPVI PA/ LLEAEAGGS PEVRS LRPAWP 
TW 


12711 


26612 


A 


12839 


368 


77 


KN PNFLKFGSKPMGP PI YS PP \ LEG * AG 
GFI NPGF * TP PGYMGKPPFFLKYSNL PG 
LAAPGGCS PF PGGLGRKI S FTPEMEVS I 
NPGS PLS LPPGEQN 


12712 


26613 


A 


12840 


16 


173 


KTDVHSKTCTKLFTAALFLIVKKWKHLK 
PP*VD/EINKMWYKHIVBYYIiAIKS 


12713 

• 


26614 


A 


12841 

■ 


425 


45 


NSFVFFFFLFENRVLTP VAPRLE \CTGV 
IL/APHCTLPALPEFKRFLVPSALLSSW 
DYRPVPVMPWLNFCI FVETGFH\HVAQS 
VLKLVSLKL/ PS PTWDVPKC*DFRYSVR 
CCGLFF 


12714 


26615 


A 


12842 


237 


375 


LLLVWYILKKLWXLGWAHACNPSTLG 
GQGGWIL*GREFETSQVNM 


12715 


26616 


A 


12843 


21 


325 


TSFFFLESYSVAQAG\QWCDPGSLQPQP 
PG/SQ/DNPPTSG*VAVTTG/MHHHARL 
IFVPFCRDGILLYCPGRSKI IFSSCIRE 
L FKK I S D F FLLTML I FCNS NKG 


12716 


26617 


A 


12844 


369 


58 


PISPLQFMLPFLKNQSPYGPFFWKKKNF 
PPPFLGGAVLKTPGF* NFLFKKIQRGFP 
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SEQID 
NO: of 

n nrlentide 

II LI tlwUIIUv 

sequence 

i 


SEQID 
NO: of 
np n tide 

sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/51S,l 
26 


Predicted 
beginning 
nucleotide 

location 
corresoond 
ing to first 
amino acid 
residue of 

peptide 
sequence 


Predict- 
ed end 

nude— 

ottde 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 

sentience 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Add, 

E=Glutamic Acid F=Phenvlalnninf> 
G=Giycine, H-Histidine, I=lsoIeucine, 
K=Lvsine. L= Leu cine. M=Methioniiie. 
N=Asparagine, P=Proline, 
Q=Gtutamine, R«Arginine, S=Serine, 
T^Threonine. V=VaIina. 
W=Tryptopban, Y=Tyrosine, 
X=Un known, *=Stop cod on, /^possible 
nucleotide deletion, ^possible 

nuplpntirip in^prttnn 














. PFPKKXKPRERPKPCPPKFG/SPPFFFY 
PGFPPPKKGPPPPFFFFFFF 


12717 


26618 


A 


12845 


211 


409 


NALKP KI HFFFS GGGLKGI VWGNTLLDI 
GLHKTFF F\ * GD F YYAL* I * AEMALPGG 
GGDCAPHLQVK 


12718 


26619 


A 


12846 


200 


33 


QKTNNTKCWQGWGATGTLIHF*GEFTRV 
QSHWKIVWKFL* / SLNILP/PSSSSSSS 

Q ^ 3 CJVTCNT.fT .HTrar*rWMTTT A AT.TiT.T. 

AKTWKQP*CE 


12719 


26620 


A 


12847 


416 


125 

' ■ 


KIPTRPGKGGPPLYPRPFRG^iCQiGtf- 
PEFGTPLGNKGKS FPFFOKNKM\NWPHQ 

GGP IGPPSPGG* VGGLTLKGEVSTKLDS 
PPAPPPGOOKWPLP 


12720 


26621 


A 


12848 


85 


397 


KIHIFFIFTiVSLLKGLSFLLTFSKN*VF 
/SFIDFLILCVCVCIEFYCFILILFHYF 
CLFGFCLFLMFVS* CQSIjDFYFWSF/YI 

F*VK"COCVTCF /CTT»TF 


12721 


26622 


A 


12849 


1 


246 


RPRRHLVASLVLTQLNGSSPLPIKIISW 
AWWLMPVIPTFWEAEVGGLLEPRSSRQ* 
* amt\ gppppawat 


12722 


26623 


A 


12850 


161 


1 


NKTTSSQIW*PMPIVSTT*EAEVGGSLE 
PRS /L / KLKCAM IS P VNSHCTPTWTTK 


12723 


26624 


A 


12851 




244 


STYYLHSLPFLDISYKWNHTTC\SRYFM 

flAT.PT.TATTf ♦TfOPTTRPQAnPrtT+'P /TMW 

YIYTLEYDIaAIKRNEILIHAATWMNLKN 
IMLSKRSQTQKT\HW*FHL 


iz/m ■ 




A 


1/5 DZ 






TTT A VQ VT .T T »WT?T .Ml? P * TT /lPHTPT /I / W G 

LITES VDHVQWHAPVI PALWEGEVEGLL 
EPRSLRPAWATW ■ 


12725 


26626 


A 


12853 


168 


453 


CQLRGASGTQGPGLSESQCQQPCP/ PEC 
RPSKPRPCGCRI*SPARTSPQPSPAASM 

ATjPTMT? PTP PP AT.KXJ * P FT /fPTf H Q Pfi<5 A 

KS KSPVKS TERTAK 


12726 


26627 


A 


12854 


244 


3 


ELQVYMHI*IYI/C*SIYTKANIYIKYM 
HMCVY I YTHTHTHVC IHLHI YLLRHGFG 
CWLMLGKVRGCEGMAGLMKAQQGWGW 




ZOOZo 


A 




J /O 


4Zo 


Awn Jr^jvjri i jref r Jj v ujn rurtl? v Jjy Jx 1 
RP* GGGQGGKFNPNFSGPREPPPP/ PPP 
GGGKKGPPPPPGVFVFFI*EKGGSPILPR 
G VLNPGAPKGVYTQRGG I KSGNQGATPL 


12728 


26629 


A 


12856 


129 


395 


APPNTHPFLSEAPQSLSLEARPPSAPSN" 
SSSR/PPVWRASQTHPPQDWSSHCSPPM 
GAPPOT * P FSnSIAHP PODWSSHCJ5 P PW 

GAPPQI*PFSDSLAHPPQDWSSHCSPPW 
GAPPQI 


12729 

- 


26630 


A 


12857 


363 


97 

• 


GLATLRRLVSNSQAQVIRPPQPPEVLSL . 
PTCC?*DYR PGR PPA<?T,THVNQR<YP+ KVAP 

/CSKPLAAAAHQGSPGATETTRRCPPSR 
OGPIP 


12730 


26631 


A 

• 


12858 , 


322 


361 


KS E KQS * VMIiAVCTLDMMKMT\ F I S WF 
LPKTHN/ LMSNYKKNTRQ I PMEGHST I Y 
LTRI PQNCQGHQKQ * KSEKQSQQES KKK 


12731 


26632 


A 


12859 


88 


417 

» 


HFTFFFFFFFFKRGGGFFSPRLKRLGKN 
FFFLDPPPPG*RDFSPSPFKEGGF*KP/ 
SPPPLVFFFFF*KKKGFPLGAGGF*TSG 
PGETPPLYSPEVWFFYGGTPGPTPFFF 


12732 


26633 


A 


12860 


217 


2 


LILINKLANMHCFLFVFETSLT*AGVQW 
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SEQ ID 
NO: of 
nucieotiae 
sequence 

- 


SEQID 
NO: of 
pcptioe 
sequence 

■ 

• 


M 
eth 

OQ 

■ 


SEQID 
NO: in 

09/515,1 


Predicted 

beginning 

nucieotiae 

location 

correspond 

ing to first 

amino acid 

residue oi 

peptide 

sequence 


Predict- 
ed end 
nucle- . 
otide 
location 
correspon 
ding to 
ib si ammo 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCystelne, D=Aspartlc Acid, 
Hy— oiutamic acid, r— xnenyiaianme, 
G=Glycine, H-Histidine, I=Isoteucine, 
iv— juysine, j_r— leucine, ivi — irieinioniue, 
N=Asparagine, P=Proline, 
Q=Glutamine; R=Arginine, S=Serine, 
i — j nreuuine, v— vanne, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














HDL/GLLLRLAPPRFKQPPCLIIiPSSWD 
iKlAIiliHrTVl KKDeIA 


12733 

T 


26634 


A 


12861. 


375 


145 


LSFFFFF*KPGSCSVAQAGVQWHAHSSL 
QLQTP\GFSDSLNSAFSVA£PVRVHCHT 
RPLSPFNICWRARPKQKVFL 


12734 

m t 


26635 


A 


12862 


525 


852 


NIjLSIYFFETESRSVAQAGVQWCHLGSL 
QPLPPRF* IKKPSNTAIiFM*NLLSIYFF 
EIBSRSVA\RLBCSGAILAHCNLCLPGS 
SDSPASTSRVAGVHHHSHLSSWDYSCAP 
PCLANFFVFF * VETGFHYVSQDGL/DTS 
♦PQVICPSRPPKVLGLQA 


12735 


26636 

■ 


A 

* 


12863 


.3093 

• 


204 


EPDKTGPVLWKVGGGARVPGMAETLSGL 

GDSOAAGAAALSSASSETGTRRLSDIiRV 

I DLRAELRKRNVDS SGNKSVLMERLKKA 

IEDBGGNPDEIBITSEGNKKTSKRSSKG 

RKPEEEGVEDNGLEENSGDGQEDVETSL 

ENLQDIDIMDISVLDEAEID\NGSVADC 

VEDD \ DADNLQE S LS DSRE L V\ EGEMKE 

\LPEQLQ\EHAIEDKETINNLDTSSSDF 

TILQEIEEPSLEP\ENEKILDIL\GET* 

RSEPVN* ESSELEQPFAQDTSS VGPDRK 

LAEEEDLFDSAHP\EEGDL\DLASEST\ 

AHAQS S KADSLLAWKREPAEQ PGDGER 

TDCEPVGLEPAVEQS/SVAASELAEASS 

EELAEAPTEAPS PE ARDS KEDGRKFDFD 

A\CNBVPPAPKESSTSEGAD\QKMSSPE 

DDSDTKRLSKEBKGRSSOG\RKFLGLVG 

LSSTTRATDLK\NLFRQIWGRVGGAPRL ! 

WTNARS PGS FAVYG FVTMSTAEEATKC I i 

NHLHKTELHGKMISVEKAKNEPWKKTS 

\PKRGSDGKKEKSSNSDRSTNLKR\DDK 

CDRTDDAXRCDDES VEKSK\ DKNDQK\ P 

GPSERSRATKSRKSRGPKRTW\MDKSK 

\GVPVISVKTSGSKERA\SKSQ\DR\KS 

ASREKRSWSFDKVKEPRKSRDS\ESHS 

RVSERSEREQR\MQA\QWEREERERLEI 

ARERLAF\QRQRLERE\RMERERLERER 

MHVEHERRREQERI\HRE\REEIiRRQQE 

LRYEQERRPAVRRPYDLDRRDDAYWPEA 

\ KRAALDER YHSDFNRQERFHDFDHRDR 

GRYPDHSV\DRREGSRSMMG/SIREGQH 

YLERH\SDPEPH\GQDSLR*LGWGYEL* 

Q*RLS\EGRGLPSFPPGAGRDWGDTWLE 

DEDDPVMGKGTAERGPMMDK \DHK\RWQ 

\ GGERSMSGHSG PGHMMNRGGMS GRGS F 

APGGASRGHPIPEGGMQGGFGGQSRGSR 

PSDARFTRRY 


12736 


26637 


A 


12864 . 


2 


456 


IHLGSGSEGDGSSGGLGRGNSNTSRFSS 

PGLGEAGAGS STPGDGGELRY/ WPGLLG 
A* GRGIGGDDDDLRTLGLAGVQQOGKG/ 
RRGPRG PGRGQEPRRPRDLGLRG PRRAQ 
GAA/SATAAPPP 


12737 


26638 


A 


12865 


388 


1 


L I KMTKVKKTAITKC * RGWGRTGAFIHC 
SWECKMIQPLWQTVWQLLKKLNIYL\WK 
QPKCPLM/VKWIKKLYISMDSYSAVKKK 
ETLMYTTA* MTLEVALLSERSQTRKGER 
GHILWDS I YINFQKMQTNL 
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SEQH) 
NO: of 

11 lUIcUUuc 

sequence 

♦ 


SEQH) 
NO: of 

pc|IIIU6 

sequence 


M 
eth 

uu 


SEQID 
NO: in 

09/515,1 . 
26 


Predicted 
beginning 

n ii P IpnttH a 

11 ULICUUUC 

location 
correspond 

tno tA firvt 

lllg W III 91 

amino acid- 
residue of 
peptide 
sequence 


Predict- 
ed end 

n ii fit*— 

otide 
location 

ding to 
last amino 
acid . 
residue of 
peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartlc Acid, 

TT^f^liitam t/* Ai*iff Y^sPtiAnvlalanina 
Ej— vviuuijuii, Sm.nu, r~"JT ueii jriHiMniiitSj 

G=Glycine, H=Histidine, I=Isoleudne t 
K=Lysine, L=Leucine, M=Methionfne, 

No A ona rnoinp PzPrnlin^ 
Aapal aglilC| * *I viiiiv, 

Q=Glutnmine, R-Arginine, S=Serine, 
T=Threontae, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 




ZOOJ7 


a 


IZooo 


108 
170 




KVEGSPE\PRRSRIiQ*AMITPLHSSLCN 
RVRTRGRTRG 


12739 


Zo640 






413 




JN Jr V JN LliiUroljijir llDl PIVjflK V CuA J.V7X1V7 

TCP S VP PNTHLHTGWWCLQHHRS CRAWG 
RGGSHTHRCPQRVPDGEHP * I YIAVHGE 
PP/ESPPQPSPLSCCPPQGNYIALRBPP 

HPPFPGFOGLEIiEKGVGDFI IHPPLTLP 
KPLPNPLKPEPTPTPQTHI 


12740 


26641 

* 


A . 


12869 


715 


893 


PCVAPHSIREERFRLTSESTNQKVLWWG 
GGMAP • 


12741 


26642 


A 


12870 


342 


58 


KKKKPKENEQSIWDMWDNFRTSDI *TTG 
VLEEQr KS W GRKI I rKE I VRENSSNIiTK 
/Q INLLTQEV* TTT / HKNMNKS TP IHI I • 
IKLMKTTTKEKI 


12742 


26643 


A 


12871 

• 


110 


401 


SFYSVMILKASRIVICLFT*LYAYIHTL 
xlx TJUUi X xAi x vNXXXCTHJ. \NXHIII 
HVYTYYIVYAS I YYSMYSAWYNI KVSKD 
RQTYENIALITYY 


12743 


26644 


A 


12872 


439 


256 


S YFMRFGGDKI SKL YQF ILSRRWKQSTC 
PSMDE W INKMWH/ 1 TI ME YYSAK*RNEY 
SYMRHE 


12744 


26645 


A 

• 


12873 


356 


43 


GT PERHSHSHSQGS P I JLGArPlG I RAS rv 
YSLIiS CVLLMLS PQRVLFYFS AFKQQQS 
\WSGKVAQAYNSNTYGKAGSIT*GQEFK 
TILGNIARPCFYKKKFLANS 


12745 


26646 


A 


12874 


341 


205 


REIKKMVLNKKFFSLQFLGKKKNFWEFF 
LKKKKKVFKAPPVFQF*VPIF/CSF*KK I 
KVFGFFFSPKIFFFFFFFFFFETESTSV 
TQEENGTILAHCNLGLQGS S S S PASAF * 
VAVSaULH-Al faljLiGUKGKijlljnKiUuUUUS. 
KKNFGGKKKSKNLFFLKRKKMGT 


12746 


26647 


A 


12875 


474 


206 


PIRANGPVFSPPPPPPAPGTCPHPLVIM 

LSLBTQHFKTAHFIFP*H*SSGLGPGWG 
S PVGEGPG/ HPKLCLQ PAAAAEAWPPRE 
GTKARRHEGLP PAPCAWG PAPRDAGSRL 
EFPTLPTI PVSQSPGR*HGAGGRPSCLR 
AFVPSRGGQASAAAAGCRQS FGDLALLP 
LGS P I LGQGQNFS ARG R 


IZ/4/ 


zoo4o 


M 


izo/o 


257 


467 


VPWPT*T.T DDT.PVTMaUPKTT.'nT./'lOC'nD yo 
Irtvri ljJ_iirKJ-LCj X J. rlrtxl LJN J_uJ J_<vj o o U f / tr 

QVAGTTLK* LSRAS RVAGTTGVHHHSWL 
I FK/YFL* RRWRRFSI* 


12748 


26649 


A 


12877 


867 


1409 


GPTVLVGGQDWNS VLQQPRAHS FTPRGG 
AGGLGGG YVLHKP I P FSQELLGAFPCQK 
Q YRVP PPLEAQLLRQARS PAF I Y * AAVL 
COAOGPSPS PCSPMVS PRPSGRAPHLGA 
APQSGQKALASTSSPDAEFV/SVG*WCP 
SNPALQGGQNGAEPRAVS ITSGALSLYD 
CFLDGWWEGRSLGP 


12749 


26650 


A 


12878 

1 T "j 


1416 


740 

* 


irnprwvqldvfaaapmgrssplflllf 
sflktvlrgsmipstvps/fhkgpqacl 
yslslrs*sqaapspapsslptpsphpg 
ltqpgrgrrkadis/cshswafyvp/sl 
phlpwacplslpqpvlfsilfcfvlf*d 
gvllchpg\wsamarswahcnlhlpgfk 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 

* 

** 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

i 


Predict- 
ed end 
nude- . 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny (alanine, 
G=Glycine t H=Histidine, I=Iso leucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparegine, P=Proline, 
Q^GIutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=&op cod on, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QFSCLSIiPSSW\DYRCMPP\GLG*FLEF 
♦VEDGGPTILVRLIWNS* PQVIHLPQPP 
KVLGLQA 


12750 


26651 


A 


12879 

i 


961 

- 

■ 


31 


PSPVRDSCVAWAGERRGRLGGSGVWRSH 
ARGPGLMRGLPASKPAPCVLGSRPCFRV 
AFETPWTS PAS CMFERCP L WRPPALVAA 
GLD P FLLS STLTLAVC PRDL I RSWKKGC 
DVCDSRNCWSTHSRPACRDAI LAP ILAG 
RGECIGLWG/ AAAACTGRV/TPVSGLTF 
STVFDFS S Ar!g/ V*GGLPRS TGGPRVRG 
PGQPVPLPGLPAGVALAPLEGRPQBPAP 
ALSHSLTGSLFLGAWLCGCTRHGRAPVL 
LBVSIVACGQLCEAAPQIFIQPGLQQLP 
TPSATSPAYLWVDVLLTLPSVIKQKRVI 
SFLTKE 


12751 


26652 


A 


12880 


232 


28 


PALWVPWARFDGMEETGQNHWSKEKAW 
PGAVAHTCNPNTLGG*GGH/MT*GQEFE 
TSLANMAKPRIPP 


12752 


26653 


A 


12881 


331 


1 


KNGPFFKNPPEFFPPPKWGVPPNPTPYF 
PWPEPIF*LTPGPLKKQKIPFQ\KGEIL 
GWGGKMGNPLPFWKWGGPPRESKSSPRF 
YKTPPPI FKKKKKGGSSRSRTSPRV 


12753 


26654 

- 


A 


12882 

- 


370 . 


287 

■ 


VS I LAGCCGVAM I DNTVLVHKEVKI LR K 
* 1 1 KK* CI * * FRGHSK/HNFSG* AWWLM 
PVI QH FGRPR * G S L EARS LRPAWATWQD 
PHLYEK 


12754 


26655 


A 


12883 


656 

• 


1168 


EPSRGVWPHEDARINGSKKKKSKDKKRK 
REDEETQLD/IVGIW*TVTNFGEISGTI 
AI EVDEGTY I HALNNGLFTLGAPHK/ ER 
I ALKPGYGKYLS INSDELW/GRSDAIG 
PRBQWEPVFQNQEVRNGGPAEMGEEKRN 
GTKWREDTDHTSFPLFPSTGGQPKAHSN 
WRKVCH 


12755 


26656 


A 


12884 


205 

■ 


452 


ASGS AHRFRTFS FETGRVLD KPSQTDHT 
NRKRGGPNVK*SFL/WLGAAAHACNPNT 
LGGRGRRTA* AQE FRTSLGNTVRPCL Y 


12756 

■ 


26657 


A 


12885 

• 


112 


290 


KYNI I Q YN I VFS CHDPLVS FNLFFFF FF 
GKGG F PL S PRLKARGR VWGYIiNP PLPGK 
RNFWAPPPQGFEIKGLPPP*SPPFGGVG 
QAKNLGPGFQPPPAPMGKTPFF* KNQNN 
PNWGGQPLNFKTLGGGGPKI SF/PPGGG 
GLNNPKPSPGPSTWGKGGNPLSQKKKKK 
KD 


12757 


26658 


A 


12886 


406 


190 


TRS LARGGFGRPLQKP YWNEQS F Z MAKK 
\YEQPKCPWTDKWVNKMWSILTVG*YAA 
MKRRDVLTCATITCR 


12758 


26659 


A 


12887 


440 


160 


RVCVCVCVCVCVGVFVCFALWLSSLIYN 
* VYLFLFVHNLGRVS FSYFLNLI * FFEH 

\TKtLXXT /tlMVimMTnUipyi VT irOVT l"l'L.'LT / 

VNHV/HNI VUWIUIM IaJjIJjKo IhttXc «/ 
CPC*FPPILLH 


12759 


26660 


A 


12888 


201 


411 


HIWDVILIRHALGNFLTSIS CYI VKIWL 
DAVAHTCNPNTLGGQGG/WIT* SQEFKI 
ILGNIGDPLSTKKNK 


12760 


26661 


A 

» 


12889 


138 


464 


SCTMNPSEMQRKAPPWRWRHRS *APSTH 
KMNRMVMSEQMKLPSTKKAEPPTWAQLK 
KLTELA/TKKKSLENTRGTQTSKNMLFA 
ALMIVSTGCAGVPSSSKETATIEDKP 


12761 


26662 


A 


12890 


146 


366 


FS VSMKYFV* IH* FYHL/KLLLMVYL*Y 
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SEQ1D 
NO: of 

D UCICIf IIUC 

sequence 


SEQ ID 
NO: of 

|JCUUUC 

sequence 


M 
eth 

uu 

• 


SEQ ID 
NO: in 
I ISSN 

09/515,1 
26 

4 

• 


Predicted 
beginning 

n 1 1 r lAn+tff a 

II U kKUUUC 

location 
correspond 

inn fr\ flr«f 
lllg Ml 1119 I 

amino acid 
residue of 
peptide 

con nonMi 


Predict- 
ed end 

otide 
location 

r*f\ rrpcnnn 

corrcsjiun 

ding to 
last amino 
add 

resiuuc vl 

peptide 
sequence 


Amino acid sequence (A=Alanlne 
OCystelne, D=Aspartic Acid, 

Vf^CZ lntnmir Arirl F=Phpnvlfllflninp_ 

I1/~VJ JU U1U1IL. /*CIU, UCII^IBUIIllllCj 

G=Glycine, H=Histidioe, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 

iT~"/»sp«r nginc, r M rr uiiiic, 

Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 

Y~T In le n niifn Cfnn ftxtt rwn ^anncoiKIa 
/V— UOKIJOWn, — OWip COUQll) t*^yWi9IUW 

nucleotide deletion, \=possible 
nucleotide insertion 














f*PT .A T NT QHN T <? C Q OKTMTfT.WT • * (JBGW 

. ur LnllV i. O^lN lQV>JyMPU> 1 in 1 1 V3XVVJ V V 

AHACNPGTLGG*GEWMA*AQBFNTGLSIT 
TAKPYLDF 


12762 


26663 


J 


12891 


368 


50 

• 


KGRQS6VS SFPVGTVTNYTNLVAHNNRY 
LFPHS P6CQTSEVKVLA\GCSLLKASRG 
pr pppt . pn cjnn q ttp pt .& vnc* t tp t qjx q <? 

CDIiLLWVCVFSSYLAY*DTCHWI 


12763 


26664 


A 


12892 


227 


456 


KLTTLLKLFEK\ I E DRTLS KS FY EVS I T 

*TDQCCCQCCCD / T CT-TNTTT^ATTlf T1MVT D 
M X rOOOOOOOOr/ XOXjXJN X LJtAAJVXRIJXX Jr 

RN^IQQCIKKI I FHKFSTGI YS 


12764 


26665 

* 


• 


J 2893 


461 


on 

89 


W*QCKIMQLLWKI IWQFL* /BLNTEVPY 
DS I PLLDMY/ PKKTBC/ YTSTQTLI YCM 
NPIEYYSAIKRDKVLIHATTWMNLENI / 
ISKRRQSQKSTYCMIPLI 


12765 


26666 


A . 


12894 


222 


185 


RIVMQYLHLNLGACVCVCVFVCD/ CVCL 
CV*MCVHXHXYI& IHi XIHIXVGM*ICR 
LS*+CEFYS 


12766 


26667 


A 


12895 


186 


166 


I FFFFFEMGSHS VTQARVQ WHNHSSKQH 
PPP\GASDPPTSAY*VAKINIFFFF 


12767 


26668 


A 


12896 


55 


485 


T.CWDCRDEPPRPALESVFLTSFLEDEFS 
IEIADLVPHPTPCSHPLQPP* *VIIYFC 
S\KPLVTGSSSPPRKPFPSLHEPKSASL 
*PPSSTIQPVRSSPKG*PSPPLA*VQFP 
KGMT I PTIiCIiS P VXi PVPST I PxjFNVSKP 
LPFY 


12768 


26669 

• 


A 


12897 


923 


312 


VDV I RKVQ I E I TLRCHL IHVRIAI I KKT 
RANKCW*GCGEEGAIiVHC*WEC/NYILV 
QPLWKrrVWRi? IjSKijKXELPyDPAIPIJjG 
ISPKGNEII/CCTPKFIAAQFTIVKIQT 
QPRCSSMGKWIKKL/W/HIERSIDR*LE 
YxTiAFKKKAVLSFAKTWXDLEDIMI^lEI 

E>U* ^ * ** " X O Li X \J\3± JUUxn. V 1^ X XJSXJS 

NKTVITKVRIRWRK 


12769 


26670 


A 


12898 


320 


3 


ESWRWGRIVRPHSVPGETWLCPVPPGPP 

NAPDGSFPLAICPVPPSCRSL/CPP/DD 
PT,ac anv+nnpp t.pptph g putt cwappo 

Luno/UjV "v»VjP f JuirlV 1 QHO Jr Alio WrtUry 

EQPWPPQCKLKVGAPNSTALSEGS 


12770 


26671 


A 


12899 


138 


447 


IFVFHYKTQNIYNEDTIaHVI I INLWCYP 
GS YI KSTGRLRP EVERGIiGPTTMCC* FS 

PSTLGG*GRWIT*GQKFNRD 


12/ /l 




- 




Ziy 


1 . 


RFIR/NLKI EliL YVPATPPVGV* LKELK | 

SAC*RDNCTT/M/FTAALFCIGYGSNliG 

NSTALIGSRVLFP 


12772 


26673 


A 


12901 


235 


183 


I INYGGGPPEKKKKGGKKIPLI FYKKIK 
KFXWPKKKKLFGGAKNPIV*AHHIFTV* 
IDVDTRADFTSATI \ I IAIPTGVKVFS * 
LATLHGSNMK*SAAVL*ALGFI 


12773 

• 


26674 


A 


12902 


180 


19 


LFIYLFIYIiFICEIRCHFVTQAGLQW\L 
I IASCSFALLGSSDPPTSAS *VKGI 


12774 


26675 


A 


12903 


406 


949 

i 

• 


APVPGHGFCVSF/CGENGVSLYCPGRS* 

TPGLKLCSCL\CLPESWDYR/R*APVPG 

\LGFVFHLTL*KPPFFKDYMKSLFQFLK . 

YLIQG*CTLV*GVRYR33FIFFFFFLRR 

SIA/SVTQAGVQWRDLGSLQSPPPGFTP 

FSRLSLLSSWDYRRPPPCPANFAFVFIiV 
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SEQID 
NO: of 

sequence 

• 


SEQID 
NO: of 

pcpuuc 

sequence 

■ 


M 
eth 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

n it f ImiHH a 

location 
correspond 

int» to firtt 

■ UZ£ lu tit 9i 

amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 

■iii(*Ia— 

otide 
location 

Wl ■ vOUVII 

ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 

Tr=r21itf a m ir Arid T5^=Phen vlalaninp 

G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, ■ 

N^Asnfl rapine^ P=T*rolinc 

Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=VaIfne, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /-possible 
nucleotide deletion, \»possib!e 

iiuLicutiuc iiiscruuii 


• 






• 






EMGF\TVLTRMVWIS*PCDPPASASQSA 
G ITGVSHR AR P KFVT ^YPKDMGEPI/jGVT. 

HRSHIAVHWM 


12775 


26676 


A 


12904 

- 


265 

■ 


385 

• 


GNGS PS LRW I GVLRGRS PTLELRH* PIP 
LTAASRP^LDRVSVPMWGTFIjSEPLS TP, 

U1/VWAJU9UUA V V XT Li f» VJ A XT 1 IQPg AJO 4>E> 

GLVSRYLTNNLMERIPILYRN/PFNNET 
M/PESLCYRVL I FLSKGYPRVKGRLDTC 
Y<5 PVR R <? P AKKA KT.PTlAPRTiRfTVKPVA 

SVHPEPGSNRTRG 


12776 


26677 


A 


12905 


363 


137 


APGMQA* WCM/HCI PGAQEAEAGEWTEP 
RQ^RT.WCTVTMPMfl 3HCTPAWATW 


12777 


26678 


A 


12906 


315 


565 


TPVDHGWLECSGAIMAHCSLKFLGSRDP 

Pa^ZXQRVa(^TTYlVr*WW^OTiTFTiin?WiTR ' 

SNFVAQAGIELLG\ *VILLPQSPKVLGL 


Ml to 






100H7 

izyu / 


j /© 


41Z 


D A.U PTWRWT .* PRVWMTDR * P PR DMT.TY 

LTA\ PLKYFKSGVMAHSCN\ PSTLGGQG 
RWIT *GQE FET\ LANMEKPCLY 


\d.l 17 


ZD05U 


A 


IZtUo 


01 < 
ZID 


L 


RGGKIS * TQE FETS LANTVRPHLHKKQK 
IALPLPLPLPIiPTVR 




ZOD51 


» 


jzyuy 






LKKKEIL 


12781 


26682 


A 


12910 


279 


123 


KMILS WARWLMS I I PAAWEAEAGGS VEL 

\ LBLP+ftNITVPTiRHPTPAWARR 


12782 


26683 


A 


12911 


1 


194 


AKQHI PLLARLECI /GLFTNTVTAHCSL 
ELLASSHPPASAS * VARMTECVHRAWLE 
WIFYLCVIL 


12783 


26684 


A 


12912 


206 


388 


IELSKQGPVYYKWYVRITPPYFPFFFFF 
ETGSCSVT\RLECS / GTITAHCSLNLPD 

T.QVQ'D , PJVaQ#VaW 
Jjo ISr 1/vu w Vnn 


12784 


26685 


A 


12913 


238 


426 


TPl^PSQQWLKKESLWLGRVVHACNPST 
LKG*GGRDS\RGQ* FBTSLGNIARPCLI 
REKFLKI 




ZOOOO 




1 0Q1 A 


1 ID 


143o 


J\x\xJi\tLc L CiV J./iINi\.y riiVK^O X oxAn*LrvOlTL 

QSKIVRYHLSB* /R*tiFFFNGGNCWQG/ 
CKK/ IQDLIHCWECKMVQPLWNI IW\RF 
L*NIiTCICHA5GIAX*IjSIYPREMKTCVH 

TXTC T R MF T AAT,FVM\7ETWROPT YP^TiO 

G 








1901S 
jtzyi «> 




Z J 


I KPPPCPSTWGTKKKXiFPKKKKEVMKML 
CFHFNLLVQN IF* YRLGNE I S DKAG \ PV 
AHAWNPSNLGS*GELIT*GQEFDTSLAT 


12787 


26688 


A 


12916 

« 


254 


150 


TLPGRSSD FSLPEV/ YIjKKNNH/ WLGW 
AHSCNPSTLGGKGGWIT*GPPHMLKVSF 
LPTDPTSRVKOL YNWVPHSTVL IALFPT 
KNPG 


12788 


26689 


A 


12917 


181 


24 


AQWCDG I P VSKGNL / WLGMVAHACNLS T 
LGGPGRW I T * GQE FETS LANE FHRTD 


12789 


26690 


A 


12918 


75 


298 

• 


IiFILFTS * VD *MRPTHIKEN1!IL\LAQS T 
NPNVHL I Q/ KHPHRHS QNRVWPNVWAPH 
G\QSSCHIKMTIMARPSGSQL 


12790 


26691 


A 


12919 


251 


9 


KPMFLARCGGSRL* SQLSGRPRRADHLR 
PGVQNQPGQHSESQYLPKSTNFCRVWWC 
TPWPATREAEA\GESTEPGRQRLQ 


12791 


26692 


A 


12920 


198 


211 


RFCFSVGDEREGKRA/ RERNRETERKKQ 
TEDTETQREERDRDRKR\ERE TDTERER 
PREERHTEREKK* KAAFPFQTIjGCRAQD 
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SEQID 
NO: of 

IIUUCUIIUC 

<? en nance 

1 

- 


SEQID 
NO: of 

nanHHp 


M 

eth 

od 


SEQID 
NO: in 
USSN 

09/515 1 
26 


Predicted 
beginning 

nil ManKrla 

IvUI UVII 

correspond 
ing to first 
amino acid 

fPGlHllO ftf 

naiuuc hi 

peptide 
sequence 


Predict- 
ed end 
nucje- 

UllUB 

location 
correspon 
ding to 

lasi dm i iiu 

acid 

residue of 
peptide 

sequence 


Amino acid sequence (A=Alanine 
D=Cysteine, D=Aspartic Acid, 

Us— vyiuiamic /vciu, r — jrnciiy miaul Do, 
vj~vriy ciiiCj n~ xiisiiuiiJti, *- launmcine, 

K=Lysine, L=Leu cine, M=Meth ion ine, 
N s Asparagine, IMProline, 
- Q=Glutamine, R=Arginine, S=Serine, 

TsThrPAnina V— ValinA 

i — i iirconine, v— valine, 
W«Tryptophan, Y=Tyrosine, 
X-Un known, *=Stop codon, possible 
nucleotide deletion, \=possible 
nucicoiiQc insertion 














FPKQL FPS RGWGVGPMTVLL 


12792 


26693 


A 


12921 


438 


127 


RWMERI FVRPHI KDSMVWVRSTLLGRG 
DALNDLFLYCWEKEDPRYKERP\RERER 
E RE RERE RDRCWE K* APGWRERP VQS LR 
ACHP P WHRFALAGP VPSAQ 


12793 


26694 




12922 


158 


460 


EAR VE liLi L S S RCKP RAV I TAETQPAVYR 
LQLEIENFPGLSPRLECSGAITAHCSLD 
LLG* SDPPNSASR\AGTTGMCHAQI VNS 

CKADWLVLiLSSRGIjTHR 


12794 


26695 


A . 


12923 


33 


484 


G IQAAC I PGLRLS P PADAHCHRHIPS PP 
P SSTAPRDDVGPMQ ITPDNFPPQDPPL \ 
TPPSQSLLPHRAP*PQGSSVDLPaLLFK 
LPQMPQTPGPEEVDTCVQIHKTSKARBD 
VQMRKTPMAREGRHRAAHRWIGALGKAP 
QSQRRLRH TVG 


12795 


26696 


A 


12924 


335 


332 

• 


. RIKVYVRGHAKE * LRPGP/ VAHTCNCHT 
LGGQGMRI TQAQEFETSLGDIGRPHLYK 
KKKKKGGPF 


12796 


26697 


A 


12925 


302 


407 


AQWLMAI I PAP WEAEAGGLLEPRS * \AW 
VTW 


12797 


26698 

t 

i 


A 


12926 


766 


244 


RVPRL PP PAL * LE FS PSGLRLARSHS FQ 
LRSSPPPVSPPST/PFSPHRSPPLSLLS 
LSVPS /PPLHHSFFSQRAPHILPPPPLI 
FPSSPPPAPPPPPPSSSSPSSSSSPPPP 
/PPINWGTPRFFPPPPFKNPPPEINFGG 
PKKKKI FSPPRA* KFVFLKGPPPFFFFF 
FFFFLGINLL 


12798 

■ 


26699 

» 

■ 


A 


12927 


127 


720 


WOGLIiES TELQLQSLPGLYVSVI FGKI F 
YISGLGI CCTKI FPSALPSAQPAASLPL 
PARSALGI VFFLHFC* IE/CNYKKLPI F 
HHI * L I KTFS YGLT * LFFFFFKTKSIi I F 
S PRLNCRGPFLVN * NLCLRG *GNSPA* P 
FGGAG I EGAPHKAGL I FGGGGGAFLKKT 
GXLHHVAPGGASNSGTGNISHPPI^IPPK 


12799 


26700 


A 


12928 


308 


49 


KEHW *AVGHFNI CVTGGP *GERKKGTER 
VFKEI ITPNFPNLIQTIKPKTQETQHTP 
5 KKNT.K.I / TPR X I X X Kr JjKTSNKE KXIiT 
TAR 


IZoUU 


ZO/U1 




1 TOTQ 

izyzy 




i 


LND\EQADTTFTI * FCVRQKTKKSTDKL 
VKL I / ERFYKVAT YM INFLKKSNS FHTP 

CVRFVGKKLTFPER 


12801 


26702 


A 


12930 


202 


379 


SSENFSGKTVNLCCLGRLHRMISWLHTV 
CLY 


12802 


26703 


A 


12931 


209 


376 


TVGEKMYLKVKRI \ WLGWAHASNPNTL 
GGOGGW IT* POE FETS LGNMAKPHLYKK 

™ * * « J-J UMU A ill ^ Aviv 


12803 


26704 


A 


12932 


280 


31 


I S CKVT KKVTXjFELKY I WNKF Y I YERVS 
IVGTTEDAACEVTSNVITGPGAVAHACN 
PST/RGRWIT*GRELKTSPANIAKPRLY 


12804 


26705 


A 


12933 


1 


189 


QM YTE SE P * CKLWTLGD YNV* M* l?ts*/ 
IQCTTQAGNVNIRE/RLCMCSGGEYMRN 
LCAFLSIXiXt 


12805 


26706 


A 


12934 


108 


409 


HHSHLLKLPNVWFS FCHFIVTVFLSL*R 
VFFVFY* I FLSSSHFLFSFCFCPIMFIM 
FPFKFILS(ZNIVFI*NIIFFLF/VHV*G 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 

• 


M 

eth 

od 

» 


SEQD) 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
OGIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N»Asparagine, P=Proline, 
Q=Glutamfne, R=Arginine, S=Serlne, 
^Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














* STLVSLFVFFFFLLQF 


12806 


26707 


A 


12935 


225 


11 


I LLCMEPS FLFWFGFICFVKTGSHYIAQ - 
VEVLWLFSGMIKVHCSL/KLLHSSNPPT 
SAS * VAGNTCSS YCTW 


12807 


26708 


A 


12936 


74. 


12 


GTTHASAGVINILVFI LSVFFLTHSLCI 
HISKYMFVYICVCI CTRTDFYTEFKSFC 
TFCFFGSSFFFLEKKFPLVAGLBEQGKD 
LG*LKPPPPRLKKFSCLTLPG/AWE*RT 
NG 


12808 


26709 


A 


12937 


166 


311 


PHS * WECQMVHAPAIiENWQ FLTKF IRGL 
PL\DAALPLHM/CHRKSCTQLFI IALLI 
VAKKCNHHKNL * ANE WINKM * YI HTTDY 
PLAIKQNEILIHATTWMNWTL 


12809 


26710 


A 


12938 


51 


296 


LCMYTLHTLYMHTC I YAYICYTCKYIHR 
Y/LHT*IPEDANSVTCHKYTHTHIYVHTL 
VYPHTTYLMCSYIYSWVCVCVCVYIYT 


12810 


2671 1 


A 


12939 


426 


28 


KGFNLVNPRVFPS PRF* TPAPVFYFOGP 
I KKI FFFKAGGVKFDSLKRAPLFFFFLK 
TGSCFVP \RVECNG 1 1 TAHCNLCL PRSN 
DPPPSDLQVTGVKPPP\AKLFFFFLW*M ! 
GFHHILDYQGETCLS^QLLGWR 


12811 


26712 


A 


12940 


11 


213 


ATAPGIiFS FFETGFCS VDQAG VQ *HDHG 
SLQP*PP\GSSDPPALASQSARITGVSH 
CKGPEHTSIiLT 


12812 


26713 


A 


12941 


416 


38 


WLNNHSRLGFPNCWDYICKLPRLGPDVA j 
SLLIHNSS \GTWYGATKLESNWYFS IP I 
YKDHQK* FAFTWQGQQ YTFTVIiPQDYVS 
SAL *HNTVHRDLDHLDNPQN ITLVHYSD 
DTMLLDLMS RKYQAL 


12813 


26714 


A 


12942 


361 

i 


74 


I XHPS FS VS SRN IHKNAYPAALGG YS VY 
GVAI LL FHYF INKLAFTLHCGLA/ LNS F 
LR* IQEPPI*G\SGSGPFSCNVFLAPTEG 
TIVQKPDPMATFG 


12814 


26715 


A 


12943 


125 


401 


SSFEPRGIPLNPEVAMQKESVNILCSPR 
SQEFLESRIKIKVLTDLTQDEL*GQAQ* 
LTPIIPS\IWEAKAGGFLEPRSLRLRSC 
HCTPAWVTE 


12815 


26716 


A 


12944 


92 


306 


KRTTNSPWGCSAY/ GVAILLFL* IL/KL 
LAFTLIiHGLPSNSFIiRKIQEPSLG\SGS 
GPIFGNKLVAFGGLMIWS 


12816 


26717 


A 


12945 


322 


24 


NFKKNINLNPFLRGYFLFLKGGFGLTFK 
KVGEPLKKKKKKLNQQFSYFLSTYTVK* 
RN*/WLGAVARTYNPSTLGGRGGWIT*G 
QEPGTSLANMVKKILAK 


12817 


26718 


A 


12946 


2 


448 


GGAPMKRHGSLDRNYRVDTTRPSQLFLF . 
QLYHNYLKFAYEATKEYMHRKETLCLPC 
PAIQF&FPKS NLCYRGYLCMRTCAHTQM 
*MYRYIYTYVYMHIYICM/YRAICTYYT 
FE*KY ILQKCILLCI FLFFLQYLKYVSY 
LNRHHFLIAA 


12818 

• 


26719 


A 


12947 


155 


415 


LLETGGNAGCSSWMHVLLGKIHRYGMGS 
RPXSLMAG* \GPPWLRRVAGLRGRPATL 
GLRYGPDSYGRQQW\EYCTMGASIiMQRR 
RVRD 


12819 


26720 


A 


12948 


247 


250 


KYVYKYIHIVYKCVYI FTHVYGYTYIiYT 
DVYKYTYL YTDV* I C I CI YKCI YLYWYI 
YIHLC INM YI YIHLC INMYIHIHV* IYI 
HIYl\HVYIFIYIFLYTVYK*IHIYBVC 
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SEQID 
NO: of 

IlUCIcOUUc 

1 


SEQID 
NO: of 

Lie U LIU C 


eth 

■ 


SEQID 
NO: in 

09/515 1 

26 


Predicted 
beginning 

Mil J*Iaa4-**$ Ay 

correspond 
ing to first 
amino acid 
residue of 
• pepiiue 
sequence 


Predict- 
ed end 
nucie- 
onae 
location 
correspon 
ding to 
last amino 
acta 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
OCysteine, D=Aspartic Add, 
£/— Liiutamic acio, f— .rnenyiaianine, 
\» — vjiycjne) n — riiSLiainc, i— isoicucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proiine f 
Q=Glutamine, R=Arginine, S=Serlne, 
T^Threonlne, V=Valine, 
w—irypiopnan, x— tyrosine, 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














EYIYTLWKL 


12820 


26721 


TV 

A 


12949 

i 


25 


407 


EALS LLLLL vLWGLLLGIARKMY QQMNL 
VQYCLLVL * P * NL * EQ KRFERE VGLDHR 
EP*FHVLGCIHSEKSFRFWSW*BL*IRK 
Li TJ_i VA \ CiG V AH i Y iM P N T-LGHK GRR I T*G 
QEFETSLGNWRPLFL 


12821 


.26722 

* 


A 


12950 

■ 


370 


106 

■ 


ILYLETLLKSFISIRSFSAVDSHFEKSS 
TVGKMLPNSITCTDPI \FNKKKSQLMWQ 
SSLLPFSQKLPQSPQPVSQQPSPSRQGS 
! I PATR* QIiA*GSVQVMLFGS I LP TVELF 
SKWESTAEKLLILINDFSKVSRYKIYV 


12822 


26723 


A 


12951 


280 


319 


DQPGQHGKTLF*VSH*KKKKGWM/ POVY 
AYNPRFLGGRGGW I T* GQGFETS LANMG 
KPCFY 


12823 


26724 


A 


12952 


337 


113 


IQIAISLLGrCPRDLNTVTtS\St)TCTQ 
MFLSVLLMVAKR* KQPKYLQIDEWINKM 
QCIHMES YLVS KGRKYLCKL 


12824 


26725 


A 

4 


12953 


9 


186 


DPTVCCLQDTHFTYKDTNKLKVMGWTKY 
/ CHANGNQKRAGWIQIEYP * S KSLKSE 
IVQN 


12825 

• 


26726 


A 

. 


12954 


46 

• 


381 


NENTYSYKKNTQMFIAALFVVTPNWKQP 
TC PS / SGE W INKLL \ YHLMVYYS AVKTN 
/ ILVYVATSVNLRI I I LGKQPDKKRVLT 
V* FHLYRKYKLI S SDRKQIHGRLEVENK 
GK 


12826 


26727 


A 


12955 


19 


250 


CSDMVLKACIKKLMSYSKMGHIQAMDYY 
TAL/NKEL/LMYATI * ANLEDIMLSKIR 
QTQKDNCLMVDCI YIRYAEQSS L 


12827 


26728 


A 


12956 


173 


163 


GAKHSASGTGWIRKDFTRPGAVAHACNP 
STLGG * GGWLSRS E FETS LTNMVKL/ di 
YK3ICCTRCLLGCFPQALSKAID 


12828 


26729 


A 


12957 


301 


79 


EKDNQPLLTPRNCNVLHAIKESENHRIT 
ELKVTLRPDAVAHTCNPSTLGGRSG/ WI 
A*GQEFEISLAKKVKPCLY 


12829 


26730 


A 


12958 


314 


126 


KHHHFKKHNFRVL C VCVCVCVCVCYKHF 
YLFSLSFSQ*P/CICC*CTOCSFLLCCF 
♦SATYLKII 


12830 


26731 

- 


A 

• 


12959 

■ 


529 

■ 

• 


1697 


VPFMGHISNFFFSPFLRQKYLALLPRU3 . 
VQWQ*SRAHCNLRFPG\SSNSHASASRV 
AGITGACHHGWLLFVFLVETWFHHVG\Q 
AGLKTP* PQV\SACFSLPKCCDRRCEPL 
CPATSS I LNL * TWDRL / PGEPKNQ *NLL 
VSFMCAVFYREGQQHVYFQRSLWCKREF 
I VRS PWAINL E KGGLRMGAV* I RI MLNL 
TLT*GFCTEDEKTVNSRVNQKLTPT*LG 
DE * DVKREKNINDE I KLLTONS B YTKS Y 
RLGR* I KLGAFCFFM \ VPMGQTGRNGQL 
KvQ V*TGTVAHVCNPSTLGG*DRR I TRV 
OE FETNLGDHSESS SIiOKT IfTjARHGAAR 

L*SLSL*VSWDYRCSPPWP\SOTCIFK\ 
RDGVSPYWPKA/WS *TPDLQRSAHFSLP 
KVLG 


12831 


26732 


A 


12960 


1 


395 


GTRE FR I LS EKFKKE I DFF FFFKKQSS P 
PPGPKKQTL/C*KQTTPTKPQKTNIiGPK 
KFGGQPKKPFKISHPQIGPPKKKNYNP* 
GQPVWNPPGEKKEGPTPPKFKDPGANLK 
VMGLKGGVKRKRGGKTFPQRA 


12832 


26733 


A 


12961 


90 


436 


LWRLTS P KS AG WAGNLETQE ESMLQFKS 
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NO: of 
nucleotide 

t 

sequence 
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SEQID 
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sequence 
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26 
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Predicted 

beginning 

nucieotiae 

location 

correspond 

ing to first 

amino aciu 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E/-~<jriutamic Acid, r— Jrnenyiaiamne, 
G=Glycine, H=Histidine, l=Isoleudne, 
K-Lysine, D=Leucine, M=Metbionine, 
N»Asparagine, PHProline, 
VpK^lutamine, K=Arginine, aenne, 
T=Threonlne, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
x— unknown, =Mop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 




■ 




■ 






EGHR/ PAESPLA* SSVLFSLSTDWMRP I 
LI/MEGDLLYSKPIHENVKISSKNTYRN 
IWNNVWPHV*TL*PSQSD/CLK*TITPr 

VLlLTCTH 


12833 


26734 


A 


12962 


415 


463 


LRAKDQVLPKI LVC * TLLPFCAQI KKKK 
K/WPGAVAHAYNPSTLEGQGGWIT*GQE 
FKTSLGNMVKPSGIPPHWI 


12834 


26735 


A 

• 


12963 

■ 


774 


316 

■ 


SIS PTCSGKIGGKLNRHFS KEDKTG\RY 
MNKCPISLVISECVXjKP *BDTH*TP\LR 
MAKIKRADH/DKC*QRYKMTGTLIQC*R 
ERKMVQPLWKTV^QFLKRLNIHLPDDS I 
PLLG\ I Y* RKMKACVHXNTI IWMPIA3L 
LVKAK\AKKQPKCPSTR 


12835 


26736 


A 


12964 


44 


395 


MYFTLWJGSLGPKFLGGHQPLQGQGPFL 
HFLKAL* *FPFPQTPGNGGGTPPAKTFP 
PFPTPLSSSSPCPSPAEGGSREPSLSTP 
/SCLHLYIGGAASNPSPPSPLPPPQCSG 
LGYPVC 


12836 


26737 

i 


A 


12965 


424 


50 


MAFPFLSCAPDAINNNNKKCCREINNEE 
PPNEPLKWRHIMIFFFEAESHSFAR\L£ 
CSDAX SAHCKLCLPGSRHS PAS AYRCPP 
RHLANFS AFLVQTG P PRVSQDGKDLQTS 
*SALLMPPLDPAIP 


12837 


26738 


k 


12966 


245 


44 


QSKBLGNYPLPSPSPSSPSPAFHGLPPP 
SSPSPAFHGLPLLP\SWTVQPQ*RLTAT j 
SLPDPPASPRPG 


12838 


26739 


A 


12967 


205 


30 


QEGASLS RI KRGPGVGAWLGMVAHACNP 
STLGGQG/SWIT*AQEFETSLGNPHLYQ 
KYKN 


12839 . 

• 


26740 


A 


12968 


114 


465 


DPVSN PETTKPB P PKKTTES * P/ BPKSL 
LWLPS\P*NS3*CANREQQPPSPQPHLP 

9 ******** mmm, mmm m& mm ***.*> mm> mmm. mm m— • _» m mw Vm«m*Jm«iM M>MM%a bsm 

HPLPTSLQVHRLP* *SLPPPNLFTPLYD 
/SPEPSLLPVIQFPA*TSPP*PPVYSSP 
PSPTCPLH 


12840 


26741 


**, 

A 


12969 


1618 


1038 


4 Wm *M »m mm. M M-MM. mm m. MM> M MM *m **mm* M MfVlfe M^tf W VM VW*«UM M 

VHMVQDKD I NLNNQFLSGTMLLFFKETS 
HR*DRGPGDF* S CLYS AP I PTQPLLF * Y 
LPQNFIEG* PS *NSRI KGTSLKTYISRV 
PSFFFS FL * DGVS \TVPQAGVQWHDLGS 
LQPPPPGFECRFSCLSLLSSWDYRHPPLC 
PANFLDF * * RRGFTMLARLVSNS * PCDL 
PASASQIAGITRMSHHTLPHVFPLNV 


1 m* Cm A Hi 

12841 


26742 


A 


12970 ! 


197 


411 


EQALRS TVC YCLNPLP PAPLRPRQDC \G 
PCRFRWMPGGRTRWLAPVI PTL* EAEAQ 
VWLEARS LRQAWATW 


12842 


26743 


A 


12971 


'A* 9 A* 

616 


777 


MGIHGVRHNAQLNFVFFvEDGGEWQLTC 
TGAGDSSWVLGEP*VFAC/PKNVLFYLH 
P*LFG*I * I YQLKITF*NYEGMVP/FVF 
RFLFETRSLLSPQLECSGTITTHCSLEL 

VFFVEDGGEWQLTCTGAGDS S WVLGEPL 
RYYKERLIKWYL 


12843 


26744 


A 


12972 


222 


389 


VITEALGSDELEGRQLWW* I KIHSQVQW 
LIPVIPAL/LEAKAGGLLEAKTSRPAWE 
T 


12844 


26745 


A 


12973 


425 


145 


QIIFFLFQKIKNQFNKASFFKKMTLEQM . 
TSRQKNSAQ ISVFKKKNSKWITDLNVTH 
*TIQLTGKN\NIGKNLQDQGLGKEILNL 
TLKAQSTKG 
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M 
eth 

00 


SEQID 
NO: in 

I [COM 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid ■* 

residue ot 

peptide 

sequence 


Predict- 
ed end 
nucle- ; 
otide 
location 
correspon 
-ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Atanine 
C=Cysteine, D=Aspartic Acid, 
jb— oiUiamic aciu, r— rnenyiaiamne, 
G=Glycine, H=Hlstidfne> I=Isoleucine, 
K=Lysine, I^=Leucine,M=Methionine, 
N=Asparagine, P^ProIine, 
Q=Glutamine, R^Arginine, S=Serine, 
i— inreonine, v— valine, 
W^OTryptophan, Y=*Tyrosine, 
X=Unknown, *=Sltop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


12845 

■ 


26746 


A 

• 


12974 


322 

■ 


411 

* 


NNKKIFMSTS*KFFYCFIVQLIFETILS 

x Tnxrr oottt t*tvt j,vot t/*itptmjt nr 

VRVIiE S 1 1 L INYL * YC 1 1 C I FPFIjFL* T 
* E I RKVTKFTS FPQI EEK* KCRR/ WLGT 
VAHTYNPSTLGG*GGRIA*AHEFETSLG 
NTGRliPS VQKI Ii 


12846 


26747 


A 


12975 


428 


1 


SRKSSSQFRLSFFPSHFLDLGAKGAVSQ 
DRAIALQPPGPRSR*S*RPPSRPRCFRR 
PRPSRSRSNKERAGEKRRPRRSLQPRPP 
HIAGP /VPRRQGSPSAGPEAPLATDEPP 
PQRRSG IRGGSANI RSKQTSSLRPRVRG 
RVG 


12847 


26748 


A 


12976 


60 


294 


NHCTETVLFQ*LTCVCVCVCVCVCVCVT 
ERI /CFFKFLIGKGTYGCIYLWPKTFIL 
IFI YILYSPRGRS I LKKKFFFLT 


12848 


26749 


A 


12977 


6 


341 


DSLTLLPGLECSGAISAHCS LCLSG/ FK 
QFSCFSLPSSWGFTG/VPPRMPQLIFPY 
FSVKTGFH\HVG *AGLKLLI PPASASPK 
GWDYRRE PPRPGDLWNF* PTGFKLRFPQ 

■4KB 

FPL 


12849 


26750 


A 


12978 


296 


42 


SQPASLGFKSVDLPLIiAVCPLGKITLTL 
WDFSFIIYIKQE*WPRAGIFMPY\NPST 
LGGRGGRIS*AQEFltrSIiGNIVKPHYLV 
K 


12850 


26751 


A 


12979 


20 


373 


KLYGG I WGFFFFRRGIiNSATQAEMQGHN 
LG*LKSPPPG\*RNFPASASGEPGITGS 
HHKAGI VFYI KKKPGLKTFTLGKPTPPP 
SFRHWHQPAPMG VYS EI LGEKSLTNGWD 
PQTKKG 


12851 

• 


26752 


A 


12980 

• 


380 


96 


PVFPP PWRAQKGGS IGRE IRPPRGNQGK 
PGPP* KKKKNYPGGGGGPPS PPFPEG\ * 
PKKWHNPGGGTPP * TEFP P WPSTRGGEK 

KPPb KKKKKEK 


12852 


26753 


A 


12981 


331 


71 


ELL I YS WVWWCTP WPATW +GRELVGRI 
T\KSRRLRLQ*AKIVPIiHSSliGDRVRPC 
PP P PTKKKXiANIiCP WXBCjo JjS r a YLiItEQ 
FHSC 


12853 


26754 


A 


12982 


383 


189 


RRALKSCAHSTPAGPKNAALSQAQWLTP 
VIPTL *EAEAGGSLELRS PRLL* AMI BP 


12854 


26755 


A 


12983 


144 


6 


ILAQ*KTWLSWAHTL*S/TLGGQDGRI 
TGAQE FKTSLGNTRRPCLYK 


12855 


26756 


A 


12984 


341 


509 


h I JjxTME * YTAI E KN E J MS FAGT WJjEIjK 
A\VIL\SNLV*EQKTKMLPCS PLMGWBL 
K 


12856 


26757 


A 


12985 


387 


125 


DLPPTPPKI^PRTCSvAUVliVWuJj/Cs&Ij 
QPQPPGP\SDPPA*AS*IAGTTGTRHHA 
* EMFPF* VYFVQTS EHILG INP PGRHTK 
VAWBV 




Zo75o 


7A 
*\ 




131 


OCA 


urWjvar flic J. Jr rzr\j\ K >Ett\.ir\jr i. tr f rwrtxr ir 

PLGFFFPG/RFFFFLRRSLALSPRKWRD 
LGPLQPPPPPPTPGY 


12858 


26759 


A 


12987 


251 


1 


RMS LLEKRRSNN * EARV/ CGRVT YVATE 
VTKIMIGISMVAHAYNPSTLGGQGRRIA 
SAQEFETSLGNMTNPCLYKKYTHKKSRA 


12859 


26760 


A 


12988 


320 


3 


TPWGVKDFSPPPPPGEGVFPGRAPPPR* 
KFFGNFWKKRGFPNWVRGVLKFWG\KNF 
FPPNLPKSLGLRFKPP3PP*KAPPFFPIj 
FFFFFEMEFRLCCPGWSAVAQS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M N 
etfa 

All 

OU 

■ 


SEQID 
NO: in 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing lo nrst 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-AIanine 
OCysteine, D=Aspartic Acid, 
js^vjiucaniic Acta, r— rnenyiaianine, 
G=Glycine, H=Histidine, I=Isoleucine, 
KHLysine, L=Leucine, M^Methionine, 
jn— Asparagine, Jr^jrroiine, 
Q=GIutamine, R=Arglnine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-tfnknown, *=Stop codon, /"possible 
nucleotide deletion, \=possi»ie 
nucleotide insertion 


12860 


26761 


A 


12989 


353 


183 


EYGTHILTAAIFSIAKRWAHR*GSLTDB 
P 

* 


12861 


*\ fr* fry 

26162 

> 


A 


12990 

« 


377 


2 


PQKTP PRAPKKKKP FFFFFFFSFFKKNF 
PFFFKGQSKGGFFGPP*PLIFRFK\NPP 
PPPPQKIGFKAWVFSTPPPPIWGFFFFF 


12862 


26763 


A 


12991 


134 


298 


PPNSYCDGSITLIPKSDKNITRKEKYL/ 
P/IYLMTIDAKVFNKVLPNRIQQ* IKRI 
I 


12863 


26764 


A 


4 A Ays 

12992 


892 


171 


QT*SH/SLSRLEYSDTI IAHC5LKLMGS 
SD PPAS ASQVAKPTGHCTTWPQPHTBGF 


12864 


26765 


A 


12993 . 


269 


2 


GLWCFVKQLKLTETCPHFAVPQFPLGIS 
KIPIS ILFFKKLLL/ WPGTVAHTCNPST 
LGGRGGWTT* GQE FGLM FHHWPS KFHHG 
QHGET 


12865 


26766 


A 


12994 


324 


4 


SLYPLLFQAEIYFLVTIHRSSMESCIYF 
FSVWPLDSLVQIHVKI /HIKGIGML*V 
N*QNCFKINISE*L*YIWPSTVADTLGC 
QGEITRAQGFKTS LSNMTKPHLY 


12866 

■ 


26767 

■■ 


A 


12995 

■ 


106 

■ 


531 


WLLNLFAFLNDKGFLVPMNRSRI FSQQK ; 
VKQRS FEGCS SFNDVGS C YVTQAGEQWL 
FTGVIGTL* PGTPG \ S SDPPASASQAAG 
TTGS*HHIQIiNLILYSSG*QTI*LL*TT 
DSTHLIFASLYAKEQLITFYFQATSLQP 
CT 


12867 


2616% 


A 


12996 


1 


367 


NTEPFLNI FFKDKVS/CSIAQAGGQWRL 
YS RCS LKLLGS SNS PTS AS LVAG S KGRH 
H YTRL * LFF YVSRPQD YLPWS P VFFFSF 
FLS FFFW KGS FFLP P S WKARAP I WVNGS 
LPPRVKPIFLA 


12868 


26769 


A 


12997 


3 


379 


YRPSPSETKAVLVFFS FFS FFFIiGTPGG 
KGPPKGAPNPTLTWRGKENFFI * KKKKP 
/RLI*KKGPPPKMGFKRERFWVJjKSKGP 
QFGKGEKG/SPNYPHPRGDPKAPSLWKK 
PQFPGGGGGGAPKKAS 


12869 


26770 


A 


12998 


A 0% S\ 

230 


409 


SCRPKKLIiAFIiKSSTEISQRWKQPIYI/ 
PSTDE* INKMWHIHTMEYYSALKRTKVL 
IHATT 


12870 


26771 


A 

* 


12999 


310 


343 


DWCTII*IICDllNL*IECSKVW*SLKF 

WTT TT WfcTTT^TMTJeTfrniT^lTI/TJT f*TV»t7t/+ 

I K. I LiXijr H VC V WFb VLtfN urlKWiiCJtft-VV w 
KQW / LWPGMVAHTCNPSTVRGGGGR IV* 
GQE FETS LGN I VRP S L YNNKS C I 


12871 


26772 


A 


13000 


390 


289 


IKLCQPRGVKLDSFKRAPPFFFFFETRS 
C rVS * AG VQ WCN/ Lob IjQPUI oSHNN fA 
TS A* LSLI FMCSNS YVCI VK 


12872 


26773 


A 


13001 


1 


289 


GILRQISVNYLPVGRSLAQAGVQRCDLG j 

FIFSREG\GHPVKGASSDSPTFGFSKAG j 
1 TRLS PRALACT 


12873 


26774 


A 


13002 


359 


105 


KQ KKNPS PP PGGGGGQGKPG/ PPKP FPN 
SRDFPPKNQSPFFPKLKKKFFNFY*KAK 
NP*KPNPPIiKKGKKPKGFFSPPPPLKKP 
PNV 


12874 


26775 

■- 


A 


13003 


243 


293 


MNSQFLKRIi\RIESPYGSVISLLGTYST 
EK*KACPCKTCTQMFTSALYIIAKKW\BI 
KCPSTVRW/ IKLWHINTMEFFCQ* KKFH 
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SEQID 
NO: of 

niirtpfttiilp 
u u ucu II uc 


SEQID 
NO: of 

sentience 

- 


M 

#h 

od 

• 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 

niirlpntiffo 
location 
correspond 
inn to first 
amino acid 
residue of 
Deotide 
sequence 


Predict- 
ed end 

UUUG^ 

otide 
location 
corresnon 
ding to 
last amino 
acid 

residue of 

peptide 

seauence 


Amino acid sequence (A^AIanine 
OCysteine, D=Aspartic Acid, 
E=CIlutflmic Acid F=Phenvlf>lnninp 
G=Glvcine, H=Histidine. I=Isoleucine 
K=Lysine, L=Leucine, M=Methionine, 
N=Asnaraeine. P= Proline. 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=TrvDtoDhan. Y=Tvroslne. 
X=Un known, *=S>top codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














CIDMPQFVHLTVEGHLSPLFCYYI 


12875 


26776 


A 


13004 


465 


307 


SEACFILKLFMFCG*CXXFVC\SILLVF 
YFVFCFVLFCFAYYMQFFNQCLFG 


1 zo /O 


ZO/// 


a 




zv / 




DGS LTML PKLE C / RWLFTGVI MAHCHIiE 
/ RLG SND P PAS AS R VAGTTGVCHHA* L I 
FIFFVKMRSHYPTRP 


iZO/ / 


ZD / /o 


a 




z 


JO 


DTjAIjIiFR L * CNOM I TAHCTCT iIC / R PT5* T.T • 

UUt\U US X\ J— I V_ii"vJl'i X i. ^Vll v-XN XJ I\ / XvCrv> XJXJ 

GSRDPPTMVS*STGITGMSHEAHRPSPS. 
CDF * RSCS WOAWOWHDYS SL 


12878 

1 


26779 


A 


13007 

i 


402 


74 


FGASLLKRHFLFFLS KPLVYNKKIPRPL 
VFLKKT\PPPPGRATPFYYCCFFHKEGP 
d PGVFOTEG * PKG 1 S P PPPOROS TP PFFT . 

GAFKKPKVTTPFFKKNPPKTLKGPPFF 


12879 


26780 


A 


13008 


356 


145 


RPP PFFFFFETVSH PVAHAGVQWHDHS S 
LQL* TP \ S SREP PASAS * VAGTTGVHYIi 
PGSQLPILBHSPRR 


12880 


26781, 


A 


13009 


227 


353 


GWSELT* /SN*AQWLTPVILAL*EAEAG 
GSSBPRSLRSAWATW 


12881 

* 


26782 


A 


tin* r\ 

13010 


249 


32 


N F1J b n b WAW WCA£> V V PCj i v KAK VGoSXiS 
SRSLRLR*AMIVP\CTPAWVAQEPLSLK 
KPPTIETLFPSPQDP^Q 


1 toon 

12882 


26783 


A 


13011 


247 


403 


^nr»/^ppin7DDDwnTiAiri'Vfn/^/ ,, »PT?t'NTr"/^/^ 
1 PUrVstaGKx r r r WGPQKKNRCiCaGFKNGGG 

GKTRGPP * KQPAS S S PFAP P PFFWGAPG 

FSPPPRF\KPPPPVFFtt3PPKKKIFPPP 

PGGLFFFFLGGPPSPFFFFFFFLGGGPL 

KS EGLrGEGGNS VLTRVLL I S S YQ I PGNP 
R 


12883 


26784 


A 


13012 


206 


386 


TREAQTFWFPLYS * KAL/ WLGTVAHT*N 
PSTLGGQGRRITSSQESETSLANMVKPH 

T.T.T.PT 


12884 

* 


26785 


A 


13013 


354 


31 


YKMNI WDI IH I *HT \ Y I P WP KGKERKR 
KKLEEYL/QDIMTKNFLNLMKNMNIYFQ 
EVQQTPSRINSQTSTPRHI I I KLSKYKT 
GREQWILSVMLALWEAEWGPPEVRS 


12885 


26786 


A 


13014 


1 


419 


EETSFGKARCLPGYPGFHPRRLRSRSPL 

T . C *MPT. QTTPfV3T3 QHPT3 QM Q G Q D D1TT 

STCSCS P PAAGCTRGRTTGRPFWTASGP 
AELDWASGS S V/EAIiTSGLPAR'GLWGQY 
AGLGVPPRLLGPMTS CCMSLP FPS PARP 


IZOOO 


OA*7C7 
ZO /5/ 


A 
n 




mo 




ST*LRLHPPPPHMCF/PD*SLKSN*Y*H 
YLVNP \ I WPMLTLLFVCLFVCFRDRVSL 
CCP(W'3AVVPSRI.TAASTTjWVOVT1\PP\ 

RLAN* IFFLRWGLAMLPRLV*NSWAGDI 
LPP*PPKVLGLQA 


12887 


26788 


A 


13016 


149 


422 


LKRI FFLKWFTHDTAPGDPDNMCP YSC 
F*NQI I KSGLFFFKKPHL/ WLGEGAYPC 
NPTLLGGQGGRIT* FQEFETTPANMAKP 
LLYKKYKN 


12888 


26789 


A 


13018 


362 


377 


QS PGYPERVSKLPT I TQRGMWMHQPSFV 
CTSL*NV* TGWAQWLTPVIPAF* BD\AL 
AGGLLEVKNLRPAWST 


12889 


26790 


A 


13019 


205 


427 


I QTGGPPARPFLWPWAPGKK* RVPPEGG 
P PKKKKKTPPPHPGG EREGS / PFSKKKK 
KKTPKKRAGVFFPGGPRPKEGPGWGPPR 
LDSKPVGAPPEPGRIjGNGPGKIFFGPGP 
FFPLQPQKGRP* PGGEGGSFFFFLGVPP 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 


SEQID 
NO: of 
peptide 
sequence 


M, 
etn 
od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 

• 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid ■ 
residue of 
peptide 
sequence 


Amino acid sequence (A~Alinine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny (alanine, 
G=Glycine, KNHistidine, I=IsoIeucine, 
K=Lysine, L=Leu cine, M=Meth ion ine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine* S^Serine, 
T=Threonine, V=Valine, 
W<Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LGEPSIFCLDLPENA 


12890 


26791 


A 


13020 


248 


358 


SRCGGEAQ WLL P V/ 1 / PAF* KAEAGQS F 
EPKSLRPAWAT 


12891 


26792 


A 


13021 

■ 


2 


434 


ITEAQSDTQE IHHTLYTHLCVYWRF FTK 
KAQVKKKVWWHM* PQLLGRLEMGGS LEP 
RSLRLQGAMTLPVTSTT\QAWAT 


12892 


. 26793 


A 


13022 


162 


410 


LTQKTAASQI ILSFSSTSLEKI PISSES 
GGGGSVSVTQAAEQWRYHSLIiQP*TPGD ! 
KRSSHLGIPHP\SSWDYRISKDWVMPV 


12893 


26794 

* 


A 


13023 


250 


40 


L I KLNI FSCS LAI WTS S CMTS LFKYFAH 
FSFF* IDFYSSYYILDTNPFFRN/GITN 
ILSSSVTCSFILFW 


12894 


26795 


A 


13025 


264 


1 


VSSTKHGKLRLEPEY*CFIHH*EBKVFE 
SYFEY / PCQTFLRKETVRPGAVAYACNP 
STLGGQCRWI I * EQEFETSLTCIAAALR 
VTLR 


12895 


26796 


A 


13026 


110 


523 


CIDSSSWTQTDRCKEDRLCPGTTGTCAP 
GLLFLSAPPLGPF* PSQAHPSFHVSLSG 
PSPGLRPQLDIPLHPSTSIAPRPLPGPA 
S\LPRLEPHPSLPSPTGACPFLPPCLGE 
QRHWPLPTSLLWLGLDGHCLATPSVS 


12896 


26797 

■ 


A 


13027 


52 


53 


CPTLLQLTLC/LPPQASAMVDAPP*ARP 
LPPSSILACCTSGEQGSVWGLAEPVKG 
YNLLVCRLLRPLE KCGI WVAVSRFSL YS 
LSWLPLARKG\NPLTACASWVR 


12897 


26798 


A 


13028 


262 


381 


HFGRLRQVERLS PGVRDSLCNKVKP * Q I 
IYTF* FSVSLS VINTYLKGPSTVAHTYN 
PSTSGG*GKWIA*AQEFE/TSLCNKVKP 
CLY 


12898 

■ 


26799 

■ 


A 


13030 

* 


1 


846 


FRP I QAGRAP WALTIjPCMGFTEGSPAIiIj 
SHS S RCD I QMGCNTHAVDDS TGEGP FHK 
GRMRESCWLRTPRCGLYTKGVGGPGMEG 
RERRSRSRRQHMPGRGHSTMASWSLRQC 
MRLKLQWASDLLSQLEMQIPSLPPGESG 
SLGLTGPRSLH/ CEQTIGLSPTPGESWS 
EELG \ GG I VGGGRGTVAGRGGHREAGNK 
PSGTWGRQCCASGDKAMS I *LTSLY\AR 
RGEPGPGRPVRKWLRESSQNRIGGGEGR 
WE *GQRRSREGRECP VGWGGRKKPLRWG 
LMWS 


12899 


26800 


A 


13032 


308 


1 


KHYHDFWCWIILALSDAQSLDHTCV\CV 
CVCVCTCVCVLRDQGSHSCL*CEG*LVP 
HFVPGPVLCLLSPVIYVCFICSLTCKVE 
SLSNFFKKLNKALLRHCI 


12900 

■ 


26801 


A 

• 


13033 


763 

* ■ 


244 

• 


FLGPRI IGLRHE I S VETQDHKSAVRGNN 
THDNYENVEAGPP KAKGKTDKEL YENTG 
QSNFEEHIYGNETSSDYYNFQKPRPSEV 
PQDEDIYILPDSY*L/CQNIDFCYWMIN 
IHCNFSTAKTRNQTKC*STVDWIKKMWY 
T YT I E Y YAAVKK/ DTKLTWEQ K I KYH I F 
SLKSGS 


12901 


26802 


A 


13034 


326 


58 


ehyko^ycqkifqkl*/hl*skkhvyaS 
dkyykykdgwpllvsqi ilytfytrgk 
p*sscsnfvqnqi iyigqksckcnkfgn 

TFFKKP 


12902 


26803 


A 


13035 


199 


3 


RFI YNHKVCI TTFLPHFYF * KL YL / WLG 
TVAHAWNTSTLGG*GGRIT*GQELETGL 
DNTARPHLHTV 
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SEQID 
NO: of 
nucleotide 
sequence 

* 


SEQID 
NO: of 
peptide 
sequence 


■ M 
od 


SEQID 
NO: in 

T1CCM 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucie- 
otide 
location 
correspon 
ding to 
last ammo 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
fci-oiutamic Acid, F=rbenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
I -Threonine, V=Valine, 
W^ryptophan, Y=Tyroslne, 
X=Unknown, *=3top codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12903 


26804 


A 


13036 


94 ' 


448 


gs cats mafkdtrktave pgvwhg iri 
kple kvcadl itgkkfknl/ kvkgpvcm 
ptktlkittrktpcgesska\wdgfqmr 
iqk*ltnlhspskivkqiisisgapgid 

SEVTNADG 


12904 

i 


26805 


A 


13038 


964 

• 


1717 


hklscvkdlkdfmrqage\vtfa\dahr 
pk\ lnegwe fas ygdlknai e kls g k\ 
eingkninlikgskXrpsrsrsrssdpg 
tqksplgprsrspspwrkp*qpvqear 
srepgarsqtfpvpvk*gpppcpekepqk 
\rgslqskilsppsilwirpevsgprsk 

VPRSVDSGN 


12905 


26806 


A 


13039 


2 


725 


SLLFS PVLPLFLFLPLAYILQNLKPLQF 
SPDLKSKHLIFFCNAA*PQYKLDRSSK+ 
PENGTLNFSILQDLNNSCRKIGKWSEVP 
EVQAFF\SHQSLPSLCAQCNS/CPNLPS 
FPP\PVPSVPTPSVTESF*SSFSTDPSD 
LSLPPQALLLARPS*VPILPQPPLLHPI 
ILLSPPLLTPRPAYSFIP*L/CPPPPAQ 
QFTLKKVAGAKGIVKVNAPFSLSQIR+H 
LGSFSSNIKIQPSSWLVWQQP 


12906 


26807 


A 


13040 

■ 


1131 


2 


DGVSLLSPRIiECSGTISAHCNIiHLLCLN 

DSPASAS* VAGITGPATTPPLI FVYFFF . 

LVBTGFHHVGQSGI*ELPTD\SDEPPASD 

SQSVRITGVSH/AYLACSQF/CTRC*FP 

II*EQCVNMNINSSK*IFILISYLYRWM 

NKDPI/VIIERLLVRMPEGQSHAAW*NK 

TNKQT* I LELVLSVTSCGVSVRFNLSIP 

*VFPSTK*EGWMRCVNPSTLT*FIPQVA 

*TCQEHGLMKIQT*NPGLDPS IVSDHQT 

Q* E PS VKQVIjD I SHDKLPMS LEELYHCR 

HTIiASGKQISGVPRTIFLNHFHGFYFIY 

/ CIFEMESCSV/ SQAG/VQWRDLSLLQA 

RS/PGSR/RFSCLS/LPVQHPPPGPANF 

\CIFSGDGVSPC*PGWSQSPDLVIR 


12907 


26808 


A 


13041 


97 


626 


I CFTQVKVIjNESQVQWCMP WPATWEAG 
AGGVLBPGS*EL*/CPVNSHCPPAWAT 


12908 


26809 


A 


13042 


2 


431 


HDPDTHSQIKVIKIiMQSEFCWGWAW*I*T . 
PVIPALWEAE/AGGSPEVRSS 


12909 

• 

- 


26810 

• 


A 

• 


13043 

• 

• 


405 

• 


1319 

■ 


RYRKKLQNKPLWLH/CLLLLPNSATR* T 
LQK*SK*ISPI»ESSA**KEPPF1jSVLSV 

tsvkt/saipmpphwenvntqvpyqlip . 

lhnqtheynevanlfgktmdrnrikriq 

riqnldlweffcrkkaqlkkkrgvpqin 

eqmlfhgtssefveaicihnfdwringi 

hgavfgkgtyfardaayssrfckddikh 

GNTFQIHGVSLQQRHLFRTYKSMFLARV 
LIGDYINGDSKYMRPPSKDGSYVNLYDS 

P\rnTVT*WWOWT W^f\rPTM\K[OTVT>1?VT THOU 

C v UiJ x WN trill c V V r DANy X lira i Li 1 Ur il 
* FHFQ IS WKEALFFFAGRFALQS SSH 


12910 


26811 

* 


A 


13044 

* 


23 


450 


RSRAAAI QLGQRLRLPGPDS PRLTGRQL 
E PGTGRG SRAKRG PCS I FRFRRETPPQR 
EGGEALGATSRQAPPPCPPRQPWELDAS 
AKAAAVAP PKKVG F KAVS RD PRAEALGG 
KGSP\GGKGSLKGVAHR*GGIiGPIi*GAP" 
EAI 


129U 


26812 


A • 


13045 


172 


16 


AHVS PE VDKTKFW \ LGPVAHI CNSSTLG 
GRGGQIT/R/GQEFETNLTNMEKPHLY 
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SEQID' 
NO: of 
nucleotide 
sequence 1 

'i 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ins to first 

*Mff All ft^V 

.amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nude- 

otide 
location 

ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid. F=Phenvlfllanin» 
G=Glycine, H=Histidine, Msoleurine, 
K=Lysine, L^Leucine, M-Methionine, 

Q=Glutamine« R=Arginine, S=Serine, 
T=Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, ^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


12912 


26813 


A 


13046 


181 


9 


YKWG P AML P RL KCRA I \ I G SNWAHCRtiD 

LPGS SDP PS 6 ASQVAOTTGTCHIiGARGG 
MV 


12913 . 

- 


26814 

* 

< 

■ 


A 

■ 


13048 

■ 

■ 


2 


3507 

■ 


YVRVSLPPPPPAAGRPGAAVADDAREEE . 

EEAAPPPPPPPPPRLAAARPPGSQPRPP 

AAGEAQAAADMNHQQQQQQQKAGEQQLS 

EPEDMEMEAGDTDDPPRITQNPVINGNV 

ALSDGHNTAEEDMEDDTSWRSEATFQFT 

VERFSRLSESVLSPPCFVRNLPWKIMVM 

PRFYPDRPHQK\SVGFFLQ\CNAESDST 

SWSCHAQAVLKI INYRDDEKSPSRRISH 

LFFHKENDWGFSNFMAWSEVTDPBKGFI 

DDDKVTFEVFVQADAPHGVAWDSKKHTG 

YVGL KNQGAT CYMNS LLQTLFFTNQLRK 

AVYMMPTEGDDSSKSVPLALQRVFYELQ 

HSDKP VGTKKLTKS FGWETliDSFMQHDV 

QELCRVLLDNVENKMKGTCVEGTIPKLF 

RGKMVS Y IQCKEVDYRS DRREDYYDIQL 

S I KGKKNI FESFVDYVAVEQLDGDNKYD 

AGEHGLQEAEKGVKFLTLPPVLHIiQLMR 

FMYDPQTDQN2KINDRFBFPEQLPLDEF . 

LQKTDP KDPANY I LHAVLVHS GDNHGGH 

YWYLNPKGDG KWCKFDDDWSRCTKEE 

AIEHNYGGHDDDLSVRHCTNAYMLVYIR 

ESKLSSVLQAVTDHDI PQQLVERLQEEK 

RIEAQKRKERQEAHLYMQVQIVAEDQFC 

GEQGNDMYDEEKVKYTVFKVLKNSSLAE 

FVQSLSQTMGFPQDQIRLWPMQARSNGT 

KRPAMLDNE ADGNKTMI ELSDNENP WT I 

FLETVDPELAASGATLPKFDKDHDVMLF 

LKMYDPKTRSLNYCGHIYTPISCKIRDL 

LP VMCDRAGF I QDTSLI LYEEVKPNLTE 

RIQDYDVSLDKALDELMDGDI IVFQKDD 

PENDNS ELPTAKEYFRDLYHRVDVI FCD 

KTI PNDPGFVVTLSNRMNYFQVAKTVAQ 

KLIN I UrMLdjQr r ICbQGiIuXjPGNPIjRHN 

YEGTLRDLLQFFKPRQPKKLYYQQLKMK 

ITDFENRRSFKCIWLNSQFREEEITLYP 

DKHGCVRDLLEECKKAVELGEKASGKLR 

LLE I VS YKI IGVHQEDELLECLS PATSR 

TFRIEEIPLDQVDIDKENEMLVTVAHFH 

QSLLDIQEKEFEKFKFAIVMMGRHQYIN 
EDEYEVNLKDFEPQPGNMSHPRPWT GTiP 


12914 


26S15 


A 


13049 


30 


477 


VRAEHCAVWERNFEETVRWTSVKFLMTS 
PEIASLSWGQMKVKGSNTTYKDCKVWPG 
GSRTWDWRETGTEHS PGVQPAAC\DVKB 
WEKG VQTLVIGRGMS EALKVPS STVE Y 
LKKHGI DMRVLQTEQAVKEYNALVAQGV 
RVGGVFHSTC 


12915 


26816 


A 


13050 


179 


389 


NIETIQSMFPI/DNEMKLEINLKKAIWG 
IHKYASQVLWFMP I FPTLWEAKVGBFLE 
PRSSRSVWETWRDPI 


12916 


26817 


A 


13052 


466 


1489 


PGKGEMRTRS PSPLAI VPRPQRASRPLL 
CAVS PMASASGATAKHEQI LVLDPPI DL 
KFKGPFTDWTTNLKLRNPSDRKVCFKV 
KTTVPERYCVRPNSGIIDPGSTVTVSVM 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 

«r 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
cor res pon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
OCysteine, D=Aspartic Acid, 
E^GIutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M^Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W»Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 




■ 


• 




■ 


* 


LQPFDYDPNEKSKHKFMV/QIFAPLNTS 
DMEAVWNEAKPHELMDSKWR/ CPNENDK 
LNDME PSKAVPLNAS KQDGPTPQPHSAS 
LHDTETRKLTEECKRLQGEMMKLSEENQ 
HLRDEGLRLRKVAYS DKPGS TSTAS FRD 
VTSLLPSLLWIAAI FTGFLFFFWRRSL 
TLVAQAGVQRRDDGSLQPPSPGFKQFSH 
LSLPSSWDYRRPPPHPANFCIFSREGVS 
PCWPGWS 


12917 


26818 

* 

* 


A 


13053 

i 

■ 


892 

• 


189 

■ 


SYDPGALGCRPAPRSGRGKGSGATRDMH 
PMSRERAKFVKSGL\YCKTTRELMLHAR 
CCLNQKGTILGLDLQNCSLEDPGPNFHQ 
AHTTVT IDLQANPLKGDLANTFRGFTQL 
QTIiILPQHVNCPGGINAWNTITSYIDNQ 
I CQGQKNLCNNTGDPEMCPENGSCVPDG 
PGLLQCVCADGFH/G I QVYAPGLVLTAY 
VLRDSGSHHS IRLHSALGDPAPKSQDFM 
NYIGLTIDLRSI 


12918 


26819 


A 


13054 


423 


12 


SFNQKNPLRQKCSLRPTRPASQSLATWP 
GQSTRFWEQAIiPWALWDPQSKRAE\MQIi 
E PPHAHTWTHAHTCGHTGA\ FCSAHTEV 
YVHLHSPVHAHTHAHITHSPPVHTCGGRL 
HAHTH\PQTPSPPGtiLSLLAVARPQS 


12919 


26820 


A 


13055 


16 


349 


RRSGKNDPWTDQSPRAAASHTRCPHPAV 
AAAAMPKRKTEGDAKGDKAKVKDESQKT 
S /ARMS S KSAAS KAYAKP I KAPAMNGEN 
VPNG I QRKLEAFYDLQ ACRHLTRL. IDFA 


12920 


26821 


C 


13056 


716 


594 


MMQTETGVMPPQARQCLEPPGAGTGRKD 
SPLYPSEQSWPL* 


12921 


26822 


A 


13057 


353 

> 


2 


IGNPGKTTFNKPPPPKKFIFSKFFMFLF 
FLVKTRS CYVAQAGLQTPG \ S SDPSASA 
NQSAGITGVSHHTQSEIIjKYTVFCLTFQ 
TPEYTIPSTWSIPCCSHFSCYLCFLFAC 
FLRQSF 


12922 


26823 


A 


13058 


864 


529 


EVSTSKTAGCRGCALVISMLCIIFPQGL 
FTRAIiLRTCSALLHADWPE I SFVWWF 
FKTESCSVAQDGVQWCDLRSLKPPPP\G 
SSDSPASASPVAKITGMHHHARHGLKS 


12923 

* 

• 


26824 


A 


13059 

■ 


2714 

■ 


1376 


GRYDGLVEQLGGRATPAVGFAMGLERLV 
LLVQAVNPEFKADPWDIYLVASGADTQ 
SAAMALAERLRDELPGVKLMTNHGGGNF 
KKQFARADKWGARVAVVLGE SEVANGTA 
WKDLRSALAVGVILGVGALIGWRYWNS 
HQVDSARSASLAYQNAVTAVSEGKPDS I 
PAAE KFAAENKNTYGALAS LELAQQFVD 
KNELEKAAAQLQQGLADTSDENLKAVTN 
LRLARVQVQLKQADAALKTLD/TPIKGE 
GWAAIVADLRGEALLSKGDKQGLLSVTIj 
LSGCSLFNSEEDWKMSPLPTVENQFTP 
TTAWS TSDGSGIGNFYSNIJiPAIjADNVV 
YAADRAGLVKALNADDGKE I WSVSLTE K 
DGWFS KEPALLS CRVT SWJPNNQTEKFN 
KQRTEGPAVAEPEQAGVRRHYLYPFTGG 
YRRRRAACQAERPAARCATEEHGSRN 


12924 


26825 


A 


13060 


99 


254 


RNHHLVAFKELSALKVKRVDWAPWLMPV 
I PAF/LEAKVGRS LEPRS S RPAWAT 


12925 


26826 


A 


13062 


127 


386 


FS Y Y YYFFFL FFLCVFV IPGEPFLKMKIi 
GKPKTIMLKEKKGGESYSELFFETDQME 
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Dentide 
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M 
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od 
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nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glyci ne, H=HJstfdine, I^Isoleudne, 
K=Lysine, L=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q^GIutamine, R=Arginine, S^Serine, 
T=Threonine, V-Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \~possible 
nucleotide insertion 














WL/TPV I PTLWEAEVGGS FEHRSSR 


12926 

• 


26827 


A 


13063 


1 


209 


GKYILLLKVFCCFLRQGLCHIiAHCSLRI, 
PGSSDSRASASQVAG/ IS ITGVHHHARL 
NFVFCFFFLVCFCF . 


12927 


26828 


A 


, 13064 


132 


1 


SHAM CCNYLKRFG WAR WLMPV/ IPALWE 
ABAGRS PEVRS SRSAL 


12928 


, 26829 


A 


13065 


269 


134 


TLWSVFASS/WAWWIiTPVILAPWBAKAG j 
GSPEVRSLRICKQKVLS | 


12929 


26830 


A 


13066 


853 


461 


RLWRSVSQFLCACVHMCVPSVCVCXSIiAH 
AGTYAVSTCVH/MCVCV CMCARITLAKI 
CSETSGNLLFRKI ISLLDLFTLNFS I KY 
KVQLVLKSRSLAWWGR I KLLTLSRPFSD 
VNTFERINYWLINDISIIR 


12930 

• 


26831 

- 


A 

• 


13067 

• 


57 

- 

• 


2066 

■ 


AQPTGRQTS TVAPTPATARSRGGRVP I F 
PCPHPAPTLLSTSRWTPASPAASMKAIi 
RLS AS AL FCLLL INGLGAAP PGRPEAQP 
PPLS SEHKEP VAGDAVPGPKDGSAPEVR 
GARNSEPQDEGELFQGVDPRALAAVLLQ 
ALDRPAS P PAP SGSQQG P EE EAABALLT 
ETVRSQTHSLPAAGEP\EPAAPPRPQTP 
ENGPEASDPSEELEALASLLQEIiRDFSP 
SSAKRQQETAAAETETRTHTLTRVNLES 
PGPER VWRAS WGE FQARVPERAPLP PPA 
PSQFQARMPDSGPLPETHKFGEGVSSPK 
THLGEALAPLSKAYQGVAAPFPKARRPE 
SALLGGSEAGERLLQQGLAQVEAGRRQA 
EATRQAAAQEERIiADLASDLLLQYLIiQG 
d APORGIjGGRGIjOEAAEERE s AREEEEA 

eqerrggeervgeedeeaaeaaeaeade 
aerarqnallfaeeedgeagaedkrsqe 
etpghrrkeaegteeggeeeddeemdpq 
tidslielstklhlpaddwsiieevee 
krnrkkkap p e p vp p praap apthvrs p 

QPPPPAPAPARDELPDWNEVLPPWDREE 
DEVYPPGPYHPFPNYIRPRTTiQPPSALR 
RRHYHHALPPSRHYPGREAQARHAQQEE 
AEAE ERRLQE QEELENY I EHVLLRRP 


12931 


26832 


A 


13068 


282 


53 


PGFPQGKFRFKKKILPQDYPFQGAPNSK 
ARPGQGVP PG I PA\ LWR VKKGGPLRS GG 
PGP PGAKGGTP FLP I PNTKKS 


12932 


26833 


A 


13069 


177 


2 


VSLCRPGWMECSGVISAHCNLRLPGSSN 
S / PAliASRVAG I TGARHHALLNSLFKVL 
SRA 


12933 

• 


26834 


A 


13070 

■ 


789 


302 

■ 


PHPEFYYSFQYFIYFFISFHFTRHFIVH 
YFISSHFISCHLASSSSSSSSSSSSSSS 
SSPPAFHYS FHHFIS FHLAIHY\HFI IS 
FHLFHLFS S FHS I S FRDRDLL YCPGWSV 
PPHPAFHYSFHH/VHVMSRHFILRQTRS 
IAQAG IQWRDLCSLQPEVGSSRSA 


12934 


26835 


A 


13071 


193 


2 


P P YLKSHSQVDLS PGVQDQPGQHRETPS 
/LTKN/TKVSWVWWCTPVIPARWEVEVR 
ESLEPRCSTRA 


12935 


26836 


A 


13072 


122 


1495 


LLSDFFFFBTE / SRS IAQACMQWCYLSS 
LQSLPPGFKRFSCLSLPSSWDDRCPPPC 
LANYC I FSRDRVL PCWPGWS RTPDLR 


12936 


26837 


A 


13073 


178 


347 

■ 


CCliCNDNSVLLCMYYLLVCMHACMHALY 
IKTWK/MQLGAVSHACNPSTLGGQGEWI 
T 
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sequence 
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SEQ ID 
NO: of 

|IC|1 LIUv 

sequence 

■ 


M 
od 


SEQID 
NO: io 
USSN 

09/515,1 
26 


Predicted 
beginning 

niiplpflfifip 

U U UCUUUG 

location 

ing to first 
amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

nurlfv 

otide 

lnrntton 

1 VWali wis 

correspon 
din? to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
Er=Glutamic Acid. F^=Phenvlalanine. 
G=Glycine, H=Histidine, I=Isoieucine, 
K=Lvsine I^Lcucine- M^IVlethioninfi. 
N=Asparagine, P=Proline, 
0=GIutamine. R~Art*inine. S^Serine* 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, . 
X=Un known, *=£top codon, /-possible 
nucleotide deletion, \= possible 

nudpntide inqprtinn 


12937 


26838 


A 


13074 


165 


34 


PILSFPFETVSYFVAQAGVQW\I*IIAHC 
SLDLPRSSDSPTSAP 


12938 


26839 


A 


13075 


205 


105 


GGW VQWFMS IIP / AI WEAKAGGLLBPRN 
SRPVWAT 


12939 


26840 


A 


13076 


3 

* 


357 


HERE YTQAEKRKF IAFPFFKMGS CS VI Q 

LIFVFSRTDSRLPSSPRSPLTSVISLI I 
PLSL I PTTYLTTPSRPS ST8 P P SHPS PR 
APLTSDP 


12940 


26841 


A 


13077 

i 


3 


477 


STPALTARGLSGGSRASVAAMVLLESEQ 
FLTELTRLFQKCRTSGSVYI TLKKYDGR 

KKISTWSSKEVNKFQMAYSNLIiRANMD 
GLKKRDKKNKTKKTKAGSNSS / attaap 
AAAATAATTAATTAATAAQ 


12941 


26842 


A 


13078 


1077 


1566 


RQVTSLDPLKTFPRTSLEFTSPTGQLDS 
taftlAr i. VvWFouArl FKvaKKoAAriUUW/ 
YPVGQR£\RMAGLSPRSFCWWEVSFSR/ 
SHSC/ 1 CWVQLGKSAVLPQQSRGPGAHE 
GSRRRGGSSPPCPPLQTQLVFSPQELSM 
SAP IHRLS VTLQGD C I STGGAPSRCRIiA 


\2942 


26843 


A 


13079 


3 


396 

• 


GKLQLRYQEISKRTQPTTKLPEGPSHKL 
FNN YY \ CTRDGRRES VPPFI 1 MS SHKAL 
VSRMPAYS FAEAATDKKAVTPAP S 1 YRG 
EQSSDHPYLRHCTLTVTRLL 


12943 


<\£Q A A 

26844 


A 


13080 


467 


109 


CT Ara\T\TG OUT .V t? D C PD VT . T PNT17VIT A7DTTI? 

QEFWSKEAGSIiFSPPVPPYCWTPSIVFS 
SSFF FLFRCVCFFFFFFCF FFFF F\ LLF 
FGTKKALYLLRARGCKQYKNQKLIWYLT 

~ 1 tojjjj V A. 


12944 


26845 


A 


13081 


458 


51 


VSLFCTVAPLLIiLPEGIFFSCSLLEVKA 
PMRDVSGVFLTLFLYSVLFFPLGVFFFL 
FPFTi T T iPPPKT PPPPPVPVAIjPFPQPAP 

OAVFLF\SLWSSVLPGVGFFTACSFFFF 
FFFF FFF FLRQS FALVAQARMQWR 


12945 


26846 


A 


13082 


23 


327 


RSDSEGGRRGHKYI FI PI YTHTHTYIFI 
YTOT YT CUIT YACLS / LICTHI FI S IYI 

At, Am \^JL A, JL Sr _f r* I im\ r 1 fyT ^ MmAAm\~ A» fiX & ^ *&Am Am Am 

HTYIYISIYAHIYLYIHTHIYLHIYTHT 
YIYIYLHIRIYSSWVCKGS 








1 JUOJ 




ID J 


KKLATR Y YS AI KKN55VT.TH /VSTWMRIiKN 
ITLSLKTQSQRHLYYIIPSVRNVQNRQV 
HKYRS RLVDS ED 


12947 


26848 


A 


13084 


121 


417 

* 


DYDYYCYYYPKIYLTKLLYFY\VGNYIY 
1TIIYVRHVYICETYMPVYTHIHTYICIY 
I YI WKETTWLFS CGI SKTYRTHDIAYTY 
L VFCLTHVT FHVS S T 






/% 




2Aa\ 


1*7 


LSSRTjPMIiTjETFCE ITS KIIjPSRKIj/IjPY 
GWCFKSFKITWVSQIRNRL\AQWLTTVI 

PQLWEA\ sgllkprsrrpawttw . 


12949 


26850 


A 


13086 


274 


340 


GGEKKTPGGFLEKKSFFGGGIFGPPP/P 
TKRGVFPPS PKKFFFPPKTKNPGRGGGP 
KIPPPKKDFFSKNPPGVFFSPP/YKKKK 
IIFPPPVKLGPPKDFtiKSPPPFFFFFYF 
FFFFFFSSQEVNLKALSSTSEI IFFYLV 


12950 


26851 


A 


13087 


241 


405 


PPPS KIH I GNFRKMS D VKNATGWAWWLT 
PVIPARWEAEAG\VSLRPRNSRPAWAA 


12951 


26852 


A 


13088 


537 


375 


KHSLTLSLRIjECRGT I LAHCS / L/ CTSL 
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peptide 
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Amino acid sequence (A a Alanine 
C=Cysteine, D=Aspartic Acid, 

Ei— \JIUUtIllll> /Villi) r — X UcO jlaiUDIOCf 

G=Glycine, H==Histidine, Hso leucine, 

K—l .vein a. I ^1 AitrtrtA IVfsslMAth IahIha 

N=As para gin e, P=Proline, 
Q=Glutamine, R=Arginine, S-»Serine, 
T«Tbreonine, V-Valine, 

W=Trvntnnhnn V=Tvrrwiln<» 
tt - 'i rjrpiupnan, l — ijruslDc, 

X=Un known, *=Stop codon, /^possible 

nucleotide deletion, \=possible 

iitidcuuut? iriDcrnon 








• 






. LGSSDLSLQPQQIAGTTGACHHTHLBKI 

* 

F 




26833 


i n 




445 




wiJb vr r o oUAr rtrrW^yorif rrr r f Ju 
BTGSRS VS KAGVQLHHGS LQPPPPGFER 
SSHFS VTTI /TYLYHHRLALHVFELHIN 
GIMHR 


12953 


, 26854 


A 


13090 


365 


191 


GQEREKEREREKRRQKKREVE/REEBKE 
RVRDLE S ESNRALERERS AL VDRAPLSR 


12954 


26855 


A 


13091 


148 


316 


DKl^ESCSIAQTGVQWHD\LG\YIiQPSPL 
GFKRFCLSLSSSWDYRWETSTEPPQQAV 

F. 


12955 

* 

* 


26856 


A 

• 


13092 


712 


229 


FVAWVKMAEYLASIFGTEKDKVNCSFYF 
KIOVCRHGDRCSRLHNKPTFSQBVFTEL 
QE KYGE I EEMNVCDNLGDHL VGNVYVKF 
RREEDGERAVAELSNRWFNGQAVH/GBC 
TRGGFCNFMHLRP I SQNLQRQLYGRGPR 
RRSPPRFHTGHHPRERNHRVFP 


12956 


26857 


A 


13093 


835 


405 


ELIERLGPNQKPPILMWKPMPSPSNMKA 
S AALLCLLLTAAAFS PQGLAQP VGINTS 
TTCCYRFINKKI PKQRLES YRRTTSSHC 
PREAVI FKTKLDK\ E I CADPTQKWVQDF 
MKHLDKKTQ\TPKLLNIHDWNLENQAMT 
LRKPN 


12957 


26858 


A 


13094 


1137 


424 


LCPSHFAPTTLTQPGAHKNMCCIKSRFK 
RDLGLCRTCLVNKMFTSS ILGKSHRHSL 
VSINQGNNALWKAAG\PLSWKAGYC\QG 
FSPCDSLKYG\SWDEKDLTVPQPDTHKG 
S VLR w I S KRG KP LA VE I EGRATG LPGIiA 
PWGTE \ CLGYKTP I V\HLFNS EMG\ ENR 
PYGGEARHVCSNAALLFFTPLRCLGGEK 
HKSGLRAHPVIVLSLELNYDIDSFAHMF 
FADLLLI ITLLSCYIPFC 


12958 


26859 


A 


13095 


1 


324 


ARGERERERERERERERBKEJRERERERE 
RERERERERERERERERERERERERERE 
RERERP/RPRVLFFVRGREGSHPSKCEN 

7VyDlDT?CDliT PITT g TPTD/lnWO TTTPTV 

CVFVIGAHRVCRSVYIYI 


12959 


26860 


A 


13096 


508 


840 


DGVS LCRPGRTADCSGAI SAHCKLRFPG 
SRQSPGLSLPSSWDYRRLKPMRPANFFF 

PPRLGLPKCW\DYRREATAPGQELLLNR 
TGM 




/pool 


■tx. 


1 "2 AQ*7 




OKA 


Wiry ±J\0\~Lt vrtyiHj vyn sJJXIOiDIjV^W * Jrta 

\SSHPPASASQVARITGMHHCAWLHLII 
LLRAHGS S PLC 




ZOo02 


n 


lJU9o 




an 

07 . 


rjry^Ti T5T T*IfnE'irT.T. , DVT?DDDP T CDTTfOTil? 
latAjiHirJjljJVv7CtX\JjJj±' JSJ^rFJrKJLo Jr J. vbnf 

FAWGPPPGYCFLKKGGPFF\VFPPSFFG 
KKKNS P P PPH INGGAKTFPNP PLLFSSF 
FFF 


12962 


26863 


A 


13099 


253 


444 


EGAFSLGTFYFIFFLVLSNTFLLSIKNQ 
QI WNKJCTVW\ PS PFLPTLIALTTLLLPI 
SPFILIIL 


12963 


26864 


A 


13100 


341 


590 


PMASSQICIGRIPPFLYLKWYFWPGKW 
HTCNPKTLGGRGALITQGQKFENSLAN\ 
MAKPHLY 


12964 


26865 


A 


13101 


55 


184 


RLREPTPLYSE/HAPTKYYRMAHHNYPH 
SPPIPQPRHHYRVFLL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 

• 


M 

eth 

od 

■ 


SEQID 
NO: in 
USSN 
09/515,1 
26 • 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

• 

nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pnenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=*Asparagine, PHProline, 
Q=Glutamine, R=Argmine, S=Serlne, 
T^Threonlne, V=Valine, 
W=Tryptopban, Y=NTyrosine, 
X=Un known, *=Stop codon, /-possible 
nucleotide deletion, ^possible 
nucleotide insertion 


12965 


26866 


A 


13102 


433 


762 


RNLLNCKRGLERKGCRAS WPAVPLFLQR 
RVWPRPTABLNEACMI^QHQNCQNVKM^ 
YFQLSFTKRQEI/WLGWAHACNPSTLG 
G\QGGGIACSQEFKTIPSSKVRPCFYLK 


12966 


26867 


A 


13103 


349 


250 


DLFFSTKPPPPPPPPKNIaSPP \GPFGPP 
FQEG 


12967 


26868 


A 


13104 


3 


236 


YPQHYPS CPTFQ xTLKTKNHNFLIFFBV 
GSCSWQARVQWC3iHGSLQPP/NSSDPP 
ASAFQVAGTPGVSHRAPPKNQNF 


12968 

• 


26869 


A 


13106 


154 


4 


KKVKNTVHYNGKYFLKSQNQSTFLKEIL 
/WLGGVAHTCNPSTLGGRSGRIT 


12969 


26870 


A 


13107 


391 


150 


R PPKKPNKKY S Q I TS KG I KMVYTRKYLT 
QECKKAMSQALWY/V/HAVPATQEAEVGG 
S CE PGS S RPRCTMI M PMNTHCAPAWAT 


12970 


26871 


A 


13108 


90 


252 


KVGDGGRCLVFC FYFKRQGFTVAQAGVQ 
W \ AT I AHCS LKLLGS S DPQNYEDSWGT 


12971 


26872 


A 


13109 


197 


414 


LYEF I KITVLLGLGCPIiGRYGGSNQRFF 
SLEMKFHTVAWARVQWHDLDSLQPLPPR 
PNRFSCLR/YPQSSWDYR 


12972 

V 


26873 


A 


13110 


2 


317 

• 


GRVGANVHKGHRQRTYGSVI PHI LPLHV 
LKKTFSLRDFHFSV$LKX\NLVLTCLKL 
FL/GVRTPRNDPFVSMMLLFTAFLDRPS 
TILGTGLLYTEGLTVAbRLAYLR 


12973 


26874 


A 


13111 


413 


162 


LILLPQPAECLQLQASAATLDCLGLPRC 
RDCRRS LVHS VLNGAQAGVQWRDLGSLQ 
PPPPS \TCLGLPKYRDCSLCPAATPSGK 


12974 


26875 


A 


13112 


40 


296 


ESRLSALYIYHICVCVYNREHLLYGIHM 
/ S IHT YTHTHTHTH I YIHMRAFALEDKF 
ICSLLCSRQDNAFILVSVKLQRKNYFIiT 
RR 


12975 


26876 


A 


13113 


2 


288 


FEPRCKNSARGKVPRGFSPNLPHVTVET 
PEGSKTGFELESR/RRHRQIHSSAQCPL 
TAVPGAGDAI PEDAS GWHTWLPLHAQNC 
FLLY I QAPEQPPA 


12976 


26877 


A 


13114 


685 


340 


LSPPRAGSARPTGPPTDAPGQRSTWTCG 
ALKPRRRAIiRDAAENLFQELQEHFQALT 
ATLNLRMEEMGNR I EDLQK\NVNDL\MV 
QAGIENS IKEQMLKTVTANMSVFGDGAS 
YRSH 


12977 


26878 

• 




13115 


.65 

* 


3021 


GVLTQMGDEKDS WKVKTLDE I LQE KKRR 
KEQEE KAE I KRLKNS DDRDS KRDS LEEG 
ELRDHCME ITIRNS PYRREDSMEDRGEE 
DDSLAIKPPQQMSWKEKVHHRKDEKRKE 
KCRHHS HS AEGG KHAR VKEREHERRKRH 
REEQDKARREWERQKRREMAREHSRRER 
DRLEQLERKRERERKMREQQKEQREQKE 
RERRAEERRKEREARREVSJUfflRTMRED 
YSDKVKASHWSRSP PRPPRERFEliGDGR 
KP VKE EKMEERDLLS DLyDX SDS ERKTS 
SAESSSAESGSGSEEEEEEEEEEEEEGS 
TSEESEEEEEEEEEEEEETGSNSEEASE 
QSAEEVSEEEMSEDEERENENHLLVGKN 
LPGDRVPVP2SRFDRDSGESEEAEEEVG 
EGTPQ \S SALTEGDYVPDSLPLS PIELK 
QELPKYLPALQGCRSVDEFQCLNRIEEG 
T YG WYRAKDKKTDE I VALKRLKMEKEK 
EGFP I TS LRE INT ILKAQHPNI VTVRBI 
WGSNMDKIYIVMNYVEHDLKSLMETMK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 

* 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanlne 
OCysteine, D==Aspartic Acid, 
E— Glutamic Acid, r— rnenylalanine, 
OGIycine, H=Histidine, 1-Isoleurine, 
K=Lysine, I^Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=GlutamIne, R=Arginine, S=Sertae, 
T=Threonine, V=Valiiie, 
W=Tryptophan, Y^Tyrosine, 
X=Unknown, *=Stop codon, /possible 
nucleotide deletion, ^possible 
nucleotide insertion 


. 


• 

• 

■ 




• 

• 




• 


QPFLPGEVKTLMIQLLRGVKHLHDNWIL 
HRDLKTSNLLLSHAGILKVGDFGLAREY 
GSPLKAYTPVWTQWYRAPELLLGAKEY 
STAVDMWS VGCI FGELLTQKPLFPGNSE 
IDQINKVFKELGTPSEKIWPGYSELPW 
KKMTFSEHPYNNLRKRFGALLSDQGFDL 
MNKFLTYFPGRR I S AEDGLKHEYFRETP 
LPIDPSMFPTWPAKSEQQRV\KRGTSPR 
PP \EGGLG YSQLGDDDLKETG\ FHLTTT 
NP£GLLPRGPGFSI*RVAEVSBIiDPVIGR 
NSAGDHRRGYL PAWNCDGTRNS SS Y FVL 
PCFVFVFLVCKLSRIKSFSLLWRKELCF 
LRDLPRAERVGI FRGAHVGQHKPHHHPL 
PLS TRTG LGWAV I W KGTGGS RVDCL I FG 
AGDLFLCWDERCPIiAPTHSSRPALSTTS 
PRPPTTQPQLLPAGLEQVFYYKWGVLK 
CIKIFFEE 


12978 


26879 


A 


13116 


469 


202 


Mil I VGLCEYTKSAVLVQHSDPLPPAPG i 
VFFLAVLDWDIKI WHPVFCLFVCLYE j 
MESCPVAQAGVQWRDLGSLQPPP\PG3S 
TSCF 


12979 


26880 , 


A 


13117 


193 


383 


AARMASTFFVFWFWFSTF \SLYSICVCV 
CVCVCVCVCRMHTV'QLYFFQVGCQCSSL 
LNPLFPMN 


12980 


26881 


A 


13118 


146 


350 


HYL VKFLLLAH PG S KAP PLNTLQP P P LA \ 
TREQPPLTVIFHYPPTSYKMAP\PYIjPS ! 
LTLFGLSPPAPR j 


12981 


26882 


A 


13119 


124 


254 


HALLQTLRDLSQAWWHALVIPATW\RAE 
AGGSLEPRSSRPAWAT 


12982. 


26883 


A 


13120 


150 


45 


SLPRLECSFTVLAH/ C/NLHLLGSSDSP 
ASASQWGGI 


12983 


26884 


A 


13121 


336 


446 


IPLKISSWAQWLTPV/IMPAVWEAEAGG 
SLEVRSSGPA 


12984 


26885 


A 


13122 


181 


39 


QNLTAQDGVQWCDLGSLQPP \LPSSWDY 
RRESLCPSSFFIFWKRWGFT 


12985 


26886 


A 


13123 


463 . 


2 


GPAVPSGLYLKGKPMRRLCASHS PESHS 

hdkgqgtlpripq/ pllgeggsr/plaw ! 
gavaglp snrpr i vplpaptrsgtrvrp 

HTGHQPGM5PGCGCESKWPH5AAAWPEA ! 
YPHFFLLPFPTQGCEVLGPLYTADPWVI 
CVSLLPS CPNSTAVDVD 


12986 


26887 


A 


13124 


429 


263 


DHFS FLCVTGS QS VAQAGVQW \ L I IAHC 
bJjls^JjAboL'FFAij Vr y & 1KJL lvjlrr JjoITI 


12987 


26888 


A. 


13125 


310 


486 


NKKTTRGES S KQRQPRVSHHPGWGTVAP 
SGFTAASAS\RARVILLPRPPE 


12988 


26889 


A 


13126 


124 


461 


GSPLQLLPSPLAAIiTRDCSEAPMGSCSV 
AQDGV QWRDLGS LQRLPPGFEP FS CLS P 
TPAP\FPSGWDYR 


i on or* 

12989 


26890 




13117 


2 


337 


Kwi/irrAiiPl/lV liiunnKX rVJLflj VfV\jVt\Xriy 

ARGFGSDQS ENVDRGAGSI REAGGAFGK 
REQAEEERYFRAQSREQIAALKK\HHDE 
E I VHHKKE I ERLQKE I ERHKQKI KMLKP 


12990 


26891 


A 


13128 


681 


422 


CRSDRWAKEHRGKRGQDSS KDVMARXiME 
APKQTAQYFFIFYFFETKSYSVTQAGVQ 
WLDLGSXiQRPPPG\SSDSPASASCAWPQ 
TAH 


12991 


26892 


A 


13129 


299 


579 


LVMFCKVTKIERLNLWRPGTVAHVLWSQ 
HFGRPRWAVHVGWGVRDP/RLTQHGETP 
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SEQID 
NO: of 
nucleotide 
sequence 

- 

■ 


SEQID 
NO: of 
peptide 
sequence 

• 


M 

eth 

od 


SEQID 
NO: In 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- '. 
otide 
location 
correspon 
ding to 

■ A » 

last ammo 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanlne 
OCysteine, D=Aspartic Acid, 
&=Glutamic Acid, F=Phenylalamne, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, I>Leucine, M-Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Vahne, 
W=^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stopcodon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














SLLE\NRNISQAWWCMPVVPATREAEAR 
ESLBPGGQRLQ 


12992 


26893 


A 


13130 

• 


268 


438 


KVDKSM KMRKNRP KNAENS KNKKAS S P P 
NDCNSSPARVQ/ SWTENEFDKLTEVGFR 
KW 


12993 


26894 


A 

* 


13132 


1108. 

■ 


1529 


KIHSS FVLLNAKLVAVAAKTPKFYKKKT 
DLPQTYHSAQTAVPE I PVSAHSTFIRS F 
IHSFIFETESYSVAQAGIQWHDLPS\PP 
GS \SDSP\ASVSRVAGTIGVCQQAWLFF 
V\FLVET\GS FPHLS PLTMIRNKKQLFK 
KGH 


12994 


26895 


A 


13133 


497 


370 


RPSTVSPRIiECSGLILAHCS \LDPPA3A 
SQTAETTYGHHHAS 


12995 


26896 


A 


13134 


217 


373 


CNHTEITYVNKKKHRPGTVAH/SLYNPS 
TLGGQGGLITQAQEFDTSLGNMTLS 


12996 


26897 


A 


13135 


257 


507 


PVSPGFPPL\CPVPHTPHLCPPCRYPES 
PGQPAAQHCGAPRQPQPNPRACSSRGLF 
RCPPACHHAPSCLLCVCPPLPQCCRVQG 


12997 


26898 


A 


13136 


137 


309 


PSFPTHPQFCFLFCFLFVFETE/SSIVA 
QAEVQWCDLGSPDQGSSASPPSASSLPL 
SIW 


12998 


26899 


A 


13137 


297 


419 


GGSAVYYI WEIVQVLWLMPVI PT\WEAY 
AGGLFEPRSSRPA 


12999 


26900 


A 


13138 


213 


492 


Q VG INYQ P PTWP VGDLAKVQ RAVCMLS 
NTTAI TEAWG LPGA I S S AKCAL VHW \ YV 
GEGMAVGE FSEAREDLAALED YETCGPW 
NPVETEAERR 


13000 


26901 


A 


13139 


246 

» ..... 


534 


DRVLPLLPRLECSSAILGHCP/APASQV 
TGTTGTHHHTQLIFLLYCWLCPPPLASE 
ASAESPPLLPRGFLCFSSTTSSITAIPA 
TTRD YVDVS RSGS 


13001 


26902 


A 


13140 


1 


206 


SLiEWPFFFSFSETGSHSFTQAIVQWCNH 
SSL\SPTSASQIAGTTGVHNHAQL/VFV 
FCFAFMS PQLSNSC 


13002 


26903 


A 


13141 


164 


582 


GLVLYNLYLRS YLMVF I YINSRLLQVTD 
ASRERRENESSWFCRSVCRM/PPVGPGP 
SCVHPCSPRGSISPCTPHCPLKTTQAPA 
\PPRPLPRVGPGPHLDQSCXVAARPIiYD 
P I LVAAHSS WPAP PS PVPVPQEDTEHGA 
G 


13003 


26904 


A 


13142 


235 


367 


LALNTFCWLGVVAHACNPSTLGGQGEWV 
TRGQE \QANMAKTCLY 


13004 


26905 


A 


13143 

* 


91 


403 


GALQ PATAP WE PLS GIiAEAGAI S PCLQG 
G/ SGGRGAS ENRGCTRPGR VPGGRRLNR 
PRAWSSWPAPPAPGSERPSAINCPRAEE 
CGRRVWDWQAALPAAPAWVPV 


13005 


26906 


A 


13144 


558 


147 


GHSFQNWLLGCPLQLQRHLLHPPDASEE 
CGSPPWTPLRPHIHPSAGPHGSTHK\HA 

*+gm *mw wnt T%nrt t Mill WS^t/* W tit flNT ^MAa\ 

GGCLFPLRPSPTHPCGPPVPLPWSQXQA 
PLCPHPPIHSPHDWECDPGKSALIPPPA 
AQSPSRLIKHAANEPECSPRNRRPGIP 


13006 


26907 


A 


13145 


321 


421 


GCAQWLTPV/ 1 PALWEAKAGES LLFRSL 
RPAWAI 


13007 


26908 

* 


A 


13146 


3 


398 


X ELLQSHDKTLTD KKLLLLDEQRKWFLE 
MESTPGEDVNNVELTTKDLEYHINIiDD\ 
KAGFER I DSNFERVS AVG / KMLSNS IAC 
YRE I FHERS VKVAN FTVAI LPQ PHKLSV 
TINLI IQQPSTLRQDLLLAKR 
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SEQID 
NO: of 

nit j*loAtifl A 

sequence 


SEQID 
NO: of 

JJcpUuc 

sequence 


M 
etb 

• 


SEQ ID 
NO: In 

09/515,1 
26 

- 


Predicted 
beginning 

nil f 1 A/\4t /fa 

location 
correspond 
ing to first 

a IIIIII V aCIU 

residue of 

peptide 

sequence 


Predict- 
ed end 

otide 

location 

correspon 

last amino 
acid 

residue of 

pcpuuc 
sequence 


Amino acid sequence (A^AIanlnc 
C=Cysteine, D=Aspartic Acid, 

R=f?Iiit'imir Arid F^Ph^nvl alanine 

G=Glycine, GMHistidine, I=lso leu cine, 
K-Lysine, L=Leucine, M=Methioninc, 
N=Asparagine, P^Proline, 

V; UIU Vanillic^ K\ gllllUCf p- OvIIUCi 

T=Threonine, V=Vallne, 

W=Tryptophan, Y=Tyrosine, 

X=Un known, *=Stop codon, ^possible 

n itf*l<*rvti#lA ri^lo^tfkn \sn#\anllilflk 

uuneuLiuc ueiciiviif r^possii/ic 
nucleotide insertion 








1 31/17 






S KTORS SR YYSIjS VCVC*VCVRVPT YT Y T 

ffffetefls\apqagvqrcdlsslrlp 
pp\gssds pfstsqsawdyrrepqrpts 
lhyfvqlkhnlhrvkrrdlrnttr 


13009 


26910 

t 


A 


.13148 


3 


275 


KHLSDLQ I LRLGLS SKTRKTRY SHFAQG 

TNLNVNI \ TKKH PHRNVQNNVWPNVWAP 
WPKEVDT 


13U1U 


/oVJ l 




1 "3 1 A O 

l Ji4y 




ICO 


HWnHT'K'T.PQT7V T rZl'K'FT r r /DWVTJAWPWR 
J. XVJjIj £> C X i-H-tVC il/ tCT^IVKy * * *S 

KIF^NYPSDBGLVTRMYKSLKQLYRKKI 


13011 


26912 


A . 


13150 


391 


29 


FCFLFLFMFLFFFKMQDGVQWCNLSSPQ 
PPPPG\SSHSPASASQWEABNCLNPQO 

rviPCT JTVD aT3T.T3^k'D VT>1?DiyDt>'T r l?0 7DVD 
L-KtOi 1 AtOK AK1 jFIJK KKKKKKK 1 KKKKKK 

ERASCAFCGAITFVNMWOQKHHTCNHLP 
WLTVFBTNKL 


13012 


26913 


A 


13151 . 


351 


157 


P/^VOCDT PITVTWPDM'M t 1 £»hr\rkiVTf»T VTItf 

r uibKr lit-V x lWr K 1 JNliSX XWX ACx X Ut 
LYL\NIHTzTLIYTYVHTYTHIHIFYIY 


13013 


26914 


A 


13152 


402 


111 


TFFGGBKPPFTLRPGPPRREGGGGPPGP 

■P'RYVSWFPVDT/tlUPrtPQPDTr /DOWlTCrSDV 

W^PQI FIiWGGKSRAIKESGKGFFFSRIT 
LLVFATEKKKKTTLF 


13014 


26915 


A 


13153 


310 


141 


RE HE KQ KP LS Q VR W CTP LVQAAQEAEAG 

RPT.PPP Q QP7/y<3MT V TPVMQWPPPHPV 

T 


13015 


26916 


A 


13154 


153 


21 


LNKCVIS WLGAVAHACN/ PSTLEGRWIT 

fVJnRTrKTQT /5 .ZVK"P"HT ,Y 

UUyuI £\. i. O JJVSOUrUvATXllJ X 


13016 


26917 


A 


13155 


429 


275 


S VD I PLE KTL INKD VFiffiAAKtCkV'S / 
QARVKFKKR Y KPVKTKW F FQ KLWF 


i mi *y 




A 

• 


1 3156 


mi 
J 03 


42 


THTHTRMHAHTHTHPAII PAFAKNGLGT 

FHWTffiPT YWT) IfVTf PP T JR TFTVa T P <!NT .K^ 

REFR 


13018 


26919 


A 


13157 


198 


13 


SHGVAILLFFHFLNKRAFT\YSPAMNSS 
KYFAKI 


13019 


26920 


A 


13158 


192 


2 


ISNSHRKDIKDPEVLLIEQKYILSGQMQ 
WLTPVI PAL /V/ EAEAGRSLE FRSLRPA 
WATDRDSVSK 


1 JVZKf 






ill 


Aft 
4U 


1QA 


P\PGSRHSPASASQVALFLNRK 








111 £A 


t 


1 *7fl 

I /o 


PTMVtiflPADKTNVKA /AWfiKVfS&WZVflRY 
ririvjJorAiJAiiN vxui/ nnvjiv vuruuvja I 

GAEALERMFLS FPTTKT YFPHFDIiSHGV 
SSYTj 


13022 

* 


26923 


A 


13161 


411 


41 


ESliQICVGLHAKPSWDACRPWLGQAWFR 
YFLLTFLGPLVFLLVFWFS KI YFLYHLK 
NPSLSSCYNLDXFNTQYLKYCKKKKRTiG 
AVAYACMPS ILGSOGGP TTRGOEFETRTj 

GNMAKPDAWVDP 


13023 


26924 


A 


13162 


372 


218 


FWKRSWISKKGLFKRGVSLSLSIiS/CDT 
HTHTHTHTHTHAHS G YLCVLRKVK 


13024 


26925 


A 


13163 


247 


89 


QGLVLS PWL KCRGG I TAHCGLLGS SHPT 
TSA\QVAGTTGTQHHAQLKFFILIIL 1 


13025 


26926 


A 


13164 


122 


409 

* 


FPSTAIANSH/CSSPQGCSAYGVAIRIiF 
I^SNKLAL\LYGLAIj/NSFFCNFQEPS 
\LGSLSGPLSDNKSNNLCCFKSPSSWLF 
AIAVLGNYYSLNTVRSGPJIDPGTPNACS 


13026 


26927 


A 


13165 


69 . 


313 


PSVSLFFPPNHNIVIHMGLLKINS I FGE 
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SEQID 
NO: of 

n itrlpfttirfp 

II ULICUUUC 

sequence 


SEQID 
NO: of 

sequence 


M 

eth 

od 

• 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 

location 
corresnond 

ing to first 
amino acid 
residue of 
nentidft 

sequence 


Predict- 
ed end 

nude- 1 

II UVIV 

otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 

senuence 


Amino acid sequence (A=Aianine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid. F^Phenvlalanine. 
G=Glycine, H=Histidine, I=Isoleucine, 
K— Lvsine. L^Leucine, M=Methionine. 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine t S=Serine, 
T=Threonine, V=VaIine, 
W=TrvntoDhan. Y=Tvroslne. 
X=Unknown, *=&op codon, /^possible 
nucleotide deletion, V=possible 

nucleotide insertion 














GLALS S RMECS VT ITVHCXjLDLLGSGG P 
PSAAS / R VAVTAGTCHHS WLMFLS FV 


1 jUZ / 




n 


IjlOO 




Oik 


T KSMTPLPATiNVS Y YLP PLRPCR.ERG 
MVQDRFFCFLFCCCCFI FS ETGSHSAAQ 
AVSQNKKTTGWAWWLTPVI PATHEAEAG 
\ESLERGRQRLQ 


13028 


26929 


A 


13167 


296 


68 


KNGPLNLATKS LL I LTMKCLGVyTiGGKP 
NCSKLKKNKI KTKKWGWAPWLTPILPA/. 
IiLKAEAGRSLEARNIjRPVWT 


13029 


26930 


A 


13168 


2 


235 


GATPCPREPSLFPLAGWAPLSCWAPPPH 
CTQGTTPPGLMGTG\ AP IAI IGGNARSP 
PMPFLPLSGTRRALSAOECPTF 


13030 


26931 


A 


13169 


248 


406 


TERCGFDLLSLGQVQWIiTSVIPALE/EA 
EAGE LLES R S WR PAWATWQNL I SLKK 


13031 


26932 


A 


13170 


33 


308 


NSLQGAEMAAAQSSLFSASEGYRGERGR 

GRGRGRPGRG / RSEGDRGGRGRPGRGRG 
ninrT.T.T.D t rxrau dttt .p t .wimp fip Tnvrsnp 

VTYLMESEI 


13032 


26933 


A 


13171 


294 


56 


ICKRNYLFFIYFLREGLTLLPRIiECSGA 
CALSSRDALISLCYPGWSQAPGLK 


13033 


26934 


A 


13172 


71 


476 


DSLNYDNHI YQM P L I Y INMGLAVT I SLL 
G\ILVYRSHLIS SLLCLEGI ILSLFI IA 

X Xi X X XuN X MO iMJOJH X V «r Jiii JLi V r /mv&nn 

VGWEECGFRSRPTALILEGPLRSCQPRT 
CVRRPACTRRGPAFVGKNRHSIGG 


13U34 






1 "2 1 T2 


SO 


3 15 


YICIYTYKYICIYTYKYICICVYIHVYM 
YMCVYVYICICMCVYICIDSKGR 














NTMW P£IM7TP lCi\ nT.PMPTNPTiQQPQT.T. 

REKDPPTTSGPQTDQPKKHLTNFKSGKR 
PLFTLFSNLPHYPSTSFSFQS WRHTS I S 
PFS 


13036 


26937 


A 


13175 


128 


325 


FLRVI L I CHKM YGC VCVCM/ YACI CNGM 
CIYMCVCVKSWKLKPELKCLCISDEVGW 


13037 


26938 


A 


13176 


134 


311 


INSNSEKMGSHYVAQDGVE/WQGLFTGH 
GlT^CNLlCx^SSNFSASISQGAGTTGI 
VANG 


13038 


26939 


A 

• 


13177 


577 

• 


1052 

- 


SPTSTRTGSGVAMWLSQPMGRT\CRSKV 
ASKSRLHGPLIWAHPPIVRCPMLRHHY\ 
KAOAGRGLSLEE \ LRVAGI YKKVAOT IG 
IS\EDARRRNQST\QALQAKVQRX,K\BD 
RSSLI LFPRK\ PLAPKRGDS S AEELELD 
TQLTGPEMPIGNVYKEKARVIAD 


13039 


26940 


A 


13178 


496 


678 


TLNFWJAQGLKITRPWKNAPVFPVIW\R 
LRRiCNLLTLKGEICSDPKWPYCLPPWKT 
KKNLV 


13040 


26941 


A 


13179 


192 


421 


GIYTFFLSFFLFLRQSbTLSPRAGVQWC 
NLCLPGSSDSPASASRVPG\ATGT\CHH 
GSANFLYFLVGDRVFGYVGPR 


13041 


26942 


A 


13180 


144 


5 


LQLAI KLLKMRLGVVAHICNPS / TLGGW 
G\GRSQGQEFETSLANIVKP 


13042 


26943 


A 


13181 


351 


51 


KKKKKI FFGTKKRRFFFLKGVWGPPPPK 
VSPPPPPKISPNPPKNNPLLKA/ IKSPQ 
NWLFFFGPPSCKFFPPFFIiNPKNSPWEN 
PPSGGEFSPKKKKKKSAI 


13043 


26944 


A 


13182 


2 


218 


IHFGCFVFLFFETEFCSCHPGCRAGVQW 
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Predict- 
ed end 
nucle- 
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locauoD 
correspon 
ding to 
last amino 
acta 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanioe 
OCysteine, D=Aspartic Acid, 

iv — oiuiamic ACiu, j 1 — jrncnyiaianine) 
G=Glycine, H=Histidine,I=Iso leucine, 

X/ — "1 vcina 1 i — 1 mi ri n a 1V1=1V1o^Ii t/>ntn m 

iv — juysiiiC) i^ijeucine, iTieinioinne, 
N»Asparagine, PHProline, 
Q^GIutamine, R=Arginine, S=Serine, 
^Threonine, V=Vallne, 
yv— lrypiopnan, i = iyrosine f 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V-possible 
nucleotide insertion 














EITGMSHHACPI FKKHS 


13044 


26945 


A . 


13183 

i 


33 


304 


E KHG I FRAVK I L CM I LQRWT YATHLS RP 
rG YTAPKVNPKl*VFrijGUWDV&M/ SGSlii 
VTNTALGWGLLI RRSSACMGQGDMGNLC 
TFLSILL 


13045 


26946 


A 


13184 


111 


13 


FSFFLIVEMGSCYIAQAGLELLGSSNPP 
S S AS Q SAG I TGM S HHP AWLATLKRAFVS 
D \NS LS F PSS ENAHLSI LAOHFCLMGFF 
FFQ 


13046 


26947 

* 


A . 


13185 


140 


361 


ITCCCVCIYTHIHTYVYVCVYIYTHTYI 
RVCV / YY IHTHTYI HI Y IHVF I YI YI YT 
FMHTYIYPYVCMVLLQLTIi 


13047 

- 


26948 

♦ 


A 

- 


13186 

• 


64 

* 


759 

r 


CLSAESAPTS TMPMTLGYWD IRGLAHAI 


RLLLEYTDSSYVEKKYTLGDAPDYDRSQ 
WLNEKFKLGLDFPNLPYLIDGAHKITQS 
NAILRYIARKHNIiCGBTEBEKIRVDILB 
NQVMDNHMELVRLCYDPDFEKLKPKYI»E 
ELPEKLKALTS\EF3GGKRPWF/3QGDK 
ITFVDFLAYD VLDMKRI FEP KCLDAFLN 
LJCD F I SRFEGLKKI SAYMKS SQFtiRGLL 
FGKSATWNSK 


13048 


26949 


A 


13187 


41 


426 

* 


LFFF FFESGPCWVTQAGGRR/ WNDHGSL 
Q PG FPGLKGS PCLTLRRS WNYRPGMCRH 
TQLVFAFFFFPREKKFLLGAPTGFKTRG 
LRGPSRFGFQKGGNKGQEPPPRAKOLIF 
LEPERGOAFFGVFFGPTQ 


13049 


26950 


A 


13188 


39 


197 


FFFLFKTRSCS FIQAGVQWCEHSSLQPQ 
TPG\SSAAPASASCKAGTTEAALKL 


13050 


26951 


A 


13189 


221 


396 


GIQLQGAEGLWDVTNSPFCWRLYDQKTV 
YHE CRMWANSHCP \ P PKGLLRDI TPRCW 
AP 


13051 

• 


26952 


A 


13190 

- 


2 

* 


826 


PGSTI SSRRRGACGSRGGHFPSPRGGSG 
VASLERAESWSTEPAKAIKPIDRKSVHQ 
I CSGQWLSLS TAVKKI VENS LDAGATN 
IDLKLKDYGMDL I EV SGNG CGVEEENFK 
GL\ TLSALKHHTS K IQEFADLTRVETFG 
FRGEALSSLCALSDVTISTCHVSAKVGT 
RliVFDHDGKI IQKTPYPHPRGTTVSVKQ 
LFSTLPVRHKEFQRNIKKKRACFPFAFC 
KJJL-Ur JjbuoFAMJjr VUf \AJ!Uj J, FK£> 1 Ft* 
HPCSLEDNVITVFSSVKNGPGSSR 


13052 


26953 


A 


13191 


235 


1 


S PCARQCCPPNPAGQEPRRRLERGPGKW 

UDDOTVT7DT DHYf /O JlOITDT.T.'DQaOT.OnT. 

VSGRVGGRVGRPGKAVQVSGGLN 


13053 


26954 


A 


13192 


2 


386 


VIYLLLFFETESCCVAQARDGVSPCWPG 
Wr U I FG \obUIiPiioAiri\./ V/UjX 1 o V brlK 
TRPDVRGFKKC/Gl^LEMSHIiALLIj/TT 
SHRFW YFVLLFS FVS KGFFI FFFFLFFF 
FFLGKGFFFFFPRGGGGGQI 


13054 


26955 


A 


13193 


749 


506 


GQTKAFVLS YCGAS PS IKQ/ PCPQAKDH 
PLEPSMHPEGTQLQSCSIMLGPRQIjSSE 
KQPLLPPRSHLKSSPMIiRACKGLTS 


13055 


26956 


A 


13194 


73 


297 


RMPGFGALGSPFSCQRIECPC\AGAGSC 
TYASFCKCKEYKCTSCKKSECGAFPGNQ 
GAGQSQRREPRAQQAGAGQ . 


13056 


26957 


A 


13195 


670. 


390 


PRETYIKPFARS IGCQPPKTIHPTSRRP 
STRPPARPHARPPVHTP/APSVHMSDRP 



1337 



t 



WO 01/64835 PCT/US01/04927 

■ 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 

• 


SEQID 
NO: in 

USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- ". 
otide 
location 
correspon 
ding to 
last amino 
acid 

-— - — 2 — ^1 mt m 4h " m^ 

residue ot 

peptide 

sequence 
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OCysteine, D=Aspartfc Acid, 

tT^-f li »+<i mm I j* A mitt 17— DknnnlAU.:. 

£^=i>iuTamic Acid, rnenyiaianine, 
G=Glycine, H=Histidine, I=lsoIeucine, 
i\r- juysine, Leucine, ivi— ivieinionine, 
N=Asparagine, P»Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonfne, V=Valine, 
W=Tryptopban, y«Tyrosine, 
A— Unknown, —atop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














STCPPSIH\RPPSTDVSRPQYSLSPSTS 
LALPGQSCLGR 


13057 


26958 


A 


13196 


2 


438 


LS FTS I £ FCHLLFLNQLI CNYVI FTkS S 
TIS CEALGRLLWYPWTQRFPES FGDLS 
TPDAVMGNPKVKAHGKKVLGAFSDGLAH 
L\ P PLKPTFAXVS ELHCAKLHVDPENFT 
LLGNVLVCV/MKHNFGK 


13058 


26959 


A 


13197 


57 


499 


S PS WPRSSARPCSGHCLLPGLWDHGKAP 
CLRAARHSQGHG FRKGRKBGCGLWAAGW 
TKNPVPTWRAG/S RRGPGVTS PPPVPS V 
QNP P\P PQRRHGGSQATRP VLFTPPPKH 
MEGPNAPILGDQRITAPPPBILYRPPRA 
ATCDALRRG 


13059 

• 


26960 


A 


13198 

■ 


34 


465 


GILPYSPCVLRAPNSKRVRKCGCTTPRH 
L/CKATRKCGCTTPRAPLNETRKCGCTT 
PRHLSRQLGNAGALPRGHLNETRKCGCT 
TPRAPLNATRKCGCSTPRHLSRQLGNAG 
ALPRGHPLKETRKCGGATPADAFYETKK 
IGGGL 


13060 


26961 


A 


13200 

• 


177 


1075 

■ 


PTSS SMAFPKKKLQC-iLVAATI TPMTENG 
EINFSVIGQYVDYLVJCEQGVKNIFVNGT 
TGEGLSLSVSERRQVAEEWVTKGKDKLD 
QVI IHVGALS LKE S QE LAQHAAE I GADG 
IAVIAPFFLKPWTKDI LINFLKEVAAAA 
PALPFYYYHI PALTGVKIRAEELLDG I L 

DKIPTFQGLKFSDTDLLDFGQCVDQNRQ 
QQFAFL FGVDEQ LL S ALVMGATG AVGS T 

ynylgkktnqiv\eafeqkdfslalnyq 
fciqrf infwklgfg vsqtkai mtlvs 
gdsngptpastcrkppgeft 


13061 


26962 


A 


13201 


35 

i 


464 


VQEFKTS LGNVAKPCLYKKKKKKS PLGG 
GVPPP P KKTWVGGTLS PQKIRPAGPLI F 
PPPLPRGQQGEPLS PI KGGGGGGSS \YP 
PPPPPCLTGRA/PPPSPISAKSSPSPPP 
CQYVYHPPLLPLFFFLHALRRATAPLFL 
FLVRA 


13062 

■ 

> 


26963 


A 


13202 


1153 

• 


259 

■ 


AG3VLRLG WTGS RMAS DSGNQGTLCTIi 
EFAVQMTCQSCVDAVRKSLQGVAGVQDV 
EVHLEDQMVLVHTTIiPSQEVQALL\E3T 
GRQPVLKGMG\TGQFSESGGQPYAI PGG 
GLGTVQGVVRFLQLTPERCLIEGTl\PG 
LE PGLHGLHVHQ YGDLTNNCNS CGNHFN 
PDGAS \HGGPQDSDRH\RGDLG\NVGAN 
AAGGAF FRMED E Q LKVWD VI GRS L \ I ID 
EG\EDDPGAREGQSLYPKITRELPGSRL 
SCGI\IAKSAGLFPEPQSKICSCDGLT\ 
I WEERGRPIAGKG\ RKESAQP PAHIi 


13063 


26964 


A 


13203 


281 


420 


VDGSKKYNEMPVIPALWEAKAGGLL\EP 

KlvoKlAWAliCCilJovb 1 JvJM 


13064 


26965 


A 


13204 


3 


1125 


SDSPQTPRMRVMAPRTLILLLSGALALT 
ETWACSHSMRYFYTAVSRPGRGEPRFIA 
VGYVDDTQFVRFDSDAASPRGEPRAP\W 
VEQEG PE YWDRETQKYKRQAQTDRVSLR 
NIoRGYYNQSEAGSHTLQWMYGCDLGPDG 
RLLRG YDQS AYDG KD Y I ALNEHLRS CTA 
ADTAAQ I TQRKWEAARAAEQWRAYLEGT 
CVEWLRRYLENGKETLQRAEHPKTHVTH 
HPVSDHEATLRCWALGFYPAE ITLTWQR 



1338 



WO 01/64835 PCTYUS01/04927 



SEQID 
NO: of 
nucleotide 
sequence 

i- 


SEQ ID 
NO: of 
peptide 
sequence 


M , 
etfa 

Ou 

> 


SEQID 
NO: in 

09/515,1 
26 

■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue oi 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last ammo 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanlne 
C=Cysteine, D=Aspartic Acid, 
Kj— Lriucamic aciq, r— rnenyiaianiDe, 
G=Glycine, H=Histidine, I=I*o leucine, 
. K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P-Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
i— inreontne, v—vaune, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 


! 






i 






WPSGEEQRYTCHVQHSGLPBPLTLRWB 
PSSQPTIPIVGIVAGLAVLAVLAVLQAV 
MAWMCRRKSSGGKGGSCSQAASSNSAQ 
GSDESLIACKA 


13065 


26966 

■ 


A 


13205 


417 


13 


GHPRVMLQCHMGS AS LS IMT PP P IRS YQ 
PALTRSSATKPA\TSAPSITPMTPVTAN 
I S / VWPTANTTTTKLSNPTSS /STLSTP 
IWHPSILPAPPSTPAPLTTVTNSATTST 
ICFHVQLFQAHHCHHCIAAALEYTS 


13066 

• 


26967 


A 

» 


13206 


1139 


559 


GLSELSIHPSVARVQEGHGATPBRSIPW 
PGGKPRRGGRGTGLREWGGGSHRSLGQT 
KKKGAKAKRDHCAHGPIAAFQGPFPRAT 
QLLS PCHGCS WKENQPP*GPPGHACTAW 
EKTVTHGSCRPVTKPSPDPGFVGQAPGT 
KGNPAPGMDG CAGFRI S CHPVP I TKPLS 
NTGICVTjLCTrEFGFLlAlVGCWGFK 


13067 


26968 


A 


13207 


306 


332 


ENGEIRI YX YXCLYL I KKHWEEAGCGSV 
HL * SQHLKRLKWED YLRPGVPDQPGQHR 
ETPSQKLNK* INNDF 


13068 


26969 


A 


13208 


48 


233 


GKQNHLKCLWCRLLFLLSAVISRNVYIL 
T * LSLSLSQCVCVCVCVCVCVCVCVAIL 
ENYPR 


13069 


26970 


A 


13209 


250 


2 


YCEGDLGSSRPGVS KCREFSPRRDKRXM 
PWPGAVT YAYNPSTLGGRDGWIA* A* EF 
ETS LGN I V KP HL YTHTHTHTHTHTHT 


13070 


26971 


A 


13210 


615 


120 


SVFWPLLGGVSWSGYTEVRGPLEKAVCP 
LS ELE YCAGRSAALFRAIRLFKDALSLLi 
KLCPQPPFPLGAPLSP*EALVCVVPLHV 
STCLS*WCPTPCPPERSWCVLFPSMYPR 
VCPDGLLPPVPLRGPGVCWPLHVSMCL 

T OT CVTC5T T TIT T1 rpfWIT, 7T.TTJ OX T PT IT T Tf* 

JjSIjFNS LiiRIiRTCG VWFb VPViiVC 


13071 


26972 


A 


13211 


239 


429 


FI LLRNKLQPGTVACGCNPSTLGSQGKW 
IA*AQGFETSLDN1WKPCFYLPQLKKKK 
KIKNLGV 


13072 


26973 


A 


13212 


954 


503 


VQL.LPLCGRDLYLFAFNLPTYRCWQHFL 
SKPRAPILAGGSDRWPCPCCPFWLP*WC 
HVSAHAPQPFPCAKPEAPLPVGCPCSPA 
P S LBAALLI LRAGIiWLRAPFCFS AHLL 
SCHSQCCQFPCSEKRSCLGGRDRHAASN 
PLSAVWPPGE 


13073 

■ 


26974 


A 


13213 

* 


1844 

• 


1147 


LHSQIYSTAKKASLSMKGSRDKTRAASS 
RPVPSVIX3VPPWSTLLQHPQNMWPGPAQ 
QQGQPSGRQAWCTPGEAPGAEAAPQ*QP 
HPEEDHSGGPQASAAIiAIiPPSPPSQRDV 
QGETGMQGRS APRS AS SSACCACRWSRIi 
PCPQLLQRHPGLRLVSPAHSRPPGPAPS 
S SSGSGLVPG YLPQKGLAGLSAGAVLC P 

RGKRFLRF 


13074 


26975 


A 


13214 


413 


60 


LCS KS CHGS FAMDCE TLLTFPRLF YP I L 
HMPTVPGQALPVHTPFL*LKLPLLPGIP 
SCHSPLFLD*AQESPPPGGLPGHPPGSG 
VLCLWFHSLKHPVLSGSSVSTPGSASPS 
RSRAL 


13075 


26976 


A 


13215 


67 


196 


GLSLSSPKVTHNATLMGCIFQN*KAFVY 
HSMKQKKI I LLFNMA* P* YPLDFGEQWP 
LHGSHAYSTIL*LDLFCKKEEE*DEIPY 
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13076 


26977 


A 


13216 


161 


2 


S VFFLF I DRRLYL I KS VWAAYNS STIK3G 


13077 


26978 


A 


13217 


57 


184 


CLTVCKDKVSLCYPGWSPW*SYLTAAS 
NSWSKAILPPQPPE 


13078 


26979 


A 


13218 


292 . 


400 


LKVKQGRGHLDLNQGPLDL * SNS LPLSH 

liroxlKKlv 


13079 


26980 


A 


13219 


186 


394 


YLVLSFLFYFLFIFLSRVSLPIFFILEI . 
* LG I VAYTCNPSTLGSQGGR I A* AQE FD 
TSLGNIVRPPSLI 


13080 


26981 


A 


13220 


315 


448 


KTKDLGWDHVYNPSTFGG*GGRIT*GQ 
KFETSLDNRARSCLiYK 


13081 


26982 


A 


13221 


3 


13 


LRPLRSFVSVERLWASCCLGSELWDKT 
KRKKRRELSEEHKQEIKDAFELFDTDKD 
EVI D YHE LKVAM I ALGFDVKKADVLNI L 
KDYDR* VTGKI SFGHSIE I VTDFQL*R 


13082 

■ 


26983 

* 


A 


13222 


1 


715 


VDMHSRGSPELTHASTHASGKMAAPWAS 
LRLVAPMWNGRI RG I HRLGAAVAPEGNQ 
KKKRT I LQFLTNY F YD VEALRD YtiLQRE 
MYKVHEKNRSYTWLEKQHGPYGAGAFFI 
LKQGGAVKFRDKEWIRPDKYGHFSQEFW 
NFCE VPVEAVDAGD CDINYEGLDNLLRL 
KELQSLSLQRCCHVDDWCXSRLYPLADF 

t r7v/^t7i T7T5LT7\ f<fiT3D T f TDTV 2i VXTl^* T71TLTT 

V AvjAt KWAol-JrK If K 1 vaAfUnlSj r HrLL vj w JJ 

LSSGWATCPGTSPT 


13083 


26984 


A 


13223 


1019 


1238 


LTGAE YKKSDGLTE S VTANLE PEAPKVF 
SRLDDEAPVTVLPARIiPLPAP * HTARPD 
P PQRRRS MQIiAPARP rXi 


13084 


26985 


A 


13224 


1534 

> 


1181 


RWNS E PD YSAGGPRAGRGARLLP VPKS P 
GGGGAVAPGPQTP LRAERTAVRIiS ISS * 
NS9 TAP ADFUAAP PIjriFU FAKAU iWvoA 
FLRVI FCFPSLGFGCWEKKI AFVPFPPT 
DEKCL 


13085 


26986 

• 


A 


13225 


328 


22 


CLESPRDRPFQGPAFPKHPPPPLAFLSQ 

nfunfftvppppMfiTfOT eo/M?Ttretooot!Ti*T' 
PGKVQYFr BSMuKoijSbUiS XKMoKoVFT 

SPPAPSPPPPPFLYFSIiTAEGGEV*KET 

LDSDRAPPSSREAPICQV 


13086 


26987 


A 


13226 


180 


5 


N5 1 D YivII^KWoWKPGAVArl x CN PSx JjQ 
SQGRRIT*AKEFETSLGNIVLLIYTIIY 
YI 


13087 


26988 


A 


13227 


1*1*7 

237 


coo 


HKNLYWHIDLVMYFSRIS FYFTLSAAI 
CLS L YLG SHI LS P FL * IiFYCAYTTL YLC 
LITTLFFLIPKV 


13088 . 

* 


26989 


A 


13228 


140 


366 


NATCLWHPMSPYAPTY*C*LSPGIiLRWV 
LAITPI VLMILISS YNHVS IS YGGTFRS 
NFFLDQI ILTFLI YELHIV 


13089 


26990 


A 


13229 


A^ ^ 

257 


A, #* 

25 


LPAHSLGRDLSAQPYSnPRrGGELiKASG 
OSSL CS VARS S L * LHVCVCVCVCVCART 
CVNVFACVLLS TKV CLHPL PE 


13090 


26991 


A 


13230 


561 


52 


WPSCSSSGSPMLIiPPVPPGSLG* *SFSP 
LPAAAALPAPCAAPRPAPLRPCGPAPTP 
AAAPAPAPPAAASLS AARAAAGS PPGSR 
PS ARGARRRPSGP CAP PEPRLG PRAPS S 
APRTRTPS * GRARSGGSAGNAPS ARRTP 
QGP PRAACS LARS W I WPAGRGGS CGRAL 
GA 


13091 


26992 


A 


13231 


335 


103 


CQKYSTS YCNKWIATIiVIICSRKI *DGL 
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C=Cysteine, D=Aspartic Acid, 

IT— rinMmSn A^irl IT— Phonvlalonina 

JC/ — oiuum ic aciu, r—rnciiyiaianuie, 
(^Glycine, H-Histidine, l^lsoteucioe, 

\{a I vein a Y a¥ aiirinp MsMpf hinnina 
XV IjJ 3 II 1 C, JL/^lvCULIIICj ITJ ITJClIIIUlJlIlC, 

N=Asparagine, P=Proiine, 
Q^GIutamine, R=Arginme, S=Serine, 
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nucleotide deletion, ^possible 
nucleotide insertion ; 














KGQAWWLVPVI PMLWKAEAVGLPEARSL ] 
SSPCNMARAPALKklFLIGQV 


13092 


26993 


A . 


13232 


452 


712 


VGIii.GED*PPFSPAQSQbljSCWFDCSCP 
PSWPPDQRPGERPQPPDPPESHGABEEG 

TQ j 


13093 


. 26994 

■ 

* 


A 


13233 


480 


2 

- 


SCFSEDWNPLKFKLQLSTSLSYRKSHTL 
HCAHHFRGQCHSD*AFIjVFSIjTCI 
SLR FTAGD VS P S Y P E KLR VL * EGRVAD F 
HVMARKGQQAAIAILLQKARTGYVWWLT 

rvl 4* ALiWKAJSAovJo V JurxCS) JjKu 1 nUXlWc. 

TRLLDRGESASQSAGMTGM . 


13094 


26995 


A 


13234 


692 


824 


QIMAIjPFPSPPVPPLVM*CKGGKIPPPP 
SKNPRTCVGNTVI I KA 


4 f% f\ r\ J™ 

13095 


26996 


A 


13235 


657 


872 


PRGPRLDVLYACE PQLI 1 ClihliGl PLKV 
RFWLEVVDHTYNPSTLGDQGGRTA*GQE 
FKTSVGNSKTLSLKKK 


13096 


26997 


A 


13236 


168 


365 


QESLCILQTCTCAHQNLK* KFGGKSRLG 
AVAHACNPSTLGGQGGRI I *GQEFETSM 
VNMVKPYLY 


m m Ana 

13097 


26998 

■ 


A 


13237 


31 


338 


V^IVYICVCVCVCIYIYEIYMIYI*YI* 
Y I Y I KHTVQ * H * HLAFS TFTLIjCNCHHC 
PSIiELFHYPTLTYHTH*I ITPHCSSC* Y 
yPLir irrtbx VrtJxoKxJjV 


13098 


26999 


A 


13238 


714 

■ 


1001 


SSGV S CLNRAGW I TGVNHH I HS LVLXiKR 
EK*TQNIITEVCliMFLLLPHLAIEATVT 
WPGMVAHACNPSTLGSQGRRSV*AREFE 

V isnDN JLAKFVJlt 


13099 


27000 


A 


13239 


250 


406 


NLA* HGGVHMWS SASQVAGI TGTRCHAQ 


13100 


27001 


A 


13240 


256 


193 


WINFISLPLSSLT*TLSLCVCACVYVCV 
CVCLRPCVHIVGGNLKRYIVFARSVTIiG 

rWPUT TJTOT TTMMnT 

uvr 1 nlirlloJ-tl iJMWDJ. 


13101 


27002 


A 


13241 


1 ■ 


257 


GMDIiWQMCHS LSItPLI FFFFFFS EKTRS 
NFVPQAGFQLRALRDPPA*ASREAGITG 

T TtTLTTrD TV'f^T T3VTTTlTT»T .W"D r PTT .T .DT.T.'D'D 

K 


13102 


27003 


A 


13242 


319 


406 


KRGWT * WLMP VI PTLWEAE AGGS PDVRS 

T 
Li 


13103 


27004 


a : 


13243 


228 


56 


KGAPPVLKPGYPKAAKNPTPFKPPGSKV 
GENPPLF*NPCPGGGNPFPTFFFFFFFF 

r 


13104 


27005 


A 


13244 


1687 


612 


ILGNQCCKFDAYNALANESTMIS IKLNE 

*\rO RP PP PITPITR VPP VPRP JfPPPHinfirr. 

TSSKGTGSTATFHV 


13105 


27006 


A 


13245 


220 


1 


TKDPSSTMPPQPNPLdiSFKSQFIFLIFL 
QPRSS RPAWATWRNP I F 




7^nA7 

A f\i\JI 


A 




XU*f 




ASWPPGLOYELRI I KCLLHCWTRAMIFR 
ERERRERERERERERERERE*VHLKRKG 
A 


13107 


27008 


A 


13247 


158 


2 


IWVFKETFFFQNFFFFFGKENNFNGFFF 
FFFFFFFFF*DRVSLTLHPSLGYR 


13108 


27009 


A 


13248 


221 


3 

4 


NPSQGFPLIiKIFFFFIiRWSLAIiSPRIjE* 
NSI*KN*KI*KISWAWWRTPWPGAMEA 
EPGE SLE PGRQRLQNE 


13109 


27010 


A 


13249 


81 


341 . 


GELNDT I HVKYLG * Y I KVL * RNRTNRVC 
VCVCVCVCVCVYACTQKDLF*GTGSCNC 
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kt— vviu ihitiic /\ciu, r xnenyiaianinej 
G^CIycine, H=Histidine, I=Isoleucine, 

K=T/vcinp f jsl .ai ifinp 1W— <*th inn in* 

N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
•^Threonine, V=Valine, 

W=Trvntr\nhgn VsTvn\ein» 

X=Un known, *=Stopcodon,/=possible 
nucleotide deletion, \ppossible 
nucleotide insertion 














vc 


13110 


27011 


A 


13250 


299 


1 


ATARLRFLHGAWWYAS WP PTQEAEARG 

FLDP RS SRME *AM ILPRCTPASVTDKI P 
c r Jh en tup d com pvt t .t . c otr& dtpj* d c 

RWPPPSPLCPAGSRA 


13111 


27012 


A 


13251 


106 


146 


LCVLKQVKQATWTKYSSSQI FAI LDCMT 
TIjS SHP ALKYCNYVL I SNQ S PNHLRL * C 

OnH/ITCnOPXTT/T DT* * GTVTOD» C A OOTrTV^r/^ 
ovjfl X oAxlL-TJ LiCi-i Ftj w o JN o F Ala avAvL 

FKASQASYLD 


13112 


27013 

t 


A 


13252 r 

t 


217 


158 


LFEIFCIPFFFFFFNFFFETRSHSVSQA 
GVQWCKQGSLQP* IPINFFWGKEGVIDK 
LtlLL S YGNAQGr KV AVaFVLTa iALQPGR 
QSETLSQKRN 


13113 


27014 


A 


13253 


150 


56 


TIL I Li It JjLi x b 1 ii b Qb VAQDG V Q W CDLGS 
IQSPPRSSCHSPA*AGVNRLRQENGMSF 

CM? T^TT O TlT>D r P r P7\ T UWTyiTCFT etITTT 

CtAr«X VajoJtUKa liUjHPGL^lb IL&nXX X 

IILILSTISFHQLLH 


13114 


27015 


A 


13254 


174 


145 


PCLKMTTYDFKSALE IAFFCFTAILRYK 
LLIKVTHFKWIJ^VAAHDCNPNTIiGSGGG 
RITWGHEFKTSLANMA* PTTRS YSQRVG 


13115 


27016 


A 


13255 


179 


273 


GQWLTP VI SAL * B VE VGGLLETRS LKQA 
WAT 


13116 


27017 


A 


13256 


264 


382 


oluvX WWrx F V X FTL w n VKARGLL 
E PRS LKS AWAK 


13117 

■ 


27018 


A 


13257 


1199 

- 


1563 


YLWGPAVPPLDLEPGQAGATCGPVSHY 

JlXr/uUjWAV(jbXXUJX^F(jFlS w l7&c LkbVr 

TSLCFIYGVTYPFHSEQPHPASPSASGL 
LLLKGRAAWLVSTLC IGSPAPMLTGX FP 

WKT/VKTOT .P T 


13118 


27019 


A 


13258 


194 


242 . 


I LYVETGS H YVAQAGLELLGSGNP PAS T 
S E I AG I E A * ADE F I YVGVSWQDEG I KSM 
KQVAHAS I PAISEVEAGGLPEPRS SRPA 


1311? 


Z/U2U 






322 


120 


Ijr i F v LiJJiVr y UiLLo r 1 ok*JsjJ\L)\j X XirA 
FPIRKQGGTQWPBCP VI PTLWEAKAGRFL 
♦PRSFRPAWAT 


13120 

* i 


27021 


A 


13260 


237 


1 


TSKKISLGWWGKQGI PGTWEG*AKRPFE 
KKKKQKQKTLKLCTHSRITYSRA 






7V 

» 


1 jzol 


D2 




PSSWDYRGITRELFQRFPWTFLQLITAV 

T Q ^ TVTilCNT.RTi AAVR Rc;HVP VTMMaV 

PINPF 


13122 


27023 


A 


13262 


274 


1 


KKKKNS PVWWWTPVI PGSPG *AGELLbP 

n v y itny v v t i-i>vxv v v trw a in. v v yuv<r j_i 

AliQPGEHQGIFVSQKKKNPKTKRKYWAL 
FCSLPSC 






A 




717 
fit 


R77 
Off 


NSTKEMAHWPGWAHTCNPSTFGG*GGW 
IT*AHEFETSLPNMVKPHLYKKYKK 


13124 


27025 


A 


13264 


288 


131 


SLYIWHSKRLITITNNKISGVWWCLPW 
S S TWEVEAGGS LEPRR* RPAWATK 


13125 


27026 


A 

t 


13265 


80 


287 


FMNGEAS * KTSILSQAWWCAPIVPAAQ* 
NEAGELLE PRS E WLVWATRALRI s SRGL 
RFRFRLRRFTSTR 


13126 


27027 


A 


13266 


3 

♦ 


241 


VGLFLFFFETESCSVTQAGVQWC^L^*!^ 
* PPPPGIPJ3SPASASQVAGTTGTHHHTW 
LIFLYF* +SSGFHYVGLGRSSNS 
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Predict- 
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nucle- 
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sequence 


Amino acid sequence (A«Alanine 
OCysteine, D=Aspartic Acid, 
Jb-Ulutamic Acid, f—rnenylalanine, . 
G=Glycine, H=Histidine, I=Isofeucine, 
K=Lysine, L=Leucine, ^Methionine, 
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Uj=Oiutamine, K=Arginine, 5>=oerine, 
Threonine, V=Valine, 
w— iryptopnan, x—iyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \~possible 
nucleotide insertion 


13127 


27028 


A 


13267 


103 


359 


. ICVYVYVCIYVYEMRATSKDFLNQVWFS 
rLiK i VAVJ-ir Ea X Jj JL VXi *MWI»5 rnlHIYI 
HTHIHIFIYVYTHVLIYVYIHTYSYMCI 
HIYI 


13128 


27029 


A 


. 13268 


172 


12 


SARSSQAIQIQKS VTSGQVKPQW* QS * P 
TPFLPTLIALTTLLLPISPFILI IL 


13129 


, 27030 


A 


13269 


2 


203 


R I EN I RS YKNIiAALFMTAKKMGT I QMPH 
NFBWINKMWYIHTLEH*SDKKKKKKKOT 
RAEIPKLIIQK 


13130 


27031 


A 


13270 


236 


365 


KI YRQGAVAHTCNPNTLjGGQDGW I T*GQ 
KFETRLANMVKPHLY 


13131 


27032 

i 

■ 


A 


13271 


371 


15 


QNQINFALILI INTLLALLLI I ITF*LP 
QLNGYIBKSTPYECGFDPISPARVPFS I 
KFFLVAITFLLFDLEIAT,T,T.PLP»ALQT 
TNLP L I VMS SLLL III LALSLAYE * LQK 
GLD*AE 


13132 


27033 


A 


13272 


2 


16 


PRVRTS SRSRAALELI FFFFLGPPNLPV 
YNGPLGRTKPGTGELDTGGSPILCVGQG 
RHP YWKGGAKPLAPFGKGGGP * GTRPLA 
GPIVPPKAGLQSEMPGMAPFCGSFG*PT 
RpD 


13133 


27034 


A 


13273 


329 


290 


SSALVPRLECKGI FS AHCNLCLMGSgNS 
PTSASRVAGITGVAGQATDKTPQTLS*R 
SQPSHMDPVRWSP 


13134 


27035 


A 


13274 


175 


13 


APCDHRPCPPENNPL*L*FSITFPNPIK 
RPHPYLPLLTLFSDSAHLHPGEIEQR 


13135 


27036 


A 


13275 

- 


355 


861 


PLTTTPAAPRAPCPPSRLSGQPLTGPTE 
GSRSRIiSPNISEQGEPPLALTVGHPLST 
QPGPTVPSELEPIQGPRG*GDCPTPSQS 
A*GGVLSCTPESHTEFKPPPTGGGRRWA 
RLGLNGAT * GREE PLQTRLPAEYPGPGP 
IDPLQPPPISTASMATAFSDFLLLGRDP 
A 


13136 


27037 


A 


13276 


*\ m* m» 

865 


m* m* mm 

667 


KFLCILLDFLFF IF* EMRSQHRLE CS CA 
1 I AHCS LKLLASTDP PTSAS * IAGFTGM 
CHCAQLFLTF 


13137 


27038 


A 


• ^% mm mm 

13277 


103 


A mm "* 

375 


WSRRLPWRRGLGYIELFQGLEIRHHFLF 
GPHYLRRTQCQGPVI PS ELDGQGWD YMS 
PGV*DQPGQHDETPSLQKI * KTSSAWWH 
APWPAT 


13138 


27039 


A 


13278 


177 


3 


QQTEGSRAHSNSHRRPGAvVtiASNPWTL 
GGRGG* NI * SQKFKTSLVNMVKPCLYGR 
V 


13139 


27040 


A . 


13279 


379 


372 


SR*WVCMVAHACDPSIIiGGQGGRIT*AQ 
E FETSLRNMVRPCIiCLGNTNI YI Y IHTH 
TYIFIQN 


13140 


27041 


A 


13280 


83 


1184 

i 

p 


PPAHAARAS PPSYTWLC YEVKI KRGRSN . 
LLWDTG VFRG PVLP KRQSNHRQEVSSWE 
CRKHI S KMS GGGIjSTVYFFHRRFQ ITWF 
VSWNPCLPC^VKVTKFIAEHPNVTLTIS 
AARLYYYRDRDWRWVLLRLHKAGARVKI 
MDYEGERCRGQGSMTGRNSLRDGWICNA 
INYASLHRTLKEILR*GSPSGLIVSLLS 
P PAHPPEDS PGNES WLCFTMEVTKHHSA 
VFRKRGVFRHQVMKS YliHPK* EL S S WE 
RRKHNT*HTNYEVTWYTSWSPCPECAGE 
VAE FLARHSNVNLT I FTARLC YFWDTD Y 
QEGLCSLSQEGASVKIMGYKDFVSCWKN 
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E=Glutamic Acid, ^Phenylalanine, 
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K=Lysine, L=Leucine, M-Methiomne, 
N**Asparagine, P=*Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














FVYSDDEPFKPWKGLQTNFRLLKRRLRE 
ILQ 


13141 


27042 


A 


13282 


1558 


1824 


S F PYLAS FP PCLGL PLETLFHPI/3L * KD 
♦SDTHTOTHTHTHTHSHTHAHFPSFPDP 
LFQSSPFSSGFIDEYKYPHLWPVMSVTC 
CRFCV 


13142 

■ 


27043 

■ 


A 

• 


13283 


1 

• 


519 


PHFFLPQGFWGFFSPFPL*KSSSPLKAL 
IFLGGFSPFFPPPKKRFFSKIPRGVFFP 
PPKGKKFFFPPPG* IWPPQGFFLKGPPS 
SSSSSSSSSSSSSSSSSSSSSS 


13143 


27044 


A 


13284 


349 


159 


CPLE PKKNYP FCFF F* ETGS CYVA* AGL 
VLVASSNPPASASQIAGIQGTSRHAQPC 
RIFLSKT 


13144 


27045 


A 


13285 


748 


869 


WLG WAHACNP S S LGGRGGQ IV * PQE FE 
TSRGNMARPQSL 


13145 


27046 


A 


13286 


315 


410 


VQWFTPVNSALWESEAGGLLEIRS*RTT 
WAT 


13146 


27047 


A 


13287 


355 


499 


ARCGGMHLYSQI FGRLRWEHCLR* KVLS 
QVWWHAPVFADIWEVEVGALIjEARSLRL 
* CAMLLPVNSHGPPTWAMQ 


13147 


27048 


A 


13288 


880 


701 


KS YFSSHLHFEGKKQVSS ILD* S ILVLM 
CSWLMNYTHTHTYIYIYTRTHICVHVYV 
HNF 


13148 


27049 


A 


13289 . 


379 


434 


KRGP P P PKKRGCFFS EKFF * GPP KT PFF 
FFFPKKKKKKKKKKKKKKKKKKGGRSRS 
RLADAWADAW 


13149 


27050 


A 


13290 


253 


344 


GLFYNLQKIKI FYVKNLFFFFSNS ITEA 
GVQWPNLGSLQPPPSGSNDSPASE*RIV 
V 


13150 


27051 


A 


13291 


561 

■ 


745 

• 


AWEPSLVGETNVNSFNQKYINWPGAVAH 
TYNAGTLGGQGGWI T*GQEFETTLANMV 
KPSPY 


13151 


27052 


A 


13292 


289 


2 


TLPQGEDFNKFVFGSIKQKLQINL*NTD 
FIGNVFQSWAQQYTPWPPSWVAEVEGS 
LAARSSRPLCTI ITPLNSHCSLAWATQQ 
DPAGRVGRPRV 


13152 


27053 


A 


13293 


281 


229 


LKNKNVNKEKLRQGWAHACNPS ILGGR 
GGWIT*GEM 


13153 

i 


27054 


A 


13294 


276 


441 


GLFPKFLIQKINQNWPDAWHAYKPSTL 
GVQGRW I T * GQ * FKTS LANRGRKS ENK 


13154 


27055 


A 


13295 


258 


389 


NMVEKRLGQAHAYNLSTLRGQGERIT *A 
WEFETSLGNWRPCI 


13155 


27056 


A 


13296 


358 


353 


SFLAFFNGAFLPLRGFFWKTFFFGGVCW 
RRPPL*RKKKKKKKKKKKKKKKKKKKKE 
MFKRNIGGEERGGGG 


13156 


27057 

* 


A 


13297 


196 

* 


1 


LMGKAPFLGGFI PPAFFFFLKKKKGGPG 
AVAYTCNLSTLGGRGRWIT * GQEFETSI* 
ANIAKPCSC 


13157 


27058 


A 


13298 


182 


436 


GVTI LNVRHRHRKKVTLYRDLKKVRKCP 
KLHGNLRKVFQAEGIASSKTLKGHVWWL 
MPVIPAL* EAKMEGLLEARS LRS AWATQ 


13158 

• 


27059 


A 


13299 


334 


440 


RHYWLNF*AFYHSSIAPAPQGGGHWPPP 
G I TPLNS LHVPLLNTSA* LASGV* LT* A 
HHRLVPNNRALf I QALRITRILGLYSTL 
L PA* KNFEAP FT I S DGVYGSAFF* ATGC 
HGLNVI IGSTFLTICFIRQLIFHCTSKQ 
•LWLEPPAWDW 


13159 


27060 


A 


13300 


359 


372 


KKKKICGGKKNPPNKKKVKPRGEKTPLK 
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Amino acid sequence (A s Alanfne 
C=Cysteine, D=Aspartic Acid, 
E=G utamlc Acid. F=Pnenytalanine. 
G=Glycine, H=Histidine, I=lsoIeucine, 
K=Lysine, X=Leuci n e, M-Metb ionin e, . 
N=Asparagine, P=Pro!ine, 
O^GIutaraine* R=Areinine« S^Serine. 
T=Th reon ine, V=Valine, 
W=Tryptophan, Y=Tyroslne» 
X=Unknown. *=StoDcodon.A s nossible 
nucleotide deletion, \=possible 
nucleotide insertion 




• 

■ 


- 




• 




GV*PFFCLEGFFSPHKFFFFFIFSPNIC 
SPKKS FFLKF^PALFFFNLPFLKNI PSP 
PFLKKASF*BI FFCGPPPFFFFFQNTLF 
FFPPFFFFFFFFFFFFFFLDKFAVY 


13160 


27061 


A 


, 13301 


233 


406 


VTVEGMEEVLAGSSEGMGSRPTS * V* FL 
KYLLGAVAHAHKP S TLGGRGGRIT* AQE 
LKTSLCNKVRPRLYKINK 


13161 


27062 


A 


13302 


92 


429 


GSHFNSELIEMKFYIIKRNIFIILNCFY 
L I KHQH FHYTKFLL VSV INKF * LATVAH 

mmm mmm mm ^mmr * * * V W VA«k V A^h A Mi A mm W • AAA 

AYNLNTLGGQGGRSA*THEFKTSLGNTV 
RSCLYKNLKMSWVWWCTAIWEABVRGLP 


13162 


27063 . 


A 


13303 


302 


140 


GGAVNS VQS QAWWHMPWP VP WEAETGL 
LKPRSLSL*CPMIVHVNSHCTPAWAT 


* 




A 




i lis 


111 


LRS PS FVIiS PRLKCGGTI MVHCS FDLI/3 

♦RNPPAIiPSKWETIGLHHBALLRIiLIF 
FF*KWWLFF 

A A> V V Ar 


13164 


27065 


A 


13305 


90 


24* 


PG WAHACNPS TLGGRGGR I T * GQE FKS 
SLCKIAPPPERKEGRKEGGKKKERG 


13165 


27066 

■ 


A 


13306 


• 


310 


DSS FS P IMCFGMS VNKTGLWLGKKKKKK 
KKKKGGPP* KKPIjGGPNTiPGGGKKKFFP 

♦RGAKKNPPGDFWKiCrLFIiGGSKLGPPP 
POK*NPFFGGGKIGPTPPPKIKPPGBKK 

Aj *«* Ai A WWWlWAWfa A At A> A A^A4M AJ VM*W\ 

KF 


13166 


27067 


A 


13307 


179 


56 


W VLVYKKQS I LG * AQWVMS VI S ALWBAE 
AGGS LVS TSLRLA . 


13167 


27068 


A 


13308 


425 


290 


KKNIYLAPPGYFWPPQRFFLSPPPPPNV 
VIFFFFLiPFPPFIjFFFFPFIjTOYJXRGIG 

V ± L i A X Ai4 A» A> A> AV aJa k £f *V 4JJ aV U Jk«^Aj\7 JL \J 

VMRGRGEGKMGR*W*KFFGRRVLNIRSI 
LLANFKHWPYW 


13168 . 


27069 


A 


13309 


214 


371 


IYF* * RW I SQAWWLMPVI PKL WEAEAGG 
* LEPMSLPGG YNETS PL 


13169 

• 


27070 


A 


13310 


975 


572 


AWGMWGVGKHSLVS LGIEECQASTALS 
LDKSGWWEEAARREDVLCEDRRCLLCHV 
PAGVRGS LKPELGSRKGKNGOOSGSKPS 
VPSLGPRPQGPGNPALSKGTRPN*AICF 
LCQTPADHS AKKQAPHTL IPIR 


13170 


27071 


A 


13311 


270 


10 


TPNKSLI * RLQFSNI KLANLGLGTMAHS 
CNLNTLGGQGARITSGEEFKTGLGNIDP 
I PIESKVI KTLAGMVSPDAWVDAIKRTG 
TSG 


13171 


27072 


A 


13312 


103 


290 


LTEWSGVIL*NCIYLLPQYTSTRVHAY 
IHTYIHTHTHTSGLSSTSVGSTNHKSKI 
FGGKKG 


13172 


27073 


A . 


13313 


136 


1 


SWISS SLVTQAGGQ WCELGSL* PLPPKF 
KRFSdiSLPSSWDYRRL 


13173 


27074 


A 


13314 


88 

* 


183 


RVSTLLKKSCFFFFFFFFFFFFFFFFFF 
FFFFSSQKKKNLKNKKRE*KKSCFFFFF 
FFFFFFFFFFFFFFFFF 


13174 


27075 


A 


13315 


481 


517 


XiLS P * RP KGKP KG KKRKPNLKE KKIGEA 
RGGKERQRKTPQRTGKQKKRRKYRRIGR 
RNYR 


13175 


27076 


A 


13316 


307 


19* 


GMHLYLPRTW*WVRVIPGSREIEVHAPL 
EPGSQRLLVAEMTPMHSSLDNMNKSPFP 
QPPKTEKQTKNPPKPHIIA 


13176 


27077 


A 


13317 


2 


209 

*- 


PARALDLKGSP YMES CSDAQAGVQDS I Y 
GDHL*LRAPAV**GQTVFVASPSKVGGI 
TGASHHPGLFFLF 


13177 


27078 


A 


13318 


3 


178 


SFCFSVLREIIGQSLIMKTLP* ITHTHT 
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. HTHTHTHTHTQ RE I ERERDRERGESQVL 
GC 


13178 

> 


27079 


A 


13319 


40 


382 

♦ ■ ■ 


DSVPSLIiLKLLMETGPHHVVQASTTTPS 
*FFILEGFHHVAHAGLKLPSSSRPPHIA 
SQSAGI YGCEPLRSARLI INYIGSRLWG 
LRNICLSODHEECLLCFILOAISMYRPC 

LG 


13179 


27080 


A 


13320 


107 


1 


PPL IMANSGWAQWLTPWPTLWEAKAGG 
♦SEVRSS 


i ^ i sin 








lit 
in 


5oJ. 


AS «5 OLIjE LATOVPPGT WT P PTT^ftV fVtfD™ 
SGKTLVTS KI IAGLKMPKDSRSRPRTVA 
HTYNSSTLRGQDRRIT*AQEFKTSLGKI 


13181 


27082 


A 


13322 


49 


108 


CGLY* FFF*ATPJSWHVAQAGLEFLGLSS 
FPASASSVAGTTGMCHHTQLHY*F*V 


13182 


27083 


A . 


13323 


1 


257 


ARGERERERERERERERERBRERERERE 
RERERERERERERERERERERERERERE 

D T?P 21 P PP T? * PI"*V+ PrtPTl/OnP PDDITPP & 

PPRVLGAVAIYIEWCLASLIQ 


13183 


27084 


A 


13324 


1 


248 


EFQYILCSALI FGRSKILK* ATKEVKKS 
KJII PRI S CQDTRICniWSAKAKRRNTTGT 
GRMRHLKIVYRRFRHGLHEENLFLKH 


Ml Oil 

13184 


27085 


A 


13326 


375 


• 
I 


PFKKKv»itltt^bPPl\.Vr FPFQWN* 11 PPP 
QLIICYL*RGGVFFSU?LII*APPAFFF 
FFQFFFFFFFFFFFFFFLQGQYWQFRDT 

RWGQCVPFPPTRP 




1 /UoO 


n 


1551/ 






NWl3KT^PPT.RPnNWi?T .T? IfTDOT VPnT.VKTP 
r» ±r Ivr! V_ it i_in..r ini r IjHIV.1 Jfy IF C V^lXIVXS C 

FFFFPKKVFYPPKKPFLEKPWPPVLN*K 
KPPPPFFFGWGPGGPQYCPFKKAPPLFI 
RGKERFPLWGKGD FFOI PCPPGPIiKKKK 
RAAARDLGTS 


13186 


27087 


A 


13328 


2 


220 


GRVGSSRARAVALFFFFFFFFCFFFFFF 
ONPPI1KRGEKKKNTPPLKKKRPI1RGGFK 
KQKEC*EKKKKLSGPK 


13187 

l J JO/ 


Jm 1 UOO 


a 






7.RQ 


I OLWSSL * K * LHTHTHTHTHTHTNTCFL 
TVMKBLSTHPGNK 


13188 


27089 


A 


13330 


203 


269 


INPGPPREPGGFNQKPQFPSAPGFEPWN 
PPQGPKP* KKKKRKKPFKVWAPGGGSKV 
ONPGLRETGVFG 


13189 


27090 


A 


13331 


216 


418 


DVPVHYCRLYKPCTLRLHSI YKKQ ILLW 
PGAVAHACNPTTLGGRGLWI T*GQEFEA 
SLANWKPHLY 


13 ion 




A 




lOU 




werol fk i aosgiarwltpvi pat.wrar 
EGGLFEYTSLR*IjWATQQDPISTKMFK 


13191 


27092 


A 


13333 


339 


62 


GMLPLFVPPQKRGSPPYPCYGVYNSPPL 
KKQRFFSSLGIVLPPIVFITPPPPAFFF 
FFFCFFFFFFFFFFFFFLI*RRLMRRRM 
FSCFLHCP 


13192 


27093 


A 


13334 


45 


384 


DPS VRINTLLAT 1T1T 1 1 RI TF* LPQLNGYI 
EKSTPYECGFDPISPARVPFSIKFFLGA 
ITFLLTOLEIAIJ^PLP+ALQTTNLPLI 
GMASLLLI 1 1 LALSLAYE*LQKGLH*AE 


13193 


27094 


A 


13335 


270 


660 


AGSRRPLRVPGFSLSMMSPSLCRPVCVA 
QC IRVS FLLTVGAVLRPGFQCLDWP CLV 
YARLSGWTG FPPCRYGKGCCRYDEGCCR 
FGEGCCRCDDRCCRCGEGCCRCDDGCCR 
YDEG * CRCDDGCCH YGE 
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nucleotide insertion 


13194 


27095 


A 


13336 


298 


349 


KTTP F YGLRGS KIGTL * CSGVI I AH * SL 
KLIX3S S DPLS L VAQAARTTGACHHDQLT 
NAFF FF PLERVPTPLKNNPFL wVKGVKN 
WDLD 


13195 

■ 


27096 


A 


13337 


865 


518 


KSATMRMVEDTHKMPAWGGYSGSLQQER 
GELSSLPCP* PES PPPLLI FLKLPSNPL 
PCSPLPCTQGPGQPAPYVSIHTSLKFPN 
AKLS VCLSRAI E FS 


13196 


27097 


A 


13338 

i 


154 


354 


FNLQVKPDGYFVLFICHILPTDSKSGKA 
PGAPCHSRNFKIRKTLGWVQWLTPVILA 
LWET * FSHQENSGLGAVAHTCNP STLGN 
QGGWIA 


13197 


27098 


A 


13339 


262 


11 


QFHFSRKLTP * KM* LSAHS SNTHEAYVA 
FP KAHKPGLNS KTAF * LGTEAHACNPN I 
LGGQGGQITRGQAFKTSLANMAKPRLY 


13198 


27099 


A 


13340 


358 


164 


TKKGKPRFFLKNS KI SRGWWWAPVISGP 
WEGEGGAFV*TGRPKIWLTEVGPLPFNL 
GKKKENPF 


13199 


27100 


A 


13341 


147 


2 


FRPQKVSVRVGVLAHACNPTILGAQGGH 
II*GQEF*ASMANMVKPSSC 


13200 


27101 


A 


13342 


367 


17 


TSRS WNLFHVLVRFEJ.TADKDIRE IGS FT 
KKRS L I DLTI PYAWAS FTLMMEGKKEQV 
TSS INGSRQKVRSQSGELLSLTPSDFSW 
AHWLMPVFPALW*SETGSXjFEVRPSRPA 
WPTW 


13201 

• 


27102 


A 


13343 


166 


492 


EGTQETLCGCI I CL VRGDALNL FHL KCS 
WVGWRGAI CCMELRETAKQKLS VWKLYP 
FEIVFS F SNNNPRPGAVAH I CNPNTLGG 
♦GGRIARAQEFETNLCNIVRPHLFR 


13202 


27103 


A 


13344 


804 


1124 


TFFFCFFL* DRVWDVAPGWES SDMIMGP 
LQALTSWGSTDPPTLGLQSGLGDLQDTW 
PPYPTSFYYFFLQGRGLTMLPRLVLNSC 
TQAI LPPQPGQRSKTPISIiATN 


13203 


27104 


A 


13345 


280 


448 


GGGGKKKKTPKPPLEKNNFS P PP * FFPP 
KKQIKPPPPF*GGGGKKKKTPKPPLEKN 
NFS PPPFKPRKGI FFLFPPLS WVKKKGD 
PPGGTRPLAPL 


13204 


27105 


A 


13346 


232 


400 


AELLRYCS CGHFL * S MARYKTKPGVRVHj 
TPVI PVFWEAEAR * LHEPRS SRPASATQ 


13205 


27106 


A 


13347 


138 


332 


RAWKPSFEIFYFFF*RAHTF* *SCVCL 
CVCVCVCVCV* THTHTHTHTHKHTHDY 


13206 


27107 


A 


13348 


314 


395 


WLGIVAHAYNPSTLGT*GGWIT*GPEV 


13207 


27108 

• 


A 


13349 


388 

* 


371 


IMNGWQDKELVSRVIQTGIKK* KPRNRP 
E FRAPP * WHERNGRKERERMEGRKERER 
KKGKGKERKRKKEACIYKVKIEISNNLA 
NLIINVER*DRRRPDILSWVSLSSRVSS 
VFIT^VTCLrSPQFFLSLHLNQILFPlAi 
KNL 


13208 


27109 


A 


13350 


412 


i/V 

40 


M * WAFGDIMYTFDLLFI KKTKNNCKLW 
QGCKEEGSLIHCWRECKLVQPL*RPI *R 
SLKKLQMGLPYDPAISLLETYPKERKSV 
Y * RNI CTSGRVG 


13209 


27110 


A 


13351 


372 


148 


FFFFFFFFFFFFFFFFFFATGS CRVA* P 
GVKWLISGTVPLLISTGVLTCS ISDU3Q 
PVHTSLGNLWPYSQEIjPY 


13210 


27lll 


A 


13352 


209 


206 


KKKKKNFFSFHGKNPGNGGPFGGPPPPP 
F*TFFKKGGGLPKGPPPKGFFWNPPQNG 
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■ V3uv7Jr J. v?i7£»JYiw tr r^cjL^jy ur r r r r x\x r 
KKRPTAFFFLKKKIFF 


13211 

■ 


27112 


A . 


13353 

* 

■ 


43 


334 

• 


KKKKKKKKKKKKKKKKKRGGGPFIKISG 
rsan ppp^p kttpppppt -/Trsfzw tpp^ppp 

KKPFFWGGNFWPPPPQKNFPFGEKKKFL 

GGGGGKNRVFFWGEKIFSLGFFLKKFF* 
Tfifpp<'iirNPPPPinrwPVPQPPPP*irpppp 

P KGE I FLGGGG PKI P PPKKRFFFKKPPG 
GFFS PP * KKKKKFFLPPEKLGPPRNFYK 
RPPPPFFFFFFFFFFFFFFFFF 


1 7010 • 


0*71 
2/1 1.5 






AOH 


AAA 


KPT I liRVTHFIiEGCEKYEFLLPYAFSNF 
SEPLKRYYYWLGAEAHVCNPSTLBGROG 
QIT*GQEFETSLVNMVKTCLY 


13213 


27114 


A 


13355 


183 


9 


IRS P KI PEIRVFFFSRWS YALVAQVGVQ 
WfiENrr,<5 * MWPPPAPFOPIiSSIiSVPST*ICY 

R 




001 1 < 


a 


1 JJJO 


Ofifi 
ZUo 


30R 


PITLGGQGGQTT+GRAFETSIiDIMEKRY 

L' 


13215 


27116 


A 


13357 


256 


341 


KGQTQ WLT P VI PTLWEA* AGGLIiESRS S 


13216 


27117 




13358 




"3 A A 

344 


FLFPLFSCLCKFFFFFFSPBTEFCFVPQ 
AGVO * HNLG* LKP P P P0LKOFS CLTLP S 
SWNYR 


13217 


27118 


A 


13359 


259 


291 


AQ*IiMPWPALYEAEAGGSSljEPRSLLS 
VWAT 


nil o 


27119 






AAQ 

44 o 


lO 1 

121 


YCRIYIYVYIYVYICIHICVYM*TYMCM, 

YICIYMYTYTHVYTHVYMCIHTYMCI*A 

YICVYIYIF*KACLPLAISTLSSGH 


13219 


2712U 




1 Jiol 


125 


2E)2 


P Q RO Zl Q VTT JTT Q VTVM Q WVD WT .TP T T QTT 

csrund vor idxvi v nan vunxix jt jlxo 4 a. 
* EAQAGGSLEARNSRPD 


13220 


27121 

1 


A 


13362 


362 


46 


ARAKGPKKIGFSGKMGPP*GAPPPKMGK" 
?TT * T TP P PTTPT .PP PT /SKTK I TOJPPMGPW 

PFGFPKKKRGGGARR*KTPSPFREKPPP 

QKLKRFKTPFPPLFFFKNPRP 


1 JZZI 


OOIOO 


a 


1 jjOj 


J to 




PTOCA AH PTTU^TRlCPPGFVCLFWDFFWRd 
SHSVTOAGVQ * CDLNLHPPGS SDS PAAA 
SQVAWTTGTHHHTQLI FIFI FCRNKIS P 
SLLKKYKKNLPGWAGACNPSHLGG 


ID AAA 


001 O^ 
At IAj> 




1 J JO** 


1 il 




SLTFIIiEKRECYYPRSLFTVANFTVAKR 
* QTLNCPSTDKW INKMCHIHTVEYYSDI 
KRNEIPMRATCRQTLKA 


13223 


27124 


A 


13365 


76 


280 


PLLP FKAWQWVGCHYW* KLD * INCYFR 
ATJ^APSEYMYIYIYTPTHTHT* IYIYIY 
I YKLTYSHLKVG 


1JZZ4 


ooio*i 

2 / l AD 


a 


1 

IjjOO 


i jj 


30A ■ 


x lilt iijv iwuUJiruuuvwuv<i>4iUnvc<i . 

IATLITLNTHSLLINIVPIAILDIiAAGQ 
AAVG LALL VS KKKKKG P PLKKP P LGAQ I 
SPANQRKKFPPKRKPIKTRRGTF 


13225 


27126 


"a 


13367 


488 


190 


FSYAACFLPQCVFVCTFIYLHADGFLLL 
I IFFENSVFILCHS CWVFCFLLQWFLLM 
NLFSLFLLMLMHPVS*MECSIYSFSSLI 
IFHFSLMLLFFCFMI 


13226 


27127 


A 


13368 


229 


372 


Y YATKD FSRPRAVAHTCNPS TLGGRGRQ 
IT*GRQFETSLANWKPQLY 


13227 


27128 


A 


13369 


122 


2 


KEECVGSGTWWVMPVI PAL* EAEGRGSL 
EPRSLRPAWTKK 
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Amino acid sequence (A- Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q=G!utamfne, R=Argininc, S=Serine, 
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X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V^possible 
nucleotide insertion 


13228 


27129 


A 


13370 


43 


412 


RPTRPKRQNYGDSENISGCQGLGKKGMN 
GKSTEDFEGSBTTLYDAITVGTCHNKF I 
RSHIVYNTKSGP* HKLWTWGGHDVSV* V 

«i m mm k mm mmm_ mm mmmm ii ■ ■ i r urn j _ m m j~mt mmm mm. AfcM #M MvKWI f <V 

PQLRKTYHPGERCL * *GRLSMYRGRB Y I 
GNLCIPCSLLL 


13229 


27130 


A 


13371 


136 


1 


KS PTW PGAMAHTCNLSTLEGQGG WIT * G 
QGFETSLANMVKPSDAW 


13230 


27131 


A 


13372 


228 


385 


GLLIHKLEFKHFWLGTVAHTCNPSTLGG 
QGRHIT*GQGpETSLiANMAEPCLy 


13231 


27132 


A 


13373 


277 


429 


LETTTKAGLPTIIFTSSGQMSIWFFQNK 
PWDSKKLFNLLSISSP**SSKNY*PWQS 
TVAHTCNPSTLGGQGRWIT*GQEFEISL 
ANMVKPARVGRHVIRGLQVS . 


13232 


27133 


A 


13374 


194 


3 


NQENY I Y I YTHTHTYTHTYI YI L * LLLQ 
WVYLAYFLCLSYPSLF7VQGFTHTTYTHT 
HMHTLIL 


13233 


27134 


A 


13375 


377 


228 


DRVLLLL PRLE CRG 1 I MAHCRLPRLVSN 
S*APALSFQSAESTGVPNVPS 


13234 


27135 


A 


13376 


255 


465 


NAWKCPF I YKI TFVI FLIHVS CKKFRNY 
RQREWKLPTVRPLSPS*AI IFPVTCTYT 
SRWPEATKDPQKK „ 


13235 


27136 


A 


13377 

- 


336 


38 

- 


VWWCTPVVPATQEAEVGGSIjESGRLRLQ 
* AVI TI»VNEHRES ALASRRGPEETS SVK 
PPLPTILAHTCFSLPRTGQDITSRFLAQ 
RNTEENLELQMEARA 


13236 


27137 


A 


13378 


376 


293 


FFFFFFFFYYFFFFK*KFLTKKKILSSQ 
YI 


13237 


27138 


A 


13379 


215 


16 


HLTWSFTTATEGSKTVTQHSVYRKAKLG 
IX3AVTHTCOTSTLGGLGRWIT*GQEYKG 
IPPHGLEDVQ 


13238 


27139 


A 


13380 


184 


64 


VDESLEGWMMDEWMSGWRGGCINRYMHA 
WMDG*GDGWIGG 


13239 


27140 


A 


13381 


233 


417 


LKPITKGRKPRGFFLPFKPKQKKYFWGF 
KKKKKKKKKKKKKKKKKKASRAPF*KKG 
PQKTP 


13240 


27141 


A 


13382 


34 


360 


RWNTTNADHDLKDN I LSPPQINLYIRQN 
S SRLFCRH * QMDS KS PVKI PAGFVLYI * 
I YRYTHTHTHTHTHTD SELYMETHTRMA 
NTIVRKNNSLEHS YYLMLRMTI KLP 


13241 


27142 


A 


13383 


214 


71 


QTLNTD S G P GW I HACNPSNLGGHGGKI 
I * GQGFETS LANMVKRCLY 


13242 


27143 


A 


13384 


352 


312 


DKQLTIiHRTDSLYIFYTY*PPQSCEPIS 
YNNFPCLSQYLYLCPSLHTHTHTHTHTH 
THTHTHTHTHTHVSVGRRSLFFSGPHTE 

*^ MW^ m W m k v B m*m* mm mmmm. m. 

APRSRVSVYT 


13243 


27144 


A 


13385 


393 


271 


IEGQIQYTSTIGNKFKDFYLILCKEGIM 
SREISPPSSCHLRQQ*RVRLRERDRERK 
RQRERQRERE * GRS VLHPHVT 


13244 


27145 


A 


13386 


180 


440 


PVEERTLCEDI LCFPFSVLLCIQFHLLI 
QHAC^KYPNPNSRFGSMPGAVAHICNPS 
TLGGGGRWI T*NQEFEARLSNMVKPRLY 
KNI 


13245 


27146 


A 


13387 


190 


47 


EGEKGVPSTILKMETLLGTVAYPCHPST 
LGGOGGRIAEAQEF*DHLE 


13246 


27147 


A 


13388 


188 


470 


ARPPCKGRDSSAEGP PGPPFPS WSSLGC 
WTRE P PGRG E P I Q VAVRREES AQDWAR P 
ELI I KE WWPGLVAHTCNPSTLGGRGGW I 
A*A*EFENSQ * 
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SEQID 
NO: of 
nucleotide 
sequence 

i 

■ 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: Id 
USSN 
09/515,1 
26 

- 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
otide 
location 
co r res p on 
ding to 
last amino 
acid 

residue of 
peptide 
sea ue nee 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine. 
G=Glycine, H=Histidine, Msolencine, 
K=Lysioe, L=Leucine, M=Methionine, 
N=Asparagine, ps=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine. V=Valine. 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V= possible 
nucleotide insertion 


13247 


27148 


A 


13389 


454 


367 


NKKKPPPRPP* PPPPPLGKTPFFKKKNP 
NFPRGGGNSPGSPSFGGGKGKDPPPGGF 
FPPPPYPRGEGGNPFFKKKKKKKRASLD 
P*WSSQPCPAYBKELNFAPVLATVTLPP 
GHSS 


13248 


27149 


A 


13390 


34 


354 


HTLLALLLI I ITF * LPQLNGYI EKSTPY 
E OGFDP I S PARVP FS I KFFLVAI TFLLF 
ELEIALLLPLP * ALOTTNLPLIVMSSliL 
LIIILALSLAYB*LQKGLD*TEY 


13249 


27150 


A 


13391 


1089 


792 


THGFGPQVGSPLDLKQPQKAMPRGPSLG 
NPPPGCOOHSHGAGGP*VPPGPROPVMA 
SRNLCLPLALPRTHPPSPGQTRDKPRTQ 
DRSCWRDPRNGPGV 


13250 


27151 


A 


13392 


299 


428 


W I RCRGRGRS CLP VPHSAP ALLS PL WD 

EGSGCRSQALPREAAEARRBFKHGGIPR 
H*LTESLVSWEFRHA 


13251 


27152 


A 


13393 


382 


413 


HGKTHLYKKFKNKKFKATMPA* WLTFVI 
PTLQEAEAEASLKPKSSRPAWVTWQDPS 
LQKI 


13252 


27153 


A 


13394 


64 


401 


GGIPKEDSQAGAFTGIGERQQGLVIPLL 

AGEGGGQGQECS LGLLPEANRCHPCNPS 
TLGGRSGQIA*GQEFE ISLGNTVRSCP Y 


13253 


27154 


A 


13395 


146 


18 


FSPGWAHACNPSTLGGQGGQIT*GQAF 
KTSTiANTVTCPKP PO 


13254 


27155 


A 


13396 


1056 


679 


DIQVPERPL*NVPEPEAKGEPPDRAVGE 
HIDRDCRSDPAQQKRKIFTNKCERAGCR 

SGEGHPTSRAGLAAI SRAQAVASTSTVP 
SPSQTMPSCTSPSR 


11255 




a 

• * 


i jjyt 






IHSPPTORRV*ORERERERERERERRRH 
ALAERNRTREG I SGTTGERGNLMLVGHH 
LPGPLSSRRLGFCPDGFGGQHCLAQGDQ 
VRL 


13256 


27157 


A 


13398 


2 


256 


I ETLGSAVE FI P YENT YQTYLKKKNRNS 
HMQSQPNRKGHIRLKYIAAWA*WLTPVI 
PALWEAEAGRSLESRSRNPVSTKKKIQK 


13257 


27158 


A 


13399 


102 


922 


LSFFLFSETGSCSVAQPEVQWCNHSSLQ 
P *TPGGPSMQFQLPQEQGQRTAVWHGGR 
A 


13258 


27159 


A 


13400 


198 


55 


KDYSAWHTCNPSTLGGGGGWIT*GQEL 
KTSQANMVKPHLYKNPKIEF 






A 1 

** 




1Q 


*rrf 


E SALNLPSAGI IGVSHRTOP I FC I T.NATi 
ALGVL ILERSPLQS PQVPPSHSHTKPGS 
LPSVTPGRGPGRPRAAEPTAQGRRYNSN 
PVLISPGSVHPASFALPAEPP*TGAPST 
PRPCVLRPLLLGNAMDLLCTKLSSRLPP 
YRT 


13260 


27161 


A 


13402 


93 


410 


DSNLNYSLFFHGEADLGTNQVLTHPSTT 
AMYFEHYCQPP*rVHGTINT*PPVVHKN 
PIHIITPSPCLRASTAINLLLSHINCYS 
♦ATPHPLGYQQTYLPLTVHST 


13261 


27162 


A 


13403 


55 


387 


SNSHTYS LKKSAG I TKFQNWHI VNCI CI 
CQVWPLEMVNRHTVILCQYPVKPRILY 
QHHTAILVTI LTFTLRPG WSHACMPST 
LGGQGGRI I *AQEFKI SLGNI VRPCLY 


13262 


27163 


A 


13404 


39 


365 


SGDRRVRLLLKI I T F * LS QLNG Y I EKS T 
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M 
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00 
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correspond 
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peptide 
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Predict- 
ed end 
nucle- 
otide 
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correspon 
ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
it— (jriutamic Acid, * — t nenyiaianine, 
G=Glycine, H=Hjstidine, I=lsoleucine, 
KHLysine, LHLeucine, M=Methionine, 
rN^Asparagine, Jr— iTOiine, 
Q=Glutamlne, R=Arginine, S=Seririe," 
T=Threonine, V=VaIine, 
W=Tryptophan, y=Tyrosine, 
a— Unknown, w =Mop coaon, /^possioie 
nucleotide deletion, ^possible 
nucleotide insertion 


■ 












RYECGFDPISPARVPFSIKFFLAAITFL 
LFDLE IALLLFLP * ALOTTNLPLI GMS S 
iilili 1 1 1 JjAXjSXaAx K*ijyiUjliLJ ^ w Ac 


13263 


27164 


A 


13405 


32 


350 


HEFATSTSEMTKGTS* IVKRRIKTHTLC 
RRCGS KAYHLQKSTCGKCGYPAKRKRKY 
NWSATAKRRNTTGTGRMRHLKIVYRRFR 
HGFREGTTPKPKRAAVAASSSS 


13264 

* 


27165 


A 


13406 

i 


326 


668 


LQFKIFCMPGIFTISEVSSMSN*RTAYG 
SQSFHKI VLI *LATFGQALSC* IL* IHL 
CTLKNiCTKHRALVAHACNPirrLiGS 
IA+GQEFETSLGSIVRLLLSTKNLKKKK 
NM 


13265 


27166 


A 

♦ 


13407 


170 


472 


KLI S VWSEESS YCVEHGE PWPQEKEYS 
WLE I G KAEME RS TLES LLGATLEALS SN | 
PTRKMQGLKLKLQRASAFKEEPWLGWA 
HACNPSTLGG*GGRIA 


13266 


27167 


A 


13408 


178 


422 


FYNRYF * YTKTYQTEN I YCKI MSFRLLV 
* KI * SLPGAMAHACNPSTLGGRGGRI I * 
GREFETRNS 


13267 


27168 


A 


13409 


274 


444 


LFSQYI VFVHYSS FVLILFV* * LFSQYI 
VFVHYSSFVL I LFVJ LHNYTYI YIHTHT 
HTYTLIFIiLIFSQISLGWMKLIL 


13268 


27169 


A 


13410 


205 


23 


KEFLKFHRKCICQQVKKIWPRAVAHACN 
PS TLGGQGKR IT* GQEJjETS LANVE KPH 
FSNS 


13269 


27170 


A 


13411 


25 


426 


SVWWNSLETRSSRLRLLKKI *NLQNKKR 
KFEHRHAQRKYDLETQGKDAI YKLRRER 
RKEEE * EKEEKKRRRRKB KEKKKKKPTL 
LCUCKQRNQHLDLRUPASRTVKKLISW 
♦ATHSWPCYGSPSKLIQQSIH 


13270 


27171 


A 


13412 


227 

• 


52 

> 


E IGSHS CHPGWS TVAQS * LTLTSNS * TQ 
LILPPQPPRELHYNCVPPCPAFCRQALA 
MF 


13271 

• 


27172 

• 


A 


13413 


345 


67 


SKCC/VVAHDCI LLDLI CLYFLELVGCV j 
DCFSSSLGHY* PLFv* IFFLSPSIiLFWY 
SHYEHICVPHFSKAPFIFLHFFPSPEFP 

uJ-iCiKrlVx V 


13272 


27173 

w 


A 


13414 


345 

* 


33 


KVFFFLI ICKMRMQNKQKNKHGICLLNS 

LFYLFHFIEMGSCPGWSAVA* SQPSATP 
VQMQriVTT.r.DnpppnT/?KS 

v orloyv ILiLjrUc'rCiL'iAwld 


13273 


27174 


A 


13415 


187 


415 


FNQGWAFLFPFCVIFSYPWKDRNRTIFS 
IKITSFQESVLKLFCIFEVS*ILKIGLG 


13274 


27175 


A 


13416 


143 


50 


CMKLQSS I VMLG WAHACNPSTLGGQGG 
WIT*GQGGWIT 


13275 


27176 


A 


13417 


170 

• 


17 


*IKKM*YICTMECYSALRPEFPE 


1D/./0 


9*7177 


A 


i*4dift 

Uflo 


7 


1UJ 


IK* IWYI * TMEYYSAMKRKKIMSSAAI W 
MKLEAI ILGEVMQEWKTKNLMFSFISGS 


13277 


27178 


A 


13419 


901 


709 


SGRKTS LGNIARS CL YTKS FKI HCLP W 
PATQEAEAGGSLKPRSLRLQ*ARSHHCS 
PVWVTTRP 


13278 


27179 


A 


13420 


236 


4 


VIRAKTGSHNSLFPPHTFFFQGKNLCFF 
QKKKKKKIYIQSKPGAMAHACNPSTLGG 
Q IT * DQE FETSIATMVKPNLY 


13279 


27180 


A 


13421 


46 


141 


NSVISAHCNLCLPGSSDSPSSASRVAGI 
TGWC* HCNLCLPGSSDSPSSASRVMIT 
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Predict- 
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nucle- 
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correspon 
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acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Add, 
E=Glutamic Acid, {^Phenylalanine, 
G=Glycrae, H=Histidine, I=Iso leucine, 
K=Lystne, L=Leucine, M=Meth4onine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Try ptophan, Y=Tyroslne, 
X=Un known, *=Stop coidoo, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














GWC 


13280 

• 


27181 

■ 


A 


13422 

• 


604 


1121 

■ ■ 


LGSGDLP*EINPLSSCSLLREKDPPTTS 
GPQTDQPKKHLTNFKSAPYKTITDAELR 
VTLTVEAHLHPGE INSHVAHTKPVWWSL 
HTDAHEIWCRDSDLKLVPTLPLIPLEAA 
LRNI THS LS I PP PKNFRRPNTS TLPCVI 
FLINIRRQEyQASEPKPSHRIPCDLHVY 
AQMA 


13281 


27182 


A 


13423 


214 


22 


IRESSSTPSSPTLPESCNSIPFPF*KFI 
WSQVVAHAYNIiNTLGG*GRKIARAQEFE ' 
NSTRAGLW 


13282 


27183 


A 


13424 


397 


330 


SEYNSECS * VQWHTLWSATQEAEAEGS 
LEARS 3 RIiLCAL 1 1 PVNCHCTPAWAFPM 


13283 


27184 


A 


13425 


729 


885 


AKCYPVRNISLWPGAVAHACNPSTLGGQ 
GGRI T * GQEFRTS IANMVKPHLY 


13284 


27185 


A 


13426 


231 


2 


. ASVFFKATLVDLHW* I yiytyi ytyihi 
YTHTYTHIYIYIHTYIYLNVNQYICLYV 
HINANQQGYIYTGRTPICFF 


13285 


27186 


A 


13427 


184 


171 


LS*YIYIYTHTHTHTHTHTYILYIYTYE 
QPSSQSS 


13286 


27187 

• 


A 


13428 


372 


166 


CKFR PRK INTTFS H I CGS * KVD 1 1 EAA* 
HWLGTVAHACNPSTLGGQGRWTI *VWEF 
KASLANLLTPLLQ 


13287 


27188 


A 


13429 


391 


437 


MYSLGVGFFHSTI * LLKDI FWSGWAHT 
CNPSTL * GQDGR IT* GRKLETRPGNRAK 
LYLYKKNSN 


13288 

• 4 


27189 

• 


A 


13430 


39 


254 


EFIPRAQDLETSQGNRVRPRLYRKFKNN j 
♦VWWCASWPAMWEAEAGGLSEPGRLRI* 
HSNMDNGERS CLKKK 


13289 


27190 


A 


13431 


336 


488 


DIWPGMVAHACNPS TLGGQGGRIA* A* E 
FETSLSNMARPWLKKKLARHSGA 


13290 


27191 


A 


13432 


154 


481 


FFFFFFFQRQKNGFF*KGIFFFFKLEGR 
GAIFV*PNIiTLPG*GKSPASPL*KGGKK 
GGAPPP*LIFFFLKKKKLPLLGQGGLKL 
RALGKP PPFPSQRGG INGVKTP FGL 


13291 


27192 


A 


13433 


388 


280 


KSKLKNGCCVALCLLGLI* 3VLLA*FKT 
♦LFCVSLDMYIDTTCSLSISIYLSIYIiS 
IYLSIYLSIV*SVS 


13292 


27193 


A 


13434 


187 


48 


SISSTKGPGAVAHAYNPNTWGGGGGRIT 
♦GQEFETSLANMVKPRNS 


13293 


27194 


A 


13435 


71 


309 


DSVSEEEEEEEVELAHPLAAERYHCERA 
ET*VKALLWLELCDERVSSRSHTEEDCT 
EELFDFLHARDHCVDHKLFSNLK 


13294 

• 


27195 


A 


13436 

• 


18 

■ 


375 

• 


RPAVPGRPTRP INTLLGLLLI I ITF*LP 
QLSGYIEKSTPYECGFDPISPARVPFSI 
KFCLLAMTFLLFDLEIALLLPLP*ALQT 
TNLPLIVMASLLLI IILALSLAYE*LQK 
GLD*AE 


13295 


27196 


A 


13437 


373 


27 


THPYYSHQEYQSP* PLTGALSALLMTSG 
LAM*FHFHSITLLILGLLTNTLTIYQ*W 
RDVTRESTYQGHHTPPVQKGLRYGIILF 
ITSEVFFFAGFF*AFDHSSLAPTPQLGG 
HWP PTGI TPLNPLEVPLLNTS VLLASGV 
SIT*AHDP 


13296 


27197 


A 


13438 


575 


699 

i 


LGTVAHTCNSSSLGGRGGWIT*GQEFET 
SLANMVKTCPQKK 


13297 


27198 




13439 


247 


168 


LENL I YTRVLERHREAKVHFPFSNISYS 
DKRTDTFVLTKTH+HTHTHTHTHTHTHT 
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UULlli 
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peptide 
senile nee 


Amino acid sequence (A B Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid. F^Phenvlalanine. 
(^Glycine, H=Histidine, I=Isoleucine, 
KMLvsine, L= Leu cine, M=Methionine. 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S= s Serine, 
T=Threonine. V=Valine« 
W=Tryptophan,Y=Tyrosine, . 
X— Unknown. *=SioD codon. /=D0SSible 
nucleotide deletion, ^possible 
nucleotide insertion 


• 






• 






HTRGSHFPY * HTHTHTHTHTHTHTHTRG 
SHFFY 


13298 


27199 


A 


13440 


391 


168 


TKKKKHPS PRGAREVFF* QPPPPPFCVL 
★FFIVFFLSQGGGGGGGEFPPFFFPPPT 
SPPSPPPRGVWKKTPPFS 






n 


tIAAt 
1 1 


WJ 


94fi 


LFPTAIIGGPPGFPPPPF*NPAPEFFFW 
GPKKKKKLSPPPAIKMVFFKGPPP 


1 linn 




JX 
n 


1 1AA*> 




A1 A 


KNNFHPS ILGG*GGW IT* GOALETTSLG 

•VXAlAvX liC pVp\p#WW Vrf\pTf « pW pV W^TpUU pV p» UpJV 

NMVKPHIY 


13 Jul 


0*70 fV> 


•n 


1344.5 


AW 




PG J TVAHAYNIiSTLEG*GTMITOGOEF 1 * 1 
SLVNTVIPHLYQKKKKKGR 




Z7ZU3 

' • * 


ft 


1 1AAA 
1,3 444 




IT 


DPSYI1AWHMVTPHOVSTOHTLPRPRPCI1 
PEMNFLVSLKIWQCWPGTVAHACNPSTL i 
RGRGGWIT*GQEFETSLANMTKTRTRGS 

pTppVV 


13303 


27204 


A 


13445 


256 


388 


FCGFWPGAEAHACCPCTLGGRGGWIT*G 
QKFRTSIiAHMVKPRVC 


13304 


27205 


A 


1 a /i ac 
13440 


277 


374 


PLLY 


13305 


27206 


A 


13447 


14 


387 


PQVIHSPWPPKVLGI^QA*TTSQAWWLTP 
VIPALWEAKANGSLEyRSS 


13306 


27207 


ft 


t "> A A O 

13448 


lo5 


379 


L* NTKKKKRGRPFKKKKLQSLVWQNKI F 
FliAELKLLW 


13307 


27208 


7\ 

A 


13449 


J OO 


340 


T GTT .T3TMWMTQT VTMT.WT.'PTnPTiK'KRrjP 

F I FPYFFFFFFFFFFGGNHSL* SLI VIF 
LIC 


1330S 


27209 


ft 
A 


13450 


57 


1 

170 


VliPQP Pflpr^nTJXTPT.VPPQTRUiTRAPP 
D LLTG KRKP P PLGRM S RAR VQWHNHF * T 
ESCSVAQARVQQRNIiGSL*TLPPKIKQF 


13309 


27210 


A 


13451 


411 


241 


PPLFFFFFFKQKFPSVP*GGGQGWDFGS 
LOPPPPEVKOI FCPKI P PPW PPKKGGVP 
G 


13310 


272H 


A 


13452 


307 


298 


PTHSIVRNNKSL*IN*TSSMCSGLKVMR 
R* SQKQEDP* SGWAHACNLS TIiGGRGG 
WIT*GQEFSTFLCPSA 


1*311 1 

1 






1 1AS. A 
1 j4j4 


lOl 


ins 
XUo 

* 


GWSOTPGLGRS SLLS LPOWWDYRMOCRG 
VILAPHCGLDLLGSGDPPASASQVAGTA 
AFENCATS FWLP * FLNTQCFKIYKTKN 

p^p>p> *^P>^ ^PWT P> P> Prf * F» P«P> P* ^**Pl^ Pb WPH P>«^» * **■ ^ ~ 


13312 


27213 


A 


13455 


242 


400 


IATLLDYLQIPNTGPGAVAHACNPSTIX3 
GOGRW 1 1 * GQE FENSLANTVKS CL 


13313 


27214 


A 


13456 


1 

• 


346 


YVTTARCSCWAGTESHMGAKDFYRQEKH 
PFHADSGIGWA*DRLRQRYGRFGRRM*F 
SWARGAGEKHSLA*AQWLMPWTTFWEV 
KEGRSLEPRSLRPAWATWRNVSTKIKKK 
KKG 


13314 


27215 


A 


13457 


166 


289 


VRSS KLNTWVRWIjVP 1 1 LTLWEA* AGGF 
LEPRI SRLVWATQ 


13315 


27216 


A 


13458 


220 


76 


TASLKLSKSWLDWAHACNPSTLGGPGR 
WIT*GQEIKTSLANMVKPSP 


13316 


27217 


A 


13459 


219 


46 


SPPLKKKNNFFPPGVMGAPPRFFLKGPP 
QNFFFFFFFFFFFFFFFFFLKQ* EQTFL 
LIY 


13317 


27218 


A 


13460 


126 

■ 


1 


WSLGAVARSCSPSTLGVQGG* IT*SQE 
FETSLANMVRTLTS 


13318 


27219 


A 


13461 


330 


420 


N I CS WAQWLTPVTPAL* EAEAGG5 P EVT 
SS 
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corresDOn 
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Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid. F=Pbenylalanine. 
G=Glycine, H=Histidine, I=Isoleueine, 
K=Lysine, L= Leucine, M=Methionine, 
N=AsDflraeine, P=ProIine, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
\V=TrYDtODhan, Y=Tyrosine. 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion. V=D0Ssible 
nucleotide insertion 


13319 


27220 


A 


13462 


577 


766 


AQWHTS IVLAA* EAAAAGLLEPRS * SRL 
KCTMI I P INSHCT PAW QHMRPCLLKKNF ' 
SNSFISK 


133Zv 






I3**03 

i 




Jot 


VTVGE1EAEAGRGHGLLTCCVLKTQKWL 
GTVAHAYNPSTLGG*GGRIT*DQEFBTS 
FLG5 


13321 


,27222 


A 


13464 


167 


41 


DFWLGAVTHACNPNSVGG*GGQIT*GQE 
FETSLANMVKPWNS 


13322 


27223 

■ 




1 1 ACS. 


480 


/y 


KGQYWSTQEERLEPGVMGEFTPGPCPCL 
VSOGPSLCPS PPP YS FRWSRLSOAPGSI 
RVLI *RPAAATSAPPPAQLGLPDTAPPP 
PLLSRDGQWAAGRQGDRDKGLP 


13323 

i 


27224 




13456 


3 


ITT 
3 /3 


TF* LPQLNGYIBESTPYECGIiBPISPAR 
VPLYIKFFLVAITFLLLDLERALLLPLP 
♦ALLTTNLPLIVMSSLLLIIILALSLGY 
E * LRKGLD * AE 


13324 


27225 




13467 


214 




TP r>KT PNT .P<3 S * KKPliI PPARAKDFLFF 
I FFFFFF FFFFFFFFFS CS RXLLHR 


13325 


27226 

■ 




1 1 ACQ 

1 3465 


iii 
111 


35o 


VMTCVFYT* TOPTiVTOAYTFVKSHOTAHI 
LFHSMYVLP * IKMFWPGTVAYTYNPSTL 
GGHSRRI S * AQEFETSLGNTVS PHLY 


13326 


27227 


A 


13469 


199 


607 


RIRGTSQKLLLLKKLISLIPKSQPTRGT 
p * TTFPPPNTTNFPPPPVAATAPDP9PA 
HFVSS P YNPDLGS P * PECPSPGRLQREI 
EQCKKD I QNFPFPTTS RI CSNDLSLKGS 
AS RRRGHL F C EPJ? VNQ FRS PKP KK 


13327 








100 
XjLo 




EDWMWWLT P VI PTPOE VE VRGSLE PRSS 
ELQ*AVIVPCTPSE 


13328 


27229 


A 


13471 


94 


388 


KGSGKTEELWSWRSHRGTHLCLIiADISA 
LPIiHYYYYYYYFWQRWSLPDQPGQHGET 
PCTjPKTOI INPAYWHVPVVPTDOEADAR 
* LPEPWRQRSQRPE 


13329 


27230 


A 


13472 


359 


73 

• 


ATE PGQLF YF * ES ESHNVTQAGVQWLDH 
GSLOPPTPGLKPSFPLIHTGITAASHYT 
WLNQLLPMNVSIiLTFYESPQLKIQRSHI 
LCNSIH I KFLE 


13330 


27231 


A 


13473 


332 


10 


ILSKRGFPKPGRYCEVNPFLCVQGAK*V 
RP*EDTSLHCEIYHTHTHTHTHTHTHTN 
LKP*TSKELI PQKTESLFFSLCIKKSLL 
RFQTFPDAGSTGNSVGCACFISK 


13331 


27232 


A 


13474 


355 


385 


IMVCIKQSCVH*KNN* FWPGTVARVCNP 
RTLGGQDRQTA* AQELKTS LDNMAKPCL 
YKKFF 


T 111. 1 } 




A 




207 


425 


RKSAWWLMP WSPL* EAEAGGFLEPGSL 
RSAWATWGDPHLYKK 


13333 


27234 


A 


13476 


160 


1 


KSTGRLGMVAHICIPSTLGSQGGWIA*A 
QEFETSLGKILSLPEIQKSASIiITW 


13334 


27235 


A 


13477 


334 


79 


GMKEQ IN FLYQKRG I KPNKLOAENWLI 
HYN * KLLSNS FK* LQDTAS I * YFFGNYL 
NRPGMVAHACNPSTLGGRGGWIT*GQEF 
L 


13335 


27236 


A 


13478 


234 


385 


YAKFHVS INEKKPDVAHACNPGTLGGQG 
KWIT*GQEFKTSLANMTKPRIY 


13336 


27237 


A 


13479 


408 


129 

* 


KRSINRKKGGEIjSSSHFLIitiLLPPLDEE 
PSPPPFCPPLPPFPPPCPRLPQSG*GAC 
GRLHPC*MDLGPCKCPARKVFSHLSCSL 



1354 



WO 01/64835 PCT/US01/04927 



SEQ1D 
NO: of 
nucleotide 
sequence 

■ 

• 


SEQID 
NO: of 
peptide 
sequence 


M 

.eth 

od 

■ 

■ 


SEQID 
NO: In 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to tirst 

amino acid 

residue of 

peptide 

sequence 

i 


Predict- 
ed end 
nude- . 
otiae 
location 
correspon 
ding to 
last amino 
acid 

■ 

residue oi 

peptide 

sequence 


Amino acid sequence (A«Alanine 
OCysteine, D=Aspartic Acid, 
Hr-viiucamic acio, r— rnenyiaiamne, 
vj — vviycine, n — insuuine^ i— isQieucinc, 
K=Lysine, L=Leucine, M=Methionine, 
i^"~Asparagine, r = tiuiine ) 
Q=Giutamine, R=Arginine, S^Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
a— un Known, — oiop cooon, ^possioie 
nucleotide deletion, V=possible 
nucleotide insertion 














VLQVCRSP 


13337 


27238 


A 


13480 


dp* mt 

251 


384 


T I TP W KGAVAHVCNPSTIajGQSRUMT * G 
QE PKTS LSNMVKPRLY 


13338 


27239 


A 


13481 


164 


300 


ALjUUoUNblVKjJLjJLfVsKlHJSlKr J? &(JLA?PvJPl 

ttkfivpalweaevggwleptslrlr+a 
liallqctpa*gtvydykvyrpstl03r 

GRRMA 


13339 


27240 


A 


13482 


2 


30 


TGCHQFRFIuNNPFSPHPVLQLVSPGPDC. 
LIFRYFKF*LHQLM*INPNANFCGARSF 
LCKFRVWIS*LRYPSLVALSVLFVFK*S 
VQX * VKK±GUCKKKK1CKKKKKKKKK*K 


13340 


27241 


A 


13483 


39 


199 


LDVGTCSPIVPASQEAEAGRLHEPTSLR 
LW+AMIVQLYYSLGDRARPCLYLKK 


13341 


27242 


A 


13484 


18 


72 


TRTRSYPGSTGCQLHTFGLVSIiYNCVSQ 
FLITNLYISIYRDIY*YIYIYLYREIYL 
YI YIEI YTYIBRYVYIHI * ILGLQVANC 
IPLD 


13342 


27243 


A 


13485 


266 


496 


mw mmm. jmm. mm mm. ta>» mm* SWm m^. mm. mmmm. mf-mt. m mm m*m\ m. W W mfm WmM*m mm^mmm MkdB M V V 

LDSASDITQAGVQWCNHS FL* PQTPGLK 
* AS TS AS * VAE I TGLSQHVWVKI IMS VS 
LSSYTFIQLPSILLYGNTAP 


13343 


27244 


A 


13486 


286 


72 


HILSHCTVLWFLTP^GTTHYKNTTGQP 
S IVAHTHNLS T LGG * GRWTT *GQEFKTS 
LANTVKCS DAWVDRD 


13344 

* 


27245 


A 


13487 


309 


6 


RFFFFFPPGKKGVFFQRFFFGFPRGFPP 
PRFFKTGPEIFFFGPLKKKKNFPPPGGK 
IVPF*RGAPPFFFFFFFFLFFFFFFFLK 
KNYLLLEFCVQS PEKEG 


13345 

• 


27246 


A 


13488 

- 


183 


459 


RRVKKI CKCI I WQNNYL * KG INICSMPQ 
KKKKKKKKKKKKKGAPFKKKKKKKIS *G 
GGGPFF* KTKI KPPGGRGFFFFFGEKIM 
LFPAVFI 


13346 


27247 


A 


13489 


348 


1 


CEGKDGLP IQFAS WFKYAGFHSLES I FQ 
S FCQTKKLEAPKEIiS PCSQL YRYNWQLT 

CRKMKIjD PHJjLi P YTKJ.NJsKFGAAAHA* n 

PSTLGG*GGWI I *GQEFETSLTNMGETP 

f IKr 


13347 


27248 


A 


13490 


313 


320 


KQGGPPIiAQARVQWYNLSSGETSLPLPG 
* DREKGDQRKL 


13348 


27249 


A 


13491 


187 


350 


GNPVKK* IKNTTSSSGWVWWLMPIIPAL 
WDAEVGGLLEPRSLRPAWATERDSVS 


13349 


27250 


A 


13492 


211 


83 


VCVCVCVTTCIKNI 


13350 


27251 


A 


13493 


286 


407 


LGVEAYACNLNTLGG * GGRI T * GQE FBT 
GLGNTARPCLYF 


13351 


27252 


A 


13494 


897 


1033 


VPTDi/ , niuirtriTnntl(?OT nn * rnTif T 7rtOor*T 
Kb XFKQnUWUDHooLiyP* 1 PiSliJ\.UooUlj 

RLP STWD YRRE PLHLAM 


13352 


27253 


A 


13495 


265 


3 


HLILLEEDLTWQRGKWLKGRVSLCSLGS 

#1Y*iQT ^rr.lTIff^PriTJ^l'lA/2iV2VDQ'rT/3f3Pfi 
1 V JjI\J\AjI\OJJ\J V V/iVi\lrO 1 J-A3VjX\VS 

RWVALAQELETSLGNMVKLSLSKIQKIiA 
GMV 


13353 


27254 


A 


13496 


319 


1 


C FKLWDT CAQCAGLLHRYTCVMVVCYTH 
HPEFCYNIMGLPKYMWSYLWPKGCHAVY 
H FRYI KNLLPG CVAHACNPSTLGGRGGW 
IT*GKEFKRRMKTKV*IGIYFI 


13354 


27255 


A 


13497 


265 


11 


VU3EVSNQHLLGAPSNRTPSEVPLAQAA 
QGRPSLVPPSAIVFPSYFTICYAFLSIS 
MPI I P* VWEAEVGELLEPRSLRQA*ATW 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 

'i 


SEQ ID 
NO: of 
peptide 
sequence 


M, 
eth 
od 

- 


SEQID 
NO: in 
USSN 
09/515,1 
26 

■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino add 

residue of 

peptide 

sequence 


Predict- 
ed end 

* 

nucle- 
otide 
location 
correspon 
ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pnenylalanine, 
G=Glyctne, H=Histidine, l=Iso leucine, 
K^Lysine, L=Leucine, M=Metnionine, 
N«Asparagine, FMPnoline, 
Q=GIutamlne, R=Argihine, S=Serine, 
1>=Threonine, V=Valine, 
W^ryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /-possible 
nucleotide deletion, Nppossible 
nucleotide insertion 


13355 


27256 


A 


13498 


164 


22 


G I KRKLI KS GWAHARNS S S LAGRGGWL 
T *GQEFETSLAKMME I PS V 


13356 


27257 

• 


A . 


13499 


277 


363 


TYLCYIFNI CLFVGEIEKQR*_* i?HKS 
TLNGHILYVRPVSPNPSQAVASPVTCTY 
MPTWPEWBESQKK 


13357 


27258 

• 

• 


A 


13500 


I 


270 

■ 


YTCS VLLLVT VNLIiLCL I YKLNF I IGML 
R KKTVYI Y I ER I Q YYLKFQAS TCCLGVY 
PTV* PYMYTYTYMYM YMYI YI YIYI YI L 
CLCLDG 


13358 


27259 

* 


A 


13501 


170 


383 


IIiYSSKLTYPSPQIVIFFCGKST*GWVW 
WIoMPVIPAI*EAEVGGSLESTS_QAWA 
TGRKAFFVFYKEYKN 


13359 


27260 

• 


A 

• 


13502 


31 


402 

• 


GD YLYLREGWQE P ARREQHAQNLBPWRT 
FTPHlilCLTNYRLFSLAWSECEERRNET 
TDKNQTVKDLALDMKRVLPFS *OHQ__VT 
KVG WAYACNPS TLGGQGQW I TRGBEFK 
TSLINMAKPYLY 


13360 


27261 


A 


13503 


232 


446 


RNR PGAVAHACNP STFRGRGGRIMRLGD 
*RS 


13361 


27262 


A 


13504 


196 

■ 


390 


VNKAEKKHIiYFVARKEILRPGVWHTCN 
ESTLGGRGGWII *GQEFETSLPLPKTQK 
LAECGRGCL 


13362 


27263 


A 


13505 


146 


7 


LKYVLY WLGAMAYTCNTS TLGGQGGW I S 
*GQE FETSLANMVKPCLF 


13363 

■ 


27264 


A 


13506 


177 


18 

* 


SQNFGRPRQQDHFRLDGRHSETPSLQEK 
VKKLTRHGGACVQSQLLERLRREDHLGP 
GGGGCSVP*SVNII 


13364 


27265 


A 


13507 


117 


338 


NKI LKKKKGGGRFKES KFTS PGLQGNSF. 

_V_M_. m .___ _■__ ___. _*_*__* _^ _ W_. _— _, _ , ^_.-__» _*_W W_ — l_t ^ M_t _«__* ___fc_T M_t ___ *_> *-_fe 

FMGPPKIiNSRAGV* QRREGKNLGVPQLK 
PFEANPLFARGPNTKNP 


13365 


27266 


A 


13508 


306 


443 


LARYEPAVRTRAC* AQWLTAVI PVIi* EA 
EAGGPLEARNLRPAWAT 


13366 


27267 


A 


13509 


146 


390 


KMFKGHEQAAHRKKKKRGGRFKGSKFTS 
ACLQRNT FFLGP P KLNS RAGV* QRGD WK 
NPGVTQFNRFEENPLFARGPNTQKP 


13367 


27268 


A 


13510 


119 


356 


NEDRNLRGGC PGR * LLRTEGVCSNPAGW 
SGIRWCEESGGLFWRWRGTESVLFSGL 
FVCCVFAQEKAKATGRAEVSLCP 


13368 


27269 


A 


13511 


221 


21 


EDLQRDKPLGSC YS TCGWAEQWYLQHPG 
GAGSKCR I WLGAVAHAYNPS TLGGQGRW 
IT*GQEFIiSIi 


13369 


27270 


A 


13512 


163 


418 


TFPDDQCLMLQDEACVKRSIQSA*YLTP 
AVLAHWEAEAGGLLEPTSLRPAWAT 


13370 


27271 


A 


13513 


239 


21 


FGNLGGPGGRTACIQFSLGNIVRPYFYK 
TKPKPKPNQNKTKI S *AWWCMPI VPATR 
KAEVGESLEPRRSRLQ 


13371 


27272 


A 


13514 


135 


11 


KFFFWPGVLAHAYNPSSLGAQGGRIT*G 
QEYETSIANMVKS 


13372 


27273 


A 


13515 


___________ 

162 


SI 


EGTLRSRRPPIaX5WVT*GQEFETSPANM 
VKRCLY 


13373 


27274 


A 


13516 


424 


54 


PKRGFFPTPFIWVPPVFPLPPFFKPPPR 
IFFLGPPKKKFFSPPPGLKIFFF*KGPP 
PFFFFFFFFLFFFFKKKKKRLGGFFFFF 
FWARPFSFLSLFFFKETIPLYYNIWMHK 
DSCKAVI HHYH 


13374 


27275 


A 


13517 


310 


349 


GGGPLKKKLFFSRGGERFFFF*GAPPFF 
FF FFL * S S KI F I FFLKS FFFFFFFF F FC 
FFFFFSQILFFISSPCFVFFFLNYTSRT 
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DUCIcOllUc 
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SEQ ID 
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M 
eth 

- 
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SEQ ID 
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beginning 

n II i* Uififi rfp 

11 UtICUllUC 
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correspond 
ino to first 
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peptide 

vvU Uvllww 
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Predict- 
ed end 

■I U Lib 

otide 

v »■ w w 

location 
corresoon 
ding to - " 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCystefne, D=Aspartic Acid, 

E^dtttAitiir Arid K~Phpnvlal^ninp 
G = Glvcine. H==Histidine. 1=1 co leucine 
K=Lysine, L=Leucine, M=Methionine, 
N = As oa rapine* P— Proline. 
Q=Glutamine, R=Arginine, S=Serine, 
TMThreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X = Un known. *=StoD codon. /=nossib!e 
nucleotide deletion, V=possibIe 
nucleotide insertion 














CSYVCALILFKGNTYSRKARHILW 


13375 


27276 


A 


13518 


273 


393 


I KCLS YRKKATTLGGQGRW IT* SQEFET 
S LSNM * KPOL Y 


13376 


27277 


A 


13519 


450 


139 


I PP IALVS PFAKHTWQQQMGLVRRSS KG 
NNMSKGWEI IHPSRI FCSNHTLLRIALW 
LGAWHTCNPS TLVGRGGW IT* GWEFKP 


13377 

* 


27278 


A 


13520 

i 


47 


427 


KMKQLFEMLVFQNWTIFLSHS IFRTIR* . 
LTLE PLMFRYYFKQ * RTLLP PKF * DRVW 
LCHAGGC L* CS GTVS AHCNLRLLGS SDP 

QTfl 21 Q * "\TEi.(Vr\m ArtWWTMT . T T .WMBPT.PTT 
Oloiw 1 Vnul Vvj/\ikjJlrlJ.rliJ J.J-u>ini<ri I if IS. 

QKFFINSSRNLFFT 


13378 


27279 


A 


13521 


199 


98 


S LPS VKLS ICCEFPEDIVSKDP I P WLP 

■ 

V VJN^iiouoljV^iijFiNijUFxs.^iil\.l NJLUV " 

VFQSTI KAKAQLGMVAHACNPS TLGGRG 
GWIT*GLR 


13379 


272o0 


A 


13522 


314 


1 o 

lo 


T,V*V* *NYPF*flTiTf , FTFSWHF.TONT* * 

KNLI I WAGAVAHTCNCNTIiGDQGRRIAL 
AQEIEPSLGNIAIQ 






n 

A. 


1 JDZj 


1 5 
15 


Oil 
231 


P pp A VFRT R T EO TO T tK YT) T KT> * K" TROAW 

WCAPVI PGTQAE VGGSLE IRSSRL*CAM 
STSMNRHCVPAWAT 


1 J JO I 


2/282 


A 


ioD*£*? . 


2.57 


HOD 


RVOVS E CTjIjHRB KCIVYFOLGIiS * POT 
GWLKQQKLPGTVAHTCNPGTLRGRGKRI 
T*GQEFETSLANIVKPRLY 


1 oooJL 




a 








■3T iP T*RT VAHTn\TP«»Tr/3n*RnW TTR 

OLIZvUu X VAI11 uii ru 1 Uuv VJVJVT J. J. £uJU 


13383 


27284 


A 


13526 


342 


369 


CTLNTLLLYS ISDFGVSAFLATGGDITR 
NKVRKT * LRLGTVAHT YNPNTLGGsQGRR 
LT*AQEFKNSLSNIGRPHLYKTKICTKKT 


1 jjo4 


27285 




1332/ 


241 

i 


Oil 

231 


IYVHTNSHIYTHMYIHTHTHTHICIYKL 


13385 


27286 


A 


13528 


241 


231 


EYVCIV3fSHSSLYFSSSLYIHM*VCVCVY 
IYVHTNSHIYTHMYIHTHTHTHICIYKL 
EEKYKEL 


13386 

* 


27287 


A 


13529 


34 


299 

• 


SAPAS AS PWGIGVLYRPLACPEDRFSPG 

PF ANPfiF T KTTF, T,T? VTTiS PP TjF C *>f?T TT 

APCSLDLQGSRDPPPSAS*VSETTGASH 
HAOL 


13387 


27288 


A 


13530 


158 


45 


SGHRSRARWL* PVI S ALWEASAGGSPE I 
KSLRNRRPG 


13388 


27289 


A 


13531 


342 


1 


C FFLKGAP P FF F FY F FNFFF F F FLVGTD 
KLI LKVTWKWKG P K I DKAVP K KNNQARG 
LP* PDI KI FYKVWLGAVANAYNP SNLGG 
* GGWIA* E * EFETS LANMTKPQ YKKYKK 
LA 


13389 


27290 


A 


13532 


589 


742 


RIMKMLRIKICGTGPGMVAHAVNPSSLG 
DGWIT* DQEFKAS LANMIKPHLY 


13390 


27291 


A 


13533 


74 


446 


HVGIPSRLTSVSHPRHCKHPRTESEHVY 
CAIEARHYQCKVSRPSDAAGSEGRPRDM 
TQDTEVADTPPPNI SSI TATRTMAT IGV 
GWAHAYNPNTLGGQGRRIA*AQEFKTS 
LGNLAKPCLYKH 


13391 


27292 


A 


13534 


1941 


1040 


AFHLLPYLSSGFYCSLGPCFGLGSSHMA 
AWHKEPLHNACSDSRRSAPIRQGLGSPS 
ATHDTHTRTLAH I CKE I FKERLHE I KBR 
E IDS *RG * NTQR * VERE EAENKYKPTE I 
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SEQID 
NO: of 
nucleotide 
sequence 

"i 


SEQ ID 
NO: of 
peptide 
sequence 

• 


M 

eth 

oa 


SEQID 
NO: in 

09/515,1 
26 


Predicted 
beginning 

KB « 1 rife. A rik 

nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=CysteIne, D=Aspartic Add, 
Er=<jiutamic Acia, r—rnenyialanine, 
G=Glycine, H=Histidine, I=lso!eucine, 
K=Lysine, L=Leucme, M=Methionine, 
N=Asparagine, ^Proline, 
Q=Glutamine, R=Arginine, S= t Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


■ 

■ 






■ 

* 




- • 


. NTE I ENHKQQEMLRHAGEERENS PREAE 
RKCFJreKCEIKSRNSRKEARLRKKVKFK 
TNEKGKG*QIITVHKEKREVGRERRGRQ 

.- 

NKKRTKSTQHVTQKGDMR* SVKESPHLR 
RAKRNSNKMKNERYVLRQDTDHSCEwKE 
KMRQRHTRKKLARETW KWVRHTLNREQR 
AKPRKKETKEIH IHS AI LKC 


13392 

• 


27293 


A 

- 


13535 

■ 


1192 

■ 


1476 

• 


E I KRKWG PGAS CHACNP ITGRPRRVDHL 
RSGVQGQPGQRGETPSLLKNIQISWAWW 
O^PVI*EIKRKWGPGASCHACNPITGRP 
RRVDHLRSGVQGQPGQRGETPSLLKNIQ 
I S WAWW CAP VI P ATW EAEAGGS LETGS G 
GCTEPRSHHCTQAwVTE 


13393 


27294 


A 


13536 


55 


132 


RAES CS IiAEDGVQ * GNLSSLQ PP PPK 


13394 

* 


27295 


A 


13537 


7 . 


325 


S P VE FRLGE PTKGTS S FGKRRNKTHTLC 
RRCGSKAYHLQKSTCGK*GYPAKRKRKY 
NWS AKAKRRNT TGTGRMRHLKI VYRTF K 
HGFREGTTPKPKRAAVAASSSS 


13395 


27296 


A 


13538 


259 

« 


274 


KFEAAVCYHCTTLSL* *CHTVCFL3FFL 
LE IGYHYI TQAQ SGAN I AHCSLKLLGS S 
NPPVPAF*VAETRSACYNAWLVFVKFLM 
GL*ELWKDMS 


13396 


27297 

• 


A 

■ 


13539 


3 


68 

• 


QASVLPQKQLfoSWQKLSEStfmFWNLV 
KTGRLGEE I SCCLWREYYSI SDCLAT I 
KLPASHLSMRKPRHKDFKSTFLISTKNQ 
GNHGHGTKGNEVNNFQQRISV* KTYFKK 
YNMRPGAVAHTCNPS TLGG * DWW I T*GK 
NCQNQLS 


13397 


27298 


A 


13540 


268 


468 


YRRL I S KKF FKHRWKS VSFFKFS F FWLG 
EVAHACNPSTLGGQGGKIA*AQEFIiTSL 
GNI VGPCLYKK 


13398 


27299 


A 


13541 


308 


437 


FS WAFE I GS QAEVQWHDLHSIi * PLPPRF 
KRFSHLSLPSSWDYR 


13399 


27300 


A 


13542 


7 


244 


AKTAPLFF * FETVSCFVAQARGQWHDPG 
SLQPRPPGQETSMIKTSSDPPPPASQVA 
G ATGMGHHAQK I * FLVE TGS PHVS QGGL 
ALLK 


13400 


27301 


A 


13543 


50 


175 


ALPAIHTAVGQCSVEGFCVLSDPEGDGA 
ARMKLVRFLMKLSHETVTIELKNGTQVH 
GTITGVDVSMNTHLKAVKMTLKNREPVQ 

y Hum ott otT>TTnvT?T t t*t>£»t tit win r itn 

LETLS I LGNNIRYFI LP DSL PLDTLLAD 
AE PNVKS MNREAVAG * R *G*ARE I FDE I 
ES 


13401 


27302 


A 


13544 


47 


362 


VAFQGIiQVPATNPANFF FFWKGGFFFVP 
QGGRQGEKIGLRDPWPPGVRKI PGPTLS 
GTREKGAPPPPPIYF*FFWKKGGQKGGP 
GGr w TWDPKGPPo PTPFKGolJ 


13402 


27303 


A 


13545 


97 


351 


NKKKKGPPPFF*KKRVGKKKKKGARAGG 

PPPTfPPPE'fifiPPfiriOPP.fJONfilCTPB PfTfi 

GNPPFKKKKKKRNI FPAGPMGLKGGGHL 


13403 


27304 


A 


13546 


214 


32 


NYIKISSVDGGAKIK*WT*NLKCLVLSV 
KN I TRWVDCLS SGVQDQPGQHGBTPSIiL 
KTRKTS WA* WRLGLI QRWRKMPWPSR 


13404 


27305 


A 


13547 


350 


159 

§ 


QTGTL£QKTKQKKQAQWLMTVILGL*EA 
KAGGLLE P KS LRPAWATWQ DP I YKKKFF 
LERKTWR 


13405 


27306 


A 


13548 


367 


149 


FFFFFFFF*TGSHYVAQNWTQIPGLNLP 
ASASQVTATTGMHHHTRHTPFPSLTSHT 
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Hr~\jrl U LH ID 1C ACKJj J 1- * Ucl)jrli)l«D lllc, 

n=nivrinfk H=HktiHine 1=1<;nlpiicina~ 

K«Lysine, L=Leucine, M=Methionine, 
N^Asparagine, PHProline, 
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1 J. 11 1 Cull 111 C) V***VllllDv> 

W=Tryptophan,Y=Tyrosine, 

X=Un known, *«Stop codon, /-possible 

IJUUCUUUC UClCllUllf V—pOaalUIC 

nucleotide insertion 














QSVSTASWTYLONSGRH 


1J4U0 








ii t 


Z 


TA*MVQ* LQLPDKVQI YPAHPLFINDMP 
P * WLLK T P <3 PC PGTVMPYNPNT'LGGRG 
RRTTRDREFBTSLANWKPHLH ■ 


ii/ini 
1J4U/ 








ZIO - . 


1 1 


f3V <3GVP TVP <3MP Vnnn Qrt PPT S T PM 

GKAG INQ 1 1 PKTTIiGGRGGWIA*GQE PK 


13408 


27309 


A 


13551 


199 


391 


KDFS I F IACFKRPFITGSQGQEQWIjI FP 
HAYNPS I LGGRGRQIT*GQEFLTPARLG 
NMVKPCLY 


13409 


27310 


TV 




70 


239 


7V r Pf"»VBrTJ * O TC U & WWPVP WP ft TWP A a A 
A 1 ^ V AW w K 1 o W A W W ^» V ir V V if A 1 AEinrinXj 

6LLE PRS S RLQCS VI AS VERHRTSVWTT 


13410 


27311 


7\ 
A 


13553 


1 AC 

105 


OCT 

257 


W JbrHr V ± FAJj W DiUinLiuo r HrKb L_»K 

LA*ATVPGPNNFLKRAFHSLLNLFLP 


13411 


27312 


A 


13554 


499 


165 


Y+MESHFVTQGGLQWCSLGSLQAPPPEV 
GGWLELGRQRLPMKPKI AP I WTPSPG 


13412 


27313 


A 


13555 


415 


719 


YTDKS I RS I SLLP VKGSTHS MCP VKF I Q 

IX2KAGGIHWAKLSGQRVDREW 

TCNRS HLEGKGGQ I X* AQ VFKTS IX5NMA 

BCSCLCKKYQKLSWI^QH 


1 13413 


27314 


A 


13556 


224 


523 


DKVSIiCHPGWECGiwQSQliTTASiSQG * 
TILFPQSPE 


13414 


27315 


A 


13558 


355 

■ 


1 


QVSLQIiPYCVLFQFHIYKKQAALLQRGY 
VSAxNI^LIKAIWQiVNbljKJTlCBENQ^ 
LQNMPF*HRNYFGEAGHGGSCL*SQLLG 
RLRH * NCLNLGGGGC5ELRLRHCTPAWA 


13415 


27316 


A 


13559 


73 


259 


KYVPHKGKISERRLCSVAQAGVQWCDHG 
SRSLEVLGSNDLLASAS * VAGITGCWGS 


13416 


27317 


A 


13560 


205 


21 


WK I S ED FLLGDAKKWAYQE D I KRLRLG 
RGAVSHTCNPSTWRGRGGWIT*DQEFET 
SLPKC 


13417 


27318 


a 
J\ 


liDOl 


254 


4ol 


KEIFILLFF*DGVSHUiPRLECSGMISA 
YCHLCLQGSSDRRDFLKKQ 


13418 


27319 


A 


13562 


174 


369 


TFFSPSMFVEPGPYYIAQAGVRWLFTGA 

T TirSIPQ T .V J .T .21 T.CnC DTQ Ji <5 * VTHTTYIM 

PPHLAFTVK 


13419 


27320 


A 


13563 


371 


109 


LFSKAGRCILQNLLCCMVSGKCWPMNGI 

AQWLMPVIPAI*EANAGESLEPKSLRIiA 
♦ATVPGPRSLSQTLLHRSELCGL 


13420 


27321 


A 


13564 


206 


486 


RQLAAVH I LVTPLPDHVTWANYVTSLSL 
LSS S ENG VNNDNTH F I GLLLRD YK* GWV 

GSKARPRFV 


• 




A 

*• 


13565 






SRVRGCFP FNLPKSSCI* ECGQGAVAHA 
CNPSTLGGRGGQIA*AQEFKTSQOJVAK 
PHLYKI 


13422 


27323 


A 


13566 


26 


395 


YSPVHTDKCSGVRKLGLFFFFFFEKEFH 
FFS PAWKARAKIGLNGTPLS QGKGNFRP 
KPPKKRGKRGRAPQPGKILNFKKKRGFS 
MGAKVNPNFGPKGNPPP*PPKGAGBCKGG 
TPRPGPFFFFF 


13423 


27324 


A 


13567 


400 


195 


ATRVSLAPTKNKINWGWWPTP VI PPPQE 
GEAGBFL* PQKWRVRUTKIVPLPSRPRD 
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peptide 
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Amino acid sequence (A=Alanine J 
OCystei n e, D=As pa rtic Acid, 

Ks^tlltamSp A fill G^PhpnvlnloniMA ' 1 

kj — vyiuutiuic Aciu 9 r— r neoyiaianine, 1 
G=Glvcinc H=HUHdlne Ts=lcnlpnrinp 1 

K=Lyslne, L= Leu cine, M=Methionine, 
N=Asparagine } P^Proline, } 

O— C^tiita m inr> R=Aroinin*i QnQorina — ! 

^ v**u uiiii iiicj iv~r\rgiijincj o™"3erincj s 
T=Threonine, V«Valine, 
W=Tryptophan, Y=Tyrusine, 

X — 1 In L nnwn 's$fnn fAfinn /snn6*iklA 1 
KJli ivllUrr II, *— Oiop WUUQD} / = pOS5IDie 

nucleotide deletion, V=possible 
nucleotide insertion J 














KTRLCFKKKTCKN 1 


13424 


27325 


A 


13568 


316 


154 


SQPF PGLRWEDRLSRGVQNQPKQ* LMPV 
IPAFSGAEVGGSLEPRSSKPA*ATNNIiK 


13425 


27326 


A 


13569 


297 


475 


FDYFFIY*ETKSHSAAQAGVQ*CKHSSL 
QP* PPGLKQSLLPWPPKSAGLTGVSYGT 
WPI • 


13426 


27327 


A 


13570 


116 


332 


NRVLLLPKGEYRGTISAHCSLKLLGSKR 

DT3TC1 *<^T^T^T?'T M TY^"tfy*'tTLI*Prn7#tI»T v*i*di? S 

FF 1 jAA * (jUL/r, 1 ICyViJnnTPV* rUCTF F I 
I E I RPG YVPRVGHWRF | 






a 




nil 


405 


! lVP&JUKASCRIjIUtKKKKKKG 

GFPGPGPRV*GPEAKNPGGGGPGIPKOP 

G . ! 


13428 


27329 


A 


13572 


408 


3 


TFCDYFKTPPPGKFFFINGPPSFFLNRG 
P S FFFFPG FP PPFLKNFFFFPLWFKGG 
GiKRGVFPPKKKjGPPQPLFFFFFFFLI * 
KKTRPGTVAHPCNPSTLGGQGRWIT*OQ 
EFLSSLAKMAGRWRPGNCBSV 










2.16 


406 


IKEFKAERGGHS*NTSTFGGQGRWIA*S ( 
QEFETSLANMVKPHLY J 


13430 


27331 


A 


.13574 


385 


38 


K£NSPPPPGKNFFFF*GPPPSPFFFFFF j 
FFFFFFFLGGKKTFFFPPPSRFFFFLKP 
1 FPPKNQGPKNFFPPKKKIGD*ARPPPLT 

CNS 




£.1 J0£. 








53 


Ji WKVJjKBNKQ X F TKI S FKMNAI KDS VGQ 

AQWLMPVIPAL*EPDMGGPVEPKSLRPA | 
WAT 


13432 


27333 


A 


13576 


406 


33 


VNSIVWVSSPFRVSTFISFFELKSCSVT 

<M\PW4MiinT crrnT/MPn^r vnnntnnn/i 1 
yACjV*WHDIjSLiI^LQTPGuKRSPAPSRG J 

LSFYLSS I FYPHSSQTELFVGTLNAASF 

TNS RLLYLSLLLCX3KYH PLVPS LPL3 T I 

WYFSLRVSHHFP | 


13433 


27334 


A 


13577 


426 


294 


GGFPSPPVFKPPPQFFFFGPLKKKISPP j 
IrirUfiiAJxX If r ^K/UrFV j 


13434 


27335 


A 


13578 


268 
# 


3 


LQAKGPAMGE AGARCS SE VWGWPRKGFD 1 
L*VSR*PSFDSHVRIGRVQRLMPWPVL 
WETEADI S FE PCSSRPAWAS WRKTYI QK j 
PQRI J 






A 
*i 




• 




£*iU\Ar r VL.r Pvjr PC?NPPNPSPQFF* KGL j 
ISPSGSLRTRRGFFP*RVFFFFLKKQFP j 
LSPRVECNGI IPDP*PPFGEVTSTPQVA 

FFF 1 


13436 


27337 

■ 


A 


13580 


68 


466 


GASPAQGSTLHLVTLPRYFFKIPTVRTE 
SFSGSLVTSPPPLHFLPLNKERGGPG I P 
LPCPQLLQVLGGTSLPPVPSLPDTSQDK 

WDT.U^2\7DT5r , iri7PO*T3T 7\ rTv/^ A XkTD Ti <? TkTTVr 
W J^JjUVj V ±^ FLjjI v to w F ijivjUvaAW JrFSirliW 

IPLNPGTSKSLQPAPPWNSA | 


13437 


27338 


A 


13581 


67 


249 


ATAPGLLCS YKI FSCQLQWCVPVTQIiAW 
ETEAGGSLEARSSRL* *TMITPVNRHCI j 
LAWAT 1 


13438 


27339 


A 


13582 


359 


3 


KQDSQWVAAAPASVAVSTCWRGI*Pti>WS ' 
GGQNRG KGS LPLQVHRGA* GTENKNQGG 
TPRPGGGPGPSAPRGSSLGAHRKLPVHH . 
TTLSSSRSALPPTPRHPAPSSPPCTEEF 1 
HRTRPI 


13439 


27340 


A 


13583 


174 


1 


FFFFFVETESHSVAQARVLEYSGAIPAQ | 
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G=GIycine, H=Histidine, I=IsoIeucine, 

\f — T ,vc!no T aiminn {VT— Mp^h inn in a 

N»Asparagine, P=Proline, 
Q=G!ut*mine, R=Arginine, S=Serine, 

T=:Thrwininp VsVnlinP' 
x — a iir evil iuc» v— vhiiiic, 

W=Tryptophan, Y=Tyrosine, 

X=Ud known, *=5>top codon, /=possible 

nuplpn^ifl a #1 olof i n n YsnnccihlA 
MUwieuiiue ucieiiun, \— jjuaaiuie 

nucleotide insertion 














CNLDNLSSSELPASAS*AS*VAAXTGT* 
KR 


13440 


27341 


A 


13584 


192 


206 


IiFFLBVGSCYVAQAGFBFPGSGHLPLSA 

c 


1 1AA 1 

1J441 


i734Z 


TV 


17COC 


Z44 


Zo 


GAVAYNCNP STLRD * GGR I T * AQE FKTN 
LGNMARPRTRGSTRTF 


13442 


27343 


A 


13586 

i 


312 


701 


RGPEET+PRTRKSSPRPTRSCESVSKLA 
RPPRRCSAAPAQVPRLSLRSPKPDPPRV 
S b Ail KCJAPljPJUbPb Coil IWUjFKNbr lit- 
QNASSPLLSLGLPPSPTVQALQPRALHQ 
HLGSTNKEDAHVAPAKKK 


13443 


27344 


A 


13587 


410 


292 


GYMI KQQTVYHE CRM* ANSHLHLPPEGL 
LRAVTLWRRAP 


13444 


A mm) 0% m 

27345 


A 


13588 


428 


36 


TPPPFFFFWAQKKKKI FPPPGGKIFFFL 
KGPPPLFFFFFFFFFFFFFFLKKFFGLK 

ALLPQQRS FVPGLGPTRLFWGLVFWFFG 
FFFL P FGLCFFFAL I FNC 


13445 


27346 


A 


13589 


3 


364 


TAPDDQGRP YQAGE PAAH VRADSTHGLG 
A* RRVAGHL P PRVGAP HHPGAS S RBGAP 
PLHPPPNL*HRRPNGSCRFLPGPAAPPQ 

VTPSWRSS 


13446 


27347 


A 


13590 


778 


910 


DRIRRSGAVAHTCNPSTIiGGQGERTA*G 


13447 


27348 


A 


13591 


230 


3 


KLVNI LVEKSLVAMKSTVSEGKMVIRSP 
AS LGMVAHTHS P S TLGGQGRQ IA * AQE F 
ETILGNMAEPCLYKKHSVY 


tO A AO 

13448 


27349 


A 


13592 


225 


340 


iiltilrLrWnoilV VbolJij i>lrUAv InA 

YNPRTLGIRSGRIT*GHEFQTTLSHTMK 
TRLYHELL 


13449 


27350 


a 
>i 


13593 


■a < 
33 




PLTPAS * GDGTTDTCHHARL I FTGADFC 
IDASS 


1 1AZJ\ 

1J450 


27531 


a 


1 3 CO/1 


0*7 

57 


509 


LFNFP I FS KKI AS CKVKKLTRE * KYKSQ 
ARWCT P VI PATEAE VRGQLE PRSLRPAW 
ATYQPHFKS 


13451 


27352 


A 


13595 


261 


489 


SWVNERMWG* *KHETSQAQWLMPI IPA 
LCEAEVGGSLEARSWETSLCNIVRLHLS 
KKKRGKGRKRGGNQIAPSRE 


13452 


27353 


A 


13596 


238 


2 


NDCLWWLFRI PANVSTFGLLHMSLKVNT 

FtaJMiNKr JS2s rjlAalKVJXjlrll L4Niro XiAru^a 

GWIT*GQEFVTSMQKGSAVKNQ 


13453 


27354 




13597 


36 


687 


iDriTrLJTDC r PV/^&r , o vr\T*4r»7r , D/™ , T?7vi70T coco 

RGAYSCPVP ITFAEGKTRMGRDKMRLIL 
GLS WGPSLTIiLIjP ITNAPRRPGM * E PAL 
PGNSTS * SGAVRFPGGGRK*EDSAESWS 
CFCSHPSPSSGPNPPSPPSAVCMLPGIiS 
GLQR* PKQLSPPPALSLPVSSLLVRLSP 
WPPTSSHLLPQALPQSLHPQGSHRAVND 
S FSM*GMVLGSRRNRGCMEA 


13454 


27355 


A 


13598 


367 


468 


KKQRRGREPWLTPAI PALWKAKAGGSP* 
VQSSRP 


13455 


27356 


A 


13599 


183 


380 


LTSMLAVDNSSVEKTCPQAMRISRPGAV 
AHTCNPS ILGGQGGLIA* VQEYETNLGS 
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W=Tryptophan, Y=Tyrosine, 

-» * WT ■ -L. A** ft ft § m% ft 

X=Unknown, *=Stop cod on, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














MAKTHLYKN 

• 


13456 


27357 


A 


13600 

* 


161 

■ 


657 


LIQGCWCSLFFSTRVFLAAIGMKSQWCR 
PVAMDLGVYQLRHFS I SFLSSLLGTENA 
S VRLDN S S SGAS WAI DNKI EQAMDLVK 
SHLMYAV* E EVE VLKEQ IKBLI E KNS QL 
EQENNLLKTLAS PEQLAQFQAQLQTGS P 
PATTQPQGTTQPPAQPASQGSGPTA 


13457 


27358 


A 


13602 


12] 


423 


MIKVNSFGLYLYSQVIYIYIMKYYTAIY 
LKDFFFKDGVLTRHPRWNTVTTQCSLE I 
LGSTD P PAS AS ♦AAGTTNTSHHGQLYSY 
FKKRPTYAEMEQYKIKK 


13458 


27359 


A 


13603 


1889 


1717 


VPRIRPLSWTPPSS I *RPKPPSSTFSHP 
GKPSMGPPGLNRC P VQRALLPGNYQHCG 
R 


13459 


27360 


A 


13604 


81 


267 


ETCC I KNNTLK* I * SQMLWCAPIVPATW 
QAEVGGSLDPRSSSL*CAMITSVNNHCT 
PAWAR 


13460 


27361 


A 


13605 


267 


418 


THASGMVTHTCNPGTLRVSAGRIT*GQE 
FKTSLGN I ARPHL YKI I INKY I 


13461 


27362 


A 


13606 


428 

■ 


24 


CPESNPRKREPTLKSEPILTLPITGLIL 
DDGNSPTAAAGSVFAETLQPQCSSLCC* 
VKSSPWPTASIiPQPFGSAPQTLPVRYAL 
IPGPSLVSTSLGPRLAPGLPGSPSPPSP 
QVTLD S ERYPS PT P C I APALED 


13462 


27363 


A 


13607 


267 


442 


RPPPTLKVPWLGVEVHACNPRTLGGQGG 
S IS *AQKLENSLSNI LRPHLYQKQQKQQ 
KK 


13463 


27364 


A 


13608 


3 


280 


YRl^Vi C^DPMRBREQTCFP&PPSFPPN . 
RYCSTRDGGMEGSQGDFRKGFPA*APCR 
SGGGWRGCCSPAGAPGSPLCGSSQGGPV 
SPWGQKRS 


13464 


27365 


A 

> 


13609 


291 


248 

i 


EKRAHGSQPMKGRERNSMCKSSEFRWA 
WSIiAGVKGDGGR*GSGAGGSPKLG*AV* 
LDPEGHAGQAVFCGEAGARTQIGGKG*R 
RDQAPLPLSPNLCSGSCLPTKHSItARMP 
LRVKSYSSAQLGRTTCTRPSPPTITFDS 
S 


13465 


27366 


A 


13610 


267 


3 


HTDLL YNTPTHP I S PRCDPKHTAI PDKQ 
SLLFFFFEMESCSVARAGVRWHDLRSLR 
PLPPRF* KFFCCS I PSLYCGRSRGSSQT 
YTS 
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G=Glycine, H=Histidine, l=Lsoleucine, 
K=Lysine, LHLeucine, MHVIethionine, 

Q=Glutamine, R=Arginine, S^Serine, 

T&Thronntnik VsVa!ln« 

i 63 1 iircuiiine, v*~~vuiidc, 
W=Tryptophan, Y«=Tyrosine, . 
X=Un known, *=&op codon, /=possib!e 
nucleotide deletion, ^possible 
nucleotide insertion 


13470 


27371 


A 


13615 


197 


428 


KTMVGLKTSAS FHQDF I Q I PLNHNLS VP 
RGGW IT * GQEFETS LATMVKP 


13471 


27372 


A 


13616 


445 


550 


IS ITRGRARWLTPVILVLWEAEAGGS P * 

tjrKooKxjr* 


13472 

■ 

r 


27373 

■ 


A 

» 


13617 


3 


435 


TMVLS PADKTNVKAAWRKVGAHAGE YGA 

EAIiERMFIjSFrTTKTx r rnr DLSHGSAQ 
xrucvxn. v tn? an a t /PKrava wvimMDwa i\c. v 

V AunU NftCnUnU 1 JMM WU1 V L/l^PlfJXrUiO V. 

LSDLHAHKLRVDPVNFKLLSHCLLVTLD 
RHT.PAE FTP AVHAS LDKFLAS VSTVLTS 

KYR 


13473 


27374 

t 


A 


13618 


719 


918 


CEGRREKWKI GRERKGRGRRKGEQMGRE 

fSVd'i'D 1?TVS/^10 K"D D <T C T? PP G Q G TJ TVO CTJ T T 
laJMjlCCi JJVJOKJVKlr L. J. OK Jr o o KL/KolX X X 

ISEHASAVEA 


13474 


27375 


A 


13619 


847 


938 


W1>!PAVPALLEAEVGISI*EAQSCRPPWA 
TQ 


13475 


27376 


A 


13620 


349 


409 


NLWSWPGAVAHVCNPSTLGG . 


13476 


27377 


B 


13621 


25 


189 


MVLS PAD E TNVKAAWG KVGAHAGEYG AE 
ALERMFLCFPTTMTYFPDFDLIHGSAQ* 


13477 


27378 


A 


13622 


142 


356 


RIVENEKINAEKSSfCQKVDLQSLPTRAY 
UXJTVVPII^O^IAyiiAKERPPNPIEFL 
ASYLLKNKAQFEDRN 


13478 

■ 


27379 


A ' 


13623 

» 


18 


1353 

> 


AGAAQCEWS AGEAGARTMS EADGLRQR 
RPLRPQ WTDDDGQAPEAKDGS SFSGRV 
FRVTFLMLAVS L TVTPLLGAMMLLE S P I D 
PQPLRQISGIALFCSFKBPPI.r.TfiVLHP 
NTKLRQAE RLFENQLVGPES IAHIGDVM 
FTGTADGR WKLENGE I ET I ARFGSGP C 
KTRDDEPVCX3RPLGIRAGPNGTLFVADA 
YKGLFEVNPWKRJ2VKLLLSSETPIEGKN 
MS FVNDLTVTQDGRKI YFTDS S S KWQRR 
DYIJaLVMEGTDDGRl^YDTVTREVKVL 
LDQLRFPNGVQLS PAEDFVLVAETTMAR 
I RRVYVSGI/1KGGADLFVENMPGFPDN I 
RPSSSGGYWVGMSTIRPNPGFSMLDFLS 
ERPWIKRMIFKLFSQBTVMKFVPRYSLV 
LELSDSGAFRRSLHDPDGLVATYISBVH 
E HDGHL YLGS FRS P FLCRLS LQAV 


13479 

- 


27380 

- 


A 


13624 


18 

• 


1353 


AGAAQCEWSAGEAGARTMSEADGLRQR 
RPLRPQWTDDDGQAPEAKDGSSFSGRV 
FRVTFLMLAVSLTVPLLGAMMLLESPID 
PQPLRQISGIALFCSFKEPPLLLGVLHP 
NTKLRQAERLFENQLVG PB S IAHIGDVM 
FTGTADGR WKLENGE IETIARFGSGPC 
KTRDDEPVCXSRPl^IRAGPNGTLFVADA 

ykglfevnpwkrevklllssetpiegkn 
msfvndltvtqdgrkiyftdssskwqrr 
dylllvmegtddgrlleydtvtrevkvl 

IRRVYVSGLMKGGADLFVENMPGFPDNI 
RPSSSGGYWVGMSTIRPNPGFSMLDFLS 
ERPWIKRMIFKLFSQBTVMKFVPRYSLV 
LELSDSGAFRRSLHDPDGLVATYISEVH 
EHDGHLYLGS FRS P FLCRLS LQAV 


13480 


27381 


A 


13625 


1 


384 


QSFRGTGRKRERERKRMSLSDWHIAAKL 
ADQPLTPKSILRLPETELGEYSLGGYS I 
SFLKQLIAGKLQESVPDPELIDLIYCGR 
KLLDDQTLDFYGIQPGSTVHVLRKSWPE 
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SEQ ID 
NO: of 
nucleotide 

i 

sequence 

"i 


SEQ ID 
NO: of 
peptide 
sequence 


eth 

00 

• 


SEQID 
NO: in 

TTCCW 
U7/JlJjl . 

26 

• 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to iirsT 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
otioe 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIaoine 
C=Cysteine, D=Aspartic Add, 
jc^=v»iutaroic Acid, r— rnenyiaianine, 
vjr— vjriycinej n — msnoine^ i=isoreiicinet 
K=Lysine, D=Leucine, M=Metbjonine, 
iN^Asparagine, Jr==rroune, 
Q=Giutamine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, ^possible 
nucleotide insertion 














PDQKPE P VDKEAAMRD 


13481 


27382 


A 


13626 


980 


1089 


KRIRIQLTGGLYPVPPPHPPPQSPPIFP 
RPTSPTRT 


13482 

■ 


27383 

■ 


A 


13627 


116 


587 

* 


VCGELRADSWPVPSQPEQASGPQKQAFI 

rvrrjT? 7\ nono7vnT nTnwmvunooTyviwpo 
WPEAP SRS ARIiP X xiTJJi WDoivL^TUBS 

QMIajSMARKKPRNTSRLPIjAIiNPIjKSKD 

V LAV LAERNEAI VP VGAWVE PAS rGSSE 

I PAYTSAYL I EEELKEQLRKKQEALKHF 

QKQVKYRVNQQITLRKQ 


13483 


27384 


A 


13628 


150 


484 


VAF, PQAS RGVRKTE VGEQQGQGTACGG C 
GHQCPPPTWHIQRAHCVFSTCGALTO^ 
QGRASCRRPPSQSPQVYQTQVORQDPHP 
GVGRDSRGELAWS S FIHP YLLSVCNPL 


13484 


27385 


A 


13629 


330 


397 


ARCE WLTPVI PALWEABAGGSH 


13485 


27386 


A 


13630 


10 


137 


S FTGAV I LI I AHGLTSS LL FCLANSNYE 
RTHNLFFLNDFFFS 


13486 


27387 


A 


13631 


380 


98 


FQWLVSFTATLWFEERHRKNKIEREERR 
AKGERGDRKEERREEGGBERGRRGGERS 
DKRE P KKKSKEESNHPKWKFS PICS FC 
FLTPFFPVFF 


13487 


27388 


A 


13632 

k 


56 


827 


PL FEAFTACGFVHDCGLL I HPEE TCGLQ 
PISSDYIEAILQSELKRCPSGDMKGQWI 
VPCLS CSDNRTCDWREI TWQPHNCQYGV 
LTKPQLQQCLGGRKILFIGDSTNRGIMY 
YLI ERIiNETLQE WQKVHGTKFYHNVNGG 
KTLISYSYYPQFWISPSLRPTFENALBH 
LLQRSRPLENTGQTVLWGGVQWLNSNH 
LQIIHKVLKSPFTTLNQPVTKSCLQAIY 
FPRLSPTLHSNCLDLVYSFTKSFNIYFV 
VQFLN 


13488 

* 

* 


27389 

» 


A 


13634 


3 


2718 

* 


SGPCRTTVAPLLRAAPVBHCVAALRPTD 
STMLKKFDKKDEESGGGSNPFQHLBKSA 
VLQEARVFNETPINPRKCAHILTKILYL 
INQGEHLGTTEATEAFFAMTKL FQSNDP 
TLRRMCYLTIKEMSCIAEDVI IVTSSLT 
KDMTGKEDNYRGPAVRALCQITDSTMLQ 
AIERYMKQAI VDKYP SVSS SALVS S LHL 
LKC S FD WKRWVNEAQE AAS SDN IMVQ Y 
HALGLLYHVRKNDRLAVNKM I S KVTRHG 
LKSP FAYCMM I RVAS KQLEEEDGSRDS P 
LFDFIESCLRNKHEMVVYEAASAIVNLP 
GCSAKELAPAVS VLQLFCS S PKAALRYA 
AVRTLNKVAMKHP S AVTACNLD LENLVT 
DSNRSIATLAITTLLKTGSESSIDRLMK 
QISSFMSEISDEFKWWQAISALOQKY 
PRKHAVLMNFLFTMLREEGGFEYKRAIV 
DCI IS I IEENSESKETGLSHLCEFIEDC 
EFWLATRILHLLGQEGPKTTNPSKYIR 
c ± YIvK V V LiEinbr. V KACaA vb AltAJvpTiAyW 

EEMLPS ILVLLKRCVMDDDNEVRDRATF 
YLNVLEQKQKALNAGYILNGLTVSIPGL 
ERALQQ YTLE PSE KPFDLKS VPIATAPM 
AEQRTESTPITAVKQPEKVAATRQEIFQ 
EQLAAVPE FRGLGPLFKSS PEPVALTES 
ETEYVI RCTKHTFTNHMVFQFDCTNTLN 
DQTLENVTVQMEPTEAYEVLCYVPARSL 
P YNQP GT CYTL VAL P KEDPTAVACTFS C 
MMKFTVKDCDPTTGETDDEGYEDEYVLE 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 


SEQID 
NO: of 
pepnae 
sequence 

• 


M 
oa 


SEQID 
NO: in 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue oi 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last ammo 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A a Alanine 
OCysteine, D=Aspartic Acid, 
su— vfiuiamic Acta, r— rnenyiaianine, 
G=Glycine, H=Htstidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
i — inreonine, v— valine, 
W=Tryptophan, Y=Tyroslne, . 
X=Un known, *=£top codon, /^possible 
nucleotide deletion, \=possible . 
nucleotide insertion 




• 






■ 


* 


DLEVTVADHIQKVMKLNFEAAWDEVGDE 
FEKEETFTLSTIKTLEEAVGNIVKFLGM 
rtr C K KS D KVT? DNKNTHTLIj LAG Vr RGGH 

D I L VRS RLIiLLD TVTMQVTARS LE ELP V 

httt 7\cir/"* 
DJ.±J_iAo VCj 


13489 

■ 


27390 


A 


13635 


118 

■ 


629 


LWALQLH PEPELPSRRGTGAAG VWTGVA 
MIRF ILIQNRAGKTRLAKWYMQFDDDEK 
UKLtlKEVHAVV^l VRDAKHTNFVErRNr K . 
I IYRRYAGLYFCICVDVNDNNLAYLEAI 
HNFVEVLNEYFHNVCELDLVFNFYKVYT 
VVDEMFLAGEIRETSQTKVLKQIiLMLQS 

T C 


13490 


27391 


A 


13636 


57 


221 


LHHCTPPWAEVEETJjKRIjQSQKGVQGI I 
WNTEGGWEPLGHCGDRSRPPAQGCP 


13491 


27392 


A 


13639 


344 


544 


IiSGGHAGALSSLVTVHLYCLLSSQQLLGN 
VLVTVLAIHFGKEFTPEVQASWQKMVTG 
VCSALCFRYH 


13492 


27393 


A 


13640 


319 


623 


DMEEAS EGGGNDRVRNLQS E VEGVKNI M 
TQNVER I LARGENLEHIjRNKTEDLEATS 
EHFKTTSQKVARKFWWKNVKMIVLICVI 
VFI I I LFIVLFATGAFS 


13493 

• 


27394 

• 


A 


13641 

■ 


2099 


769 


TRLAGRVSVASRPCRGPAVGGLLVERSK 
ARRPLLESRVAMAAVPELLQQQEEDRSK 
LRSVSVDLNVDPSLQIDIPDALSERDKV 
KFTVHTKTTLPTFQSPEFSVTRQHEDFV 
WLHDTL IETTD YAGL I IPPAPTKPDFDG 
PREKMQKLGEGEGSMTKEEFAKMKQELE 
AEYIiAVFKKTVSSHEVFLQRLSSHPVLS 
KDRNFHVFLEYDQDLSVRRKNTKEMFGG 
FFKS WKSADEVLFTG VKEVDDFFEQE K 
NFLINYYNRIKDSCVKADKMTRSHKNVA 
DD Y IHTAACLHS LALE E PTV I KKYLLKV 
AELFE KLRKVEGRVS SDEDLKLTELLRY 
YMLNIEAAKDIiLYRRTKALIDYENSNKA 
LDKARLKS KDVKLAEAHQQE C CQ KFEQL 
SESAKEELINFKRKRVAAFRKNLIEMSB 
LEIKKARNNVSLLQSCIDIiFKNN 


13494 

| * 


27395 


A 


13642 


210 


772 


SVKMVRYSLDPENPTKSCKSRGSNLRVH 
F KNTRE T AQAI KGMH I RKATKYLKD VTL 
QKQCVPFRRYNGGVGRCAQAKQWGWTQG 
RWPKKSAE FLLHMLKNAESNAELKGLDV 
DSLVIBHIQVNKAPKMRRRTYRAHGRIN 
PYMSSPCHIEMILTEKEQIVPKPEEEVA 
QKKKI SQKKLKKQKLMARE 


13495 

• 


27396 


A 


13643 


168 


2172 


SPLCEVSVPSFCFRVFCKKHKKHKSDKH 
LYEEYVEKPLKLVLKVGGNEVTELSTGS 
S GHDSS LFEDKNDHDKHKDRKR KKRKKG 
EKQ I PGEE KGRKRRRVKEDKKKRDRDRV 
tWil»A£ijUJijwwlAir VKljiibrlriSJVrijIoblj 
AKQEEVEQTPLQEALNQLMRQLQRKDPS 
AFFSFPVTDFIAPGYSMIIKHPMDFSTM 
KEKIKNNDYQSIEELKDNFKLMCTNAMI 
YNKPETIYYKAAKKLLHSGMKILSQERI 
QSLKQSIDFMADLQKTRKQKDGTDTSQS 
GEDGGCWQREREDSGDAEAHAFKSPSKE 
NKKKDKDMLEDKFKSNNLEREQEQLDRI 
VKESGGKLTRRLVNSQCEFERRKPDGTT 
TLGLLHPVDPIVGEPGYCPVRLGMTTGR 
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SEQID 
NO: of 

LI UVICUMUC 

sequence 


SEQED 
NO: of 

|IC|IUUC 

sequence 


M 

eth 

od 

uu 


SEQID 
NO: in 
USSN 
(09/515,1 
26 


Predicted 
beginning 

II U LIVUIIUC 

location 
correspond 
ing to first 
amino acid 
residue of 
. peptide 
sequence 


Predict- 
ed end 

niir !&•» 

UUUV 

otide 
location 
correspon 
ding to — 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid. F=PhenvlaIanine. 
G=Glycine, H=Histidfne, I^Isoleudne, 
K=Lysine, D=Leucihe, M=Methionine, . 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S«Serfoe, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


■ 










* 


PYSSYAPHYDSTFANISKDDSDLIYSTY 
GEDSDLPSDFS IHEFLATCQDYPYVMAD 
SLLDVLTKGGHS RTLQEMEMS L PEDEGH 
TRTLDTAKEMEQITEVEPPGRLDSSTQD 
RL I ALKAVTNFGVP VEVFDS EEAE I FQK 
KLDETTRLLRELQSAQNBRLSTRPPPNM 
I CLLG PS YREMHLAEQVTNNLKELAQQ V 
TPGD I VSTYGVRKAMG X S I PS PVMENNF 
VDLTEDTEE P KKTDVAECGPGGS 


13496 

• 


27397 

1 

\ 


A 


13644 

• 


1032 


207 


ILSASGACKLIDSLHSYCFSSRQNKSQV 
CCLREQVEKKNGELKSLRQRVSRSDSQV 

PKMNP WEPLS WMLGTWLSDP PGAGTYP 
TTjQPFQYLEEVHISHVGQPMLNFSFNSF 
HPDTRKPMHRECGFIRLKPDTNKVAFVS 
AQNTGWEVEEGEVNGQBLCIASHS IAR 
IS FAKEPHVEQ I TRKFRLNSEGKLEQTV 
SMATTTQPMTQHLHVTYKKVTP 


13497 


27398 


A 


13646 


148 


380 


RGSWREVPESASLPSRGAKGKWRGLCCC 
CCCCCCCCCCCCYHCHQEQGDIjKHQADL 
WRSGRTQNQAG I WQEHQTLEG 


13498 


27399 


A 


13647 


2099 


769 


TRLAGRVSVASRPCRGPAVGGIiLVERS K 
ARRPLLESRVAMAAVPELLQQQEEDRS k 
LRSVS VDLNVDPSLQIDI PDALSERDKV 
KFTVHTKTTLPTFQS PE FS VTRQHEDFV 
WLHDTL I ETTD YAGLI I PPAPTKPD FDG 
PREKMQKLGEGEGSMTKBEFAKMKQELE 

7VT?VT TV T TZ?X/VT r \ TC 7T?T-/"tP T CGUDTfT.C 

AUXJUAVr JsJxX VoortBVr jjUKliobMrVjJO 
KDRNFHVFLEYDQDLSVRRKNTKEMFGG 
FFKSWKSADEVLFTX3VKEVDDFFEQEK 
NFLINYYNRIKDSCVKADKMTRSHKNVA 
DDYIHTAACLHSLALEEPTVIKKYLLKV 
AELFE KLRKVE GRVSSDEDLKLTELLR Y 
YMLN I EAAKDLL YRRTKAL I D YENSNKA 
LDKARLKSKDVKLAEAHQQECCQKFEQL 
SESAKE ELINFKRKRVAAFRKNLIEMS E 
LEIKHARNNVSLLQSCIDLFKNN 


13499 

• 


27400 


A 

<• 

• 


13648 


1 

■ 


1206 


MSTSQS PCES I CD YVTSHDKSNFTDMI K 
LNILRCEVILDYPAPGGGSLGAKHCCSC 
YTVS S GVTEGERNAGEKGVKLNADGAR I 
RGTPGRGRRAEAEASSPAPAAVAAACW 
AAAAASRQLASGNRTRVSSGVPAPAFLG 

ACFHCETCKMTLNMKNYKGYEKKPYCNA 
HYPKQSFTMVADTPENLRLKQQSELQSQ 
VRYKEEFEKNKGKGFSVVADTPELQRIK 
KTQDQ ISNIKYHEEFEKSRMGPSGGEGM 
EPERRDSQDGSS YRRPLEQQQPHHI PTS 
APVYQQPQQQPVAQSYGGYKEPAAPVS I 
QRSAPGGGGKRYRAVYDYSAADEDEVSF 
QDGDTIVNVQQIDDGWMYGTVERTGDTG 
MLPANYVEAI 


13500 


27401 


A 


.13649 


3 


394 


GDGGGHLGSGRNGGGSMNAPPAFESFLL 
FEGEKITINKDTKVPNACLFTMNKEDHT 
LGNI IKSQLLKDPQVLFAGYKVPHPLEH 
KII IRVQTTPDYSPQEAFTNAITDLISE 
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SEQIB 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 

• 


SEQID 
NO: in 
USSN 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- ' 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartfc Acid, 
E— Glutamic Acid, F=rnenylalanine, 
Kj— Glycine, H=iiistidine, I— isoieucine, 
K=Lysine, L=Leucine, M=Methionine, 
N»Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, " 
T=*Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 

XT' IT 1 + OA— _ 1— I _ — !L1« ' 

X=Unknown, *=Stop codon, ^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














LSLLEERFRVRAGPGGAD 


13501 


27402 


A 


13650 


1 


471 


SRPTGLREAAGSGPREAPRRSGCKSPGL 
GTVGMLRPKALTQVLSQANTGGVQSTLL 
LNNEGS LLAYSGYGDTDARVTAAIASNI 
WAAYDRNGNQAFNEDNLKF ILMDCMEGR 
VAITRVANLLLCMYAKETVGFGMLKAKA 
QALVQYLEEPLTQVAAS 


13502 

- 


27403 


A 


13651 

- 


1334 

• 


82 

* 

• 


CYTGGTQSLWPGSSCASSVARPSSLFRS 
AWSCEWSVRCARACTMSVPAFIDISEED 
QAAELRAYLKS KGAE X SEENS EGGLHVD 
LAQI IEACDVCLKEDDKDVESVMNSWS 
LLLILEPDKQEALIESLCEKLVKFREGE 
RPSLRLQLLSNLFHGMDKNTPVRYTVYC 
SLI KVAAS CGAI QY I PTELDQVRKW I SD 
WNLTTEKKHTLLRLLYEALVDCKKSDAA 
S KVMVE LLGS YTEDNASQ AR VDAHR C I V 
RALKDPNAFLFDHLLTLKPVKFLEGELI 
HDLLTI FVSAKLAS YVKFYQNNKDFIDS 
LGLLHEQNMAKMRLLTFMGMAVENKEIS 
FDTMQQELQ I GADDVEAPVIDAVRTKMV 
YCKIDQTQRKWVSHSTHRTFGKQQWQQ 
LYDTI^AWKQNLNKVKNSLLSLSDT 


13503 

* 


27404 


A 


13652 


1 


377 


TTASGRSGVKGSTMSAEVPEAASAEEQK 
EME DKVTS PE KAE EAKLKAR Y PHLGQKP 
GGSDFLRKRLQKGQKYFDSGDYNMAKAK 
MKNKQLPTAAPDKTEVTGDHIPTPQDLP 
QRKPSLVASKLAG 


13504 

■ 


27405 


A 

• 


13653 


424 


596 


SLKNI YGLSCRKKKKGAVKKI I LVQAWW 
LMP VI TVLWEAE VGGLLEARGLRPTRAT 
W 


13505 


27406 


A 


13654 


190 


448 


LRS YPAPHLGS PELR I RKGRGHSHCLAG 
AAGPQRTALCGLS APLCPPS PTP PGAGA 
PRYCSGSDAP PCLLRGAGP P I PGMGDPE 
TS 


13506 


27407 


A 


13657 


1612 


563 


SMPGWRLIiTQVGAQVLGRLGDGLGAALG 
PGNRTHIWLFVRGLHGKSGTWWDSHLSE 
ENVPFI KQLVSDEDKAQLASKLCPLKDE 
P WP IHPWE PGS FRVGLIALKLGMMPLWT 
KDGQKHVVTLLQVQDCHVLKYTSKENCN ! 
GKMATLSVGGKTVSRFRKATSIIiEFYRE ; 
LGLP PKQTVKI FNI TDNAAI KPGTPLYA 
AHFRPGQYVDVTAKT IGKGFQGVMKRWG 
FKGQPATHGQTKTHRRPGAVATGDIGRV 
WPGTKMPGKMGNIYRTEYGLKVWRINTK 
HNI I YVNGS VPGHKNCIjVKVKDS KLPAY 
KDLGKNLPFPTYFPDGDEEELPEDLYDE 
NVCQPGAPS ITFA 


13507 


27408 


A 


13658 


128 


2626 


NSHRWVYVRARRWRRRGKQREQPEDRGV 

rivivo n turn t tj o ti rwx t cv*»t>pp y>r\ u l»\t/> i j u 

P NKRAAlnAIiHo PQ Y X FGDF S PDEFNQFF 
VTPRSSVELPPYSGTVLCX3TQAVDKLPD 
GQEYQR I E PGVDE VI E PSDTLPRTPS YS 
I SSTLNPQAPEFI LGCTASKITPDGITK 
EASYGS IDCQYPGSALALDGSSNVEAEV 
LENDGVSGGLGQRERKKKKKRPPGYYSY . 
LKDGGDDS I STEAL VNGHANSAVPNSVS 
AEDAEFMGDMPPS VTPRTCNSPQNSTDS 
VS D IVP DSP FPGALG S DTRTAGQ PEGGP 
GADFGQ S C FPAEAGRDTLSRTAGAQPCV 
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SEQID 
NO: of 
nucleotide 
sequence 

'i 


SEQID 
NO: of 
peptide 
sequence 


M, 
eth 

00 


SEQID 
NO: in 

ITCC1M 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
cs-oiutaniic Acia, r— rnenyiaianine, 
G=Glycine, H=Histidine, Msoleueine, 
K=Lysine, L^Leucine, M^Methfomne, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


■ 


• 

i 

• 


• 

• 


* 

• 

• 




■ 


GTDTTENLGVANGQILESSGEGTATNGY 
ELHTTES I DLDPT KPESAS PPADGTGSA 
SGTLPVSQPKSWASLFHDSKPSSSSPVA 
YVET KYS P P AI S PLVS EKQVEVKEGLVP 
VSEDPVAI KIAELLENVTL IHKPVSLQP 
RGLINKGNWCYINATLQALVACPPMYHL 
MKFIPLYSKVQRPCTSTPMIDSFVRLMN 
EFTNMPVPPKPRQALGDKIVRDIRPGAA 
FEPTYIYRLLTVNKSSLSEKGRQEDAEE 
YLGFILNGLHEEMLNLKKLLSPSNEKLT 
I SNGPKNHS VNEEEQEEQGBGSEDEWEQ 
VGPRNKTSVTRQADFVQTPITGIFGOHI 
RSWYQQS S KES ATLQP FFTLQLDIQSD 
KIRTVQDALESLVARESVQGYTTKTKQE 
VEISRRVTLEKLPPVLVLHLKRFVYBKT 
GGCQKLIKNIEYPVDLEISKELLSPGVK 
NKNFKCHRTYRLFAWYHHGNSATGGHY 
TTDVFQIGLNGWLRIDDQTVKVINQYQV 
VKPTAERTAYLLYYRRVDLL 


13508 


27409 

• 


A 


13659 


42 


382 


TLWLKTIQIYLTISIjGLGSDYGIiAflFSA 
LGCHQAAVKMLTAF I L IQGLDLGRSHFQ 
AHSGCWQDS FPCDSRIHGGLUjHVQQES 
PWFQSAERVSCITKCNHRSDTHHLCSIIi 
L 


13509 


27410 


A 


13660 


178 


349 


DMGPCYVAQXVLKLLGSSDPPTSASQEA 
GTTCHHAHLLSHPFLMIYRNFSFIQCLE 

mt 

wmm 

T 


13510 


27411 


A 


13661 


268 


2 


RQQKVILSSSGSLMSEMGRTVLPLKSPG 
MNPSSPLLiASGGCWKSLACGHIIPFSSF 
ILMWPSPLCVSVSSSLPMRKPLLLHLRS 
TLFQ 


13511 


27412 


A 


13662 


112 


483 


AGVGALRMVQRLTYRRRLSYNTASNKTR 
LSRTPGNRIVYLYTKKVGKAPKSACGVC 
PGRLRG VRAVRP KVLMRLSKTKKHVSRA 
YGGSMCAKCVRDRIKRAFLIEEQKIWK 
VLKAQAQS QKAK 


13512 

- 


27413 

• 


A 


13663 

- 


2 


873 


S VEE FDRGCTGRGCG ADARAGAAMVKIS 
FQPAVAGI KGD KAD KAS AS APAP AS ATE 
ILLTPAREEQPPQHRSKRGSSVGGVCYL 
SMGMVVliliMGLVFASVYIYRYFFLAQLA 
RDNFFRCGVLYEDSLSSQVRTQMELEED 
VKI YLDENYERINVPVPQFGGGDPADI I 
HDFQRGLTAYHD I S LDKCYVI ELNTTIV 
LPPRNFWELLMNVKRGTYLPQTYI IQEE 
MWTEHVSDKEALGSFIYHLCNGKDTYR 
LRRRATRRR INKRGAKNONAI RHFENTF 
WETLICGW 


13513 


27414 


A 


13664 


118 


3 


AWSLI PVIPWREAKAGGIiLEPRSLRPT 
WAlWUJJrVol 


13514 


27415 


A 


13665 


1 


2876 

• 


I RQR INFS RKNKWD SRKLDQAGVS ELAT 
NQKLILVCGRYEGIDERVIQTE IDEEWS 
I GD YVLSG GELPAMTL I DS VSRF I PGVL 
GHEASATEDSFAEGLLDCPHYTRPEVLE 
GMEVPPVLLSGNHAEIRRWRLKQSLGRT 
WLRRP E LLENLALT EEQARLLAE FKTEH 
AQ QQHKHDGMAAD EAGRT FLRAD FNM I E 
EGDR IMVCLSGGKDS YTMLE I LRNLQQS 
APINFSLVAVNLDQKQPGFPEHVLPEYL 
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SEQID 
NO: of 
nucleotide 
sequence 

* 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 

00 


SEQID 
NO: in 

09/515,1 
26 


Predicted 
beginning 
nucieonac 

location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 

• 


Predict- 
ed end 

pit iaIa 

nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystelne, D=Aspartic Acid, 

jv— ijf luiamic aciO) i*— ** ijeiiyiaianine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M-Methionine, 
N-Asparagine, P*=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonlne, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stopcodon,/=possibIe 
nucleotide deletion, V= possible 
nucleotide insertion 




♦ 

* 




1 

• 


- 




E KLGVE YKI VEENTYG I VKEKI PEGKTT 
C S LCS RLRRGI LYRTATELGATKIAIX3H 
HRDDILQTLFLNMFYGGKMKGMPPKLMS 
DDGKH XVI RPLAYCREKDIQRFADAKAF 
PI I PCNLCGSQPNLQRQVIADMLRDWDK 
RYPGR I ETMFSAMQNWPSHL CDTNLFD 
FKGITEGS EWNGGDLAFDREE I PLQPA 
CWQPEEDENQLDELRLNWEPDGPRHRH . 
APDARRFCQYLRHQARYLLSWPNLDVA 
VTE PIiANGDGLNVLI KRE WGFRANTVE 
KTGENQ YRVWPNEMPADLHKI RPHHPLN 
RNLDHFFPLDLTNSTFSGGYVHVLKGVL 
S DDLLLKS FQKMG YVRRDSHRLMVTALP 
. PACQLVQVALGCFALRLECEIIiGEVIAQ 
LGTS VLPAE E LLQARRASGDVAS CQRLA 
QDEEPPPLP PRGS PAAYRAPLDLYRDLQ 
EDEGSEDASLYGEPS PGPDSPPAELAYR 
P PLWEQ SAKL WGTGGRAWE P P AEE LPQA 
S S P P YGALE EGLE P E PS AFS FLS LRREL 
SRPGDLATPE SSAAAS PRRIRAEGVPAS 
AYRSVS E P PG YQGTQLP VPWRPAHPLLR 
HLSPAACCPLCSPARL1PSRPI1AACAAWR 
RPAALVATACTDGHSALQQPRGPALGQL 
GPRSRALGRQLPWCFSIjIjGRGLGLWLPG 
P WGS PGHSRGLES PGQ PLLEVGLS VYGR 
LQGQQA 


13515 


27416 


A 


13667 


76 


289 


SGTPASPCLEMDPNCSCSPVGSCACAGS 
CKCKECKCTSCKKSCCfiCCPVGCAKCAQ 
GCICKGTSDKCSCCA 


13516 


27417 


A 


13669 


2 

1 


667 


GRVDAEQSRLGATERAAAAAMNPEYDYIi 

FKLLL I GDSGVGKS CLLLRFADDTYTES 

YISTIGVDFKIRTIELDGKTIKLQIWDT 

AJGQEKifK T I To 0 X Y Kwuvil JL V V x U \T1 D 

QESYANVKQWLQEIDRYASENVNKLLVG 
wire nr.TTTrTnn/nMTT a v pp ati 01 T /3 T dt?t. 

ETSAKNATNVEQAFMTMAAEI KKRMGPG 

A & QrUTKR PUT »lf TD QTPVff PAGGGPf* 


13517 

< 


27418 


A 


13670 


176 


398 


R I LKTQLQENNQPTTTT KNRQKTRTDTL 
PKGI YRRHI SLWKLGTWLTPVI PALCEA 
NAGGRLE S RS SRPVWATY 


13518 


27419 


A 


13671 


149 


247 


RKGLALS PRLDCSGAI T VNCSLNLQGLS 
NIPI 


13519 


27420 


A 


13672 


257 


359 


PRCSRP 


13520 


27421 


A 


13673 


142 


1 


HLKGSGLAKHQ WLMPVT PALWEAEVGGS 
LEPRSSRPVWATLQDPISS 


13521 


27422 


A 


13674 


619 

■ 


168 


MF I ESCCTPKI YTSFVxiRQSHS FAQAGV 
Q WLDLG S LQ PliS LG S S N S PASAS C I Q VIi 
FVIYTSVKLGVGGJiliGNKKVMSSWKKFV 
CECRGWI I LP FSVPSLRFQDGGWDHGVS 
S AE KALQAGRTAGHRVG VPPLGAPCGGA 
PCTTPPAWRS 


13522 


27423 


A 


13675 


3 


370 


ARALLHPRLHQWQCLCHLQGTGGCPQQS 
PSSEGRADLLSPALEEGYPPPASGCFIP . 
HFLCGCS I FN CLP P I MRHAHKS PDALLE 
TLGPCRAWRLMP VI PTLWEAKVDRLLES 
RSSRPSWPTW 


13523 


27424 


A 


13676 


3 


281 


IGLRPELSVSLGRQRLTAIRLQPliLPAR 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

oa 

* 


SEQID 
NO: in 

I1CCM 

09/515,1 
26 

* 

• 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to iirsc 

amino acid 

resioue oi 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 

til rrtJo|Jtiii 

ding to 

I act aminn 

acid 

■ cMUUC Ul 

peptide 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
Sj — oiuujiiiii. aciii, r — riieiiyiaianine, 
G=Glycine, H=Histidine, I=Isoleurine, 
. K=Lysine, L=Leucine, M^Methtonioe, 

NsA cnsrooSnft Ps'PVAtttiA 

Q=Glutaniine, R^Arginine, S«Serine, 

1 A III CvlIIIJCf T — THIIHC, 

W=Tryptophan, Y=Tyrosine, 
nucleotide deletion, \=possible 

niiflpnftHp incpff !nn 

1IUVICUIIUC 1II9CI UVII 




» 










TPCGFLRARVALALMADETDLPLLPRLE 
YSQMITAYCSIiDFLQSSDPPTLQSGVAQ 
TTENCCED 


13524 


27425 


A 


13677 


272 


389 


YCRQ I EKS KKKRC WLGTVAHACNPSTWG 
GRGGRITRSRD 


13525 


27426 

* 


A 


13678 


261 


363 


NSSWGHARWLTSVI PALWEAEVGGSPEV 


13526 


27427 


A 


13679 


149 


44 


SCVMSVSVLKKMYWLGAVAHACNPSTLG 


13527 


27428 


C 


13680 


118 


267 


MSHCTWLMCVCVCVCVCGVYAYMYTYSP 
LRWDLPMLSRLECSG YSQAQ+ ♦ . 


13528 


27429 


A 


13682 


16 

• 


346 


NHHLTQPHPHLDINDFFFFFFFFFFFFF 
FFFFFFFFFFFLKKS PRPAGKKKKKKKK 
KKKPPRFFFLGGAPPPIiFFFLINFFFFF 
LCKI ISLVYNRRGGPCLSKRFFLKERKS 
PL 


13529 


27430 


A 


13683 


244 


140 


GRVDVFHHVAQAGLKLLS SGDLPAWP PK 

«T ATT 

ELRJjQV 


13530 

■ 


27431 

■ 


A 


13684 

« 


191 


777 


NSDEHVYRCLYGHKLSFLLS KYLKSGLA 
GVLLPLDAAVDMEK I EEQ FANIjHI VKCS 
LGTKEPTYLLGI DTS KTVQAGKENIiVAV 
LCSNGS I RI YDKRR T iNVLRE FSGYPGLL 
NGVRFANSCDSVYSACTDGTVKCWDARV 
AREKPVQIjPKGYPSNIFISFDINCNDHI 
ICAGTEKVDDDALFGGFGDARMEFLQN 


13531 


27432 


A 


13685 


150 


ry c O 

253 


VGWGJjWJjVP V I PAr WhAi^UbS IjDFKNJj 
LPCWAT 


13532 


27433 


A 


13686 


r* f\ r\ 

500 


321 


I EMGSH YVAQAGJjEIjIjS S SW P PTS VS QS 
AGIVGMSHHALAAI S KSSDAS I TSHHPM ! 
TTP 


13533 


27434 


A 


13687 


6 


424 


msllqcggircfkmpepaksapapkkgs ; 
kkavtkaqkkdgkkrkrsrkesysvyvy 
kvlkqvhpdtgisskamgimnspvndif 
eriageasrlahynkrst itsre iqtav 

KIjJjJjPGbljAJ\HA.V5>bGxlvAV 1 Jvi loaJv 


13534 


27435 


A 


13688 


178 


293 


TGYSSQAQWLMTVLTATWEAEAGGSLQP 

T3CT DDMaTATY^ 


13535 

• 


27436 


A 


13689 


663 


144 


KELSAVSAGI PHS CGS QGCGGGS VAACV 

GVNAPPPPVCSSVRIiHVCQSDRLWVRLA 

RPSESLRPTSNAASLFRSSVPTVLSHSV 
PLAASLGKRRACGGREHASVAVYLSVCL 
SLPT 


13536 


27437 


A 


13690 


125 


3 


FAHQGHAPGQAWV7LRP VI PVLREAELGG 
RPEPRSSRTARA 


13537 


27438 


A 


13691 


136 


24 


GGPPFFFFFFFFFFFFFFFFFFFPFPFF 
EFPGKLQQDFQYYVK 




27439 






144 




TL IALTTLLLP ISPFILIIIi 


13539 


27440 


A 


13693 


1476 


380 


N S WSTLASELTIjWAATSAD PERKS QAAS 
AAMWATLPLLCAGAWLLGVPVCGAAELC 
VNSLEKFHFKSWMSKHRKTYSTEEYHHR | 
LQTFASNWRKINAHNNGNHTFKMALNQF ; 
SDMSFAEIKHKYLWSEPQNCSATKSNYIi 
RGTGP YP PS VD WRKKGNFVS P VKNQGAC I 
GSCWTFSTTGALESAIAIATGKMLSLAE 
QQLVDCAQDFNNHGCQGGLPSQAFEYIL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
neotide 
sequence 

• 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1. 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

oeotide 

sequence 

< 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
OGIycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=TrvDtoDhaOi y=Tvrosine, 
X^Unknown, * =5 SJtop codon, /^possible 
nucleotide deletion, Y=possibIe 
nucleotide insertion 


: 


- 




• 






YNKGIMGBDTYPYQGKDGYCKFQPGKAI 
GFVKD VAN I T I YD E EAMVEAVALYNP VS 
FAFEVTQDFMMYRTGIYSSTSCHKTPDK 
VNHAVLAVGYGE KNGIP YW I VKNSWGPQ 
WGMNGYFLI ERGKNMCGLAACAS YP I PL 
V 


13540 


27441 


A 


13694 


87 


2 


PGWAWWFTPVILALWETBAGGSPEFKSL 


13541 


27442 


A 


13696 


130 


2 

mm 


CNILIII IQVCWAWWLMPVI PALWHAEP 
GGSPEVESSRPALV 


13542 

> 


27443 


A 


13697 


282 


107 


GLTKCPGTVCRSLKWISYQLSIICLSM 
MEHSGTILAHCSLKLLGSSDPPTSVSRV 
AS 


13543 


27444 

• 


A 


13698 


198 


3 


CSITVNKVKVMTLFRSSMSliEIVCDIDL 
AVLRRDLGS LQD P PPRFTPLS CLSL PS S 
WDYRRPLV 


13544 


27445 


A 


13699 


2 


120 


ARGDGVSLWLPRLKCNGAISAHCNLRPT 
GSSDKNFKTRF 


13545 


27446 


A 


13700 


60 


341 


PDMGLEDERKMLTESGDPBEEEEEQEEL 

ERVS S RSHTDDD CPEELFDFLHAKDHCV 
ALKLFNNLO 


13546 


27447 

* 


A 


13701 


6 


540 


KNSRTLGQCGG I RGS LCR PRKPG VGGTQ 
TRAVR P AVC S ADS ARPHL P LRRADMKD S 
LVLLGRVPAHPDSRCWFLAWNPAGTLLA 
SCGGDRRIRIWGTEGDSWICKSVLSEGH 
QRTVRKVAWSPCGNYLASAS FDATTCIW 
KKLTLRI YNILRKLBACIKPLCCALKYG 
CLEEKQLHS 


13547 


27448 


A 


13702 


239 


346 


AKRGWLMPVI PALWEAKVGKSLEPRISR 
PLWATWQN 


13548 


27449 


A 


13703 


210 

i 


300 


LMPVI S ALWEAKARGLLE PRTLRPAWAT 
W 


13549 


27450 


A 


13704 


141 


440 


PSAFEHFEKINLYFLKFCISQGFPERQN 
NRDRE IHRERYERDRERHRERYERGLRE 
LAHVIVEAEKPHHRPFITWETLGCWWSG 
S VQVQKPWE PGKLMV 


13550 


27451 


A 


13705 


204 


438 


LLNVLLTQLFFL FLSLRQS CS VAQARGQ 
WYKHGS IjQP S THGASNP P TS AS Q SVGTT 
GMSHDHGLFLHFSTLDIFSSL 


13551 


27452 


A 


13706 


1707 

# 


1821 


AKAGG S QHLE I LANAVKP CL YWE CKMAG 
PWWCAPVGG 


13552 


27453 


A 


13707 


167 


334 


I RRANFKI LTE IGWTQWLTP VI PALWBA 
EGGGLLE PGS PSLVFLFF ILTT I LMRKK 


13553 


27454 


A 


13708 


224 


344 


KDTAMEEE IKDTEKTGKPWLGAVAHACN 
PSTLGDQGRQIA 


13554 


27455 


A 


13709 


1476 


380 


NS WS TLASELTLWAATS ADP ERKSQAAS 
AAMWATLPLLCAGAWLLGVPVCGAAELC 
VNSLEKFHFKSWMSKHRKTYSTEEYHHR 
LQTFASNWRKINAHNNGNHTFKMALNQF 
SDMSFAEIKHKYLWSEPQNCSATKSNYli 
RGTGPYP PSVD WRKKGNFVS P VKNQGAC 
GSCWTFSTTGALESAIAIATGKMLSLAE 
QQLVDCAQDFNNHGCQGGLPSQAFEYIL 
YNKGIMGEDTYPYQGKDGYCKFQPGKAI 
GFVKDVANITIYDEEAMVRAVALYNPVS 
FAFEVTQDFMMYRTGIYSSTSCHKTPDK 
VNHAVLAVGYGEKNGIPYWIVKNSWGPQ 
WGMNGYFLIERGKNNCGLAACASYPIPL 



1371 
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PCT/US01/04927 



SEQID 
NO: of 

n 11 *« Ion ti rl a 

3CUUCIILC 

• 

i 


SEQID 
NO: of 

pcpiiuc 


M 

eth 

uu 

• 


SEQID 
NO: in 

09/51 5.1 
26 


Predicted 
beginning 

nnrtpnfirip 
location 
correspond 
ing to first 
amino acid 
residue of 
. peptide 

CPflllMlf A 
5C*| UGI1US 


Predict- 
ed end 

mini A— 

otide 
location 
correspon 
ding to 
last amino 
acid 

I GalUUC UI 

peptide 
sequence 


Amino acid sequence (A m Alanine 
OCysteine, D=Aspartic Acid, 

G = Glvcin.e lfsTTistidine. I~Isoleucine_ 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R»Arginine;$=*Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyroslne, 

YcsTTnlrn Awn *sStnn Mwlnn ' festtAtslKlo 

nucleotide deletion, V=possible 
nucleotide insertion 














V 


13555 


27456 

1 




■ 1 1*71 fk 

1 .5711) 

i 


/Oj 


JLjI 

■ 


LSSAAKHSVDGSSLSSELQQIiOLPKEHA 
ASIiOTnYEEKOQPI^PnrJJlVCSLRMNRTi 

AGVGWRVDYTLSSSLLQIRGKSPWCHCG 
WKVATCPQGPQAPACLPCPPSSKTKPPR 
SSLARTES RPKTLMKIjPWAEEIK3VPRPV 


13556 


27457 


A 


13711 

■ 


133 


401 


VLSKSGNP YWGLSTNHEI KGQEGCAWQL 

WLP.RE I KLDLLKNGCGPS K I TQGGPMVA 
GSLKL 


13557 


27458 


A 


13712 


212 


74 


RRINLAPPRVFLKGPPPFFFFFFFFFFF 
FFAWGNPLPTjFNSTFGRT 


13558 


27459 


A 


13713 


324 


141 


DRVLPRLECSGVTTARYRLNLLGSRDPP 

x ofuuxAUw x vj x v*rLnftni*jr x xjr x x v c ■ 

FIIL 


13559 


27460 


A 


13714 


299 


431 


I RL I WLS AVDHACN S S TK3GHGVQMTKG 

^L/f Dill U/UN IfUi XT air X 


13560 


27461 


A 


13715 


133 


1 


ILVWAWWDMPAIPVLWEAETEGIxIiEPRS 
LRPAWATWKNTISTKN 


IJDoI 


Z74oz 


a 


. i ill a 

1.5 /lo 


15 


i as. 
105 


SVLYMASERECRTYNLMLNK 


13562 


27463 


A 


13717 


119 


1 


KPNVSQAQWLAPVI PALWEARVGVLFKP 


13563 

* 


27464 


A 


13718 


271 


64 


VLRS I STLNEIIiAKTRNPPLQ ILMKPRK 
TKISWVWWRTPVI 


13564 


27465 


A 


13719 


21 


209 


KDGPKGTGGSETNESVEMTECCSVTQAG 
VQWHDLGSLQPPPPGFKRFS CLGLPSSW 

DYSRKD 


13565 


27466 


A 


13720 


391 


151 


RGFFLKGEKKKNFFPIILGKKILGSPGK 
KGEKKKKKRGKI FFFLKNPLGFFPICKKV 
LGEKKNP YSGVWGGKKKRGP PPHP 


13566 


27467 


A 


13721 


109 


2 


RRLGWAHACNPSTLGGEVGGSQGPGNC 
GQVTKVS 


13567 


27468 


A 


13722 


69 


369 


RDILQ ADGAHSL FPTTPSLNAYI X FSS P 
IGPHTHRPYHATPTYLPFYTNNLIKKKK 
KKKKKKKKKKKKKKKKKKKKKKKGGGPF 
KKTPGGAP I FGGGGK 


13568 

■ 


27469 


A . 


13723 

- 


58 

■ 


1208 

• 


F WNENS PAS ELAPNGGGSVTS VPRLEDY 
LTRPQLYKLTGVBG PS RANS RDS FHDRK 
TYKTPSANM>1VLKVEELVTGKKNGNGEA 
GEFLPEDFPJDGEYEAAVTLEKQEDLKTL 

Ixntlr V 1 JbVj£UUWiUbx!iI^XVB>lE^ 

QRSKLENLEDLE 1 1 IQLKKRKKYRKTKV 
P WKE PE PE I ITE P VDVPTFLKAALENK 
LPVVEKFLSDKNNPDVCDEYKRTALHRA 

AI HWAS RGGNLDVLKLLLNKGAK I SARD 
KLI^TALHVAWTGHYSCAEHLIACEAD 
LNAKDREGDTPLJIDAVRLNRYKMIRLLI 
MYGADLNI KNCAGKTPMDLVLHWQNGTK 
AI FDSLRENS YKTS R IATF 


13569 


27470 


A 


13724 


400 


135 

* 


KWNGKBLNGVDGIGWDWNEMYWKGLEFH 
ETEWNBLERNGLEWNGMSWDGIKCNGIK 
WNGIECSGMLWNGMECIGMECPLMEWIR 
VEWN 


13570 


27471 


A 


13726 


186 


344 


S ASLGLWR CRDCRRS L VHS VLNVAQAG V 
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NO: of 
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sequence 
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SEQ ID 
NO: of 
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sequence 

• 


M 
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SEQID 
NO: in 

TIC CM 


Predicted 
beginning 

M ■ • aI A A Hfl A 

nucieotiae 
location 
correspond 
ing to first 
amino aciu 

residue of 

f\ an +ttt A 

pepuue 
sequence 

* 


Predict- 
ed end 
nude- . 
otioe 
location 
correspon 

A\w%n 4-rt ' " 

ding to 
last amino 
aciu 

residue of 

peptide 

sequence 


Amino acid sequence (A«Alanine 

OCysteine, D=Aspartic Acid, 

cj— uriuiamic aciu, r— sr nenyiaianine, 

§ — ■# ^ ]ir/t|M A H 11 Sri 1 ifll gift T T^A Iaii Al n n 

v> — oiycine, n— nisiiuiue, i — is o leucine, 

K=0Lysine, L=Leucine, ^Methionine, 

N=Asparagine, ^Proline, 

V£ — oiuianiine, tv^Arginine, 3 — oenne, 

T^Threonine, V=Valine, 

w — jrypEupnan, i — tyrosine, • 

X=Un known, *=Stop codon, /^possible 

nucleotide deletion, V=possible 

nucleotide insertion 
















13571 


27472 


A 


13727 


380 


168 


APPQKKVFSSKFPQGFFFAPPNRKKLFF 

O DPP Tf TITO DD VT?TTT .tJ 7A DDT. T?I?T?T7I?W T7CT? 

FFFFFFWYSRTDYK 


13572 


27473 


A 


13728 


324 • 

* 


219 


RIMFFIFFYFFFFFFFFVFFLFFFFFKR 

DDDDVV 


13573 


27474 


A 


13729 


2 


270 


RIjQEASLS P VPE I LGGALPSAKRTS YLL 
IjSTJjL.JLiJjLic>il lAoRGNoLTGLiGHRSDHY 
NS VRSGGQ SLySACP I VTT IHGTCYGGK 
AKCCK 


13574 


27475 


A 


13730 


134 


3 


AVHRCKKWGRGWGRWLTPVI PTLWEAEV 
GGPLEPRSLRPAIiAT 


13575 


27476 


A 


13731 


429 


641 


RKKAVCFMNDL I C FLDNTFKNNVLSQAW 
WCVHLVPTIWEAEAGGSIjEPRSLKLQCP 
vvapvknctpawat 


13576 


27477 


A 


13732 


188 


32 


I FFFFLFFYFLFFFFFFFPRSRHCTLQP 
GQQSBTRSQNKKQKTKNKQTKNTjP 


13577 


27478 


A 


* a A 

13733 


341 


3 


GTVTPEE PKKMGDQTPRGLPRPERTLGR 
LSTSBSRVDPTPRVGGGSSGGRTAPWGP 
PRERGRVAGSSLHRACNFSTSNTIISVG 
LWLTPVI PTLGEAE^GGSPEVRSSTHAS 


13578 


27479 


A 


13734 


208 


2 


KLYKNFFFRFTKNPPPLLKKGFYLYNFF 
FKKKKKIVKKKKKKKKKKKKQKKKKKKK 
GRPLAIKNYPRV 


13579 


27480 


A 


* a a ^ 

13735 


16 


212 


EGVSLLLPRLECRGTISAHCNLRHRGRR 
CTPAS AS Q I LRRL KQENHLNLGGGG YRD 
PIKLQRTAS 


13580 


27481 


A 


13736 


22 


A • ^ 

214 


pdrpalptrpggw i twgrevrdq pgqcg 
ETPSI^KIEKLAGCDGTCLEQNCXjQNNN 
SNNKvGIR 


13581 


27482 


A 


13737 


386 


51 


WGNS I TQQALFBHLCFSGHTS LPAVPCP 
HQACFPTLPLICRGI I IAHCNLGLU3S S 

r\ririT C TV O r»TrV">TlT«TV^T unuKnr DP TUT T f.TV 

DSP 1 S AS P I DGNTGIjH PHAKJU FS FLiLWY 
RWGLTMMPKRSLS PGLKPSSRLADAWV 


1J5o2 


27483 




1 1 *T"T O 

13738 


204 


431 


LYKYKKINQGWAWWLTPVI P PLWEGQVG 
GSPEGKELKTPLRNIjGKPHL 


1 KOI 

14585 


274B4 


TV 


13739 


69 


159 


E 


Ii3o4 


274o5 


TV 

A 


13740 


51 


IOC 

285 


r r JLkb cr Jjr r r r r r J. KJsJ? r SUU 
KSPQNLFFLKKNPPPFFFPPPKKPIiWGF 


13585 


27486 


A 


13741 


372 


213 


yrscvllqqnpplfyqswwnfslcvcvc 

17T*\/f , \7l* , Vn?TY2 QTiTOT17T »T ZIYI WIT1T1? 


13586 

■ 


27487 


A 


13742 


29 


237 


TKRKG yktnkgywawawwltpvi palfi 

AEGNLILQMQKQAQPEAGCX3CVCVCVCA 
CMCVCVCVFKVFF 


13587 


27488 


A 


13743 


322 


183 


HTHTHTHTHTHTHTHTNP IHVFWT >T iT iKY 
P K T OSHOFHISAFODPIjTj 


13588 


27489 


A 


13744 


305 


373 


VGWGWWLTPVI PALWEAKVGGS 


13589 


27490 


A 


13745 


172 


1 


KNKIEEQMKWEMRQEAVAHSQJPSTLGG 
QGEFETSLVNMMKPCIiYKKMQKISQVCS 
C 


13590 


27491 


A 


13746 


285 


1045 


FQHPFGLS QS EMAAVKASTSKATRPW YS 
HPVYARYWQHYHQAMAWMQSHHNAYRKA 
VESCFNLPWYLPSAIiLPQSSYDNEAAYP 
QS FYDHHVAWQDYPCSS SHFRRSGQHPR 
YSSRIQASTKEDQALSKEEEMETESDAE 
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SEQ ID 
NO: of 
nucleotide 
sequence 

* 


SEQ ID 
NO: of 
peptide 
sequence 


M, 
etb 
od 


SEQ ID 
NO: in 

09/515,1 
26 


Predicted 
beginning 

mm ill aa4I ^1 

nucleotide 
location 
correspond 
ing to first 
amino acid 

^^***m A jI ^ ■ ^» 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^Alanine 
C=Cystelne, D=Aspartlc Acid, 
m<— u ramie acio, r— rnenyiaianine, 
G=Glycine, H=Histidine, I=Isoleudne, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
l—l nreomne, V— valine, 
W=Tryptophan, Y=Tyrosi ne, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=*possible 

.....Ui>l!J. * ' * 

nucleotide insertion 


■ 












VECDLSNMEITEELRQYFAETERHRBER 
RRQQQLDAEKLDSYVNADHDLYCNTRRS 
VEAPTERPGERRQAEMKRLYGDSAAKIQ 
AMKAAVQI^FDKHCDRKU^^ * WF VI PIiK 
F 


13591 

• 


27492 


A 


13747 


2 


305 


GRVGS FS VRD VELSDPARERGEMPVAVQ 
PYGQSQPSCFDRVTCMQrVMUtAVGWAAQ 
ALFGTFSCLRIGMRGRELMGGIGKTMMQ 
SGGTFGTFMAIGMGIRC 


13592 


27493 

* 


A 


13748 


2 


305 


GRVGS FS VRDVELSDPARERGEMPVAVG 
PYGQSQPSCFDRVKMGFVMGCAVGMAAG 
Ala FUTr S CLKlGMKGR&.LiNiiG iGKTMMU 
SGGTFGTFMAIGMGIRC 


13593 


27494 


A 


13750 


238 


423 


AVSWDQLTWGTGVQEKKIQAQAWGLMPV 
I PTLWEGEVGGSPEVKSSRPACPHGKTP 
FXLKMQ 


13594 


27495 


A 


13751 


455 


248 


ISVGPGIFQLNFCFLEPYGTFLSEKKFF 
IMETGSRFVIQAGVQWGS YSS PQPQFFG " 
FESSLKPPHSGVK 


13595 


27496 


A 


13752 


2 


98 


IP TP T IHTHTHS HTHTHTHTHTS FLYMP 
PDLK 


13596 


27497 


A 


13753 


196 


2 


IWAPPKIFLKKPPLFFFFFFFFFFFFFF 
FFFFPFFI ISFIPWPLTRKQKLERSWVF 
FKDSACSA 


13597 

■ 


27498 

• 


A 


13754 

■ 


917 

■ 


379 


KYKKCS LQKNLLLVGCKKYSLCYRRHKL 
HLVTHGERKKPAAVNSFFLFFS FFFPLN 
LVRNTE ITKKRVNLHETKADAES CNQDC 
TTSS ETAS EIiEQ I R SGKHNSGKWAGEGA 
AGgRREGERMEWTEMRRGARGRRGGRBR 
EMERARVRGGEERE KE IDL YKKVTSKI E 


13598 


27499 


A 


13755 


175 


21 


KKKNFFFPPRVILGPPKVFFKRAPIjFFF 

E'E'PPPPPPPPgPPT WCUDDT TV T 

c f r r ccc c rr r rr LKKSWKlrliAj. 


13599 


27500 


A 


13757 


234 


341 


E IGWVQWLMPVI PAVWEAE VGGLLEPRS 
LRAARAI 


13600 


27501 

« 


A 


13758 


1 


365 


PAPNRRGGHI QDRATNSTE LGANQCF FF 
S PRPPSLEKKTPE INKEPRPAPQPSNPG 
NDGTREGGDSWAGTTRCLRRTDEGDTYR 
1 E»Jr f 1 AJjoWUy LblAc c rVVAJurALjJSJSitti 
RNLLKTQFF 


13601 


27502 


A 


13759 


87 


181 


SHTHTHTHTHTHTHTF YLHWIHFDME I 


13602 


27503 


A 


13760 


1 


228 


ARGERERERERERERERERERERERERE 
KxKnWjJCULil-jl MbliVjh 1 tKAl iril ire* 
PSIKAWRLCEPRPRAGILCX3RNIF 


13603 


27504 


A 


13761 


383 


197 


RCD P PAWVSQSARI TG VS YRAQ PATS VL 

Mr*l?17TM?T CTMTIT7T 1\ fT7T3 T.7\ T3ft\7T T O VT.Ot? 

VSVAMT 






A 




Z /O 


J / 


InV iAJUOOuCUUiijr V C V C C C C C C V c c 

FKKQGFGLMPIiGEDRGPNMGFFYFPMVG 
DKTKLPFKKKKKKRPR 


13605 


27506 


A 


13763 


377 


209 


PRP AYGPAFLFTEGFS WNP F F FFQMES R 
S VAQAGVQW CYLGS LQ PPP P WFPGTLLN 


13606 


27507 


A 


13764 


160 


202 


MEKYNVHPHSGILHSHEKEQAALFTIAK 
RWRQPS YPS IDE WKNIMS IHTVEYYTAM 
KRNKQLYSQ 


13607 


27508 


A 


13765 


390 


284 


ESGGHFLS LSLS I Y I Y IHI YI YVCVY I Y 
THTSHTI 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 


SEQID 
NQ: of 
peptide 
sequence 

■ 


M 

eth 

od 


SEQID 
NO: in 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acia 

residue of 

peptide 

sequence 

< 


Predict- 
ed end 
nucie- . 
otide 
location 
correspon 
oiog to 
last amino 
add 

residue oi 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
Hi— vTiuiainic ACiu, r— rnenyiaianine, 
G=Glycine, H=Histidioe, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
i^^vjriuianiine, rv = Arginine, exocrine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
a— un Known, — otop couon, /=possiDie 
nucleotide deletion, V=posslble 
nucleotide insertion 


13608 

• 


27509 


A 


13766 


350 


41 


YWILQENRNTQNRKLLPKSTELCMLLQ 
TGS FYVTQAKVRWLFTGMITVHCSLKLL 


13609 


27510 


A 


13767 


330 


156 


KITQAWWCVHWPDNREABVTGVIRLGR 

nKJjJc'Juav7nL.l£4v«VJk'OnOl\iSUL>£' VoxlINJalS. 

K 


13610 

• 


27511 


A 


13768 


25 


292 


KCFFLSWRGGSRLQSQHFGRPRRADHLR . 
PGVPDQPGQHGFLLVKMSDKPDLSEVBK 

VNIQD 


13611 


27512 


A 


13769 


585 


680 


KCLGSRTRWLT P VI P TL WE AEAGGSLRP 
RSS 


13612 


27513 


A 


13770 


488 


359 


PSPRERKFLPFFKKKKGWGPPPPPKKNR 
GRGPQQKEGPQKPKP 


13613 


27514 


A 


13771 


794 

• 


515 


PDMGLEDEQKMLTESGDPEEEEEEEEEL 
VD PLTT\n<EQCEQLEKCvKARERLELCD 
ERVSSRSHTEEDCTEELFDFLHARDHCV 
AHKLFNNLK 


13614 


27515 


A 


13772 


130 


397 


WGLTPLCFCMRAS L PRHS KPQARKTKK 
KKKKKKKKKKKKKGGSLKKKI FGGAKKS 
GATKKKNFPKKRGQNKKNPGFFKNEIFF 

GGGAl 


13615 


27516 


A 


13773 . 


376 


145 


TRVGGGCSEQRLCHCTPSWATEPNPVSN 
NNKIPLELFPSNQTFFHFVRNWKKRGNN 
WLSNYS I S STVTRHFPTLFQF 


13616 


27517 


A 


13774 


1 


247 


GLQSLADLLSGLLQKVCLPLSNGIMDIiY 
LiLYYLFTFETGS CSVAQARVQWCEHGSL 
QPHTS AS SDPPTS ATTTCHHARLI IW 


13617 


27518 


A 


13775 


*>fe 

376 


145 


TRVGGGCS EQRLCHCTPS WATEPNPVSN 
NNKI PLELFPSNQTFFHFVRNWKKRGNN 
WboW YS 1 SST VU Kttlf JrTIiFQr 


13618 


27519 


A 


13776 


107 

« 


468 


EKARS PERARDQEGGTERDRNSEREKI L 
PKLREELPWVSGGWRCPWPHQGWSHWED . 
NPG WG I PTGPS VG WGE KKG PGEGRSHKY 
GTGRTKCELGVSiGNSAFTFLLHFYFKH 
RKRERKQI 


13619 


27520 


T4 

A 


13777 


125 


2 


NIr r PiirELY xr Kv-liWARWLTrVI PALW 
EAKEGGS PBVRSS 


13620 


27521 


A 


13778 


166 


74 


GRiCYSHSLSKHTHTHraTkTHTSkiiSF 
IP 


13621 


27522 


A 


13779 


1 

• 


341 


ARGERERERERERERERERERERERERE 
RERERERERERBRERERERERERERERE 
RERERLS FSLGGGGALKKKRI FLCVTLE 

fVZXT tStlT TlWY^PPr'PM'I'UTT CMVPD13W< 

GKHJjPPJjTnTVar r GJ^THTJjoHKTKEXjG 
FICAGGDERRSAS VMRAYI YRECE PHTH 

V 


13622 


27523 


A 


13780 


"614 


337 


RRCSSALC YRRHGNHKVKIRS KQGASVS 

MYVCMYVCMYVCISIYLSIYLSIYLSIY 
LSISISSR 


1 3623 


27524 


A 


13781 


390 


158 


VLIRLS WGRI C FQAHSGCWQNSGS CSHK 
TEGLCLLLAGSCSWLLGGALKGWASKEE 
FYTMQHS CQSDTLSPLPYS IH 


1 3624 


27525 


A 


13782 


308 


163 


TKFYLLFFFFFFFIiFIFFFFFFFLKKKF 
LFFIQGGGKIKTPGTIjTFLG 


13625 


27526 


A 


13783 


45 


277 


IASGRPFFFFLHLPPFQKAFVFGGGGGT 
PFEYQNFVAYIKGKQGNPFLFGCGDLFN 
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SEQID 
NO: of 
oucicoiiue 

i 

Self UCtlCc 

'l i 


SEQID 
NO: of 
pepnoe 

S6«|U6I1Ip-C 


M 
eth 

nrf 


SEQID 
NO: in 

09/515 1 

26 


Predicted 
beginning 

UUHCUUtlC 
IIH.allVM 

correspond 
ing to first 

ftminA ariH 
niiiiliv mill 

residue of 
peptide . 
sequence 


Predict- 
ed end 

IlUUC* 

Afiria 
vuuc 

location 
correspon 

Hino tt% 

Ulllg IU 

last amino 
acid . 
residue of 
peptide 

OCQU CIICC 


Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 

tT— (^Iliitji mir Arid F=Phpnvlnlnntnp 
XV — \jt u xa lint /\x. i U ^ r — x iicti ^ uiiaiiiuC} 

0=f»lvrinp> Hsffistidine. I=Iso leucine 
K=Lysine, L=Leucine, M=Methk>nine, 
N=Asparagine, P=Proline, 

^Threonine, V=Va!»ne, 
W-Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 

uuLicuiiuc inocriiiin 














. APQL I KQLSQLGQKY PKKPUi 


13626 


27527 


A 


13784 


413 


218 


PNFPPFLFFSGKFFFFAQNLPFGFFPPP 
GEGKAFPGQGCPVGGGGFAOPTFFFGPP 


13627 


27528 

* 


A 


. 13785 


1011 


1368 

* 


IDWENDLTPMVLCHGPSIYNKYSFSQNI 
FPISANFFFIAHINLTLLQLVSELSHKI 

T T .P WF V T FDT JTHT T .WT .VT i TT. MUUT .PHT . 

CKSTAICRYLCVCIYIHLYIHTHTHIHI 
YTHIRL 


13628 


27529 

* 


A 


137oo 


161 


425 


DiarJQPVPDl^lVDUT.VT FFT.T.T.T.T.VTA/T.FM 

GVLPLLPRLECSGMITVHCNLEFLGTSY 

DDT <? VQOTKTTTf ICR P FTT,PPiYPWU"lTnT.P 
LRGE . 


13629 


27530 


A 


TOT 

1 3787 


70 


31 1 


r>T.TQTT2ir.\7TC;DKrVB T VTVT..QT VT.QTVT..C 

IYLSIIYLIFISTNLLWVLSLWETLNKT 


13630 


27531 


A 


13788 


25 


219 


RNMAAATLTS KL YS LLFRRTSTFALTI I 
VGVMFFERAFDQGADAIYDHVNEGKLMK 
HIKHKYENK 


13631 


27532 


A 


13789 


279 


138 


RRSLALSPRWDCGLQWRNLGSLQALLPG 


13632 


27533 ■■ 


A 


13790 


138 


28 


GG WVR WLTPV I PALWBARVGGLXiBPGSS 


13633 


27534 


A 


13791 


133 


7 


YSCQRCLTHGQAQWLTTVISAPWEAKAG 

PTT.F PP. ^ VR ^ ZV W A T 


13634 


27535 


A 


13792 


86 


290 


EHVHRTL I EASVSG WGWWLTP VI SAAWE 
TTESLGSFAATS 




Z O JO 




13 /y3 




3 / 


PCT T YPSOARWLMPVI PALCEAEVGPJLT.E 
PSSSRlxAWAV 


13636 


27537 


A 


13794 


216 


77 


PTQHPTEG YMI KQQTVYHBCRMWANSHL 
(^TjPEGIiljRAVTPWCHAP 


13637 


27538 


A 


13795 


251 


347 


VARAWGLAPVI PALWEAEVGGSPEGRSL 


13638 


27539 


A 


13797 


177 


274 


VQWHMS WPAI QEAEVGGS PE PRSLRPA 
WATY 


13639 


27540 


A 


13798 


307 


406 


KFKKGWAWWLMPVI PALWEAKAGGSPES 
RSLRP 


13040 


27541 




13795* 




3 


wpw ff p fp ppMn h YTvnAP vnwr.WTDix 

crrrtr re uAjnv^onx x vy>ti\vynur jli/h 
NIVHCSLQLIASSDPPVSTSQVGLQACA 
DDAQNPE 


13641 


27542 


A 


13800 


333 


198 


LPSVFFFFFQTESCSVTQAGVEWC3DLGS 
LHSSLGNHARIiCLKIK 


13642 

• 


27543 


A 

■ 


13801 


I 

■ 


335 


ETERGRERETERGRERETBRGRERETSR 

GKKNRDREENRRRETERVGDWCVCPHP 
NLII^C5PYISHVliREGx73KTYLNYGGS 
cirlxiV-oly 


1 J 0*0 








14 




RVFFDRS RYRSRTLGS THASALLGI LVY 
RSHLISSLLCLEGI ILSLFI I ATLITLN 
THS L LANI VP IA I L VFAACEAAVGLALL 
VS KKKKKGAPVL KN P WGAQS LRGQARKY 
FFPYREPNKNLPGNLGKEPFLGGGDIIiG 
QPPYKN 


13644 


2754^ 


A 


13804 


1474 


1661 


TDFHNSHLLKCCKNCI FSTLNYYI RREH 
FSIWIFCYICVVTCVHYIRRELFRCLYF 
LLYMCC 


13645 


27546 


A 


13805 


360 


3 


LWGKRGGPFSPPPFFFFLBLFFFIxKKMG 
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SEQ n> 
NO: of 
nucieotiae 
sequence 


SEQID 
NO: of 
peptiae 
sequence 

■ 


M 

eth 

oa 


SEQID 
NO: in 

09/515,1 
26 


Predicted 
beginning 

■I u cieuLiQc 

location 
correspond 
ing to first 

residue of 

peptide 

sequence 

* 


Predict- 
ed end 

nut-ic"" 
otide 
location 
correspon 

rlino tn 
UHlg LU 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 

17s=r^hitnmir A riff IT^Phpnvlnlfliiinp 

Gr^UIUIHIIIII. £\K-llMf JT — A HGII JF IAIHII 11)6} 

G=Glycine, H=Histidine, Msoteurine, 
K=Lysine, LHLeucine, M-Methionine, 
N=Asparagine, P^Proline, 

0=r^liiinminr» "R=A r<jininp_ S=Sprinp 

VJ I U Kt\ III Hi C, n ,£\m gllllUV, U^ytl IIIC, 

T=ThreonIne, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=S>top codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion j 














nan f f p wrcp p kyzp t p f fv f pw a po k 

KKKKKP P P F I FWGGGPPP FFFFPP PKQ K 
KKKNKJOCKKKKQKXKKKKKKKRAARWPA 
RSRSSA 


13646 


27547 


A 


I38U6 


t 
l 


AOS. 


DT ft!rtVAT.DGaaTV3aP\/CflPPPf!nTfT,T.PlJ 
£\J_iv7V7 VJUinonnUunr VOunr V^.vjVjI\ Li 1 lK. w 

CLVTDFPPDSCTTCSYSRRSTPGCSPQQ 

LYLLGWLTLLS I FVRVMES LEGLLBS P j 

SPGTSWTTRSQLANTEPTKGLPDHPSRS 

M 


13647 . 


27548 

i 


A 


13807 

i 


5 


313 


EKPYLQGITFTSKTTHLIYEFTPYTI IV 
KEYSTNYVLLIGNTQITKIYHIVFRKFS 
MT P I YKFQDLAQ WLTPVI PTLWEAKPGG 
IiLEPRSLQSS YSL 1 1 L I L 


13648 


27549 


A 


13808 


182 


314 


V I TQLD KTE RAQ W LMPVI LVL YEAEVGG 
LLEAMS S RPAWAT KTW 


13649 

■ 

■ 

a 


27550 

• 


A 


13809 

■ 

■ 


219 

■ 


1812 


LPPPESGAMSGFNFGGTGAPTGGFTFGT 
AKTATTTPATGF S F STSGTGGFNFGAP F 
QPATS T P STGLFS IATQTPATQTTGFTF 
GTATLAS GGTGF F LG IGAS KLNLSNTAA 
TPAMANPSGFGLGSSNLTKAISSTVTSS 
QGTAPTGFVFGPS TTSVAPATTSGGFS F 
TGGSTAQPSGFNIGSAGNSAQPTAPATL 
PLTPATPAATTAGATQPAAPTPTAT ITS 
TGPSLFAS IATAPTSSATTGLSLCTPVT 
TAGAF TAGTQGFS LKAPGAASGTSTTTS 
TAATATATTTTSS STTGFALNLKPIiAPA 

(jJ..foJM I AftAV 1 AfFijlAjAA/UxrUViDoAn 

TYAQLESIjINKWSLELBDQERHFIiQQAT 

DQKRLDQELDFILSQQKELEDLLSPLEE 

DAQLKRMAQDLKDI I EHLNTS GAP ADT S 
DPIiQQ I CKILNAHMDSLQW I DQNSALLQ 
RKVEEVTKVCEGRRKEQERS FRITFD 


1 36 jU 


27551 


A 


1 JolU 


1 1A 
1 J4 




GGLLE PRGLRPTWVT 


13651 


27552 


A 


13811 


210 


51 


TLSHRKPISFEAYFARVRGFIIiEVSETR 
NPPIGWTRWLTPVI PALWEAEAGG 


13652 


27553 


A 


loot o 
13812 


OA 1 


1 10 


T ■HP AflT i A PM T P AT iWR VKVGGIjTjRPRS PR 

PAWASW 


13653- 


27554 


A 


13813 

- 


539 


256 


RTTQMSTAAGFTEAPNWKQPRCPWTEEQ 
TWKMM Q T.WAMPCASAMKWNEVLTOPAVR 

MARENS RRKPGDMHD I WS VWSAGEWL PG 
TjGRSTGKGSD 




Z7555 




1 JO 1 J 


CO 




VRETPLKTH YLEE I S SPAS PTAI POS LL 
FSFFISPPSSLATGSGHSGHPVHSIiHHP 
PETEPSPSVCLWAGPKVPPGAAGKGSPD 
SNPLVIRSLLAPPPASL 


13655 


27556 


A 


13817 


258. 


3 


AESAP PFSTNSLFFTHTHTHTHTHTHTH 
THS LRTTRWAPS QVCRPQTS QRGGECVG 
PTAFAP S PTLLKPHHPS S HVHLPSQPRR 


13656 


27557 


A 


13818 


516 


665 


WFKSGSFWLGMVAHTCNPSTLGGQGGWI 
TSGRSRTS VTTTTTS QTCAPA 


13657 


27558 


A 


13819 


295 


361 


WLTPVI PALLEAVTGGSLBPRS 


13658 


27559 


A 


13820 


166 


368 


GWWATQHSAQYWSSSSIGALCIiGESCS 
VTQAGVQWCDLSS PQPSHPGFKRFLCLS 
LPSSWDHRGDL 
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nucicoiioe 

sequence 
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SEQID 
NO: of 
pepiiuc 
sequence 


M < 
eth 

uu 


SEQID 
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09/515,1 
26' 


Predicted 
beginning 

n n j*Ia/vH rfa 

11 ULICUUUC 

location 
correspond 
ing to first 
amino acid" 
rcsiuue oi 
peptide 
sequence 


Predict- 
ed end 

IIUUC" 

otide 
location 
correspon 
ding to 

MSI HUM Hf 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A B Aianine 
C=Cysteine, D=Aspartic Acid, 

E=falutamic Acid T^Phenvbtlflninp 

*J \JIUMIIIJIV /ttlUj Jl M UbU f M Mill IUIm 

G=Glycine, H=Histidine, I=Isoteudne, 
K=Lysine, Lr=Leucine, M=Metbionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 

1 ~~J 111 Cull 1 IIC, T ""V OllUV| 

W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 


13659 


27560 


A 


13821 


133 


530 


. RKYCGQCLNMALNNVSLSSGDQRSRVAY 
RSSHGDLRPRASALAMVSGDQFLVSRPE 
AIHLGP RQAVRP S VRAES RRVDGGGRS P 
REPDGRGRSRQARFSPYPIPAVEPDLLR 
SVLQQRLIALGGVIAARISV 


13660 

• 


27561 

* 

t 

i 


A 


13822 

■ 


76 

- 


1958 

■ 


RQELI WPLCSPPQGDRFLQKSWI FFRPV 
MADKLTRIAIVNHDKCKPKKCRQECKKS 
CPWRMGKLCIEVTPQSKIAWISETLCI 
GCGICIKKCPFGAIiSIVNLPSNLiEKBTT 
HRYCANAFKLHRLPIPRPGEVLGLVGTN 
GIGKSTALKILAGKQKPNLGKYDDPPDW 
QEILTYFRGSELQNYFTKILEDDLKAI I 
KPQ YVDQ I PKAAKGTVGS I LDRKDBTKT 
QAIVCQQLDLTHLKERNVEDLSGGELQR 
FACAWCIQKADIFMFDBPSSYLDVKQR 
LKAAIT I RSLINPDRYI I WEHDLSVIiD 
YLSD F I CCL YGVPS AYG WTM P FSVRBG 
INI FLDG YVPT ENLiRFRDAS LVFKVAET 
/^NEEEVKKMCMYKYPGKKKKMGEFELAI 
VAGEFTDSE IMVMLGENGTGKTTFIRML 
AGRLKPDEGGEVPVLNVSYKPQKISPKS 
TGSVRQLLHEKIRDAYTHPQFVTDVMXP 

T OTPMT TfV^t?tfrt>Pf OfV' BY />T5 "I TT\. T T ^IT 

JjQIENI IDQEVQTIjSC3GBIjQRVAIiALCIj 
GKPADVYLIDEPSAYLDSEQRLMAARW 

I VFDGVPS KNTVANS PQTLLAGMNKFLS 
QLE I T FRRDPNNYRPRINKLNS I KDVEQ 


13661 


27562 


A 


13823 


3 


292 


KCWDYVHEPPRPAPMQTYQVDLRCCFVS 
TINLLVCGERAS LP S VESL VFSGLAE VW 

ARPACRGLVFSA 


13662 


27563 


A 


13824 


245 


91 


YKVNFGPPRGSFFRPPPFFFFEVFHFFF 
I IFFFFFFFFFFFS FFLGCFMYV 


13663 


27564 


A 


13825 


276 


100 


PGQ K KKTLS P KKKKS I YLY I HTHI Y I YT 
HTHYI YKYIFVCVYICVS I YIHTHIVPR 
IFS 


13664 


27565 


A 


13826 


202 


375 


IYOTYIHTHTHTHTHTHNVYI.YIQSGDC 
IYAVLLGELSFIYIFVFLSSLISILFHL 
TY 


J J 00 J 


2/jOO 


a 

A 


lJO£f 


9751 
X to 




KPRGLS PAGVSAORAWGPHS CPRLPD PR 
TELQVRQ I PALVGS S AFQGRRGRKGPVG 
GSGP SRPRAPPQ PRPGCCQCTLLQI 


13666 


27567 


A 


13828 


62 


307 


GGARAI FLNVPHLKRPMFFLELCLS TKC 
PVGTEFALDSLLRLYNSNTFCWVQWLMP 
TI PALCKAEMGGLLEARSLI PAWAT 


1 JDO/ 


Z/Ooo 




l Jozy 


joO 


4/5. 


MR TGMV«5HACNP STI^fiGOGRRIMRSGDO 
DHS 


13668 


27569 


A 


13830 


364 


484 


NFNI WLG WAW WLMP VTPALWEAKAGGS L i 
EVRGSGPAANMV 


13669 


27570 


A 


13831 


33 


486 


PARSAEFGTRRERERERERERERERERE 
RERERERERERERERERERERERERERG 
VTCVCPCAIiCVR^SDPLSQKKYCVCVF 
VWYTESLALSPFO^CLALLIFLEGM 
WRKIWAPSFLSLCVGALFFSQHPPPFWC 
LCVLS LVRPLPLS LSGGVFVSQRALI 


13670 


27571 


A 


13832 


6 


718 


YSAVE FAMAGVGAG PLRAMGRQALLLLA 
LCATGAQGLYFHIGETEKRCFI EE I PDE 



1378 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 

• 


M 
eth 

• 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 

* 

nude- . 

otide 

location 

correspon 

ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Add, 
E=Glutamic Acid, F-Phenylalanme, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L^Leucine, M=Methionioe, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X= Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


- 


■ 




■ 


■ 




TMVIGNYRTQMWDKQKEVFLPSTPGLGM 
HVE VKDPDGKMLQ WLSRQYGS EGRFT F 
TSHTPGDHQI CLHSNSTRMALFAGQKLR 
VHLDIQVGEHANNYPEIAAKDKLTBLQL 
RARQLLDQVEQIQKEQDYQRYREERFRL 
TSES TNQRVLWWS IAQTVIL I LTGIWQM 
RHLKS FFEAKKL V 


13671 


27572 


A 


13833 


17 


130 


RLQE FGTRRERERERERERERERERERE . 
RDTHSFLHG 


13672 


27573 


A 

• 


13834 

i 


283 


460 


LWRLAICKNYSLKRETEPGSVAHVYNP 
STLGGQGGRTARGQEFKTSLDNIARPPS 
LQK 


13673 


27574 

* 

• 


A 


13835 


3 

• 


435 

• 


WPRFCTALQEFGTRRERERERERERERB 
RERERERERERERERERATS LSRARP S F 
LFPSACVSHTPYVGERENLSLHTXiKHTH 
TREKHSLSYTNASARDTRARFSEGRAPP 
IYSVYLTHGVFFFFFFSLCLSVRHAEGL 
AHV 


13674 


27575 


A 


13837 


351 


57 


RTLVH I S KFTGNLS DL YYVQS E WQDLGT 
ADLSIMTSVLHCLPSLESIKNTDSWPSS 
VAHTCOTSTLGGRGRCRDQHGQHSKQPS 
LLKIQKIARRCGG 


13675 


27576 


A 


13838 


336 


150 


TFCYKKYUWGAHCFIFRTTGLP IKNFYF 
GQAWCLLPVISARWEAETGGSLEPRSLR 
VAWAS 


13676 


27577 


A 


13839 


310 


454 


HCS LGLPQVLL 1 1 LYKILLGWARWLTPV 
I PALWE VEAGGLLEARSSRP 


13677 


27578 


A 


13840 


163 


297 


I I FFLLDLKLQGAMTI YVS IYLSIYLS I 
YLS I YL YTHTLWNTLQL 


13678 


27579 


A 


13841 


33 


316 


LDQHPTPRSPLLCHSLRKTSSSQGGKSE 
LVKQSLKKPKLPEGRFDAPEDSHIiEKEP 
LEKFPDDVNPVTKE KGGPRGPE PTRYGD 
WERKGRCIDF 


13679 

♦ 

■ 


27580 


A 


13842 

- 


4474 

• 


2586 


DGGSGCVKMEFPGGNDNYLTITGPSHPF 
LSGAETFHTPSLGDEEFEI PP ISLDSDP 
SLAVSDWGHFDDLADPSSSQDGSFSAQ 
YGVQTLDMPVGMTHGLMEQGGGLLiSGGL 
TMDLDHS IGTQYSANPPVT IDVPMTDMT 
SGLMGHSQLTTIDQSELSSQIjGLSLGGG 
TILPPAQSPEDRLSTTPSPTSSLHEDGV 
EDFRRQLPSQKTVWEAGKKQKAPKKRK 
KKDPNEPQKPVSAYALFFRDTQAAIKGQ 
NPNATFGEVS K I VASMWDSLGEEQKQI Y 
KRKTEAAKKEYLKAXiAAYKDNQECQATV 

ETVBLDPAPPSQTPSPPPMATVDPASPA 
PAS IEPPALSPS I WNSTLS S YVANQAS 
SGAGGQPNITKLIITKC3MLPSSITMSQG 
GMVTVI PATWTSRGLQLGQTSTATIQP 

£»/"WM» /\T1T*T»TaoTTT /*\1V IV TV IV IV 7V IV TV TV OUftf 

S UUAQ J- VTRS VXiQAAAAAAAAAb MQliF V 
PRLQPPPLQQMPQPPTQQQVTILQQPPP 
LQAMQQPPPQKVRINLQQQPPPLQIKSV 
PLPTLKMQTTLVPPTVESS PERPMNNSP 
EAHTVEAPSPETI CEMITDWPEVESPS 
QMDVELVSGSPVALSPQPRCVRSGCENP 
PIVSKBWDNEYCSNECWKHCRDVFLAW 
VASRNSNTWFVK 


13680 


27581 


A 


13843 


54 


251 


EFYRMNSPSPLWLCVCVCVCACLFMCLC 
ALTCMCVWYKSLLHSNLTQVLSSKFYDS 
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SEQID 
NO: of 
nucleotide 
sequence 

• 


SEQID 
NO: of 
peptide 
sequence 


M, 
eth 
od 

r 


SEQID 
NO: in 
USSN 

i 

09/515,1 
26 

■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

- » Arm j 

mg to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Add, 
E=GIutflmic Acid, ^Phenylalanine, 
G=Glycine, H=Histidme, J=Iso leucine, 
. K=Lysine, L=Leucine, M=Meth|onine, 
N^Asparagine, P»Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosme, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possibte 
nucleotide insertion 














ISPAENPGK 


13681 


27582 


A 


13844 


154 


26 


DPHVPPTERSEGSQARWLPPVIPTLWEA 
EAQGS PEVKSSNST 


13682 


27583 


A 


13845 

• 


475 


225 


GHTELYGGALFQLLQAAI PEGGGTSCLF 
NPSSDGGS S AAGGWGGAGDLARSSRVP 
SPTSSRPVPSPDPSPPSLYLTPWKETH 


13683 


27584 


A 


13846 


479 


319 


RDCRRS LTHSVLSG AQAGVQ WRDLGS LQ 
PPPPSRLPWPPKAPRWQPLPGHHPI 


13684 

• 


27585 

■ 


A 

• 


13847 


182 


499 


LLCVKLCDS CNENKALREQTRAQVQGGH 
SRAQVQQVQGGHSAHTWQ IRSAASRRK 
AFSTCSSHLGMVLLFYGTGSSTYMRPTT 
RYS PLEGRLAAVFYS I LI PTLN 


13685 


27586 


A 


13848 


388 


494 


RDSWERWLEPLI PVLWEAQAGRLLEPRS 
LRLAWAT 


13686 


27587 


A 


13849 


198 


63 


HHATTWMNLEDIMFRKVSQTQKNKYYMI 
PLIWKTKKVLSHREQK 


13687 


27588 


A 


13850 


555 


353 


RCPPGAQAPLPAPSPAPPVHIALSPLSQ 
GVSWPRELTAHPPEPPPPPPFKIHPHPQ 
LSVGKRGQMTP 


13688 


27589 


A 


13851 


169 


66 


LLAGSTGQARWIiTPVI PELWKAEVGGSL 
EIRSSR 


13689 


27590 


A 


13853 


12 


105 


IASGLHDFFKKKKI KKKKKKKKKKKKGG 
AL 


13690 


27591 


A 


13854 


256 


363 


ICLFI KDSDRGQTQWLTPVILTLWKAEA 
GGSLEPKS 


13691 


27592 


A 


13856 


23 


436 


IDRLS DSHE ERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RERESSLLCVRIITCFAPPTYIFLWETLR 
VCPTP CVWGALS VFS HTERAR I FFYTCG 
GS VARAP VCAHTRI FS SRARARALCARP 
PtSRERRAHTRVFFI 


13692 


27593 


A 


13857 


91 


428 


PDPDTSPLTTDRGETPLGATLPSCYCX3G 
QQ S FE P KTKKTTNQ KKKKKGKKFT PQKK 
NPKIFPPKPFCPKGPRGENQPKKTQNQP 
KKKKKI LGGGPAPNS PLGGGKKNS WAGF 


13693 


27594 


A 


13859 

• 


465 


43 


KRVTTENPQHICSAETAIPWSIiSQNCNF 
HILWWGHPNHPIYITTSIYIYIYIPHIY 
GMVYIYTRICWCVCIYVHICIHTHTHIY 
IYIHTHTHIYMERERETVSCSVAQAGVQ 
WCDLGSLQPPNKRLKRFSCXSLPSSWDY 

* 

W 


13694 


27595 


A 


13860 


290 


52 


WYKYLPGTRGFLIWLNYLKCIGRIELLV 
GYFSFI YI*S I YLS I YLS IYLSSRS IAQP 
GVKRCNHSSLQPRTPGIiMHLSNS 


13695 


27596 


A 


13861 


108 


22 


IFFLFSAVKTRFSYAFPKEFPYRMNHVS 
I 


13696 


27597 


A 


13862 


167 


33 


VFTVPGTYMCVSVRDLGQAWWLTPVIPA 
LWEAEAGGSLEEFLRT 


13697 


27598 


A 


13863 


a An 

448 


355 


PLN 


13698 


27599 


A 


13864 


321 


445 


SQ S TQAWCLMPV I PALWETKS S ES FEPR 
S LRP A W ATGHNP I 


13699 


27600 


A 


13865 


380 


80 


MILSPSPKFPSFLFSLFSATiTiTiWHOCiEL 
PFLSNVGKGMQNDPVGLSPTSTPSDLSR 
IYLFFCFFVCLFVCFETGFPSVAQAGVQ 
WCDPLGLKWFLPPQSP 


13700 


27601 


A 


13866 


372 


264 


KMENPBDKNFFIFFMFFIVFLLFFFFFL 
FCLTQFF 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 

» 


SEQID 
NO: of 
peptide 
sequence 

• 


•M 
00 

■ 


SEQID 
NO: in 

09/515,1 

ZD 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acia 

residue of 

pepuae 

sequence 

i 


Predict- 
ed end 
nucle- 
otide 
location 
correspoo 
ding to 
last amino 
acia 

residue of 

pepaae 

sequence 


Amino acid sequence (A B Alanine 
C=Cysteine, D=Aspartic Acid, 
fc/=GIutamic Acid, r— rnenyiaianine, 
G=Glycine, H=Histidine, jNIsoleucine, 
k= Lysine, ir=JLeucine, m— ivietnionine, 
N=Asparagine, PHProline, 
Q=oiutamine, K^Argimne, a^aerme, 
T=Threonine, V=Valine, 
w— j ryptopnan, jr— lyrosine, . 
X= Unknown, *=£top codon,/=possibte 
nucieonae deletion, ^Fpossioie 
nucleotide insertion 


1J7U1 


27602 


A 




1 £.1 

101 


304 


SERTGITDGATEILPQLRL 


1 1*7 AO 


27603 


TV 


1 

ioooo 

■ 




34o 


vo JfJ-s/Niavr v^vjlxcv i rc v fur v» v vhisk.\joA. 
YNNGSTELHSK 


' 1 MAI 

13703 

* 


27604 


—A 




loo 


O^A 

260 


CY 


13704 


27605 


A 


13870 


140 


16 


LKYGQWRDLGSLQPPPPRFKQFSCLSLIi 
a SKNCTTMTKDiilS 


13705 


27606 


A 


13871 


143 


30 


K I LKWVS QAWWLI P VI PALWEAEVGGS 

T DTIBTODT 71 


13706 


27607 


A 


13872 

i 


158 


59 


HLHGCIHLLKLRLDAVSHACNSSTLGGQ 
GGRIA 


13707 


27608 

» 


A 


13873 


232 




APFFFKGAPIFFFFFFFFFFFIFFFLIR 
GKPGNDVGVEG 


13708 


27609 


A 


13874 


1619 


1375 


KTT WRRHLE KLQDMSLLRHR VHVDS SQG 
LFPFFQRQGLLPRIiKCNGDI I PYCNLKL 
LGWSNS PASAFRVARTTSLCRHTWL 


13709 


27610 


A 


13875 


219 


84 


NPLQSTST I CNRAWDRVTCLWSQLLRRL 
RWEDHLSPGVPSCSAL 


13710 


27611 

- 


A 


13876 


166 

- 


496 


KKKKKKKKKGGG P LKKT PGGAKI NRGRK 
KKI F FQ KGGQ KKT PCjJG I LKKKP FLGGGK 
KGPNPPKKI KGLREKKKFKRGKGAKPAQ j 
NPWGKKISPPGFFLKKFFPRGRGFFI 


13711 


27612 


A 


13877 . 


132 


348 

* 


PSKKKKRGGPFKRTPGGPKFNGGGKGNS 
FP LMGGG I KPNLG I LG KNP YLGGGTNGN 
NPPTEI KGFGEKKKF 

• 


13712 


27613 


A 


13878 


287 


134 


QQGPPISPGPKEENRMGPPSPGWGNPGI 
RSPPGAGPNPGAFQDGMFPSKKQ 


13713 


27614 


A 


13879 


333 


480 


VEPLRSABGPRPESNMLHIiTLKSKQQWL 
TP VI PAL WEAKAGGSQGQE I E 


13714 


27615 


A 


13880 


1 


227 


PKIHCKIQHIWLVSFFPSFFLSFFQGE 
SCSAAQAGVQWHDLGSLQAPPPGFKWFR 
FIELLGLGFHSRRMRGKDS 


13715 


27616 


A 


13881 


276 


378 


GWPWWLTPVI PALLEABVGGLLEPRS LR 
LALAT 


13716 


27617 


A 


13882 


126 


389 


GS VLGTGCX^GS VLGTVCRGS VLGTVL IA 
SATPDRYLRLSSGSSLTR I TNSDLVP WL 
TPVIPALWEAKAGVSIjEPKIWKPARKTW 
EDPH 


13717 


27618 


A 


t toot 

13883 

• 


414 


150 


AKUAFKWCjIMF firirvjJf JjTJirFIrlsJNr FJj 
GPQKKKIFFPTPPPPHFFFFKGPPPPFF 

LTE 


1 "2*71 O 


27619 


J\ 




i at 
197 


363 


I P VI P ILWKAEAGRS LKPRS LRPAWAT 


13719 1 


27620 


A 


1 lOQC 
IJOOJ 


101 


236 


IHTHTHTHTHTRTTHSL 




Z /Oxl 




1 JOO / 


1*7 
i / 


i An 


PIMRSGALGNFCL 


13721 


27622 


A 


13888 


135 


1 


VLHAGLKLIjGSSTSPTSASKGAGTVGIH 
YHTQLAFQFFLFCFLIR 


13722 


27623 


A 


13889 


411 


276 


TLYFKHGEMGFHHVGQAGLELLASSDPP 
ALEFETSLHNKVRPRXj 


13723 


27624 


A 


13890 


216 


54 


EFKLCYTLFDHSYFPLEELVSGQVWRLR 
PAIPSFWEAKEGGSLEPRSSKHEIV 


13724 


27625 


A 


13891 


3 


149 


SSDPRPSSSWLAVJVGLWSMVWRVPPFLL 
PILFLASHVQKSPLSPSPHF 
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SEQID 
NO: of 
nucleotide 

sequence 

• 


SEQID 
NO: of 

nentide 

sequence 


M 

etb 

od 


SEQID 
NO: in 
USSN 
09/515,1 . 
26 

• 

* 

■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AJanine 
C= Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanfne, 
G=Glyrine, H=Histidine, I=lsoleuclDe, 
K=Lysine, 1/= Leucine, M=Meth ionise, . 
N=Asparagine, P=Proline, 
"Q=Glutamine, R=Arginine, S°Serine, 
T=Tbreonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


13725 


27626 


A 


13892 


166 


1 


LFSKI FWWLLPVI PALREAKVGELLEPG 
SSRPAWATWKN P VYTKNTKI SQAWQCT 


13726 


27627 


A . 


13893 


128 


2 


FSTTYPNP I KLPHPYLPLLTLFLDSAHL 
HPS E I NSLVAQ KKK 


13727 


27628 


A 


13894 


225 


391 


HLSIHILTFPFSLPEQWWLVPIIPVLWE 
VKMGGLLEPRSSRPAWATQQDPQVSRK 


13728 


i 27629 

■ 


A 

V 


13895 


192 


1 


TLVCVSSSVPKNYNNIGLACWFMPIIPA 
LWEVESGSLLELKSSPPAWPTWWVPLKS 
KNTKISLA 






A 


13896 


152 


245 


DQS IAI)VLMYYTHTHTHTUTirrHTLQNQ 
VPQ 


13730 


27631 


A 


13897 


270 


406 


ISTRFFLYRDRASLCHPGWSAGVQSWLP 
GSSNSQAQAILLPQPPE 




77/W^ 


A 


1J070 




73 


EAEGPRDVTSSTFHWRLYDQTTNYHECR 
MWANSRLRLTPEGUjKVITPWCHAH 






A 




107 


7 


LGOAOWLMP 1 1 PALWEAKAGGSLEPRTS 
RPAWA 




77A14 




I J7W 


67 


774 


PTRP I LAHCNLCVLGS DDPPASASOSAG 
VTGMSHCAWSNLI IHLPSRSTAICTL 


13734 


27635 


A 


13901 


450 


194 

• 


KRQVKSHSE VKTTHVQ WPG I DPGS PAWE 
ARILPLNHQRLSGTTPWRITBKSNHKDL 
ETVLSGFFKCRLKANKDIQTKCVPIGNF 

Y. 


13735 


27636 , 


A 


13902 


300 


404 


AAPGRAPQKKKKXKKKKKKKKKKKKKAP 
FIKRGR 


13736 


27637 


A 


13903 


131 


29 


ALGS LQPLS PRFKRFS CLS LPS S WNSTA 
LTVTTM 


13737 


27638 


A 


13904 


267 


74 


VWVLVPPFTSYGVLDKPILIVNNIYVHV 
CVCLCMCVCVCVCVCCTVIRRHCWHNMI 
IIISQHDQ 


13738 


27639 


A 


13905 


193 


64 


IRSHQIffiCI^FKKFRiliGWTWWL^pVlP 
APWEAEEGRS PEPSS 


13739 


27640 


A 


13906 


309 


424 


QS TDFLENRRVGWAWWLTPVI PADWEAE 
VGGLPEIKNS 


1174fl 


77<*4l 


A 


11007 

lJ7v / 


404 




SCLISLRSWDYRHVPPQPANFVFFFFEM 
ESCSVAQSGVQWQEENSISKINK j 






A 

* * 


i jyxjo 


471 


59 


GPPQAKKGKGFPTPGP PQKKGGGQPKSG 
VLGFSPPPLGPCPGLGPPKGWGFWGGSF 
FPFFFFRKWEHKNELFPFSKTEKPRRAG 
EARKGSQSTKYAGLSRGERQASPTLRPG 
HLAKASAGGWLQYTRWQGKLPEPGN 


13742 


27643 


A 


13909 


479 


231 


QEYCMLIFCEHFGDFDLNIVFCLSIO^V" 
GVIADF YLLS S FTGRPQWLMPVIPTLWE 
AEVGGSLEAKSFTPAWATWQDPISMF 


13743 


27644 


A 


13910 


167 

• VI 


19 


CI VLGIMLDSLWE PRQWVPSWVRWFTPV 
IPALWEAEAGGSPEVRSSNSS 


13744 


27645 


A 


13911 


328 


446 


GSFSKITDLSQTWWYVPWLALQESEVG 
RSLEPRSSRLQ 


13745 


27646 


A 


13912 


145 


460 


NAFMSTGCEVLSYSELRQRNLVWWFLPV 
IPATPQAEAGRLLGPRSSRPPWATKGVR 
ALSHKQTNTKTNKLSHSTGVSFVFTHHI 
HSPGILWRLSQWLLAMNLATN 


13746 


27647 


A 


13913 


319 


210 


KLGFFKGPPLFFFFFFFFFYFFFFFGFL 
TVSCNFTL 


13747 


27648 


A 


13914 


372 


10 


PNTTCTPITYGKKCCPREISYYLCHEFI 
RIRILTFFDGESCYIAQAQLKLLGSR3P 
PTSASGVAGNTDVCHHTQPILFYILETG 



* 
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SEQID 
NO: of 
nucleotide 
sequence 

• 

• 

- 


SEQID 
NO: of 
peptide 
sequence 

• 


M 


SEQID 
NO: in 

09/515,1 
26 


Predicted 
beginning 

Q UCIcOllUc 

location 

f*n i*t*AQ n a n H 

ing to first 

■•aminn arid 

a III 1 LI v aUU 

residue of 
peptide 

1 


Predict- 
ed end 

IIUCIC - 

otide 

lUvaUUU 

correspon 

last amino 
acid 

rcaiuuc ui 

peptide 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartfc Acid, 

R=fHiitflmir Arid P=Phpnvlfl1nninf* 

G=Glycine, H=*Histidine, I=Isoieucine, 
K=l . vs i n e. I ^Leucine. M=lVfethionine- 

N»Asparagine, PHProline, 

0=C» lutafninc Pv— Arpinine S=Scrine. 

•^Threonine, V=VaIIne, 
W=Tryptophan, Y=Tyrosine, 

X=l JnUnown *=Stftn codnn /=nr»ssible ' 

nucleotide deletion, \=possible 

niirlpotiHp tn<;prtinn 

IIUVlVVIIUv IJI^WI nun 














LALLPRLILNSWSQAILLPPQPLRSYFN 
FWLFFCLYN ' 


13748 


27649 


A 


13915 


594 


471 


PSYIAYMCICMCVCVYVYINICVCIYTH 
VHTPYNPVISKN 


13749 

* 


27650 


A 


13916 


235 


3 


EANNLIRKTYTSGWFQRTSKRPPAVRRM 
PPGLNRSSHLSFPSSSDYRWA 


13750 ■ 


27651 


A 


13917 


272 


184 

* 


KTTAWAQWLTPVISTLWEAKVGGSLEAG . 


13751 

• 

• 


27652 

• 


A 


13918 


46 


321 


SRVQGTGGHRGRG C I QAG VGEQEAKRE T 

PrtPPWPttTr.TPVT QTfT.WPAK'AOOQr.FPVfl 

SRRATWNLLKRIYYTDEREYLIPLAIiNT 

if or 


13732 


I /o53 




i3yiy 


35/ 


AHA 


FTP t TOWaRWLMPVIPTLWEREAGGSPE 
VRSS 


13753 


27554 




I3yzu 


142 


321 


FT.PCITWPWT.^f'WflT.PQNVWOQT^TF^YT. 

KVRL YGQAWWFT P I ILALWEDKAOGSLD 
PRSL 


i inc. a 

13754 


27655 


TV 

A 




OTA 


All 


T pnprv-T VQ TT.HTKTTiPPnn AR WT.TPVT P 

JLXZiLIXViVJ. X OXJjni O X XJI2u\v3\/AEv?V XJ X *r V JL F 

AFWEVKAGGS FEPRSLRPV 


13755 


27656 


A 


13922 


304 


396 


NFGWAQWLTPVIPALjHESEAGGSLEPRG 

i-LExO 


13756 


27657 


A 


13923 


305 


387 


VVWLTPVIPALWEAEAGGSPEPRSSRP 


13757 


27658 


A 


13924 


103 


3 


NKS PE LGWAWWLAP V I PAL WEAEAGGS P 
EVRSS 


13758 


27659 


A 


13925 


195 


400 


CLLI LSGLVAS C I I ERHGKGQAWWLMPV 
x, f 1 xjnr>/\l\VuV7oxjJ2ixxluNljKoi\W<aX wxtrio 
IS I KYTEQLAQG 


13759 


27660 


A 


13926 


294 


489 


KWAKMQl^HKGRYACVCICVCVCVCVYT 
ILSRMWSNQ 


13760 


27661 


A 


13927 


363 


2 

- 


NWGPPGFPPPPFLKTGPVFPIFGAPKKK 

PPT.CTPP ZiT.WPVT «T.KT3f3PT.PPPPPPT.WV 

FTjVKTGSHCVAQPDLELLSSSNPVLVSQ 
nao T TOM Q POVOPIiDIiPPPPS PPKTGSR 

S YAW ADA W 


1 1*1£.\ 

!37ol 






137ZO 


I 


215 


LR PKKR KTNKP KMWS VHTME YHSALKRK 
EIRTQO^WimEDIVLSEIRVAVTKGR 
ILYDS TOMRNLEESNL 


13762 


27663 


A 


13929 


219 


408 


TFCHFIAFLHVYKMFVVFFFSFIMFFFP 
LNCNKAFERKIUCKNTCK^ S 
LSPPAPP 










161 




WHP P ASASHVAGTTGACKHERI^QFJ)HF 
SlxRVQGCSEPl^aiHTPP 


13764 


27665 


A 


13931 


116 


\ 


INTGWARWLTPVIPAFWEAAVGGSLEPR 
STS OAWATWK 


13765 


27666 


A 


13932 


181 


38 


PWPQRATQPKISPLPPAISAIiWFAEAJGG ; 
ALE PRSSRPAWAKSDAWVD 


13766 


27667 


A 


13933 


267 


397 


TVFDRTLFFIKKI IHSWAPWLTPWPAL 
WEAEVCGSPEVRSSG 


13767 


27668 


A 


13934 


129 


12 


NIGVGWALWLTSVI PALWEAEAGGPPKV 
RSSRTARPMS 


13768 


27669 


A 


13935 


179 


1 


KT YYGG I PPPGPPKKGE KKP PFFFKKFF 
LFFISFFIFPFFFFLRQSIAKMQSASVW 
KVS 


13769 


27670 


A 


13936 


245 


616 


DD1<KKKEAAQKKATEQKIKVPEQIKPSV 
S Q PQ PANSIWGTSTATSTNNNAKRATAN 
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SEQID ' 
NO; of 

nil pijm^Bfi A 

nUCIcOUGc 

sequence 1 

'i 

• 


SEQ ID 
NO; of 
peptide 
sequence 

— 


M 
eth 

OO 


SEQ DO 
NO: in 

09/515,1 . 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide • 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCystefne, D=Aspartic Acid, 
L=L»iutamic Acid, F= Phenylalanine, 
OGIycine, H=Histidine, I=Isoleacine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, Pyrrol ine, 
Q=Glutamine, R=Arginlne, S=Serine, 
T=Threonine, V=Valine, 
W-Tryptopn a n, Y= Tyrosine, ♦ 
X=Unknown, *=Stop codoo, /=possible 
n ucleo tide deletion , V=possible 
nucleotide insertion 














NQQPQQQQQQQQPQQQQPQQQPQPQPQQ 
QQPQQQPQALPRYPREVPPRFRHQEHKQ 
LLKRGQHFPVEF 


13770 


27671 


A 


13937 


121 


292 


NTDI.DLAIALIVRVLTSDDNFKNWGWVQ 
WLMPVFPAPWEAEVGGSSEATNLRPAWA 
T 


13771 


, 27672 


A 


13938 


421 


202 


GPREAPIiFPSTQGI SRPLFFFFFFETES 
RTVTRAAAQWHNLGSPQPPLCSLDILKT 
SQSVCMPEAAILVSQIT 


13772 

- 


27673 


A 

- 


13939 

■ 


205 


543 

■ 


I LE, SHRRLGWS EALP ENQLLCRAS S WES 
LGPPDPPCLKPPS PAFWGAAGGWRAKST 
SVSEPVPVGTLLVALFLEMAVGLLLGLP 
FCFHS LGAPHLLCEPDG PS VLPABARQS 
R 


13773 


27674 


A 


13940 


140 


2 


LSQHSEEYTLPP^L^ElteSSR^PPQH 
TPHPPPPWPWHPDIADIQ 


13774 


27675 


A 


13941 


47 


303 


GLIFLIIWTCIVKTSTDFPRMEDCSQCI 
HQVTEESNKRMGFLSYYANPHHGSSRLL 
W PQHAAP WDDGRRGKP VFS LGFVSFPFP 
Q 


13775 


27676 


A 


13942 


1488 


1719 


PJjVS FS KNTYPCLGNVKQtlQKTT5 t CSRW 
KlaHVLSSNLSSPAEVTVVASINLVVSEQ 
SFQNSQLPAMLCLIVHLQMS 


13776 


27677 


A 


13943 


127 


402 


Q P VTHP PNRTMCSS MI FL FLYLCFIiSLS 
PFPSHLSLYAPPKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKAGGGGPFK 
KKIHFPPGGGRI IFFY 


13777 

■ • 


27678 

- 


A 


13944 


852 


209 


EQTVYCTRlDLOQKtVFLRl^Ai?CLLSQ 
GDPGS PMMCQLQQFDLWVLRGVLNFGGE 
TCPGLFLYTKVEDYSKWITSKAERAGPP 
LSSLHHWEKLI SFSHHGPNATMTQKTYS 
DSELGHVGSYLQGQRRTITHSRLGNSSR 
DSLDVREKDVKESGRS P EAS VQPLY YD Y 
YGGE VGEGRI FAGQNRL YQ P E E I ILGSL 
RACFLFAAVSS PGATPPN 


13778 


27679 


A 


13945 


68 


260 


INIFWHCVLLYVILYAISQFLLFPCVKN 
IASWLGSVAHACNPSTLGGQAGESLEPR 
NLRPAWAT 


13779 


27680 


A 


13946 


420 


156 


FLP PGVKYGS FKRAPP FFFFFFFFLVET 
LSCYIAHTGLELPDSSNYPTSASQSAVI 
TGMS HHTWP LEGACLAI PQALR I LLVLV 
PLH 


J 3780 


27681 


A 

* 


13947 


317 


17 


AQHGGS TCNPSQPI VTAVP INLWGRDLL 
QQWGAQAF I PEQLYS PQSQHTMQEMGYV 
PGMGLEKNLKGLKKPLQAEGQNSHQGLG 
YNFSWQPLLSLQNLYL 


13781 


27682 


A 


13948 


55 


224 


NFRLTSS FS YLKMS FWLGMWLTPVIPAL 
WEAKTGGLFEARS LRLVWWQS KKAGSHL 




/ /Do J 


n 


I WW 


1464 


one 

805 


KAi c> V KVjAoKbKo v~l?AAW5 P Ao 1 G P FLK 
RSVLPNAHFRSSREGGMAASTDMAGLEE 
S FRKFAIHGDPKASGQEMNGKNWAKLCK 
DCKVADGKS VTGTD VD I VFS KVKGKSAR 
VINYEBFKKALEELATKRFKGKSKEEAF 
DAI CQLVAG KEP ANVG VTKAKTGGAVDR 
LTDTSRYTGSHKERFDESGKGKGIAGRQ . 
D I LDDSG YVS AYKNAGT YD&KVKK 


13783 


27684 


A 


13950 


356 

* 


98 


NAGPGI YF WGP I KKI LPCPPAGVKLGSL 
KRAPLFFF FVEMG FHHVAQAGLKLLS SR 



1384 . 



* 



WO 01/64835 PCTYUS01/04927 



SEQID 
NO: of 
nucleoli at: 
sequence 

■ 

■ 

■ 
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SEQID 
NO: of 
pepuue 
sequence 
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M 

nrl 
uu 

• 


SEQID 
NO: in 

USSN 

09/515,1 
26 


Predicted 
beginning 

UUCIcUUUc 

location 

f nri*pcnnn<l 

VVI I OUvUU 

ing to first 

ant inn tinirf 

residue of 

■ III M V 1/1 

peptide 
seauence 

) 


Predict- 
ed end 
nucle- 
otide 

ivvauvii 

correspon 

HSncr fn 

last amino 
acid 

residue of 

peptide 

seauence 


Amino acid sequence (A B Alanine 
C=Cystelne, D=Aspartic Acid, 

P=f^ hi m if Artri F*=PhPnvlnlfininf* 

G=Glycine, H=Histidine, l=lsoleucf ne, 
K=Lvsine. I^=f murine. M=Methionine. 

N=Asparagine, PHProIine, 
0=("«liitamin*> R=Arpinine. S==Serine 

T=Threonine. V=Valine, 
W=Tryptophan, Y=Tyrosine, . 
X— Unknown. *=StoD codon. /=oossible 
nucleotide deletion, ^possible 
nucleotide insertion 

■ • w »^ V V IlltfVl II VII 














DSPIPVSQSIRITSTCEPPCIiAKIKSSL 
RL 


13784 

* 


27685 


A 


13951 


194 


48 


NFLEGWPGQRLTPVI PALWKAEAGGSLE 
PKTFRPVWAMWRDRLRKPQH 


13785 


27686 


A 


13952 


341 


418 


DQPGQHGETPS ILKIQKIAGRGSTCL 


i37oo 


27687 


a 
A 


J3953 


i JO . 


52.5 


LPLLPLKKKKKKKKKKKKKKKKKKI KK 


13787 


27688 


A 


: i3so4 


231 


ICC 


EYYSAVKGMKQLI 


13788 


27689 


TV 


13955 

i 


493 


39 


rAAr obWbWA^Uo ouuW\.JL 1 rAon fir UJj 

LPPAPVSSMAAPDLDSFPLPLCWTFLRV 
LCRVFSSSTLAFIiSCVSTSFGALTPAAS 
SNIFSISLPHWASPPLPTARLTSAVSPE 
PLPPLPSRLQRQPRGAPSHPGMSMGPIL 


13789 


27690 


A 


13956 


497 


372 


NTKISWASWYAPVIPATWBVSNSRPQAI 
HSHQPPKRIGLQA 


13790 


27691 


A 

* 


13957 


438 


301 


GTXx Jmjx UTtI uUi Vb Vt-iA.1. 1 IU IKtJUo 
YIYTYVYTHTHTHTHMYI 


13791 


27692 


A 


13958 


84 


466 


QPLGRSGKVPQLHLWEMQRLPAAFFRSF 
AAQGLGESVHINTPT^LSLRGRPFISSS 
GPGARARVTCAPCLHAARILCCFWRGLH 
RLKCSNVI S AHCNLRFSGS S DS PAS ASQ 
VAGITGRSSLBQLLE 


13792 


27693 


A 


13959 


216 


28 


KYVKNYVLSS 1 1 FWMQE I FHNFKKS S RT 

U W Li 1 P V 1 fAb W fciAKAbCjo lr £• FXis aKJrAb 

SIQLENS 


13793 

• 

■ 


27694 


A 


13960 

■ 


320 

1 

* 


943 


VLS FS LS DRAQRCNCRMGKQNS KLAPEV 

GRLNLEE FQQL YVKFFPYGDASKFAQHA 
FRTFDKNGDGTIDFREFICALS ITSRGS 

T?Cf"WT .TvTU7rVTITVnVrVT~lT TVTY^TrTTPTrBPMT.IPT 

r ny n at in w 1 Ui-iiAjLAjrwJ. x x v d*Uj£*i. 
I EAI Y KMVGTVI MMKMNEOGLTP EQRVD 

IVLLLQCDIQK 


13794 


27695 


A 


13961 


191 


8 


GLFRFRRLSEBVKHCCCCCCCCCCCCCC 
CCCCCCCCrcCYLRIAPQSPAAPAPELH 
TPIiPGSRRNR 


i 




7A 
A 




oo'a 
Jsjo 


Z5 


v mjiiynr \^,c vrvr niuvvrvr r r xjim j\ 1 i i try 

GGFFFFFFFFFLVF I DNS WVFLGEGDLA 
GS 


1 J /JrO 


77£G7 


A 




1ft 
IO 


171 
1/1 


GIRHEERERERERERERERERERERERH 
PARE IDMCVSKRDTRGALCALF 


1-3707 


77#\OR 
Z fQyo 


A 




9S 


7QQ 


HDFCTRRERERERERERERERERERBRE 
RER ERERERERERERE RERERERSRERE 
RERE RE RERERE RERE THTLS L Y I YMCV 
FLYIPLYFSFLYIHTQREAPPPIiSATLS 
LSIL 


13798 


27699 


A 


13965 


246 


410 


AYNGQGS RGVTP VI PS LWEAKTGGSIjE P 
RSWRSAWATWHTPPVPGPSKEISLYI 


13799 


27700 


A 


13966 


91 


26 


FCCCGCCCCCCFYCKKEKTLV 


13800 


27701 


A 


13967 


158 


24 


PFCHIYIYFLESCFVAMLECSGKILAHC 
YLRLLGSSNSLVPSLA 


13801 


27702 


A 


13968 


129 


7 


S WAQWLMP I XPTI1WEHEAGRSI1EAMSSR 
PAWATQQDPVSKK 


13802 


27703 


A 


139^ 


445 


268 


KIQLTKLKNASESLPSRINQAEEKISGW 
AWCLMPVI PVLWEAKAGGSLEPRS LRRG 
QVT 
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SEQID ' 
NO: of 

ft ± » _» 

nucleotide 
sequence 1 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 

- 


SEQID 
NO: in 
USSN . 
09/515,1 
26 

1 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide . 
sequence 


Predict- 
ed end 

to 

nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine j 
OCysteine, D»Aspartic Acid, 
E=Glutaroic Acid, ^Phenylalanine, 
G=Glydne, H-Histidine, I=Isoleudne, 
K-Lysine, ^Leucine, M=Methionine, . 
N=Asparagine, P=Prolirie, 
Q=Glutamine,-K : =Arginine, S=Senne, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *«Stop codon,/=posslble 
nucleotide deletion, ^possible 
nucleotide insertion 


13803 


27704 


A 


13970 


1628 


1309 


HRRKKCAGYSSLSSALLRERERERBRER 
ERERESVCVCVLGTITCFRRQYCVIQGI 
SGVRQNCVQI PALSDPDLPLFLSLSVLH 
SNTWWRPTLRVTERNKGAMCV 


13804 


27705 


A 


13971 


314 


140 


QCHNCYILTQLVKI KGDQAW WLMP 1 1 PA 
LGAKAGG SLE AGS LRSAWATWRAPMSTK 
IF 


13805 


27706 


A 


13972 


292 


482 


YKS VRTQDS CAHGVDRIjVGKI RYVNLFV 
YI CQHSDGRTWWLTPVI P PLWETKGGGS 
SBVGRSG 


13806 

■ 


27707 

■ 


A 


13973 


502 


303 


STAIGPSFSPFLCTHGSFHGFLFFFFCF 
CFFLVEMGSHYVAPAGLXLI/3S SOP PAS 
ASPSHLGLQV 


13807 


27708 


A 


13974 


493 


289 


IPPTALGTSFSPFLCTHGSPHGFLFFFF 
CFCFFLVEMGSHYVAPAGLKLLGSSDPP 
AS AS PSHLGLQV 


13808 


27709 


A 


13975 


405 


121 


IREAAQSESLFVRAAPHTGLISGPCITE 
IYAVKDTRCMQRRILVRELECAAHGLTE 
CLSIDTHTHTHTHTHTHTHTHTHTHTHT 
HGTRWEAGQSLKGES 


13809 


27710 


A 


13976 


135 


254 


TVTCTNSSWSLTFTSFLFLMPGWIQKGR 
RVCVCVCVCVC 


13810 


27711 


A 


13977 


179 


58 


VPLS INSPQRQCI FCCCCCCCCCFLVLQ 
KIISERSWAEIG 


13811 


27712 


A 


13978 


97 


186 


KI IFWLGWAHACNPSTLGGQGGRIMRG 
RD 


13812 


27713 


A 


13979 


289 


420 


ELGIiKIHGFPSQAPWFMPI IPALWEAEA 
GGLLESRSSRIAWAMW 


13813 


27714 

■ 


A 


13980 


421 

■ 


106 


SGGRNFFFFLGGGF YKKKFPP P PFLRGK 
KKKPPLPKKKNPTPIFFPPGGGPPFSPP 
PKKGGGKIFFFFGGKNFIMQNFGGFPPL 
WGKKKKFLPKKKNPLFFFFFF . 


13814 


27715 


A 


13981 


309 


135 


KHIYI YLSI YLS I YLS I YLSI YLPAYLP 
TYLHAHPL I HANAYKN I HI D Y P FKKG FE 
Y 


13815 


27716 


A 


13982 


231 


21 


QHMYCLKS I WHKKLSRKPYCYI ISBYSS 
MVKLRVS ISQ V QWLiMPV I PALWEAKAGG 
LLKARSSRPMWAI 


13816 


27717 


A 


13983 


209 


359 


HLHTHTHTHTPNGVKQ I MTQ I P I LPLTG 
PLILEKFLNFSKPIjSHPGKRDP 


13817 


27718 


A 


13984 


391 


249 


IWPILPPQKIFFFFFETESCSVTQARVQ 
WCNIjGSLQPPNPVFPSLAK 


13818 


27719 


A 


13985 


181 


403 


YLVLDSPESTLGILSQPKDYITKDQTLT 
FRPVWAHACNPSTLGSQGKWITSVQEV 

/ <lL|l|l1\T)n/^T VAWVW 

ETSLCNTARPCJjYQKYKN 


13819 


27720 


A 


13986 


441 


358 


ETGSRCVILAGAQWRDLSSVQPLPPRFK 


13820 


27721 


A 


13987 


7 

■ 


233 


ASIPCI YQKTKI PNTKQTKNKNTKITPA . 
WWCTP I APATQKAE I P PLHS S LGD FLRL 


13821 


27722 


A 


13988 


152 


44 


PKAPLFFFFFFFFFFYETRSHSVAQAKV 
QRHDHGSL 


13822 


27723 


A 


13989 


188 


379 


NVSCHAAETRVWWEKERPLPVPFGAYLL 
DGMPLVWLLSASSILSIWLSIYLSIYLS 
IYLSIYF 


13823 


27724 


A 


13991 


296 


53 


SS ILFHASKYHFSGRLFSLLSCKRRNIL 
QLVPHMYSPYLYMYI HTHKHI YE YMYVH 
IYIYMNLRNCRIYLHSCTPKENQAR 


13824 


27725 


A 


13992 


37 


290 


RLG S PAS RHQQIQCLI RACliKDGTLG WV 
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SEQID 
NO: of 

IJ U VrIC U II U C 

sequence 

• 

- 


SEQID 
NO: of 

sequence 

■ 


M 
nrl 


SEQ ID 
NO: in 
USSN 

09/515,1 
26 


Predicted 
beginning 

niif l(*Atirfp 

11 uucuuuc 

location 

VVI I vapvuu 

ing to first 
amino acid 

1 vOIUUC VI 

peptide 

1 • 


Predict- 
ed end 

nn r»l a_ 

otide 

location 

IVtrH LIU II 

correspon 
-ding to 

Inct nmmn 

add 

residue 4\f 

peptide 
sea ue nee 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=frlutflmic Acid F^Phenvlalanine. 

G=GJycine, H=Histidine, I-Isoleucine, 
K=LfVsine. L=L,eucine. M=Methioninc 
N=Asparagine, P=Proline, 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threoninc V= Valine. 

M M. Ill will U T T fillllv) 

W=Tryptophan, Y^Tyrosine, 

Xe=Un known. *=Ston cndon. /=nossihle * 
nucleotide deletion, Impossible 
nucleotide insertion 














WWLI PVI PALWEAKVGGSLB PRSS TPAW 
ATQGDLILKKDGTLLQCPHVAKGHSSHS 






A 

*• 


1399"* 




370 


SRNAIFKKRWESSCOSSFNSLGVFPQAA 
TKGWVRWLTLVIPALWEAKAGGSLEPRS 
SRPA 


13R7£ 
1 jOZO 


77777 


A 

** 




17Q 


7S 


SKHNKKNIOICLLCSKRSMPVHVCIHIH 
ICVYVYRYAYIYIGMCIHI YIHS 


13827 


27728 

• 


A 


13995 


159 


2 


EDPLS PGVQDQHGHHRKSLSGRGGKISL . 
GOE FKS SMAAM YFGRSGGCKLERR 


13828 


27729 


A 


13996 

i 


307 


99 


FIWEHFVNYTMPFTYKKPTSQAVfWLMPV. 
VPALWETEAGGS PE VRSPRP PGQHDDVR 
PLGPSLHVYLQMD 


13829 


27730 


A 


13997 


*> 1 o 


Zl 


SPLASSILDLIPPPFPIDEPVDLAFQSV 
TLGVQDTTS 


i loin 


27731 




U99o 


00*7 

Z27 




PVCVTT.T TJ^T.TnT TOP TaTJfJTP Vm?PT7P 

ERERERERERVCS CLMHELASHLLFFFL 
FRE 


13831 


27732 

- 


A 


13999 


400 


668 


P VGRRWELC3GMGIiGVRTWQS S LPCQGPT 
ooovLoLmKu 1 ruioM i v^r JrKo/iliSlAa 
LPAQVP FPNVQS QDQDLS FS LGCFPRTY 
vrprjTT. 


13832 


27733 


A 


14000 


225 


414 


NRVLLCHTGWST VAI LAYCS FKIiLGSRD 
KGGRRL 




Z / /^)**> 








lis 

1 15 


T .F*5r»PFFFD FCS R PFOR PTVP FCliT iT iYNO 

MQPYLLRSAFVDSYYMYDSFLRBRVSFC 
OS GWSTVAPSRLTAALNSIiAOVI LLPOP 
LNVLCGISR 


13834 


27735 


A 


14002 


3 


173 


YVFYRQKQMVCFYKI R I KNMFNRNRLNL 
GQWCVPIVPATQVAEAGGSPEPTSLRPI 


13835 


27736 . 


A 


14003 


440 


290 


YTHTL Y I Y I Y IHTHTHTHTY I YLCTYVJG 
QFDLLVLDNKL YKE I Y YLS VF I 




1 Jo^o 


77*71*7 
Li IS I 






i, 


11/: 
1 10 


AP T.VAMP F1CVRF T KDFLLTARR'KTIAFCSV 
/inxivxu'iirr iViciciAi/r i n )aiuuu\ i i\*t> v 

KIKKNKSAAV 


13837 


27738 


A 


14005 


279 


422 


SAKAPLSCLRTNSLLLKNGLRGWAWWIT 
PVIPTLWEAEVGGSPDVTSP 


13833 


27739 


TV 


14-UUo 

w 


3 




AAAQPCTPTMDVFKKGFS IAKEGWGAV 
EKTKQGVTFAAEKTKEGVMYVGAKTKEN 
WOSVTS VAR KTKEOANAVSEAWSS VN 
WATKTVEEAENIAVTSGVVRKEDIiRPS 
APQQEGEASKEKEEVAEEAQSGGD 










1 
1 


707 


SLSIYMSVRVCIYMYMCVCVCIRIYTYI 
HTHIHIRVYIYTIPYIWGIYIYIYVDLV 
IYMGWCGCPPPSI 


13840 


27741 


A 


14008 


57 


250 


KI FFFLMNHT WGG VP VVLATWEAEVGGS 
LEPRSLGLQETMMTSLLSRLPRQQGEIL 
SLILKRNY 


13841 


27742 


A 


14009 


184 


2 


AVPGRWPCQQVPGLLPS IQDTLKQGRTG 
QWLTPVTPMPWEAEAGGSLDARSLRPAW 
ATGK 


13842 


27743 


A 


14010 


123 


3 


KEPSRPGRWLTSI IPACWEAEAGGPPEA 
RSSTRE FRTAK 


13843 


27744 


A 


140U 


403 


168 

• 


GTGLRASRCISQPVMSGLLSPSEFPFPE 
I ESHLHVSQRETAARIiRDRERERARERE 
RERERGKNASSGKPPGSASCVP 
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SEQJDs 
NO: of 
nucleotide 
sequence , 



13844 



13845 



13846 
13847" 



SEQID 
NO: of 
peptide 
sequence 



M 
eth 

od 



SEQID 
NO: in 
USSN ' 
09/515,1 
26 



27745 



27746 



A I 14012 



A I 14013 



27747 



27748 



13848 



13849 



13850 



13851 



13852 
T385T 



27749 



27750 



27751 



27752 



27753 



13854 



13855 



27754 



27755 



14014 



14015 



14016 



Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 



207 



200 



319 



63 



14017 



14018 



14019 



14020 



416 



196 



163 



14021 



203 



179 



27756 



13856 
"13857 
T3858 



13859 



13860 



27757 
27758 
27759 



14022 



14023 



A 
A" 

X 



14024 
74025 
14026 



27760 



27761 



13861 



27762 



A 14027 



A 14028 



A 14029 



358 



98 



254 
740 
579 



30 



46 



Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 



422 



405 



176 



401 



63 



425 



233 



83 



431 



1746 



Amino acid sequence (A a Alanine 
C-Cysteine, D=Aspartic Add, 
E=Glutaralc Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M^Methiooine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
TVThreonine, V=Vaiine, 
W=Tryptopban, Y=0>rosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 

EIRRGSLAGPLSLNFILSVMQSHSKYLR 
KVIGRAQWLMPWPALWEAEVGKLHELR 
RLGHTWAAW ' 

WLCVLGWSLLALS PRt^CtdP 1 SAHCNL 

HLLRSSDSPASASOALQBAWYQHLLLMM 
PQAAS TQVGRKK 

KPSTITFLFIiHLFISSRESRPVTQTBVK" 

WLNHSS PQPPTPRFSL PSS 

E Q ENKNTLNS P GWDG VH I HVtk FLNTRM"" 

PLGHLSFSEGVSGSGSSGRRCQAGAGGR 

SATAAARPCQCLGLKGLPPAVPSCGIJEG 

KAP E RAG YS AS HP P HG CBTGNGWVL YVN 



VSKVYPKINHTQRTVRHETFKIAMPKYY 

SNLCSILRFNKDRLAFMRHYTKQCSRTS 

ITEYIAMFFFVCLFVLKIGAHCHPGWNE 

WLVCSQVTAASTSRAQAILPPQPPKIL 
GMQQA 



SLMFRPPLFSFCCFCIRYPFIFVFFSLL' 
CFAFSWCSLLLFFFHYLKLFIiLLFLLFF 
LPVLLCLDS FLVHDFFLLSS 



SS RMGRGTKT I afdmrws ce vdrarhrd 

REREREREREREREKGAEPQEFPVIS PW 
VDRPCIiAGILHLKCGC GHRL 
AERWHDQICVERE I WGWVQ WIjMPVI P Vtf 
WEAKAKNtiEPKNLKLQRNRRPGNSR 



LKKKKNKSQAQWLTSVVLASQEAEAGGS 
LEPRNSRPAWAT 

VTWQRELRLQMELRLLI CRAWWfTP VI P 

VPWEAKAGGSPEPRSSRPAWETRNRRPG 
NS 



WHKKKNQNWLGTGAHACNPSTLGG 

GRPAGWLTPVAPTLWEAEMGGSLEPRSL" 
GHE ■ 

RENWAEPSSHSLLNI FLQLGTVARACNP 
STLGGRGGWITRSGD 

KGVLDGPRGLCVFCFCFFFLCVCVCVCV 
CVCVWRRSLTL SPRIiQC I 

QREWVGWAGKEGEGWVSHVPASAQLPAN 
GQRGQPHPS PLGGTGS TWAQEGAYCCLS 

SCSHCCCCSSCCCCCPPGFCLFPSIiGAI 
CHLYHL 



663 ! LR IRALRELPASHI PGS LT ICCVPRPPI7 

PCS STKPDAKGYKPLAQQKLGSQRTGPC 
LGHAPCYS PL WEtiRGHGCS P YGP S PSGS 

VSLWQEAMRLPKNTPEEKDRRTAALQEG 
LRRAVSVPLTLAETVASLWPALQBLARC 
GNLACRS DLQ VAAKALEMGVFGAYFNVL 
INLRD ITDEAFKDQIHHRVSSLLQEAKT 

QAALVLDCXtBTRQE 

267 ~ | AVGVHHAFHLPHCFFASLLESPVSPRUT 
MDPNCS CAAGVSCTCAGS CKCKECKCTS 

CKKSCCSCCPVGCSKCAQGCVCKGPJSEK 
CSCCD 



PAAGAATMEFRQEE FRKLAGRALGKLHR - 
IiLEKRQEGAETLELSADGRPVTTQTRDP • 
PWDCTCFGLPRRYI XAIMSGLGFCISF 
GIRCNI^VAIVSMVNNSTTHRGGHVVVQ 
KAQFS WD PETVGL IHGS FFWGYIVTQI P 



1388 



WO 01/64835 PCT/US01/04927 



SEQH) 
NO: of 
nucleotide 
sequence 


SEQ1D 
NO: of 
nentide 
sequence 


M 

eth 

od 


SEQH) 
NO: in 
USSN 
09/515,1 
26 


Predicted 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 

u ***** w 

otide 
location 
correspon 
ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine,D=AsparticAcid t . 
E=Clutamic Acid* ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, ^Methionine, 
N=Asparagine, P^ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine t 
W«Tryptophan, Y=Tyroslne, . 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


■ 












GGPICQKFAANRVFGFAIVATSTLNMLI 
PSAARVHYGCVI FVRILQGLVEGVT YPA 
CHG I WS KWAP PLERS RLATTAFCGS YAG 
AWAMPIiAGVLVQ YSGWS S VFYVYGS FG 
I FWYIiFWLLVS YESPALHPS I SEEERKY 
IEDAIGESAKLMNPLTKFSTPWRRFFTS 

EEVFGFE I S KVGLVS ALPHLVMTI I VP I 
GGQIADFT^SRRIMSTTNVRKLMNCGGF 

FSGFAISGFNVNHLDIAPRYASILMGIS 
NGVGTLSGMVCP 1 1 VGAMTKHKTREE WO 
YVFLI ASLVHYGGVI FYGVFASGEKQPW 
AEPEEMSEEKCGFVGHDQLAGSDDSEME 
DEAE P PGAPPAP P PS YGATHSTFQP PRP 
PPPVRDY 


13862 


27763 


A 


14030 


240 


63 


VFTAQSNGGVELNRFNEGPLSAVLKGRV 
AWAQWLTPVI PTLWEVKAGGLLEARSLS 

PA • 


13863 


27764 


A 


14031 


289 


414 


HAEMGRVQWLTPAI PALCKAEAGGP pep 
RSLRPAWATY 


13864 

■ 


27765 


A 


14032 


386 


1765 


LGDARAPEKMS AX QAAWP5GTECIAKYN 
FHGTAEQDLP FCKGDVLTI VAVTKDPNW 
YKAKNKVGREG 1 1 PANYVQKREGVKAGT 
KLSLMPWFHGKITREQAERLLYPPETGL 
FLVPJE STNYPGD YTLCVS CDGKVEHYRI 
MYHASKLSIDEEVYFENLMQLVEHYTSD 
ADGLCTRLI KPKVMEGTVAAQDE FYRSG 

WAT .MMVPTtVT .T J~\T T CZ KT2 PPTinVMTi'Sn VP 

GNKVAVKC I KNDATAQAFLAEAS VMTQXi 
RHSNLVQLLGVIVEEK33GLYIVTEYMAK 
GSLVD YI*PSPGP S VTinGDC!T»I.KPSTiD VC 

UuU V X LMViJXVwXvO V L WJlwAJl.* 1 1 1.IJVJ7 ij i J I./ V V* 

EAMEYLEGNNFVHRDLAARNVLVSEDNV 
AKVS DFGLTKEASS TQDTGKLPVKWTAP 
EALRE KKFSTKS DVWS FGI LLWE IYS FG 
RVP YPRI PLKDWPRVEKGYKMDAPDGC 
P PAVYEVMKNCWHLDAAMRPS FLQLREQ 
LEHIKTHELHL . 


13865 


27766 


A 


14033 


619 


414 


EQARCLEQHACTS PRQPRPQLQCSELKG 
HSLGMEHOGHVPCLHPCSPCPPVKASPA 
CWSSNPAPLCPH 


13866 


27767 


A 


14034 

• 

■ 


2 

■ 

• 


614 


LESRPDGRPSSTHPASPSAFSAPGKPHP 
P EAKMS S KRAKAKTTKKRPORATSNVFA 
MFDQSQIQEFKEAFNMIDQNRDGFIDKE 
DLHDMLASLGKNPTDEYLSGMMSEAPGP 
INFTMFLTMFGEKLNGTDPEDVIRNAFA 
CFCEEASGFIHEDHLREXLTTMGDRFTD 
EEVDEMYREAP IDKKGNFNYVEFTRILK 
HGAKDKDD 


13867 


27768 


A 


14035 


1 


254 


AVEFGPAGPGSLGRAAAMI I PVRCFTCG 
KIVGNKWEAYLGLLQAEYTEGDALDALG 
LKRYCCRRMLLAHVDLIEKLLNYAPLEK 


13868 


27769 


A 


14036 


494 


148 


FRFLSDCGVFAEGH I ELQVESGVPLGFS 
TMAEDMETKI KNYKTAPFDSRFPNQNQT 
RNCWQNYIiDFHRCQKAMTAKGGDISVCE 
WYQRVYQSLCPTSWVTDWDEQRAEGTFP 
GKI 


13869 


27770 


A 


14037 


130 


3969 


IMGDVKNFLYAWCGKRKMTPSYEIRAVG 
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sequence 
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SEQH) 
NO: of 
peptide 
sequence 


eth 

00 


SEQID 
NO: in 

1 ICCW 

USSN 

09/515,1 

26 


Predicted 

beginning 
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location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A°Alanine 
OCysteine, D=Aspartic Add, 
r>— tiiutamic Acid, r=rnenytalanine, 
G=GIycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methioninc, 
N=Asparagine, P^Proline, 
Q^Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=TyrosIne, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=p05sibte 
nucleotide insertion 


* 

■ 

• 

• 

■ 


i 

■ 

• 




♦ 


• 

* 




NKNRQ KFMCEVQVEG YNYTGMGNS TNKK 
D AQ S NAARDFVNYL VRINE I KSE2VPAF 
GVAS P PPLTDTPDTTANAEGDLPTTMGG 
P L P P HLAL KAENNS BVGASG YGVPGPTW 
DRGANLKDYYSRKEEQEVQATLBSEBVD 
LNAGLHGNWTLENAKARLNQYFQKEKIQ 
GEYKYTQVGPDHNRS F I AEMTI YI KQLG 
RRIFAREHGSNKKIiAAQSCALSLVRQLY 
HIiG WEAYS GLTKKKEGETVE PYKVNLS 
QDLEHQLQNIIQELNLEILPPPEDPSVP 
VALN IGKLAQFEP SQRQNQVGVVPWSPP 
QSNWNPWTSSNIDEGPLAFATPEQISMD 
LKNELMYQLEQDHDLQAILQERELLPVK 
KPE SE I LEAIS QNSWI IRGATGCGKTT 
QVPQFILDDFIQNDRAAECNIWTQPRR 
I SAVSVAERVAFERGEEPGKS CGYSVRF 
ES VLPRPHAS IMFCTVGVLLRKLEAGIR 
GISHVIVDEIHERDINTSFLLWLRDW 
QAYPEVRIVFMSATIDTSMFCEYFFNCP 
SLKLWRTYPVQEYFLEDCIQMTHFVPPP 
KDKKKKDKDDDGGEDDDANCNL I CGDE Y 
GPETRLSMS QLNE KETPFEL I EALLKY I 
ETLNVPGAVLVFLPGWNLIYTMQKHLEM 
NPHFGSHRYQILPLHSQIPREEQRKVFD 
PVP VGVT KVTLSTNIAETS I T INDWYV 
IDS CKQKVKLFTAHlh©1TNYSTVWASKT 
NLEQRKGRAGRSTAGFCFHLCSRARFER 
LETHMTPEMFRTPLHE IALS I RLLRLGG 
IGQFLAKAIEPPPLDAVTEAEHTLRELD 
ALDANDELTPLGRXIiAKLPIBPRFGKMM 
IMGC I FYVGDAI CTIAAATCFPEPFVNE 
GKQLGYIHRNFAGNRFSDHVALLSVFQA 
WDDARMGGEEAE I RFCEHKRLNMATLRM 
TWEAKVQ LKE I L I NSG FPED CLLTQVFT 
NTG PDNNLD WI S LLAFGVYPNVC YHKE 
KRK I LTTEGRNAL I HKS S VNCPFS SQDM 
NYPSPFFVFGEKIRTRAISAKGMTLVPP 
LQLLLFA3KKVQSDGQIVLVDDWIKLQI 
SHEAAACITGLRAAMEALWEVTKQPAI 
ISQLDPVNERMLNMIRQISEPSAAGINL 
MIGSTRYGDGPRPPKMARYDNGSGYRRG 
GS S YSGGG YGGG YSSGGYGSGGYGGSAT 
PSGRI CAGVGGGYRGVSRGGFRGNSGGD 
YRGPSGGYRGSGGFQRGGGRGAYGTGYL 
DIEEEVAAIKIjGYVSSVCRQ 


13870 


27771 


A 


14038 

• 


431 


542 


EGITPGWARWLTPVI PGLWEEEAGGSPG 
REIDIGLAN 


13871 


27772 


A 


14040 


238 


362 


RRCTFLS FRMLAPRVYS WGKRAFS PS V 
L.VKAnui\.UUxoXo 


13872 


27773 


A 


14041 


6290 


3514 


FRAAGSSSTNSGRICPLPSGALLYQSEG 
LLARP HGKG S FQ VGRQQHQAVRVGS AHA 
SSAGYVCVSAFPFVFLRFCVGGGPGISR 
VYALFYGE CNPTREWAVSS ELSPS FQEQ 
NKMNKVEQKSQESVSFipVTVGFTQEEW 
QHLDP SQRALYRDVMLENYSNLVS VGYC 
VHKPE VI FRLQQGEEPWKQEEBFPSQSF 
PEVWTADHLKERSQENQSKHLWEWFIN 
NEMLTKEQGDVTGIPFNVDVSSFPSRKM 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first • 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- . 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A«Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Fnenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 
Q^GIutamine, R«Arginine, S=Serine, 
^Threonine, V~Valine, 
W=Tryptophan, Y=Tyroslne, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


i 

• 

- 


• 




t 


• 


■ 

• 


FCQCDSCGMSFNTVSELVISKINYLGKK 

SDE FNACG KLLLN I KHDETHTRE KNEVL 

KNRNTIiSHRENTLQHEKIQTLDHNFEYS 

ICQETLLEKAVFNTRKRENAEENNCDYN 

EPGRTPCDSSSLLFHQISPSRDNHYEFS 

DCEKFLCVKSTLS KPHGVSMKRYDCGE S 

GNNFRRKLCLSHLQKGDKGEKHF3CNEC 

GKAFWEKSHLTRHQRVHTGQKP FQCNE C . 

EKAFWDKSNLTKHQRSHTGBKPFECNBC 

GKAFSHKSALTLHQRTHTGEKPYQCNAC 

GETFYQKSDLTKHQRTHTGQKPYECYEC 

GKSFCMNSHLTVHQRTHTGBKPFECLEC 

GKS F CQKSHLTQHQRTHIGDKP YECNAC 

GKTFYHKSVLTRHQ I IHTGLKP YECYBC 

GKTFdiKSDLTIHQRTHTGEKPFACPEC 

GKFFSHJCSTL5QHYRTHTGEKPYECHEC 

GKIFYNKSYLTKHNRTHTGEKPYECNEC 

GKTFCQKSQLTQHQRIHIGEKPYECNEC 

GKAFCHKSAL I VHQRTHTQEKP YKCNE C 

GKSFCVKSGLI FHERKHTGE KP YECNE C 

GKFFRHKSSLTVHHRAHTGEKSCQCNEC 

GKIFYRKSELAQHQRfeHTGEKPYECNTC 

RKTFS QKSNL I VHQRRHIGENLMNEMD I 

RNFQPQVSLHNASEYSHCGES PDDILNV 

Q 


13873 


27774 


A 


14043 


187 


422 


LTIPKLHSAQEPAGPVSPPPPPSPTFQI 
GGRSRAE PRNQ YCPQVAPVPALRGCLPL 
SPGEPPHDTSSPREXTCGGIVF 


13874 


27775 


A 


14044 


340 


442 


NKFLS WVRLLMP VI PTFWEAKVGGWLEA 
RSLRMQ 


13875 


27776 

■ 


A 


14045 

• 


3 

■ 

■ 


1240 

■ 


LVEGAAGQGVSDGARLRKCGTRSFPGSE 
EVLSSMARGSAFPAAAIiWLWS I LLCLLA 
LRAEAGP PQEE S L YLW I DAHQARVL IG F 
EED I L I VS EGKMAP FTHD FRKAQQRMP A 
ISVNIHSMNFTWQAAGQAEYFYEFLSLR 
SLDKG IMAD PTVNVPLIiGTVPHKAS WQ 
VGFPCLGKQDGVAAFEVDVTVMNSEGNT 
ILQTPQNAI FFKT CQQAECPGGCRNGG F 
GNERRI CECPDGFHGPHCEKALCT PRCM 
NGGLCVT PGFC I CP PGFYGVNCDKANCS 
TTCFNGGTCFYPGKCI CPPGLEGEQCEI 
SKCPQP CRNGGKCI GKfi KCKCS KG YQGD 
IiCSKPVCEPGCX3AHGTCHEPNKCQCQEG 
WHGRHCNKRYEASL IHALRPAGAQLRQH 
TPSLKKAEERRDPPESNYIW 


13876 

• 


27777 


A 


14046 

• 




274 


YSQEVLCEMNRSG WSNI KRGWAWWQLT 
TTIPALWEAEVGGLIiEPRSLRLAS PQPP 
QHEDYSMS YCAQPRFSLCIRGFI FYSLW 
r 


13877 


27778 


A 


14047 


177 


441 


GGSHYAWPNITFI FTKLIAQFFCFFEMQ 
SCSVARLECRGAISAHCNLHLPGSSNSP 
VMAQIIiKNQRKSNFDEDATLLGLSYTLL 
GEI 


13878 


27779 


A 


14048 


136 


319 


DVFLLLFHSNYFLFFETGSHSVSQAGVK 
WCDLSSLQPPPLGSSDPPTSASRVPALF 
FFLEF 


13879 


27780 


A 


14049 


388 


128 


ELKLPSYKGQSPQLSLRRYFADLIAIVS 
NRFTLCPSARHLAVYLLDLFMDRYDISI 
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ivcauun 

correspon 
ding to 
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peptide 
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Amino acid sequence (A=A!anine 
OCysteine, D=Aspartic Acid, 

I?-— nf o mi/* A />!H ITrrPHanvlalonin* 
II/-— VrlUlil ul 1C AKCIU, F— -Jr IJcnyifllaDIDVf 

G=Glycine, H=Histidine, I^Isoleuctne, 

1^=1 .vein a I^=1^nrinA IMalvlAth Inning. 

N=>Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V*=Vaiine; 

X=Un known, *=Stop codon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














nr\r .wt lV& t . q pt .T.T.ac irvi? ctyi .wnwinyTtM 
w W iJTLu v ajjo ^jjjj rv i Co jjunisw jvp un 

FI 


13880 


27781 


A . 


14050 . 

« 


262 


3 


VIYNYISVRPFPNIYHTFYTLSLFLYLS 
THTHTHTHTHTHTQRETGHVCV CI YTYM 
YMCMYAKWWYKFTFLISGKARIHTHLHP 
CI 


13881 


.27782 


A 


14051 


149 


423 


YPHLCAFLLLPSISSFRAGVFFFSYLSL 
TAYPLTWKPPEPPATPQPTPNPPPGHPA 
TRGPKSALLDHSYACASAPPLGTEGTAP 
SPSFALI 


13882 

■ 


27783 

• 

i 


A 


14052 


2 


1440 


FVAVTAMAAPCLLRQGRAGALKTMLQEA 
QVFRGLASTVSLSAESGKSEKGQPQNSK 
KQSPPKNWEPKERGKLLATQTAABLSK 
NLSSPSSYPPAVNKGRKVASPSPSGSVL 
FTDEGVPKFLSRKTLVEFPQKVIiS P PRK 
QGSDSEARQVGRKVTSPSSSSSSSSSDS 
ES DDEADVS EVTPRWS KGRGGLRKPEA 
SHSFENRAFRVTVSAKEKTLLQKPHVDI 
TDPEKPHQPKKKGS PAKPSEGRENARPK 
TTMPRSQVDEEFLKQSLKBKQLQKTFRIi 
NEIDKESQKPFEVKGPLPVHTKSGLSAP 
PKGS PAPAVLAEEARAEGQLQAS PPGAA 
EGHLBKPVPEPQRKAAPPLPRKETSGTQ 
G IEGHLKGGQAI VEDQIPPSNLETVPVE 
NNHGFHEKTAALKLEAEGEAMEDAAAPG 
NDRGGTQE PAP VPAEPFDNTTYKNLQHH 
DYSTYTFLDLNLELS KFRMPQPS SGRES 
PRH 


13883 

■ 


27784 

• 


A 

- 


14053 


177 

• 


1253 

• 


EKKRTLPVRSVTGBLQGKSLSDLAAGTM 
DSEKKRFTEEATKYFRERVSPVHLQILL 
TNNEAWKRFVTAAELPRCEABALYEALK 
KLRTYAAI EDE YVQQKDEQFREWFLKEF 
PQVKRKIQESIBKLRALANGIEEVHRGC 
T I SNWS S STGAASG I MSLAGLVLAP FT 
AGTSLALTAAGVGLGAASAVTGITTSIV 
EHS YTSSAEAEASRLTATS IDRLKVFKE 
VMRDITPNLLSLLNNYYEATQTIGSEIR 
AIROARARARLPVTTWRISAGSGGQAER 

1 JLAu 1 1KAV oKvj*AK1.LiS>A1 1 ova J. c liAltU 

WNLVYES KHLHEGAKSASAEELRRQAQ 
ELEENLMELTQIYQRLNPCHTH 


13884 

• 


27785 


A 


14054 


1585 


2099 


ICVKTFPPLALQVRMAAAEHHHSSGLPY 
WP YLTAETLKNRMGHQ PPPPTQQHS ITD 
NSLSLKTPSERLLYPLPPSAPPSADDNL 
KTPPECLLTPLPPSALPSADDNLKTPAE 
CLLYPLPPSADDNLKTPPECLFTPLPPS 
APPSVDDNLKTPPECVCSLPFHPQRMII 
SRN 


13o&5 

■ 


OTTO *T 

27786 




14055 


<-> 
2 


2oo5 


aT.Drv2f2ii Qva^np aF^PDtv p^tctjxbpi? 

KTEGAVAAVGGGP S S FRCCYG CCHEARL 
GRTSLPRGVIMLTEASLSIWGWGSLGIV 
LFLITFGPFVIFYLTFYILCFVGGGLW 
TLtiFGKTNSEKYLEQCEHSFLPPTSPGV 
P KCLEEM KREART I KIDRRLTGANI IDE 
PLQQVIQFSLRDYVQYWYYTLSDDESFL 
LEIRQTLQNALIQFATRSKEIDWQPYFT 
TRIVDDFGTHLRVFRKAQQKITEKDDQV 
KGTAEDLVDTFFEVEVEMEKEVCRDLVC 
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SEQ ID 
NO: of 
nucleotide 
sequence 

* 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 

Ou 


SEQ ID 
NO: in 

09/515,1 
26 

■ 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucie- . 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
OCysteine, D=Aspartic Acid, 

X? — t~* 1 ■ i m its A mlA tT^—Ok nnvlnlnninn 

n^Hjiuiamic ackj, r— r nenyiaianine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=As para gin e, P»Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
"^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


i 




r 




• 




TS PKDE EGFLRDLCE VLL YLLLPPGDFQ 
NKIMRYFVREILARGILLPLINQLSDPD 
YINQYVIWMIRDSNCNYEAFMNI IKLSD 
NIGELEAVRDKAAEELQYLRSLDTAGDD 
INTIKNQINSLLFVKKVCDSRIQRLQSG 
KEINTVKLAANFGKLCTVPLDSILVDNV 
ALQFFMDYMQQTGGQAKLFFWMTVEGYR 
VTAQQQLEVLLSRQRDGKHQTNQTKGLL 
RAAAVG I YEQYLS E KAS PRVTVDD YLVA 
KLADTLNHEDPT PE I FDD I QRKVYBLML 
RDERFYPSFRQNALYVRMLAELDMLKDP 
S FRGSDDGDGES FNGS PTGS INLSLDDL 
SNVS SDDS VQLHAY I SDTVYADYDP YAV 
AGVCNDHGKTYAL YAI TVHRRNLNS EEM 
WKTYRRYSDFHDFHMRITEQFESLSS IL 
KLPEKKTFNNMDRDFLEKRKKDLNAYLQ 
LLLAP E MMKAS PALAHYVYD FLENKAYS 
KGKGD FARKMDT FVNPLRNSMRNVSNAV 
KSLPDSIAEGMTKMSDNMGKMSERLGQD 
IKQS FFKVPPLIPKTDSDPEHRRVSAQL 
DDNVDDNI PLR VML LLMD EVFD LKERNQ 
WLRRNIKNLLQQLIRATYGDTINRKIVD 
HVDWMTS PEQVADSVKRFRDAFWPNGIL 
AEAVPCRDKS I RMRTRVAG KTKLLA I MP 
GE 


13886 


27787 


A 


14057 


311 


150 


FLCFKYRRGFATLHSGLKLLGSSDPPSS 
TSQIPG ITGTSHCVQPTHLFFLALS 


13887 


27788 


A 


14058 


53 


211 


RQHITCLDIFFIHMNYKYKYINIYLSIY 
LSIYLS IYLI YLSXSHSSYITERA 


13888 

- 


27789 

• 

- 


A 


14059 


1 

* 

> 

■ 

• 


1642 


RDGRKMATAT IALQ VNGQQGGGSE PAAA 
AAWAAGD KWKP PQ GTDS I KMENGQ S TA 
AKLGLPPLTPEQQEALQKAKKYAMEQSI 
KSVLVKQTIAHQQQQLTNLQMAAQRQRA 
LAIMCRVYVGS I YYELGEDTIRQAFAPF 
GPIKSIDMSWDSVTMKHKGFAFVEYEVP 
EAAQLALEQMNSVMLGGRNIKVGRPSNI 
GQAQPI IDQLAEEARAFNRI YVASVHQD 
LSDDDI KSVFEAFGKI KSCTLARDPTTG 
KHKGYGFIEYEKAQSSQDAVSSMNLFDL 
GGQYLRVGKAVTPPMPLLTPATPGGIiPP 
AAAVAAAAATAKI TAQEAVAGAAVLGTL 
GTPGLV5 P ALT LAQ P LGTLPQAVMAAQA . 
PGVITGVTPARP P I PVTI PS VGWNP IL 
ASPPTLGLLEPKKEKEEEBIjFPESERPE 

STVMVLRNMVDPKDIDDDLEGEVTEECG 
KFGAVNRVIIYQEKQGEEEDAEIIVKIF 
VEFSIASETHKAIQALNGRWFAGRKWA 


13889 


27790 


A 


14060 


1238 


1474 


VLALQ EGRP WRRE PAS IDACRLNFQRLR 
RGKFSNVLFPGLAQEALYS GGYHLKFAD 
ELMGGNLKKSTADASG SRGHQL 


13890 


27791 


A 


14061 


266 


400 


GQWARP VI PALWEAKAGGSLE PSMLRPA 
GQHIKTPSVLITSKKT 


13891 


27792 


A 


14062 


2178 


1 881 


VLQAPS ILLDAPRTDGGDMGRAMVARLG 
LWLIiLLALLLPTQIYSSErriXiTSSNSS 
QSTSNTGLAPNPTNATTKAAGGAIiQSTA 
SLFWSLSLLHLYS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M, 
eth 
od 


SEQID 
NO:in 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^Crlutamic Acid, t~ rnenylaianine, 
G=Glycine, H-Histidine, t=Iso leucine, 
K^Lysine, LHLeucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, K— Arglnlne, 5— oertne, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, ^possible 
nucleotide deletion, \~possible 
nucleotide insertion 


13892 


27793 

* 


A 


14063 


261 


1 


• NSVLTIiALLFSLHVHLPPPPSGVTDTAG 
ALSTGGPPSPTSVTGRGPAHSHASQLPP 
APGE FAP LNE S CRGWAG EAFLERFPDAW 
VDP 


13893 

■ 


27794 

i 

j 


A 


14064 


2384 


1115 

■ 


QHFSRRGLCWEQRS S VTS S WTSGAWS P 
PCPPSNASCNTLHTRDWASPDPGGQGSL 
GESPGPAPPGQLHTLDTDLHSIiAQIGGK 
SPVAGVGNGGSLWPRESPGTANGHSPEH 
TPPGPGPPGPCPTKRRIiLPAGEAPDVSS 
E EEG PAPRRRRGSLGHPTAANS SDAKAT 
P FWSHLL PGP KB PVLDPTDCGPMGRRLK 
GARRLKLSPLRS LRKGPGLLS PPRASPV 
PTPAVS RTLLGNFEE S LLRGRFAPSGH I 
EGFTAE IGASGS YCPQHVTLPVTVTFFD 
VSEQNAPAP FLG I VDLNPLGRKG YS VP K 
VGTV Q VTLFN PNQTWKMFLVT PD FSDM 
PAAHMTFLRHRLFLVPVGEBGNANPTHR 
LLCYLLHLRFRSSRSGRLSLHGDIRLLF 
S RRS LELDTGLP YELQAVTEAPHNPRYS 
PLP 


13894 


27795 


A 


14065 


232 


416 


GWAVQLG I GTEGKEGTS S S BRQREGRGD 
GNRKTGRRREGRRSTKTDAPS P YPAHRP 
RSKLI 


13895 


27796 


A 


14067 


238 


4* 


IQTCGSHSFCIPSIWEAKAGGLLEPRSL 
SPACPTQKDPISHLGSELPQSFRLLIRL 
VPLFLSAC 


13896 

■ 


27797 

• 

• 


A 

■ 


14068 

* 

• 

- 

■ 

■ 


3 

• 

■ 

• 


1705 


SCESKATPWRAVSASQELQRPQGGQRSP 
LPGDLTPARPNPAYPLTVECQRSCSRPV 
PAPLPPHTHPPGSSCFYSSFSFITKATA 
PGAQRRAVTQAE RGRMGFLGTGTW I LVL 
VLPIQAFPKPGGSQDKSLHNRELSAERP 
LNEQIAEAEEDKIKKTYPPENKPGQSNY 
S FVDNLNLLKAI TE KEKIEKERQS IRSS 
PLDNKLNVED VD S TKNRKL I DD YD S TKS 
GLDHKFQDDPDGLHQLDGTPLTAEDIVH 
KIAARI YEENDRAVFDKXVS KLLNLGLI 
TESQAHTLEDEVAEVTjQKLISKEANNYE 
EDPNKPTS WTENQAGK I PEKVTPMAAIQ 
DGLAKGENDETVSNTLTLTNGLERRTKT 
YSEDNFEELQYFPNFYALLKSIDSEKEA 
KEKE^TLITIMKTLIDFVKMMVKYGTISP 
EEGVS YLENIiDEMI ALQTKNKLE KNATD 
NISKLFPAPSEKSHEETDSTKEEAAKME 
KEYGSLKDSTKDDNSNPGGKTDEPKGKT 

TP7VVT 1?7\ TO VKTTI7UT ZTWTWVr'U 1/ ur\VrtT 

SKMRDFINKQADAYVEKGILDKEEAEAI 


13897 


27798 


A 


14069 


1 


140 


NTSVAIQTIEYYSAFKRKETLTHVPLWM 
NLKDTMLREI SOSGKDTV 


13898 


27799 


A 


14071 


212 


104 


HTNHQCQTVLPAGQALATPQGLAPSPVF 
PQSCLRMV 


13899 


27800 


A 


14072 


358 


112 


FPLTVMLNVFLWWQNLKHFCWFIQEQLM 
FFFFFFLRTGSHYVVQAGLBLLGSTNLP 
ASVSRVAGTTGIHNCTQFNPLHTLHL 


13900 


27801 


A 


14073 


3 


151 


YMGFHHVGYAGLELLTSSDLPPWPPKCW 
DYRHE PS CLAMFFYFAL I APE 


13901 


27802 


A 


14074 


8 


1493 


VTIHHLFVGQAVRADTLKKKSPSCLLCI 
VFI PVPRIDLTNIDQVAVT FKHHFPVGR 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 

• 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystelne, D=Aspartic Acid, 
E=GIutamic Acid, {^Phenylalanine, 
G=GJycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N»Asparagine, PHProline, 
Q=GIutamine, R=Arginine, S=Serlne ( 
T=Threonine, V=Valine, 
W=Tryptophan,Y=Tyrosine, 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 








i 






GDAVLKTWAPAQCLCSRMGPAWLWLLGT 
GILASVHCQPLLAHGDKSLQGPQPPRHQ 
LSEPAPAYHRITPTI TNFALRLYKELAA 
DAPGWIFPSPVSISTTLALLSLGAQANT 
SALILEGU3FNLTETPEADIHQGFRSLL 
HTLALPSPKLELKVGNSLFLDKRLKPRQ 
HYLDS I KELYGAFAFSANFTDSVTTGRQ 
INDYLRRQTYGQVVDCLPEFSQDTFMVL 
ANYIFFKAKWKHPFSRYQTQKQESFFVD 
ERTS LQVPMMHQ KEMHRFLYDQDLACTV 
LQ I E YRGNALALLVL PDPGKMKQVEAAL 
QPQTLRKWGQLLLPSLLDLHLPRFSISG 
TYNLEDILPQIGLTNIIzNLEADFSGVTG 
QLNKT I S KVS HKAM VDMSE KG TEAGAAS 
GLLSQPPSLNTMSDPHAHFNRPFLLLLW 
EVTTQSLLFLGKWNPVAG 



I 
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* 

WHAT IS CLAIMED IS: 

> • • 

1 . An isolated polynucleotide comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NO: 1-13901 , a mature protein coding portion of SEQ ED NO: 1-13901 , 
an active domain of SEQ ID NO: 1 -1 3901 , and complementary sequences thereof. 

5 

2. An isolated polynucleotide encoding a polypeptide with biological activity, wherein said 
polynucleotide hybridizes to the polynucleotide of claim 1 under stringent hybridization 

■ 

conditions. 

■ 

• * ■ 

T 

10 3. An isolated polynucleotide encoding a polypeptide with biological activity, wherein said 
polynucleotide has greater than about 90% sequence identity with the polynucleotide of claim 1 . 

■ 

- 

4. The polynucleotide of claim 1 wherein said polynucleotide is DNA. ' 

■ » 1 

* ■ 

15 5. An isolated polynucleotide of claim 1 wherein said polynucleotide comprises the 
complementary sequences. 

c 

6. A vector comprising the polynucleotide of claim 1 . 
20 7. An expression vector comprising the polynucleotide of claim 1 . 

8. A host cell genetically engineered to comprise the polynucleotide of claim 1 . 

9. A host cell genetically engineered to comprise the polynucleotide of claim 1 operatively 
25 associated with a regulatory sequence that modulates expression of the polynucleotide in the host 

cell. 

10. An isolated polypeptide, wherein the polypeptide is selected from the group consisting of: 

(a) a polypeptide encoded by any one of the polynucleotides of claim 1 ; and 
30 (b) a polypeptide encoded by a polynucleotide hybridizing under stringent 

conditions with any one of SEQ ID NOS: 1-13901. 

■ 

11. A composition comprising the polypeptide of claim 1 0 and a carrier. 

35 12. An antibody directed against the polypeptide of claim 10. 
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■ • 

\ 

13. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

i • 

• a) contacting the sample with a compound that binds to and forms a complex 

with the polynucleotide of claim 1 for a period sufficient to form the complex; and 
5 b) detecting the complex, so that if a complex is detected, the polynucleotide 

of claim 1 is detected. 

* 

14. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample under stringent hybridization conditions with 
10 nucleic acid primers that anneal to the polynucleotide of claim 1 under such conditions; 

b) amplifying a product comprising at least a portion of the polynucleotide of 

claim 1; and 

c) detecting said product and thereby the polynucleotide of claim 1 in the 

15 sample. 

15. The method of claim 14, wherein the polynucleotide is an RNA molecule and the method 
further comprises reverse transcribing an annealed RNA molecule into a cDNA polynucleotide. 

20 16. A method for detecting the polypeptide of claim 10 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms a complex 
with the polypeptide under conditions and for a period sufficient to form the complex; and 

b) detecting formation of the complex, so that if a complex formation is 
detected, the polypeptide of claim 10 is detected. 

25 

17. A method for identifying a compound that binds to the polypeptide of claim 10, 
comprising: 

a) contacting the compound with the polypeptide of claim 1 0 under 
conditions sufficient to form a polypeptide/compound complex; and 
30 b) detecting the complex, so that if the polypeptide/compound complex is 

detected, a compound that binds to the polypeptide of claim 10 is identified. 

18. A method for identifying a compound that binds to the polypeptide of claim 1 0, 
comprising: 
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a) contacting the compound with the polypeptide of claim 1 0, in a cell, under 

conditions sufficient to form a polypeptide/compound complex, wherein the complex drives 

■ 

expression of a reporter gene sequence in the cell; and 

b) detecting the complex by detecting reporter gene sequence expression, so 
that if the polypeptide/compound complex is detected, a compound that binds to the polypeptide 
of claim 10 is identified. 



1 9. A method of producing the polypeptide of claim 1 0, comprising, 

a) culturing a host cell comprising a polynucleotide sequence selected from 
1 0 the group consisting of a polynucleotide sequence of SEQ ID NO: 1-1 390 1 , a mature protein 
coding portion of SEQ ID NO: 1-13901 , an active domain of SEQ ID NO: 1-13901 , 

complementary sequences thereof and a polynucleotide sequence hybridizing under stringent 

i ■ 
conditions to SEQ ID NO: 1-13901 , under conditions sufficient to express the polypeptide in 

said cell; and 

15 b) isolating the polypeptide from the cell culture or cells of step (a). 



20 



20. An isolated polypeptide comprising an amino acid sequence ^selected from the 
group consisting of SEQ ID NOS: 13902-27802, the mature protein portion 
thereof, or the active domain thereof. 

■ • 

21 . The polypeptide of claim 20 wherein the polypeptide is provided on a polypeptide 
array. 



25 



22. A collection of polynucleotides, wherein the collection comprises the 

sequence information of at least one of SEQ ID NOS: 1 -1 3901 . 



23 . The collection of claim 22, wherein the collection is provided on a nucleic 

acid array. 



30 



24. The collection of claim 23, wherein the array detects full-matches to any 

one of the polynucleotides in the collection. 



35 



25. The collection of claim 23, wherein the array detects mismatches to any 

one of the polynucleotides in the collection. 
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• ■ 

« . ■ 

26. The collection of claim 22, wherein the collection is provided in a 

computer-readable format , 



27. A method of treatment comprising administering- to a mammalian subject in need 
thereof a therapeutic amount of a composition comprising a polypeptide of claim 
1 0 or 20 and a pharmaceutical^ acceptable carrier. 



28. A method of treatment comprising administering to a mammalian subject in need 
thereof a therapeutic amount of a composition comprising an antibody that 
specifically binds to a polypeptide of claim 1 0 or 20 and a pharmaceutical^ 
acceptable carrier. 
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